EP2214425A1 - Binaural audio guide - Google Patents

Binaural audio guide Download PDF

Info

Publication number
EP2214425A1
EP2214425A1 EP09380013A EP09380013A EP2214425A1 EP 2214425 A1 EP2214425 A1 EP 2214425A1 EP 09380013 A EP09380013 A EP 09380013A EP 09380013 A EP09380013 A EP 09380013A EP 2214425 A1 EP2214425 A1 EP 2214425A1
Authority
EP
European Patent Office
Prior art keywords
user
binaural audio
orientation
audio file
audio guide
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP09380013A
Other languages
German (de)
French (fr)
Inventor
Ivan Portas Arrondo
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Auralia Emotive Media Systems SL
Original Assignee
Auralia Emotive Media Systems SL
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Auralia Emotive Media Systems SL filed Critical Auralia Emotive Media Systems SL
Priority to EP09380013A priority Critical patent/EP2214425A1/en
Priority to PCT/EP2010/050912 priority patent/WO2010086321A1/en
Publication of EP2214425A1 publication Critical patent/EP2214425A1/en
Ceased legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • H04S7/303Tracking of listener position or orientation
    • H04S7/304For headphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/07Use of position data from wide-area or local-area positioning systems in hearing devices, e.g. program or information selection

Definitions

  • the main object of this invention is a binaural audio guide that is noninvasive with the environment, which provides users with information on the objects around them through headphones.
  • the acoustic information relative to each object seems to come from the object itself and therefore the information received via the headphones depends on the position and orientation of the user's head.
  • This audio guide is especially useful in museums or similar places.
  • Some audio guides that offer tourist information to users based on their location are known in the state of the art, and which, according to the type thereof, can operate outdoors (for example, in a city) or indoors (for example, in a museum). All of these systems comprise a device transported by the user and a network of information points located next to each object relative to which information is requested, whether a monument in the case of the outdoor type or a work of art in the case of the indoor type. The information points detect whether the user is in the vicinity of the object and, if so, reproduce an audio file relative to the monument or painting in question.
  • the audio files may be stored in a device transported by the users or at each information point, in such a manner that they are transmitted to the user when reproduced.
  • An inconvenience of these systems is their high cost, as they require the prior installation of the information points so the system can operate. Additionally, some environments such as museums and similar do not allow the installation of information points for artwork security reasons or for fear of endangering the surroundings. Another inconvenience is related to the user's impossibility to intuitively identify the painting or monument relative to which they are receiving the information, due to the lack of context thereof. For example, in the case of the museum, there could be several works of art within the same room, which can create confusion in the user.
  • Confusion can also arise due to the lack of precision in the localisation of the user. Normally, the user is only located based on the scope of the means of communication between the device transported by the user and each information point. For example, infrared media are used indoors to ensure that the user is inside a specific room, or media such as Bluetooth or W-Fi which have a known range. However, this provides very little precision, which frequently leads to the reproduction of audio files at times when the user is not in the required position. In addition, due to the type of Boolean detection (true or false), these systems do not solve problems such as occlusion due to physical obstacles between the issuer and the recipient, abrupt audio reproduction, etc.
  • Humans are volumetric sonorous recipients, i.e. we process the sound that reaches us through, for example, reflections created by the shoulders and torso, or diffractions created by the sound as it surrounds the user's head. That is, human hearing is binaural by nature, where the resulting sonorous reception process ends in two single channels: right ear and left ear.
  • the term "binaural” relates to the nature of human hearing, due to the fact that people are capable of capturing all the sonorous spatial information through a single pair of ears and determining the location of the sonorous sources. When this phenomenology is not taken into account the so-called "endocranial sound" is normally produced, such as for example on listening to traditional stereo sound through headphones. Endocranial sound consists of the feeling that the sonorous sources are located inside the user's head, at a point located between the headphones, due to which traditional stereo sound is not a recommendable format when trying to represent three-dimensional sonorous spaces realistically.
  • the invention describes an audio guide system where the audio files that provide tourist information to the user are transformed to binaural format in real time depending on the position and orientation of the user's head in relation to the position of the object relative to which information is being provided. That is, for the user the sound source seems at all times to be the object relative to which they are being informed. If the position or orientation of the user's head changes, the sound source is also displaced in real time, in such a manner that the sound continues to seem to come the object in question. In this manner, the user instantly and intuitively knows which is the object relative to which they are receiving information. That is, in this document, the expression "reproduce a binaural file associated to an object” means that the binaural file being reproduced is such that the apparent sound source is located on said object at all times.
  • localisation and orientation means are used that determine the position and orientation of the user's head without the need for installing information points, in such a manner that the binaural audio guide of the invention can be used without altering the environment.
  • This localisation and orientation means obtains the location and orientation of the user's head through the detection of reference points in the environment by means of at least one image acquisition means in accordance with a MonoSLAM process.
  • the MonoSLAM system is superficially described in " MonoSLAM: SLAM in real time with a single camera," by Andrew J. Davison, lan D. Reid, Nicholas D. Molton and Olivier Stasse, Analysis of trans. Patterns and artificial intelligence, IEEE 2007 .
  • the MonoSLAM system obtains the position and orientation of an image acquisition means based on the detection of fixed reference points and on the modification of its relative position as the position and orientation of said image acquisition means changes.
  • the reference points may be the corner of a painting, a smoke detector, the corner of a room, etc. Based on these data, the MonoSLAM system obtains the exact position and orientation coordinates (X, Y, Z) of the image acquisition means, and therefore of the user's head.
  • MonoSLAM's great accuracy allows a set of reproduction rules to be established, spatial or temporal, that determine how the audio files associated to each object will be reproduced based on user conduct.
  • regions that trigger the reproduction of certain audio files for example regions around objects, door thresholds, etc.
  • the reproduction volume could even be modified based on the user's position.
  • time intervals that trigger the reproduction of certain audio files could be defined. For example, it could be the duration of the user's stay within the region associated to a specific object, the time they spend looking in a specific direction, etc.
  • front direction is the direction located immediately in front of the user's face, i.e. the direction in which the user looks if they continue looking straight ahead, as opposed to the rear direction or side directions, all being comprised within a horizontal plane that crosses the user's head. Consequently, for example a front-upper or side-upper direction, in the present document comprises those directions that, starting from the front or side direction, are raised to an angle of between 0° and 90° in a vertical direction.
  • a first aspect of the present invention describes a binaural audio guide which provides information to a user relative to objects located within an enclosure, for example some works of art inside the rooms of a museum.
  • each object is associated to at least one audio file and reproduction rules.
  • the reproduction rules associated to each object are individually chosen according to each specific application. For example, and in the case of information relative to works of art inside a museum, greater regions could be assigned to the main works and smaller regions to the secondary works. Or, in the case of paintings, the associated regions could be located only in front of them, while the regions associated to sculptures could also include the space located behind them. Depending on the design of the reproduction rules, the combination of several sound files corresponding to one or more objects is possible.
  • the information contained in the audio files associated to each object can be of different kinds.
  • the information contained in the audio files associated to each object can be of different kinds.
  • a museum it could be a locution with artistic, historical, tourist or similar information, or atmosphere sound, dialogues or music that help the visitor to understand the work of art.
  • the elements that comprise the audio guide are fundamentally an upper module and a console, each of which are described below:
  • the function of the headphones is to reproduce binaural audio files, and therefore can be of any type provided that they can carry out said objective.
  • the images acquired by, at least, one image acquisition means will be used to determine the position and orientation of the user's head based on a group of reference points detected by the enclosure walls and/or ceiling.
  • the image acquisition means must be fixed to the upper module, and at the same time ahead of the user, in such a manner that no relative displacements take place between the image acquisition module and the user's head.
  • the image acquisition means could be oriented in any direction provided that there are fixed reference points within their viewing angle, even though in a specific embodiment of the invention it is oriented in a front-upper direction.
  • it is preferably disposed inside a transparent dome located in the upper module, which also facilitates the cleaning thereof.
  • the image acquisition means is a CCD or CMOS-type camera, although any type of camera can be used.
  • the localisation and orientation means, the sonorous synthesis means and the storage means can be implemented, jointly or individually, using any types of devices known in the state of the art, such as microcontrollers, micro-processors, FPGAs, DSPs, ASICs or similar, provided that these are capable of implementing the necessary operations for the correct operation of said means.
  • the binaural audio guide also comprises a removable battery, to provide it with autonomy, and an interface medium to allow users to interact with the audio guide, in order to stop or repeat the reproduction of the binaural audio files.
  • a preferred embodiment of the invention includes the modification of the binaural audio file reproduction volume depending on the distance between the user and the object. This characteristic can be used to increase the realism of the information provided to the users to attract their attention. In this manner, for example, the volume can diminish as the user draws away from the object and disappear altogether when they leave the region of the enclosure associated to said object.
  • Another specific embodiment of the invention includes selecting, based on the reproduction rules, the binaural audio file that will be reproduced through the headphones from among a group of binaural audio files associated to an object.
  • a sub-region could be defined within the region associated to an object, in such a manner that when the user enters the region, a first binaural audio file is firstly reproduced with atmosphere sound to draw the attention of the user. If the user advances and draws nearer to the object, on entering the sub-region a second binaural audio file will be reproduced, for example an introductory track with information about the object.
  • the reproduction of the second binaural audio file could take place if the user remains within the region associated to the object for a certain time interval.
  • the user can request the reproduction of binaural audio files using the user interface, for example by pressing a button.
  • Another advantage of the system and procedure of the present invention is that it allows the reproduction of audio help files for the blind, in such a manner that these can move with greater freedom around the museum.
  • These audio help files could be locutions informing about the size of the room, the distance between the user and the nearest obstacle in a certain direction, the location of entries and exits, etc. For example, if the blind user wishes to move to the following room, a locution with the word "exit" that seems to come from the location of the exit could be reproduced, in such a manner that the blind user would only have to move towards the place from which the sound comes.
  • This audio guide (1) has immediate application, especially in museums, and substitutes the traditional audio guides considerably improving user experience (20).
  • the audio guide (1) is comprised by two differentiated units.
  • the upper module (2) that accompanies the user (20) throughout the visit, and which comprises light and ergonomic headphones (4) and, in this example, two image acquisition means (5, 6).
  • the second unit, or console (3) is fundamentally an electronic device having sufficient capacity to process the images and generate the binaural audio.
  • the upper module is comprised of two main components, the image acquisition means (5, 6) and the headphones (4).
  • the transmission of the digital video signals coming from the image acquisition means (5, 6) towards the console (2) is carried out via USB connection through screened interlaced wiring (7), that runs parallel to the audio signal cable (8).
  • the image acquisition means (5, 6) are two CMOS cameras, although in other embodiments of the invention the images of a single camera would be enough to carry out MonoSLAM processing. Based on the two signals obtained, and through MonoSLAM processing, the absolute position and angular orientation of the user's head (20) within a two-dimensional map of the enclosure.
  • the requirements that must be fulfilled by the image acquisition means (5, 6) for these position and orientation coordinates to be correct comprise the use of VGA or higher-resolution cameras, with a wide-angle lens.
  • the frame rate of the image acquisition means (5, 6) is at least 30 fps (frames per second).
  • To prevent the image of the image acquisition means (5, 6) is comprised by moving objects (such as other museum visitors), are disposed on the upper part of the upper module (2) at a 45° angle with the horizontal. Additionally, the image acquisition means (5, 6) are covered with a transparent dome (9) to avoid the deposition of dust particles on the lenses and facilitate cleaning of the optical zone.
  • the console (3) is divided into four functional blocks:
  • the localisation and orientation means (10) is a device which, by using MonoSLAM-type algorithms, obtains the absolute position and orientation of the user's head (20) within an enclosure, such as the rooms of a museum, using the two image acquisition means (5, 6) disposed on the upper module (2) as a sensor.
  • the position obtained consists of the absolute x, y, z parameters and the tilt and orientation angle of the upper module (2), information that is transmitted to a sonorous synthesis means (10) with a frequency of 30 times per second through a serial communications port (SPI, I2C, RS232,).
  • the localisation and orientation means (10) is an Intel Core 2 Duo 2.4 Ghz processor, although it is possible to use other solutions based on low-consumption processors such as Intel Atom or ARM.
  • the sonorous synthesis means (11) is a DSP by Texas Instruments of the C600 family having a storage means (12) or 4 GB Secure Digital memory in which all the audio tracks are stored in a single channel compressed into mp3 (this implies more than 70 hours of information). That is, in this example, the sonorous synthesis means (11) and the storage means (12) are implemented in a single device.
  • the sonorous synthesis means (11) includes the localisation and orientation means and, based on this information and the reproduction rules, selects and generates the binaural audio files to be reproduced. At certain times, the localisation and orientation means (10) can show undetermination when giving the position due, for example, to abrupt user movements (20). It is possible to implement a complementary module based on accelerometers and gyroscopes to compensate these deficiencies.
  • the selected audio files are processed according to the user's relative position (20) with respect to the objects (17), in this case paintings, and time, obtaining a single stereo track. This processing will depend on the relative distance to the object (17), in addition to the reception angle.
  • the stereo tracks are processed with enclosure-modelling algorithms (reverberation, echoes, ...) to give the experience greater realism.
  • enclosure-modelling algorithms reverberation, echoes, ...) to give the experience greater realism.
  • the audio guide (1) of the invention will be powered by a Li-Po four-cell battery (14) that is located within the console (130 x 75 x 27.5 mm). Therefore, the space available for the battery (14) could be 124 x 61 x 12 mm with a capacity of 2.5 Ah and a nominal voltage of 14.8 V. With this capacity, to obtain an audio guide (1) with four hours of autonomy throughout its useful life, the average consumption of the audio guide (1) should be less than 5W.
  • the console (3) has a removable lid that gives access to the battery (14) and an SD card.
  • the battery (14) is charged by extracting the console (3), which allows the audio guide (1) to be used continuously, thereby eliminating battery (14) recharge times.
  • the batteries (14) will be recharged at a station with a capacity for several units and the recharge time must be less than that of a user's visit (20) to the museum.
  • the audio guide (1) of this example also includes a user interface (13) that comprises a monochrome graphic LCD, six push buttons and a switch. Additionally, a sliding switch is disposed on the upper part of the console (3) that blocks the operation of the rest.
  • Fig. 4 represents a user (20) who has contracted the use of the audio guide (1) of the invention at the start of a visit to a museum.
  • the MonoSLAM system must be started up so that it recognises the reference points of the different rooms.
  • the reproduction rules associated to each object (17) or work of art must also be defined, normally using a specific software or through a computer or PC.
  • the audio guide (1) of the invention is ready to operate: as the user (20) moves through each room of the museum, they enter and exit the regions (15) and sub-regions (16) associated to each object (17), which are represented in Fig. 5 and, depending on the reproduction rules associated to each object (17), the corresponding information is reproduced through their headphones (2).
  • a first type of binaural audio file consisting of atmosphere sound related to the object (17) in question is reproduced. This sound is reproduced when the user (20) is still close to the region (15) limits, and attempts to draw the user's attention.
  • the object (17) is the painting by Goya "Shootings at Principe Pio"
  • the atmosphere sound could be the shots and cries of the persons who appear in the painting.
  • a binaural audio file is reproduced with introductory information.
  • This track will have relevant information about the object, but it will not be very long so as not to tire the user (20), and can contain music, dialogues or atmosphere sounds to improve the experience.
  • the reproduction volume will gradually diminish until disappearing altogether when the user (20) leaves the region (15) associated to the object (17).

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Stereophonic System (AREA)

Abstract

The main object of the present invention is binaural audio guide (1), preferably for use in museums, which provides users (20) with information about the objects (17) around them, in such a manner that the information provided seems to come from the specific objects (17) relative to which it informs.

Description

    OBJECT OF THE INVENTION
  • The main object of this invention is a binaural audio guide that is noninvasive with the environment, which provides users with information on the objects around them through headphones. The acoustic information relative to each object seems to come from the object itself and therefore the information received via the headphones depends on the position and orientation of the user's head. This audio guide is especially useful in museums or similar places.
  • BACKGROUND OF THE INVENTION
  • Some audio guides that offer tourist information to users based on their location are known in the state of the art, and which, according to the type thereof, can operate outdoors (for example, in a city) or indoors (for example, in a museum). All of these systems comprise a device transported by the user and a network of information points located next to each object relative to which information is requested, whether a monument in the case of the outdoor type or a work of art in the case of the indoor type. The information points detect whether the user is in the vicinity of the object and, if so, reproduce an audio file relative to the monument or painting in question. The audio files may be stored in a device transported by the users or at each information point, in such a manner that they are transmitted to the user when reproduced.
  • An inconvenience of these systems is their high cost, as they require the prior installation of the information points so the system can operate. Additionally, some environments such as museums and similar do not allow the installation of information points for artwork security reasons or for fear of endangering the surroundings. Another inconvenience is related to the user's impossibility to intuitively identify the painting or monument relative to which they are receiving the information, due to the lack of context thereof. For example, in the case of the museum, there could be several works of art within the same room, which can create confusion in the user.
  • Confusion can also arise due to the lack of precision in the localisation of the user. Normally, the user is only located based on the scope of the means of communication between the device transported by the user and each information point. For example, infrared media are used indoors to ensure that the user is inside a specific room, or media such as Bluetooth or W-Fi which have a known range. However, this provides very little precision, which frequently leads to the reproduction of audio files at times when the user is not in the required position. In addition, due to the type of Boolean detection (true or false), these systems do not solve problems such as occlusion due to physical obstacles between the issuer and the recipient, abrupt audio reproduction, etc.
  • DESCRIPTION OF THE INVENTION
  • Humans are volumetric sonorous recipients, i.e. we process the sound that reaches us through, for example, reflections created by the shoulders and torso, or diffractions created by the sound as it surrounds the user's head. That is, human hearing is binaural by nature, where the resulting sonorous reception process ends in two single channels: right ear and left ear. The term "binaural" relates to the nature of human hearing, due to the fact that people are capable of capturing all the sonorous spatial information through a single pair of ears and determining the location of the sonorous sources. When this phenomenology is not taken into account the so-called "endocranial sound" is normally produced, such as for example on listening to traditional stereo sound through headphones. Endocranial sound consists of the feeling that the sonorous sources are located inside the user's head, at a point located between the headphones, due to which traditional stereo sound is not a recommendable format when trying to represent three-dimensional sonorous spaces realistically.
  • The invention describes an audio guide system where the audio files that provide tourist information to the user are transformed to binaural format in real time depending on the position and orientation of the user's head in relation to the position of the object relative to which information is being provided. That is, for the user the sound source seems at all times to be the object relative to which they are being informed. If the position or orientation of the user's head changes, the sound source is also displaced in real time, in such a manner that the sound continues to seem to come the object in question. In this manner, the user instantly and intuitively knows which is the object relative to which they are receiving information. That is, in this document, the expression "reproduce a binaural file associated to an object" means that the binaural file being reproduced is such that the apparent sound source is located on said object at all times.
  • In order to generate the binaural audio files, localisation and orientation means are used that determine the position and orientation of the user's head without the need for installing information points, in such a manner that the binaural audio guide of the invention can be used without altering the environment. This localisation and orientation means obtains the location and orientation of the user's head through the detection of reference points in the environment by means of at least one image acquisition means in accordance with a MonoSLAM process. The MonoSLAM system is superficially described in "MonoSLAM: SLAM in real time with a single camera," by Andrew J. Davison, lan D. Reid, Nicholas D. Molton and Olivier Stasse, Analysis of trans. Patterns and artificial intelligence, IEEE 2007. Fundamentally, the MonoSLAM system obtains the position and orientation of an image acquisition means based on the detection of fixed reference points and on the modification of its relative position as the position and orientation of said image acquisition means changes. For example, in the case of the present invention, the reference points may be the corner of a painting, a smoke detector, the corner of a room, etc. Based on these data, the MonoSLAM system obtains the exact position and orientation coordinates (X, Y, Z) of the image acquisition means, and therefore of the user's head.
  • MonoSLAM's great accuracy allows a set of reproduction rules to be established, spatial or temporal, that determine how the audio files associated to each object will be reproduced based on user conduct.
  • In the case of spatial rules, regions that trigger the reproduction of certain audio files, for example regions around objects, door thresholds, etc., could be defined. The reproduction volume could even be modified based on the user's position. In the case of temporal rules, time intervals that trigger the reproduction of certain audio files could be defined. For example, it could be the duration of the user's stay within the region associated to a specific object, the time they spend looking in a specific direction, etc.
  • In the present document, we will consider front direction as being the direction located immediately in front of the user's face, i.e. the direction in which the user looks if they continue looking straight ahead, as opposed to the rear direction or side directions, all being comprised within a horizontal plane that crosses the user's head. Consequently, for example a front-upper or side-upper direction, in the present document comprises those directions that, starting from the front or side direction, are raised to an angle of between 0° and 90° in a vertical direction.
  • In this manner, a first aspect of the present invention describes a binaural audio guide which provides information to a user relative to objects located within an enclosure, for example some works of art inside the rooms of a museum. In accordance with the invention, each object is associated to at least one audio file and reproduction rules.
  • The reproduction rules associated to each object are individually chosen according to each specific application. For example, and in the case of information relative to works of art inside a museum, greater regions could be assigned to the main works and smaller regions to the secondary works. Or, in the case of paintings, the associated regions could be located only in front of them, while the regions associated to sculptures could also include the space located behind them. Depending on the design of the reproduction rules, the combination of several sound files corresponding to one or more objects is possible.
  • On the other hand, the information contained in the audio files associated to each object can be of different kinds. For example, in the case of a museum, it could be a locution with artistic, historical, tourist or similar information, or atmosphere sound, dialogues or music that help the visitor to understand the work of art.
  • The elements that comprise the audio guide are fundamentally an upper module and a console, each of which are described below:
  • a) Upper module
  • It is a module that can be coupled to the user's head, and which comprises headphones and at least one image acquisition means to acquire images of the user's surroundings.
    The function of the headphones is to reproduce binaural audio files, and therefore can be of any type provided that they can carry out said objective.
    On the other hand, the images acquired by, at least, one image acquisition means will be used to determine the position and orientation of the user's head based on a group of reference points detected by the enclosure walls and/or ceiling. To this end, the image acquisition means must be fixed to the upper module, and at the same time ahead of the user, in such a manner that no relative displacements take place between the image acquisition module and the user's head. With respect to the orientation, in principle the image acquisition means could be oriented in any direction provided that there are fixed reference points within their viewing angle, even though in a specific embodiment of the invention it is oriented in a front-upper direction.
    For the purpose of preventing the image acquisition means from suffering damage, it is preferably disposed inside a transparent dome located in the upper module, which also facilitates the cleaning thereof. Normally, the image acquisition means is a CCD or CMOS-type camera, although any type of camera can be used.
  • b) A console
  • It is a device connected to the upper module and which also comprises:
    • A localisation and orientation means that serves to determine the position and orientation of the user's head based on the images acquired by at least one image acquisition means. To this end, MonoSLAM-type algorithms are used, capable of calculating the position and orientation of an image acquisition means at great speed based on the images that it obtains.
      In a preferred embodiment of the invention, an additional localisation and orientation means based on accelerometers and gyroscopes to improve system accuracy is also used.
    • A storage means that stores audio files associated to the objects relative to which we wish to inform the user.
    • A sonorous synthesis means that generates, based on the position and orientation of the user's head, time and reproduction rules associated to the objects, a binaural audio file based on at least one audio file associated to each of said objects. That is, the binaural audio file can be generated from the combination of several audio files associated to several objects. In the audio file generated, the sound corresponding to each of the objects will seem to come from the object in question.
  • The localisation and orientation means, the sonorous synthesis means and the storage means can be implemented, jointly or individually, using any types of devices known in the state of the art, such as microcontrollers, micro-processors, FPGAs, DSPs, ASICs or similar, provided that these are capable of implementing the necessary operations for the correct operation of said means.
  • Additionally, in preferred embodiments of the invention the binaural audio guide also comprises a removable battery, to provide it with autonomy, and an interface medium to allow users to interact with the audio guide, in order to stop or repeat the reproduction of the binaural audio files.
  • According to a second aspect of the invention, a description is given of an operating procedure of a binaural audio guide such as that previously described, where each object is associated to at least one audio file and reproduction rules, and which comprises the following operations:
    1. 1) Acquire, through an image acquisition means coupled to a user's head, images of said user's surroundings.
    2. 2) Process the images acquired in a localisation and orientation means through a MonoSLAM process to obtain the position and orientation of the user's head.
    3. 3) Determine, based on the position and orientation of the user's head and the time, if any of the reproduction rules corresponding to any of the objects are fulfilled.
    4. 4) If so generate, by means of a sonorous synthesis module, and based on the position and orientation of the user's head, on time and the reproduction rules of said objects, a binaural audio file from at least one audio file associated to each object in such a manner that the user's feeling on listening to the binaural audio file is that each object comes from said object in question.
    5. 5) Reproduce the binaural audio file through the user's headphones.
  • In view of the fact that the generation and reproduction of the binaural audio files is carried out in real time, as the user modifies the position and orientation of his/her head, the foregoing operations must be carried out quite frequently, preferably greater than 10 cycles per second.
  • Additionally, a preferred embodiment of the invention includes the modification of the binaural audio file reproduction volume depending on the distance between the user and the object. This characteristic can be used to increase the realism of the information provided to the users to attract their attention. In this manner, for example, the volume can diminish as the user draws away from the object and disappear altogether when they leave the region of the enclosure associated to said object.
  • Another specific embodiment of the invention includes selecting, based on the reproduction rules, the binaural audio file that will be reproduced through the headphones from among a group of binaural audio files associated to an object. For example, a sub-region could be defined within the region associated to an object, in such a manner that when the user enters the region, a first binaural audio file is firstly reproduced with atmosphere sound to draw the attention of the user. If the user advances and draws nearer to the object, on entering the sub-region a second binaural audio file will be reproduced, for example an introductory track with information about the object. In another specific example, the reproduction of the second binaural audio file could take place if the user remains within the region associated to the object for a certain time interval.
  • In yet another preferred embodiment, the user can request the reproduction of binaural audio files using the user interface, for example by pressing a button.
  • Additionally, another advantage of the system and procedure of the present invention is that it allows the reproduction of audio help files for the blind, in such a manner that these can move with greater freedom around the museum. These audio help files could be locutions informing about the size of the room, the distance between the user and the nearest obstacle in a certain direction, the location of entries and exits, etc. For example, if the blind user wishes to move to the following room, a locution with the word "exit" that seems to come from the location of the exit could be reproduced, in such a manner that the blind user would only have to move towards the place from which the sound comes.
  • DESCRIPTION OF THE DRAWINGS
  • For the purpose of complementing this description and helping to better understand the characteristics of the invention, a set of drawings in accordance with a preferred practical embodiment thereof has been included as an integral part of this description, in which the following figures have been represented in an illustrative and unlimitative manner:
    • Figure 1 shows a block drawing of the audio guide of the invention.
    • Figure 2 shows a perspective view of the upper module of the audio guide of the invention.
    • Figure 3 shows a perspective view of the console of the audio guide of the invention.
    • Figure 4 shows a user who is using the audio guide of the present invention.
    • Figure 5 shows a plan view of a museum where the regions and sub-regions associated to each table can be observed.
    PREFERRED EMBODIMENT OF THE INVENTION
  • Below we describe a preferred embodiment of the audio guide (1) of the invention making reference to the figures attached hereto. This audio guide (1) has immediate application, especially in museums, and substitutes the traditional audio guides considerably improving user experience (20).
  • The main technological challenges that the development of this audio guide (1) faces are, firstly, obtain accurate localisation indoors without modifying the surroundings and, secondly, ensure that the volume, weight and energy consumption of the audio guide (1) are sufficiently low for it to be really portable.
  • As can be observed in figures 1, 2 and 3, the audio guide (1) is comprised by two differentiated units. On one hand there is the upper module (2) that accompanies the user (20) throughout the visit, and which comprises light and ergonomic headphones (4) and, in this example, two image acquisition means (5, 6). The second unit, or console (3), is fundamentally an electronic device having sufficient capacity to process the images and generate the binaural audio. These components are described below:
  • a) Upper module (2)
  • The upper module is comprised of two main components, the image acquisition means (5, 6) and the headphones (4). The transmission of the digital video signals coming from the image acquisition means (5, 6) towards the console (2) is carried out via USB connection through screened interlaced wiring (7), that runs parallel to the audio signal cable (8).
  • Image acquisition means (5, 6)
  • In this example, the image acquisition means (5, 6) are two CMOS cameras, although in other embodiments of the invention the images of a single camera would be enough to carry out MonoSLAM processing. Based on the two signals obtained, and through MonoSLAM processing, the absolute position and angular orientation of the user's head (20) within a two-dimensional map of the enclosure. The requirements that must be fulfilled by the image acquisition means (5, 6) for these position and orientation coordinates to be correct comprise the use of VGA or higher-resolution cameras, with a wide-angle lens. Additionally, to achieve the necessary position updating frequency, the frame rate of the image acquisition means (5, 6) is at least 30 fps (frames per second).
  • Moving objects hinder normal MonoSLAM operation. To prevent the image of the image acquisition means (5, 6) is comprised by moving objects (such as other museum visitors), are disposed on the upper part of the upper module (2) at a 45° angle with the horizontal. Additionally, the image acquisition means (5, 6) are covered with a transparent dome (9) to avoid the deposition of dust particles on the lenses and facilitate cleaning of the optical zone.
  • Image Acquisition Means (5, 6) Technical Characteristics:
    • Sensor: BGA CMOS
    • Maximum lens aperture: F:2.4
    • Lens angle of vision: 120 degrees
    • Data format: YUY2
    • Colour depth: 24 bits
    • Interface: USB 2.0 Video Class: UVC
    • Power supply: Via USB < 1 W
    Headphones (4)
  • Given that the acoustic contamination of the real atmosphere sound is considered a problem, we decided to find a headphone (4) coupling system that will insulate against the entrance of external noise to a greater extent. Among the existing alternatives to implement the headphones (4), intra-auricular devices have been ruled out for hygiene reasons, and those that encapsulate the ear for being too bulky and compromise transpiration.
  • The technical characteristics of the headphones (4) used in this example are set out below, although other alternatives that can also be functionally correct are not ruled out:
    • Response frequency: 12-22000 Hz
    • Impedance: 24 ohms
    • Magnet type: Neodimio
    • Maximum incoming power: 100 mW
    • Sensitivity: 106 dB
    • Injected black ABS casing with lacquered ultra-polished finish
    • Spherical articulation in both transductors
    • Cable outlet through a single headphone (arc with a hollow for cables)
    • Disposable covers
    • Upper transparent polycarbonate dome
    b) Console (3)
  • The console (3) is divided into four functional blocks:
    • Localisation and orientation means (10)
    • Sonorous synthesis means (11)
    • Storage means (12)
    • Interface medium (13)
    • Battery (14)
    Localisation and orientation means (10)
  • The localisation and orientation means (10) is a device which, by using MonoSLAM-type algorithms, obtains the absolute position and orientation of the user's head (20) within an enclosure, such as the rooms of a museum, using the two image acquisition means (5, 6) disposed on the upper module (2) as a sensor. The position obtained consists of the absolute x, y, z parameters and the tilt and orientation angle of the upper module (2), information that is transmitted to a sonorous synthesis means (10) with a frequency of 30 times per second through a serial communications port (SPI, I2C, RS232,...).
  • The MonoSLAM algorithms have a fairly high processing load. To carry out the necessary functions with ease, in the present example the localisation and orientation means (10) is an Intel Core 2 Duo 2.4 Ghz processor, although it is possible to use other solutions based on low-consumption processors such as Intel Atom or ARM.
  • Sonorous synthesis means (11)
  • The sonorous synthesis means (11) is a DSP by Texas Instruments of the C600 family having a storage means (12) or 4 GB Secure Digital memory in which all the audio tracks are stored in a single channel compressed into mp3 (this implies more than 70 hours of information). That is, in this example, the sonorous synthesis means (11) and the storage means (12) are implemented in a single device.
  • The sonorous synthesis means (11) includes the localisation and orientation means and, based on this information and the reproduction rules, selects and generates the binaural audio files to be reproduced. At certain times, the localisation and orientation means (10) can show undetermination when giving the position due, for example, to abrupt user movements (20). It is possible to implement a complementary module based on accelerometers and gyroscopes to compensate these deficiencies.
  • The selected audio files are processed according to the user's relative position (20) with respect to the objects (17), in this case paintings, and time, obtaining a single stereo track. This processing will depend on the relative distance to the object (17), in addition to the reception angle. The stereo tracks are processed with enclosure-modelling algorithms (reverberation, echoes, ...) to give the experience greater realism. Once all the tracks have been independently processed, they are introduced in a mixer that generates a stereo audio track. Next, this stereo audio track is processed through an outgoing codec and an audio amplification system, giving rise to the signal that is reproduced in the headphones (4). The maximum total consumption of the sonorous synthesis means (11) is 2W.
  • Battery (14)
  • The audio guide (1) of the invention will be powered by a Li-Po four-cell battery (14) that is located within the console (130 x 75 x 27.5 mm). Therefore, the space available for the battery (14) could be 124 x 61 x 12 mm with a capacity of 2.5 Ah and a nominal voltage of 14.8 V. With this capacity, to obtain an audio guide (1) with four hours of autonomy throughout its useful life, the average consumption of the audio guide (1) should be less than 5W.
  • The console (3) has a removable lid that gives access to the battery (14) and an SD card. The battery (14) is charged by extracting the console (3), which allows the audio guide (1) to be used continuously, thereby eliminating battery (14) recharge times. The batteries (14) will be recharged at a station with a capacity for several units and the recharge time must be less than that of a user's visit (20) to the museum.
  • Interface (13)
  • The audio guide (1) of this example also includes a user interface (13) that comprises a monochrome graphic LCD, six push buttons and a switch. Additionally, a sliding switch is disposed on the upper part of the console (3) that blocks the operation of the rest.
  • Operation
  • Below we describe the operation of the audio guide (1) of the invention. Fig. 4 represents a user (20) who has contracted the use of the audio guide (1) of the invention at the start of a visit to a museum. Firstly, and specifically in the case of museums, the MonoSLAM system must be started up so that it recognises the reference points of the different rooms. The reproduction rules associated to each object (17) or work of art must also be defined, normally using a specific software or through a computer or PC.
  • After it has been started up, the audio guide (1) of the invention is ready to operate: as the user (20) moves through each room of the museum, they enter and exit the regions (15) and sub-regions (16) associated to each object (17), which are represented in Fig. 5 and, depending on the reproduction rules associated to each object (17), the corresponding information is reproduced through their headphones (2).
  • In this example, when the user (20) enters the region (15) associated to the object (17), a first type of binaural audio file consisting of atmosphere sound related to the object (17) in question is reproduced. This sound is reproduced when the user (20) is still close to the region (15) limits, and attempts to draw the user's attention. For example, if the object (17) is the painting by Goya "Shootings at Principe Pio", the atmosphere sound could be the shots and cries of the persons who appear in the painting.
  • Next, when the user (20) approaches the work of art and is located within a sub-region (16) associated to the object (17), a binaural audio file is reproduced with introductory information. This track will have relevant information about the object, but it will not be very long so as not to tire the user (20), and can contain music, dialogues or atmosphere sounds to improve the experience.
  • Finally, once the introductory track has finished, additional information can be reproduced if the user (20) requests it via the console (3) interface medium (13), for example by pressing a button.
  • If during the reproduction of any of these tracks, the user (20) moves away from the object (17) towards another location, the reproduction volume will gradually diminish until disappearing altogether when the user (20) leaves the region (15) associated to the object (17).

Claims (15)

  1. Binaural audio guide (1) to provide information to a user (20) relative to objects (17) located within an enclosure, characterised in that each object (17) is associated to at least one audio file and reproduction rules, which comprises:
    a) an upper module (2) that can be coupled to the user's (20) head, which comprises headphones (4) and at least one image acquisition means (5, 6) which acquires images from the user's (20) surroundings; and
    b) a console (3) connected to said upper module (2), which comprises:
    - a localisation and orientation means (10) which determines, according to a MonoSLAM process, the position and orientation of the user's (20) head based on the images acquired by it at least one image acquisition means (5, 6);
    - a storage means (12) that stores the audio files associated to the objects (17); and
    - a sonorous synthesis means that generates, according to the position and orientation of the user's (20) head, time and the reproduction rules associated to the objects (17), a binaural audio file based on at least one audio file associated to each of said objects (17), in such a manner that, in the binaural audio file generated, the sound corresponding to each of the objects (17) seems to come from said specific object (17).
  2. Binaural audio guide (1), according to claim 1, characterised in that at least one image acquisition means (5, 6) is oriented in a front-upper direction.
  3. Binaural audio guide (1), according to any of preceding claims, characterised in that at least one image acquisition means (5, 6) is located inside a transparent dome (9) disposed in the upper module (2).
  4. Binaural audio guide (1), according to any of the preceding claims, characterised in that it comprises two image acquisition means (5, 6).
  5. Binaural audio guide (1), according to any of the preceding claims, characterised in that it also comprises a localisation and orientation means based on accelerometers and gyroscopes.
  6. Binaural audio guide (1), according to any of the preceding claims, characterised in that the localisation and orientation means (10), the sonorous synthesis means (11) and the storage means (12) are implemented by means of at least one of the devices of the following list: a micro-controller, a micro-processor, a DSP, a FPGA and an ASIC.
  7. Binaural audio guide (1), according to any of the preceding claims, characterised in that it also comprises a battery (14).
  8. Binaural audio guide (1), according to any of the preceding claims, characterised in that it also comprises an interface medium (13) with the user (20).
  9. Operating procedure of a binaural audio guide (1) for providing a user (20) with information about objects (17) located within an enclosure, characterised in that each object (17) is associated to at least one audio file and reproduction rules, and which comprises the following operations:
    - acquire, through at least one image acquisition means (5, 6) coupled to a user's (20) head, images of said user's (20) surroundings;
    - process the images acquired by a localisation and orientation means (10) based on a MonoSLAM process to obtain the position and orientation of the user's (20) head;
    - determine, depending on the position and orientation of the user's (20) head and time, if any of the reproduction rules corresponding to any of the objects (17) are fulfilled;
    - if so generate, through a sonorous synthesis module (11), and according to the position and orientation of the user's (20) head, time and the reproduction rules of said objects (17), a binaural audio file based on at least one audio file associated to each object (17), in such a manner that the user's feeling on hearing the binaural audio file generated is that the sound of each object (17) comes from said specific object (17); and
    - reproduce the binaural audio file generated through headphones (4) wom by the user (20).
  10. Operating procedure of a binaural audio guide (1), according to claim 9, characterised in that the preceding operations are carried out with a frequency of above 10 cycles per second.
  11. Operating procedure of a binaural audio guide (1), according to claims 9 to 10, characterised in that the reproduction rules comprise spatial reproduction rules and temporal reproduction rules.
  12. Operating procedure of a binaural audio guide (1), according to claims 9 to 11, characterised in that it consists of reproducing a first type of binaural audio file when the user (20) enters the region (15) associated to an object (17).
  13. Operating procedure of a binaural audio guide (1), according to claim 12, characterised in that it consists of reproducing a second type of binaural audio file when the user (20) enters a sub-region (16) located inside the region (15).
  14. Operating procedure of a binaural audio guide (1), according to claim 12, characterised in that it consists of reproducing a second binaural audio file when the user (20) remains for a certain time interval within the region (15).
  15. Operating procedure of a binaural audio guide (1), according to claims 9 to 14, characterised in that it also comprises the additional operation of modifying the binaural audio file reproduction volume based on the distance between the user (20) and the object (17).
EP09380013A 2009-01-28 2009-01-28 Binaural audio guide Ceased EP2214425A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP09380013A EP2214425A1 (en) 2009-01-28 2009-01-28 Binaural audio guide
PCT/EP2010/050912 WO2010086321A1 (en) 2009-01-28 2010-01-27 Binaural audio guide

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP09380013A EP2214425A1 (en) 2009-01-28 2009-01-28 Binaural audio guide

Publications (1)

Publication Number Publication Date
EP2214425A1 true EP2214425A1 (en) 2010-08-04

Family

ID=40636123

Family Applications (1)

Application Number Title Priority Date Filing Date
EP09380013A Ceased EP2214425A1 (en) 2009-01-28 2009-01-28 Binaural audio guide

Country Status (2)

Country Link
EP (1) EP2214425A1 (en)
WO (1) WO2010086321A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120148055A1 (en) * 2010-12-13 2012-06-14 Samsung Electronics Co., Ltd. Audio processing apparatus, audio receiver and method for providing audio thereof
EP2824649A1 (en) * 2013-07-12 2015-01-14 GN Store Nord A/S Audio based learning system comprising a portable terminal connected to an audio unit and plurality of zones
EP2866217A1 (en) * 2013-10-25 2015-04-29 GN Store Nord A/S An audio information system based on zones and contexts
EP2777040A4 (en) * 2011-11-09 2015-09-09 Sony Corp Voice control device, voice control method and program
DE102014204630A1 (en) * 2014-03-13 2015-09-17 Steffen Armbruster Mobile device for immersive sound experiences
EP3410747A1 (en) * 2017-06-02 2018-12-05 Nokia Technologies Oy Switching rendering mode based on location data
CN111448805A (en) * 2017-12-29 2020-07-24 诺基亚技术有限公司 Apparatus, method and computer program for providing notifications
US11091350B2 (en) 2017-04-28 2021-08-17 Otis Elevator Company Audio orientation systems for elevator cars

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014111829A1 (en) * 2013-01-17 2014-07-24 Koninklijke Philips N.V. Binaural audio processing
JP6263098B2 (en) * 2014-07-15 2018-01-17 Kddi株式会社 Portable terminal for arranging virtual sound source at provided information position, voice presentation program, and voice presentation method
EP3220667A1 (en) * 2016-03-14 2017-09-20 Thomson Licensing Headphones for binaural experience and audio device
CN108008394B (en) * 2017-11-28 2019-09-24 安徽工程大学 Biomimetic sonar two dimension undergoes drawing drawing method, storage medium and equipment
CN117765839A (en) * 2023-12-25 2024-03-26 广东保伦电子股份有限公司 Indoor intelligent navigation method, device and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5745126A (en) * 1995-03-31 1998-04-28 The Regents Of The University Of California Machine synthesis of a virtual video camera/image of a scene from multiple video cameras/images of the scene in accordance with a particular perspective on the scene, an object in the scene, or an event in the scene
WO1999049574A1 (en) * 1998-03-25 1999-09-30 Lake Technology Limited Audio signal processing method and apparatus
US6259795B1 (en) * 1996-07-12 2001-07-10 Lake Dsp Pty Ltd. Methods and apparatus for processing spatialized audio
WO2001055833A1 (en) * 2000-01-28 2001-08-02 Lake Technology Limited Spatialized audio system for use in a geographical environment
WO2007004134A2 (en) * 2005-06-30 2007-01-11 Philips Intellectual Property & Standards Gmbh Method of controlling a system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5745126A (en) * 1995-03-31 1998-04-28 The Regents Of The University Of California Machine synthesis of a virtual video camera/image of a scene from multiple video cameras/images of the scene in accordance with a particular perspective on the scene, an object in the scene, or an event in the scene
US6259795B1 (en) * 1996-07-12 2001-07-10 Lake Dsp Pty Ltd. Methods and apparatus for processing spatialized audio
WO1999049574A1 (en) * 1998-03-25 1999-09-30 Lake Technology Limited Audio signal processing method and apparatus
WO2001055833A1 (en) * 2000-01-28 2001-08-02 Lake Technology Limited Spatialized audio system for use in a geographical environment
WO2007004134A2 (en) * 2005-06-30 2007-01-11 Philips Intellectual Property & Standards Gmbh Method of controlling a system

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ANDREW J DAVISON ET AL: "MonoSLAM: Real-Time Single Camera SLAM", IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, IEEE SERVICE CENTER, LOS ALAMITOS, CA, US, vol. 29, no. 6, 1 June 2007 (2007-06-01), pages 1052 - 1067, XP011179664, ISSN: 0162-8828 *
ANDREW J. DAVISON ET AL.: "MonoSLAM: SLAM in real time with a single camera", ANALYSIS OF TRANS. PATTERNS AND ARTIFICIAL INTELLIGENCE, IEEE, 2007
MEYER ET AL: "A Survey of Position Trackers", PRESENCE, CAMBRIDGE, MA, US, vol. 1, no. 2, 21 March 1992 (1992-03-21), pages 173 - 200, XP002103552, ISSN: 1054-7460 *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120148055A1 (en) * 2010-12-13 2012-06-14 Samsung Electronics Co., Ltd. Audio processing apparatus, audio receiver and method for providing audio thereof
US9830128B2 (en) 2011-11-09 2017-11-28 Sony Corporation Voice control device, voice control method and program
EP2777040A4 (en) * 2011-11-09 2015-09-09 Sony Corp Voice control device, voice control method and program
US9299349B2 (en) 2011-11-09 2016-03-29 Sony Corporation Voice control device, voice control method and program
US9557962B2 (en) 2011-11-09 2017-01-31 Sony Corporation Voice control device, voice control method and program
EP2824649A1 (en) * 2013-07-12 2015-01-14 GN Store Nord A/S Audio based learning system comprising a portable terminal connected to an audio unit and plurality of zones
EP2866217A1 (en) * 2013-10-25 2015-04-29 GN Store Nord A/S An audio information system based on zones and contexts
EP3709677A1 (en) * 2014-03-13 2020-09-16 Framed Immersive Projects GmbH & Co. Kg Mobile device for immersive sound experiences
EP3117632B1 (en) * 2014-03-13 2020-04-29 Framed Immersive Projects GmbH & Co. Kg Mobile device for immersive sound experiences
DE102014204630A1 (en) * 2014-03-13 2015-09-17 Steffen Armbruster Mobile device for immersive sound experiences
US11091350B2 (en) 2017-04-28 2021-08-17 Otis Elevator Company Audio orientation systems for elevator cars
EP3410747A1 (en) * 2017-06-02 2018-12-05 Nokia Technologies Oy Switching rendering mode based on location data
WO2018220278A1 (en) * 2017-06-02 2018-12-06 Nokia Technologies Oy Switching rendering mode based on location data
US10827296B2 (en) 2017-06-02 2020-11-03 Nokia Technologies Oy Switching rendering mode based on location data
CN111448805A (en) * 2017-12-29 2020-07-24 诺基亚技术有限公司 Apparatus, method and computer program for providing notifications
US11696085B2 (en) 2017-12-29 2023-07-04 Nokia Technologies Oy Apparatus, method and computer program for providing notifications

Also Published As

Publication number Publication date
WO2010086321A1 (en) 2010-08-05

Similar Documents

Publication Publication Date Title
EP2214425A1 (en) Binaural audio guide
US20230209295A1 (en) Systems and methods for sound source virtualization
US9906885B2 (en) Methods and systems for inserting virtual sounds into an environment
US20190313201A1 (en) Systems and methods for sound externalization over headphones
WO2018068689A1 (en) Volume adjustment method and device
KR20160113666A (en) Audio navigation assistance
WO2016099052A1 (en) Three-dimensional guidance device for informing visually handicapped person of obstacle, guidance system for providing surrounding information using same, and method therefor
Harma et al. Techniques and applications of wearable augmented reality audio
EP2736276A1 (en) Personal communications unit for observing from a point of view and team communications system comprising multiple personal communications units for observing from a point of view
US10080086B2 (en) Dummy head that captures binaural sound
CN108632695B (en) Earphone set
CN108169927A (en) A kind of glasses for guiding blind based on binocular stereo vision
US20240042318A1 (en) Gaming with earpiece 3d audio
JP2017062598A (en) Information processing device, information processing method, and program
JP6627775B2 (en) Information processing apparatus, information processing method and program
CN109996170B (en) Indoor route generation method, device and system
WO2020026548A1 (en) Information processing device, information processing method, and acoustic system
CN117685968A (en) Method for navigating intelligent agent and intelligent agent
JP4743305B2 (en) Function control method by boundary definition, function control system by boundary definition, function control server and program by boundary definition
GB2558279A (en) Head mountable display system
CN113707165A (en) Audio processing method and device, electronic equipment and storage medium
Cohen et al. Applications of Audio Augmented Reality: Wearware, Everyware, Anyware, and Awareware
JP2021141505A (en) Imaging apparatus, control method, and program
JPH08336166A (en) Video viewing device
CN115209032B (en) Image acquisition method and device based on cleaning robot, electronic equipment and medium

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA RS

RIN1 Information on inventor provided before grant (corrected)

Inventor name: AZPIAZU LOZANO, JON

Inventor name: MARTINEZ MONTENIEL, JOSE MARIA

Inventor name: PORTAS ARRONDO, IVAN

17P Request for examination filed

Effective date: 20110125

AKX Designation fees paid

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK TR

17Q First examination report despatched

Effective date: 20110414

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20120605