US9271103B2 - Audio control based on orientation - Google Patents

Audio control based on orientation Download PDF

Info

Publication number
US9271103B2
US9271103B2 US13/997,423 US201213997423A US9271103B2 US 9271103 B2 US9271103 B2 US 9271103B2 US 201213997423 A US201213997423 A US 201213997423A US 9271103 B2 US9271103 B2 US 9271103B2
Authority
US
United States
Prior art keywords
orientation
electronic device
mobile electronic
respect
audio signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US13/997,423
Other versions
US20140153751A1 (en
Inventor
Kevin C. Wells
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intel Corp
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Assigned to INTEL CORPORATION reassignment INTEL CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: WELLS, KEVIN C.
Publication of US20140153751A1 publication Critical patent/US20140153751A1/en
Application granted granted Critical
Publication of US9271103B2 publication Critical patent/US9271103B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • H04S7/303Tracking of listener position or orientation
    • H04S7/304For headphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/07Use of position data from wide-area or local-area positioning systems in hearing devices, e.g. program or information selection

Definitions

  • Three-Dimensional (3-D) audio is a technology in which sound is generated in a manner that creates the illusion that the source of the sound is somewhere in a 3-D space surrounding the hearer.
  • movies and video games may employ 3-D audio to create an “immersive” effect wherein the participant (e.g., user) is made to feel as if actually a part of the content being viewed by generating sound that appears to come from various directions (e.g., left, center right, back, etc.).
  • this effect may be simulated through the placement of speakers in the various directions. Sound from content such as, for example, a movie, a video game, etc. may then be directed to a speaker corresponding to the direction from which the sound is meant to be heard.
  • the coordinate system is relative to the headphones of the user, and thus, the head of the user.
  • the direction from which a sound emanates no longer matches the direction intended by the creation of the content, destroying the intended sense of immersion to be created by the content.
  • FIG. 1 illustrates an example system configured for audio control based on orientation in accordance with at least one embodiment of the present disclosure
  • FIG. 2 illustrates an example device usable with at least one embodiment of the present disclosure
  • FIG. 3 illustrates example interaction between audio control circuitry and other circuitry in a device in accordance with at least one embodiment of the present disclosure
  • FIG. 4 illustrates an example initial aligned orientation in accordance with at least one embodiment of the present disclosure
  • FIG. 5 illustrates an example of a displaced orientation in accordance with at least one embodiment of the present disclosure
  • FIG. 6 illustrates a second example of a displaced orientation in accordance with at least one embodiment of the present disclosure.
  • FIG. 7 is a flowchart of example operations for audio control based on orientation in accordance with at least one embodiment of the present disclosure.
  • Audio or “audio signal,” as referred to herein, comprises three-dimensional (3-D) audio that is configured to simulate sound sources at various locations within a 3-D sound field so that the listener (e.g., user) perceives that he/she is immersed in the content corresponding to the audio.
  • Any 3-D currently known or later emerging audio technology e.g., DirectSound® from Microsoft Corp., open AL® from Creative Technology Ltd., etc.
  • audio signal generation in a device may be controlled to maintain the perceived location of a sound source constant with respect to coordinate system, regardless of changes of position with respect to the device and/or user.
  • a coordinate system as referred to herein, may be a fixed coordinate system such as, for example, the World Geodetic System (WGS), longitude/latitude, compass directions (North (N), East (E), South (S) and West (W)), etc.
  • WMS World Geodetic System
  • Audio signal generation may be controlled based on the orientation of a device with respect to a coordinate system and/or a head of the user with respect to the device.
  • Orientation may include both the locations of the device/head in the coordinate system and a relative position (e.g., angle, tilt, offset, etc.) of the device with respect to the coordinate system, and the head with respect to the device.
  • a device configured to generate audio that is, for example, associated with video content such as a movie, game, etc. being played on the device may determine its orientation with respect to a fixed coordinate system.
  • the device may further determine the orientation of a user's head with respect to the device. The device may then determine if the orientation of the device has changed with respect to the fixed coordinate system and/or if the orientation of the user's head has changed with respect to the device.
  • Audio signal generation may then be controlled in the device (e.g., simulated positions of sound sources in the 3-D sound field relative to the user's headphones may be changed) to maintain the position of the sound sources perceived by the user with respect to the coordinate system.
  • the sense of immersion intended by the content creator is maintained regardless of device/head movement.
  • the device orientation with respect to the coordinate system may be determined based on, for example, coordinate information obtained from a Global Positioning System (GPS) receiver in the device, position information determined based on a wireless link to an access point (AP), direction information obtained from an electronic compass in the device, orientation information provided by orientation sensors, motion information provided by motion sensors, acceleration sensors, etc.
  • GPS Global Positioning System
  • AP access point
  • direction information obtained from an electronic compass in the device
  • orientation information provided by orientation sensors motion information provided by motion sensors, acceleration sensors, etc.
  • the head orientation with respect to the device may be determined based on detection of the user's face or detection of a position corresponding to an apparatus worn by the user. In user face detection the device may capture an image, detect a head within the image, determine a face in the detected head, and may determine the orientation of the user's head based on the detected face.
  • the device may receive a signal from the worn apparatus indicating the orientation of the worn apparatus, and orientation of the user's head may be determined based on the orientation of the worn apparatus.
  • the worn apparatus may provide location information to the apparatus via wireless communication, the worn apparatus may transmit a wireless signal that the device may use to determine orientation for the worn apparatus, etc.
  • the device may then determine if the device orientation has changed with respect to the coordinate system from the last time device orientation was determined and/or if the head orientation has changed with respect to the device from the last time head orientation was determined. If changes are determined to have occurred, at least one setting may be changed in the audio signal generation configuration within the device in order to maintain the perceived locations of the simulated sound sources constant with respect to the coordinate system.
  • FIG. 1 illustrates example system 100 configured for audio control based on orientation in accordance with at least one embodiment of the present disclosure.
  • System 100 may include a device 102 configured to generate audio (e.g., a 3-D audio signal).
  • Examples of system 100 may include a mobile communication device such as cellular handset or a smartphone based on the Android® operating system (OS), iOS®, Blackberry® OS, Palm® OS, Symbian® OS, etc., a mobile computing device such as a tablet computer like an ipad®, Galaxy Tab®, Kindle Fire®, etc., an Ultrabook® including a low-power chipset manufactured by Intel Corp., a netbook, a notebook computer, a laptop computer, etc.
  • OS Android® operating system
  • Examples of device 102 may also include typically stationary devices when used in conjunction with headphones or another listening apparatus that allows for user mobility.
  • Stationary systems may include a desktop computer with an integrated or separate display, a standalone monitor (e.g., television) and/or systems that may comprise a standalone monitor such as a home entertainment system, a videoconferencing system, etc.
  • the orientation of device 102 may be determined with respect to coordinate system 104 .
  • coordinate system 104 may be fixed with respect to the physical location in which system 100 is operating.
  • the orientation of device 102 and/or a user's head 106 may be determined with respect to coordinate system 104 in order to lock the sound field generated by device 102 into a fixed perspective that does not vary with the motion of device 102 and/or head 106 .
  • orientations in coordinate system 104 may be determined based on the type of technology being employed by device 102 , which will be discussed in further detail in regard to FIG. 4 .
  • a user that is, for example, viewing video content on device 102 may listen to an audio signal generated in association with the content on headphones 108 .
  • Headphones 108 may include any apparatus configured to generate sound based on an audio signal that may be worn by a user, and may include various configurations such as over-the-ear headphones, in-ear headphones (e.g., ear buds), headphones integrated into other devices (e.g., headphones that are part of a content viewing apparatus worn by a user), etc.
  • headphones 108 may be specially equipped to generate a 3-D sound field based on based on a 3-D audio signal (e.g., headphones 108 may include multiple speaker drivers, specially placed speaker drivers, etc.
  • Device 102 may be configured to determine the orientation of head 106 as shown at 110 , and may then generate audio signal 112 based on the determined orientation of device 102 with respect to coordinate system 104 and/or the orientation of head 106 with respect to device 102 . Audio signal 112 may be transmitted to headphones 106 via wired or wireless communication.
  • FIG. 2 illustrates example device 102 ′ usable with at least one embodiment of the present disclosure.
  • Device 102 ′ comprises circuitry capable of implementing the functionality illustrated in FIG. 1 .
  • System circuitry 200 may be configured to perform various functions that may occur during normal operation of device 102 ′.
  • processing circuitry 202 may comprise one or more processors situated in separate components, or alternatively, may comprise one or more processing cores situated in a single component (e.g., in a System-on-a-Chip (SOC) configuration).
  • SOC System-on-a-Chip
  • Example processors may include various X86-based microprocessors available from the Intel Corporation including those in the Pentium, Xeon, Itanium, Celeron, Atom, Core i-series product families.
  • Processing circuitry 202 may be configured to execute instructions in device 102 ′. Instructions may include program code configured to cause processing circuitry 202 to perform activities related to reading data, writing data, processing data, formulating data, converting data, transforming data, etc. Instructions, data, etc. may be stored in memory 204 .
  • Memory 204 may comprise random access memory (RAM) or read-only memory (ROM) in a fixed or removable format. RAM may include memory configured to hold information during the operation of device 102 ′ such as, for example, static RAM (SRAM) or Dynamic RAM (DRAM). ROM may include memories such as bios memory configured to provide instructions when device 102 ′ activates, programmable memories such as electronic programmable ROMs, (EPROMS), Flash, etc.
  • Power Circuitry 206 may include internal (e.g., battery) and/or external (e.g., wall plug) power sources and circuitry configured to supply device 102 ′ with the power needed to operate.
  • Communications interface circuitry 208 may be configured to handle packet routing and various control functions for communication circuitry 212 , which may include various resources for conducting wired and/or wireless communications.
  • Wired communications may include mediums such as, for example, Universal Serial Bus (USB), Ethernet, etc.
  • Wireless communications may include, for example, close-proximity wireless mediums (e.g., radio frequency (RF), infrared (IR), etc.), short-range wireless mediums (e.g., Bluetooth, wireless local area networking (WLAN), etc.) and long range wireless mediums (e.g., cellular, satellite, etc.).
  • RF radio frequency
  • IR infrared
  • WLAN wireless local area networking
  • communications interface circuitry 208 may be configured to prevent wireless communications active in communication circuitry 212 from interfering with each other. In performing this function, communications interface circuitry 208 may schedule activities for communication circuitry 212 based on the relative priority of the pending messages.
  • User interface circuitry 210 may include circuitry configured to allow a user to interact with device 102 ′ such as, for example, various input mechanisms (e.g., microphones, switches, buttons, knobs, keyboards, speakers, touch-sensitive surfaces, one or more sensors configured to capture images and/or sense proximity, distance, motion, gestures, etc.) and output mechanisms (e.g., speakers, displays, indicators, electromechanical components for vibration, motion, etc.).
  • various input mechanisms e.g., microphones, switches, buttons, knobs, keyboards, speakers, touch-sensitive surfaces, one or more sensors configured to capture images and/or sense proximity, distance, motion, gestures, etc.
  • output mechanisms e.g., speakers, displays, indicators, electromechanical components for vibration, motion, etc.
  • user interface circuitry 210 may include, or may be coupled to, audio control circuitry 214 .
  • Audio control circuitry 214 may be configured to receive orientation information for device 102 ′ and/or head 106 from user interface circuitry 214 or other circuitry in device 102 ′, to determine if the orientation of the device 102 ′ has changed with respect to coordinate system 104 and/or if the orientation of head 106 has changed with respect to device 102 ′, and to control generation of the audio signal based on the determination.
  • the audio signal may then be output from device 102 ′ via, for example, communication circuitry 212 in instances where wireless communication (e.g., Bluetooth) is employed to transmit the audio signal to headphones 108 .
  • wireless communication e.g., Bluetooth
  • FIG. 3 illustrates an example interaction between audio control circuitry 214 and other circuitry in device 102 in accordance with at least one embodiment of the present disclosure.
  • Audio control circuitry 214 may receive device orientation information from device orientation detection circuitry 300 .
  • Device orientation detection circuitry 300 may be configured to provide position and/or orientation information for device 102 with respect to coordinate system 104 such as, for example, GPS receiver circuitry configured to provide position/orientation information, compass circuitry configured to provide orientation information with regard to magnetic direction (e.g., in terms of degrees from true North or magnetic North), short-range wireless communication circuitry configured to provide relative or absolute position/orientation information, etc.
  • device 102 may be coupled to another device via short-range wireless communication (e.g., to an access point (AP) via WLAN communication) and may be able to determine relative position and/or orientation based on the strength and/or direction of signals received from the AP. Absolute position may then be determined based on the location of the AP.
  • Device orientation circuitry 300 may also determine the orientation of device 102 based on various sensors not tied to coordinate system 104 such as, for example, orientation sensors (e.g., tilt/angle sensors, etc.), motion sensors, acceleration sensors, etc.
  • Audio control circuitry 214 may also receive head orientation information using a variety of techniques, examples of which are illustrated at 302 A and 302 B.
  • the example illustrated at 302 determines head orientation based on face detection.
  • image capture circuitry e.g., a camera in user interface circuitry 210
  • Face detection circuitry 306 may be configured to identify a face and/or facial region in the images provided by image capture circuitry 304 , and in turn, an orientation for head 106 .
  • face detection circuitry 306 may include custom, proprietary, known and/or after-developed face recognition code (or instruction sets), hardware, and/or firmware that are generally well-defined and operable to receive an image (e.g., but not limited to, a RGB color image) and to identify, at least to a certain extent, a face in the image. Face detection circuitry 306 may also be configured to track the face through a series of images (e.g., video frames at 24 frames per second). Detection systems usable by face detection circuitry 306 include particle filtering, mean shift, Kalman filtering, etc., each of which may utilize edge analysis, sum-of-square-difference analysis, feature point analysis, histogram analysis, skin tone analysis, etc.
  • the orientation of head 106 may be determined based on an apparatus worn by the user (e.g., worn apparatus or WA) that is in communication with apparatus 102 .
  • An example of an apparatus worn by the user is headphones 108 or a visual aid apparatus used to view 3-D video content (e.g., glasses, goggles, etc.), but may also include apparatuses specifically designed to provide head orientation information (e.g., a sensor affixed to a headband, etc.).
  • a position/orientation sensor and transmitter may be affixed to headphones 108 and apparatus signal receiving circuitry 308 may be configured to receive information from the sensor.
  • the information may then be provided to apparatus orientation determination circuitry 310 , which may be configured to determine orientation of the apparatus based on the information. Since headphones 108 are attached to head 106 in a fixed orientation, the orientation of head 106 may be derived from the orientation of headphones 108 . In another implementation, a transmitter in headphones 108 may continually transmit a beacon signal (e.g., a signal that is identifiable as being transmitted by headphones 108 ) that is received by apparatus position receiving circuitry 308 . Apparatus orientation determination circuitry 310 may then determine the orientation of headphones 108 based on characteristics of the received signal (e.g., a direction from which the signal was received, received signal strength, etc.).
  • a beacon signal e.g., a signal that is identifiable as being transmitted by headphones 108
  • Apparatus orientation determination circuitry 310 may then determine the orientation of headphones 108 based on characteristics of the received signal (e.g., a direction from which the signal was received, received signal strength, etc.).
  • Orientation determination circuitry 312 may receive device orientation information from device orientation detection circuitry 300 , and may receive head orientation information via one of the example techniques illustrated at 302 A and 302 B. Orientation determination circuitry 312 may then be configured to determine if the orientation of device 102 has changed with respect to coordinate system 104 and/or the orientation of head 106 has changed with respect to device 102 . Orientation change may be quantified by, for example, angle offset from the previous orientation measurement, distance offset from the previous orientation measurement, etc. Sensed changes in orientation may be provided to audio adjustment circuitry 314 , which may be configured to alter the generation of the audio signal based on the change in orientation. The manner in which the generation is altered may depend on, for example, the particular 3-D audio technology being employed.
  • any changes in orientation determined by orientation determination circuitry 306 may be used to offset the simulated position of sound sources in the 3-D audio field so that the simulated sound sources appear fixed to coordinate system 104 regardless of the position of device 102 or head 106 .
  • the resulting audio signal may then be transmitted from device 102 (e.g., to headphones 108 ) via communication circuitry 212 .
  • FIG. 4 illustrates an example initial aligned orientation in accordance with at least one embodiment of the present disclosure.
  • head 106 is aligned with device 102 along coordinate system 104 , which in the example of FIG. 4 is based on magnetic compass headings (e.g., N, E, S and W).
  • Simulated sound source 400 does not actually exist, but is perceived by the user as producing sound from a NW direction, and may correspond to, for example, sound sources to the left of the user as depicted in content being viewed on device 102 (e.g., a movie, video game, etc.).
  • Simulated sound source 400 may be generated by headphones 108 using an audio signal generated by device 102 and then transmitted to headphones 108 .
  • FIG. 5 illustrates an example of a displaced orientation in accordance with at least one embodiment of the present disclosure.
  • device 102 and head 106 are still aligned along axis 500 , but device 102 is no longer aligned with “N” (North) in coordinate system 104 .
  • N North
  • the sound generated by headphones 108 stays relative to the position of headphones 108 . Therefore, when device 102 is offset as shown at 502 , head 106 and headphones 108 are also displaced a proportional amount, and the perceived position of the simulated sound source shifts to 400 A.
  • the displaced position of simulated sound source 400 A no longer matches the position for simulated sound source 400 in coordinate system 104 as intended by the content creator, which may disrupt the sense of 3-D immersion for the user.
  • device 102 may determine that its orientation with respect to coordinate system 104 has been displaced by offset 502 , and may change the position of the simulated sound source to 400 B as shown at 504 .
  • the sound coming from sound source 400 B shifts from the perspective of the user (e.g., the sound seems to be coming from closer to the front of head 106 ), while remaining in alignment with coordinate system 104 .
  • FIG. 6 illustrates a second example of a displaced orientation in accordance with at least one embodiment of the present disclosure.
  • head 106 is further displaced from axis 500 by offset 600 . Similar to the example of FIG. 5 , without audio control such as disclosed herein, displacement 600 would cause the perceived location of simulated sound source to shift to 400 C in FIG. 500 .
  • both the displacement of device 102 from coordinate system 104 (e.g., offset 502 ) and the displacement from head 106 to device 102 (e.g., offset 600 ) may be determined, and the generation of the audio signal may be controlled so that the perceived position of the simulated sound source is shifted to 400 D as illustrated at 602 .
  • the sound from simulated sound source 400 D now seems to be coming from directly in front of head 106 .
  • the perceived location of simulated sound source 400 remains tied to coordinate system 104 , and thus, the sense of 3-D immersion intended by the creator of the content may be preserved.
  • FIG. 7 is a flowchart of example operations for audio control based on orientation in accordance with at least one embodiment of the present disclosure.
  • device orientation may be determined. For example, current device orientation may be detected and a determination may be made as to whether the orientation of the device configured to generate an audio signal has changed with respect to a fixed coordinate system. Head orientation may then be determined in operation 702 . For example, current user head orientation may be detected and a determination may be made as to whether the orientation of the user's head has changed with respect to the device. A determination may then be made in operation 704 as to whether there has been a change in orientation (e.g., the device with respect to the coordinate system and/or the head with respect to the device).
  • a change in orientation e.g., the device with respect to the coordinate system and/or the head with respect to the device.
  • the device may maintain the existing audio configuration.
  • the device may control audio generation based on the orientation change. For example, the device may reconfigure audio parameters based on the orientation change in order to change the position of a simulated sound source in the 3-D audio field. As a result, the perceived position of the simulated sound source may remain constant with respect to the coordinate system regardless of changes in device or head orientation.
  • FIG. 7 illustrates various operations according to an embodiment
  • FIG. 7 illustrates various operations according to an embodiment
  • the operations depicted in FIG. 7 are necessary for other embodiments.
  • the operations depicted in FIG. 7 may be combined in a manner not specifically shown in any of the drawings, but still fully consistent with the present disclosure.
  • claims directed to features and/or operations that are not exactly shown in one drawing are deemed within the scope and content of the present disclosure.
  • module may refer to software, firmware and/or circuitry configured to perform any of the aforementioned operations.
  • Software may be embodied as a software package, code, instructions, instruction sets and/or data recorded on non-transitory computer readable storage mediums.
  • Firmware may be embodied as code, instructions or instruction sets and/or data that are hard-coded (e.g., nonvolatile) in memory devices.
  • Circuitry may comprise, for example, singly or in any combination, hardwired circuitry, programmable circuitry such as computer processors comprising one or more individual instruction processing cores, state machine circuitry, and/or firmware that stores instructions executed by programmable circuitry.
  • the modules may, collectively or individually, be embodied as circuitry that forms part of a larger system, for example, an integrated circuit (IC), system on-chip (SoC), desktop computers, laptop computers, tablet computers, servers, smart phones, etc.
  • IC integrated circuit
  • SoC system on-chip
  • any of the operations described herein may be implemented in a system that includes one or more storage mediums having stored thereon, individually or in combination, instructions that when executed by one or more processors perform the methods.
  • the processor may include, for example, a server CPU, a mobile device CPU, and/or other programmable circuitry. Also, it is intended that operations described herein may be distributed across a plurality of physical devices, such as processing structures at more than one different physical location.
  • the storage medium may include any type of tangible medium, for example, any type of disk including hard disks, floppy disks, optical disks, compact disk read-only memories (CD-ROMs), compact disk rewritables (CD-RWs), and magneto-optical disks, semiconductor devices such as read-only memories (ROMs), random access memories (RAMs) such as dynamic and static RAMs, erasable programmable read-only memories (EPROMs), electrically erasable programmable read-only memories (EEPROMs), flash memories, Solid State Disks (SSDs), embedded multimedia cards (eMMCs), secure digital input/output (SDIO) cards, magnetic or optical cards, or any type of media suitable for storing electronic instructions.
  • ROMs read-only memories
  • RAMs random access memories
  • EPROMs erasable programmable read-only memories
  • EEPROMs electrically erasable programmable read-only memories
  • flash memories Solid State Disks (SSDs), embedded multimedia cards (eMMC
  • a device may be configured to sense its own orientation with respect to a coordinate system, and to also sense the orientation of a user's head with respect to the device. The device may then determine whether the device orientation has changed with respect to the coordinate system and/or whether the head orientation has changed with respect to the device. Audio signal generation in the device may then be controlled based on the determination. For example, the position of a simulated sound source may be changed in a 3-D audio field based on determined changes in device and/or head orientation. The audio signal may then be transmitted from the device to an apparatus configured to generate sound based on the audio signal (e.g., headphones).
  • an apparatus configured to generate sound based on the audio signal (e.g., headphones).
  • the device may include device orientation detection circuitry configured to determine device orientation with respect to a coordinate system, user head orientation detection circuitry configured to determine user head orientation with respect to the device and audio adjustment circuitry configured to control the generation of an audio signal based on the device orientation and user head orientation.
  • the above example device may be further configured, wherein the device further comprises image capture circuitry and determining the user head orientation with respect to the device comprises capturing an image using the image capture circuitry, detecting a face of a user in the image and determining an orientation for the head of the user based on the detected user face.
  • the above example device may be further configured, wherein determining the user head orientation with respect to the device comprises detecting an orientation of an apparatus worn by a user and determining the user head orientation based on the orientation of the worn apparatus.
  • the device may further comprise at least a wireless receiver and detecting the worn apparatus orientation comprises receiving position information from the worn apparatus via wireless communication.
  • the above example device may be further configured, wherein controlling the generation of the audio signal is based on the relative orientation of the device with respect to the coordinate system or the relative orientation of the user head with respect to the device.
  • the above example device may be further configured, wherein controlling the generation of the audio signal is based on the relative orientation of the device with respect to the coordinate system and the relative orientation of the user head with respect to the device.
  • the above example device may be further configured, wherein the audio signal is a surround sound audio signal configured to simulate a three-dimensional (3-D) sound field.
  • controlling the generation of the audio signal may comprise changing the location of a simulated sound source within the 3-D sound field.
  • changing the location of the simulated sound source within the 3-D sound field may further comprise changing the location of the simulated sound source to maintain the relative position of the simulated source within the coordinate system with respect to at least the user head orientation.
  • the above example device may be further configured, wherein the device further comprises at least a wireless transmitter configured to transmit the audio signal.
  • the method may comprise determining device orientation with respect to a coordinate system, determining user head orientation with respect to the device and controlling generation of an audio signal based on the device orientation and user head orientation.
  • determining the user head orientation with respect to the device comprises capturing an image, detecting a face of a user in the image and determining an orientation for the head of the user based on the detected user face.
  • determining the user head orientation with respect to the device comprises detecting an orientation of an apparatus worn by a user and determining the user head orientation based on the orientation of the worn apparatus.
  • detecting the worn apparatus orientation may further comprise receiving position information from the worn apparatus via wireless communication.
  • the above example method may be further configured, wherein controlling the generation of the audio signal is based on the relative orientation of the device with respect to the coordinate system or the relative orientation of the user head with respect to the device.
  • the above example method may be further configured, wherein controlling the generation of the audio signal is based on the relative orientation of the device with respect to the coordinate system and the relative orientation of the user head with respect to the device.
  • the above example method may be further configured, wherein the audio signal is a surround sound audio signal configured to simulate a three-dimensional (3-D) sound field.
  • controlling the generation of the audio signal may further comprise changing the location of a simulated sound source within the 3-D sound field.
  • changing the location of the simulated sound source within the 3-D sound field may further comprise changing the location of the simulated sound source to maintain the relative position of the simulated source within the coordinate system with respect to at least the user head orientation.
  • the above example method may further comprise transmitting the audio signal via wireless communication.
  • a system comprising at least a device configured to generate an audio signal, the system being arranged to perform any of the above example methods.
  • At least one machine readable medium comprising a plurality of instructions that, in response to be being executed on a computing device, cause the computing device to carry out any of the above example methods.
  • an apparatus for audio control based on orientation the apparatus being arranged to perform any of the above example methods.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Stereophonic System (AREA)

Abstract

A device may be configured to sense its own orientation with respect to a coordinate system, and to also sense the orientation of a user's head with respect to the device. The device may then determine whether the device orientation has changed with respect to the coordinate system and/or whether the head orientation has changed with respect to the device. Audio signal generation in the device may then be controlled based on the determination. For example, the position of a simulated sound source may be changed in a 3-D audio field based on determined changes in device and/or head orientation. The audio signal may then be transmitted from the device to an apparatus configured to generate sound based on the audio signal (e.g., headphones).

Description

BACKGROUND
Three-Dimensional (3-D) audio is a technology in which sound is generated in a manner that creates the illusion that the source of the sound is somewhere in a 3-D space surrounding the hearer. For example, movies and video games may employ 3-D audio to create an “immersive” effect wherein the participant (e.g., user) is made to feel as if actually a part of the content being viewed by generating sound that appears to come from various directions (e.g., left, center right, back, etc.). In stationary systems this effect may be simulated through the placement of speakers in the various directions. Sound from content such as, for example, a movie, a video game, etc. may then be directed to a speaker corresponding to the direction from which the sound is meant to be heard. Since the system is fixed, the sound always appears to be coming from the correct direction (e.g., the direction intended by the creator of the content) regardless of where the user is positioned. More recently, portable systems (e.g., mobile communication and/or computing devices) have evolved to have the capability of generating 3-D audio. A user may view content on a portable system and listen to audio associated with the content on headphones configured to generate directional sound in accordance with the content. However, at least one limitation that exists in portable 3-D audio is that the sound field is relative to the headphones. As opposed to a stationary system where the sound generation is relative to a fixed coordinate system, and so the user is free to move about the physical space without changing the intended sense of immersion, in portable devices the coordinate system is relative to the headphones of the user, and thus, the head of the user. When the head of the user moves the sound field moves as well. As a result, the direction from which a sound emanates no longer matches the direction intended by the creation of the content, destroying the intended sense of immersion to be created by the content.
BRIEF DESCRIPTION OF THE DRAWINGS
Features and advantages of various embodiments of the claimed subject matter will become apparent as the following Detailed Description proceeds, and upon reference to the Drawings, wherein like numerals designate like parts, and in which:
FIG. 1 illustrates an example system configured for audio control based on orientation in accordance with at least one embodiment of the present disclosure;
FIG. 2 illustrates an example device usable with at least one embodiment of the present disclosure;
FIG. 3 illustrates example interaction between audio control circuitry and other circuitry in a device in accordance with at least one embodiment of the present disclosure;
FIG. 4 illustrates an example initial aligned orientation in accordance with at least one embodiment of the present disclosure;
FIG. 5 illustrates an example of a displaced orientation in accordance with at least one embodiment of the present disclosure;
FIG. 6 illustrates a second example of a displaced orientation in accordance with at least one embodiment of the present disclosure; and
FIG. 7 is a flowchart of example operations for audio control based on orientation in accordance with at least one embodiment of the present disclosure.
Although the following Detailed Description will proceed with reference being made to illustrative embodiments, many alternatives, modifications and variations thereof will be apparent to those skilled in the art.
DETAILED DESCRIPTION
Generally, this disclosure describes systems and methods for audio control based on orientation. “Audio” or “audio signal,” as referred to herein, comprises three-dimensional (3-D) audio that is configured to simulate sound sources at various locations within a 3-D sound field so that the listener (e.g., user) perceives that he/she is immersed in the content corresponding to the audio. Any 3-D currently known or later emerging audio technology (e.g., DirectSound® from Microsoft Corp., open AL® from Creative Technology Ltd., etc.) may be employed as the various embodiments disclosed herein to automatically manipulate 3-D audio configuration, not the manner in which 3-D is produced. In one embodiment, audio signal generation in a device may be controlled to maintain the perceived location of a sound source constant with respect to coordinate system, regardless of changes of position with respect to the device and/or user. A coordinate system, as referred to herein, may be a fixed coordinate system such as, for example, the World Geodetic System (WGS), longitude/latitude, compass directions (North (N), East (E), South (S) and West (W)), etc. Audio signal generation may be controlled based on the orientation of a device with respect to a coordinate system and/or a head of the user with respect to the device. Orientation, as referred to herein, may include both the locations of the device/head in the coordinate system and a relative position (e.g., angle, tilt, offset, etc.) of the device with respect to the coordinate system, and the head with respect to the device. For example, a device configured to generate audio that is, for example, associated with video content such as a movie, game, etc. being played on the device may determine its orientation with respect to a fixed coordinate system. In addition, the device may further determine the orientation of a user's head with respect to the device. The device may then determine if the orientation of the device has changed with respect to the fixed coordinate system and/or if the orientation of the user's head has changed with respect to the device. Audio signal generation may then be controlled in the device (e.g., simulated positions of sound sources in the 3-D sound field relative to the user's headphones may be changed) to maintain the position of the sound sources perceived by the user with respect to the coordinate system. Thus, the sense of immersion intended by the content creator is maintained regardless of device/head movement.
In one embodiment, the device orientation with respect to the coordinate system may be determined based on, for example, coordinate information obtained from a Global Positioning System (GPS) receiver in the device, position information determined based on a wireless link to an access point (AP), direction information obtained from an electronic compass in the device, orientation information provided by orientation sensors, motion information provided by motion sensors, acceleration sensors, etc. In the same or a different embodiment, the head orientation with respect to the device may be determined based on detection of the user's face or detection of a position corresponding to an apparatus worn by the user. In user face detection the device may capture an image, detect a head within the image, determine a face in the detected head, and may determine the orientation of the user's head based on the detected face. In worn apparatus (e.g., headphones or another apparatus worn by the user), the device may receive a signal from the worn apparatus indicating the orientation of the worn apparatus, and orientation of the user's head may be determined based on the orientation of the worn apparatus. For example, the worn apparatus may provide location information to the apparatus via wireless communication, the worn apparatus may transmit a wireless signal that the device may use to determine orientation for the worn apparatus, etc. The device may then determine if the device orientation has changed with respect to the coordinate system from the last time device orientation was determined and/or if the head orientation has changed with respect to the device from the last time head orientation was determined. If changes are determined to have occurred, at least one setting may be changed in the audio signal generation configuration within the device in order to maintain the perceived locations of the simulated sound sources constant with respect to the coordinate system.
FIG. 1 illustrates example system 100 configured for audio control based on orientation in accordance with at least one embodiment of the present disclosure. System 100 may include a device 102 configured to generate audio (e.g., a 3-D audio signal). Examples of system 100 may include a mobile communication device such as cellular handset or a smartphone based on the Android® operating system (OS), iOS®, Blackberry® OS, Palm® OS, Symbian® OS, etc., a mobile computing device such as a tablet computer like an ipad®, Galaxy Tab®, Kindle Fire®, etc., an Ultrabook® including a low-power chipset manufactured by Intel Corp., a netbook, a notebook computer, a laptop computer, etc. Examples of device 102 may also include typically stationary devices when used in conjunction with headphones or another listening apparatus that allows for user mobility. Stationary systems may include a desktop computer with an integrated or separate display, a standalone monitor (e.g., television) and/or systems that may comprise a standalone monitor such as a home entertainment system, a videoconferencing system, etc.
The orientation of device 102 may be determined with respect to coordinate system 104. As indicated above, coordinate system 104 may be fixed with respect to the physical location in which system 100 is operating. The orientation of device 102 and/or a user's head 106 may be determined with respect to coordinate system 104 in order to lock the sound field generated by device 102 into a fixed perspective that does not vary with the motion of device 102 and/or head 106. As discussed above, orientations in coordinate system 104 may be determined based on the type of technology being employed by device 102, which will be discussed in further detail in regard to FIG. 4. A user that is, for example, viewing video content on device 102 may listen to an audio signal generated in association with the content on headphones 108. Headphones 108 may include any apparatus configured to generate sound based on an audio signal that may be worn by a user, and may include various configurations such as over-the-ear headphones, in-ear headphones (e.g., ear buds), headphones integrated into other devices (e.g., headphones that are part of a content viewing apparatus worn by a user), etc. In one embodiment, headphones 108 may be specially equipped to generate a 3-D sound field based on based on a 3-D audio signal (e.g., headphones 108 may include multiple speaker drivers, specially placed speaker drivers, etc. Device 102 may be configured to determine the orientation of head 106 as shown at 110, and may then generate audio signal 112 based on the determined orientation of device 102 with respect to coordinate system 104 and/or the orientation of head 106 with respect to device 102. Audio signal 112 may be transmitted to headphones 106 via wired or wireless communication.
FIG. 2 illustrates example device 102′ usable with at least one embodiment of the present disclosure. Device 102′ comprises circuitry capable of implementing the functionality illustrated in FIG. 1. System circuitry 200 may be configured to perform various functions that may occur during normal operation of device 102′. For example, processing circuitry 202 may comprise one or more processors situated in separate components, or alternatively, may comprise one or more processing cores situated in a single component (e.g., in a System-on-a-Chip (SOC) configuration). Example processors may include various X86-based microprocessors available from the Intel Corporation including those in the Pentium, Xeon, Itanium, Celeron, Atom, Core i-series product families. Processing circuitry 202 may be configured to execute instructions in device 102′. Instructions may include program code configured to cause processing circuitry 202 to perform activities related to reading data, writing data, processing data, formulating data, converting data, transforming data, etc. Instructions, data, etc. may be stored in memory 204. Memory 204 may comprise random access memory (RAM) or read-only memory (ROM) in a fixed or removable format. RAM may include memory configured to hold information during the operation of device 102′ such as, for example, static RAM (SRAM) or Dynamic RAM (DRAM). ROM may include memories such as bios memory configured to provide instructions when device 102′ activates, programmable memories such as electronic programmable ROMs, (EPROMS), Flash, etc. Other fixed and/or removable memory may include magnetic memories such as floppy disks, hard drives, etc., electronic memories such as solid state flash memory (e.g., eMMC, etc.), removable memory cards or sticks (e.g., uSD, USB, etc.), optical memories such as compact disc-based ROM (CD-ROM), etc. Power Circuitry 206 may include internal (e.g., battery) and/or external (e.g., wall plug) power sources and circuitry configured to supply device 102′ with the power needed to operate. Communications interface circuitry 208 may be configured to handle packet routing and various control functions for communication circuitry 212, which may include various resources for conducting wired and/or wireless communications. Wired communications may include mediums such as, for example, Universal Serial Bus (USB), Ethernet, etc. Wireless communications may include, for example, close-proximity wireless mediums (e.g., radio frequency (RF), infrared (IR), etc.), short-range wireless mediums (e.g., Bluetooth, wireless local area networking (WLAN), etc.) and long range wireless mediums (e.g., cellular, satellite, etc.). For example, communications interface circuitry 208 may be configured to prevent wireless communications active in communication circuitry 212 from interfering with each other. In performing this function, communications interface circuitry 208 may schedule activities for communication circuitry 212 based on the relative priority of the pending messages.
User interface circuitry 210 may include circuitry configured to allow a user to interact with device 102′ such as, for example, various input mechanisms (e.g., microphones, switches, buttons, knobs, keyboards, speakers, touch-sensitive surfaces, one or more sensors configured to capture images and/or sense proximity, distance, motion, gestures, etc.) and output mechanisms (e.g., speakers, displays, indicators, electromechanical components for vibration, motion, etc.). In one embodiment, user interface circuitry 210 may include, or may be coupled to, audio control circuitry 214. Audio control circuitry 214 may be configured to receive orientation information for device 102′ and/or head 106 from user interface circuitry 214 or other circuitry in device 102′, to determine if the orientation of the device 102′ has changed with respect to coordinate system 104 and/or if the orientation of head 106 has changed with respect to device 102′, and to control generation of the audio signal based on the determination. The audio signal may then be output from device 102′ via, for example, communication circuitry 212 in instances where wireless communication (e.g., Bluetooth) is employed to transmit the audio signal to headphones 108.
FIG. 3 illustrates an example interaction between audio control circuitry 214 and other circuitry in device 102 in accordance with at least one embodiment of the present disclosure. Audio control circuitry 214 may receive device orientation information from device orientation detection circuitry 300. Device orientation detection circuitry 300 may be configured to provide position and/or orientation information for device 102 with respect to coordinate system 104 such as, for example, GPS receiver circuitry configured to provide position/orientation information, compass circuitry configured to provide orientation information with regard to magnetic direction (e.g., in terms of degrees from true North or magnetic North), short-range wireless communication circuitry configured to provide relative or absolute position/orientation information, etc. For the short-range wireless communication scenario, device 102 may be coupled to another device via short-range wireless communication (e.g., to an access point (AP) via WLAN communication) and may be able to determine relative position and/or orientation based on the strength and/or direction of signals received from the AP. Absolute position may then be determined based on the location of the AP. Device orientation circuitry 300 may also determine the orientation of device 102 based on various sensors not tied to coordinate system 104 such as, for example, orientation sensors (e.g., tilt/angle sensors, etc.), motion sensors, acceleration sensors, etc.
Audio control circuitry 214 may also receive head orientation information using a variety of techniques, examples of which are illustrated at 302A and 302B. The example illustrated at 302 determines head orientation based on face detection. In this regard, image capture circuitry (e.g., a camera in user interface circuitry 210) may be configured to capture images periodically, on a set interval, etc. Face detection circuitry 306 may be configured to identify a face and/or facial region in the images provided by image capture circuitry 304, and in turn, an orientation for head 106. For example, face detection circuitry 306 may include custom, proprietary, known and/or after-developed face recognition code (or instruction sets), hardware, and/or firmware that are generally well-defined and operable to receive an image (e.g., but not limited to, a RGB color image) and to identify, at least to a certain extent, a face in the image. Face detection circuitry 306 may also be configured to track the face through a series of images (e.g., video frames at 24 frames per second). Detection systems usable by face detection circuitry 306 include particle filtering, mean shift, Kalman filtering, etc., each of which may utilize edge analysis, sum-of-square-difference analysis, feature point analysis, histogram analysis, skin tone analysis, etc.
In the embodiment illustrated at 302B, the orientation of head 106 may be determined based on an apparatus worn by the user (e.g., worn apparatus or WA) that is in communication with apparatus 102. An example of an apparatus worn by the user is headphones 108 or a visual aid apparatus used to view 3-D video content (e.g., glasses, goggles, etc.), but may also include apparatuses specifically designed to provide head orientation information (e.g., a sensor affixed to a headband, etc.). In one embodiment, a position/orientation sensor and transmitter may be affixed to headphones 108 and apparatus signal receiving circuitry 308 may be configured to receive information from the sensor. The information may then be provided to apparatus orientation determination circuitry 310, which may be configured to determine orientation of the apparatus based on the information. Since headphones 108 are attached to head 106 in a fixed orientation, the orientation of head 106 may be derived from the orientation of headphones 108. In another implementation, a transmitter in headphones 108 may continually transmit a beacon signal (e.g., a signal that is identifiable as being transmitted by headphones 108) that is received by apparatus position receiving circuitry 308. Apparatus orientation determination circuitry 310 may then determine the orientation of headphones 108 based on characteristics of the received signal (e.g., a direction from which the signal was received, received signal strength, etc.).
Orientation determination circuitry 312 may receive device orientation information from device orientation detection circuitry 300, and may receive head orientation information via one of the example techniques illustrated at 302A and 302B. Orientation determination circuitry 312 may then be configured to determine if the orientation of device 102 has changed with respect to coordinate system 104 and/or the orientation of head 106 has changed with respect to device 102. Orientation change may be quantified by, for example, angle offset from the previous orientation measurement, distance offset from the previous orientation measurement, etc. Sensed changes in orientation may be provided to audio adjustment circuitry 314, which may be configured to alter the generation of the audio signal based on the change in orientation. The manner in which the generation is altered may depend on, for example, the particular 3-D audio technology being employed. For example, many 3-D audio systems allow the location of simulated sound sources to be modified in the 3-D audio field (e.g., via the configuration of certain software parameters in the 3-D audio control software). In one embodiment, any changes in orientation determined by orientation determination circuitry 306 may be used to offset the simulated position of sound sources in the 3-D audio field so that the simulated sound sources appear fixed to coordinate system 104 regardless of the position of device 102 or head 106. The resulting audio signal may then be transmitted from device 102 (e.g., to headphones 108) via communication circuitry 212.
FIG. 4 illustrates an example initial aligned orientation in accordance with at least one embodiment of the present disclosure. In particular, head 106 is aligned with device 102 along coordinate system 104, which in the example of FIG. 4 is based on magnetic compass headings (e.g., N, E, S and W). Simulated sound source 400 does not actually exist, but is perceived by the user as producing sound from a NW direction, and may correspond to, for example, sound sources to the left of the user as depicted in content being viewed on device 102 (e.g., a movie, video game, etc.). Simulated sound source 400 may be generated by headphones 108 using an audio signal generated by device 102 and then transmitted to headphones 108.
FIG. 5 illustrates an example of a displaced orientation in accordance with at least one embodiment of the present disclosure. In particular, device 102 and head 106 are still aligned along axis 500, but device 102 is no longer aligned with “N” (North) in coordinate system 104. It is important to note that without audio control such as disclosed herein, the sound generated by headphones 108 stays relative to the position of headphones 108. Therefore, when device 102 is offset as shown at 502, head 106 and headphones 108 are also displaced a proportional amount, and the perceived position of the simulated sound source shifts to 400A. The displaced position of simulated sound source 400A no longer matches the position for simulated sound source 400 in coordinate system 104 as intended by the content creator, which may disrupt the sense of 3-D immersion for the user. In one embodiment, device 102 may determine that its orientation with respect to coordinate system 104 has been displaced by offset 502, and may change the position of the simulated sound source to 400B as shown at 504. As a result, the sound coming from sound source 400B shifts from the perspective of the user (e.g., the sound seems to be coming from closer to the front of head 106), while remaining in alignment with coordinate system 104.
FIG. 6 illustrates a second example of a displaced orientation in accordance with at least one embodiment of the present disclosure. In addition to device 102 being displaced from “N” in coordinate system 104 by offset 502, head 106 is further displaced from axis 500 by offset 600. Similar to the example of FIG. 5, without audio control such as disclosed herein, displacement 600 would cause the perceived location of simulated sound source to shift to 400C in FIG. 500. Instead, in one embodiment of the present invention both the displacement of device 102 from coordinate system 104 (e.g., offset 502) and the displacement from head 106 to device 102 (e.g., offset 600) may be determined, and the generation of the audio signal may be controlled so that the perceived position of the simulated sound source is shifted to 400D as illustrated at 602. The sound from simulated sound source 400D now seems to be coming from directly in front of head 106. Thus, while device 102 and/or head 106 may move with respect to coordinate system 104, the perceived location of simulated sound source 400 remains tied to coordinate system 104, and thus, the sense of 3-D immersion intended by the creator of the content may be preserved.
FIG. 7 is a flowchart of example operations for audio control based on orientation in accordance with at least one embodiment of the present disclosure. In operation 700 device orientation may be determined. For example, current device orientation may be detected and a determination may be made as to whether the orientation of the device configured to generate an audio signal has changed with respect to a fixed coordinate system. Head orientation may then be determined in operation 702. For example, current user head orientation may be detected and a determination may be made as to whether the orientation of the user's head has changed with respect to the device. A determination may then be made in operation 704 as to whether there has been a change in orientation (e.g., the device with respect to the coordinate system and/or the head with respect to the device). If it is determined in operation 704 that there was no change in orientation, then in operation 706 the device may maintain the existing audio configuration. Alternatively, if it is determined that an orientation change has occurred, then in operation 708 the device may control audio generation based on the orientation change. For example, the device may reconfigure audio parameters based on the orientation change in order to change the position of a simulated sound source in the 3-D audio field. As a result, the perceived position of the simulated sound source may remain constant with respect to the coordinate system regardless of changes in device or head orientation.
While FIG. 7 illustrates various operations according to an embodiment, it is to be understood that not all of the operations depicted in FIG. 7 are necessary for other embodiments. Indeed, it is fully contemplated herein that in other embodiments of the present disclosure, the operations depicted in FIG. 7, and/or other operations described herein, may be combined in a manner not specifically shown in any of the drawings, but still fully consistent with the present disclosure. Thus, claims directed to features and/or operations that are not exactly shown in one drawing are deemed within the scope and content of the present disclosure.
As used in any embodiment herein, the term “module” may refer to software, firmware and/or circuitry configured to perform any of the aforementioned operations.
Software may be embodied as a software package, code, instructions, instruction sets and/or data recorded on non-transitory computer readable storage mediums. Firmware may be embodied as code, instructions or instruction sets and/or data that are hard-coded (e.g., nonvolatile) in memory devices.
“Circuitry”, as used in any embodiment herein, may comprise, for example, singly or in any combination, hardwired circuitry, programmable circuitry such as computer processors comprising one or more individual instruction processing cores, state machine circuitry, and/or firmware that stores instructions executed by programmable circuitry. The modules may, collectively or individually, be embodied as circuitry that forms part of a larger system, for example, an integrated circuit (IC), system on-chip (SoC), desktop computers, laptop computers, tablet computers, servers, smart phones, etc.
Any of the operations described herein may be implemented in a system that includes one or more storage mediums having stored thereon, individually or in combination, instructions that when executed by one or more processors perform the methods. Here, the processor may include, for example, a server CPU, a mobile device CPU, and/or other programmable circuitry. Also, it is intended that operations described herein may be distributed across a plurality of physical devices, such as processing structures at more than one different physical location. The storage medium may include any type of tangible medium, for example, any type of disk including hard disks, floppy disks, optical disks, compact disk read-only memories (CD-ROMs), compact disk rewritables (CD-RWs), and magneto-optical disks, semiconductor devices such as read-only memories (ROMs), random access memories (RAMs) such as dynamic and static RAMs, erasable programmable read-only memories (EPROMs), electrically erasable programmable read-only memories (EEPROMs), flash memories, Solid State Disks (SSDs), embedded multimedia cards (eMMCs), secure digital input/output (SDIO) cards, magnetic or optical cards, or any type of media suitable for storing electronic instructions. Other embodiments may be implemented as software modules executed by a programmable control device.
Thus, the present disclosure provides systems and methods for audio control based on orientation. A device may be configured to sense its own orientation with respect to a coordinate system, and to also sense the orientation of a user's head with respect to the device. The device may then determine whether the device orientation has changed with respect to the coordinate system and/or whether the head orientation has changed with respect to the device. Audio signal generation in the device may then be controlled based on the determination. For example, the position of a simulated sound source may be changed in a 3-D audio field based on determined changes in device and/or head orientation. The audio signal may then be transmitted from the device to an apparatus configured to generate sound based on the audio signal (e.g., headphones).
The following examples pertain to further embodiments. In one example embodiment there is provided a device. The device may include device orientation detection circuitry configured to determine device orientation with respect to a coordinate system, user head orientation detection circuitry configured to determine user head orientation with respect to the device and audio adjustment circuitry configured to control the generation of an audio signal based on the device orientation and user head orientation.
The above example device may be further configured, wherein the device further comprises image capture circuitry and determining the user head orientation with respect to the device comprises capturing an image using the image capture circuitry, detecting a face of a user in the image and determining an orientation for the head of the user based on the detected user face.
The above example device may be further configured, wherein determining the user head orientation with respect to the device comprises detecting an orientation of an apparatus worn by a user and determining the user head orientation based on the orientation of the worn apparatus. In this example configuration the device may further comprise at least a wireless receiver and detecting the worn apparatus orientation comprises receiving position information from the worn apparatus via wireless communication.
The above example device may be further configured, wherein controlling the generation of the audio signal is based on the relative orientation of the device with respect to the coordinate system or the relative orientation of the user head with respect to the device.
The above example device may be further configured, wherein controlling the generation of the audio signal is based on the relative orientation of the device with respect to the coordinate system and the relative orientation of the user head with respect to the device.
The above example device may be further configured, wherein the audio signal is a surround sound audio signal configured to simulate a three-dimensional (3-D) sound field. In this example configuration controlling the generation of the audio signal may comprise changing the location of a simulated sound source within the 3-D sound field. In this example configuration changing the location of the simulated sound source within the 3-D sound field may further comprise changing the location of the simulated sound source to maintain the relative position of the simulated source within the coordinate system with respect to at least the user head orientation.
The above example device may be further configured, wherein the device further comprises at least a wireless transmitter configured to transmit the audio signal.
In another example embodiment there is provided a method. The method may comprise determining device orientation with respect to a coordinate system, determining user head orientation with respect to the device and controlling generation of an audio signal based on the device orientation and user head orientation.
The above example method may be further configured, wherein determining the user head orientation with respect to the device comprises capturing an image, detecting a face of a user in the image and determining an orientation for the head of the user based on the detected user face.
The above example method may be further configured, wherein determining the user head orientation with respect to the device comprises detecting an orientation of an apparatus worn by a user and determining the user head orientation based on the orientation of the worn apparatus. In this example configuration detecting the worn apparatus orientation may further comprise receiving position information from the worn apparatus via wireless communication.
The above example method may be further configured, wherein controlling the generation of the audio signal is based on the relative orientation of the device with respect to the coordinate system or the relative orientation of the user head with respect to the device.
The above example method may be further configured, wherein controlling the generation of the audio signal is based on the relative orientation of the device with respect to the coordinate system and the relative orientation of the user head with respect to the device.
The above example method may be further configured, wherein the audio signal is a surround sound audio signal configured to simulate a three-dimensional (3-D) sound field. In this example configuration controlling the generation of the audio signal may further comprise changing the location of a simulated sound source within the 3-D sound field. In this example configuration changing the location of the simulated sound source within the 3-D sound field may further comprise changing the location of the simulated sound source to maintain the relative position of the simulated source within the coordinate system with respect to at least the user head orientation.
The above example method may further comprise transmitting the audio signal via wireless communication.
In another example embodiment there is provided a system comprising at least a device configured to generate an audio signal, the system being arranged to perform any of the above example methods.
In another example embodiment there is provided a chipset arranged to perform any of the above example methods.
In another example embodiment there is provided at least one machine readable medium comprising a plurality of instructions that, in response to be being executed on a computing device, cause the computing device to carry out any of the above example methods.
In another example embodiment there is provided an apparatus for audio control based on orientation, the apparatus being arranged to perform any of the above example methods.
The terms and expressions which have been employed herein are used as terms of description and not of limitation, and there is no intention, in the use of such terms and expressions, of excluding any equivalents of the features shown and described (or portions thereof), and it is recognized that various modifications are possible within the scope of the claims. Accordingly, the claims are intended to cover all such equivalents.

Claims (30)

What is claimed:
1. A mobile electronic device, comprising:
device orientation detection circuitry configured to determine a device orientation of said mobile electronic device with respect to a coordinate system;
user head orientation detection circuitry configured to determine user head orientation with respect to the mobile electronic device; and
audio adjustment circuitry configured to control the generation of an audio signal based on the device orientation and user head orientation;
wherein:
the mobile electronic device is a mobile communication device or a mobile computing device;
the audio signal is configured to cause the generation of a sound field; and
the audio signal is configured to lock the sound field in a fixed perspective regardless of a change in said device orientation.
2. The mobile electronic device of claim 1, wherein:
the mobile electronic device further comprises image capture circuitry; and
determining the user head orientation with respect to the mobile electronic device comprises capturing an image using the image capture circuitry, detecting a face of a user in the image and determining an orientation for the head of the user based on the detected user face.
3. The mobile electronic device of claim 1, wherein determining the user head orientation with respect to the mobile electronic device comprises detecting an orientation of an apparatus worn by a user and determining the user head orientation based on the orientation of the apparatus.
4. The mobile electronic device of claim 3, wherein:
the mobile electronic device further comprises at least a wireless receiver; and
detecting the worn apparatus orientation comprises receiving position information from the apparatus via wireless communication.
5. The mobile electronic device of claim 1, wherein controlling the generation of the audio signal is based on the relative orientation of the mobile electronic device with respect to the coordinate system or the relative orientation of the user head with respect to the mobile electronic device.
6. The mobile electronic device of claim 1, wherein controlling the generation of the audio signal is based on the relative orientation of the mobile electronic device with respect to the coordinate system and the relative orientation of the user head with respect to the mobile electronic device.
7. The mobile electronic device of claim 1, wherein the audio signal is a surround sound audio signal configured to simulate a three-dimensional (3-D) sound field.
8. The mobile electronic device of claim 7, wherein controlling the generation of the audio signal comprises changing the location of a simulated sound source within the 3-D sound field.
9. The mobile electronic device of claim 8, wherein changing the location of the simulated sound source within the 3-D sound field comprises changing the location of the simulated sound source to maintain the relative position of the simulated source within the coordinate system with respect to at least the user head orientation.
10. The mobile electronic device of claim 1, wherein the mobile electronic device further comprises at least a wireless transmitter configured to transmit the audio signal.
11. A method, comprising:
determining a device orientation of a mobile electronic device with respect to a coordinate system;
determining user head orientation with respect to the mobile electronic device; and
controlling generation of an audio signal based on the device orientation and user head orientation;
wherein:
the mobile electronic device is a mobile communication device or a mobile computing device;
the audio signal is configured to cause the generation of a sound field; and
the audio signal is configured to lock the sound field in a fixed perspective regardless of a change in said device orientation.
12. The method of claim 11, wherein determining the user head orientation with respect to the mobile electronic device comprises capturing an image, detecting a face of a user in the image and determining an orientation for the head of the user based on the detected user face.
13. The method of claim 11, wherein determining the user head orientation with respect to the mobile electronic device comprises detecting an orientation of an apparatus worn by a user and determining the user head orientation based on the orientation of the worn apparatus.
14. The method of claim 13, wherein detecting the worn apparatus orientation comprises receiving position information from the worn apparatus via wireless communication.
15. The method of claim 11, wherein controlling the generation of the audio signal is based on the relative orientation of the mobile electronic device with respect to the coordinate system or the relative orientation of the user head with respect to the mobile electronic device.
16. The method of claim 11, wherein controlling the generation of the audio signal is based on the relative orientation of the mobile electronic device with respect to the coordinate system and the relative orientation of the user head with respect to the mobile electronic device.
17. The method of claim 11, wherein the audio signal is a surround sound audio signal configured to simulate a three-dimensional (3-D) sound field.
18. The method of claim 17, wherein controlling the generation of the audio signal comprises changing the location of a simulated sound source within the 3-D sound field.
19. The method of claim 18, wherein changing the location of the simulated sound source within the 3-D sound field comprises changing the location of the simulated sound source to maintain the relative position of the simulated source within the coordinate system with respect to at least the user head orientation.
20. The method of claim 11, further comprising transmitting the audio signal via wireless communication.
21. A system comprising at least one machine-readable storage medium having stored thereon, individually or in combination, instructions that when executed by one or more processors result in the following operations comprising:
determining a device orientation of a mobile electronic device with respect to a coordinate system;
determining user head orientation with respect to the mobile electronic device; and
controlling generation of an audio signal based on the device orientation and user head orientation;
wherein:
the mobile electronic device is a mobile communication device or a mobile computing device;
the audio signal is configured to cause the generation of a sound field; and
the audio signal is configured to lock the sound field in a fixed perspective regardless of a change in said device orientation.
22. The system of claim 21, wherein determining the user head orientation with respect to the mobile electronic device comprises capturing an image, detecting a face of a user in the image and determining an orientation for the head of the user based on the detected user face.
23. The system of claim 21, wherein determining the user head orientation with respect to the mobile electronic device comprises detecting an orientation of an apparatus worn by a user and determining the user head orientation based on the orientation of the worn apparatus.
24. The system of claim 23, wherein detecting the worn apparatus orientation comprises receiving position information from the worn apparatus via wireless communication.
25. The system of claim 21, wherein controlling the generation of the audio signal is based on the relative orientation of the mobile electronic device with respect to the coordinate system or the relative orientation of the user head with respect to the mobile electronic device.
26. The system of claim 21, wherein controlling the generation of the audio signal is based on the relative orientation of the mobile electronic device with respect to the coordinate system and the relative orientation of the user head with respect to the mobile electronic device.
27. The system of claim 21, wherein the audio signal is a surround sound audio signal configured to simulate a three-dimensional (3-D) sound field.
28. The system of claim 27, wherein controlling the generation of the audio signal comprises changing the location of a simulated sound source within the 3-D sound field.
29. The system of claim 28, wherein changing the location of the simulated sound source within the 3-D sound field comprises changing the location of the simulated sound source to maintain the relative position of the simulated source within the coordinate system with respect to at least the user head orientation.
30. The system of claim 21, further comprising transmitting the audio signal via wireless communication.
US13/997,423 2012-03-29 2012-03-29 Audio control based on orientation Active 2032-10-15 US9271103B2 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2012/031188 WO2013147791A1 (en) 2012-03-29 2012-03-29 Audio control based on orientation

Publications (2)

Publication Number Publication Date
US20140153751A1 US20140153751A1 (en) 2014-06-05
US9271103B2 true US9271103B2 (en) 2016-02-23

Family

ID=49260855

Family Applications (1)

Application Number Title Priority Date Filing Date
US13/997,423 Active 2032-10-15 US9271103B2 (en) 2012-03-29 2012-03-29 Audio control based on orientation

Country Status (3)

Country Link
US (1) US9271103B2 (en)
CN (1) CN104205880B (en)
WO (1) WO2013147791A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150063613A1 (en) * 2013-09-04 2015-03-05 Samsung Electronics Co., Ltd. Method of preventing feedback based on detection of posture and devices for performing the method
US10448189B2 (en) * 2016-09-14 2019-10-15 Magic Leap, Inc. Virtual reality, augmented reality, and mixed reality systems with spatialized audio
WO2019220009A1 (en) 2018-05-18 2019-11-21 Nokia Technologies Oy Methods and apparatuses for implementing a head tracking headset
US10705338B2 (en) 2016-05-02 2020-07-07 Waves Audio Ltd. Head tracking with adaptive reference
US11182930B2 (en) 2016-05-02 2021-11-23 Waves Audio Ltd. Head tracking with adaptive reference
US11330388B2 (en) * 2016-11-18 2022-05-10 Stages Llc Audio source spatialization relative to orientation sensor and output
US11601764B2 (en) 2016-11-18 2023-03-07 Stages Llc Audio analysis and processing system
US11689846B2 (en) 2014-12-05 2023-06-27 Stages Llc Active noise control and customized audio system

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013147791A1 (en) 2012-03-29 2013-10-03 Intel Corporation Audio control based on orientation
US20130332156A1 (en) * 2012-06-11 2013-12-12 Apple Inc. Sensor Fusion to Improve Speech/Audio Processing in a Mobile Device
US10203839B2 (en) 2012-12-27 2019-02-12 Avaya Inc. Three-dimensional generalized space
US9892743B2 (en) * 2012-12-27 2018-02-13 Avaya Inc. Security surveillance via three-dimensional audio space presentation
JP2014143470A (en) * 2013-01-22 2014-08-07 Sony Corp Information processing unit, information processing method, and program
US20140227977A1 (en) * 2013-02-12 2014-08-14 Zary Segall Method, node, device, and computer program for interaction
US9769585B1 (en) * 2013-08-30 2017-09-19 Sprint Communications Company L.P. Positioning surround sound for virtual acoustic presence
US20160238408A1 (en) * 2015-02-18 2016-08-18 Plantronics, Inc. Automatic Determination of User Direction Based on Direction Reported by Mobile Device
US10327089B2 (en) 2015-04-14 2019-06-18 Dsp4You Ltd. Positioning an output element within a three-dimensional environment
CN105007553A (en) * 2015-07-23 2015-10-28 惠州Tcl移动通信有限公司 Sound oriented transmission method of mobile terminal and mobile terminal
GB2542609A (en) * 2015-09-25 2017-03-29 Nokia Technologies Oy Differential headtracking apparatus
US10089063B2 (en) * 2016-08-10 2018-10-02 Qualcomm Incorporated Multimedia device for processing spatialized audio based on movement
US10028071B2 (en) 2016-09-23 2018-07-17 Apple Inc. Binaural sound reproduction system having dynamically adjusted audio output
CN107249166A (en) * 2017-06-19 2017-10-13 依偎科技(南昌)有限公司 A kind of earphone stereo realization method and system of complete immersion
TW201914314A (en) * 2017-08-31 2019-04-01 宏碁股份有限公司 Audio processing device and audio processing method thereof
CN109672956A (en) * 2017-10-16 2019-04-23 宏碁股份有限公司 Apparatus for processing audio and its audio-frequency processing method
US10375506B1 (en) 2018-02-28 2019-08-06 Google Llc Spatial audio to enable safe headphone use during exercise and commuting
US11617050B2 (en) 2018-04-04 2023-03-28 Bose Corporation Systems and methods for sound source virtualization
CN112771893A (en) * 2018-11-20 2021-05-07 深圳市欢太科技有限公司 3D sound effect implementation method and device, storage medium and electronic equipment
US11105636B2 (en) * 2019-04-17 2021-08-31 Google Llc Radio enhanced augmented reality and virtual reality with truly wireless earbuds
JP7342451B2 (en) * 2019-06-27 2023-09-12 ヤマハ株式会社 Audio processing device and audio processing method
CN113574849A (en) * 2019-07-29 2021-10-29 苹果公司 Object scanning for subsequent object detection
US11356795B2 (en) 2020-06-17 2022-06-07 Bose Corporation Spatialized audio relative to a peripheral device
US11675423B2 (en) 2020-06-19 2023-06-13 Apple Inc. User posture change detection for head pose tracking in spatial audio applications
US11586280B2 (en) * 2020-06-19 2023-02-21 Apple Inc. Head motion prediction for spatial audio applications
US12069469B2 (en) 2020-06-20 2024-08-20 Apple Inc. Head dimension estimation for spatial audio applications
US11589183B2 (en) * 2020-06-20 2023-02-21 Apple Inc. Inertially stable virtual auditory space for spatial audio applications
US11647352B2 (en) 2020-06-20 2023-05-09 Apple Inc. Head to headset rotation transform estimation for head pose tracking in spatial audio applications
US12108237B2 (en) 2020-06-20 2024-10-01 Apple Inc. Head tracking correlated motion detection for spatial audio applications
US11982738B2 (en) 2020-09-16 2024-05-14 Bose Corporation Methods and systems for determining position and orientation of a device using acoustic beacons
US11582573B2 (en) 2020-09-25 2023-02-14 Apple Inc. Disabling/re-enabling head tracking for distracted user of spatial audio application

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100322428A1 (en) 2009-06-23 2010-12-23 Sony Corporation Audio signal processing device and audio signal processing method
US20110026745A1 (en) * 2009-07-31 2011-02-03 Amir Said Distributed signal processing of immersive three-dimensional sound for audio conferences
US20110191674A1 (en) 2004-08-06 2011-08-04 Sensable Technologies, Inc. Virtual musical interface in a haptic virtual environment
US20120062729A1 (en) * 2010-09-10 2012-03-15 Amazon Technologies, Inc. Relative position-inclusive device interfaces
US20120093320A1 (en) 2010-10-13 2012-04-19 Microsoft Corporation System and method for high-precision 3-dimensional audio for augmented reality
WO2013147791A1 (en) 2012-03-29 2013-10-03 Intel Corporation Audio control based on orientation

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3994296B2 (en) * 1998-01-19 2007-10-17 ソニー株式会社 Audio playback device
JP3982431B2 (en) * 2002-08-27 2007-09-26 ヤマハ株式会社 Sound data distribution system and sound data distribution apparatus
AU2005282680A1 (en) * 2004-09-03 2006-03-16 Parker Tsuhako Method and apparatus for producing a phantom three-dimensional sound space with recorded sound
JP4735993B2 (en) * 2008-08-26 2011-07-27 ソニー株式会社 Audio processing apparatus, sound image localization position adjusting method, video processing apparatus, and video processing method
JP4849121B2 (en) * 2008-12-16 2012-01-11 ソニー株式会社 Information processing system and information processing method
KR20130122516A (en) * 2010-04-26 2013-11-07 캠브리지 메카트로닉스 리미티드 Loudspeakers with position tracking

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110191674A1 (en) 2004-08-06 2011-08-04 Sensable Technologies, Inc. Virtual musical interface in a haptic virtual environment
US20100322428A1 (en) 2009-06-23 2010-12-23 Sony Corporation Audio signal processing device and audio signal processing method
US20110026745A1 (en) * 2009-07-31 2011-02-03 Amir Said Distributed signal processing of immersive three-dimensional sound for audio conferences
US20120062729A1 (en) * 2010-09-10 2012-03-15 Amazon Technologies, Inc. Relative position-inclusive device interfaces
US20120093320A1 (en) 2010-10-13 2012-04-19 Microsoft Corporation System and method for high-precision 3-dimensional audio for augmented reality
WO2013147791A1 (en) 2012-03-29 2013-10-03 Intel Corporation Audio control based on orientation

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
International Preliminary Report on Patentability and Written Opinion received for PCT Patent Application No. PCT/US2012/031188, mailed on Oct. 9, 2014, 6 pages.
International Search Report and Written Opinion received for PCT application No. No. PCT/US2012/031188, mailed on Nov. 20, 2012, 9 Pages.

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150063613A1 (en) * 2013-09-04 2015-03-05 Samsung Electronics Co., Ltd. Method of preventing feedback based on detection of posture and devices for performing the method
US9635476B2 (en) * 2013-09-04 2017-04-25 Samsung Electronics Co., Ltd. Method of preventing feedback based on detection of posture and devices for performing the method
US11689846B2 (en) 2014-12-05 2023-06-27 Stages Llc Active noise control and customized audio system
US10705338B2 (en) 2016-05-02 2020-07-07 Waves Audio Ltd. Head tracking with adaptive reference
US11182930B2 (en) 2016-05-02 2021-11-23 Waves Audio Ltd. Head tracking with adaptive reference
US10448189B2 (en) * 2016-09-14 2019-10-15 Magic Leap, Inc. Virtual reality, augmented reality, and mixed reality systems with spatialized audio
US11310618B2 (en) 2016-09-14 2022-04-19 Magic Leap, Inc. Virtual reality, augmented reality, and mixed reality systems with spatialized audio
US11330388B2 (en) * 2016-11-18 2022-05-10 Stages Llc Audio source spatialization relative to orientation sensor and output
US11601764B2 (en) 2016-11-18 2023-03-07 Stages Llc Audio analysis and processing system
WO2019220009A1 (en) 2018-05-18 2019-11-21 Nokia Technologies Oy Methods and apparatuses for implementing a head tracking headset
EP3794846A4 (en) * 2018-05-18 2022-03-09 Nokia Technologies Oy Methods and apparatuses for implementing a head tracking headset

Also Published As

Publication number Publication date
CN104205880A (en) 2014-12-10
WO2013147791A1 (en) 2013-10-03
CN104205880B (en) 2019-06-11
US20140153751A1 (en) 2014-06-05

Similar Documents

Publication Publication Date Title
US9271103B2 (en) Audio control based on orientation
CN112425187B (en) Apparatus, method, and storage medium for spatial audio rendering
EP3227788B1 (en) Master device for using connection attribute of electronic accessories connections to facilitate locating an accessory
US20210153629A1 (en) Method for processing images, electronic device, and storage medium
US11647352B2 (en) Head to headset rotation transform estimation for head pose tracking in spatial audio applications
US9113246B2 (en) Automated left-right headphone earpiece identifier
CN110618805B (en) Method and device for adjusting electric quantity of equipment, electronic equipment and medium
US11309947B2 (en) Systems and methods for maintaining directional wireless links of motile devices
US11582573B2 (en) Disabling/re-enabling head tracking for distracted user of spatial audio application
US11520550B1 (en) Electronic system for producing a coordinated output using wireless localization of multiple portable electronic devices
US9832587B1 (en) Assisted near-distance communication using binaural cues
US10530818B2 (en) Server-based sound mixing for multiuser voice chat system
US20230024547A1 (en) Terminal for controlling wireless sound device, and method therefor
CN116601514A (en) Method and system for determining a position and orientation of a device using acoustic beacons
US20240176414A1 (en) Systems and methods for labeling and prioritization of sensory events in sensory environments
WO2019128430A1 (en) Method, apparatus and device for determining bandwidth, and storage medium
WO2024134736A1 (en) Head-mounted display device and stereophonic sound control method
WO2024040527A1 (en) Spatial audio using a single audio device
CN116743913B (en) Audio processing method and device
WO2021112161A1 (en) Information processing device, control method, and non-transitory computer-readable medium
CN116048241A (en) Prompting method, augmented reality device and medium

Legal Events

Date Code Title Description
AS Assignment

Owner name: INTEL CORPORATION, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:WELLS, KEVIN C.;REEL/FRAME:032276/0927

Effective date: 20130909

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCF Information on status: patent grant

Free format text: PATENTED CASE

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 8