WO2024154416A1 - Voice output device, voice output method, and program - Google Patents

Voice output device, voice output method, and program Download PDF

Info

Publication number
WO2024154416A1
WO2024154416A1 PCT/JP2023/040242 JP2023040242W WO2024154416A1 WO 2024154416 A1 WO2024154416 A1 WO 2024154416A1 JP 2023040242 W JP2023040242 W JP 2023040242W WO 2024154416 A1 WO2024154416 A1 WO 2024154416A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
audio output
state
output device
audio
Prior art date
Application number
PCT/JP2023/040242
Other languages
French (fr)
Japanese (ja)
Inventor
領平 須永
Original Assignee
株式会社Jvcケンウッド
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社Jvcケンウッド filed Critical 株式会社Jvcケンウッド
Publication of WO2024154416A1 publication Critical patent/WO2024154416A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones

Definitions

  • This disclosure relates to an audio output device, an audio output method, and a program.
  • Voice calls using wireless communication are used in a wide range of situations as a means of communication between users. For example, when running, trekking, cycling, etc., multiple people may use devices that make voice calls via wireless communication to communicate with each other.
  • Patent Document 1 discloses earphones that take in outside sounds when spoken to.
  • the present disclosure aims to provide an audio output device, an audio output method, and a program that enable a user to properly understand the surrounding situation.
  • the audio output device of the present disclosure includes a detection unit that detects that the user's state is one in which the physical load of the user is high, an audio output unit that outputs audio to the user, and an audio output control unit that, when the detection unit detects that the physical load of the user is high, causes the audio output unit to make it easier for the user to hear sounds around the user.
  • an audio output device an audio output method, and a program that enable a user to properly understand the surrounding situation.
  • FIG. 1 is a diagram illustrating an example of the configuration of a first embodiment of an audio output device according to the present disclosure.
  • FIG. 2 is a flowchart showing a flow of a first aspect of the processing of the audio output device according to the present disclosure.
  • FIG. 3 is a flowchart showing a flow of a second aspect of the processing of the audio output device according to the present disclosure.
  • FIG. 4 is a diagram illustrating an example of the configuration of a second embodiment of a sound output device according to the present disclosure.
  • First Embodiment Fig. 1 is a diagram showing a configuration example of a first embodiment of a sound output device according to the present disclosure.
  • the sound output device 100 includes a microphone 110, a sound output unit 120, a control unit 130, a sound input unit 140, and a sensor 150.
  • the device may have a storage unit for storing various information. Below, these configurations will be described in order.
  • the audio output device 100 is a device that allows a user to listen to audio content while walking, running, or the like.
  • the audio output device 100 is a device that is configured from a portable information terminal such as a smartphone or an audio player, and headphones or earphones.
  • a portable information terminal such as a smartphone or an audio player
  • the audio output device 100 is a device that is configured from a helmet-integrated headset with a communication function, a portable information terminal such as a smartphone or an audio player, and a neck speaker.
  • the audio output device 100 may be a single device, or may be configured such that a device configured from the control unit 130, audio input unit 140, and sensor 150 and a device configured from the microphone 110 and audio output unit 120 are connected by wire or wirelessly.
  • the microphone 110 picks up various sounds.
  • the microphone 110 includes a microphone L111 and a microphone R112.
  • Microphone L111 is a microphone provided on the left side of the user. Microphone L111 picks up environmental sounds around the user using audio output device 100, for example, for external sound capture or noise cancellation.
  • Microphone R112 is a microphone provided on the right side of the user. Note that the function of microphone R112 is the same as that of microphone L111, so a description thereof will be omitted.
  • the microphone 110 When the audio output device 100 is attached to a helmet worn by a user, the microphone 110 is arranged on the outside of the helmet, for example, on the left or right side.
  • the audio output unit 120 When the audio output unit 120 is configured as an earphone, headphone, neck speaker, or the like, the microphone 110 is arranged near the audio output unit 120, and is a so-called microphone for capturing external sound or a microphone for noise cancellation.
  • the audio output unit 120 outputs audio to the user.
  • the audio output unit 120 includes an audio output unit L121 and an audio output unit R122.
  • the audio output unit L121 is an audio output device such as a speaker provided on the left side of the user.
  • the audio output unit L121 outputs various sounds to the user.
  • the audio output unit L121 outputs the voice of a user of another audio output device 100.
  • the audio output unit R122 is an audio output device such as a speaker provided on the right side of the user.
  • the function of the audio output unit R122 is the same as that of the audio output unit L121, so a description thereof will be omitted.
  • the audio output unit 120 When the audio output unit 120 is attached to a helmet worn by the user, it is positioned so as not to block the user's ears when the helmet is worn.
  • the audio output unit 120 When the audio output unit 120 is configured as earphones, headphones, a neck speaker, or the like, it is the speaker or sound-producing element of those devices.
  • the audio input unit 140 receives audio content from a storage unit (not shown) that stores audio content, or a communication unit (not shown) that acquires audio content, and outputs the input audio content to the audio output unit 120.
  • the sensor 150 is a sensor or device that detects or measures the state of the audio output device 100 or the state of the user using the audio output device 100.
  • the sensor 150 may be a gyro sensor or an acceleration sensor.
  • the gyro sensor generates a primary vibration that vibrates in one direction in the movable electrode, and when rotation is applied to the movable electrode, a secondary vibration occurs due to the Coriolis force acting in a direction 90° from the vibration direction, causing a change in capacitance, and may be a capacitive MEMS (Micro Electro Mechanical Systems) gyro sensor that detects this.
  • the angular velocity can be determined from the change in capacitance and the vibration phase of the movable electrode.
  • the acceleration sensor may be, for example, a capacitive acceleration sensor that creates a movable electrode and a fixed electrode using MEMS, and measures acceleration using the relationship between the acceleration and the change in capacitance caused by the movement of the movable electrode.
  • the sensor 150 may be a device worn by the user, such as a smart watch.
  • the sensor 150 may be replaced by the functions of a device worn by the user, or the functions of the device worn by the user may be used to complement the sensor 150 provided in the audio output device 100.
  • the user's activity level, heart rate, blood oxygen concentration, etc. can be obtained, making it possible to obtain the user's biological information more appropriately.
  • the gyro sensor and acceleration sensor may be provided in the audio output device 100, or in a device worn by the user.
  • the control unit 130 is a controller that manages and controls the audio output device 100.
  • the control unit 130 is realized by a processor such as a CPU (Central Processing Unit) or MPU (Micro Processing Unit) that executes various programs using RAM as a working area.
  • the control unit 130 may also be realized by an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or FPGA (Field Programmable Gate Array).
  • ASIC Application Specific Integrated Circuit
  • FPGA Field Programmable Gate Array
  • control unit 130 includes an audio processing unit 131 and a detection unit 132 as functional blocks realized by program execution, circuit configuration, and the like.
  • the control unit 130 may execute these processes using a single CPU, or may include multiple CPUs and execute these processes in parallel using the multiple CPUs. Each of these configurations will be described below in order.
  • the audio processing unit 131 performs various processes on the audio input to the audio output device 100 and the audio output by the audio output device 100.
  • the audio processing unit 131 includes, as functional blocks, an audio acquisition unit 1311, an ambient sound acquisition unit 1312, and an audio output control unit 1313.
  • the audio acquisition unit 1311 acquires the audio input from the audio input unit 140.
  • the audio input unit 140 performs a decoding process corresponding to the codec of the audio input from the audio input unit 140.
  • the ambient sound acquisition unit 1312 acquires audio around the user. That is, the ambient sound acquisition unit 1312 acquires audio around the user from microphone L111 and microphone R112.
  • the ambient sound acquisition unit 1312 may perform any filtering process on the audio acquired from microphone 110.
  • a frequency selection filter may be used for the ambient sound acquisition unit 1312.
  • the frequency selection filter can acquire audio in a frequency band corresponding to a frequency selected from the frequency distribution of the audio data.
  • a band pass filter, band stop filter, high pass filter, or low pass filter may be used as the frequency selection filter.
  • the audio output control unit 1313 controls the output of audio to the audio output unit 120. Specifically, the audio output control unit 1313 controls the output of the audio acquired by the audio acquisition unit 1311 and the ambient sound acquired by the ambient sound acquisition unit 1312 to the audio output unit 120. The audio output control unit 1313 also controls the on/off of the audio output to the audio output unit 120 and the adjustment of the volume of the audio output to the audio output unit 120 for the audio acquired by the audio acquisition unit 1311 and the ambient sound acquired by the ambient sound acquisition unit 1312. The audio output control unit 1313 also performs noise cancellation processing on the audio acquired by the audio acquisition unit 1311 using the ambient sound acquired by the ambient sound acquisition unit 1312.
  • the audio output control unit 1313 makes the audio output unit 120 in a state where the user's surrounding sounds are easily heard. Specifically, the audio output control unit 1313 controls the audio output unit 120 to output the surrounding sounds acquired by the surrounding sound acquisition unit 1312 so that the user can hear the surrounding sounds, in other words, to make the user in a state where the surrounding sounds are easily heard.
  • the audio output control unit 1313 outputs ambient sound to the audio output unit 120.
  • the audio output control unit 1313 mixes the audio content with ambient sound and outputs it, or lowers the volume of the audio content and mixes in ambient sound and outputs it.
  • the audio output control unit 1313 outputs the ambient sound to the audio output unit 120 at a relatively high volume.
  • a relatively high volume is a volume that is higher than the volume at which the user hears the ambient sound directly, since the user can hear the ambient sound directly.
  • the audio output control unit 1313 reduces or stops the noise cancellation effect based on the audio around the user acquired by the ambient sound acquisition unit 1312 for the audio to be output to the user.
  • the audio output control unit 1313 reduces or stops the noise cancellation effect based on the audio acquired by the ambient sound acquisition unit 1312 for the audio to be output to the user from the audio output unit 120.
  • noise cancellation may be achieved by outputting from the audio output unit 120 a sound that is in the opposite phase to the ambient audio acquired by the ambient sound acquisition unit 1312, thereby canceling out the sound that is considered to be noise.
  • This section describes an example of the reduction in the noise cancellation effect on the audio output unit 120 by the audio output control unit 1313.
  • the audio output control unit 1313 reduces the noise cancellation effect and stops the noise cancellation processing.
  • the audio output control unit 1313 may lower or mute the volume of the audio content in response to the reduction in the noise cancellation effect and the stop of the noise cancellation processing.
  • the audio output control unit 1313 stops outputting the inverse phase sound of the ambient sound acquired by the ambient sound acquisition unit 1312 to the audio output unit 120.
  • the audio output control unit 1313 performs processing such as lowering the output level of the inverse phase sound of the ambient sound acquired by the ambient sound acquisition unit 1312 and outputting it to the audio output unit 120.
  • the detection unit 132 detects that the user's state is a state of high physical load.
  • the detection unit 132 detects that the user's state is a state of high physical load based on information indicating the results of detection or measurement by the sensor 150.
  • the detection unit 132 may detect that the user's state is a state of high physical load using not only information acquired from the sensor 150, but also current position information based on signals from positioning satellites received by a GNSS (Global Navigation Satellite System) receiving unit (not shown), map information or topographical information acquired via a communication unit (not shown), or information provided by the audio output device.
  • GNSS Global Navigation Satellite System
  • the detection unit 132 detects that the user's physical load is high based on the user's operational load. Specifically, the detection unit 132 detects that the user's operational load is high as a state in which the user's physical load is high.
  • the state in which the user's operational load is high that is detected by the detection unit 132 is, for example, a state in which the user is walking uphill.
  • the detection unit 132 detects that the user is walking uphill as a state in which the user's physical load is high.
  • the state in which the user is walking uphill can be any state in which the user's operational load is high, such as a state in which the user is walking uphill, a state in which the user is running uphill, or a state in which the user is pedaling uphill on a bicycle, and the method of movement does not matter.
  • an uphill slope is an uphill slope with a gradient of a predetermined level or more that increases the user's operational load, and includes stairs. Such a state can also be said to be in an environment in which the user's operational load is high.
  • the detection unit 132 detects that the user is going uphill from information, for example, from the gyro sensor, acceleration sensor, etc. of the sensor 150.
  • the detection unit 132 acquires information on the user's inclination and detects that the user is going uphill from the average value of the inclination.
  • the detection unit 132 also detects that the user is walking, running, or pedaling a bicycle from continuous vibrations and changes in inclination based on information from the gyro sensor, acceleration sensor, etc. of the sensor 150. For example, the detection unit 132 detects that the user is going uphill when the direction of gravitational acceleration indicated by the gyro sensor, acceleration sensor, etc. of the sensor 150 changes to point backward from the user's traveling direction.
  • the detection unit 132 calculates the moving speed based on the moving distance per unit time of the current position information based on the signal from the positioning satellite received by the GNSS receiving unit, and detects whether the user is walking uphill, running uphill, or even pedaling uphill on a bicycle, based on continuous vibrations from the gyro sensor and acceleration sensor of the sensor 150, thereby detecting whether the user is under a heavy load of motion.
  • the detection unit 132 may detect that the user is under a heavy operational load by detecting that the user is moving uphill based on current location information based on signals from positioning satellites received by the GNSS receiving unit, and based on map information and topographical information.
  • the detection unit 132 detects, for example, that the user is moving at a predetermined speed or faster as a state in which the user's physical load is high.
  • the detection unit 132 detects that the person holding the audio output device 100 is running from information from the gyro sensor, acceleration sensor, etc. of the sensor 150. In other words, the detection unit 132 detects that the user is running (in other words, moving at a walking speed or faster) as a state in which the user's operational load is high.
  • the detection unit 132 may detect that the user is moving at a predetermined speed or faster based on the distance traveled per unit time of the current location information based on a signal from a positioning satellite received by the GNSS receiving unit.
  • the detection unit 132 detects that the user is in a state of high physical load based on the user's fatigue state. Specifically, the detection unit 132 detects that the user is in a fatigue state as a state of high physical load.
  • the user being in a fatigued state detected by the detection unit 132 means that the user is in a physically fatigued state.
  • the detection unit 132 detects that the user is in a fatigued state, for example, based on the user's heart rate variability acquired from the sensor 150.
  • the detection unit 132 stores the average value of the user's heart rate variability based on the user's heart rate detected by the sensor 150, and detects that the user is in a fatigued state by detecting that the value of the user's heart rate variability has dropped from the normal range.
  • the detection unit 132 detects that the user is in a state of high physical load based on the user's stress level. Specifically, the detection unit 132 detects that the user's stress level is high as the user's state of high physical load.
  • the detection unit 132 detects the user's stress level based on, for example, the user's heart rate variability acquired from the sensor 150.
  • the detection unit 132 stores the average value of the user's heart rate variability based on the user's heart rate detected by the sensor 150, detects the user's stress level by detecting that the user's heart rate variability value has fallen below the normal range, and detects that the user's stress level is high based on the degree of deviation from the average value.
  • the detection unit 132 can detect a state in which the user's physical load is high by a variety of sensing methods and techniques, not limited to those described above, and they can also be used in combination.
  • FIG. 2 is a flowchart showing a flow of the first aspect of the processing of the audio output device according to the present disclosure.
  • FIG. 2 shows an example of an audio output method executed by the audio output device according to the present disclosure, and is also an example of processing based on a program executed by the control unit 130.
  • the first aspect of the processing of the audio output device 100 according to the present disclosure will be described along the flow shown in FIG. 2.
  • the process shown in FIG. 2 is started when the power supply of the audio output device 100 is turned on, or when an application that executes the process shown in FIG. 2 is started in the audio output device 100.
  • the process shown in FIG. 2 may also be started when a user of the audio output device 100 puts on a helmet, earphones, headphones, neck speaker, or the like that is equipped with the audio output unit 120.
  • the process shown in FIG. 2 may also be started at any timing when a user of the audio output device 100 operates the audio output device 100 while using the audio output device 100.
  • the process shown in FIG. 2 may or may not involve the audio acquired by the audio acquisition unit 1311 being output to the audio output unit 120.
  • the audio output device 100 detects whether or not the physical load of the user of the audio output device 100 is high (step S101). Specifically, the detection unit 132 detects whether or not the physical load of the user of the audio output device 100 is high. A state in which the physical load of the user of the audio output device 100 is high is based on the detection of a state in which the user's operating load is high, the user being fatigued, the user's stress level being high, and the like.
  • step S101 If it is detected in step S101 that the user is under a large physical load (step S101: Yes), the audio output device 100 starts outputting external audio (step S102). Specifically, the audio output device 100 outputs the audio acquired by the ambient sound acquisition unit 1312 to the audio output unit 120. At this time, if the audio output device 100 is a device that performs audio calls with other audio output devices and a voice call is being performed, the audio acquired by the ambient sound acquisition unit 1312 is output in addition to the audio of the voice call. Also, if the audio output device 100 is a device that listens to audio content and the audio content is being output, the audio acquired by the ambient sound acquisition unit 1312 is output in addition to the audio of the audio content.
  • the audio output device 100 outputs the audio acquired by the ambient sound acquisition unit 1312 in addition to the audio input from the audio input unit 140.
  • the sound acquired by the ambient sound acquisition unit 1312 is output in addition to the sound input from the sound input unit 140; in other words, both the sound acquired by the ambient sound acquisition unit 1312 and the sound input from the sound input unit 140 are output from the sound output unit 120.
  • the audio output device 100 determines whether the process shown in FIG. 2 has ended (step S103).
  • the process shown in FIG. 2 is determined to have ended when the audio output device 100 is powered off, or when an application that executes the process shown in FIG. 2 has ended in the audio output device 100.
  • the process may also be determined to have ended when the user of the audio output device 100 removes the helmet, earphones, headphones, neck speaker, or the like that is equipped with the audio output unit 120.
  • the process may also be determined to have ended when the user of the audio output device 100 performs an operation on the audio output device 100 while using the audio output device 100, resulting in an end operation being performed at any timing.
  • step S104 determines whether or not an external sound is being output. If it is determined that an external sound is being output (step S104: Yes), the audio output device 100 determines whether or not a state of high physical load continues (step S105). If the determination in step S104 has progressed from No in step S101, the processing in step S102 has not been performed, and therefore an external sound is not being output; if the determination in step S104 has progressed from Yes in step S101, the processing in step S102 has been performed, and therefore an external sound is being output.
  • step S105 If it is determined in step S105 that the state of high physical load continues (step S105: Yes), the determination in step S105 is executed again. Note that the period during which the determination in step S105 is Yes may also include a determination as to whether or not the processing has ended, as in step S103. If it is determined in step S105 that the state of high physical load does not continue (step S105: No), the audio output device 100 stops outputting external audio (step S106).
  • step S106 the audio output device 100 determines whether the processing shown in FIG. 2 has ended (step S107), similar to step S103. If it is determined that the processing has ended (step S107: Yes), the audio output device 100 ends the processing shown in FIG. 2.
  • step S101 If it is not detected in step S101 that the physical load is high (step S101: No), the audio output device 100 proceeds to step S103 and executes the processes from step S103 onward.
  • step S103 If it is determined in step S103 that the processing has ended (step S103: Yes), the audio output device 100 ends the processing shown in FIG. 2.
  • step S104 determines whether external audio is not being output (step S104: No)
  • the audio output device 100 proceeds to the process of step S107 and executes the processes from step S107 onwards.
  • FIG. 3 is a flowchart showing a flow of the second aspect of the processing of the audio output device according to the present disclosure.
  • FIG. 3 shows an example of an audio output method executed by the audio output device according to the present disclosure, and is also an example of processing based on a program executed by the control unit 130.
  • the second aspect of the processing of the audio output device 100 according to the present disclosure will be described along the flow shown in FIG. 3.
  • the process in FIG. 3 is applied when the audio output unit 120 in the audio output device 100 is an earphone or a headphone equipped with a noise cancellation function.
  • the process in FIG. 3 is also based on the premise that the audio output device 100 performs noise cancellation processing on the audio content acquired from the audio input unit 140 based on the environmental sound acquired by the microphone 110.
  • Steps S201, S203, S205, and S207 shown in FIG. 3 are the same as steps S101, S103, S105, and S107 shown in FIG. 2, so their explanations are omitted.
  • step S201 If it is detected in step S201 that the user is under a large physical load (step S201: Yes), the audio output device 100 limits noise canceling (N/C) (step S202). Specifically, the audio output device 100 reduces or stops the noise canceling effect based on the sound acquired by the ambient sound acquisition unit 1312 for the sound to be output to the user.
  • N/C noise canceling
  • step S203 determines whether the processing shown in FIG. 3 has ended. If it is determined in step S203 that the processing has not ended (step S203: No), the audio output device 100 determines whether the noise canceling restriction state continues (step S204). If it is determined that the noise canceling restriction state continues (step S204: Yes), the audio output device 100 determines whether the state of high physical load continues (step S205). If the determination in step S204 has progressed from No in step S201, the processing in step S202 has not been performed, so the noise canceling restriction state is not continuing, and if the determination has progressed from Yes in step S101, the processing in step S202 has been performed, so the noise canceling restriction state is continuing.
  • step S205 If it is determined in step S205 that the state of high physical load is not continuing (step S205: No), the audio output device 100 removes the restriction on noise canceling (step S206).
  • step S204 If it is determined in step S204 that the noise canceling restriction state is not continuing (step S204: No), the audio output device 100 proceeds to step S207 and executes the processes from step S207 onward.
  • the noise cancellation effect can be reduced or stopped. This makes it easier for the user to hear surrounding sounds. Therefore, it is possible to provide an audio output device 100 that allows the user to properly grasp the surrounding situation.
  • FIG. 4 is a diagram showing a configuration example of the second embodiment of the audio output device according to the present disclosure.
  • the audio input unit 140 is changed to a communication control unit 160 and a wireless communication module 170 compared to the audio output device 100 according to the first embodiment.
  • a microphone 101 for calling is added, and the function of the audio acquisition unit 1311 is different.
  • the audio output device 100 according to the first embodiment is the same as the audio output device 100 according to the first embodiment, so the description will be omitted.
  • the audio output device 100 shown in FIG. 4 is specifically a device that allows a user to communicate with other users via wireless communication while riding a bicycle, climbing a mountain, trekking, or the like.
  • the audio output device 100 is a device that is composed of a helmet-mounted headset with communication capabilities, a portable information terminal such as a smartphone, and a neck speaker.
  • the communication control unit 160 controls wireless communication by the audio output device 100. As shown in FIG. 4, the communication control unit 160 includes a transmission control unit 161 and a reception control unit 162.
  • the transmission control unit 161 transmits the audio acquired by the audio acquisition unit 1311 to another audio output device 100 that has been set in advance. In other words, the transmission control unit 161 transmits the audio acquired by the audio acquisition unit 1311 to the other audio output device 100 that has been set in advance via wireless communication such as a public communication network, or directly.
  • the reception control unit 162 controls the reception of audio transmitted from another audio output device 100.
  • the reception control unit 162 may control the reception of audio transmitted from another audio output device 100 that has been set in advance.
  • the wireless communication module 170 performs wireless communication with other audio output devices 100.
  • the wireless communication module 170 is, for example, a wireless communication module for performing wireless communication such as Wi-Fi (registered trademark) or 5G, or a wireless communication module for performing medium-range wireless communication using Bluetooth (registered trademark) or digital simple wireless communication.
  • the call microphone 101 picks up the speech of the user of the audio output device 100 in order to make a voice call via wireless communication between the audio output device 100 and another audio output device 100.
  • the call microphone 101 is placed in a position close to the user's mouth when the user is wearing the helmet.
  • the voice acquisition unit 1311 in the control unit 130 acquires the voice picked up by the call microphone 101, which picks up the user's speech. In other words, it can be said that the voice acquisition unit 1311 detects the user's speech.
  • the voice acquisition unit 1311 also acquires the voice received by the communication control unit 160 from another audio output device, and outputs the voice picked up by the call microphone 101 to the communication control unit 160.
  • the processing of the audio output device 100 shown in FIG. 4 is as shown in FIG. 2 in the first embodiment.
  • the audio output device 100 includes a detection unit 132 that detects whether the user's state is one in which the user's physical load is high, an audio output unit 120 that outputs audio to the user, and an audio output control unit 1313 that, when the detection unit 132 detects that the user's physical load is high, causes the audio output unit 120 to make it easier for the user to hear sounds around the user.
  • This configuration makes it possible to make the user's surrounding sounds easier to hear when the user is under a large physical load. As a result, it is possible to provide an audio output device 100 that allows the user to properly understand the situation around them.
  • the audio output device 100 further includes an ambient sound acquisition unit 1312 that acquires audio around the user, and when the detection unit 132 detects that the user is under a high level of physical stress, the audio output control unit 1313 outputs the audio around the user acquired by the ambient sound acquisition unit 1312 to the user.
  • the audio output device 100 further includes an ambient sound acquisition unit 1312 that acquires audio around the user, and when the detection unit 132 detects that the user is under a high level of physical stress, the audio output control unit 1313 reduces or stops the noise cancellation effect based on the audio around the user acquired by the ambient sound acquisition unit 1312 for the audio to be output to the user.
  • the detection unit 132 of the audio output device 100 detects a state in which the user's physical load is high, such as when the user is walking uphill or moving at a predetermined speed or higher, as a state in which the user's physical load is high.
  • This configuration makes it possible to provide an audio output device 100 that allows the user to properly understand the surrounding situation when the user is in a state where the user's physical burden is high, such as when the user is walking uphill or moving at a predetermined speed or faster.
  • the detection unit 132 of the audio output device 100 detects when the user is in a fatigued state or in a state of high stress level as a state in which the user's physical load is high.
  • the audio output method executed by the audio output device 100 includes a detection step of detecting that the user's state is one in which the user's physical load is high, and an audio output control step of making the user's surrounding sounds easier to hear in relation to the audio output to the user when it is detected that the user's physical load is high.
  • This configuration makes it possible to make the user's surrounding sounds easier to hear when the user is under a large physical load. This makes it possible to provide a method for controlling the audio output device 100 that allows the user to properly understand the situation around them.
  • the audio output device, audio output method, and program according to this embodiment can be used, for example, as an audio output device, audio output method, and program that allows a user to properly understand the surrounding situation.
  • Audio output device 110 Microphone 111 Microphone L 112 Microphone R 120 Audio output unit 121 Audio output unit L 122 Audio output unit R 130 Control unit 131 Audio processing unit 1311 Audio acquisition unit 1312 Ambient sound acquisition unit 1313 Audio output control unit 132 Detection unit 140 Audio input unit 150 Sensor 160 Communication control unit 161 Transmission control unit 162 Reception control unit 170 Wireless communication module

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

Provided are a voice output device and a voice output method that enable a user to appropriately ascertain conditions surrounding the user. This voice output device comprises: a detection unit that detects that a user is in a state where the physical stress of the user is significant; a voice output unit that outputs a voice to the user; and a voice output control unit that, if the detection unit detects the state where the physical stress of the user is significant, controls the voice output unit so as to create a state where surrounding noises around the user can be heard more easily.

Description

音声出力装置、音声出力方法、及びプログラムAudio output device, audio output method, and program
 本開示は、音声出力装置、音声出力方法、及びプログラムに関する。 This disclosure relates to an audio output device, an audio output method, and a program.
 無線通信を用いた音声通話は、利用者間のコミュニケーション手段として幅広い利用場面において使用されている。例えば、ランニングやトレッキング、サイクリングなどにおいて、複数人数による相互のコミュニケーションを行うために、無線通信による音声通話を行う装置を利用することがある。  Voice calls using wireless communication are used in a wide range of situations as a means of communication between users. For example, when running, trekking, cycling, etc., multiple people may use devices that make voice calls via wireless communication to communicate with each other.
 例えば、下記の特許文献1には、声を掛けられることで外音を取り込むイヤホンが開示されている。 For example, the following Patent Document 1 discloses earphones that take in outside sounds when spoken to.
特開2022-98974号公報JP 2022-98974 A
 しかしながら、このような装置の利用時、ユーザの身体的負荷が大きい場合には、周囲に対する注意力が低下することで、ユーザが周囲の車両の音などを聞き逃す可能性があった。このような場合、ユーザの耳をふさがない形態の装置であっても同様に、周囲の音を聞き逃す可能性がある。 However, when using such a device, if the user is under a great physical strain, their attention to the surroundings may decrease, and the user may miss sounds such as those of nearby vehicles. In such a case, even if the device does not cover the user's ears, there is a similar possibility that the user may miss surrounding sounds.
 本開示は上記課題を鑑み、ユーザに周囲の状況を適切に把握させることができる音声出力装置、音声出力方法、及びプログラムを提供することを目的とする。 In consideration of the above problems, the present disclosure aims to provide an audio output device, an audio output method, and a program that enable a user to properly understand the surrounding situation.
 上述した課題を解決し、目的を達成するために、本開示に係る音声出力装置は、ユーザの状態が前記ユーザの身体的負荷が大きい状態であることを検出する検出部と、前記ユーザに対して音声を出力する音声出力部と、前記検出部が前記ユーザの身体的負荷が大きい状態であることを検出した場合、前記音声出力部において、前記ユーザの周辺音が聞こえやすい状態とする音声出力制御部と、を備える。 In order to solve the above-mentioned problems and achieve the objectives, the audio output device of the present disclosure includes a detection unit that detects that the user's state is one in which the physical load of the user is high, an audio output unit that outputs audio to the user, and an audio output control unit that, when the detection unit detects that the physical load of the user is high, causes the audio output unit to make it easier for the user to hear sounds around the user.
 本開示によれば、ユーザに周囲の状況を適切に把握させることができる音声出力装置、音声出力方法、及びプログラムを提供することができる。 According to the present disclosure, it is possible to provide an audio output device, an audio output method, and a program that enable a user to properly understand the surrounding situation.
図1は、本開示に係る音声出力装置の第一実施形態の構成例を示す図である。FIG. 1 is a diagram illustrating an example of the configuration of a first embodiment of an audio output device according to the present disclosure. 図2は、本開示に係る音声出力装置の処理の第一態様のフローを示すフローチャートである。FIG. 2 is a flowchart showing a flow of a first aspect of the processing of the audio output device according to the present disclosure. 図3は、本開示に係る音声出力装置の処理の第二態様のフローを示すフローチャートである。FIG. 3 is a flowchart showing a flow of a second aspect of the processing of the audio output device according to the present disclosure. 図4は、本開示に係る音声出力装置の第二実施形態の構成例を示す図である。FIG. 4 is a diagram illustrating an example of the configuration of a second embodiment of a sound output device according to the present disclosure.
 以下に、本開示の実施形態を図面に基づいて詳細に説明する。なお、以下に説明する実施形態により本発明が限定されるものではない。 Below, an embodiment of the present disclosure will be described in detail with reference to the drawings. Note that the present invention is not limited to the embodiment described below.
(音声出力装置の構成)
(第一実施形態)
 図1は、本開示に係る音声出力装置の第一実施形態の構成例を示す図である。図1に示すように、本開示に係る音声出力装置100は、マイクロフォン110と、音声出力部120と、制御部130と、音声入力部140と、センサ150を備える。なお、図1に図示していないが、各種の情報を記憶する記憶部を有していてもよい。以下、これらの構成について、順を追って説明する。
(Configuration of audio output device)
First Embodiment
Fig. 1 is a diagram showing a configuration example of a first embodiment of a sound output device according to the present disclosure. As shown in Fig. 1, the sound output device 100 according to the present disclosure includes a microphone 110, a sound output unit 120, a control unit 130, a sound input unit 140, and a sensor 150. Although not shown in Fig. 1, the device may have a storage unit for storing various information. Below, these configurations will be described in order.
 音声出力装置100は、具体的には、ユーザが歩行またはランニング等を行いながら音声コンテンツを聴取する装置である。このような装置として、音声出力装置100は、スマートフォンや、オーディオプレイヤなどの携帯型情報端末、およびヘッドホンまたはイヤホンから構成される装置である。また、他の具体例としては、ユーザが自転車などによる走行や登山、トレッキングなどの状態において他のユーザとの通話や、音声コンテンツの聴取を行う装置である。このような装置として、音声出力装置100は、通信機能を備えたヘルメット内蔵ヘッドセット、スマートフォンや、オーディオプレイヤなどの携帯型情報端末、およびネックスピーカから構成される装置である。このため、音声出力装置100は、単一の装置であってもよく、制御部130、音声入力部140およびセンサ150で構成される装置と、マイクロフォン110および音声出力部120で構成される装置が、有線または無線で接続される構成であってもよい。 Specifically, the audio output device 100 is a device that allows a user to listen to audio content while walking, running, or the like. As such a device, the audio output device 100 is a device that is configured from a portable information terminal such as a smartphone or an audio player, and headphones or earphones. Another specific example is a device that allows a user to talk to other users or listen to audio content while riding a bicycle, climbing a mountain, trekking, or the like. As such a device, the audio output device 100 is a device that is configured from a helmet-integrated headset with a communication function, a portable information terminal such as a smartphone or an audio player, and a neck speaker. For this reason, the audio output device 100 may be a single device, or may be configured such that a device configured from the control unit 130, audio input unit 140, and sensor 150 and a device configured from the microphone 110 and audio output unit 120 are connected by wire or wirelessly.
 マイクロフォン110は、各種の音声を収音する。マイクロフォン110は、マイクロフォンL111と、マイクロフォンR112を備える。 The microphone 110 picks up various sounds. The microphone 110 includes a microphone L111 and a microphone R112.
 マイクロフォンL111は、ユーザの左側に設けられるマイクロフォンである。マイクロフォンL111は、例えば、外音取り込みまたはノイズキャンセリングのために、音声出力装置100を使用するユーザの周辺の環境音を収音する。マイクロフォンR112は、ユーザの右側に設けられるマイクロフォンである。なお、マイクロフォンR112の機能は、マイクロフォンL111と同じであるから説明を省略する。 Microphone L111 is a microphone provided on the left side of the user. Microphone L111 picks up environmental sounds around the user using audio output device 100, for example, for external sound capture or noise cancellation. Microphone R112 is a microphone provided on the right side of the user. Note that the function of microphone R112 is the same as that of microphone L111, so a description thereof will be omitted.
 マイクロフォン110は、音声出力装置100は、ユーザが装着するヘルメットなどに装着される場合は、ヘルメットの左右の外部などに配置される。マイクロフォン110は、音声出力部120が、イヤホン、ヘッドホン、ネックスピーカなどで構成される場合は、音声出力部120の近傍に配置される、所謂、外音取り込み用マイクロフォンまたはノイズキャンセル用マイクロフォンである。 When the audio output device 100 is attached to a helmet worn by a user, the microphone 110 is arranged on the outside of the helmet, for example, on the left or right side. When the audio output unit 120 is configured as an earphone, headphone, neck speaker, or the like, the microphone 110 is arranged near the audio output unit 120, and is a so-called microphone for capturing external sound or a microphone for noise cancellation.
 音声出力部120は、ユーザに対して音声を出力する。音声出力部120は、音声出力部L121と、音声出力部R122を備える。 The audio output unit 120 outputs audio to the user. The audio output unit 120 includes an audio output unit L121 and an audio output unit R122.
 音声出力部L121は、ユーザの左側に設けられるスピーカなどの音声出力装置である。音声出力部L121は、ユーザに対して各種の音声を出力する。音声出力部L121は、例えば、他の音声出力装置100のユーザの音声を出力する。音声出力部R122は、ユーザの右側に設けられるスピーカなどの音声出力装置である。音声出力部R122の機能は、音声出力部L121と同じであるから説明を省略する。 The audio output unit L121 is an audio output device such as a speaker provided on the left side of the user. The audio output unit L121 outputs various sounds to the user. For example, the audio output unit L121 outputs the voice of a user of another audio output device 100. The audio output unit R122 is an audio output device such as a speaker provided on the right side of the user. The function of the audio output unit R122 is the same as that of the audio output unit L121, so a description thereof will be omitted.
 音声出力部120は、ユーザが装着するヘルメットなどに装着される場合は、ユーザがヘルメットを装着した場合にユーザの耳を塞がないような位置に配置される。音声出力部120は、イヤホン、ヘッドホン、ネックスピーカなどで構成される場合は、それらのスピーカや発音素子である。 When the audio output unit 120 is attached to a helmet worn by the user, it is positioned so as not to block the user's ears when the helmet is worn. When the audio output unit 120 is configured as earphones, headphones, a neck speaker, or the like, it is the speaker or sound-producing element of those devices.
 音声入力部140は、音声コンテンツなどを記憶している図示しない記憶部や、音声コンテンツを取得する図示しない通信部などからの音声コンテンツが入力され、入力された音声コンテンツを音声出力部120に出力する。 The audio input unit 140 receives audio content from a storage unit (not shown) that stores audio content, or a communication unit (not shown) that acquires audio content, and outputs the input audio content to the audio output unit 120.
 センサ150は、音声出力装置100または音声出力装置100を使用しているユーザの状態を検出、または計測するセンサやデバイスである。 The sensor 150 is a sensor or device that detects or measures the state of the audio output device 100 or the state of the user using the audio output device 100.
 センサ150は、ジャイロセンサや、加速度センサであってよい。ジャイロセンサは、可動電極に一方向に振動する一次振動を発生させておき、可動電極に回転が加わると振動方向と90°の方向にコリオリの力が働くことにより二次振動が発生し、静電容量の変化が生じるため、これを検出する静電容量型MEMS(Micro Electro Mechanical Systems)ジャイロセンサであってよい。なお、静電容量の変化と可動電極の振動位相とにより角速度を求めることができる。加速度センサは、例えば、MEMSにより可動電極と固定電極を作り、可動電極が動くことによる静電容量の変化と加速度の関係を用いて加速度を計測する静電容量式の加速度センサであってよい。 The sensor 150 may be a gyro sensor or an acceleration sensor. The gyro sensor generates a primary vibration that vibrates in one direction in the movable electrode, and when rotation is applied to the movable electrode, a secondary vibration occurs due to the Coriolis force acting in a direction 90° from the vibration direction, causing a change in capacitance, and may be a capacitive MEMS (Micro Electro Mechanical Systems) gyro sensor that detects this. The angular velocity can be determined from the change in capacitance and the vibration phase of the movable electrode. The acceleration sensor may be, for example, a capacitive acceleration sensor that creates a movable electrode and a fixed electrode using MEMS, and measures acceleration using the relationship between the acceleration and the change in capacitance caused by the movement of the movable electrode.
 また、センサ150は、例えばスマートウォッチのような、ユーザが身につけるデバイスを用いてもよい。つまり、センサ150は、ユーザが身につけるデバイスの機能に置き換えられてもよく、音声出力装置100が備えるセンサ150に対して、ユーザが身につけるデバイスの機能を補完的に用いるようにしてもよい。このようなデバイスを用いることで、ユーザの活動量、心拍数、血中酸素濃度などを取得でき、ユーザの生体的な情報を、さらに適切に取得可能である。また、ジャイロセンサや、加速度センサは、音声出力装置100に備えられていてもよく、ユーザが身につけるデバイスに備えられていてもよい。 The sensor 150 may be a device worn by the user, such as a smart watch. In other words, the sensor 150 may be replaced by the functions of a device worn by the user, or the functions of the device worn by the user may be used to complement the sensor 150 provided in the audio output device 100. By using such a device, the user's activity level, heart rate, blood oxygen concentration, etc. can be obtained, making it possible to obtain the user's biological information more appropriately. The gyro sensor and acceleration sensor may be provided in the audio output device 100, or in a device worn by the user.
 制御部130は、音声出力装置100を司り、制御するコントローラ(controller)である。制御部130は、CPU(Central Processing Unit)やMPU(Micro Processing Unit)等のプロセッサによって、各種プログラムがRAMを作業領域として実行されることにより実現される。また、制御部130は、例えばASIC(Application Specific Integrated Circuit)やFPGA(Field Programmable Gate Array)等の集積回路により実現されてもよい。 The control unit 130 is a controller that manages and controls the audio output device 100. The control unit 130 is realized by a processor such as a CPU (Central Processing Unit) or MPU (Micro Processing Unit) that executes various programs using RAM as a working area. The control unit 130 may also be realized by an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or FPGA (Field Programmable Gate Array).
 図1に示すように、制御部130は、プログラムの実行や回路構成などによって実現される機能ブロックとして、音声処理部131と、検出部132を備える。なお、制御部130は、1つのCPUによってこれらの処理を実行してもよいし、複数のCPUを備えて、複数のCPUで、これらの処理を並列に実行してもよい。以下、これらの構成について順に説明する。 As shown in FIG. 1, the control unit 130 includes an audio processing unit 131 and a detection unit 132 as functional blocks realized by program execution, circuit configuration, and the like. The control unit 130 may execute these processes using a single CPU, or may include multiple CPUs and execute these processes in parallel using the multiple CPUs. Each of these configurations will be described below in order.
 音声処理部131は、音声出力装置100に入力される音声および音声出力装置100が出力する音声に対する各種処理を行う。音声処理部131は、機能ブロックとして、音声取得部1311と、周辺音取得部1312と、音声出力制御部1313を備える。 The audio processing unit 131 performs various processes on the audio input to the audio output device 100 and the audio output by the audio output device 100. The audio processing unit 131 includes, as functional blocks, an audio acquisition unit 1311, an ambient sound acquisition unit 1312, and an audio output control unit 1313.
 音声取得部1311は、音声入力部140から入力される音声を取得する。音声入力部140は、音声入力部140から入力された音声のコーデックに対応するデコード処理などを行う。 The audio acquisition unit 1311 acquires the audio input from the audio input unit 140. The audio input unit 140 performs a decoding process corresponding to the codec of the audio input from the audio input unit 140.
 周辺音取得部1312は、ユーザの周囲の音声を取得する。すなわち、周辺音取得部1312は、マイクロフォンL111とマイクロフォンR112からユーザの周囲の音声を取得する。周辺音取得部1312は、マイクロフォン110から取得した音声に対して、任意のフィルタリング処理を行ってもよい。周辺音取得部1312には、周波数選択フィルタが用いられてよい。周波数選択フィルタは、音声データの周波数分布から選択した周波数に対応する周波数帯の音声を取得することができる。周波数選択フィルタとしては、例えば、バンドパスフィルタやバンドストップフィルタ、ハイパスフィルタ、ローパスフィルタが用いられてよい。 The ambient sound acquisition unit 1312 acquires audio around the user. That is, the ambient sound acquisition unit 1312 acquires audio around the user from microphone L111 and microphone R112. The ambient sound acquisition unit 1312 may perform any filtering process on the audio acquired from microphone 110. A frequency selection filter may be used for the ambient sound acquisition unit 1312. The frequency selection filter can acquire audio in a frequency band corresponding to a frequency selected from the frequency distribution of the audio data. As the frequency selection filter, for example, a band pass filter, band stop filter, high pass filter, or low pass filter may be used.
 音声出力制御部1313は、音声出力部120に対する音声の出力を制御する。具体的には、音声出力制御部1313は、音声取得部1311が取得した音声、および周辺音取得部1312が取得した周辺音を音声出力部120に出力する制御を行う。また、音声出力制御部1313は、音声取得部1311が取得した音声、および周辺音取得部1312が取得した周辺音に対し、音声出力部120への音声出力のオン/オフ制御、音声出力部120に出力する音量を調整する制御などを行う。また、音声出力制御部1313は、音声取得部1311が取得した音声に対し、周辺音取得部1312が取得した周辺音を用いて、ノイズキャンセル処理を行う。 The audio output control unit 1313 controls the output of audio to the audio output unit 120. Specifically, the audio output control unit 1313 controls the output of the audio acquired by the audio acquisition unit 1311 and the ambient sound acquired by the ambient sound acquisition unit 1312 to the audio output unit 120. The audio output control unit 1313 also controls the on/off of the audio output to the audio output unit 120 and the adjustment of the volume of the audio output to the audio output unit 120 for the audio acquired by the audio acquisition unit 1311 and the ambient sound acquired by the ambient sound acquisition unit 1312. The audio output control unit 1313 also performs noise cancellation processing on the audio acquired by the audio acquisition unit 1311 using the ambient sound acquired by the ambient sound acquisition unit 1312.
 音声出力制御部1313は、検出部132がユーザの身体的負荷が大きい状態であることを検出した場合、音声出力部120において、ユーザの周辺音が聞こえやすい状態とする。具体的には、音声出力制御部1313は、音声出力部120に対し、周辺音取得部1312が取得した周辺音を出力することで、ユーザが周辺音を聞くことができるような制御、言い換えると、ユーザが周辺音を聞こえやすい状態とする制御を行う。 When the detection unit 132 detects that the user is under a large physical load, the audio output control unit 1313 makes the audio output unit 120 in a state where the user's surrounding sounds are easily heard. Specifically, the audio output control unit 1313 controls the audio output unit 120 to output the surrounding sounds acquired by the surrounding sound acquisition unit 1312 so that the user can hear the surrounding sounds, in other words, to make the user in a state where the surrounding sounds are easily heard.
 音声出力制御部1313による、音声出力部120に対する周辺音の出力例について説明する。 An example of the output of ambient sound to the audio output unit 120 by the audio output control unit 1313 is described below.
 例えば、音声出力部120が、イヤホンやヘッドホンなど、ユーザの耳を塞ぐような形態の場合であって、音声取得部1311が取得した音声コンテンツが音声出力部120に出力されていない場合は、音声出力制御部1313は、音声出力部120に周辺音を出力する。 For example, if the audio output unit 120 is in a form that covers the user's ears, such as earphones or headphones, and the audio content acquired by the audio acquisition unit 1311 is not output to the audio output unit 120, the audio output control unit 1313 outputs ambient sound to the audio output unit 120.
 同様に、音声出力部120が、イヤホンやヘッドホンなど、ユーザの耳を塞ぐような形態の場合であって、音声取得部1311が取得した音声コンテンツが音声出力部120に出力されている場合は、音声出力制御部1313は、音声コンテンツに周辺音をミックスさせて出力する、あるいは、音声コンテンツの音量を低下させて周辺音をミックスさせて出力する。 Similarly, when the audio output unit 120 is in a form that covers the user's ears, such as earphones or headphones, and the audio content acquired by the audio acquisition unit 1311 is output to the audio output unit 120, the audio output control unit 1313 mixes the audio content with ambient sound and outputs it, or lowers the volume of the audio content and mixes in ambient sound and outputs it.
 また、音声出力部120が、ヘルメット内蔵ヘッドセットや、ネックスピーカなど、ユーザの耳を塞がない形態の場合は、音声出力制御部1313は、音声出力部120に周辺音を比較的大きな音量で出力する。比較的大きな音量とは、ユーザは周辺音が直接聞こえていることから、ユーザが周辺音を直接聞くより大きい音量である。 In addition, when the audio output unit 120 is in a form that does not cover the user's ears, such as a headset built into a helmet or a neck speaker, the audio output control unit 1313 outputs the ambient sound to the audio output unit 120 at a relatively high volume. A relatively high volume is a volume that is higher than the volume at which the user hears the ambient sound directly, since the user can hear the ambient sound directly.
 また、音声出力制御部1313は、検出部132がユーザの身体的負荷が大きい状態であることを検出した場合、ユーザに対して出力する音声に対して周辺音取得部1312が取得したユーザの周囲の音声に基づくノイズキャンセル効果を低減または停止する。すなわち、音声出力制御部1313は、検出部132がユーザの身体的負荷が大きい状態であることを検出した場合、音声出力部120からユーザに対して出力する音声に対して周辺音取得部1312が取得した音声に基づくノイズキャンセル効果を低減または停止する。なお、ノイズキャンセルは、周辺音取得部1312が取得した周囲の音声に対して逆位相の音を音声出力部120から出力して、ノイズとみなす音を打ち消すことにより実現されてよい。 Furthermore, when the detection unit 132 detects that the user is under a heavy physical load, the audio output control unit 1313 reduces or stops the noise cancellation effect based on the audio around the user acquired by the ambient sound acquisition unit 1312 for the audio to be output to the user. In other words, when the detection unit 132 detects that the user is under a heavy physical load, the audio output control unit 1313 reduces or stops the noise cancellation effect based on the audio acquired by the ambient sound acquisition unit 1312 for the audio to be output to the user from the audio output unit 120. Note that noise cancellation may be achieved by outputting from the audio output unit 120 a sound that is in the opposite phase to the ambient audio acquired by the ambient sound acquisition unit 1312, thereby canceling out the sound that is considered to be noise.
 音声出力制御部1313による、音声出力部120に対するノイズキャンセル効果の低減例について説明する。 This section describes an example of the reduction in the noise cancellation effect on the audio output unit 120 by the audio output control unit 1313.
 例えば、音声出力部120が、イヤホンやヘッドホンなど、ユーザの耳を塞ぐような形態の場合であって、音声取得部1311が取得した音声コンテンツが、ノイズキャンセル処理が行われている状態で音声出力部120に出力されている場合は、音声出力制御部1313は、ノイズキャンセル効果の低減、ノイズキャンセル処理の停止を行う。音声出力制御部1313は、ノイズキャンセル効果の低減、ノイズキャンセル処理の停止に伴い、音声コンテンツの音量を低下またはミュートさせてもよい。 For example, when the audio output unit 120 is in a form that covers the user's ears, such as earphones or headphones, and the audio content acquired by the audio acquisition unit 1311 is output to the audio output unit 120 while noise cancellation processing is being performed, the audio output control unit 1313 reduces the noise cancellation effect and stops the noise cancellation processing. The audio output control unit 1313 may lower or mute the volume of the audio content in response to the reduction in the noise cancellation effect and the stop of the noise cancellation processing.
 音声出力制御部1313は、音声出力部120に対するノイズキャンセル効果の低減例として、ノイズキャンセル処理の停止を行う場合は、音声出力部120に対して、周辺音取得部1312が取得した周辺音の逆位相音の出力を停止する。また、ノイズキャンセル効果の低減を行う場合は、音声出力部120に対して、周辺音取得部1312が取得した周辺音の逆位相音の出力レベルを低くして出力するなどの処理を行う。 As an example of reducing the noise cancellation effect on the audio output unit 120, when the noise cancellation process is stopped, the audio output control unit 1313 stops outputting the inverse phase sound of the ambient sound acquired by the ambient sound acquisition unit 1312 to the audio output unit 120. When reducing the noise cancellation effect, the audio output control unit 1313 performs processing such as lowering the output level of the inverse phase sound of the ambient sound acquired by the ambient sound acquisition unit 1312 and outputting it to the audio output unit 120.
 検出部132は、ユーザの状態が、ユーザの身体的負荷が大きい状態であることを検出する。検出部132は、ユーザの状態が、ユーザの身体的負荷が大きい状態であることを、センサ150が検出、または計測した結果を示す情報に基づいて検出する。検出部132は、センサ150から取得した情報のみではなく、図示しないGNSS(Global Navigation Satellite System)受信部が受信した測位衛星からの信号に基づく現在位置情報、図示しない通信部を介して取得、または音声出力装置が備える地図情報や地形情報を用いて、ユーザの状態が、ユーザの身体的負荷が大きい状態であることを検出してもよい。 The detection unit 132 detects that the user's state is a state of high physical load. The detection unit 132 detects that the user's state is a state of high physical load based on information indicating the results of detection or measurement by the sensor 150. The detection unit 132 may detect that the user's state is a state of high physical load using not only information acquired from the sensor 150, but also current position information based on signals from positioning satellites received by a GNSS (Global Navigation Satellite System) receiving unit (not shown), map information or topographical information acquired via a communication unit (not shown), or information provided by the audio output device.
 検出部132は、ユーザの身体的負荷が大きい状態であることを、ユーザの動作負荷に基づいて検出する。具体的には、検出部132は、ユーザの動作負荷が大きい状態にあることを、ユーザの身体的負荷が大きい状態であることとして検出する。 The detection unit 132 detects that the user's physical load is high based on the user's operational load. Specifically, the detection unit 132 detects that the user's operational load is high as a state in which the user's physical load is high.
 検出部132が検出するユーザの動作負荷が大きい状態とは、例えば、ユーザが上り坂を上っている状態である。つまり、検出部132は、ユーザが上り坂を上っていることを、ユーザの身体的負荷が大きい状態であることとして検出する。ユーザが上り坂を上っている状態とは、ユーザが徒歩で上り坂を上っている状態、ユーザが走って上り坂を上っている状態、または、ユーザが自転車を漕いで上り坂を上っている状態など、ユーザの動作負荷が大きくなる状態であれば、移動方法は問わない。また、上り坂とは、ユーザの動作負荷が大きくなる程度の所定以上の勾配である上り坂であり、階段も含まれる。このような状態は、ユーザの動作負荷が大きい環境にあると言い換えることもできる。 The state in which the user's operational load is high that is detected by the detection unit 132 is, for example, a state in which the user is walking uphill. In other words, the detection unit 132 detects that the user is walking uphill as a state in which the user's physical load is high. The state in which the user is walking uphill can be any state in which the user's operational load is high, such as a state in which the user is walking uphill, a state in which the user is running uphill, or a state in which the user is pedaling uphill on a bicycle, and the method of movement does not matter. In addition, an uphill slope is an uphill slope with a gradient of a predetermined level or more that increases the user's operational load, and includes stairs. Such a state can also be said to be in an environment in which the user's operational load is high.
 検出部132は、ユーザが上り坂を上っていることを、例えば、センサ150のジャイロセンサや、加速度センサなどの情報から検出する。検出部132は、ユーザの傾きの情報を取得し、傾きの平均値から、上り坂を上っていることを検出する。また、検出部132は、センサ150のジャイロセンサや、加速度センサなどの情報に基づく連続的な振動や傾きの変化から、ユーザが歩いている状態、走っている状態、または自転車を漕いでいる状態であることを検出する。例えば、検出部132は、センサ150のジャイロセンサや、加速度センサなどが示す重力加速度の方向が、ユーザの進行方向の後方に向く変化のあった場合を、上り坂を上っていることとして検出する。 The detection unit 132 detects that the user is going uphill from information, for example, from the gyro sensor, acceleration sensor, etc. of the sensor 150. The detection unit 132 acquires information on the user's inclination and detects that the user is going uphill from the average value of the inclination. The detection unit 132 also detects that the user is walking, running, or pedaling a bicycle from continuous vibrations and changes in inclination based on information from the gyro sensor, acceleration sensor, etc. of the sensor 150. For example, the detection unit 132 detects that the user is going uphill when the direction of gravitational acceleration indicated by the gyro sensor, acceleration sensor, etc. of the sensor 150 changes to point backward from the user's traveling direction.
 検出部132は、GNSS受信部が受信した測位衛星からの信号に基づく現在位置情報の単位時間に対する移動距離に基づき移動速度を算出し、センサ150のジャイロセンサや、加速度センサなどの情報による連続的な振動などから、ユーザが、歩いて上り坂を上っているのか、走って上り坂を上っているのか、さらには、自転車を漕いで上り坂を上っているのかを判断することで、ユーザの動作負荷が大きい状態であることを検出する。 The detection unit 132 calculates the moving speed based on the moving distance per unit time of the current position information based on the signal from the positioning satellite received by the GNSS receiving unit, and detects whether the user is walking uphill, running uphill, or even pedaling uphill on a bicycle, based on continuous vibrations from the gyro sensor and acceleration sensor of the sensor 150, thereby detecting whether the user is under a heavy load of motion.
 検出部132は、GNSS受信部が受信した測位衛星からの信号に基づく現在位置情報と、地図情報や地形情報に基づき、ユーザの進行方向が上り坂を上る方向であることを検出することによって、ユーザの動作負荷が大きい状態であることを検出してもよい。 The detection unit 132 may detect that the user is under a heavy operational load by detecting that the user is moving uphill based on current location information based on signals from positioning satellites received by the GNSS receiving unit, and based on map information and topographical information.
 検出部132は、例えば、ユーザが所定速度以上で移動していることを、ユーザの身体的負荷が大きい状態であることとして検出する。検出部132は、センサ150のジャイロセンサや、加速度センサなどの情報から、音声出力装置100を持っている人物が走っていることを検出する。つまり、検出部132は、ユーザが、走っていること(言い換えると、歩行速度以上で移動していること)を、動作負荷が大きい状態であることとして検出する。 The detection unit 132 detects, for example, that the user is moving at a predetermined speed or faster as a state in which the user's physical load is high. The detection unit 132 detects that the person holding the audio output device 100 is running from information from the gyro sensor, acceleration sensor, etc. of the sensor 150. In other words, the detection unit 132 detects that the user is running (in other words, moving at a walking speed or faster) as a state in which the user's operational load is high.
 検出部132は、GNSS受信部が受信した測位衛星からの信号に基づく現在位置情報の単位時間に対する移動距離に基づいて、ユーザが所定速度以上で移動していることを検出してもよい。 The detection unit 132 may detect that the user is moving at a predetermined speed or faster based on the distance traveled per unit time of the current location information based on a signal from a positioning satellite received by the GNSS receiving unit.
 検出部132は、ユーザの身体的負荷が大きい状態であることを、ユーザの疲労状態に基づいて検出する。具体的には、検出部132は、ユーザが疲労状態にあることを、ユーザの身体的負荷が大きい状態であることとして検出する。 The detection unit 132 detects that the user is in a state of high physical load based on the user's fatigue state. Specifically, the detection unit 132 detects that the user is in a fatigue state as a state of high physical load.
 検出部132が検出するユーザが疲労状態にあることとは、ユーザが身体的疲労状態にあることである。検出部132は、例えば、センサ150から取得した、ユーザの心拍変動などに基づいて、ユーザが疲労状態にあることを検出する。検出部132は、センサ150が検出したユーザの心拍数に基づいて、ユーザの心拍変動の平均値を記憶し、ユーザの心拍変動の値が、通常範囲から低下したことを検出することで、ユーザが疲労状態にあることを検出する。 The user being in a fatigued state detected by the detection unit 132 means that the user is in a physically fatigued state. The detection unit 132 detects that the user is in a fatigued state, for example, based on the user's heart rate variability acquired from the sensor 150. The detection unit 132 stores the average value of the user's heart rate variability based on the user's heart rate detected by the sensor 150, and detects that the user is in a fatigued state by detecting that the value of the user's heart rate variability has dropped from the normal range.
 検出部132は、ユーザの身体的負荷が大きい状態であることを、ユーザのストレス度に基づいて検出する。具体的には、検出部132は、ユーザのストレス度が高い状態にあることを、ユーザの身体的負荷が大きい状態であることとして検出する。 The detection unit 132 detects that the user is in a state of high physical load based on the user's stress level. Specifically, the detection unit 132 detects that the user's stress level is high as the user's state of high physical load.
 検出部132は、例えば、センサ150から取得した、ユーザの心拍変動などに基づいて、ユーザのストレスレベルを検出する。検出部132は、センサ150が検出したユーザの心拍数に基づいて、ユーザの心拍変動の平均値を記憶し、ユーザの心拍変動の値が、通常範囲から低下したことを検出することで、ユーザのストレスレベルを検出し、平均値からの乖離度合いによって、ユーザのストレス度が高い状態にあることを検出する。 The detection unit 132 detects the user's stress level based on, for example, the user's heart rate variability acquired from the sensor 150. The detection unit 132 stores the average value of the user's heart rate variability based on the user's heart rate detected by the sensor 150, detects the user's stress level by detecting that the user's heart rate variability value has fallen below the normal range, and detects that the user's stress level is high based on the degree of deviation from the average value.
 検出部132による、ユーザの身体的負荷が大きい状態の検出は、上述したものに限らず、様々なセンシングや手法が適用可能であり、組み合わせて用いることも可能である。 The detection unit 132 can detect a state in which the user's physical load is high by a variety of sensing methods and techniques, not limited to those described above, and they can also be used in combination.
(音声出力装置の処理の第一態様)
 次に、本開示に係る音声出力装置100の処理の第一態様について、図2を用いて説明する。図2は、本開示に係る音声出力装置の処理の第一態様のフローを示すフローチャートである。図2は、本開示に係る音声出力装置が実行する音声出力方法の一例を示しており、制御部130が実行するプログラムに基づく処理の一例でもある。図2に示すフローに沿って、本開示に係る音声出力装置100の処理の第一態様について説明する。
(First aspect of processing of audio output device)
Next, a first aspect of the processing of the audio output device 100 according to the present disclosure will be described with reference to FIG. 2. FIG. 2 is a flowchart showing a flow of the first aspect of the processing of the audio output device according to the present disclosure. FIG. 2 shows an example of an audio output method executed by the audio output device according to the present disclosure, and is also an example of processing based on a program executed by the control unit 130. The first aspect of the processing of the audio output device 100 according to the present disclosure will be described along the flow shown in FIG. 2.
 図2に示す処理は、音声出力装置100の電源がオンになった場合や、音声出力装置100において、図2に示す処理を実行するアプリケーションが起動した場合などにおいて開始される。また、図2に示す処理は、音声出力装置100のユーザが、音声出力部120を備えるヘルメットやイヤホン、ヘッドホン、ネックスピーカなどを装着することで開始されてもよい。また、図2に示す処理は、音声出力装置100のユーザが、音声出力装置100の利用中に音声出力装置100に対する操作を行うことで、任意のタイミングで開始されてもよい。図2に示す処理は、音声取得部1311が取得した音声が音声出力部120に出力されていてもいなくともよい。 2 is started when the power supply of the audio output device 100 is turned on, or when an application that executes the process shown in FIG. 2 is started in the audio output device 100. The process shown in FIG. 2 may also be started when a user of the audio output device 100 puts on a helmet, earphones, headphones, neck speaker, or the like that is equipped with the audio output unit 120. The process shown in FIG. 2 may also be started at any timing when a user of the audio output device 100 operates the audio output device 100 while using the audio output device 100. The process shown in FIG. 2 may or may not involve the audio acquired by the audio acquisition unit 1311 being output to the audio output unit 120.
 まず、音声出力装置100は、音声出力装置100のユーザの身体的負荷が大きい状態であるか否かを検出する(ステップS101)。具体的には、検出部132が、音声出力装置100のユーザの身体的負荷が大きい状態であるか否かを検出する。音声出力装置100のユーザの身体的負荷が大きい状態とは、ユーザの動作負荷が大きい状態にあること、ユーザが疲労状態にあること、およびユーザのストレス度が高いことなどの検出に基づく。 First, the audio output device 100 detects whether or not the physical load of the user of the audio output device 100 is high (step S101). Specifically, the detection unit 132 detects whether or not the physical load of the user of the audio output device 100 is high. A state in which the physical load of the user of the audio output device 100 is high is based on the detection of a state in which the user's operating load is high, the user being fatigued, the user's stress level being high, and the like.
 ステップS101において、ユーザの身体的負荷が大きい状態であることが検出された場合(ステップS101:Yes)、音声出力装置100は外部音声の出力を開始する(ステップS102)。具体的には、音声出力装置100は、周辺音取得部1312が取得した音声を、音声出力部120に出力する。このとき、音声出力装置100が、他の音声出力装置との音声通話を行う装置であり、音声通話が行われている場合は、周辺音取得部1312が取得した音声を、音声通話の音声に加えて出力する。また、音声出力装置100が、音声コンテンツを聴取する装置であり、音声コンテンツが出力されている場合は、周辺音取得部1312が取得した音声を、音声コンテンツの音声に加えて出力する。つまり、音声出力装置100は、周辺音取得部1312が取得した音声を、音声入力部140から入力された音声に加えて出力する。周辺音取得部1312が取得した音声を、音声入力部140から入力された音声に加えて出力するとは、言い換えると、周辺音取得部1312が取得した音声および音声入力部140から入力された音声の双方が、音声出力部120から出力される。 If it is detected in step S101 that the user is under a large physical load (step S101: Yes), the audio output device 100 starts outputting external audio (step S102). Specifically, the audio output device 100 outputs the audio acquired by the ambient sound acquisition unit 1312 to the audio output unit 120. At this time, if the audio output device 100 is a device that performs audio calls with other audio output devices and a voice call is being performed, the audio acquired by the ambient sound acquisition unit 1312 is output in addition to the audio of the voice call. Also, if the audio output device 100 is a device that listens to audio content and the audio content is being output, the audio acquired by the ambient sound acquisition unit 1312 is output in addition to the audio of the audio content. In other words, the audio output device 100 outputs the audio acquired by the ambient sound acquisition unit 1312 in addition to the audio input from the audio input unit 140. The sound acquired by the ambient sound acquisition unit 1312 is output in addition to the sound input from the sound input unit 140; in other words, both the sound acquired by the ambient sound acquisition unit 1312 and the sound input from the sound input unit 140 are output from the sound output unit 120.
 次に、音声出力装置100は、図2に示す処理が終了したか否かを判定する(ステップS103)。図2に示す処理の終了とは、音声出力装置100の電源がオフになった場合や、音声出力装置100において、図2に示す処理を実行するアプリケーションが終了した場合などにおいて終了したと判定される。また、音声出力装置100のユーザが、音声出力部120を備えるヘルメットやイヤホン、ヘッドホン、ネックスピーカなどの装着を解除することで終了したと判定されてもよい。また、音声出力装置100のユーザが、音声出力装置100の利用中に音声出力装置100に対する操作を行うことで、任意のタイミングで終了操作が行われた場合に終了したと判定されてもよい。 Next, the audio output device 100 determines whether the process shown in FIG. 2 has ended (step S103). The process shown in FIG. 2 is determined to have ended when the audio output device 100 is powered off, or when an application that executes the process shown in FIG. 2 has ended in the audio output device 100. The process may also be determined to have ended when the user of the audio output device 100 removes the helmet, earphones, headphones, neck speaker, or the like that is equipped with the audio output unit 120. The process may also be determined to have ended when the user of the audio output device 100 performs an operation on the audio output device 100 while using the audio output device 100, resulting in an end operation being performed at any timing.
 ステップS103において、処理が終了していないと判定された場合(ステップS103:No)、音声出力装置100は、外部音声が出力されている状態であるか否かを判定する(ステップS104)。外部音声が出力されている状態と判定された場合(ステップS104:Yes)、音声出力装置100は、身体的負荷が大きい状態が継続しているか否かを判定する(ステップS105)。ステップS104の判定は、ステップS101のNoから推移した場合は、ステップS102の処理が行われていないため、外部音声が出力されていない状態であり、ステップS101のYesから推移した場合は、ステップS102の処理が行われるため、外部音声が出力されている状態である。 If it is determined in step S103 that the processing has not ended (step S103: No), the audio output device 100 determines whether or not an external sound is being output (step S104). If it is determined that an external sound is being output (step S104: Yes), the audio output device 100 determines whether or not a state of high physical load continues (step S105). If the determination in step S104 has progressed from No in step S101, the processing in step S102 has not been performed, and therefore an external sound is not being output; if the determination in step S104 has progressed from Yes in step S101, the processing in step S102 has been performed, and therefore an external sound is being output.
 ステップS105において、身体的負荷が大きい状態が継続していると判定された場合(ステップS105:Yes)、ステップS105の判定を再度実行する。なお、ステップS105がYesの判定が行われている期間に、ステップS103と同様に、処理が終了したか否かの判定が含まれてもよい。ステップS105において、身体的負荷が大きい状態が継続していないと判定された場合(ステップS105:No)、音声出力装置100は、外部音声の出力を解除する(ステップS106)。 If it is determined in step S105 that the state of high physical load continues (step S105: Yes), the determination in step S105 is executed again. Note that the period during which the determination in step S105 is Yes may also include a determination as to whether or not the processing has ended, as in step S103. If it is determined in step S105 that the state of high physical load does not continue (step S105: No), the audio output device 100 stops outputting external audio (step S106).
 ステップS106の次に、音声出力装置100は、ステップS103と同様に、図2に示す処理が終了した否かを判定し(ステップS107)、処理が終了したと判定された場合(ステップS107:Yes)、音声出力装置100は、図2に示す処理を終了する。 After step S106, the audio output device 100 determines whether the processing shown in FIG. 2 has ended (step S107), similar to step S103. If it is determined that the processing has ended (step S107: Yes), the audio output device 100 ends the processing shown in FIG. 2.
 なお、ステップS101において、身体的負荷の大きい状態であることが検出されなかった場合(ステップS101:No)、音声出力装置100は、ステップS103の処理に移行して、ステップS103以降の処理を実行する。 If it is not detected in step S101 that the physical load is high (step S101: No), the audio output device 100 proceeds to step S103 and executes the processes from step S103 onward.
 また、ステップS103において、処理が終了したと判定された場合(ステップS103:Yes)、音声出力装置100は、図2に示す処理を終了する。 If it is determined in step S103 that the processing has ended (step S103: Yes), the audio output device 100 ends the processing shown in FIG. 2.
 また、ステップS104において、外部音声が出力されている状態ではないと判定された場合(ステップS104:No)、音声出力装置100は、ステップS107の処理に移行して、ステップS107以降の処理を実行する。 Also, if it is determined in step S104 that external audio is not being output (step S104: No), the audio output device 100 proceeds to the process of step S107 and executes the processes from step S107 onwards.
 以上説明した音声出力装置100の第一態様の処理によれば、ユーザの身体的負荷が大きい状態であることを検出した場合に、ユーザに対して周辺音を出力することができる。そのため、ユーザに周囲の状況を適切に把握させることができる音声出力装置100を提供することができる。 According to the processing of the first aspect of the audio output device 100 described above, when it is detected that the user is in a state of high physical load, ambient sounds can be output to the user. Therefore, it is possible to provide an audio output device 100 that allows the user to properly understand the situation around them.
(音声出力装置の処理の第二態様)
 次に、本開示に係る音声出力装置100の処理の第二態様について、図3を用いて説明する。図3は、本開示に係る音声出力装置の処理の第二態様のフローを示すフローチャートである。図3は、本開示に係る音声出力装置が実行する音声出力方法の一例を示しており、制御部130が実行するプログラムに基づく処理の一例でもある。図3に示すフローに沿って、本開示に係る音声出力装置100の処理の第二態様について説明する。
(Second aspect of processing of audio output device)
Next, a second aspect of the processing of the audio output device 100 according to the present disclosure will be described with reference to FIG. 3. FIG. 3 is a flowchart showing a flow of the second aspect of the processing of the audio output device according to the present disclosure. FIG. 3 shows an example of an audio output method executed by the audio output device according to the present disclosure, and is also an example of processing based on a program executed by the control unit 130. The second aspect of the processing of the audio output device 100 according to the present disclosure will be described along the flow shown in FIG. 3.
 図3の処理は、音声出力装置100における音声出力部120が、ノイズキャンセル機能を備えたイヤホン、ヘッドホンの場合に適用される。また、図3の処理においては、音声出力装置100は、音声入力部140から取得した音声コンテンツに対して、マイクロフォン110が取得した環境音に基づいたノイズキャンセル処理が行われていることが前提である。 The process in FIG. 3 is applied when the audio output unit 120 in the audio output device 100 is an earphone or a headphone equipped with a noise cancellation function. The process in FIG. 3 is also based on the premise that the audio output device 100 performs noise cancellation processing on the audio content acquired from the audio input unit 140 based on the environmental sound acquired by the microphone 110.
 図3に示すステップS201、ステップS203、ステップS205およびステップS207は、図2に示すステップS101、ステップS103、ステップS105およびステップS107と処理が同一であるため、説明を省略する。 Steps S201, S203, S205, and S207 shown in FIG. 3 are the same as steps S101, S103, S105, and S107 shown in FIG. 2, so their explanations are omitted.
 ステップS201において、ユーザの身体的負荷が大きい状態であることが検出された場合(ステップS201:Yes)、音声出力装置100は、ノイズキャンセリング(N/C:Noise Cancelling)を制限する(ステップS202)。具体的には、音声出力装置100は、ユーザに対して出力する音声に対して周辺音取得部1312が取得した音声に基づくノイズキャンセル効果を低減または停止する。 If it is detected in step S201 that the user is under a large physical load (step S201: Yes), the audio output device 100 limits noise canceling (N/C) (step S202). Specifically, the audio output device 100 reduces or stops the noise canceling effect based on the sound acquired by the ambient sound acquisition unit 1312 for the sound to be output to the user.
 次に、音声出力装置100は、図3に示す処理が終了したか否かを判定する(ステップS203)。ステップS203において、処理が終了していないと判定された場合(ステップS203:No)、音声出力装置100は、ノイズキャンセリングの制限状態が継続しているか否かを判定する(ステップS204)。ノイズキャンセリングの制限状態が継続していると判定された場合(ステップS204:Yes)、音声出力装置100は、身体的負荷が大きい状態が継続しているか否かを判定する(ステップS205)。ステップS204の判定は、ステップS201のNoから推移した場合は、ステップS202の処理が行われていないため、ノイズキャンセリングの制限状態が継続していない状態であり、ステップS101のYesから推移した場合は、ステップS202の処理が行われるため、ノイズキャンセリングの制限状態が継続している状態である。 Next, the audio output device 100 determines whether the processing shown in FIG. 3 has ended (step S203). If it is determined in step S203 that the processing has not ended (step S203: No), the audio output device 100 determines whether the noise canceling restriction state continues (step S204). If it is determined that the noise canceling restriction state continues (step S204: Yes), the audio output device 100 determines whether the state of high physical load continues (step S205). If the determination in step S204 has progressed from No in step S201, the processing in step S202 has not been performed, so the noise canceling restriction state is not continuing, and if the determination has progressed from Yes in step S101, the processing in step S202 has been performed, so the noise canceling restriction state is continuing.
 ステップS205において、身体的負荷が大きい状態が継続していないと判定された場合(ステップS205:No)、音声出力装置100は、ノイズキャンセリングの制限を解除する(ステップS206)。 If it is determined in step S205 that the state of high physical load is not continuing (step S205: No), the audio output device 100 removes the restriction on noise canceling (step S206).
 また、ステップS204において、ノイズキャンセリングの制限状態が継続していないと判定された場合(ステップS204:No)、音声出力装置100は、ステップS207に移行して、ステップS207以降の処理を実行する。 If it is determined in step S204 that the noise canceling restriction state is not continuing (step S204: No), the audio output device 100 proceeds to step S207 and executes the processes from step S207 onward.
 以上説明した音声出力装置100の第二態様の処理によれば、ユーザの身体的負荷が大きい状態であることを検出した場合に、ノイズキャンセル効果を低減または停止することができる。そのため、ユーザは周辺音を聞きやすくなる。したがって、ユーザに周囲の状況を適切に把握させることができる音声出力装置100を提供することができる。 According to the processing of the second aspect of the audio output device 100 described above, when it is detected that the user is in a state of high physical load, the noise cancellation effect can be reduced or stopped. This makes it easier for the user to hear surrounding sounds. Therefore, it is possible to provide an audio output device 100 that allows the user to properly grasp the surrounding situation.
(音声出力装置の構成)
(第二実施形態)
 次に、第二実施形態に係る音声出力装置100について、図4を用いて説明する。図4は、本開示に係る音声出力装置の第二実施形態の構成例を示す図である。第二実施形態に係る音声出力装置100は、第一実施形態に係る音声出力装置100に対して、音声入力部140が、通信制御部160および無線通信モジュール170に変更されている。また、通話用マイクロフォン101が追加され、音声取得部1311の機能が異なる。上記以外は、第一実施形態に係る音声出力装置100と同一であるため、説明を省略する。
(Configuration of audio output device)
Second Embodiment
Next, the audio output device 100 according to the second embodiment will be described with reference to FIG. 4. FIG. 4 is a diagram showing a configuration example of the second embodiment of the audio output device according to the present disclosure. In the audio output device 100 according to the second embodiment, the audio input unit 140 is changed to a communication control unit 160 and a wireless communication module 170 compared to the audio output device 100 according to the first embodiment. In addition, a microphone 101 for calling is added, and the function of the audio acquisition unit 1311 is different. Other than the above, the audio output device 100 according to the first embodiment is the same as the audio output device 100 according to the first embodiment, so the description will be omitted.
 図4に示す音声出力装置100は、具体的には、ユーザが自転車などによる走行や登山、トレッキングなどの状態において他のユーザとの無線通信による通話を行う装置である。このような装置として、音声出力装置100は、通信機能を備えたヘルメット内蔵ヘッドセット、スマートフォンなどの携帯型情報端末およびネックスピーカとから構成される装置である。 The audio output device 100 shown in FIG. 4 is specifically a device that allows a user to communicate with other users via wireless communication while riding a bicycle, climbing a mountain, trekking, or the like. As such a device, the audio output device 100 is a device that is composed of a helmet-mounted headset with communication capabilities, a portable information terminal such as a smartphone, and a neck speaker.
 通信制御部160は、音声出力装置100による無線通信を制御する。図4に示すように、通信制御部160は、送信制御部161と、受信制御部162と、を備える。 The communication control unit 160 controls wireless communication by the audio output device 100. As shown in FIG. 4, the communication control unit 160 includes a transmission control unit 161 and a reception control unit 162.
 送信制御部161は、音声取得部1311が取得した音声を、予め設定した他の音声出力装置100に送信する。すなわち、送信制御部161は、予め設定された他の音声出力装置100に対して、公共の通信ネットワークなどの無線通信を介し、または直接、音声取得部1311が取得した音声を送信する。 The transmission control unit 161 transmits the audio acquired by the audio acquisition unit 1311 to another audio output device 100 that has been set in advance. In other words, the transmission control unit 161 transmits the audio acquired by the audio acquisition unit 1311 to the other audio output device 100 that has been set in advance via wireless communication such as a public communication network, or directly.
 受信制御部162は、他の音声出力装置100から送信された音声の受信を制御する。例えば、受信制御部162は、予め設定された他の音声出力装置100から送信された音声を受信するように制御してよい。 The reception control unit 162 controls the reception of audio transmitted from another audio output device 100. For example, the reception control unit 162 may control the reception of audio transmitted from another audio output device 100 that has been set in advance.
 無線通信モジュール170は、他の音声出力装置100との間において相互に無線通信を実行する。無線通信モジュール170は、例えば、Wi-Fi(登録商標)や5G等の無線通信を行うための無線通信モジュール、ブルートゥース(登録商標)による中距離無線通信やデジタル簡易無線通信を行うための無線通信モジュールである。 The wireless communication module 170 performs wireless communication with other audio output devices 100. The wireless communication module 170 is, for example, a wireless communication module for performing wireless communication such as Wi-Fi (registered trademark) or 5G, or a wireless communication module for performing medium-range wireless communication using Bluetooth (registered trademark) or digital simple wireless communication.
 通話用マイクロフォン101は、他の音声出力装置100との間において、無線通信による音声通話を行うために、音声出力装置100のユーザの発話を収音する。通話用マイクロフォン101は、音声出力装置100が、ユーザが装着するヘルメットなどに装着されている場合は、ユーザがヘルメットを装着した場合にユーザの口元に近い位置に配置される。 The call microphone 101 picks up the speech of the user of the audio output device 100 in order to make a voice call via wireless communication between the audio output device 100 and another audio output device 100. When the audio output device 100 is attached to a helmet or the like worn by a user, the call microphone 101 is placed in a position close to the user's mouth when the user is wearing the helmet.
 制御部130における音声取得部1311は、ユーザの発話音声を収音する通話用マイクロフォン101が収音した音声を取得する。すなわち、音声取得部1311は、ユーザの発話音声を検出するといえる。また、音声取得部1311は、通信制御部160が他の音声出力装置から受信した音声を取得するとともに、通話用マイクロフォン101が収音した音声を通信制御部160に出力する。 The voice acquisition unit 1311 in the control unit 130 acquires the voice picked up by the call microphone 101, which picks up the user's speech. In other words, it can be said that the voice acquisition unit 1311 detects the user's speech. The voice acquisition unit 1311 also acquires the voice received by the communication control unit 160 from another audio output device, and outputs the voice picked up by the call microphone 101 to the communication control unit 160.
 図4に示す音声出力装置100の処理については、第一実施形態に示す図2が適用される。 The processing of the audio output device 100 shown in FIG. 4 is as shown in FIG. 2 in the first embodiment.
(構成と効果)
 本開示に係る音声出力装置100は、ユーザの状態が、ユーザの身体的負荷が大きい状態であることを検出する検出部132と、ユーザに対して音声を出力する音声出力部120と、検出部132がユーザの身体的負荷が大きい状態であることを検出した場合、音声出力部120において、ユーザの周辺音が聞こえやすい状態とする音声出力制御部1313と、を備える。
(Composition and Effects)
The audio output device 100 according to the present disclosure includes a detection unit 132 that detects whether the user's state is one in which the user's physical load is high, an audio output unit 120 that outputs audio to the user, and an audio output control unit 1313 that, when the detection unit 132 detects that the user's physical load is high, causes the audio output unit 120 to make it easier for the user to hear sounds around the user.
 この構成によれば、ユーザの身体的負荷が大きい状態である場合に、ユーザの周辺音が聞こえやすい状態とすることができる。そのため、ユーザに周囲の状況を適切に把握させることができる音声出力装置100を提供することができる。 This configuration makes it possible to make the user's surrounding sounds easier to hear when the user is under a large physical load. As a result, it is possible to provide an audio output device 100 that allows the user to properly understand the situation around them.
 本開示に係る音声出力装置100は、ユーザの周囲の音声を取得する周辺音取得部1312と、をさらに備え、音声出力制御部1313は、検出部132がユーザの身体的負荷が大きい状態であることを検出した場合、ユーザに周辺音取得部1312が取得したユーザの周囲の音声を出力する。 The audio output device 100 according to the present disclosure further includes an ambient sound acquisition unit 1312 that acquires audio around the user, and when the detection unit 132 detects that the user is under a high level of physical stress, the audio output control unit 1313 outputs the audio around the user acquired by the ambient sound acquisition unit 1312 to the user.
 この構成によれば、ユーザの身体的負荷が大きい状態であることを検出した場合に、ユーザに対して周辺音を出力することができる。そのため、ユーザに周囲の状況を適切に把握させることができる音声出力装置100を提供することができる。 With this configuration, when it is detected that the user is under a large physical load, ambient sounds can be output to the user. Therefore, it is possible to provide an audio output device 100 that allows the user to properly understand the situation around them.
 本開示に係る音声出力装置100は、ユーザの周囲の音声を取得する周辺音取得部1312をさらに備え、音声出力制御部1313は、検出部132がユーザの身体的負荷が大きい状態であることを検出した場合、ユーザに対して出力する音声に対して周辺音取得部1312が取得したユーザの周囲の音声に基づくノイズキャンセル効果を低減または停止する。 The audio output device 100 according to the present disclosure further includes an ambient sound acquisition unit 1312 that acquires audio around the user, and when the detection unit 132 detects that the user is under a high level of physical stress, the audio output control unit 1313 reduces or stops the noise cancellation effect based on the audio around the user acquired by the ambient sound acquisition unit 1312 for the audio to be output to the user.
 この構成によれば、ユーザの身体的負荷が大きい状態であるとして検出した場合に、ノイズキャンセル効果を低減または停止することができる。そのため、ユーザは周辺音を聞きやすくなる。したがって、ユーザに周囲の状況を適切に把握させることができる音声出力装置100を提供することができる。 With this configuration, when it is detected that the user is under a large physical load, the noise cancellation effect can be reduced or stopped. This makes it easier for the user to hear surrounding sounds. Therefore, it is possible to provide an audio output device 100 that allows the user to properly understand the surrounding situation.
 本開示に係る音声出力装置100の検出部132は、ユーザが上り坂を上っている状態や、所定速度以上で移動している状態など、ユーザの身体的負荷が大きい状態であることを、ユーザの身体的負荷が大きい状態であることとして検出する。 The detection unit 132 of the audio output device 100 according to the present disclosure detects a state in which the user's physical load is high, such as when the user is walking uphill or moving at a predetermined speed or higher, as a state in which the user's physical load is high.
 この構成によれば、ユーザが上り坂を上っている状態や、所定速度以上で移動している状態など、ユーザの身体的負荷が大きい状態である場合に、ユーザに周囲の状況を適切に把握させることができる音声出力装置100を提供することができる。 This configuration makes it possible to provide an audio output device 100 that allows the user to properly understand the surrounding situation when the user is in a state where the user's physical burden is high, such as when the user is walking uphill or moving at a predetermined speed or faster.
 本開示に係る音声出力装置100の検出部132は、ユーザが疲労状態にある場合や、ストレス度が高い状態にあることを、ユーザの身体的負荷が大きい状態であることとして検出する。 The detection unit 132 of the audio output device 100 according to the present disclosure detects when the user is in a fatigued state or in a state of high stress level as a state in which the user's physical load is high.
 この構成によれば、ユーザが疲労状態にある場合やストレス度が高い状態にあることを、ユーザの身体的負荷が大きい状態であることとして検出することができる。 With this configuration, when the user is in a fatigued state or has a high level of stress, it can be detected as a state in which the user's physical load is high.
 本開示に係る音声出力装置100が実行する音声出力方法は、ユーザの状態が、ユーザの身体的負荷が大きい状態であることを検出する検出ステップと、ユーザの身体的負荷が大きい状態であることが検出された場合、ユーザに対して出力する音声に対して、ユーザの周辺音が聞こえやすい状態とする音声出力制御ステップと、を含む。 The audio output method executed by the audio output device 100 according to the present disclosure includes a detection step of detecting that the user's state is one in which the user's physical load is high, and an audio output control step of making the user's surrounding sounds easier to hear in relation to the audio output to the user when it is detected that the user's physical load is high.
 この構成によれば、ユーザの身体的負荷が大きい状態である場合に、ユーザの周辺音が聞こえやすい状態とすることができる。そのため、ユーザに周囲の状況を適切に把握させることができる音声出力装置100の制御方法を提供することができる。 This configuration makes it possible to make the user's surrounding sounds easier to hear when the user is under a large physical load. This makes it possible to provide a method for controlling the audio output device 100 that allows the user to properly understand the situation around them.
 以上、本開示の実施形態を説明したが、この実施形態の内容により実施形態が限定されるものではない。また、前述した構成要素には、当業者が容易に想定できるもの、実質的に同一のもの、いわゆる均等の範囲のものが含まれる。さらに、前述した構成要素は適宜組み合わせることが可能である。さらに、前述した実施形態の要旨を逸脱しない範囲で構成要素の種々の省略、置換又は変更を行うことができる。 The above describes an embodiment of the present disclosure, but the embodiment is not limited to the contents of this embodiment. The above-mentioned components include those that a person skilled in the art can easily imagine, those that are substantially the same, and those that are within the so-called equivalent range. Furthermore, the above-mentioned components can be combined as appropriate. Furthermore, various omissions, substitutions, or modifications of the components can be made without departing from the spirit of the above-mentioned embodiment.
 本実施形態に係る音声出力装置、音声出力方法、及びプログラムは、例えば、ユーザに周囲の状況を適切に把握させることができる音声出力装置、音声出力方法、及びプログラムに利用することができる。 The audio output device, audio output method, and program according to this embodiment can be used, for example, as an audio output device, audio output method, and program that allows a user to properly understand the surrounding situation.
 100   音声出力装置
 110   マイクロフォン
 111   マイクロフォンL
 112   マイクロフォンR
 120   音声出力部
 121   音声出力部L
 122   音声出力部R
 130   制御部
 131   音声処理部
 1311  音声取得部
 1312  周辺音取得部
 1313  音声出力制御部
 132   検出部
 140   音声入力部
 150   センサ
 160   通信制御部
 161   送信制御部
 162   受信制御部
 170   無線通信モジュール
100 Audio output device 110 Microphone 111 Microphone L
112 Microphone R
120 Audio output unit 121 Audio output unit L
122 Audio output unit R
130 Control unit 131 Audio processing unit 1311 Audio acquisition unit 1312 Ambient sound acquisition unit 1313 Audio output control unit 132 Detection unit 140 Audio input unit 150 Sensor 160 Communication control unit 161 Transmission control unit 162 Reception control unit 170 Wireless communication module

Claims (8)

  1.  ユーザの状態が前記ユーザの身体的負荷が大きい状態であることを検出する検出部と、
     前記ユーザに対して音声を出力する音声出力部と、
     前記検出部が前記ユーザの身体的負荷が大きい状態であることを検出した場合、前記音声出力部において、前記ユーザの周辺音が聞こえやすい状態とする音声出力制御部と、を備える、
     音声出力装置。
    A detection unit that detects that a state of a user is a state in which a physical load of the user is large;
    a voice output unit for outputting voice to the user;
    and an audio output control unit that, when the detection unit detects that the user is in a state of high physical load, makes the audio output unit in a state where the user can easily hear ambient sounds.
    Audio output device.
  2.  前記ユーザの周囲の音声を取得する周辺音取得部と、をさらに備え、
     前記音声出力制御部は、前記検出部が前記ユーザの身体的負荷が大きい状態であることを検出した場合、前記ユーザに前記周辺音取得部が取得した前記ユーザの周囲の音声を出力する、
     請求項1に記載の音声出力装置。
    An ambient sound acquisition unit that acquires sounds around the user,
    The audio output control unit outputs, to the user, audio around the user acquired by the ambient sound acquisition unit when the detection unit detects that the user is in a state of high physical load.
    The audio output device according to claim 1 .
  3.  前記ユーザの周囲の音声を取得する周辺音取得部をさらに備え、
     前記音声出力制御部は、前記検出部が前記ユーザの身体的負荷が大きい状態であることを検出した場合、前記ユーザに対して出力する音声に対して前記周辺音取得部が取得した前記ユーザの周囲の音声に基づくノイズキャンセル効果を低減または停止する、
     請求項1に記載の音声出力装置。
    An ambient sound acquisition unit that acquires sounds around the user,
    When the detection unit detects that the physical load of the user is high, the audio output control unit reduces or stops a noise cancellation effect based on the audio around the user acquired by the ambient sound acquisition unit for the audio to be output to the user.
    The audio output device according to claim 1 .
  4.  前記検出部は、前記ユーザの動作負荷が大きい状態にあることを、前記ユーザの身体的負荷が大きい状態であることとして検出する、
     請求項1から3のいずれか1項に記載の音声出力装置。
    The detection unit detects a state in which the user's operational load is high as a state in which the user's physical load is high.
    The audio output device according to claim 1 .
  5.  前記検出部は、前記ユーザが疲労状態にあることを、前記ユーザの身体的負荷が大きい状態であることとして検出する、
     請求項1から3のいずれか1項に記載の音声出力装置。
    The detection unit detects that the user is in a fatigued state as a state in which the user is under a large physical load.
    The audio output device according to claim 1 .
  6.  前記検出部は、前記ユーザのストレス度が高い状態にあることを、前記ユーザの身体的負荷が大きい状態であることとして検出する、
     請求項1から3のいずれか1項に記載の音声出力装置。
    The detection unit detects that the user's stress level is high as a state in which the user's physical load is high.
    The audio output device according to claim 1 .
  7.  ユーザの状態が、前記ユーザの身体的負荷が大きい状態であることを検出する検出ステップと、
     前記ユーザの身体的負荷が大きい状態であることが検出された場合、前記ユーザに対して出力する音声に対して、前記ユーザの周辺音が聞こえやすい状態とする音声出力制御ステップと、を含む、
     音声出力装置が実行する音声出力方法。
    A detection step of detecting that a state of the user is a state in which a physical load of the user is large;
    and when it is detected that the physical load of the user is high, a sound output control step of making the sound output to the user easy to hear sounds around the user.
    An audio output method executed by an audio output device.
  8.  音声出力装置を制御するプロセッサに、
     ユーザの状態が、前記ユーザの身体的負荷が大きい状態であることを検出する検出処理と、
     前記ユーザの身体的負荷が大きい状態であることが検出された場合、前記ユーザに対して出力する音声に対して、前記ユーザの周辺音が聞こえやすい状態とする処理と、を含む、処理を実行させるプログラム。
    A processor for controlling an audio output device
    A detection process for detecting that a user's state is a state in which the user's physical load is high;
    A program for executing a process including: when it is detected that the user is in a state of high physical load, making the user's surrounding sounds easier to hear in relation to the audio output to the user.
PCT/JP2023/040242 2023-01-17 2023-11-08 Voice output device, voice output method, and program WO2024154416A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2023005168 2023-01-17
JP2023-005168 2023-01-17

Publications (1)

Publication Number Publication Date
WO2024154416A1 true WO2024154416A1 (en) 2024-07-25

Family

ID=91955789

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2023/040242 WO2024154416A1 (en) 2023-01-17 2023-11-08 Voice output device, voice output method, and program

Country Status (1)

Country Link
WO (1) WO2024154416A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5756083U (en) * 1980-09-17 1982-04-01
JPH05145985A (en) * 1991-11-18 1993-06-11 Oki Electric Ind Co Ltd Portable stereo unit
JP2009017083A (en) * 2007-07-03 2009-01-22 Data Bank Commerce:Kk Noise canceller
JP2020184692A (en) * 2019-05-08 2020-11-12 シャープ株式会社 Control system and control method of the same

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5756083U (en) * 1980-09-17 1982-04-01
JPH05145985A (en) * 1991-11-18 1993-06-11 Oki Electric Ind Co Ltd Portable stereo unit
JP2009017083A (en) * 2007-07-03 2009-01-22 Data Bank Commerce:Kk Noise canceller
JP2020184692A (en) * 2019-05-08 2020-11-12 シャープ株式会社 Control system and control method of the same

Similar Documents

Publication Publication Date Title
US11501772B2 (en) Context aware hearing optimization engine
US11017758B2 (en) Information processing apparatus, information processing method, and program
EP1834228B1 (en) Apparatus and method for receiving inputs from a user
US20120052907A1 (en) Hands-Free, Eyes-Free Mobile Device for In-Car Use
US20030036360A1 (en) Integrated portable entertainment, information and communication system linked to a wireless helmet
US20200264837A1 (en) Information processing apparatus and information processing method
EP2645750A1 (en) A hearing device with an inertial measurement unit
US20140273863A1 (en) Smart helmet with mobile communicator integration
JP2009077260A (en) Information processing apparatus, information processing method and the like
US12069469B2 (en) Head dimension estimation for spatial audio applications
US20210118461A1 (en) User voice control system
EP4024895B1 (en) A binaural hearing device with monaural ambient mode
WO2024154416A1 (en) Voice output device, voice output method, and program
CN110572734A (en) Method for intelligently monitoring environmental sounds through earphone and earphone
CN109068226B (en) Earphone and control method thereof
CN109361987B (en) Sports earphone and control method, device and equipment thereof
JP2009177758A (en) Mobile phone detector with sound-sensing function
WO2024135048A1 (en) Wireless communication device, and control method for wireless communication device
KR20100099922A (en) Apparatus and method for controlling volume in potable terminal
JP2024091249A (en) Radio communication apparatus and method of controlling radio communication apparatus
US10805710B2 (en) Acoustic device and acoustic processing method
CN115769500A (en) System and method for speech reception and detection
JP2021157245A (en) Content output control apparatus, content output system, content output control method, and program
CN112911486A (en) Wireless earphone, method for detecting in-ear state of wireless earphone and storage medium
WO2019207867A1 (en) Electronic device and processing system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23917628

Country of ref document: EP

Kind code of ref document: A1