WO2024154416A1

WO2024154416A1 - Voice output device, voice output method, and program

Info

Publication number: WO2024154416A1
Application number: PCT/JP2023/040242
Authority: WO
Inventors: 領平須永
Original assignee: 株式会社Ｊｖｃケンウッド
Priority date: 2023-01-17
Filing date: 2023-11-08
Publication date: 2024-07-25

Abstract

Provided are a voice output device and a voice output method that enable a user to appropriately ascertain conditions surrounding the user. This voice output device comprises: a detection unit that detects that a user is in a state where the physical stress of the user is significant; a voice output unit that outputs a voice to the user; and a voice output control unit that, if the detection unit detects the state where the physical stress of the user is significant, controls the voice output unit so as to create a state where surrounding noises around the user can be heard more easily.

Description

Audio output device, audio output method, and program

This disclosure relates to an audio output device, an audio output method, and a program.

　Voice calls using wireless communication are used in a wide range of situations as a means of communication between users. For example, when running, trekking, cycling, etc., multiple people may use devices that make voice calls via wireless communication to communicate with each other.

For example, the following Patent Document 1 discloses earphones that take in outside sounds when spoken to.

JP 2022-98974 A

However, when using such a device, if the user is under a great physical strain, their attention to the surroundings may decrease, and the user may miss sounds such as those of nearby vehicles. In such a case, even if the device does not cover the user's ears, there is a similar possibility that the user may miss surrounding sounds.

In consideration of the above problems, the present disclosure aims to provide an audio output device, an audio output method, and a program that enable a user to properly understand the surrounding situation.

In order to solve the above-mentioned problems and achieve the objectives, the audio output device of the present disclosure includes a detection unit that detects that the user's state is one in which the physical load of the user is high, an audio output unit that outputs audio to the user, and an audio output control unit that, when the detection unit detects that the physical load of the user is high, causes the audio output unit to make it easier for the user to hear sounds around the user.

According to the present disclosure, it is possible to provide an audio output device, an audio output method, and a program that enable a user to properly understand the surrounding situation.

FIG. 1 is a diagram illustrating an example of the configuration of a first embodiment of an audio output device according to the present disclosure. FIG. 2 is a flowchart showing a flow of a first aspect of the processing of the audio output device according to the present disclosure. FIG. 3 is a flowchart showing a flow of a second aspect of the processing of the audio output device according to the present disclosure. FIG. 4 is a diagram illustrating an example of the configuration of a second embodiment of a sound output device according to the present disclosure.

Below, an embodiment of the present disclosure will be described in detail with reference to the drawings. Note that the present invention is not limited to the embodiment described below.

(Configuration of audio output device)
First Embodiment
Fig. 1 is a diagram showing a configuration example of a first embodiment of a sound output device according to the present disclosure. As shown in Fig. 1, the sound output device 100 according to the present disclosure includes a microphone 110, a sound output unit 120, a control unit 130, a sound input unit 140, and a sensor 150. Although not shown in Fig. 1, the device may have a storage unit for storing various information. Below, these configurations will be described in order.

Specifically, the audio output device 100 is a device that allows a user to listen to audio content while walking, running, or the like. As such a device, the audio output device 100 is a device that is configured from a portable information terminal such as a smartphone or an audio player, and headphones or earphones. Another specific example is a device that allows a user to talk to other users or listen to audio content while riding a bicycle, climbing a mountain, trekking, or the like. As such a device, the audio output device 100 is a device that is configured from a helmet-integrated headset with a communication function, a portable information terminal such as a smartphone or an audio player, and a neck speaker. For this reason, the audio output device 100 may be a single device, or may be configured such that a device configured from the control unit 130, audio input unit 140, and sensor 150 and a device configured from the microphone 110 and audio output unit 120 are connected by wire or wirelessly.

The microphone 110 picks up various sounds. The microphone 110 includes a microphone L111 and a microphone R112.

Microphone L111 is a microphone provided on the left side of the user. Microphone L111 picks up environmental sounds around the user using audio output device 100, for example, for external sound capture or noise cancellation. Microphone R112 is a microphone provided on the right side of the user. Note that the function of microphone R112 is the same as that of microphone L111, so a description thereof will be omitted.

When the audio output device 100 is attached to a helmet worn by a user, the microphone 110 is arranged on the outside of the helmet, for example, on the left or right side. When the audio output unit 120 is configured as an earphone, headphone, neck speaker, or the like, the microphone 110 is arranged near the audio output unit 120, and is a so-called microphone for capturing external sound or a microphone for noise cancellation.

The audio output unit 120 outputs audio to the user. The audio output unit 120 includes an audio output unit L121 and an audio output unit R122.

The audio output unit L121 is an audio output device such as a speaker provided on the left side of the user. The audio output unit L121 outputs various sounds to the user. For example, the audio output unit L121 outputs the voice of a user of another audio output device 100. The audio output unit R122 is an audio output device such as a speaker provided on the right side of the user. The function of the audio output unit R122 is the same as that of the audio output unit L121, so a description thereof will be omitted.

When the audio output unit 120 is attached to a helmet worn by the user, it is positioned so as not to block the user's ears when the helmet is worn. When the audio output unit 120 is configured as earphones, headphones, a neck speaker, or the like, it is the speaker or sound-producing element of those devices.

The audio input unit 140 receives audio content from a storage unit (not shown) that stores audio content, or a communication unit (not shown) that acquires audio content, and outputs the input audio content to the audio output unit 120.

The sensor 150 is a sensor or device that detects or measures the state of the audio output device 100 or the state of the user using the audio output device 100.

The sensor 150 may be a gyro sensor or an acceleration sensor. The gyro sensor generates a primary vibration that vibrates in one direction in the movable electrode, and when rotation is applied to the movable electrode, a secondary vibration occurs due to the Coriolis force acting in a direction 90° from the vibration direction, causing a change in capacitance, and may be a capacitive MEMS (Micro Electro Mechanical Systems) gyro sensor that detects this. The angular velocity can be determined from the change in capacitance and the vibration phase of the movable electrode. The acceleration sensor may be, for example, a capacitive acceleration sensor that creates a movable electrode and a fixed electrode using MEMS, and measures acceleration using the relationship between the acceleration and the change in capacitance caused by the movement of the movable electrode.

The sensor 150 may be a device worn by the user, such as a smart watch. In other words, the sensor 150 may be replaced by the functions of a device worn by the user, or the functions of the device worn by the user may be used to complement the sensor 150 provided in the audio output device 100. By using such a device, the user's activity level, heart rate, blood oxygen concentration, etc. can be obtained, making it possible to obtain the user's biological information more appropriately. The gyro sensor and acceleration sensor may be provided in the audio output device 100, or in a device worn by the user.

The control unit 130 is a controller that manages and controls the audio output device 100. The control unit 130 is realized by a processor such as a CPU (Central Processing Unit) or MPU (Micro Processing Unit) that executes various programs using RAM as a working area. The control unit 130 may also be realized by an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or FPGA (Field Programmable Gate Array).

As shown in FIG. 1, the control unit 130 includes an audio processing unit 131 and a detection unit 132 as functional blocks realized by program execution, circuit configuration, and the like. The control unit 130 may execute these processes using a single CPU, or may include multiple CPUs and execute these processes in parallel using the multiple CPUs. Each of these configurations will be described below in order.

The audio processing unit 131 performs various processes on the audio input to the audio output device 100 and the audio output by the audio output device 100. The audio processing unit 131 includes, as functional blocks, an audio acquisition unit 1311, an ambient sound acquisition unit 1312, and an audio output control unit 1313.

The audio acquisition unit 1311 acquires the audio input from the audio input unit 140. The audio input unit 140 performs a decoding process corresponding to the codec of the audio input from the audio input unit 140.

The ambient sound acquisition unit 1312 acquires audio around the user. That is, the ambient sound acquisition unit 1312 acquires audio around the user from microphone L111 and microphone R112. The ambient sound acquisition unit 1312 may perform any filtering process on the audio acquired from microphone 110. A frequency selection filter may be used for the ambient sound acquisition unit 1312. The frequency selection filter can acquire audio in a frequency band corresponding to a frequency selected from the frequency distribution of the audio data. As the frequency selection filter, for example, a band pass filter, band stop filter, high pass filter, or low pass filter may be used.

The audio output control unit 1313 controls the output of audio to the audio output unit 120. Specifically, the audio output control unit 1313 controls the output of the audio acquired by the audio acquisition unit 1311 and the ambient sound acquired by the ambient sound acquisition unit 1312 to the audio output unit 120. The audio output control unit 1313 also controls the on/off of the audio output to the audio output unit 120 and the adjustment of the volume of the audio output to the audio output unit 120 for the audio acquired by the audio acquisition unit 1311 and the ambient sound acquired by the ambient sound acquisition unit 1312. The audio output control unit 1313 also performs noise cancellation processing on the audio acquired by the audio acquisition unit 1311 using the ambient sound acquired by the ambient sound acquisition unit 1312.

When the detection unit 132 detects that the user is under a large physical load, the audio output control unit 1313 makes the audio output unit 120 in a state where the user's surrounding sounds are easily heard. Specifically, the audio output control unit 1313 controls the audio output unit 120 to output the surrounding sounds acquired by the surrounding sound acquisition unit 1312 so that the user can hear the surrounding sounds, in other words, to make the user in a state where the surrounding sounds are easily heard.

An example of the output of ambient sound to the audio output unit 120 by the audio output control unit 1313 is described below.

For example, if the audio output unit 120 is in a form that covers the user's ears, such as earphones or headphones, and the audio content acquired by the audio acquisition unit 1311 is not output to the audio output unit 120, the audio output control unit 1313 outputs ambient sound to the audio output unit 120.

Similarly, when the audio output unit 120 is in a form that covers the user's ears, such as earphones or headphones, and the audio content acquired by the audio acquisition unit 1311 is output to the audio output unit 120, the audio output control unit 1313 mixes the audio content with ambient sound and outputs it, or lowers the volume of the audio content and mixes in ambient sound and outputs it.

In addition, when the audio output unit 120 is in a form that does not cover the user's ears, such as a headset built into a helmet or a neck speaker, the audio output control unit 1313 outputs the ambient sound to the audio output unit 120 at a relatively high volume. A relatively high volume is a volume that is higher than the volume at which the user hears the ambient sound directly, since the user can hear the ambient sound directly.

Furthermore, when the detection unit 132 detects that the user is under a heavy physical load, the audio output control unit 1313 reduces or stops the noise cancellation effect based on the audio around the user acquired by the ambient sound acquisition unit 1312 for the audio to be output to the user. In other words, when the detection unit 132 detects that the user is under a heavy physical load, the audio output control unit 1313 reduces or stops the noise cancellation effect based on the audio acquired by the ambient sound acquisition unit 1312 for the audio to be output to the user from the audio output unit 120. Note that noise cancellation may be achieved by outputting from the audio output unit 120 a sound that is in the opposite phase to the ambient audio acquired by the ambient sound acquisition unit 1312, thereby canceling out the sound that is considered to be noise.

This section describes an example of the reduction in the noise cancellation effect on the audio output unit 120 by the audio output control unit 1313.

For example, when the audio output unit 120 is in a form that covers the user's ears, such as earphones or headphones, and the audio content acquired by the audio acquisition unit 1311 is output to the audio output unit 120 while noise cancellation processing is being performed, the audio output control unit 1313 reduces the noise cancellation effect and stops the noise cancellation processing. The audio output control unit 1313 may lower or mute the volume of the audio content in response to the reduction in the noise cancellation effect and the stop of the noise cancellation processing.

As an example of reducing the noise cancellation effect on the audio output unit 120, when the noise cancellation process is stopped, the audio output control unit 1313 stops outputting the inverse phase sound of the ambient sound acquired by the ambient sound acquisition unit 1312 to the audio output unit 120. When reducing the noise cancellation effect, the audio output control unit 1313 performs processing such as lowering the output level of the inverse phase sound of the ambient sound acquired by the ambient sound acquisition unit 1312 and outputting it to the audio output unit 120.

The detection unit 132 detects that the user's state is a state of high physical load. The detection unit 132 detects that the user's state is a state of high physical load based on information indicating the results of detection or measurement by the sensor 150. The detection unit 132 may detect that the user's state is a state of high physical load using not only information acquired from the sensor 150, but also current position information based on signals from positioning satellites received by a GNSS (Global Navigation Satellite System) receiving unit (not shown), map information or topographical information acquired via a communication unit (not shown), or information provided by the audio output device.

The detection unit 132 detects that the user's physical load is high based on the user's operational load. Specifically, the detection unit 132 detects that the user's operational load is high as a state in which the user's physical load is high.

The state in which the user's operational load is high that is detected by the detection unit 132 is, for example, a state in which the user is walking uphill. In other words, the detection unit 132 detects that the user is walking uphill as a state in which the user's physical load is high. The state in which the user is walking uphill can be any state in which the user's operational load is high, such as a state in which the user is walking uphill, a state in which the user is running uphill, or a state in which the user is pedaling uphill on a bicycle, and the method of movement does not matter. In addition, an uphill slope is an uphill slope with a gradient of a predetermined level or more that increases the user's operational load, and includes stairs. Such a state can also be said to be in an environment in which the user's operational load is high.

The detection unit 132 detects that the user is going uphill from information, for example, from the gyro sensor, acceleration sensor, etc. of the sensor 150. The detection unit 132 acquires information on the user's inclination and detects that the user is going uphill from the average value of the inclination. The detection unit 132 also detects that the user is walking, running, or pedaling a bicycle from continuous vibrations and changes in inclination based on information from the gyro sensor, acceleration sensor, etc. of the sensor 150. For example, the detection unit 132 detects that the user is going uphill when the direction of gravitational acceleration indicated by the gyro sensor, acceleration sensor, etc. of the sensor 150 changes to point backward from the user's traveling direction.

The detection unit 132 calculates the moving speed based on the moving distance per unit time of the current position information based on the signal from the positioning satellite received by the GNSS receiving unit, and detects whether the user is walking uphill, running uphill, or even pedaling uphill on a bicycle, based on continuous vibrations from the gyro sensor and acceleration sensor of the sensor 150, thereby detecting whether the user is under a heavy load of motion.

The detection unit 132 may detect that the user is under a heavy operational load by detecting that the user is moving uphill based on current location information based on signals from positioning satellites received by the GNSS receiving unit, and based on map information and topographical information.

The detection unit 132 detects, for example, that the user is moving at a predetermined speed or faster as a state in which the user's physical load is high. The detection unit 132 detects that the person holding the audio output device 100 is running from information from the gyro sensor, acceleration sensor, etc. of the sensor 150. In other words, the detection unit 132 detects that the user is running (in other words, moving at a walking speed or faster) as a state in which the user's operational load is high.

The detection unit 132 may detect that the user is moving at a predetermined speed or faster based on the distance traveled per unit time of the current location information based on a signal from a positioning satellite received by the GNSS receiving unit.

The detection unit 132 detects that the user is in a state of high physical load based on the user's fatigue state. Specifically, the detection unit 132 detects that the user is in a fatigue state as a state of high physical load.

The user being in a fatigued state detected by the detection unit 132 means that the user is in a physically fatigued state. The detection unit 132 detects that the user is in a fatigued state, for example, based on the user's heart rate variability acquired from the sensor 150. The detection unit 132 stores the average value of the user's heart rate variability based on the user's heart rate detected by the sensor 150, and detects that the user is in a fatigued state by detecting that the value of the user's heart rate variability has dropped from the normal range.

The detection unit 132 detects that the user is in a state of high physical load based on the user's stress level. Specifically, the detection unit 132 detects that the user's stress level is high as the user's state of high physical load.

The detection unit 132 detects the user's stress level based on, for example, the user's heart rate variability acquired from the sensor 150. The detection unit 132 stores the average value of the user's heart rate variability based on the user's heart rate detected by the sensor 150, detects the user's stress level by detecting that the user's heart rate variability value has fallen below the normal range, and detects that the user's stress level is high based on the degree of deviation from the average value.

The detection unit 132 can detect a state in which the user's physical load is high by a variety of sensing methods and techniques, not limited to those described above, and they can also be used in combination.

(First aspect of processing of audio output device)
Next, a first aspect of the processing of the audio output device 100 according to the present disclosure will be described with reference to FIG. 2. FIG. 2 is a flowchart showing a flow of the first aspect of the processing of the audio output device according to the present disclosure. FIG. 2 shows an example of an audio output method executed by the audio output device according to the present disclosure, and is also an example of processing based on a program executed by the control unit 130. The first aspect of the processing of the audio output device 100 according to the present disclosure will be described along the flow shown in FIG. 2.

2 is started when the power supply of the audio output device 100 is turned on, or when an application that executes the process shown in FIG. 2 is started in the audio output device 100. The process shown in FIG. 2 may also be started when a user of the audio output device 100 puts on a helmet, earphones, headphones, neck speaker, or the like that is equipped with the audio output unit 120. The process shown in FIG. 2 may also be started at any timing when a user of the audio output device 100 operates the audio output device 100 while using the audio output device 100. The process shown in FIG. 2 may or may not involve the audio acquired by the audio acquisition unit 1311 being output to the audio output unit 120.

First, the audio output device 100 detects whether or not the physical load of the user of the audio output device 100 is high (step S101). Specifically, the detection unit 132 detects whether or not the physical load of the user of the audio output device 100 is high. A state in which the physical load of the user of the audio output device 100 is high is based on the detection of a state in which the user's operating load is high, the user being fatigued, the user's stress level being high, and the like.

If it is detected in step S101 that the user is under a large physical load (step S101: Yes), the audio output device 100 starts outputting external audio (step S102). Specifically, the audio output device 100 outputs the audio acquired by the ambient sound acquisition unit 1312 to the audio output unit 120. At this time, if the audio output device 100 is a device that performs audio calls with other audio output devices and a voice call is being performed, the audio acquired by the ambient sound acquisition unit 1312 is output in addition to the audio of the voice call. Also, if the audio output device 100 is a device that listens to audio content and the audio content is being output, the audio acquired by the ambient sound acquisition unit 1312 is output in addition to the audio of the audio content. In other words, the audio output device 100 outputs the audio acquired by the ambient sound acquisition unit 1312 in addition to the audio input from the audio input unit 140. The sound acquired by the ambient sound acquisition unit 1312 is output in addition to the sound input from the sound input unit 140; in other words, both the sound acquired by the ambient sound acquisition unit 1312 and the sound input from the sound input unit 140 are output from the sound output unit 120.

Next, the audio output device 100 determines whether the process shown in FIG. 2 has ended (step S103). The process shown in FIG. 2 is determined to have ended when the audio output device 100 is powered off, or when an application that executes the process shown in FIG. 2 has ended in the audio output device 100. The process may also be determined to have ended when the user of the audio output device 100 removes the helmet, earphones, headphones, neck speaker, or the like that is equipped with the audio output unit 120. The process may also be determined to have ended when the user of the audio output device 100 performs an operation on the audio output device 100 while using the audio output device 100, resulting in an end operation being performed at any timing.

If it is determined in step S103 that the processing has not ended (step S103: No), the audio output device 100 determines whether or not an external sound is being output (step S104). If it is determined that an external sound is being output (step S104: Yes), the audio output device 100 determines whether or not a state of high physical load continues (step S105). If the determination in step S104 has progressed from No in step S101, the processing in step S102 has not been performed, and therefore an external sound is not being output; if the determination in step S104 has progressed from Yes in step S101, the processing in step S102 has been performed, and therefore an external sound is being output.

If it is determined in step S105 that the state of high physical load continues (step S105: Yes), the determination in step S105 is executed again. Note that the period during which the determination in step S105 is Yes may also include a determination as to whether or not the processing has ended, as in step S103. If it is determined in step S105 that the state of high physical load does not continue (step S105: No), the audio output device 100 stops outputting external audio (step S106).

After step S106, the audio output device 100 determines whether the processing shown in FIG. 2 has ended (step S107), similar to step S103. If it is determined that the processing has ended (step S107: Yes), the audio output device 100 ends the processing shown in FIG. 2.

If it is not detected in step S101 that the physical load is high (step S101: No), the audio output device 100 proceeds to step S103 and executes the processes from step S103 onward.

If it is determined in step S103 that the processing has ended (step S103: Yes), the audio output device 100 ends the processing shown in FIG. 2.

Also, if it is determined in step S104 that external audio is not being output (step S104: No), the audio output device 100 proceeds to the process of step S107 and executes the processes from step S107 onwards.

According to the processing of the first aspect of the audio output device 100 described above, when it is detected that the user is in a state of high physical load, ambient sounds can be output to the user. Therefore, it is possible to provide an audio output device 100 that allows the user to properly understand the situation around them.

(Second aspect of processing of audio output device)
Next, a second aspect of the processing of the audio output device 100 according to the present disclosure will be described with reference to FIG. 3. FIG. 3 is a flowchart showing a flow of the second aspect of the processing of the audio output device according to the present disclosure. FIG. 3 shows an example of an audio output method executed by the audio output device according to the present disclosure, and is also an example of processing based on a program executed by the control unit 130. The second aspect of the processing of the audio output device 100 according to the present disclosure will be described along the flow shown in FIG. 3.

The process in FIG. 3 is applied when the audio output unit 120 in the audio output device 100 is an earphone or a headphone equipped with a noise cancellation function. The process in FIG. 3 is also based on the premise that the audio output device 100 performs noise cancellation processing on the audio content acquired from the audio input unit 140 based on the environmental sound acquired by the microphone 110.

Steps S201, S203, S205, and S207 shown in FIG. 3 are the same as steps S101, S103, S105, and S107 shown in FIG. 2, so their explanations are omitted.

If it is detected in step S201 that the user is under a large physical load (step S201: Yes), the audio output device 100 limits noise canceling (N/C) (step S202). Specifically, the audio output device 100 reduces or stops the noise canceling effect based on the sound acquired by the ambient sound acquisition unit 1312 for the sound to be output to the user.

Next, the audio output device 100 determines whether the processing shown in FIG. 3 has ended (step S203). If it is determined in step S203 that the processing has not ended (step S203: No), the audio output device 100 determines whether the noise canceling restriction state continues (step S204). If it is determined that the noise canceling restriction state continues (step S204: Yes), the audio output device 100 determines whether the state of high physical load continues (step S205). If the determination in step S204 has progressed from No in step S201, the processing in step S202 has not been performed, so the noise canceling restriction state is not continuing, and if the determination has progressed from Yes in step S101, the processing in step S202 has been performed, so the noise canceling restriction state is continuing.

If it is determined in step S205 that the state of high physical load is not continuing (step S205: No), the audio output device 100 removes the restriction on noise canceling (step S206).

If it is determined in step S204 that the noise canceling restriction state is not continuing (step S204: No), the audio output device 100 proceeds to step S207 and executes the processes from step S207 onward.

According to the processing of the second aspect of the audio output device 100 described above, when it is detected that the user is in a state of high physical load, the noise cancellation effect can be reduced or stopped. This makes it easier for the user to hear surrounding sounds. Therefore, it is possible to provide an audio output device 100 that allows the user to properly grasp the surrounding situation.

(Configuration of audio output device)
Second Embodiment
Next, the audio output device 100 according to the second embodiment will be described with reference to FIG. 4. FIG. 4 is a diagram showing a configuration example of the second embodiment of the audio output device according to the present disclosure. In the audio output device 100 according to the second embodiment, the audio input unit 140 is changed to a communication control unit 160 and a wireless communication module 170 compared to the audio output device 100 according to the first embodiment. In addition, a microphone 101 for calling is added, and the function of the audio acquisition unit 1311 is different. Other than the above, the audio output device 100 according to the first embodiment is the same as the audio output device 100 according to the first embodiment, so the description will be omitted.

The audio output device 100 shown in FIG. 4 is specifically a device that allows a user to communicate with other users via wireless communication while riding a bicycle, climbing a mountain, trekking, or the like. As such a device, the audio output device 100 is a device that is composed of a helmet-mounted headset with communication capabilities, a portable information terminal such as a smartphone, and a neck speaker.

The communication control unit 160 controls wireless communication by the audio output device 100. As shown in FIG. 4, the communication control unit 160 includes a transmission control unit 161 and a reception control unit 162.

The transmission control unit 161 transmits the audio acquired by the audio acquisition unit 1311 to another audio output device 100 that has been set in advance. In other words, the transmission control unit 161 transmits the audio acquired by the audio acquisition unit 1311 to the other audio output device 100 that has been set in advance via wireless communication such as a public communication network, or directly.

The reception control unit 162 controls the reception of audio transmitted from another audio output device 100. For example, the reception control unit 162 may control the reception of audio transmitted from another audio output device 100 that has been set in advance.

The wireless communication module 170 performs wireless communication with other audio output devices 100. The wireless communication module 170 is, for example, a wireless communication module for performing wireless communication such as Wi-Fi (registered trademark) or 5G, or a wireless communication module for performing medium-range wireless communication using Bluetooth (registered trademark) or digital simple wireless communication.

The call microphone 101 picks up the speech of the user of the audio output device 100 in order to make a voice call via wireless communication between the audio output device 100 and another audio output device 100. When the audio output device 100 is attached to a helmet or the like worn by a user, the call microphone 101 is placed in a position close to the user's mouth when the user is wearing the helmet.

The voice acquisition unit 1311 in the control unit 130 acquires the voice picked up by the call microphone 101, which picks up the user's speech. In other words, it can be said that the voice acquisition unit 1311 detects the user's speech. The voice acquisition unit 1311 also acquires the voice received by the communication control unit 160 from another audio output device, and outputs the voice picked up by the call microphone 101 to the communication control unit 160.

The processing of the audio output device 100 shown in FIG. 4 is as shown in FIG. 2 in the first embodiment.

(Composition and Effects)
The audio output device 100 according to the present disclosure includes a detection unit 132 that detects whether the user's state is one in which the user's physical load is high, an audio output unit 120 that outputs audio to the user, and an audio output control unit 1313 that, when the detection unit 132 detects that the user's physical load is high, causes the audio output unit 120 to make it easier for the user to hear sounds around the user.

This configuration makes it possible to make the user's surrounding sounds easier to hear when the user is under a large physical load. As a result, it is possible to provide an audio output device 100 that allows the user to properly understand the situation around them.

The audio output device 100 according to the present disclosure further includes an ambient sound acquisition unit 1312 that acquires audio around the user, and when the detection unit 132 detects that the user is under a high level of physical stress, the audio output control unit 1313 outputs the audio around the user acquired by the ambient sound acquisition unit 1312 to the user.

With this configuration, when it is detected that the user is under a large physical load, ambient sounds can be output to the user. Therefore, it is possible to provide an audio output device 100 that allows the user to properly understand the situation around them.

The audio output device 100 according to the present disclosure further includes an ambient sound acquisition unit 1312 that acquires audio around the user, and when the detection unit 132 detects that the user is under a high level of physical stress, the audio output control unit 1313 reduces or stops the noise cancellation effect based on the audio around the user acquired by the ambient sound acquisition unit 1312 for the audio to be output to the user.

With this configuration, when it is detected that the user is under a large physical load, the noise cancellation effect can be reduced or stopped. This makes it easier for the user to hear surrounding sounds. Therefore, it is possible to provide an audio output device 100 that allows the user to properly understand the surrounding situation.

The detection unit 132 of the audio output device 100 according to the present disclosure detects a state in which the user's physical load is high, such as when the user is walking uphill or moving at a predetermined speed or higher, as a state in which the user's physical load is high.

This configuration makes it possible to provide an audio output device 100 that allows the user to properly understand the surrounding situation when the user is in a state where the user's physical burden is high, such as when the user is walking uphill or moving at a predetermined speed or faster.

The detection unit 132 of the audio output device 100 according to the present disclosure detects when the user is in a fatigued state or in a state of high stress level as a state in which the user's physical load is high.

With this configuration, when the user is in a fatigued state or has a high level of stress, it can be detected as a state in which the user's physical load is high.

The audio output method executed by the audio output device 100 according to the present disclosure includes a detection step of detecting that the user's state is one in which the user's physical load is high, and an audio output control step of making the user's surrounding sounds easier to hear in relation to the audio output to the user when it is detected that the user's physical load is high.

This configuration makes it possible to make the user's surrounding sounds easier to hear when the user is under a large physical load. This makes it possible to provide a method for controlling the audio output device 100 that allows the user to properly understand the situation around them.

The above describes an embodiment of the present disclosure, but the embodiment is not limited to the contents of this embodiment. The above-mentioned components include those that a person skilled in the art can easily imagine, those that are substantially the same, and those that are within the so-called equivalent range. Furthermore, the above-mentioned components can be combined as appropriate. Furthermore, various omissions, substitutions, or modifications of the components can be made without departing from the spirit of the above-mentioned embodiment.

The audio output device, audio output method, and program according to this embodiment can be used, for example, as an audio output device, audio output method, and program that allows a user to properly understand the surrounding situation.

100 Audio output device 110 Microphone 111 Microphone L
112 Microphone R
120 Audio output unit 121 Audio output unit L
122 Audio output unit R
130 Control unit 131 Audio processing unit 1311 Audio acquisition unit 1312 Ambient sound acquisition unit 1313 Audio output control unit 132 Detection unit 140 Audio input unit 150 Sensor 160 Communication control unit 161 Transmission control unit 162 Reception control unit 170 Wireless communication module

Claims

A detection unit that detects that a state of a user is a state in which a physical load of the user is large;
a voice output unit for outputting voice to the user;
and an audio output control unit that, when the detection unit detects that the user is in a state of high physical load, makes the audio output unit in a state where the user can easily hear ambient sounds.
Audio output device.
An ambient sound acquisition unit that acquires sounds around the user,
The audio output control unit outputs, to the user, audio around the user acquired by the ambient sound acquisition unit when the detection unit detects that the user is in a state of high physical load.
The audio output device according to claim 1 .
An ambient sound acquisition unit that acquires sounds around the user,
When the detection unit detects that the physical load of the user is high, the audio output control unit reduces or stops a noise cancellation effect based on the audio around the user acquired by the ambient sound acquisition unit for the audio to be output to the user.
The audio output device according to claim 1 .
The detection unit detects a state in which the user's operational load is high as a state in which the user's physical load is high.
The audio output device according to claim 1 .
The detection unit detects that the user is in a fatigued state as a state in which the user is under a large physical load.
The audio output device according to claim 1 .
The detection unit detects that the user's stress level is high as a state in which the user's physical load is high.
The audio output device according to claim 1 .
A detection step of detecting that a state of the user is a state in which a physical load of the user is large;
and when it is detected that the physical load of the user is high, a sound output control step of making the sound output to the user easy to hear sounds around the user.
An audio output method executed by an audio output device.
A processor for controlling an audio output device
A detection process for detecting that a user's state is a state in which the user's physical load is high;
A program for executing a process including: when it is detected that the user is in a state of high physical load, making the user's surrounding sounds easier to hear in relation to the audio output to the user.