EP3328090A1 - System and method for enabling communication of ambient sound as an audio stream - Google Patents
System and method for enabling communication of ambient sound as an audio stream Download PDFInfo
- Publication number
- EP3328090A1 EP3328090A1 EP16201116.7A EP16201116A EP3328090A1 EP 3328090 A1 EP3328090 A1 EP 3328090A1 EP 16201116 A EP16201116 A EP 16201116A EP 3328090 A1 EP3328090 A1 EP 3328090A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- audio stream
- communication device
- indication
- playback
- headphone
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/10—Earpieces; Attachments therefor ; Earphones; Monophonic headphones
- H04R1/1041—Mechanical or electronic switches, or control elements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
- H04R29/001—Monitoring arrangements; Testing arrangements for loudspeakers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/10—Earpieces; Attachments therefor ; Earphones; Monophonic headphones
- H04R1/1083—Reduction of ambient noise
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2420/00—Details of connection covered by H04R, not provided for in its groups
- H04R2420/01—Input selection or mixing for amplifiers or loudspeakers
Definitions
- the present disclosure relates to a communication device connected to a headphone and to enabling communication of ambient sound, including outputting an audio stream to the headphone for playback to a user of the communication device.
- the method comprises outputting a first audio stream to the headphone for playback to a user of the communication device.
- the method also comprises, via an interface of the communication device to its surroundings, obtaining an indication that the playback of the first audio stream should be altered.
- the method also comprises, in response to the obtained indication, altering the output of the first audio stream; by means of a microphone of the communication device, recording a second audio stream; and outputting the second audio stream to the headphone for playback to the user.
- a computer program product comprising computer-executable components for causing a communication device to perform an embodiment of the method of the present disclosure when the computer-executable components are run on processing circuitry comprised in the communication device.
- a communication device comprising processing circuitry, and storage storing instructions executable by said processing circuitry whereby said communication device is operative to output a first audio stream to a headphone for playback to a user of the communication device.
- the communication device is also operative to, via an interface of the communication device to its surroundings, obtain an indication that the playback of the first audio stream should be altered.
- the communication device is also operative to, in response to the obtained indication: alter the output of the first audio stream; by means of a microphone of the communication device, record a second audio stream; and output the second audio stream to the headphone for playback to the user.
- the user of the communication device wearing the headphone(s) may better hear the ambient sound (via the microphone) without the need to remove the headphone(s). This may be called a voice mode of the device.
- FIGS 1a-d and 2a-d illustrate steps of some embodiments of the present invention.
- a communication device 1 is communicatively connected to a headphone or headphones 2 worn by a person 3 who is herein called a user of the communication device.
- the headphone(s) comprises speakers for playing back audio, e.g. music, to the user 3, and is e.g. arranged in, on or over an ear (or both ears) of the user.
- the communication device 1 may e.g. be configured for wired power supply or comprise a battery.
- the communication device may e.g. be a radio device such as any device or user equipment (UE), mobile or stationary, enabled to communicate over a radio channel in a communication network, for instance but not limited to e.g. mobile phone, smartphone, media players, or any type of consumer electronic, for instance but not limited to television, radio, tablet computer, laptop, or personal computer (PC).
- the device 1 is communicatively connected, wired or wirelessly, to the headphone 2 via a headphone interface 8 for outputting an audio stream to the headphone 2 which may then be played back to the user by means of its speakers.
- the headphone interface may e.g.
- the headphone interface may comprise a receiver for a headphone connector such as a 3.5 mm connector, a Lightning connector or a USB connector (e.g. a micro USB or USB-C).
- the headphone interface may comprise a radio interface e.g. for Bluetooth, Local Area Network (LAN) or Wi-Fi, or Near-Field Communication (NFC).
- the device may also comprise a communication interface for a data connection e.g. to the Internet, which may be wired or wireless e.g. in accordance with a LAN or Third Generation Partnership Project (3GPP) communication standard.
- the device 1 also comprises a microphone interface 5, e.g. a microphone, and a User Interface (UI) 4 e.g. a Graphical UI (GUI) optionally comprising a touchscreen. Additionally or alternatively, the UI 4 may comprise mechanical buttons or keys.
- UI User Interface
- GUI Graphical UI
- a user 3 listens to an audio stream (herein called a first audio stream) e.g. music or an audio book, by means of the headphone 2 connected to the device 1.
- the first audio stream may be of a media file, or playlist of a plurality of media files, which is e.g. stored in a storage in the device 1 or streamed by the device from an external media server (being buffered in the device) and outputted to the headphone.
- the user may e.g. be working or travelling and uses the headphone and the first audio stream avoid disturbing ambient sounds.
- the user 3 decides that he/she wants to, e.g. temporarily, hear ambient sound e.g. of what another person 6 is saying.
- the user 3 uses the UI 4 to input a command to the device 1 to put the device in what is herein called voice mode, whereby the device receives an indication that the playback of the first audio stream should be altered in accordance with the voice mode.
- the UI comprises a touchscreen
- the user may e.g. input the command by making a touch gesture or by pressing a graphical element 7 of the GUI, which graphical element is associated with the voice mode and thus provides the indication to the device.
- the graphical element 7 may e.g. be presented by a software (SW) application (app) or widget running in the device, e.g. integrated in a media player in the device. The user may thus easily switch to voice mode by interaction via the UI 4.
- SW software
- the switching to voice mode may be initiated automatically, without the need for the user 3 to interact with the device 1 via the UI 4.
- the device 1 detects a predefined sound by means of the microphone 5.
- the device has been preprogrammed to associate this sound with an indication that the device should be put in voice mode.
- the microphone may thus be active and, when the sound is detected, the device 1 is automatically put in voice mode.
- the sound may e.g. be a human voice.
- the human voice may have a volume which is above a predetermined threshold, e.g. a static threshold or a threshold which is relative to background noise in order to qualify as an indication for putting the device in voice mode.
- the human voice may have to speak a predetermined phrase, e.g. an activation word or phrase such as a name of the user 3.
- a predetermined phrase e.g. an activation word or phrase such as a name of the user 3.
- an other person 6, or e.g. a speaker system in a train or plane may automatically activate the voice mode without the user 3 having to see that the other person 6 is trying to make contact or without the other person having to speak loudly to be heard over the playback of the first audio stream. This may make it easier and less awkward to make contact with the user 3. For instance, if the user 3 is working while listening via headphones 2 it may be socially awkward to approach him/her which may require either entering the field of vision of the user 3, gesturing or tapping him/her or talking really loudly in order to get noticed and start a conversation.
- Figures 1c and 2c respectively, shows the situation after the device 1 has been put in voice mode, e.g. following any of the situations of figures 1b or 2b .
- the output of the first audio stream to the headphone 2 has been altered, e.g. such that the playback by means of the speakers in the headphone has been interrupted (stopped), muted, faded out, or reduced in volume, in order to allow the user 3 to hear ambient sound.
- the ambient sound is obtained/recorded by means of the microphone 5 and outputted to the headphone 2 as a second audio stream for playback to the user via the speakers.
- the ambient sound of the second audio stream typically comprises a human voice, and in some embodiments an audio filter (typically a digital audio filter) may be used to enhance the human voice and/or reduce noise before outputting the second audio stream to the headphone.
- an audio filter typically a digital audio filter
- visual feedback to the user that the voice mode is active may be presented by means of the GUI 4.
- the user may hear another person (or a speaker system) via the microphone 5 in the device 1 and the speakers in the headphone 2, without the need for removing the headphone(s).
- the device 1 may be kept in voice mode until the device, e.g. via an interface (e.g. UI 4 and/or microphone 5), obtains an indication that the playback of the first audio stream should be restored to as before the obtaining of the indication that the playback of the first audio stream should be altered.
- the device 1 may discontinue the recording and outputting of the second audio stream, and alter the output of the first audio stream such that the playback of the first audio stream is restored to as it was before the obtaining of the indication that the playback of the first audio stream should be altered (e.g. as discussed in respect of figures 1a and 2a ).
- FIG. 1d and 2d illustrates embodiments of the present invention after the playback of the first audio stream should be restored, similar to figures 1a and 2a .
- the first audio stream output may be similarly restored, e.g. resumed (started), unmuted, faded in, or increased in volume.
- the indication that the playback of the first audio stream should be altered was obtained via the UI 4
- the indication that the first audio stream should be restored may similarly be obtained via the UI 4, e.g. by making a touch gesture or by the user pressing the same, or a different, graphical element 7 of the GUI, or by releasing pressure on said graphical element 7 if the voice mode is only active while the user is continuously pressing the graphical element.
- the indication that the playback of the first audio stream should be altered was obtained via the microphone 5
- the indication that the first audio stream should be restored may similarly be obtained via the microphone 5, e.g. by detecting that the human voice is no longer heard, or is below a predetermined volume threshold, during a predetermined time period.
- the indication that the first audio stream should be restored may be obtained by the expiry of a timer which was activated when the device 1 was put in the voice mode.
- FIG. 3 schematically illustrates an embodiment of a communication device 1 of the present disclosure.
- the device 1 comprises processing circuitry 31 e.g. a central processing unit (CPU).
- the processing circuitry 31 may comprise one or a plurality of processing units in the form of microprocessor(s).
- ASIC application specific integrated circuit
- FPGA field programmable gate array
- CPLD complex programmable logic device
- the processing circuitry 31 is configured to run one or several computer program(s) or software (SW) 41 (see also figure 4 ) stored in a storage 32 of one or several storage unit(s) e.g. a memory.
- SW software
- the storage unit is regarded as a computer readable means 42 (see figure 4 ) as discussed herein and may e.g. be in the form of a Random Access Memory (RAM), a Flash memory or other solid state memory, or a hard disk, or be a combination thereof.
- the processing circuitry 31 may also be configured to store data in the storage 32, as needed.
- the SW 41 may comprise SW for making the device perform embodiments of the method of the present disclosure.
- the SW 41 may e.g. comprise app SW 33 which, when run on the processing circuitry 31 forms the app 34 by means of which the device 1 may perform at least a part of embodiments of the method.
- the device 1 also comprises the audio output/headphone interface 8, the microphone 5 and the UI 4 as previously discussed.
- Figure 4 illustrates an embodiment of a computer program product 40.
- the computer program product 40 comprises a computer readable (e.g. nonvolatile and/or non-transitory) medium 42 comprising software/computer program 41 in the form of computer-executable components.
- the computer program 41 may be configured to cause a device 1, e.g. as discussed herein, to perform an embodiment of the method of the present disclosure.
- the computer program may be run on the processing circuitry 31 of the device 1 for causing it to perform the method.
- the computer program product 40 may e.g. be comprised in a storage unit or memory 32 comprised in the device 1 and associated with the processing circuitry 31.
- the computer program product 40 may be, or be part of, a separate, e.g.
- a computer readable disc such as CD or DVD or hard disc/drive, or a solid state storage medium, e.g. a RAM or Flash memory.
- the storage medium can include, but are not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data.
- Embodiments of the present disclosure may be conveniently implemented using one or more conventional general purpose or specialized digital computer, computing device, machine, or microprocessor, including one or more processors, memory and/or computer readable storage media programmed according to the teachings of the present disclosure.
- Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.
- FIG. 5 is a schematic flow chart of some embodiments of the method of the present invention.
- the method is performed by a communication device 1 communicatively connected to a headphone 2.
- the method comprises outputting S1 a first audio stream to the headphone for playback to a user 3 of the communication device.
- the method also comprises, via an interface (e.g. UI 4 and/or microphone 5) to the surroundings of the device 1, obtaining S2 an indication that the playback of the first audio stream should be altered.
- the method also comprises, in response to the obtained S2 indication: altering S3 the output of the first audio stream; by means of the microphone 5 of the communication device, recording S4 a second audio stream; and outputting S5 the second audio stream to the headphone for playback to the user.
- the method may further comprise, via the interface 4 and/or 5, obtaining S6 an indication that the playback of the first audio stream should be restored to as before the obtaining S2 of the indication that the playback of the first audio stream should be altered.
- the method may also comprise, in response to the obtained S6 indication that the playback of the first audio stream should be restored: discontinuing S7 the recording S4 and outputting S5 of the second audio stream; and altering S8 the output of the first audio stream such that the playback of the first audio stream is restored.
- the first audio stream is of a media file stored in the communication device 1 or streamed from a media server.
- the interface 4 comprises a touchscreen of a GUI
- the indication is obtained S2 by detecting an input via the touchscreen corresponding to the user 3 pressing a graphical element of the GUI associated with the indication that the playback of the first audio stream should be altered.
- the interface comprises the microphone 5, and the indication is obtained S2 by via the microphone detecting sound which the communication device 1 has been preprogrammed to associate with the indication that the playback of the first audio stream should be altered.
- the sound comprises a human voice.
- the detected human voice sound has a volume above a predetermined threshold.
- the detected human voice sound corresponds to a predetermined phrase.
- the recording S4 of the second audio stream comprises using an audio filter to reduce noise in the second audio stream.
- the method is performed at least partly by means of a software application 34 running on the communication device 1.
- the communication device is a mobile phone, e.g. a smartphone.
- the interface of the device 1 comprises a touchscreen of a UI 4 e.g. GUI, or the interface comprises a microphone 5.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
The present disclosure relates to a method performed by a communication device 1 communicatively connected to a headphone 2. The method comprises outputting a first audio stream to the headphone for playback to a user 3 of the communication device. The method also comprises, via an interface 4 and/or 5 of the communication device to its surroundings, obtaining an indication that the playback of the first audio stream should be altered. The method also comprises, in response to the obtained indication: altering the output of the first audio stream; by means of a microphone 5 of the communication device, recording a second audio stream; and outputting the second audio stream to the headphone for playback to the user.
Description
- The present disclosure relates to a communication device connected to a headphone and to enabling communication of ambient sound, including outputting an audio stream to the headphone for playback to a user of the communication device.
- When using headphones for e.g. listening to music, it is often desirable to shut out ambient sounds in order to improve the listening experience. There are also actively noise cancelling headphones on the marked for further reduction of sound pollution when using the headphones. This implies that it may be difficult for a person using the headphones to hear another person trying to talk to him/her, unless the headphones are turned off or removed.
- It is an objective of the present invention to improve verbal communication with a person wearing headphones, without the need to remove the headphones from the ears of said person.
- According to an aspect of the present invention, there is provided a method performed by a communication device communicatively connected to a headphone (or headphones). The method comprises outputting a first audio stream to the headphone for playback to a user of the communication device. The method also comprises, via an interface of the communication device to its surroundings, obtaining an indication that the playback of the first audio stream should be altered. The method also comprises, in response to the obtained indication, altering the output of the first audio stream; by means of a microphone of the communication device, recording a second audio stream; and outputting the second audio stream to the headphone for playback to the user.
- According to another aspect of the present invention, there is provided a computer program product comprising computer-executable components for causing a communication device to perform an embodiment of the method of the present disclosure when the computer-executable components are run on processing circuitry comprised in the communication device.
- According to another aspect of the present invention, there is provided a communication device comprising processing circuitry, and storage storing instructions executable by said processing circuitry whereby said communication device is operative to output a first audio stream to a headphone for playback to a user of the communication device. The communication device is also operative to, via an interface of the communication device to its surroundings, obtain an indication that the playback of the first audio stream should be altered. The communication device is also operative to, in response to the obtained indication: alter the output of the first audio stream; by means of a microphone of the communication device, record a second audio stream; and output the second audio stream to the headphone for playback to the user.
- By altering the output of the first audio stream, e.g. discontinuing it, muting it, fading it out or reducing the volume of it, and using the microphone of the communication device (also called only device herein) for capturing and playing back, effectively amplifying, ambient sound (typically voice), the user of the communication device wearing the headphone(s) may better hear the ambient sound (via the microphone) without the need to remove the headphone(s). This may be called a voice mode of the device.
- It is to be noted that any feature of any of the aspects may be applied to any other aspect, wherever appropriate. Likewise, any advantage of any of the aspects may apply to any of the other aspects. Other objectives, features and advantages of the enclosed embodiments will be apparent from the following detailed disclosure, from the attached dependent claims as well as from the drawings.
- Generally, all terms used in the claims are to be interpreted according to their ordinary meaning in the technical field, unless explicitly defined otherwise herein. All references to "a/an/the element, apparatus, component, means, step, etc." are to be interpreted openly as referring to at least one instance of the element, apparatus, component, means, step, etc., unless explicitly stated otherwise. The steps of any method disclosed herein do not have to be performed in the exact order disclosed, unless explicitly stated. The use of "first", "second" etc. for different features/components of the present disclosure are only intended to distinguish the features/components from other similar features/components and not to impart any order or hierarchy to the features/components.
- Embodiments will be described, by way of example, with reference to the accompanying drawings, in which:
-
Fig 1a-d schematically illustrates some embodiments of the present invention. -
Fig 2a-d schematically illustrates some other embodiments of the present invention. -
Fig 3 is a schematic block diagram of an embodiment of a communication device of the present invention. -
Fig 4 is a schematic illustration of an embodiment of a computer program product of the present invention. -
Fig 5 is a schematic flow chart of embodiments of the method of the present invention. - Embodiments will now be described more fully hereinafter with reference to the accompanying drawings, in which certain embodiments are shown. However, other embodiments in many different forms are possible within the scope of the present disclosure. Rather, the following embodiments are provided by way of example so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art. Like numbers refer to like elements throughout the description.
-
Figures 1a-d and2a-d illustrate steps of some embodiments of the present invention. Acommunication device 1 is communicatively connected to a headphone orheadphones 2 worn by aperson 3 who is herein called a user of the communication device. The headphone(s) comprises speakers for playing back audio, e.g. music, to theuser 3, and is e.g. arranged in, on or over an ear (or both ears) of the user. - The
communication device 1 may e.g. be configured for wired power supply or comprise a battery. The communication device may e.g. be a radio device such as any device or user equipment (UE), mobile or stationary, enabled to communicate over a radio channel in a communication network, for instance but not limited to e.g. mobile phone, smartphone, media players, or any type of consumer electronic, for instance but not limited to television, radio, tablet computer, laptop, or personal computer (PC). Thedevice 1 is communicatively connected, wired or wirelessly, to theheadphone 2 via aheadphone interface 8 for outputting an audio stream to theheadphone 2 which may then be played back to the user by means of its speakers. In case of a wired headphone connection, the headphone interface may e.g. comprise a receiver for a headphone connector such as a 3.5 mm connector, a Lightning connector or a USB connector (e.g. a micro USB or USB-C). In case of a wireless headphone interface, the headphone interface may comprise a radio interface e.g. for Bluetooth, Local Area Network (LAN) or Wi-Fi, or Near-Field Communication (NFC). The device may also comprise a communication interface for a data connection e.g. to the Internet, which may be wired or wireless e.g. in accordance with a LAN or Third Generation Partnership Project (3GPP) communication standard. Thedevice 1 also comprises amicrophone interface 5, e.g. a microphone, and a User Interface (UI) 4 e.g. a Graphical UI (GUI) optionally comprising a touchscreen. Additionally or alternatively, theUI 4 may comprise mechanical buttons or keys. - In the situation shown in
figures 1a and2a , respectively, auser 3 listens to an audio stream (herein called a first audio stream) e.g. music or an audio book, by means of theheadphone 2 connected to thedevice 1. The first audio stream may be of a media file, or playlist of a plurality of media files, which is e.g. stored in a storage in thedevice 1 or streamed by the device from an external media server (being buffered in the device) and outputted to the headphone. This may be regarded as a starting situation for embodiments of the method of the present invention. The user may e.g. be working or travelling and uses the headphone and the first audio stream avoid disturbing ambient sounds. - In the example situation shown in
figure 1b , theuser 3 decides that he/she wants to, e.g. temporarily, hear ambient sound e.g. of what anotherperson 6 is saying. Theuser 3 then uses theUI 4 to input a command to thedevice 1 to put the device in what is herein called voice mode, whereby the device receives an indication that the playback of the first audio stream should be altered in accordance with the voice mode. If the UI comprises a touchscreen, the user may e.g. input the command by making a touch gesture or by pressing agraphical element 7 of the GUI, which graphical element is associated with the voice mode and thus provides the indication to the device. Thegraphical element 7 may e.g. be presented by a software (SW) application (app) or widget running in the device, e.g. integrated in a media player in the device. The user may thus easily switch to voice mode by interaction via theUI 4. - Additionally or alternatively, in the example situation shown in
figure 2b , the switching to voice mode may be initiated automatically, without the need for theuser 3 to interact with thedevice 1 via theUI 4. In this situation, thedevice 1 detects a predefined sound by means of themicrophone 5. The device has been preprogrammed to associate this sound with an indication that the device should be put in voice mode. The microphone may thus be active and, when the sound is detected, thedevice 1 is automatically put in voice mode. The sound may e.g. be a human voice. The human voice may have a volume which is above a predetermined threshold, e.g. a static threshold or a threshold which is relative to background noise in order to qualify as an indication for putting the device in voice mode. Additionally or alternatively, the human voice may have to speak a predetermined phrase, e.g. an activation word or phrase such as a name of theuser 3. By this, another person 6, or e.g. a speaker system in a train or plane, may automatically activate the voice mode without theuser 3 having to see that theother person 6 is trying to make contact or without the other person having to speak loudly to be heard over the playback of the first audio stream. This may make it easier and less awkward to make contact with theuser 3. For instance, if theuser 3 is working while listening viaheadphones 2 it may be socially awkward to approach him/her which may require either entering the field of vision of theuser 3, gesturing or tapping him/her or talking really loudly in order to get noticed and start a conversation. -
Figures 1c and2c , respectively, shows the situation after thedevice 1 has been put in voice mode, e.g. following any of the situations offigures 1b or2b . The output of the first audio stream to theheadphone 2 has been altered, e.g. such that the playback by means of the speakers in the headphone has been interrupted (stopped), muted, faded out, or reduced in volume, in order to allow theuser 3 to hear ambient sound. The ambient sound is obtained/recorded by means of themicrophone 5 and outputted to theheadphone 2 as a second audio stream for playback to the user via the speakers. The ambient sound of the second audio stream typically comprises a human voice, and in some embodiments an audio filter (typically a digital audio filter) may be used to enhance the human voice and/or reduce noise before outputting the second audio stream to the headphone. In some embodiments, visual feedback to the user that the voice mode is active may be presented by means of theGUI 4. Thus, the user may hear another person (or a speaker system) via themicrophone 5 in thedevice 1 and the speakers in theheadphone 2, without the need for removing the headphone(s). - The
device 1 may be kept in voice mode until the device, e.g. via an interface (e.g. UI 4 and/or microphone 5), obtains an indication that the playback of the first audio stream should be restored to as before the obtaining of the indication that the playback of the first audio stream should be altered. In response to the obtained indication that the playback of the first audio stream should be restored, thedevice 1 may discontinue the recording and outputting of the second audio stream, and alter the output of the first audio stream such that the playback of the first audio stream is restored to as it was before the obtaining of the indication that the playback of the first audio stream should be altered (e.g. as discussed in respect offigures 1a and2a ). - The situations shown in
figures 1d and2d , respectively, illustrates embodiments of the present invention after the playback of the first audio stream should be restored, similar tofigures 1a and2a . Depending on how the output of the first audio stream was altered, the first audio stream output may be similarly restored, e.g. resumed (started), unmuted, faded in, or increased in volume. - In
figure 1d , where the indication that the playback of the first audio stream should be altered was obtained via theUI 4, the indication that the first audio stream should be restored may similarly be obtained via theUI 4, e.g. by making a touch gesture or by the user pressing the same, or a different,graphical element 7 of the GUI, or by releasing pressure on saidgraphical element 7 if the voice mode is only active while the user is continuously pressing the graphical element. - In
figure 2d , where the indication that the playback of the first audio stream should be altered was obtained via themicrophone 5, the indication that the first audio stream should be restored may similarly be obtained via themicrophone 5, e.g. by detecting that the human voice is no longer heard, or is below a predetermined volume threshold, during a predetermined time period. - Additionally or alternatively, the indication that the first audio stream should be restored may be obtained by the expiry of a timer which was activated when the
device 1 was put in the voice mode. -
Figure 3 schematically illustrates an embodiment of acommunication device 1 of the present disclosure. Thedevice 1 comprises processingcircuitry 31 e.g. a central processing unit (CPU). Theprocessing circuitry 31 may comprise one or a plurality of processing units in the form of microprocessor(s). However, other suitable devices with computing capabilities could be comprised in theprocessing circuitry 31, e.g. an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or a complex programmable logic device (CPLD). Theprocessing circuitry 31 is configured to run one or several computer program(s) or software (SW) 41 (see alsofigure 4 ) stored in astorage 32 of one or several storage unit(s) e.g. a memory. The storage unit is regarded as a computer readable means 42 (seefigure 4 ) as discussed herein and may e.g. be in the form of a Random Access Memory (RAM), a Flash memory or other solid state memory, or a hard disk, or be a combination thereof. Theprocessing circuitry 31 may also be configured to store data in thestorage 32, as needed. TheSW 41 may comprise SW for making the device perform embodiments of the method of the present disclosure. TheSW 41 may e.g. comprise app SW 33 which, when run on theprocessing circuitry 31 forms theapp 34 by means of which thedevice 1 may perform at least a part of embodiments of the method. Thedevice 1 also comprises the audio output/headphone interface 8, themicrophone 5 and theUI 4 as previously discussed. -
Figure 4 illustrates an embodiment of acomputer program product 40. Thecomputer program product 40 comprises a computer readable (e.g. nonvolatile and/or non-transitory) medium 42 comprising software/computer program 41 in the form of computer-executable components. Thecomputer program 41 may be configured to cause adevice 1, e.g. as discussed herein, to perform an embodiment of the method of the present disclosure. The computer program may be run on theprocessing circuitry 31 of thedevice 1 for causing it to perform the method. Thecomputer program product 40 may e.g. be comprised in a storage unit ormemory 32 comprised in thedevice 1 and associated with theprocessing circuitry 31. Alternatively, thecomputer program product 40 may be, or be part of, a separate, e.g. mobile, storage means/medium, such as a computer readable disc, e.g. CD or DVD or hard disc/drive, or a solid state storage medium, e.g. a RAM or Flash memory. Further examples of the storage medium can include, but are not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data. Embodiments of the present disclosure may be conveniently implemented using one or more conventional general purpose or specialized digital computer, computing device, machine, or microprocessor, including one or more processors, memory and/or computer readable storage media programmed according to the teachings of the present disclosure. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art. -
Figure 5 is a schematic flow chart of some embodiments of the method of the present invention. The method is performed by acommunication device 1 communicatively connected to aheadphone 2. The method comprises outputting S1 a first audio stream to the headphone for playback to auser 3 of the communication device. The method also comprises, via an interface (e.g. UI 4 and/or microphone 5) to the surroundings of thedevice 1, obtaining S2 an indication that the playback of the first audio stream should be altered. The method also comprises, in response to the obtained S2 indication: altering S3 the output of the first audio stream; by means of themicrophone 5 of the communication device, recording S4 a second audio stream; and outputting S5 the second audio stream to the headphone for playback to the user. - In some embodiments, the method may further comprise, via the
interface 4 and/or 5, obtaining S6 an indication that the playback of the first audio stream should be restored to as before the obtaining S2 of the indication that the playback of the first audio stream should be altered. The method may also comprise, in response to the obtained S6 indication that the playback of the first audio stream should be restored: discontinuing S7 the recording S4 and outputting S5 of the second audio stream; and altering S8 the output of the first audio stream such that the playback of the first audio stream is restored. - In some embodiments, the first audio stream is of a media file stored in the
communication device 1 or streamed from a media server. - In some embodiments, the
interface 4 comprises a touchscreen of a GUI, and the indication is obtained S2 by detecting an input via the touchscreen corresponding to theuser 3 pressing a graphical element of the GUI associated with the indication that the playback of the first audio stream should be altered. - In some embodiments, the interface comprises the
microphone 5, and the indication is obtained S2 by via the microphone detecting sound which thecommunication device 1 has been preprogrammed to associate with the indication that the playback of the first audio stream should be altered. In some embodiments, the sound comprises a human voice. In some embodiments, the detected human voice sound has a volume above a predetermined threshold. In some embodiments, the detected human voice sound corresponds to a predetermined phrase. - In some embodiments, the recording S4 of the second audio stream comprises using an audio filter to reduce noise in the second audio stream.
- In some embodiments, the method is performed at least partly by means of a
software application 34 running on thecommunication device 1. - In some embodiments, the communication device is a mobile phone, e.g. a smartphone.
- In some embodiments, the interface of the
device 1 comprises a touchscreen of aUI 4 e.g. GUI, or the interface comprises amicrophone 5. - The present disclosure has mainly been described above with reference to a few embodiments. However, as is readily appreciated by a person skilled in the art, other embodiments than the ones disclosed above are equally possible within the scope of the present disclosure, as defined by the appended claims.
Claims (14)
- A method performed by a communication device (1) communicatively connected to a headphone (2), the method comprising:outputting (S1) a first audio stream to the headphone for playback to a user (3) of the communication device;via an interface (4;5) of the communication device to its surroundings, obtaining (S2) an indication that the playback of the first audio stream should be altered; andin response to the obtained (S2) indication:altering (S3) the output of the first audio stream,by means of a microphone (5) of the communication device, recording (S4) a second audio stream, andoutputting (S5) the second audio stream to the headphone for playback to the user.
- The method of claim 1, wherein the first audio stream is of a media file stored in the communication device (1) or streamed from a media server.
- The method of claim 1 or 2, wherein the interface (4) comprises a touchscreen of a GUI, and wherein the indication is obtained (S2) by detecting an input via the touchscreen corresponding to the user (3) pressing a graphical element of the GUI associated with the indication.
- The method of claim 1 or 2, wherein the interface comprises the microphone (5), and wherein the indication is obtained (S2) by via the microphone detecting sound which the communication device (1) has been preprogrammed to associate with the indication.
- The method of claim 4, wherein the sound comprises a human voice.
- The method of claim 5, wherein the detected voice sound has a volume above a predetermined threshold.
- The method of claim 5 or 6, wherein the detected voice sound corresponds to a predetermined phrase.
- The method of any preceding claim, wherein the recording (S4) of the second audio stream comprises using an audio filter to reduce noise in the second audio stream.
- The method of any preceding claim, further comprising:via the interface (4;5), obtaining (S6) an indication that the playback of the first audio stream should be restored to as before the obtaining (S2) of the indication that the playback of the first audio stream should be altered; andin response to the obtained (S6) indication that the playback of the first audio stream should be restored:discontinuing (S7) the recording (S4) and outputting (S5) of the second audio stream, andaltering (S8) the output of the first audio stream such that the playback of the first audio stream is restored.
- The method of any preceding claim, wherein the method is performed by means of a software application (34) running on the communication device (1).
- A computer program product (40) comprising computer-executable components (41) for causing a communication device (1) to perform the method of any one of claims 1-9 when the computer-executable components are run on processing circuitry (31) comprised in the communication device.
- A communication device (1) comprising:processing circuitry (31); andstorage (32) storing instructions (41) executable by said processing circuitry whereby said communication device is operative to:output a first audio stream to a headphone (2) for playback to a user (3) of the communication device;via an interface (4;5) of the communication device to its surroundings, obtain an indication that the playback of the first audio stream should be altered; andin response to the obtained indication:alter the output of the first audio stream,by means of a microphone (5) of the communication device, record a second audio stream, andoutput the second audio stream to the headphone for playback to the user.
- The communication device of claim 12, wherein the communication device is a mobile phone, e.g. a smartphone.
- The communication device of claim 11 or 12, wherein the interface (4) comprises a touchscreen of a GUI or wherein the interface comprises the microphone (5).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP16201116.7A EP3328090A1 (en) | 2016-11-29 | 2016-11-29 | System and method for enabling communication of ambient sound as an audio stream |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP16201116.7A EP3328090A1 (en) | 2016-11-29 | 2016-11-29 | System and method for enabling communication of ambient sound as an audio stream |
Publications (1)
Publication Number | Publication Date |
---|---|
EP3328090A1 true EP3328090A1 (en) | 2018-05-30 |
Family
ID=57442503
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP16201116.7A Withdrawn EP3328090A1 (en) | 2016-11-29 | 2016-11-29 | System and method for enabling communication of ambient sound as an audio stream |
Country Status (1)
Country | Link |
---|---|
EP (1) | EP3328090A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10467998B2 (en) | 2015-09-29 | 2019-11-05 | Amper Music, Inc. | Automated music composition and generation system for spotting digital media objects and event markers using emotion-type, style-type, timing-type and accent-type musical experience descriptors that characterize the digital music to be automatically composed and generated by the system |
US10854180B2 (en) | 2015-09-29 | 2020-12-01 | Amper Music, Inc. | Method of and system for controlling the qualities of musical energy embodied in and expressed by digital music to be automatically composed and generated by an automated music composition and generation engine |
US10964299B1 (en) | 2019-10-15 | 2021-03-30 | Shutterstock, Inc. | Method of and system for automatically generating digital performances of music compositions using notes selected from virtual musical instruments based on the music-theoretic states of the music compositions |
US11024275B2 (en) | 2019-10-15 | 2021-06-01 | Shutterstock, Inc. | Method of digitally performing a music composition using virtual musical instruments having performance logic executing within a virtual musical instrument (VMI) library management system |
US11037538B2 (en) | 2019-10-15 | 2021-06-15 | Shutterstock, Inc. | Method of and system for automated musical arrangement and musical instrument performance style transformation supported within an automated music performance system |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150170645A1 (en) * | 2013-12-13 | 2015-06-18 | Harman International Industries, Inc. | Name-sensitive listening device |
US20150222977A1 (en) * | 2014-02-06 | 2015-08-06 | Sol Republic Inc. | Awareness intelligence headphone |
-
2016
- 2016-11-29 EP EP16201116.7A patent/EP3328090A1/en not_active Withdrawn
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150170645A1 (en) * | 2013-12-13 | 2015-06-18 | Harman International Industries, Inc. | Name-sensitive listening device |
US20150222977A1 (en) * | 2014-02-06 | 2015-08-06 | Sol Republic Inc. | Awareness intelligence headphone |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11037539B2 (en) | 2015-09-29 | 2021-06-15 | Shutterstock, Inc. | Autonomous music composition and performance system employing real-time analysis of a musical performance to automatically compose and perform music to accompany the musical performance |
US12039959B2 (en) | 2015-09-29 | 2024-07-16 | Shutterstock, Inc. | Automated music composition and generation system employing virtual musical instrument libraries for producing notes contained in the digital pieces of automatically composed music |
US10467998B2 (en) | 2015-09-29 | 2019-11-05 | Amper Music, Inc. | Automated music composition and generation system for spotting digital media objects and event markers using emotion-type, style-type, timing-type and accent-type musical experience descriptors that characterize the digital music to be automatically composed and generated by the system |
US11776518B2 (en) | 2015-09-29 | 2023-10-03 | Shutterstock, Inc. | Automated music composition and generation system employing virtual musical instrument libraries for producing notes contained in the digital pieces of automatically composed music |
US11011144B2 (en) | 2015-09-29 | 2021-05-18 | Shutterstock, Inc. | Automated music composition and generation system supporting automated generation of musical kernels for use in replicating future music compositions and production environments |
US11017750B2 (en) | 2015-09-29 | 2021-05-25 | Shutterstock, Inc. | Method of automatically confirming the uniqueness of digital pieces of music produced by an automated music composition and generation system while satisfying the creative intentions of system users |
US11657787B2 (en) | 2015-09-29 | 2023-05-23 | Shutterstock, Inc. | Method of and system for automatically generating music compositions and productions using lyrical input and music experience descriptors |
US11030984B2 (en) | 2015-09-29 | 2021-06-08 | Shutterstock, Inc. | Method of scoring digital media objects using musical experience descriptors to indicate what, where and when musical events should appear in pieces of digital music automatically composed and generated by an automated music composition and generation system |
US10854180B2 (en) | 2015-09-29 | 2020-12-01 | Amper Music, Inc. | Method of and system for controlling the qualities of musical energy embodied in and expressed by digital music to be automatically composed and generated by an automated music composition and generation engine |
US10672371B2 (en) | 2015-09-29 | 2020-06-02 | Amper Music, Inc. | Method of and system for spotting digital media objects and event markers using musical experience descriptors to characterize digital music to be automatically composed and generated by an automated music composition and generation engine |
US11037540B2 (en) | 2015-09-29 | 2021-06-15 | Shutterstock, Inc. | Automated music composition and generation systems, engines and methods employing parameter mapping configurations to enable automated music composition and generation |
US11037541B2 (en) | 2015-09-29 | 2021-06-15 | Shutterstock, Inc. | Method of composing a piece of digital music using musical experience descriptors to indicate what, when and how musical events should appear in the piece of digital music automatically composed and generated by an automated music composition and generation system |
US11430419B2 (en) | 2015-09-29 | 2022-08-30 | Shutterstock, Inc. | Automatically managing the musical tastes and preferences of a population of users requesting digital pieces of music automatically composed and generated by an automated music composition and generation system |
US11430418B2 (en) | 2015-09-29 | 2022-08-30 | Shutterstock, Inc. | Automatically managing the musical tastes and preferences of system users based on user feedback and autonomous analysis of music automatically composed and generated by an automated music composition and generation system |
US11468871B2 (en) | 2015-09-29 | 2022-10-11 | Shutterstock, Inc. | Automated music composition and generation system employing an instrument selector for automatically selecting virtual instruments from a library of virtual instruments to perform the notes of the composed piece of digital music |
US11651757B2 (en) | 2015-09-29 | 2023-05-16 | Shutterstock, Inc. | Automated music composition and generation system driven by lyrical input |
US11024275B2 (en) | 2019-10-15 | 2021-06-01 | Shutterstock, Inc. | Method of digitally performing a music composition using virtual musical instruments having performance logic executing within a virtual musical instrument (VMI) library management system |
US10964299B1 (en) | 2019-10-15 | 2021-03-30 | Shutterstock, Inc. | Method of and system for automatically generating digital performances of music compositions using notes selected from virtual musical instruments based on the music-theoretic states of the music compositions |
US11037538B2 (en) | 2019-10-15 | 2021-06-15 | Shutterstock, Inc. | Method of and system for automated musical arrangement and musical instrument performance style transformation supported within an automated music performance system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20180150276A1 (en) | System and method for enabling communication of ambient sound as an audio stream | |
EP3328090A1 (en) | System and method for enabling communication of ambient sound as an audio stream | |
US11630636B2 (en) | Changing companion communication device behavior based on status of wearable device | |
US11705878B2 (en) | Intelligent audio output devices | |
US20170318374A1 (en) | Headset, an apparatus and a method with automatic selective voice pass-through | |
JP6419222B2 (en) | Method and headset for improving sound quality | |
US20150036835A1 (en) | Earpieces with gesture control | |
CN112770214B (en) | Earphone control method and device and earphone | |
KR102513461B1 (en) | Headphone system | |
CN107995360B (en) | Call processing method and related product | |
CN107493500B (en) | Multimedia resource playing method and device | |
WO2017032030A1 (en) | Volume adjusting method and user terminal | |
WO2018018705A1 (en) | Voice communication method, device, and terminal | |
US20130101125A1 (en) | Acoustic Shock Prevention Apparatus | |
WO2017045453A1 (en) | Monitoring method and device based on earphone | |
EP2508007B1 (en) | Arrangement in a device and method for use with a service involving play out of media | |
WO2018018782A1 (en) | Noise reduction method, terminal, and computer storage medium | |
KR101693483B1 (en) | Method and computer program for cancelling howling and echo in a headset | |
US11122160B1 (en) | Detecting and correcting audio echo | |
KR102110515B1 (en) | Hearing aid device of playing audible advertisement or audible data | |
CN109511040B (en) | Whisper amplifying method and device and earphone | |
TW201812570A (en) | Method for automatic adjusting output of sound and electronic device | |
KR101693482B1 (en) | Headset with a function for cancelling howling and echo | |
US20130039154A1 (en) | Remote control of a portable electronic device and method therefor | |
CN106856537B (en) | Volume adjustment method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: SPOTIFY AB |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20181201 |