WO2019036092A1 - Masquage dynamique de transfert de données audio - Google Patents

Masquage dynamique de transfert de données audio Download PDF

Info

Publication number
WO2019036092A1
WO2019036092A1 PCT/US2018/036783 US2018036783W WO2019036092A1 WO 2019036092 A1 WO2019036092 A1 WO 2019036092A1 US 2018036783 W US2018036783 W US 2018036783W WO 2019036092 A1 WO2019036092 A1 WO 2019036092A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
file
encoded audio
computer
audio file
Prior art date
Application number
PCT/US2018/036783
Other languages
English (en)
Inventor
Arjita MADAN
Aviral GUPTA
Sumit Gwalani
Mrinal AHLAWAT
Heman KHANNA
Rohan LAISHRAM
Original Assignee
Google Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google Llc filed Critical Google Llc
Priority to CN201880053363.8A priority Critical patent/CN110998711A/zh
Publication of WO2019036092A1 publication Critical patent/WO2019036092A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B11/00Transmission systems employing sonic, ultrasonic or infrasonic waves
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/1752Masking
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1781Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions
    • G10K11/17821Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions characterised by the analysis of the input signals only
    • G10K11/17827Desired external signals, e.g. pass-through audio such as music or speech

Definitions

  • the technology disclosed herein relates to dynamic creation of optimum audio output to mask audio-based data during transfer.
  • Mobile computing devices commonly exchange data via the Internet.
  • data can be transferred using peer-to-peer connectivity, such as Bluetooth or near field communications.
  • peer-to-peer connectivity solutions require specific hardware and APIs to function. Accordingly, there is a need to enable and use features and hardware commonly found on mobile computing devices to exchange data.
  • Each phone or mobile communication device has, by definition, a microphone and a speaker.
  • data can be transmitted over the sound waves.
  • the audio vibrates it vibrates with many frequencies, and each frequency produces a sound wave.
  • High frequencies of audio result in less environmental noise.
  • high frequencies of audio are ideal for the transfer of data.
  • data is transmitted over the high frequency sound waves it produces an unpleasant sound.
  • a sound can be played to mask the unpleasant sound produced by the transmission of data.
  • the resulting frequency and amplitude of the encoded audio fluctuates from one transmission to another.
  • a computing device encodes data to be transferred into an audio file for an audio-based transmission, wherein the encoded audio file produces a sound audible to a human ear (which may be or may be not unpleasant to a human ear).
  • the computing device determines frequency points for the encoded audio file and amplitude for the encoded audio file, and creates a masking sound file based on the determined frequency points and amplitude for the encoded audio file.
  • the computing device plays the encoded audio file and the masking sound file.
  • the computing device combines the encoded audio file and the masking sound file into a single sound file and plays single sound file.
  • the computing device plays the encoded audio file and the masking sound file as two separate sound files simultaneously.
  • An ideal masking sound that may effectively mask an unpleasant sound produced by a data transfer can depend on the particular frequency and amplitude of the encoded audio. Accordingly, a masking sound may be dynamically produced that changes depending upon the data that is encoded for transfer.
  • the masking sound file may, for example, comprises a masking sound that is pleasant to the human ear.
  • the masking sound could thus mask un unpleasant sound produced by the encoded audio file.
  • Figure 1 is a block diagram depicting a system for dynamic audio-based data transfer masking, in accordance with certain examples.
  • Figure 2 is a block flow diagram depicting a method for dynamic audio-based data transfer masking, in accordance with certain examples.
  • Figure 3 is a block flow diagram depicting a method for creating a masking sound, in accordance with certain examples.
  • FIG. 4 is a block diagram depicting a computing machine and module, in accordance with certain examples. DETAILED DESCRIPTION OF THE EXAMPLES
  • the examples described herein provide computer-implemented techniques for dynamic audio-based data transfer masking.
  • the broadcasting computing device and the account management computing system provide the capability to transmit data via audio communication channels in a manner that is more pleasant to the human ear.
  • the systems and methods described herein enables transmitting data via audio communication channels by broadcasting computing devices wherein a second sound is produced to mask the unpleasant nature of the transmitted data.
  • an account management computing system produces a set of rules that can be applied by a broadcasting computing device to create an ideal masking sound.
  • the set of rules comprises a function or algorithm that, when applied to known data points from an encoded sound, produces an ideal masking sound.
  • the masking sound is dynamically produced for each encoded audio transmission.
  • the account management computing system transmits the rules for creating the masking sound to the broadcasting computing device.
  • the account management computing system pushes the rules as an application update.
  • the broadcasting computing device encodes data for audio- based data transfer.
  • the data is encoded in sound waves via modulation by varying one or more properties of the carrier sound wave (for example, amplitude, frequency, and/or phase).
  • the encoded audio has known frequencies.
  • the known frequencies comprise a pitch or note higher than a threshold pitch or note, which results in an unpleasant sound.
  • the broadcasting computing device creates a masking sound.
  • the broadcasting computing device retrieves the rules for creating the masking sound and applies the rules to the known frequency points and amplitude of the encoded audio.
  • the ideal masking sound plays at the right frequencies points and right amplitude to mask the unpleasant sound of the encoded audio.
  • the broadcasting computing device will create a masking sound that will play at the ideal frequency points and amplitude.
  • the rules for creating the masking sound comprise a function where the frequency points and amplitude of the encoded audio are entered as input and the masking sound is produced as an output.
  • the broadcasting computing device encodes the outputted masking sound as a sound file.
  • An example broadcasting computing device is capable of playing two separate sounds as separate files simultaneously on separate streams.
  • the broadcasting computing device plays the encoded audio file and the encoded masking sound file simultaneously.
  • the broadcasting computing device plays the encoded audio file and the encoded masking sound file simultaneously through an audio component.
  • the masking sound blocks the unpleasant sound of the encoded audio, resulting in a sound that is more pleasant to the human ear.
  • the broadcast computing device is not capable of playing two separate sounds as separate files simultaneously on separate streams.
  • the broadcasting computing device combines the encoded audio file and the encoded masking sound file to create a single sound file.
  • the broadcasting computing device then plays the single sound file.
  • the broadcasting computing device plays the single sound file through an audio component.
  • the masking sound blocks the unpleasant sound of the encoded audio, resulting in a sound that is more pleasant to the human ear.
  • the broadcasting computing device and the account management computing system enable a user to transmit relevant information directly from a broadcasting computing device without having to listen to the unpleasant nature of the sound produced by the encoded data.
  • the masking sound is dynamically produced as a result of the audio encoded for the transmission of the data. Because the masking sound is dynamically produced, the methods and systems described herein reduce inputs required by users with respect to broadcasting computing devices to transmit information.
  • the systems and methods described herein may be employed to preemptively find the best masking sound without requiring a user to physically manipulate the audio configurations.
  • the system communicates the rules for creating the masking sound with an account management computing system that can push the rules to all like computing devices, saving time and resources.
  • the automatic and dynamic nature of the system operates during the transmission of audio-based data. Thus, the system occurs in the background of the transmission at a rapid pace that is quicker than can be achieved by a human performing a like action.
  • a dynamic rules scheme can be shared among all different computing devices that supports audio-based data transfer. This dynamic rules scheme is beneficial, for example, when a new computing device is utilized for audio-based data transfer. In this example, when a new computing device model is launched in the market, lab testing can be used to derive the optimum rules and share it across computing devices of the same model.
  • this dynamic rules scheme is beneficial for example, when there is a shift in the intended audio-based data transfer scheme. For example changing the frequency bands at which all computing devices must send/receive.
  • the determined configuration can be communicated to all computing devices. Such changes can be timed so that all computing devices move to the new rules scheme at a predefined time.
  • the determined rules are communicated to groups of devices, which behave similarly (for example, those that are the same model or related models from the same manufacturer).
  • the determined rules from a single computing device can trigger audio rule changes in a large number of devices, which belong to the same group.
  • FIG. 1 is a block diagram depicting a system for dynamic audio-based data transfer masking, in accordance with certain examples.
  • the example operating environment 100 comprises computing systems 110, 120, and 130 that are configured to communicate with one another via one or more networks 140 via network computing devices.
  • two or more of these computing systems are integrated into the same system.
  • a user associated with a computing device must install an application and/or make a feature selection to obtain the benefits of the techniques described herein.
  • Each network 140 comprises a wired or wireless telecommunication mechanism by which network computing systems (including systems 110, 120, and 130) can communicate and exchange data.
  • each network 140 can be implemented as, or may be a part of, a storage area network (SAN), personal area network (PAN), a metropolitan area network (MAN), a local area network (LAN), a wide area network (WAN), a wireless local area network (WLAN), a virtual private network (VPN), an intranet, an Internet, a mobile telephone network, a card network, Bluetooth, Bluetooth Low Energy (BLE), near field communication network (NFC), any form of standardized radio frequency, infrared, sound (for example, audible sounds, melodies, and ultrasound), other short range communication channel, or any combination thereof, or any other appropriate architecture or system that facilitates the communication of signals, data, and/or messages (generally referred to as data).
  • data data
  • information are used interchangeably herein to refer to text, images, audio
  • each network computing system comprises a computing device having a communication module capable of transmitting and receiving data over the network 140.
  • each network computing system may comprise a server, personal computer, mobile device (for example, notebook computer, tablet computer, netbook computer, personal digital assistant (PDA), video game device, GPS locator device, cellular telephone, Smartphone, or other mobile device), a television with one or more processors embedded therein and/or coupled thereto, or other appropriate technology that comprises or is coupled to a web browser or other application for communicating via the network 140.
  • the network computing systems are operated by users and an account management computing system operator, respectively.
  • An example broadcasting computing device 110 comprises a user interface
  • the broadcasting computing device 110 may be a personal computer, mobile device (for example, notebook, computer, tablet computer, netbook computer, personal digital assistant (PDA), video game device, GPS locator device, cellular telephone, Smartphone or other mobile device), television, wearable computing devices (for example, watches, rings, or glasses), or other appropriate technology that comprises or is coupled to a web server (or other suitable application for interacting with web page files) or that comprises or is coupled to an application 113.
  • PDA personal digital assistant
  • the user can use the broadcasting computing device 110 to broadcast audio- based data via the audio component 117 using the user interface 111 and the application 113.
  • the user interface 111 comprises a touch screen, a voice-based interface, or any other interface that allows the user to provide input and receive output from the application 113.
  • the user interacts with the application 113 via the user interface 111 to select or instruct the broadcasting computing device 110 to broadcast audio-based data via the audio component 117.
  • the application 113 is a program, function, routine, applet or similar entity that exists on and performs its operations on the broadcasting computing device 110.
  • the application 113 may be one or more of an audio application, a data application, an account management computing system 130 application, an Internet browser, a user interface 111 application, or other suitable application operating on the broadcasting computing device 110.
  • the user must install an application 113 and/or make a feature selection on the broadcasting computing device 110 to obtain the benefits of the techniques described herein.
  • the data storage unit 119 and application 113 may be implemented in a secure element or other secure memory (not shown) on the broadcasting computing device 110.
  • the data storage unit 119 may be a separate memory unit resident on the broadcasting computing device 110.
  • An example data storage unit 119 enables storage of rules for creating an optimum masking sound.
  • the data storage unit 119 can comprise a local or remote data storage structure accessible to the broadcasting computing device 110 suitable for storing information.
  • the data storage unit 119 stores encrypted information, such as HTML5 local storage.
  • the audio component 117 comprises a speaker device or other device capable of producing a sound output.
  • An example sound output comprises an ultrasound output.
  • the audio component 117 communicates with the application 113 to receive an instruction to broadcast a sound output.
  • the audio component 117 is a component of the broadcasting computing device 110.
  • the audio component 117 is communicatively coupled to the broadcasting computing device 110.
  • An example broadcasting computing device 110 communicates with a receiving computing device 120 via an audio communication channel.
  • An example communication via the audio communication channel comprises the transmission of audio- based data.
  • the data is transferred from the broadcasting computing device 110 to the receiving computing device 120 over sound waves.
  • An example receiving computing device 120 comprises a user interface 121, an application 123, a microphone component 125, and a data storage unit 129.
  • the receiving computing device 120 may be a personal computer, mobile device (for example, notebook, computer, tablet computer, netbook computer, personal digital assistant (PDA), video game device, GPS locator device, cellular telephone, Smartphone or other mobile device), television, wearable computing devices (for example, watches, rings, or glasses), or other appropriate technology that comprises or is coupled to a web server (or other suitable application for interacting with web page files) or that comprises or is coupled to an application 123.
  • PDA personal digital assistant
  • the user can use the receiving computing device 120 to receive audio-based data via the microphone component 125 using the user interface 121 and the application 123.
  • the user interface 121 comprises a touch screen, a voice-based interface, or any other interface that allows the user to provide input and receive output from the application 123.
  • the user interacts with the application 123 via the user interface 121 to receive, read, or interact with the audio-based data received via the microphone component 125.
  • the application 123 is a program, function, routine, applet or similar entity that exists on and performs its operations on the receiving computing device 120.
  • the application 123 can be one or more of an audio application, a data application, an account management computing system 130 application, an Internet browser, a user interface 121 application, or other suitable application operating on the receiving computing device 120.
  • the user must install an application 123 and/or make a feature selection on the receiving computing device 120 to obtain the benefits of the techniques described herein.
  • the data storage unit 129 and application 123 may be implemented in a secure element or other secure memory (not shown) on the receiving computing device 120.
  • the data storage unit 129 may be a separate memory unit resident on the receiving computing device 120.
  • the data storage unit 129 can comprise a local or remote data storage structure accessible to the receiving computing device 120 suitable for storing information.
  • the data storage unit 129 stores encrypted information, such as HTML5 local storage.
  • the microphone component 125 comprises a microphone device that is capable of receiving sound inputs from an environment of the receiving computing device 120.
  • the microphone component 125 communicates with the application 123 to receive an instruction to transition from a passive mode to an active mode and listen for sound inputs.
  • the microphone component 125 receives sound inputs while in the active mode and transmits the received sound inputs to the application 123.
  • An example receiving computing device 120 and broadcasting computing device 110 communicate with the account management computing system 130.
  • An example account management computing system 130 comprises an account management component 131, an audio configuration component 133, and a data storage unit 137.
  • the receiving computing device 120 and broadcasting computing device 110 register with or are otherwise associated with the account management computing system 130.
  • the account management computing system 130 is capable of identifying the receiving computing device 120 and broadcasting computing device 110 and transmitting hardware configurations, instructions, updates, or other forms of data transmission to each computing device 110 and 120.
  • the account management computing system 130 is capable of identifying communications or transmissions received from the receiving computing device 120 and broadcasting computing device 110.
  • each device has a unique or otherwise identifiable code associated with it.
  • each computing device (including 110 and 120) downloads or authorizes an application (including 113 and 123) associated with the account management computing system 130 onto the device to perform the techniques described herein. In an example, this information is maintained within the account management component 131.
  • the broadcasting computing device 110 comprises rules for creating an optimal masking sound.
  • the account management computing device 130 communicates with the broadcasting computing device 110 to provide the rules for creating an optimal masking sound.
  • the audio configuration component 133 determines the rules for creating an optimum masking sound for multiple broadcasting computing devices (including 110).
  • the set of rules utilizes harmonic frequencies to produce the ideal masking sound.
  • the rules utilize multiples of the same fundamental frequency to produce a more pleasant masking sound.
  • the masking sound and encoded audio play at frequencies that are multiples of a specific, predetermined frequency.
  • the ideal masking sound comprises high enough amplitude to mask the encoded audio.
  • the ideal masking sound also comprises low enough amplitude so it does not interfere with the data transfer.
  • an unpleasant sound comprises a sound with one or more features or attributes that are outside of a predefined acceptable range.
  • the rules for creating optimum masking sounds are saved in the data storage unit 137.
  • the data storage unit 137 can comprise any local or remote data storage structure accessible to the account management computing system 130 suitable for storing information.
  • the data storage unit 137 stores encrypted information, such as HTML5 local storage.
  • the computing device (including 110 and 120) perform some or all of the functions of the account management computing system 130.
  • FIG. 1 It will be appreciated that the network connections shown are example and other means of establishing a communications link between the computers and devices can be used. Additionally, those having ordinary skill in the art and having the benefit of the present disclosure will appreciate that the computing devices illustrated in Figure 1 can have any of several other suitable computer system configurations. For example a receiving computing device 120 or a broadcasting computing device 110 embodied as a mobile phone or handheld computer may not include all the components described above.
  • the network computing devices and any other computing machines associated with the technology presented herein may be any type of computing machine such as, but not limited to, those discussed in more detail with respect to Figure 4.
  • any functions, applications, or components associated with any of these computing machines, such as those described herein or any others (for example, scripts, web content, software, firmware, hardware, or modules) associated with the technology presented herein may by any of the components discussed in more detail with respect to Figure 4.
  • the computing machines discussed herein may communicate with one another, as well as with other computing machines or communication systems over one or more networks, such as network 140.
  • the network 140 may comprise any type of data or communications network, including any of the network technology discussed with respect to Figure 4.
  • FIG. 2-3 The components of the example operating environment 100 are described hereinafter with reference to the example methods illustrated in Figures 2-3.
  • the example methods of Figures 2-3 may also be performed with other systems and in other environments.
  • the operations described with respect to any of the Figures 2-3 can be implemented as executable code stored on a computer or machine readable non-transitory tangible storage medium (e.g., floppy disk, hard disk, ROM, EEPROM, nonvolatile RAM, CD-ROM, etc.) that are completed based on execution of the code by a processor circuit implemented using one or more integrated circuits; the operations described herein also can be implemented as executable logic that is encoded in one or more non-transitory tangible media for execution (e.g., programmable logic arrays or devices, field programmable gate arrays, programmable array logic, application specific integrated circuits, etc.
  • executable logic e.g., programmable logic arrays or devices, field programmable gate arrays, programmable array logic, application specific integrated circuits, etc.
  • Figure 2 is a block flow diagram depicting a method for dynamic audio-based data transfer masking, in accordance with certain examples. The method 200 is described with reference to the components illustrated in Figure 1.
  • transmission of data encoded into an audio file produces a sound unpleasant to the human ear.
  • the broadcasting computing device 110 produces a second sound to mask the unpleasant nature of the data transmitted in the encoded audio file.
  • the account management computing system 130 determines rules for creating a masking sound.
  • the masking sound is the second sound played simultaneous with or combined with an encoded audio file to mask the unpleasant sound of the encoded audio file.
  • an account management computing system produces a set of rules that can be applied by a broadcasting computing device to create an ideal masking sound.
  • the set of rules comprises a function or algorithm that, when applied to known data points from an encoded sound, produces an ideal masking sound.
  • the masking sound is dynamically produced for each encoded audio transmission.
  • the ideal masking sound comprises a lower frequency than the encoded audio transmission.
  • the encoded audio transmission plays at a higher, more unpleasant frequency.
  • the ideal masking sound comprises a lower, more pleasant frequency.
  • the set of rules utilizes harmonic frequencies to produce the ideal masking sound.
  • the rules utilize multiples of the same fundamental frequency to produce a more pleasant masking sound.
  • the masking sound and encoded audio play at frequencies that are multiples of a specific, predetermined frequency, e.g., multiples of 100 (for example, 200 hertz, 300 hertz, and 400 hertz).
  • the ideal masking sound comprises high enough amplitude to mask the encoded audio.
  • an unpleasant sound comprises a sound with one or more features or attributes that are outside of a predefined acceptable range.
  • the unpleasant sound's frequency, amplitude, volume, or other attribute are outside of a pre-defined acceptable range for sound frequency, amplitude, volume, or other attribute.
  • an unpleasant sound comprises a sound that is perceptible to the average human ear.
  • a pleasant sound comprises a frequency, amplitude, volume, and/or other attribute that are within pre-define acceptable ranges.
  • the account management computing system 130 creates a function that, when the known frequencies and amplitudes of encoded data are fed into, produces an ideal masking sound for the encoded data.
  • the masking sound changes depending upon the data to be transmitted. This provides the masking sound with the maximum ability to mask the unpleasant sound produced by the encoded data.
  • the account management computing system 130 transmits the rules for creating masking sounds to the broadcasting computing device 110.
  • the rules are applicable for multiple different types of broadcasting computing devices 110.
  • the rules are transmitted to all broadcasting computing devices 110 that comprise the same make or model.
  • the rules device-specific In an example, the account management computing system 130 transmits the rules for creating the masking sounds to the broadcasting computing device 110 via the network 140.
  • the broadcasting computing device 110 receives the rules for creating the masking sound.
  • the account management computing system 130 pushes the rules to the broadcasting computing device 110 as an application 113 update.
  • the broadcasting computing device 110 saves the rules for crating the masking sound.
  • the rules are saved by the application 113 in the data storage unit 119.
  • the broadcasting computing device 110 encodes data for audio- based data transfer.
  • the application 113 on the broadcasting computing device 110 encodes the data for audio-based data transfer.
  • the data is encoded in sound waves via modulation by varying one or more properties of the carrier sound wave.
  • Example varied property of the carrier sound wave comprises amplitude, frequency, and/or phase.
  • the encoded audio has known frequencies.
  • the known frequencies comprise a pitch or note higher than a threshold pitch or note, which results in an unpleasant sound.
  • the encoded audio has known amplitudes.
  • the audio component 117 of the broadcasting computing device 110 is capable of broadcasting (and the microphone component 125 of the receiving computing device 120 is capable of receiving) a limited spectrum of frequencies, which results in a restricted bandwidth.
  • the broadcasting computing device 130 creates the masking sound.
  • the method for creating a masking sound is described in more detail hereinafter with reference to the methods described in Figure 3.
  • Characteristics of a sound comprise the pitch and loudness, both of which are determined by the frequency and amplitude of the sound wave.
  • the pitch of the sound depends on the frequency of the wave. The higher the frequency of the sound wave, the higher the pitch. A higher pitch results in a higher perceived shrillness of the sound.
  • the loudness of the sound depends on the amplitude of the vibration producing the sound. The higher the amplitude of the vibration, the louder the sound. A louder sound results in a higher perceived intensity of the sound.
  • the ideal masking sound plays at the right frequencies points and right amplitude to mask the unpleasant sound of the encoded audio.
  • the broadcasting computing device 110 Based on the specific frequency points and the amplitude of the encoded audio, the broadcasting computing device 110 creates a masking sound plays at the ideal frequency points and amplitude.
  • Figure 3 is a block flow diagram depicting a method 250 for creating a masking sound, in accordance with certain examples, as referenced in block 250. The method 200 is described with reference to the components illustrated in Figure 1.
  • the broadcasting computing device 110 retrieves the rules for creating the masking sound.
  • the rules for creating the masking sound were determined by the account management computing system 130 in block 210 of Figure 2, and saved by the broadcasting computing device 110 in block 230 of Figure 2.
  • the rules for creating the masking sound are retrieved by the application 113 from the data storage unit 119.
  • the broadcasting computing device 110 determines the frequency points for the encoded audio.
  • the audio-based data has known frequencies when encoded, and the broadcasting computing device 110 retrieves the known frequency points.
  • the frequency points are measured or calculated.
  • the broadcasting computing device 110 determines the amplitude of the encoded audio.
  • the audio-based data has known amplitudes when encoded, and the broadcasting computing device 110 retrieves the known amplitudes.
  • the amplitudes are measured or calculated.
  • the broadcasting computing device 110 applies the rules for creating the masking sound to the determined frequency points and amplitude.
  • the ideal masking sound plays at the right frequencies points and right amplitude to mask the unpleasant sound of the encoded audio.
  • the broadcasting computing device 110 Based on the specific frequency points and the amplitude of the encoded audio, the broadcasting computing device 110 creates a masking sound that plays at the ideal frequency points and amplitude.
  • the rules for creating the masking sound comprise a function where the frequency points and amplitude of the encoded audio are entered as input and the masking sound is produced as an output.
  • the broadcasting computing device 110 encodes the outputted masking sound as a sound file. In an example,
  • the broadcasting computing device 110 determines whether it is capable of playing two separate sounds as separate files simultaneously on separate streams.
  • the broadcasting computing device 110 comprises hardware that can play two sounds at the same time.
  • the audio component 117 can play two sounds at the same time.
  • the broadcasting computing device 110 If the broadcasting computing device 110 cannot play two separate sounds as separate files simultaneously on separate streams, the method 200 proceeds to block 270 in Figure 2.
  • the broadcasting computing device 110 combines the encoded audio file and the encoded masking sound file to create a single sound file.
  • the application 113 merges or combines the audio files to create a single sound file.
  • the broadcasting computing device 110 plays the single sound file.
  • the broadcasting computing device 110 plays the single sound file through the audio component 117.
  • the masking sound blocks the unpleasant sound of the encoded audio, resulting in a sound that is more pleasant to the human ear.
  • the broadcasting computing device 110 plays the encoded audio file and the encoded masking sound file simultaneously.
  • the broadcasting computing device 110 plays the encoded audio file and the encoded masking sound file simultaneously through the audio component 117.
  • the masking sound blocks the unpleasant sound of the encoded audio, resulting in a sound that is more pleasant to the human ear.
  • the receiving computing device 120 receives the encoded audio file.
  • the receiving computing device 120 receives the encoded audio file via the microphone component 125.
  • the application 123 of the receiving computing device 120 is capable of decoding the encoded audio file.
  • FIG. 4 depicts a computing machine 2000 and a module 2050 in accordance with certain examples.
  • the computing machine 2000 may correspond to any of the various computers, servers, mobile devices, embedded systems, or computing systems presented herein.
  • the module 2050 may comprise one or more hardware or software elements configured to facilitate the computing machine 2000 in performing the various methods and processing functions presented herein.
  • the computing machine 2000 may include various internal or attached components such as a processor 2010, system bus 2020, system memory 2030, storage media 2040, input/output interface 2060, and a network interface 2070 for communicating with a network 2080.
  • the computing machine 2000 may be implemented as a conventional computer system, an embedded controller, a laptop, a server, a mobile device, a smartphone, a set-top box, a kiosk, a router or other network node, a vehicular information system, one more processors associated with a television, a customized machine, any other hardware platform, or any combination or multiplicity thereof.
  • the computing machine 2000 may be a distributed system configured to function using multiple computing machines interconnected via a data network or bus system.
  • the processor 2010 may be configured to execute code or instructions to perform the operations and functionality described herein, manage request flow and address mappings, and to perform calculations and generate commands.
  • the processor 2010 may be configured to monitor and control the operation of the components in the computing machine 2000.
  • the processor 2010 may be a general purpose processor, a processor core, a multiprocessor, a reconfigurable processor, a microcontroller, a digital signal processor ("DSP"), an application specific integrated circuit (“ASIC”), a graphics processing unit (“GPU”), a field programmable gate array (“FPGA”), a programmable logic device (“PLD”), a controller, a state machine, gated logic, discrete hardware components, any other processing unit, or any combination or multiplicity thereof.
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • GPU graphics processing unit
  • FPGA field programmable gate array
  • PLD programmable logic device
  • the processor 2010 may be a single processing unit, multiple processing units, a single processing core, multiple processing cores, special purpose processing cores, co-processors, or any combination thereof. According to certain examples, the processor 2010 along with other components of the computing machine 2000 may be a virtualized computing machine executing within one or more other computing machines.
  • the system memory 2030 may include non-volatile memories such as readonly memory (“ROM”), programmable read-only memory (“PROM”), erasable programmable read-only memory (“EPROM”), flash memory, or any other device capable of storing program instructions or data with or without applied power.
  • the system memory 2030 may also include volatile memories such as random access memory (“RAM”), static random access memory (“SRAM”), dynamic random access memory (“DRAM”), and synchronous dynamic random access memory (“SDRAM”). Other types of RAM also may be used to implement the system memory 2030.
  • RAM random access memory
  • SRAM static random access memory
  • DRAM dynamic random access memory
  • SDRAM synchronous dynamic random access memory
  • Other types of RAM also may be used to implement the system memory 2030.
  • the system memory 2030 may be implemented using a single memory module or multiple memory modules.
  • system memory 2030 is depicted as being part of the computing machine 2000, one skilled in the art will recognize that the system memory 2030 may be separate from the computing machine 2000 without departing from the scope of the subject technology. It should also be appreciated that the system memory 2030 may include, or operate in conjunction with, a nonvolatile storage device such as the storage media 2040.
  • the storage media 2040 may include a hard disk, a floppy disk, a compact disc read only memory (“CD-ROM”), a digital versatile disc (“DVD”), a Blu-ray disc, a magnetic tape, a flash memory, other non-volatile memory device, a solid state drive (“SSD”), any magnetic storage device, any optical storage device, any electrical storage device, any semiconductor storage device, any physical-based storage device, any other data storage device, or any combination or multiplicity thereof.
  • the storage media 2040 may store one or more operating systems, application programs and program modules such as module 2050, data, or any other information.
  • the storage media 2040 may be part of, or connected to, the computing machine 2000.
  • the storage media 2040 may also be part of one or more other computing machines that are in communication with the computing machine 2000 such as servers, database servers, cloud storage, network attached storage, and so forth.
  • the module 2050 may comprise one or more hardware or software elements configured to facilitate the computing machine 2000 with performing the various methods and processing functions presented herein.
  • the module 2050 may include one or more sequences of instructions stored as software or firmware in association with the system memory 2030, the storage media 2040, or both.
  • the storage media 2040 may therefore represent examples of machine or computer readable media on which instructions or code may be stored for execution by the processor 2010.
  • Machine or computer readable media may generally refer to any medium or media used to provide instructions to the processor 2010.
  • Such machine or computer readable media associated with the module 2050 may comprise a computer software product.
  • a computer software product comprising the module 2050 may also be associated with one or more processes or methods for delivering the module 2050 to the computing machine 2000 via the network 2080, any signal-bearing medium, or any other communication or delivery technology.
  • the module 2050 may also comprise hardware circuits or information for configuring hardware circuits such as microcode or configuration information for an FPGA or other PLD.
  • the input/output (“I/O”) interface 2060 may be configured to couple to one or more external devices, to receive data from the one or more external devices, and to send data to the one or more external devices. Such external devices along with the various internal devices may also be known as peripheral devices.
  • the I/O interface 2060 may include both electrical and physical connections for operably coupling the various peripheral devices to the computing machine 2000 or the processor 2010.
  • the I/O interface 2060 may be configured to communicate data, addresses, and control signals between the peripheral devices, the computing machine 2000, or the processor 2010.
  • the I/O interface 2060 may be configured to implement any standard interface, such as small computer system interface (“SCSI”), serial-attached SCSI (“SAS”), fiber channel, peripheral component interconnect (“PCI”), PCI express (PCIe), serial bus, parallel bus, advanced technology attached (“ATA”), serial ATA (“SAT A”), universal serial bus (“USB”), Thunderbolt, Fire Wire, various video buses, and the like.
  • SCSI small computer system interface
  • SAS serial-attached SCSI
  • PCIe peripheral component interconnect
  • PCIe PCI express
  • serial bus parallel bus
  • advanced technology attached ATA
  • serial SAT A serial ATA
  • USB universal serial bus
  • Thunderbolt Fire Wire
  • the I/O interface 2060 may be configured to implement only one interface or bus technology.
  • the I/O interface 2060 may be configured to implement multiple interfaces or bus technologies.
  • the I/O interface 2060 may be configured as part of, all of, or to operate in conjunction with, the system bus 2020.
  • the I/O interface 2060 may couple the computing machine 2000 to various input devices including mice, touch-screens, scanners, electronic digitizers, sensors, receivers, touchpads, trackballs, cameras, microphones, keyboards, any other pointing devices, or any combinations thereof.
  • the I/O interface 2060 may couple the computing machine 2000 to various output devices including video displays, speakers, printers, projectors, tactile feedback devices, automation control, robotic components, actuators, motors, fans, solenoids, valves, pumps, transmitters, signal emitters, lights, and so forth.
  • the computing machine 2000 may operate in a networked environment using logical connections through the network interface 2070 to one or more other systems or computing machines across the network 2080.
  • the network 2080 may include wide area networks (WAN), local area networks (LAN), intranets, the Internet, wireless access networks, wired networks, mobile networks, telephone networks, optical networks, or combinations thereof.
  • the network 2080 may be packet switched, circuit switched, of any topology, and may use any communication protocol. Communication links within the network 2080 may involve various digital or an analog communication media such as fiber optic cables, free-space optics, waveguides, electrical conductors, wireless links, antennas, radio-frequency communications, and so forth.
  • the processor 2010 may be connected to the other elements of the computing machine 2000 or the various peripherals discussed herein through the system bus 2020. It should be appreciated that the system bus 2020 may be within the processor 2010, outside the processor 2010, or both. According to certain examples, any of the processor 2010, the other elements of the computing machine 2000, or the various peripherals discussed herein may be integrated into a single device such as a system on chip (“SOC”), system on package (“SOP”), or ASIC device.
  • SOC system on chip
  • SOP system on package
  • ASIC application specific integrated circuit
  • the users may be provided with an opportunity or option to control whether programs or features collect user information (e.g., information about a user's social network, social actions or activities, profession, a user's preferences, or a user's current location), or to control whether and/or how to receive content from the content server that may be more relevant to the user.
  • user information e.g., information about a user's social network, social actions or activities, profession, a user's preferences, or a user's current location
  • certain data may be treated in one or more ways before it is stored or used, so that personally identifiable information is removed.
  • a user's identity may be treated so that no personally identifiable information can be determined for the user, or a user's geographic location may be generalized where location information is obtained (such as to a city, ZIP code, or state level), so that a location of a user cannot be determined.
  • location information such as to a city, ZIP code, or state level
  • the user may have control over how information is collected about the user and used by a content server.
  • Examples may comprise a computer program that embodies the functions described and illustrated herein, wherein the computer program is implemented in a computer system that comprises instructions stored in a machine-readable medium and a processor that executes the instructions.
  • the examples should not be construed as limited to any one set of computer program instructions.
  • a skilled programmer would be able to write such a computer program to implement an example of the disclosed examples based on the appended flow charts and associated description in the application text. Therefore, disclosure of a particular set of program code instructions is not considered necessary for an adequate understanding of how to make and use examples.
  • the examples described herein can be used with computer hardware and software that perform the methods and processing functions described herein.
  • the systems, methods, and procedures described herein can be embodied in a programmable computer, computer-executable software, or digital circuitry.
  • the software can be stored on computer- readable media.
  • computer-readable media can include a floppy disk, RAM, ROM, hard disk, removable media, flash memory, memory stick, optical media, magneto- optical media, CD-ROM, etc.
  • Digital circuitry can include integrated circuits, gate arrays, building block logic, field programmable gate arrays (FPGA), etc.

Abstract

Les techniques de la présente invention concernent des procédés mis en œuvre par ordinateur pour masquer dynamiquement un transfert de données basé sur un contenu audio. Un dispositif informatique code des données à transférer dans un fichier audio qui produit un son déplaisant à l'oreille humaine. Le dispositif informatique détermine des points de fréquence et une amplitude pour le fichier audio codé, et crée un fichier sonore de masquage sur la base des points de fréquence et de l'amplitude déterminés. Le fichier sonore de masquage peut comprendre un son de masquage qui est agréable à l'oreille humaine. Le dispositif informatique lit le fichier audio codé et le fichier sonore de masquage. Dans un exemple, le dispositif informatique combine le fichier audio codé et le fichier sonore de masquage en un seul fichier sonore et lit un seul fichier sonore. Selon un autre exemple, le dispositif informatique lit le fichier audio codé et le fichier sonore de masquage en tant que deux fichiers sonores séparés simultanément.
PCT/US2018/036783 2017-08-16 2018-06-08 Masquage dynamique de transfert de données audio WO2019036092A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201880053363.8A CN110998711A (zh) 2017-08-16 2018-06-08 动态音频数据传输掩蔽

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201762546133P 2017-08-16 2017-08-16
US62/546,133 2017-08-16

Publications (1)

Publication Number Publication Date
WO2019036092A1 true WO2019036092A1 (fr) 2019-02-21

Family

ID=62815149

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2018/036783 WO2019036092A1 (fr) 2017-08-16 2018-06-08 Masquage dynamique de transfert de données audio

Country Status (2)

Country Link
CN (1) CN110998711A (fr)
WO (1) WO2019036092A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180294905A1 (en) * 2017-04-10 2018-10-11 Google Llc Mobile service requests to any sound emitting device

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113593602B (zh) * 2021-07-19 2023-12-05 深圳市雷鸟网络传媒有限公司 一种音频处理方法、装置、电子设备和存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070083361A1 (en) * 2005-10-12 2007-04-12 Samsung Electronics Co., Ltd. Method and apparatus for disturbing the radiated voice signal by attenuation and masking
US20100104112A1 (en) * 2008-10-23 2010-04-29 Temic Automotive Of North America, Inc. Variable Noise Masking During Periods of Substantial Silence
US20140006017A1 (en) * 2012-06-29 2014-01-02 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for generating obfuscated speech signal
WO2014191798A1 (fr) * 2013-05-31 2014-12-04 Nokia Corporation Appareil de scene audio

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101764926B1 (ko) * 2009-12-10 2017-08-03 삼성전자주식회사 음향 통신을 위한 장치 및 방법
JP5644359B2 (ja) * 2010-10-21 2014-12-24 ヤマハ株式会社 音声処理装置
CN104505096B (zh) * 2014-05-30 2018-02-27 华南理工大学 一种用音乐传输隐藏信息的方法及装置
US10134416B2 (en) * 2015-05-11 2018-11-20 Microsoft Technology Licensing, Llc Privacy-preserving energy-efficient speakers for personal sound
CN205028649U (zh) * 2015-09-29 2016-02-10 苏州一天声学科技有限公司 多通道声音掩蔽器

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070083361A1 (en) * 2005-10-12 2007-04-12 Samsung Electronics Co., Ltd. Method and apparatus for disturbing the radiated voice signal by attenuation and masking
US20100104112A1 (en) * 2008-10-23 2010-04-29 Temic Automotive Of North America, Inc. Variable Noise Masking During Periods of Substantial Silence
US20140006017A1 (en) * 2012-06-29 2014-01-02 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for generating obfuscated speech signal
WO2014191798A1 (fr) * 2013-05-31 2014-12-04 Nokia Corporation Appareil de scene audio

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180294905A1 (en) * 2017-04-10 2018-10-11 Google Llc Mobile service requests to any sound emitting device
US10833786B2 (en) * 2017-04-10 2020-11-10 Google Llc Mobile service requests to any sound emitting device
US11431426B2 (en) 2017-04-10 2022-08-30 Google Llc Mobile service requests to any sound emitting device

Also Published As

Publication number Publication date
CN110998711A (zh) 2020-04-10

Similar Documents

Publication Publication Date Title
KR102660922B1 (ko) 복수의 지능형 개인 비서 서비스를 위한 관리 계층
US10812423B2 (en) Method, apparatus, system, and non-transitory computer readable medium for chatting on mobile device using an external device
RU2689203C2 (ru) Гибкая схема для настройки языковой модели
CN111192591A (zh) 智能设备的唤醒方法、装置、智能音箱及存储介质
CN110266505B (zh) 一种管理会话群的方法与设备
JP6337066B2 (ja) 位置依存的無線スピーカ構成の技術
EP3147730B1 (fr) Procédé de configuration de paramètres de haut-parleur, terminal mobile, serveur et système
CN104144093A (zh) 一种智能设备控制方法及相关设备、系统
CN109151671B (zh) 音频处理装置、音频处理方法和计算机程序产品
CN110062309B (zh) 用于控制智能音箱的方法和装置
CN110765395B (zh) 一种用于提供小说信息的方法与设备
WO2019129127A1 (fr) Procédé de lecture coopérative multi-terminal d'un fichier audio et terminal
CN111787540A (zh) 接入物联网的方法、装置、电子设备及可读存储介质
WO2019036092A1 (fr) Masquage dynamique de transfert de données audio
EP3659274B1 (fr) Étalonnage dynamique d'un transfert de données audio
CN104994237A (zh) 音频接入方法、设备及wifi耳机
US11683104B2 (en) Audio based service set identifier
US20170178636A1 (en) Method and electronic device for jointly playing high-fidelity sounds of multiple players
JP2015529999A (ja) プッシュ管理スキーム
WO2019001073A1 (fr) Procédé et appareil d'appel de processus à distance, et dispositif informatique
CN113518297A (zh) 音箱交互方法、装置、系统和音箱
CN112788004B (zh) 一种通过虚拟会议机器人执行指令的方法、设备与计算机可读介质
US11979197B2 (en) Audio pairing between electronic devices
US11317289B2 (en) Audio communication tokens
CN113892080A (zh) 将多个端点表示为一个端点的设备聚合

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18737081

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18737081

Country of ref document: EP

Kind code of ref document: A1