CN112752191A - Audio acquisition method, device and storage medium - Google Patents

Audio acquisition method, device and storage medium Download PDF

Info

Publication number
CN112752191A
CN112752191A CN201911052026.XA CN201911052026A CN112752191A CN 112752191 A CN112752191 A CN 112752191A CN 201911052026 A CN201911052026 A CN 201911052026A CN 112752191 A CN112752191 A CN 112752191A
Authority
CN
China
Prior art keywords
audio
acquisition
frequency
mobile terminal
components
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911052026.XA
Other languages
Chinese (zh)
Inventor
刘力
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaomi Mobile Software Co Ltd
Original Assignee
Beijing Xiaomi Mobile Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xiaomi Mobile Software Co Ltd filed Critical Beijing Xiaomi Mobile Software Co Ltd
Priority to CN201911052026.XA priority Critical patent/CN112752191A/en
Publication of CN112752191A publication Critical patent/CN112752191A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones

Abstract

The present disclosure relates to an audio acquisition method, an audio acquisition device and a storage medium, wherein the method is applied to a mobile terminal, and at least two audio acquisition components are arranged at different positions on the mobile terminal, and the method comprises the following steps: outputting a first audio frequency with a preset frequency by using an audio output component contained in the mobile terminal, wherein the preset frequency is as follows: the frequency of ultrasonic waves or the frequency of infrasonic waves; acquiring a second audio frequency returned by acting on a target sound source based on the first audio frequency by utilizing at least two audio frequency acquisition components contained in the mobile terminal; determining a relative position between the target sound source and the mobile terminal based on the acquisition time of the second audio acquired by the at least two audio acquisition components; adjusting the acquisition parameters of the audio acquisition assembly according to the relative position; and carrying out audio acquisition on the sound emitted by the target sound source based on the adjusted acquisition parameters. Through the technical scheme of the disclosure, the calculation amount and the convergence time required for determining the relative position can be reduced.

Description

Audio acquisition method, device and storage medium
Technical Field
The present disclosure relates to the field of communications technologies, and in particular, to an audio acquisition method, an audio acquisition device, and a storage medium.
Background
As the functionality of audio capture components of mobile terminals, such as microphones, has increased, more and more users have chosen to use microphones on mobile terminals for audio capture. For better acquisition, the position of the sound source needs to be localized before the microphone performs audio acquisition. Such as positioning by sound collection using a microphone in advance, the positioning method in the related art has a problem of long convergence time.
Disclosure of Invention
The present disclosure provides an audio acquisition method, apparatus, and storage medium.
According to a first aspect of the embodiments of the present disclosure, there is provided an audio acquisition method, where the method is applied to a mobile terminal, and at least two audio acquisition components are disposed at different positions on the mobile terminal, including:
outputting a first audio frequency with a preset frequency by using an audio output component contained in the mobile terminal, wherein the preset frequency is as follows: the frequency of ultrasonic waves or the frequency of infrasonic waves;
acquiring a second audio frequency returned by acting on a target sound source based on the first audio frequency by utilizing at least two audio frequency acquisition components contained in the mobile terminal;
determining a relative position between the target sound source and the mobile terminal based on the acquisition time of the second audio acquired by the at least two audio acquisition components;
adjusting the acquisition parameters of the audio acquisition assembly according to the relative position;
and carrying out audio acquisition on the sound emitted by the target sound source based on the adjusted acquisition parameters.
Optionally, the determining the relative position between the target sound source and the mobile terminal based on the acquisition time of the second audio acquired by the at least two audio acquisition components includes:
determining a time difference between the acquisition times of the second audio acquired by each of the at least two audio acquisition assemblies;
determining the relative position based on the time difference.
Optionally, the determining a time difference between the capturing times of the second audios captured by each of the at least two audio capturing assemblies includes:
determining two audio acquisition components which acquire the second audio firstly in the at least two audio acquisition components;
determining a time difference between the acquisition times of the second audio acquired by the two audio acquisition components.
Optionally, the acquisition parameter includes at least one of:
the acquisition direction of the audio acquisition assembly;
the acquisition power of the audio acquisition assembly.
According to a second aspect of the embodiments of the present disclosure, there is provided an audio acquisition apparatus including:
the audio output module is configured to output a first audio with a predetermined frequency by using an audio output component included in the mobile terminal, wherein the predetermined frequency is: the frequency of ultrasonic waves or the frequency of infrasonic waves;
the first acquisition module is configured to acquire a second audio frequency returned by acting on a target sound source based on the first audio frequency by utilizing at least two audio frequency acquisition components contained in the mobile terminal;
a position determination module configured to determine a relative position between the target sound source and the mobile terminal based on a collection time of the second audio collected by the at least two audio collection components;
the parameter adjusting module is configured to adjust the acquisition parameters of the audio acquisition assembly according to the relative position;
and the second acquisition module is configured to acquire the audio of the sound emitted by the target sound source based on the adjusted acquisition parameters.
Optionally, the position determining module includes:
a first determining submodule configured to determine a time difference between acquisition times of the second audio acquired by each of the at least two audio acquisition assemblies;
a second determination submodule configured to determine the relative position based on the time difference.
Optionally, the first determining sub-module is further configured to:
determining two audio acquisition components which acquire the second audio firstly in the at least two audio acquisition components;
determining a time difference between the acquisition times of the second audio acquired by the two audio acquisition components.
Optionally, the at least two audio collection assemblies include four microphones, and the four microphones are respectively located on four side surfaces of the mobile terminal.
According to a third aspect of the embodiments of the present disclosure, there is provided an audio acquisition apparatus including:
a processor;
a memory configured to store processor-executable instructions;
wherein the processor is configured to: when executed, implement the steps in the audio acquisition method of the first aspect.
According to a fourth aspect of embodiments of the present disclosure, there is provided a non-transitory computer-readable storage medium, wherein instructions of the storage medium, when executed by a processor of an audio acquisition apparatus, enable the apparatus to perform the audio acquisition method of the first aspect.
The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects:
according to the embodiment, in the disclosure, the first audio frequency of which the output frequency is the frequency of the ultrasonic wave or the frequency of the infrasound wave is output by the audio output assembly included in the mobile terminal, and the second audio frequency returned by acting on the target sound source based on the first audio frequency is respectively collected by the at least two audio collection assemblies, so that the relative position between the target sound source and the mobile terminal is determined by the collection time of the second audio frequency collected by the at least two audio collection assemblies, and then the collection parameters of the audio collection assemblies are adjusted according to the relative position, and the audio collection is performed on the sound emitted by the target sound source based on the adjusted collection parameters. Because the positioning is carried out by utilizing the ultrasonic waves or the infrasonic waves, the sound between the ultrasonic waves and the infrasonic waves can be collected by utilizing the microphone in the positioning process, so that the interference of the sound waves in the space where the mobile terminal is positioned on the positioning is small, and the positioning convergence can be quickly realized; meanwhile, due to the frequency difference among the ultrasonic waves, the infrasonic waves and the sound waves, the sound collection does not need to be carried out after the positioning is finished, and equivalently, the sound source positioning and the sound collection can be carried out synchronously, so that the waiting time for the positioning is shortened.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.
Fig. 1 is a flow chart of an audio acquisition method shown in accordance with an exemplary embodiment.
Fig. 2 is a flowchart illustrating an audio acquisition method according to an exemplary embodiment.
Fig. 3 is a flowchart illustrating an audio acquisition method according to an exemplary embodiment.
Fig. 4 is a schematic diagram illustrating a structure of a handset according to an exemplary embodiment.
Fig. 5 is a block diagram of an audio capture device, shown in accordance with an exemplary embodiment.
Fig. 6 is a hardware block diagram illustrating an audio capture device according to an example embodiment.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.
An audio capture method is provided in an embodiment of the present disclosure, and fig. 1 is a flowchart illustrating an audio capture method according to an exemplary embodiment, where as shown in fig. 1, the method mainly includes the following steps:
in step 101, a first audio with a predetermined frequency is output by using an audio output component included in the mobile terminal, where the predetermined frequency is: the frequency of the ultrasonic waves or the frequency of the infrasonic waves.
Here, the predetermined frequency is an arbitrary frequency inaudible to the human ear. The mobile terminal may include: a mobile phone, a tablet computer, or a wearable device, and in an embodiment of the present disclosure, the mobile terminal includes an audio output component and an audio acquisition component. Taking a mobile phone as an example, the audio output component in the mobile phone may include a speaker, and the audio collection component may include a microphone. Wherein, the frequency of the first audio frequency can be the frequency of ultrasonic wave or the frequency of infrasonic wave.
In step 102, a second audio which acts on the target sound source and returns based on the first audio is collected by utilizing at least two audio collection components contained in the mobile terminal.
Here, after the audio output module outputs the first audio, the first audio may propagate in the form of sound wave in space, and when propagating, reflection, refraction, scattering, diffraction, and transmission may be generated, and the propagation of the first audio is greatly affected (transmitted) by the thickness or material of the target sound source, and is also affected by environmental parameters such as temperature. In addition, the first audio may be diffracted, scattered, or attenuated when propagating in space. The mobile terminal acquires a second audio which is not an original first audio but acts on a target sound source to return based on the first audio by utilizing an own audio acquisition component.
The mobile terminal may include one or more audio capture components, among others. When the mobile terminal only comprises one audio acquisition component, the audio acquisition component can be arranged at the bottom, the top, the left side or the right side of the mobile terminal; when mobile terminal contains a plurality of audio acquisition subassemblies, can set up each audio acquisition subassembly in the different positions on mobile terminal respectively, here, can set up the audio acquisition subassembly in the different positions of mutual symmetry on mobile terminal to can all-roundly cover the near space of mobile terminal, gather the audio signal of each angle, and then improve the audio acquisition effect.
For example, if the mobile terminal includes two audio capturing components, the two audio capturing components may be disposed at the top and the bottom of the mobile terminal, respectively, or disposed at the left and the right of the mobile terminal, respectively. If the mobile terminal comprises three audio acquisition components, the three audio acquisition components can be respectively arranged at the left side, the right side and the bottom of the mobile terminal, or the three audio acquisition components can be respectively arranged at the top, the bottom and the right side of the mobile terminal. If the mobile terminal comprises four audio acquisition components, the four audio acquisition components can be respectively arranged at the top, the bottom, the left side and the right side of the mobile terminal.
And when the mobile terminal comprises a plurality of audio acquisition components, each audio acquisition component can respectively acquire second audio returned by acting on the target sound source based on the first audio based on different positions of each audio acquisition component.
In step 103, a relative position between the target sound source and the mobile terminal is determined based on the acquisition time of the second audio acquired by the at least two audio acquisition components.
Here, when there are a plurality of audio capturing assemblies, since the position of each audio capturing assembly is different, the capturing time for each audio capturing assembly to capture the second audio is also different, that is, each audio capturing assembly has a capturing time, and each capturing time is different. The target sound source may be an object that affects the propagation of the first audio, such as a human body, or other objects that may affect the propagation of the first audio.
In step 104, the acquisition parameters of the audio acquisition component are adjusted according to the relative position.
Here, after the relative position is determined, the acquisition parameter of the audio acquisition component may be adjusted according to the relative position, where the acquisition parameter may be a parameter in an algorithm for performing audio acquisition by the audio acquisition component, and is used to adjust an audio acquisition effect.
In step 105, audio acquisition is performed on the sound emitted by the target sound source based on the adjusted acquisition parameters.
Here, when the target sound source is a person, the person is a sound source. Taking the acquisition parameter as one parameter in the algorithm for audio acquisition as an example, the voice of the person speaking can be acquired based on the adjusted acquisition parameter algorithm.
In the embodiment of the disclosure, according to the acquisition time of the second audio acquired by each audio acquisition component and the propagation speed of the first audio in the space, the relative position between the target sound source and the mobile terminal can be directly calculated, and then the acquisition parameters in the audio acquisition algorithm are adjusted according to the relative position, so that the audio acquisition effect can be improved. Because the ultrasonic wave or the infrasonic wave is used for positioning, the audio acquisition assembly can be used for acquiring the sound between the ultrasonic wave and the infrasonic wave in the positioning process, the interference of the sound wave in the space where the mobile terminal is located to the positioning is small, and the positioning convergence can be quickly realized; meanwhile, due to the frequency difference among the ultrasonic waves, the infrasonic waves and the sound waves, the sound collection does not need to be carried out after the positioning is finished, and equivalently, the sound source positioning and the sound collection can be carried out synchronously, so that the waiting time for the positioning is shortened.
Fig. 2 is a flowchart of a second audio acquisition method according to an exemplary embodiment, as shown in fig. 2, the method mainly includes the following steps:
in step 201, a first audio with a predetermined frequency is output by using an audio output component included in the mobile terminal, where the predetermined frequency is: the frequency of the ultrasonic waves or the frequency of the infrasonic waves.
In step 202, at least two audio acquisition components included in the mobile terminal are utilized to acquire second audio returned by acting on the target sound source based on the first audio.
In step 203, a time difference between the capturing times of the second audio captured by each of the at least two audio capturing components is determined.
Here, the second audio capturing time is a time when the second audio is captured by the at least two audio capturing components. In the embodiment of the disclosure, when there are a plurality of audio acquisition assemblies, because the position of each audio acquisition assembly is different, the acquisition time for each audio acquisition assembly to acquire the second audio is also different, that is, each audio acquisition assembly corresponds to an acquisition time, and each acquisition time is different. In the embodiment of the present disclosure, the time difference between different acquisition times may be calculated respectively.
In step 204, a relative position is determined based on the time difference.
Here, after the time difference between the respective different acquisition times is calculated, the relative distance between the target sound source and the movement can be determined based on the time difference and the propagation speed of the first audio in space, and the relative position can be determined more accurately.
In step 205, the acquisition parameters of the audio acquisition component are adjusted according to the relative position.
In step 206, audio acquisition is performed on the sound emitted by the target sound source based on the adjusted acquisition parameters.
In the embodiment of the disclosure, the audio acquisition component and the audio output component which are contained in the mobile terminal can be directly utilized to determine the relative position between the target sound source and the movement, and the audio signal sent by the target sound source is acquired, so that a hardware device does not need to be additionally arranged, and the audio acquisition effect can be improved.
Fig. 3 is a flowchart illustrating a third method for audio acquisition according to an exemplary embodiment, as shown in fig. 3, the method mainly includes the following steps:
in step 301, a first audio with a predetermined frequency is output by an audio output component included in the mobile terminal, where the predetermined frequency is: the frequency of the ultrasonic waves or the frequency of the infrasonic waves.
In step 302, at least two audio acquisition components included in the mobile terminal are utilized to acquire a second audio which is returned by acting on the target sound source based on the first audio.
In step 303, the two audio capture components of the at least two audio capture components that capture the second audio first are determined.
In step 304, a time difference between the capture times of the second audio captured by the two audio capture components is determined.
In step 305, a relative position is determined based on the time difference.
In step 306, the acquisition parameters of the audio acquisition component are adjusted according to the relative position.
In step 307, audio acquisition is performed on the sound emitted by the target sound source based on the adjusted acquisition parameters.
Here, when a plurality of audio capture components are installed on the mobile terminal, if the capture time of the second audio captured by each audio capture component is calculated to determine the relative position, the amount of calculation is increased, and excessive resources of the mobile terminal are occupied. At this time, by determining two audio acquisition assemblies which acquire the second audio firstly, and based on the time difference between the acquisition times of the second audio acquired by the two audio acquisition assemblies, the time difference of the acquisition times corresponding to the audio acquisition assemblies which meet the set conditions is determined in a targeted manner, and then the relative position is determined, so that not only can the calculated amount be further reduced, but also the resource utilization rate of the mobile terminal can be improved.
Here, by determining the two audio capturing components that receive the second audio first, it can be determined that the direction of the sound source is between the two audio capturing components, and the approximate angular range can be determined by the ratio of the time difference between the capturing times corresponding to the two audio capturing components. Two audio acquisition assemblies are determined from the plurality of audio acquisition assemblies, and the relative position is determined based on the ratio of the time difference of the acquisition time corresponding to the two audio acquisition assemblies, so that the calculated amount can be further reduced, and the resource utilization rate of the mobile terminal can be improved.
In other optional embodiments, the method further comprises: and when the mobile terminal receives the starting instructions of the at least two audio acquisition components, starting the audio output components contained in the mobile terminal.
Here, when the mobile terminal receives an opening instruction of the audio acquisition component, the audio output component is started, and before an audio signal sent by a sound source is acquired, the target sound source can be positioned based on the first audio output by the audio output component, so that the time for acquiring the audio signal can be saved, and the convergence time can be shortened.
In other optional embodiments, the acquisition parameters include at least one of: the acquisition direction of the audio acquisition assembly; the acquisition power of the audio acquisition component.
In an optional embodiment, taking the mobile terminal as a mobile phone as an example, when a person speaking is recorded by using the mobile phone, because the position of the person who is not speaking is uncertain, an algorithm for performing directional recording by using an array formed by multiple microphones often needs a certain convergence time, which may affect the recording effect. Therefore, the present embodiment provides a method for performing auxiliary positioning by using ultrasonic waves to improve the recording effect. For example, when a mobile phone is used for recording and a person speaking is located near the mobile phone, ultrasonic waves can be emitted through a loudspeaker of the mobile phone and received through a microphone, the direction of the human body can be positioned according to the arrival time difference of echoes reflected by the human body and received by each microphone on the mobile phone, so that the relative position of the human body and the mobile phone is determined, and thus the parameters of a multi-microphone array recording algorithm can be adjusted according to the position during recording, and the recording effect is improved.
In an optional embodiment, the mobile terminal may include at least two audio acquisition components, and taking the mobile terminal includes four audio acquisition components as an example, the four audio acquisition components may be respectively disposed on the top, the bottom, the left side and the right side of the mobile terminal, so as to enable the acquisition of audio signals from various angles.
Here, taking the mobile terminal as a mobile phone and the audio collection component as a microphone as an example, fig. 4 is a schematic structural diagram of a mobile phone according to an exemplary embodiment, as shown in fig. 4, a mobile phone 50 includes four microphones, wherein a first microphone 51 is located at the top of the mobile phone 50; the second microphone 52 is located at the bottom of the handset 50; the third microphone 53 is located on the left side of the handset 50; the fourth microphone 54 is located on the right side of the handset 50 and can be used for audio acquisition from various angles based on the respective microphones provided on the handset.
In an alternative embodiment, a microphone is disposed on each of the upper, lower, left, and right sides of the mobile phone, and then ultrasonic waves (first audio) are emitted through a speaker on the mobile phone, and when the ultrasonic waves encounter a target sound source, a reflected signal (second audio) is generated. In this case, the reflected signals may be received by four microphones, the times at which the reflected signals are received by the four microphones may be recorded, the two microphones which received the reflected signals first may be determined, the direction of the sound source may be determined to be between the two microphones, and the angular range of the sound source may be estimated by the ratio of the times at which the reflected signals are received by the two microphones.
Fig. 5 is a block diagram of an audio capture device, shown in accordance with an exemplary embodiment. As shown in fig. 5, the audio capture device 500 mainly includes:
the audio output module 501 is configured to output a first audio with a predetermined frequency by using an audio output component included in the mobile terminal, where the predetermined frequency is: the frequency of ultrasonic waves or the frequency of infrasonic waves;
a first collecting module 502 configured to collect, by using at least two audio collecting components included in the mobile terminal, a second audio returned by acting on a target sound source based on the first audio;
a position determination module 503 configured to determine a relative position between the target sound source and the mobile terminal based on a collection time of the second audio collected by the at least two audio collection components;
a parameter adjusting module 504 configured to adjust the acquisition parameters of the audio acquisition component according to the relative position;
and a second collecting module 505 configured to perform audio collection on the sound emitted by the target sound source based on the adjusted collecting parameters.
In other optional embodiments, the position determining module 503 includes:
a first determining submodule configured to determine a time difference between acquisition times of the second audio acquired by each of the at least two audio acquisition assemblies;
a second determination submodule configured to determine the relative position based on the time difference.
In other optional embodiments, the first determining sub-module is further configured to:
determining two audio acquisition components which acquire the second audio firstly in the at least two audio acquisition components;
determining a time difference between the acquisition times of the second audio acquired by the two audio acquisition components.
In other optional embodiments, the at least two audio acquisition components include four microphones, and the four microphones are respectively located on four sides of the mobile terminal.
In other optional embodiments, the acquisition parameters include at least one of: the acquisition direction of the audio acquisition assembly; the acquisition power of the audio acquisition component.
With regard to the apparatus in the above-described embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.
Correspondingly, this disclosure still provides audio acquisition device, includes: a processor; a memory for storing processor-executable instructions; wherein the processor is configured to: when executed, implement the steps of any of the audio acquisition methods in the above embodiments.
Fig. 6 is a hardware block diagram illustrating an audio capture device 800 according to an example embodiment. For example, the apparatus 800 may be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, a medical device, an exercise device, a personal digital assistant, and the like.
Referring to fig. 6, the apparatus 800 may include one or more of the following components: a processing component 802, a memory 804, a power component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, a sensor component 814, and a communication component 816.
The processing component 802 generally controls overall operation of the device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing components 802 may include one or more processors 820 to execute instructions to perform all or a portion of the steps of the methods described above. Further, the processing component 802 can include one or more modules that facilitate interaction between the processing component 802 and other components. For example, the processing component 802 can include a multimedia module to facilitate interaction between the multimedia component 808 and the processing component 802.
The memory 804 is configured to store various types of data to support operation at the device 800. Examples of such data include instructions for any application or method operating on device 800, contact data, phonebook data, messages, pictures, videos, and so forth. The memory 804 may be implemented by any type or combination of volatile or non-volatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.
Power component 806 provides power to the various components of device 800. The power components 806 may include a power management system, one or more power sources, and other components associated with generating, managing, and distributing power for the device 800.
The multimedia component 808 includes a screen that provides an output interface between the device 800 and a user. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia component 808 includes a front facing camera and/or a rear facing camera. The front-facing camera and/or the rear-facing camera may receive external multimedia data when the device 800 is in an operating mode, such as a shooting mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have a focal length and optical zoom capability.
The audio component 810 is configured to output and/or input audio signals. For example, the audio component 810 includes a Microphone (MIC) configured to receive external audio signals when the apparatus 800 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signals may further be stored in the memory 804 or transmitted via the communication component 816. In some embodiments, audio component 810 also includes a speaker for outputting audio signals.
The I/O interface 812 provides an interface between the processing component 802 and peripheral interface modules, which may be keyboards, click wheels, buttons, etc. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.
The sensor assembly 814 includes one or more sensors for providing various aspects of state assessment for the device 800. For example, the sensor assembly 814 may detect the open/closed state of the device 800, the relative positioning of the components, such as a display and keypad of the apparatus 800, the sensor assembly 814 may also detect a change in position of the apparatus 800 or a component of the apparatus 800, the presence or absence of user contact with the apparatus 800, orientation or acceleration/deceleration of the apparatus 800, and a change in temperature of the apparatus 800. Sensor assembly 814 may include a proximity sensor configured to detect the presence of a nearby object without any physical contact. The sensor assembly 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
The communication component 816 is configured to facilitate communications between the apparatus 800 and other devices in a wired or wireless manner. The device 800 may access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component 816 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 816 further includes a Near Field Communication (NFC) module to facilitate short-range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
In an exemplary embodiment, the apparatus 800 may be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, micro-controllers, microprocessors or other electronic components for performing the above-described methods.
In an exemplary embodiment, a non-transitory computer-readable storage medium comprising instructions, such as the memory 804 comprising instructions, executable by the processor 820 of the device 800 to perform the above-described method is also provided. For example, the non-transitory computer readable storage medium may be a ROM, a Random Access Memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.
Accordingly, the present disclosure also provides a non-transitory computer readable storage medium having instructions that, when executed by an audio capture device, enable a mobile terminal to perform an audio capture method, the method comprising:
outputting a first audio frequency with a preset frequency by using an audio output component contained in the mobile terminal, wherein the preset frequency is as follows: the frequency of ultrasonic waves or the frequency of infrasonic waves;
acquiring a second audio frequency returned by acting on a target sound source based on the first audio frequency by utilizing at least two audio frequency acquisition components contained in the mobile terminal;
determining a relative position between the target sound source and the mobile terminal based on the acquisition time of the second audio acquired by the at least two audio acquisition components;
adjusting the acquisition parameters of the audio acquisition assembly according to the relative position;
and carrying out audio acquisition on the sound emitted by the target sound source based on the adjusted acquisition parameters.
Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This disclosure is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.
It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (10)

1. The audio acquisition method is applied to a mobile terminal, wherein at least two audio acquisition components are arranged at different positions on the mobile terminal, and the method comprises the following steps:
outputting a first audio frequency with a preset frequency by using an audio output component contained in the mobile terminal, wherein the preset frequency is as follows: the frequency of ultrasonic waves or the frequency of infrasonic waves;
acquiring a second audio frequency returned by acting on a target sound source based on the first audio frequency by utilizing at least two audio frequency acquisition components contained in the mobile terminal;
determining a relative position between the target sound source and the mobile terminal based on the acquisition time of the second audio acquired by the at least two audio acquisition components;
adjusting the acquisition parameters of the audio acquisition assembly according to the relative position;
and carrying out audio acquisition on the sound emitted by the target sound source based on the adjusted acquisition parameters.
2. The method of claim 1, wherein the determining the relative position between the target sound source and the mobile terminal based on the acquisition time of the second audio acquired by the at least two audio acquisition components comprises:
determining a time difference between the acquisition times of the second audio acquired by each of the at least two audio acquisition assemblies;
determining the relative position based on the time difference.
3. The method of claim 2, wherein determining a time difference between acquisition times of the second audio acquired by each of the at least two audio acquisition components comprises:
determining two audio acquisition components which acquire the second audio firstly in the at least two audio acquisition components;
determining a time difference between the acquisition times of the second audio acquired by the two audio acquisition components.
4. The method according to any one of claims 1 to 3, wherein the acquisition parameters include at least one of:
the acquisition direction of the audio acquisition assembly;
the acquisition power of the audio acquisition assembly.
5. An audio acquisition device, comprising:
the audio output module is configured to output a first audio with a predetermined frequency by using an audio output component included in the mobile terminal, wherein the predetermined frequency is: the frequency of ultrasonic waves or the frequency of infrasonic waves;
the first acquisition module is configured to acquire a second audio frequency returned by acting on a target sound source based on the first audio frequency by utilizing at least two audio frequency acquisition components contained in the mobile terminal;
a position determination module configured to determine a relative position between the target sound source and the mobile terminal based on a collection time of the second audio collected by the at least two audio collection components;
the parameter adjusting module is configured to adjust the acquisition parameters of the audio acquisition assembly according to the relative position;
and the second acquisition module is configured to acquire the audio of the sound emitted by the target sound source based on the adjusted acquisition parameters.
6. The apparatus of claim 5, wherein the position determining module comprises:
a first determining submodule configured to determine a time difference between acquisition times of the second audio acquired by each of the at least two audio acquisition assemblies;
a second determination submodule configured to determine the relative position based on the time difference.
7. The apparatus of claim 6, wherein the first determination submodule is further configured to:
determining two audio acquisition components which acquire the second audio firstly in the at least two audio acquisition components;
determining a time difference between the acquisition times of the second audio acquired by the two audio acquisition components.
8. The apparatus according to any one of claims 5 to 7, wherein the at least two audio capturing components comprise four microphones, and the four microphones are respectively located on four sides of the mobile terminal.
9. An audio acquisition device, comprising:
a processor;
a memory configured to store processor-executable instructions;
wherein the processor is configured to: when executed, implement the steps of any of the audio acquisition methods of claims 1 to 4.
10. A non-transitory computer readable storage medium having instructions therein which, when executed by a processor of an audio capture device, enable the device to perform any of the audio capture methods of claims 1-4 above.
CN201911052026.XA 2019-10-31 2019-10-31 Audio acquisition method, device and storage medium Pending CN112752191A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911052026.XA CN112752191A (en) 2019-10-31 2019-10-31 Audio acquisition method, device and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911052026.XA CN112752191A (en) 2019-10-31 2019-10-31 Audio acquisition method, device and storage medium

Publications (1)

Publication Number Publication Date
CN112752191A true CN112752191A (en) 2021-05-04

Family

ID=75641461

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911052026.XA Pending CN112752191A (en) 2019-10-31 2019-10-31 Audio acquisition method, device and storage medium

Country Status (1)

Country Link
CN (1) CN112752191A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113553022A (en) * 2021-07-16 2021-10-26 Oppo广东移动通信有限公司 Equipment adjusting method and device, mobile terminal and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2271134A1 (en) * 2009-07-02 2011-01-05 Nxp B.V. Proximity sensor comprising an acoustic transducer for receiving sound signals in the human audible range and for emitting and receiving ultrasonic signals.
US20110129101A1 (en) * 2004-07-13 2011-06-02 1...Limited Directional Microphone
CN102800325A (en) * 2012-08-31 2012-11-28 厦门大学 Ultrasonic-assisted microphone array speech enhancement device
CN106054133A (en) * 2016-05-11 2016-10-26 北京地平线信息技术有限公司 Far field sound source positioning system and method
CN106125771A (en) * 2016-08-16 2016-11-16 江西联创宏声电子有限公司 Audio directional loudspeaker and forward method thereof
CN107271963A (en) * 2017-06-22 2017-10-20 广东美的制冷设备有限公司 The method and apparatus and air conditioner of auditory localization
CN108989687A (en) * 2018-09-07 2018-12-11 北京小米移动软件有限公司 camera focusing method and device
CN109639863A (en) * 2018-12-29 2019-04-16 维沃移动通信有限公司 A kind of method of speech processing and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110129101A1 (en) * 2004-07-13 2011-06-02 1...Limited Directional Microphone
EP2271134A1 (en) * 2009-07-02 2011-01-05 Nxp B.V. Proximity sensor comprising an acoustic transducer for receiving sound signals in the human audible range and for emitting and receiving ultrasonic signals.
CN102800325A (en) * 2012-08-31 2012-11-28 厦门大学 Ultrasonic-assisted microphone array speech enhancement device
CN106054133A (en) * 2016-05-11 2016-10-26 北京地平线信息技术有限公司 Far field sound source positioning system and method
CN106125771A (en) * 2016-08-16 2016-11-16 江西联创宏声电子有限公司 Audio directional loudspeaker and forward method thereof
CN107271963A (en) * 2017-06-22 2017-10-20 广东美的制冷设备有限公司 The method and apparatus and air conditioner of auditory localization
CN108989687A (en) * 2018-09-07 2018-12-11 北京小米移动软件有限公司 camera focusing method and device
CN109639863A (en) * 2018-12-29 2019-04-16 维沃移动通信有限公司 A kind of method of speech processing and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113553022A (en) * 2021-07-16 2021-10-26 Oppo广东移动通信有限公司 Equipment adjusting method and device, mobile terminal and storage medium

Similar Documents

Publication Publication Date Title
CN106797416B (en) Screen control method and device
CN108766457B (en) Audio signal processing method, audio signal processing device, electronic equipment and storage medium
EP3163748A2 (en) Method, device and terminal for adjusting volume
EP3091753B1 (en) Method and device of optimizing sound signal
CN107749925B (en) Audio playing method and device
US10027785B2 (en) Method for switching screen state of terminal, terminal thereof, and computer-readable medium thereof
JP6964666B2 (en) Multi-beam selection method and equipment
CN112866894B (en) Sound field control method and device, mobile terminal and storage medium
CN111007462A (en) Positioning method, positioning device, positioning equipment and electronic equipment
CN106060707B (en) Reverberation processing method and device
CN111698593B (en) Active noise reduction method and device, and terminal
CN112752191A (en) Audio acquisition method, device and storage medium
CN113596662B (en) Method for suppressing howling, device for suppressing howling, earphone, and storage medium
CN111694539B (en) Method, device and medium for switching between earphone and loudspeaker
CN109712629B (en) Audio file synthesis method and device
CN112185353A (en) Audio signal processing method and device, terminal and storage medium
CN110632600B (en) Environment identification method and device
CN116705047B (en) Audio acquisition method, device and storage medium
CN112019677B (en) Electronic equipment control method and device
CN112804462B (en) Multi-point focusing imaging method and device, mobile terminal and storage medium
CN117880731A (en) Audio and video recording method and device and storage medium
CN112462928A (en) Operation execution method, device and storage medium
CN117636893A (en) Wind noise detection method and device, wearable equipment and readable storage medium
CN116193319A (en) Speech enhancement processing method, speech enhancement processing device, and storage medium
CN116758895A (en) Shooting method, shooting device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20210504