CN113903350A - Audio processing method and device and electronic equipment - Google Patents

Audio processing method and device and electronic equipment Download PDF

Info

Publication number
CN113903350A
CN113903350A CN202111150399.8A CN202111150399A CN113903350A CN 113903350 A CN113903350 A CN 113903350A CN 202111150399 A CN202111150399 A CN 202111150399A CN 113903350 A CN113903350 A CN 113903350A
Authority
CN
China
Prior art keywords
audio
audio data
sound
condition
sound acquisition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111150399.8A
Other languages
Chinese (zh)
Inventor
彭金刚
张双翼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CN202111150399.8A priority Critical patent/CN113903350A/en
Publication of CN113903350A publication Critical patent/CN113903350A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Telephone Function (AREA)

Abstract

The application discloses an audio processing method, an audio processing device and electronic equipment, wherein the method comprises the following steps: acquiring a first audio frequency through a sound acquisition device of second equipment to obtain first audio frequency data; obtaining second audio data transmitted by the first device through the network; in response to a first condition that the first audio data and the second audio data contain the sound of the first object at the same time, the second audio data and/or the first audio are subjected to a first process so that the audio of the first object in the sound heard by the second object on the second device side is weakened. According to the implementation scheme, under the condition that the sound of the first object exists in the environment and the audio data transmitted by the network, the two sounds are determined to cause interference to the user, so that corresponding processing is performed, the sound of the first object in the environment or the audio data transmitted by the network is weakened, the user can only clearly hear the sound of the first object transmitted by one path, and the use experience of the user is improved.

Description

Audio processing method and device and electronic equipment
Technical Field
The present application relates to audio control technologies, and in particular, to an audio processing method and apparatus, and an electronic device.
Background
In an office scenario, a web conference involving multiple people is often required. However, such conferences often suffer from some complications. Referring to fig. 1, which is a schematic view of a scene in which a user can hear the same voice twice at the same time, in combination with fig. 1, when some seats of participants are close to each other, when a member a speaks, a member B beside the seat can directly hear the voice of the member a from the environment, and can also hear the voice of the member a transmitted from the network in an earphone, and due to the time difference between the environmental sound wave transmission and the network data transmission, the member B can hear the same voice twice. This situation can greatly affect the meeting experience of some participants.
Disclosure of Invention
In view of this, the present application provides the following technical solutions:
an audio processing method applied to a second device comprises the following steps:
acquiring a first audio frequency through a sound acquisition device of second equipment to obtain first audio frequency data;
obtaining second audio data transmitted by the first device through the network;
in response to a first condition that the first audio data and the second audio data contain a first object sound at the same time, performing a first process on the second audio data and/or the first audio so that an audio of the first object is weakened in a sound heard by a second object on the second device side.
Optionally, the performing, in response to a first condition that the first audio data and the second audio data simultaneously contain a first object sound, a first process on the second audio data and/or the first audio includes:
and in response to a first condition that the first audio data and the second audio data simultaneously contain first object sound, performing silencing or muting processing on the audio data of the first object in the second audio data.
Optionally, the performing, in response to a first condition that the first audio data and the second audio data simultaneously contain a first object sound, a first process on the second audio data and/or the first audio includes:
in response to a first condition that the first audio data and the second audio data simultaneously contain a first object sound, obtaining distance information of the first object, and processing to obtain reverse audio data based on the distance information, wherein the reverse audio data is used for canceling out at least part of the sound of the first object in the environment after being output, so that the audio of the first object in the sound heard by the second object from the environment is weakened.
Optionally, the method further includes:
and adjusting the output volume parameter of the second audio data in response to a second condition that the first audio data and the second audio data do not contain the same object sound, so that the output volume when the second audio data is output is increased.
Optionally, the acquiring, by a sound acquisition device of the second device, the first audio to obtain the first audio data includes:
and acquiring first audio through a sound acquisition device arranged in the second device and/or an externally connected sound acquisition device to obtain first audio data.
Optionally, the acquiring, by a sound acquisition device built in the second device and/or an externally connected sound acquisition device, the first audio to obtain the first audio data includes:
acquiring performance parameters of the built-in sound acquisition equipment and the externally connected sound acquisition equipment under the condition that the externally connected sound acquisition equipment is detected;
and determining the sound acquisition equipment with better performance parameters as the used sound acquisition equipment and acquiring the first audio to obtain first audio data.
Optionally, the determining, as the sound collecting device used, the sound collecting device with the better performance parameter and collecting the first audio to obtain the first audio data includes:
detecting a first distance between the second object and the built-in sound collection device in real time;
under the condition that the first distance is smaller than the effective distance of the built-in sound acquisition equipment, determining the sound acquisition equipment with better performance parameters as used voice acquisition equipment and acquiring first audio to obtain first audio data;
and under the condition that the first distance is not less than the effective distance of the built-in sound acquisition equipment, determining the externally-connected sound acquisition equipment as the used sound acquisition equipment and acquiring a first audio to obtain first audio data.
Optionally, the detecting a first distance between the second object and the built-in sound collecting device in real time includes:
and detecting a first distance between the second object and the built-in sound acquisition equipment in real time by adopting a flight time sensor.
An audio processing apparatus applied to a second device, comprising:
the audio acquisition module is used for acquiring a first audio through sound acquisition equipment of second equipment to obtain first audio data;
the data receiving module is used for obtaining second audio data transmitted by the first equipment through a network;
and the data processing module is used for responding to a first condition that the first audio data and the second audio data simultaneously contain first object sound, and performing first processing on the second audio data and/or the first audio so that the audio of the first object in the sound heard by a second object on the second equipment side is weakened.
An electronic device, comprising:
a processor;
a memory for storing executable instructions of the processor;
wherein the executable instructions comprise:
acquiring a first audio frequency through a sound acquisition device of second equipment to obtain first audio frequency data; obtaining second audio data transmitted by the first device through the network; in response to a first condition that the first audio data and the second audio data contain a first object sound at the same time, performing a first process on the second audio data and/or the first audio so that an audio of the first object is weakened in a sound heard by a second object on the second device side.
Compared with the prior art, the embodiment of the application discloses an audio processing method, an audio processing device and electronic equipment, and the method comprises the following steps: acquiring a first audio frequency through a sound acquisition device of second equipment to obtain first audio frequency data; obtaining second audio data transmitted by the first device through the network; in response to a first condition that the first audio data and the second audio data contain a first object sound at the same time, performing a first process on the second audio data and/or the first audio so that an audio of the first object is weakened in a sound heard by a second object on the second device side. According to the implementation scheme, under the condition that the sound of the first object exists in the environment and the audio data transmitted by the network, the two sounds are determined to cause interference to the user, so that corresponding processing is performed, the sound of the first object in the environment or the audio data transmitted by the network is weakened, the user can only clearly hear the sound of the first object transmitted by one path, and the use experience of the user is improved.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, it is obvious that the drawings in the following description are only embodiments of the present application, and for those skilled in the art, other drawings can be obtained according to the provided drawings without creative efforts.
FIG. 1 is a diagram of a scenario in which a user can hear the same sound twice at the same time;
fig. 2 is a flowchart of an audio processing method disclosed in an embodiment of the present application;
FIG. 3 is a schematic diagram illustrating processing of second audio data according to an embodiment of the disclosure;
FIG. 4 is a schematic diagram illustrating a principle of processing a first audio according to an embodiment of the present disclosure;
FIG. 5 is a flow chart of another audio processing method disclosed in the embodiments of the present application;
FIG. 6 is a flow chart of determining a sound collection device and collecting a first audio disclosed in an embodiment of the present application;
fig. 7 is a schematic structural diagram of an audio processing apparatus according to an embodiment of the present application.
Detailed Description
The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The embodiment of the application can be applied to electronic equipment, the product form of the electronic equipment is not limited by the application, the electronic equipment can include but is not limited to a smart phone, a tablet computer, wearable equipment, a Personal Computer (PC), a netbook and the like, and the electronic equipment can be selected according to application requirements.
Fig. 2 is a flowchart of an audio processing method disclosed in an embodiment of the present application, where the method in fig. 2 is applicable to a second device, and referring to fig. 1, the audio processing method may include:
step 201: and acquiring a first audio frequency through a sound acquisition device of the second device to obtain first audio frequency data.
In a web conference scenario, multiple conference members are often participating, some of the conference members may be located in the same office, while other conference members may be remotely located. When the member A and the member B are located in the same office and are close to each other, and the member A speaks, the member B can hear the sound of the member A from the environment and also hear the sound of the member A from the earphone. Therefore, to deal with this situation, it is first necessary to determine whether the member B can hear the sound of the user a even in the environment.
Of course, the application scenario of the present application is not limited to a network conference scenario, and the audio processing method disclosed in the embodiment of the present application may be applied to any scenario in which multiple users realize online voice or video via a network. In this embodiment, only a scene of a web conference is taken as an example to perform related introduction.
In the embodiment of the application, whether the B member can hear the sound of the member A or not is determined by the electronic equipment of the member B. It can be understood that the electronic device of the member B is usually located in close proximity to the member B, so if the member B can hear the sound of the member a in the environment, the electronic device of the member B can also capture the sound of the member a in the environment through the sound capture device. Therefore, the audio processing method of the application firstly acquires first audio (sound in the environment) through a sound acquisition device of a second device (corresponding to the electronic device of the member B) to obtain first audio data, and subsequently can further judge whether the sound of the member a exists or not after the first audio data is obtained. The first audio may include a voice of a person speaking, and may also include other voices, such as a voice played by an electronic device, a voice produced by other objects in the environment, and the like.
Step 202: second audio data transmitted by the first device is obtained through the network.
In a network conference scene, when a member speaks, a sound collecting device, such as a microphone, on the electronic device of the member transmits collected audio data to computers of other members participating in the conference in real time through a network, and then the electronic device of the other member outputs the audio data of the other member received from the network to the user through an earphone. In this application, all audio data acquired from the network is referred to as second audio data.
The first device may be the electronic devices of all members participating in the network conference, including the electronic device located in the same office as the second device, and also including the remote electronic device.
Step 203: in response to a first condition that the first audio data and the second audio data contain a first object sound at the same time, performing a first process on the second audio data and/or the first audio so that an audio of the first object is weakened in a sound heard by a second object on the second device side.
After obtaining the first audio data and the second audio data, it is possible to identify and determine whether or not the first audio data and the second audio data simultaneously contain the sound of the first object by a corresponding technical process. The first object may be any person participating in the network conference. Specifically, the audio characteristics of the first audio data and the second audio data may be compared by performing spectral analysis on the two audio data to determine whether the same object is sound.
If it is determined through corresponding processing analysis that the sound of the first object exists in the first audio data and the second audio data at the same time, the user (corresponding to the second object) of the second device can hear the same utterance of the first object from the environment and the earphone respectively twice, so that the user can be interfered with the accurate information reception, and the first audio data and/or the second audio data can be correspondingly processed, so that the user can hear the speaking sound of the first object only once instead of hearing the speaking sound of the first object twice repeatedly; or, after processing, the user can still hear the same sound of the first object twice, but one sound is very clear, and the volume of the other sound is very small, so that the sound can be ignored, and no large interference can be caused to the user.
It should be noted that, after the processing, the sound of the first object that can only be heard by the user or the sound of the first object that can be clearly heard by the user may be from the environment or from an earphone, which is not limited in this application.
In the audio processing method of this embodiment, when it is determined that the sound of the first object exists in both the environment and the audio data transmitted by the network, it is determined that the two sounds may cause interference to the user, and corresponding processing is performed, so that the sound of the first object in the environment or the audio data transmitted by the network is weakened, and thus the user can only clearly hear the sound of the first object transmitted by one path, and the use experience of the user is improved.
In one implementation, the performing the first processing on the second audio data and/or the first audio in response to the first condition that the first audio data and the second audio data simultaneously contain the first object sound may include: and in response to a first condition that the first audio data and the second audio data simultaneously contain first object sound, performing silencing or muting processing on the audio data of the first object in the second audio data.
As described above, the second audio data may be audio data of all participants in the network conference, and in a practical scenario, not all participants may be located in the same office as the second object. Therefore, on the premise that the first audio data and the second audio data simultaneously contain the sound of the first object, that is, on the premise that the first object and the second object are located in the same office, when the second audio data is processed, only the audio data of the first object in the second audio data is muted or muted. In this way, the second subject will not hear or hardly hear the sound of the first subject from the headphones, but only the sound of speaking to the first subject (corresponding to the first audio) can be heard from the environment, thereby avoiding the interference of hearing the same utterance of the first subject twice in succession. Fig. 3 is a schematic view of a scenario of processing second audio data according to an embodiment of the present application, which can be understood by referring to fig. 3.
In another implementation, the performing the first processing on the second audio data and/or the first audio in response to the first condition that the first audio data and the second audio data simultaneously contain the first object sound may include: in response to a first condition that the first audio data and the second audio data simultaneously contain a first object sound, obtaining distance information of the first object, and processing to obtain reverse audio data based on the distance information, wherein the reverse audio data is used for canceling out at least part of the sound of the first object in the environment after being output, so that the audio of the first object in the sound heard by the second object from the environment is weakened.
In this implementation, when it is determined that the first audio data and the second audio data simultaneously contain the sound of the first object, the second audio data is not processed, but the first audio in the environment is processed, that is, the sound of the first object in the environment is processed. Specifically, the distance information of the first object may be determined first, and in the case where the position of the ear portion of the second object (which can be determined by the image recognition technique) and the position of the sound wave emitting device are known, the orientation of the first object can be determined by the distance information of the first object, and further the sound wave generating device in the second device may adjust the phase, and reverse audio data capable of canceling the sound of the first object in the environment is generated by the reverse audio generating unit, and the reverse audio data is output by the sound wave emitting device. After the reverse audio data is output, the user will not hear the sound of the first object in the environment, or can hear only the sound of a very small first object in the environment, so that the second audio data transmitted in the headphones can be clearly heard. Fig. 4 is a schematic view of a scenario of processing a first audio according to an embodiment of the present application, which can be understood by referring to fig. 4.
It should be noted that, the two implementations do not limit the specific content of the first processing, and in the two implementations, only the first audio or the second audio data is processed separately, and in practical cases, the sound of the first object in the environment may be enhanced at the same time while the sound of the first object in the second audio data is muted or muted; or increasing the output volume of the second audio data or performing other optimization processing while performing reverse cancellation processing on the first audio.
On the basis of the above-mentioned embodiments disclosed in the present application, a flowchart of an audio processing method in another implementation may be shown in fig. 5, and as shown in fig. 5, the audio processing method may include:
step 501: and acquiring a first audio frequency through a sound acquisition device of the second device to obtain first audio frequency data.
Step 502: the second audio data transmitted by the first device is obtained through the network and step 503 or step 504 is entered.
Step 503: in response to a first condition that the first audio data and the second audio data contain a first object sound at the same time, performing a first process on the second audio data and/or the first audio so that an audio of the first object is weakened in a sound heard by a second object on the second device side.
Step 504: and adjusting the output volume parameter of the second audio data in response to a second condition that the first audio data and the second audio data do not contain the same object sound, so that the output volume when the second audio data is output is increased.
Under the condition that the first audio data and the second audio data do not contain the same object sound, the first audio is only considered to be environmental noise irrelevant to the network conference, so that in order to conveniently and accurately obtain the second audio data by the second object, the output volume of the second audio data can be increased, and a user can hear the corresponding sound of the second audio data more clearly.
In this embodiment, when it is determined that the first audio data and the second audio data do not include the same object sound, it is determined that the first audio is ambient noise, and therefore, an effect of the second object receiving the second audio data is ensured by turning up an output volume of the second audio data.
In one implementation, the acquiring, by a sound acquisition device of a second device, first audio data to obtain first audio data may include: and acquiring first audio through a sound acquisition device arranged in the second device and/or an externally connected sound acquisition device to obtain first audio data.
For some network-enabled electronic devices, they are usually equipped with a sound collection device themselves. In a network conference scenario, each member usually connects to an electronic device through a headset device to receive audio data from the headset device, and the headset device usually includes a voice capture device. Therefore, when there are two available sound collection apparatuses for the electronic apparatus, one sound collection apparatus needs to be selected as the sound collection apparatus to be used.
The acquiring of the first audio by the sound acquisition device built in the second device and/or the sound acquisition device connected externally to obtain the first audio data may include: acquiring the first audio through the sound acquisition device built in the second device and/or the sound acquisition device connected externally to obtain the first audio data may include: acquiring performance parameters of the built-in sound acquisition equipment and the externally connected sound acquisition equipment under the condition that the externally connected sound acquisition equipment is detected; and determining the sound acquisition equipment with better performance parameters as the used sound acquisition equipment and acquiring the first audio to obtain first audio data.
It can be understood that when two available sound collection devices exist in the electronic device, the sound collection device with the better performance parameter is selected as the used device, so that the quality of the collected first audio data is better, and the use experience of a user is promoted.
Fig. 6 is a flowchart of determining a sound collection device and collecting a first audio according to an embodiment of the disclosure, and referring to fig. 6, in an implementation, the determining a sound collection device with better performance parameters as a used sound collection device and collecting the first audio to obtain first audio data may include:
step 601: detecting a first distance between the second object and the built-in sound collection device in real time.
In the case that the built-in sound collection device of the second device has better performance, the built-in sound collection device is used for collecting the first audio, but the second object may be far away from the built-in sound collection device of the second device because the second object is wearing the earphone. In this case, if the sound collection device built in the second device is still used to collect the sound (including the sound of the second object), the collection effect is not ideal. Therefore, in this implementation, a first distance between the second object and the built-in sound collection device is detected in real time, and then whether to switch the sound collection device is determined based on the first distance.
Step 602: and under the condition that the first distance is smaller than the effective distance of the built-in sound acquisition equipment, determining the sound acquisition equipment with better performance parameters as the used voice acquisition equipment and acquiring first audio to obtain first audio data.
Step 603: and under the condition that the first distance is not less than the effective distance of the built-in sound acquisition equipment, determining the externally-connected sound acquisition equipment as the used sound acquisition equipment and acquiring a first audio to obtain first audio data.
In the case where the first distance is greater than or equal to the effective distance of the sound collection device built in the second device, the sound of the second object collected with the built-in sound collection device may be small or ambiguous, and therefore, in this case, an externally connected sound collection device is adopted as the sound collection device to be used. Because the user can wear the headset in the whole course of the network meeting, the sound collecting equipment on the headset is always close to the second object, and therefore the sound of the second object can be clearly collected.
In one implementation, the detecting, in real time, the first distance between the second object and the built-in sound collection device may include: and detecting a first distance between the second object and the built-in sound acquisition equipment in real time by adopting a flight time sensor.
While, for purposes of simplicity of explanation, the foregoing method embodiments have been described as a series of acts or combination of acts, it will be appreciated by those skilled in the art that the present application is not limited by the order of acts or acts described, as some steps may occur in other orders or concurrently with other steps in accordance with the application. Further, those skilled in the art should also appreciate that the embodiments described in the specification are preferred embodiments and that the acts and modules referred to are not necessarily required in this application.
The method is described in detail in the embodiments disclosed in the present application, and the method of the present application can be implemented by various types of apparatuses, so that an apparatus is also disclosed in the present application, and the following detailed description is given of specific embodiments.
Fig. 7 is a schematic structural diagram of an audio processing apparatus disclosed in an embodiment of the present application, where the apparatus shown in fig. 7 is applied to a second device, and referring to fig. 7, an audio processing apparatus 70 may include:
the audio acquisition module 701 is configured to acquire a first audio through a sound acquisition device of a second device to obtain first audio data.
A data receiving module 702, configured to obtain, through the network, the second audio data transmitted by the first device.
A data processing module 703, configured to perform, in response to a first condition that the first audio data and the second audio data simultaneously contain a first object sound, a first process on the second audio data and/or the first audio, so that an audio of the first object in a sound heard by a second object on the second device side is weakened.
In the audio processing apparatus of this embodiment, when it is determined that the sound of the first object exists in both the environment and the audio data transmitted by the network, it is determined that the two sounds may cause interference to the user, so that corresponding processing is performed, and the sound of the first object in the environment or the audio data transmitted by the network is weakened, so that the user can only clearly hear the sound of the first object transmitted by one path, thereby improving the user experience.
In one implementation, the data processing module is specifically configured to: and in response to a first condition that the first audio data and the second audio data simultaneously contain first object sound, performing silencing or muting processing on the audio data of the first object in the second audio data.
In one implementation, the data processing module is specifically configured to: in response to a first condition that the first audio data and the second audio data simultaneously contain a first object sound, obtaining distance information of the first object, and processing to obtain reverse audio data based on the distance information, wherein the reverse audio data is used for canceling out at least part of the sound of the first object in the environment after being output, so that the audio of the first object in the sound heard by the second object from the environment is weakened.
In one implementation, the data processing module is further configured to: and adjusting the output volume parameter of the second audio data in response to a second condition that the first audio data and the second audio data do not contain the same object sound, so that the output volume when the second audio data is output is increased.
In one implementation, the audio acquisition module is specifically operable to: and acquiring first audio through a sound acquisition device arranged in the second device and/or an externally connected sound acquisition device to obtain first audio data.
In one implementation, an audio acquisition module includes: the performance acquisition module is used for acquiring performance parameters of the built-in sound acquisition equipment and the externally connected sound acquisition equipment under the condition that the externally connected sound acquisition equipment is detected to exist; and the equipment determining module is used for determining the sound acquisition equipment with the better performance parameters as the used sound acquisition equipment and acquiring the first audio to obtain the first audio data.
In one implementation, the device determination module is specifically configured to: detecting a first distance between the second object and the built-in sound collection device in real time;
under the condition that the first distance is smaller than the effective distance of the built-in sound acquisition equipment, determining the sound acquisition equipment with better performance parameters as used voice acquisition equipment and acquiring first audio to obtain first audio data;
and under the condition that the first distance is not less than the effective distance of the built-in sound acquisition equipment, determining the externally-connected sound acquisition equipment as the used sound acquisition equipment and acquiring a first audio to obtain first audio data.
Wherein the first distance between the second object and the built-in sound collection device is detectable in real time by employing a time-of-flight sensor.
The audio processing apparatus in any of the above embodiments includes a processor and a memory, where the audio acquisition module, the data receiving module, the data processing module, the performance obtaining module, the device determining module, and the like in the above embodiments are all stored in the memory as program modules, and the processor executes the program modules stored in the memory to implement corresponding functions.
The processor comprises a kernel, and the kernel calls the corresponding program module from the memory. The kernel can be provided with one or more, and the processing of the return visit data is realized by adjusting the kernel parameters.
The memory may include volatile memory in a computer readable medium, Random Access Memory (RAM) and/or nonvolatile memory such as Read Only Memory (ROM) or flash memory (flash RAM), and the memory includes at least one memory chip.
An embodiment of the present application provides a storage medium, on which a program is stored, which when executed by a processor implements the audio processing method described in the above embodiments.
The embodiment of the present application provides a processor, where the processor is configured to execute a program, where the program executes the audio processing method described in the foregoing embodiment when running.
Further, the present embodiment provides an electronic device, which includes a processor and a memory. Wherein the memory is used for storing executable instructions of the processor, and the processor is configured to execute the audio processing method described in the above embodiments via executing the executable instructions. Wherein the executable instructions comprise: acquiring a first audio frequency through a sound acquisition device of second equipment to obtain first audio frequency data; obtaining second audio data transmitted by the first device through the network; in response to a first condition that the first audio data and the second audio data contain a first object sound at the same time, performing a first process on the second audio data and/or the first audio so that an audio of the first object is weakened in a sound heard by a second object on the second device side.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description.
It is further noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in Random Access Memory (RAM), memory, Read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present application. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the application. Thus, the present application is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. An audio processing method applied to a second device comprises the following steps:
acquiring a first audio frequency through a sound acquisition device of second equipment to obtain first audio frequency data;
obtaining second audio data transmitted by the first device through the network;
in response to a first condition that the first audio data and the second audio data contain a first object sound at the same time, performing a first process on the second audio data and/or the first audio so that an audio of the first object is weakened in a sound heard by a second object on the second device side.
2. The audio processing method according to claim 1, said first processing the second audio data and/or the first audio in response to a first condition that the first audio data and the second audio data simultaneously contain a first object sound, comprising:
and in response to a first condition that the first audio data and the second audio data simultaneously contain first object sound, performing silencing or muting processing on the audio data of the first object in the second audio data.
3. The audio processing method according to claim 1, said first processing the second audio data and/or the first audio in response to a first condition that the first audio data and the second audio data simultaneously contain a first object sound, comprising:
in response to a first condition that the first audio data and the second audio data simultaneously contain a first object sound, obtaining distance information of the first object, and processing to obtain reverse audio data based on the distance information, wherein the reverse audio data is used for canceling out at least part of the sound of the first object in the environment after being output, so that the audio of the first object in the sound heard by the second object from the environment is weakened.
4. The audio processing method of claim 1, further comprising:
and adjusting the output volume parameter of the second audio data in response to a second condition that the first audio data and the second audio data do not contain the same object sound, so that the output volume when the second audio data is output is increased.
5. The audio processing method of claim 1, wherein the acquiring of the first audio by the sound acquisition device of the second device to obtain the first audio data comprises:
and acquiring first audio through a sound acquisition device arranged in the second device and/or an externally connected sound acquisition device to obtain first audio data.
6. The audio processing method according to claim 5, wherein acquiring the first audio by the sound acquisition device built in the second device and/or the sound acquisition device connected externally to obtain the first audio data comprises:
acquiring performance parameters of the built-in sound acquisition equipment and the externally connected sound acquisition equipment under the condition that the externally connected sound acquisition equipment is detected;
and determining the sound acquisition equipment with better performance parameters as the used sound acquisition equipment and acquiring the first audio to obtain first audio data.
7. The audio processing method according to claim 6, wherein the determining the sound collection device with the better performance parameter as the used sound collection device and collecting the first audio to obtain the first audio data comprises:
detecting a first distance between the second object and the built-in sound collection device in real time;
under the condition that the first distance is smaller than the effective distance of the built-in sound acquisition equipment, determining the sound acquisition equipment with better performance parameters as used voice acquisition equipment and acquiring first audio to obtain first audio data;
and under the condition that the first distance is not less than the effective distance of the built-in sound acquisition equipment, determining the externally-connected sound acquisition equipment as the used sound acquisition equipment and acquiring a first audio to obtain first audio data.
8. The audio processing method of claim 7, the detecting in real-time a first distance between the second object and the built-in sound collection device, comprising:
and detecting a first distance between the second object and the built-in sound acquisition equipment in real time by adopting a flight time sensor.
9. An audio processing apparatus applied to a second device, comprising:
the audio acquisition module is used for acquiring a first audio through sound acquisition equipment of second equipment to obtain first audio data;
the data receiving module is used for obtaining second audio data transmitted by the first equipment through a network;
and the data processing module is used for responding to a first condition that the first audio data and the second audio data simultaneously contain first object sound, and performing first processing on the second audio data and/or the first audio so that the audio of the first object in the sound heard by a second object on the second equipment side is weakened.
10. An electronic device, comprising:
a processor;
a memory for storing executable instructions of the processor;
wherein the executable instructions comprise:
acquiring a first audio frequency through a sound acquisition device of second equipment to obtain first audio frequency data; obtaining second audio data transmitted by the first device through the network; in response to a first condition that the first audio data and the second audio data contain a first object sound at the same time, performing a first process on the second audio data and/or the first audio so that an audio of the first object is weakened in a sound heard by a second object on the second device side.
CN202111150399.8A 2021-09-29 2021-09-29 Audio processing method and device and electronic equipment Pending CN113903350A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111150399.8A CN113903350A (en) 2021-09-29 2021-09-29 Audio processing method and device and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111150399.8A CN113903350A (en) 2021-09-29 2021-09-29 Audio processing method and device and electronic equipment

Publications (1)

Publication Number Publication Date
CN113903350A true CN113903350A (en) 2022-01-07

Family

ID=79189311

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111150399.8A Pending CN113903350A (en) 2021-09-29 2021-09-29 Audio processing method and device and electronic equipment

Country Status (1)

Country Link
CN (1) CN113903350A (en)

Similar Documents

Publication Publication Date Title
KR102470962B1 (en) Method and apparatus for enhancing sound sources
CN110970057B (en) Sound processing method, device and equipment
US20090279715A1 (en) Method, medium, and apparatus for extracting target sound from mixed sound
EP3350804B1 (en) Collaborative audio processing
CN110177317B (en) Echo cancellation method, echo cancellation device, computer-readable storage medium and computer equipment
CN110956969B (en) Live broadcast audio processing method and device, electronic equipment and storage medium
US11398220B2 (en) Speech processing device, teleconferencing device, speech processing system, and speech processing method
CN113949955A (en) Noise reduction processing method and device, electronic equipment, earphone and storage medium
CN110913312B (en) Echo cancellation method and device
CN113903350A (en) Audio processing method and device and electronic equipment
WO2020043007A1 (en) Method, system, and computer-readable medium for purifying voice using depth information
JP6314475B2 (en) Audio signal processing apparatus and program
CN115410593A (en) Audio channel selection method, device, equipment and storage medium
US9997169B2 (en) Image-based techniques for audio content
US10997984B2 (en) Sounding device, audio transmission system, and audio analysis method thereof
Matsumoto Vision-referential speech enhancement of an audio signal using mask information captured as visual data
WO2022047606A1 (en) Method and system for authentication and compensation
EP3029671A1 (en) Method and apparatus for enhancing sound sources
EP4149120A1 (en) Method, hearing system, and computer program for improving a listening experience of a user wearing a hearing device, and computer-readable medium
US20230360662A1 (en) Method and device for processing a binaural recording
US20230421702A1 (en) Distributed teleconferencing using personalized enhancement models
US20230267942A1 (en) Audio-visual hearing aid
US11810587B2 (en) Noise filtrations based on radar
CN111800552A (en) Audio output processing method, device and system and electronic equipment
CN113038338A (en) Noise reduction processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination