CN115499761A - Audio processing apparatus and audio processing system - Google Patents

Audio processing apparatus and audio processing system Download PDF

Info

Publication number
CN115499761A
CN115499761A CN202211104274.6A CN202211104274A CN115499761A CN 115499761 A CN115499761 A CN 115499761A CN 202211104274 A CN202211104274 A CN 202211104274A CN 115499761 A CN115499761 A CN 115499761A
Authority
CN
China
Prior art keywords
audio data
audio
bluetooth
chip
paths
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211104274.6A
Other languages
Chinese (zh)
Inventor
吴叶富
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anker Innovations Co Ltd
Original Assignee
Anker Innovations Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anker Innovations Co Ltd filed Critical Anker Innovations Co Ltd
Priority to CN202211104274.6A priority Critical patent/CN115499761A/en
Publication of CN115499761A publication Critical patent/CN115499761A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B5/00Near-field transmission systems, e.g. inductive or capacitive transmission systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B5/00Near-field transmission systems, e.g. inductive or capacitive transmission systems
    • H04B5/70Near-field transmission systems, e.g. inductive or capacitive transmission systems specially adapted for specific purposes
    • H04B5/72Near-field transmission systems, e.g. inductive or capacitive transmission systems specially adapted for specific purposes for local intradevice communication
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/02Constructional features of telephone sets
    • H04M1/19Arrangements of transmitters, receivers, or complete sets to prevent eavesdropping, to attenuate local noise or to prevent undesired transmission; Mouthpieces or receivers specially adapted therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/60Substation equipment, e.g. for use by subscribers including speech amplifiers
    • H04M1/6033Substation equipment, e.g. for use by subscribers including speech amplifiers for providing handsfree use or a loudspeaker mode in telephone sets
    • H04M1/6041Portable telephones adapted for handsfree use
    • H04M1/6058Portable telephones adapted for handsfree use involving the use of a headset accessory device connected to the portable telephone
    • H04M1/6066Portable telephones adapted for handsfree use involving the use of a headset accessory device connected to the portable telephone including a wireless connection

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Telephone Function (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

An audio processing device and an audio processing system are provided, the audio processing device comprises a Bluetooth chip, the Bluetooth chip is used for receiving at least two paths of audio data, wherein the at least two paths of audio data are obtained by respectively picking up sound of the same call scene by at least two microphones; and the Bluetooth chip is also used for carrying out audio noise reduction processing on the basis of all the received audio data to obtain noise-reduced voice signals of the call scene and transmitting the noise-reduced voice signals to the terminal equipment. The audio processing equipment and the audio processing system can improve the quality of voice communication and improve user experience.

Description

Audio processing apparatus and audio processing system
Technical Field
The present application relates to the field of audio processing technologies, and more particularly, to an audio processing device and an audio processing system.
Background
Today in the information age, people are working and living at a faster and faster pace. People pay more and more attention to the product experience in daily life. Due to epidemic situations and the like, more people live in homes, offices and living rooms, and frequently use computers, conference systems and the like to communicate.
With the rapid development of science and technology, a true wireless communication system has been successfully developed at present. However, wireless audio products such as earphones or speakers currently on the market can only simply use one earphone or one microphone for audio input. The above reasons cause the problem that the user experiences poor conversation after the conversation system comes into the market in recent years. Therefore, it is desirable to provide a solution that can improve the call experience.
Disclosure of Invention
According to an aspect of the present application, an audio processing device is provided, where the audio processing device includes a bluetooth chip, and the bluetooth chip is configured to receive at least two paths of audio data, where the at least two paths of audio data are obtained by at least two microphones respectively picking up sound in a same call scene; the Bluetooth chip is further used for carrying out audio noise reduction processing on the basis of all the received audio data to obtain noise-reduced voice signals of the conversation scene, and transmitting the noise-reduced voice signals to the terminal equipment.
In one embodiment of the present application, the bluetooth chip at least includes a master chip and a slave chip, the master chip is configured to receive one path of the audio data, and the slave chip is configured to receive another path of the audio data, where: the slave chip is also used for transmitting the audio data received by the slave chip to the master chip; the master chip is also used for carrying out the audio noise reduction processing according to all the received audio data after receiving the audio data transmitted by the slave chip; or the slave chip is further configured to perform the audio noise reduction processing according to the audio data received by the slave chip, and transmit a first noise-reduced voice signal obtained after noise reduction to the master chip; the main chip is further configured to perform the audio denoising processing according to the path of audio data received by the main chip to obtain a second denoised voice signal after denoising, and perform voice fusion with the second denoised voice signal after receiving the first denoised voice signal transmitted by the slave chip.
In an embodiment of the present application, the bluetooth chip at least includes an LE-audio bluetooth chip supporting a next generation bluetooth technology, and the LE-audio bluetooth chip supporting the next generation bluetooth technology is configured to receive the at least two paths of audio data and execute the audio denoising process according to all the received paths of audio data.
In an embodiment of the present application, the step of the audio noise reduction processing performed by the bluetooth chip specifically includes: respectively calculating voice energy values of all the received audio data; comparing the voice energy values of all the paths of audio data to obtain a comparison result; and synthesizing multiple paths of audio data in all paths of audio data into one path of audio data based on the comparison result, and then performing audio noise reduction processing, or selecting one path of audio data with the largest voice energy value to perform audio noise reduction processing, so as to obtain noise-reduced voice signals of the call scene.
In an embodiment of the present application, the bluetooth chip is further configured to: for audio data all the way, when audio data's the speech energy value of all the way is less than first threshold value, perhaps another way audio data's the speech energy value with when audio data's the difference of the speech energy value of all the way is greater than the second threshold value, to gathering audio data's the bluetooth sound pickup equipment that the microphone of all the way is located sends control signal, in order to control the gathering audio data's the microphone of all the way is closed.
In one embodiment of the present application, the at least two microphones include at least three microphones, and wherein the two microphones are microphones disposed at different locations on the same bluetooth sound pickup device.
In one embodiment of the present application, the at least two microphones include two microphones respectively disposed on two different bluetooth sound pickup devices.
In one embodiment of the present application, the two bluetooth sound pickup devices include a left ear headphone and a right ear headphone of a bluetooth headset, or include two bluetooth sound devices.
In one embodiment of the application, the audio processing device comprises a bluetooth adapter or a charging bin of a bluetooth headset.
According to another aspect of the present application, an audio processing system is further provided, where the audio processing system includes at least two audio processing devices as described above, and each of the at least two audio processing devices processes at least two paths of audio data, so as to implement noise reduction processing on multiple paths of audio data obtained by picking up sound in the same call scene in a matching manner.
According to the audio processing device and the audio processing system, the quality of voice call can be improved, and user experience is improved.
Drawings
The above and other objects, features and advantages of the present application will become more apparent by describing in more detail embodiments of the present application with reference to the attached drawings. The accompanying drawings are included to provide a further understanding of the embodiments of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the principles of the application. In the drawings, like reference numbers generally indicate like parts or steps.
Fig. 1 shows a schematic structural block diagram of an audio processing device according to an embodiment of the present application and a data interaction diagram thereof with a microphone and a terminal device.
Fig. 2 shows a schematic block diagram of an audio processing device and its data interaction with a microphone and a terminal device according to a more specific embodiment of the present application.
Fig. 3 shows a schematic block diagram of an audio processing device and its data interaction with a microphone and a terminal device according to another more specific embodiment of the present application.
Fig. 4 shows a schematic block diagram of an audio processing system according to an embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, exemplary embodiments according to the present application will be described in detail below with reference to the accompanying drawings. It should be understood that the described embodiments are only some embodiments of the present application and not all embodiments of the present application, and that the present application is not limited by the example embodiments described herein. All other embodiments, which can be derived by a person skilled in the art from the embodiments described in the application without inventive step, are intended to fall within the scope of protection of the application.
Fig. 1 shows a schematic block diagram of an audio processing device 100 according to an embodiment of the present application and a schematic diagram of data interaction between the audio processing device and a microphone and a terminal device, where the microphone is disposed on a bluetooth sound pickup device. As shown in fig. 1, the audio processing apparatus 100 includes a bluetooth chip 110, and the bluetooth chip 110 is configured to receive at least two paths of audio data, where the at least two paths of audio data are obtained by at least two microphones respectively picking up sound for a same call scene (for simplicity, only two microphones and corresponding two paths of audio data are shown in fig. 1). The bluetooth chip 110 is further configured to perform audio noise reduction processing based on all the received audio data to obtain noise-reduced voice signals of the call scene, and transmit the noise-reduced voice signals to the terminal device.
In an embodiment of the present application, the bluetooth chip 110 receives audio data from at least two microphones, and each microphone picks up sound for a current call scenario to obtain one path of audio data; correspondingly, the bluetooth chip 110 receives the corresponding at least two paths of audio data from the at least two microphones, and performs audio denoising processing on the audio data to obtain denoised voice signals of the call scene; because the bluetooth chip 110 performs audio noise reduction processing on the basis of at least two paths of audio data acquired by picking up sound in the same call scene, different paths of audio data are mutually referenced, and compared with performing audio noise reduction processing on the basis of only one path of audio data, the reliability of noise reduction can be improved, so that the quality of voice signals is improved; accordingly, the voice signal is transmitted to the terminal device and then transmitted to the other party of the call by the terminal device, so that the call quality can be improved, and the user experience of the call scene is improved. In addition, the terminal device can also be used for performing operations such as real-time transcription, real-time conversation and real-time translation on the acquired voice signals, and can also upload the voice signals to a cloud.
In the embodiment of the present application, the audio noise reduction processing performed by the bluetooth chip 110 may specifically include: respectively calculating voice energy values of all received paths of audio data (such as through Fourier transformation, A weighting and the like); comparing the voice energy values of all the paths of audio data to obtain a comparison result; and synthesizing the multiple paths of audio data in all paths of audio data into one path of audio data based on the comparison result, and then performing audio noise reduction processing, or selecting the path of audio data with the largest voice energy value to perform audio noise reduction processing, so as to obtain the noise-reduced voice signal of the call scene.
For example, when the difference between the voice energy values of all the channels of audio data at each frequency point is not large, it indicates that the sound pickup effect of each channel of audio data is equivalent, and at this time, all the channels of audio data can be synthesized into one channel of audio data and then audio noise reduction processing is performed; for another example, when the difference between the voice energies of different paths of audio data at some frequency points is large, it indicates that the difference between the sound pickup effects of different paths of audio data is large, and at this time, the path of audio data with the largest voice energy value may be selected to perform the audio denoising processing; this is advantageous for obtaining a better quality speech signal.
In an embodiment of the present application, the bluetooth chip 110 may further be configured to: for one path of audio data, when the voice energy value of one path of audio data is smaller than a first threshold (the threshold may be set as required), or the difference between the voice energy value of another path of audio data and the voice energy value of one path of audio data is greater than a second threshold (the threshold may be set as required), sending a control signal to the bluetooth sound pickup device where the microphone collecting the one path of audio data is located, so as to control the microphone collecting the one path of audio data to be turned off.
For example, by comparison, it is determined that the picked-up voice energy value of one microphone (e.g., microphone a) for the current call scenario is significantly greater than that of the other microphone (e.g., microphone B) for the current call scenario, which indicates that in the current call scenario, the speaker is closer to microphone a and farther away from microphone B, or there are no other noise sources near microphone a and other noise sources near microphone B; based on this, the bluetooth chip 110 may control the sound pickup of the microphones a and B, for example, control the microphone a to pick up sound only and control the microphone B to turn off (for example, control the microphone B to turn off by sending a control signal to the bluetooth sound pickup apparatus where the microphone B is located) in the subsequent call of the current call scene, so that the power of the bluetooth sound pickup apparatus may be saved while obtaining a high-quality sound pickup result (for example, the microphone a and the microphone B are located at different positions of the same bluetooth sound pickup apparatus, and the microphone B is turned off at this time, and the power of the bluetooth sound pickup apparatus may be saved while obtaining a high-quality sound pickup result by only using the microphone a to pick up sound).
In one embodiment of the present application, the audio processing apparatus 100 may be implemented as a bluetooth adapter (dongle) or a charging receptacle of a bluetooth headset having the above-described structure. The aforementioned at least two microphones may include at least three microphones, for example, and the two microphones are microphones disposed at different positions on the same bluetooth sound pickup device. Or, the aforementioned at least two microphones may include two microphones respectively disposed on two different bluetooth sound pickup devices, where the two bluetooth sound pickup devices include, for example, a left ear earphone and a right ear earphone of a bluetooth earphone, and the left ear earphone is provided with at least one microphone, and the right ear earphone is provided with at least one other microphone; or two independent bluetooth sound equipment are included, and each bluetooth sound equipment is provided with a microphone respectively.
The interaction between the bluetooth sound pickup device and the audio processing device 100, which can improve the quality of the speech sound, is described below by taking the bluetooth sound pickup device as a wireless bluetooth headset and taking the bluetooth dongle as an example (when the bluetooth sound pickup device is a wireless bluetooth sound, the interaction process is also similar, and is not described in detail below).
For example, in a conversation scenario, a first speaker makes a call with a second speaker through a wireless bluetooth headset, and during the conversation, the left side of the first speaker has other colleagues talking, while the right side is relatively quiet. In the conversation scene, the left earphone and the right earphone of the wireless Bluetooth earphone of the caller A pick up sound to obtain two paths of audio data, namely left path audio data and right path audio data; after the Bluetooth pickup equipment and the Bluetooth dongle are connected through Bluetooth, the left audio data and the right audio data can be respectively transmitted to a Bluetooth chip 110 of the Bluetooth dongle; then, the bluetooth chip 110 determines that there is more noise (speech sound from the co-worker on the left side of the speaker a) in the left audio data by comparing the left audio data with the right audio data, and the voice of the right audio data is relatively pure, so the right audio data is selected to perform audio noise reduction processing to obtain the noise-reduced voice signal of the communication scene, and the noise-reduced voice signal is transmitted to the terminal device (the mobile phone in the scene) in a wired or wireless manner, so that the speaker b can hear the voice with higher quality, thereby having better user experience. Compared with the prior art in which the wireless Bluetooth headset only adopts the pickup of one microphone, for example, the left headset is just adopted to pick up the audio data with much noise, the scheme of the application can reduce the interference noise in the call scene to a greater extent, and effectively improve the voice call quality. In the scene, at least one microphone of each of two earphones of the Bluetooth earphone is fused into a pickup array, and noise reduction processing is preferentially carried out after multipath pickup, so that the communication quality is improved; in addition, multi-channel voice collection is realized by using dongle, and individual control and data processing of the two earphones can also be realized by using dongle, as described above.
In the above call scenario example, the audio processing device 100 is replaced with the charging bin from the bluetooth dongle, and the process is the same, which is not described herein again. Data interaction between the wireless bluetooth headset and the charging bin in another example call scenario is described below.
In another example of a call scenario, the wireless bluetooth headset is not worn on the ear of the user, but is placed in a charging chamber for storage and charging, and at this time, the wireless bluetooth headset and the charging chamber may form a conference sound pickup system. That is, in this example, the wireless bluetooth headset no longer functions as a listening device, but merely as a sound pickup device, and a speaker may be provided on the charging chamber to play sound. In the example, a left earphone and a right earphone of the wireless Bluetooth earphone pick up sound of the participant to obtain two paths of audio data, namely a left path of audio data and a right path of audio data; when the earphone is connected with the pin of the charging bin or connected with the Bluetooth, the left audio data and the right audio data can be respectively transmitted to the Bluetooth chip 110 of the charging bin; then, the bluetooth chip 110 determines that the speech energy value of the left audio data is smaller and the speech energy value of the right audio data is larger by comparing the left audio data and the right audio data (for example, the left earphone is farther away from the participating speaker and the right earphone is closer to the participating speaker), at this time, the right audio data may be selected to perform audio noise reduction processing to obtain the noise-reduced speech signal of the conversation scene, and then, the noise-reduced speech signal is transmitted to the terminal device (which may be a computer in the scene) in a wired or wireless manner, so that the remote participant can hear the speech with higher quality, thereby having better user experience. In the above example, the charging cabin may also digitize the voice signal obtained by noise reduction, convert the voice signal into a recording file, and store the recording file locally to form a local recording system. The audio files can be transmitted to the terminal device in real time or non-real time (in a mode of Bluetooth HID/BLE/SPP/A2DP/HFP or USB and the like), and the terminal device can perform functions of transcription, conversation or translation and the like or upload the audio files to the cloud and the like.
The audio processing apparatus and its data interaction with the bluetooth sound pickup apparatus and the terminal apparatus according to an embodiment of the present application are exemplarily described above. Based on the above description, the audio processing device 100 according to the embodiment of the present application can acquire at least two paths of audio data from at least two microphones on at least one bluetooth sound pickup device for the same call scene, perform audio denoising processing based on the at least two paths of audio data, obtain denoised voice signals of the call scene, and transmit the denoised voice signals to the terminal device, so as to improve the reliability of denoising, thereby improving the quality of voice call and improving user experience.
In an embodiment of the present application, the bluetooth chip 110 may include at least an LE-audio bluetooth chip supporting the next generation bluetooth technology, and the LE-audio bluetooth chip supporting the next generation bluetooth technology is configured to receive at least two paths of audio data and perform audio denoising processing according to all the received paths of audio data.
In another embodiment of the present application, the bluetooth chip 110 may include at least a master chip and a slave chip (not shown, and described later in conjunction with fig. 2), the master chip is configured to receive one channel of audio data, and the slave chip is configured to receive another channel of audio data, where: the slave chip is also used for transmitting one path of audio data received by the slave chip to the master chip; and the master chip is also used for carrying out audio noise reduction processing according to all the received audio data after receiving the audio data transmitted by the slave chip. In this embodiment, the audio processing device 100 includes at least two bluetooth chips 110, each bluetooth chip 110 may receive one path of audio data, and the slave chip transmits the received one path of audio data to the master chip, and the master chip performs noise reduction processing. Because different paths of audio data are picked up aiming at the same call scene, the main chip can mutually reference the different paths of audio data, and can improve the noise reduction reliability and the quality of a voice signal compared with the audio noise reduction processing based on only one path of audio data; accordingly, the voice signal is transmitted to the terminal device and then transmitted to the other party of the call by the terminal device, so that the call quality can be improved, and the user experience of the call scene is improved. In addition, each Bluetooth chip receives one path of audio data, so that compared with the situation that one Bluetooth chip receives at least two paths of audio data, the data transmission rate can be improved, and the diversified requirements of data processing can be met. This embodiment is described later in connection with fig. 2.
In yet another embodiment of the present application, the bluetooth chip 110 may include at least a master chip and a slave chip (not shown, and described later in conjunction with fig. 3), where the master chip is configured to receive one channel of audio data, and the slave chip is configured to receive another channel of audio data. The slave chip is also used for carrying out audio noise reduction processing according to one path of audio data received by the slave chip, and transmitting a first noise-reduced voice signal obtained after noise reduction to the master chip; the main chip is also used for carrying out audio noise reduction processing according to the audio data received by the main chip to obtain a second noise-reduced voice signal after noise reduction, and carrying out voice fusion with the second noise-reduced voice signal after receiving the first noise-reduced voice signal transmitted by the slave chip.
In this embodiment, the audio processing device 100 includes at least two bluetooth chips 110, each bluetooth chip 110 may receive one path of audio data, the slave chip and the master chip respectively perform audio noise reduction on the received one path of audio data to obtain noise-reduced voice signals, the slave chip transmits the noise-reduced voice signals obtained after processing to the master chip, the master chip performs voice fusion on the noise-reduced voice signals from the slave chip and the noise-reduced voice signals obtained after processing by the master chip, and obtains final noise-reduced voice signals of the picked-up conversation scene and transmits the final noise-reduced voice signals to the terminal device. Because different paths of audio data are picked up aiming at the same call scene, the master chip and the slave chip respectively carry out noise reduction processing on the audio data, and after the voice fusion is carried out on the obtained different noise-reduced voice signals, the noise reduction reliability can be improved compared with the audio noise reduction processing only based on one path of audio data, so that the quality of the voice signals is improved; accordingly, the voice signal is transmitted to the terminal device and then transmitted to the other party of the call by the terminal device, so that the call quality can be improved, and the user experience of the call scene is improved. In addition, each Bluetooth chip receives one path of audio data, so that compared with the situation that one Bluetooth chip receives at least two paths of audio data, the data transmission rate can be improved, and the diversified requirements of data processing can be met. Furthermore, each Bluetooth chip only performs noise reduction processing on one path of audio data, so that the data processing efficiency of a single Bluetooth chip can be improved. This embodiment is described later in connection with fig. 3.
A schematic block diagram of an audio processing device 200 and its data interaction with a microphone on a bluetooth sound pickup device and a terminal device according to a more specific embodiment of the present application will now be described with reference to fig. 2. As shown in fig. 2, the audio processing device 200 includes at least two bluetooth chips, each bluetooth chip is configured to receive one path of audio data, so that the audio processing device 200 receives at least two paths of audio data, where the at least two paths of audio data are obtained by at least two microphones respectively picking up sound in the same call scene. Wherein, the at least two bluetooth chips include a slave chip 210 and a master chip 220 (for simplicity, only one slave chip is shown in fig. 2, and in practice, a plurality of slave chips may be included), wherein: the master chip 220 is configured to receive one path of audio data, the slave chip 210 is configured to receive another path of audio data, and the slave chip 210 is further configured to transmit one path of audio data received by the slave chip to the master chip 220; the master chip 220 is further configured to, after receiving the audio data transmitted by the slave chip 210, perform audio noise reduction processing based on all the received audio data to obtain a noise-reduced voice signal of the call scene, and transmit the noise-reduced voice signal to the terminal device.
In an embodiment of the present application, the audio processing 200 includes at least two bluetooth chips, where each bluetooth chip receives one path of audio data from a bluetooth sound pickup device; the slave chip 210 transmits the received audio data to the master chip 220; after the master chip 220 receives the audio data from the slave chip 210, audio denoising processing is performed on the basis of the audio data from the slave chip 210 and the audio data received by the slave chip, so as to obtain a denoised voice signal of a call scene; because the main chip 220 performs audio noise reduction processing on the basis of at least two paths of audio data acquired by picking up sound in the same call scene, different paths of audio data are mutually referenced, and compared with the case of performing audio noise reduction processing on the basis of only one path of audio data, the reliability of noise reduction can be improved, so that the quality of voice signals is improved; accordingly, the voice signal is transmitted to the terminal device and then transmitted to the other party of the call by the terminal device, so that the call quality can be improved, and the user experience of the call scene is improved. In addition, the terminal device can also be used for performing operations such as real-time transcription, real-time conversation and real-time translation on the acquired voice signals, and can also upload the voice signals to a cloud. In addition, since at least two bluetooth chips are included, the master chip 220 and the slave chip 110 each receive one audio data, and thus, compared with a bluetooth chip that receives at least two audio data, the data transmission rate can be improved.
In an embodiment of the present application, the main chip 220 may be further configured to: calculating voice energy values of all the received audio data respectively (such as through Fourier transformation, A weighting and the like); comparing the voice energy values of all paths of audio data to obtain a comparison result; and synthesizing the multiple paths of audio data in all paths of audio data into one path of audio data based on the comparison result, and then carrying out audio noise reduction processing, or selecting the path of audio data with the largest voice energy value to carry out audio noise reduction processing, so as to obtain the noise-reduced voice signal of the call scene.
For example, when the difference between the voice energy values of all the paths of audio data at each frequency point is not large, it indicates that the sound pickup effect of each path of audio data is equivalent, and at this time, all the paths of audio data may be synthesized into one path of audio data and then audio noise reduction processing is performed; for another example, when the difference between the voice energies of different paths of audio data at some frequency points is large, it indicates that the difference between the sound pickup effects of different paths of audio data is large, and at this time, the path of audio data with the largest voice energy value may be selected to perform the audio denoising processing; this is advantageous for obtaining a better quality speech signal.
Further, the main chip 220 may also be used to: for one path of audio data, when the voice energy value of one path of audio data is smaller than a first threshold (the threshold can be set as required), or the difference between the voice energy value of the other path of audio data and the voice energy value of one path of audio data is larger than a second threshold (the threshold can be set as required), a control signal is sent to the bluetooth sound pickup equipment where the microphone collecting the one path of audio data is located, so as to control the microphone collecting the one path of audio data to be turned off.
For example, by comparing, it is determined that the picked-up voice energy value of one microphone (e.g., microphone a) for the current call scenario is significantly larger than that of the other microphone (e.g., microphone B) for the current call scenario, which indicates that in the current call scenario, the speaker is closer to microphone a, farther from microphone B, or there are no other noise sources near microphone a and other noise sources near microphone B; based on this, the main chip 220 may control the sound pickup of the bluetooth sound pickup apparatus, for example, control only the microphone a to pick up sound and control the microphone B to turn off in the subsequent call of the current call scene, so that the power of the bluetooth sound pickup apparatus may be saved and a high-quality sound pickup result may be obtained.
In one embodiment of the present application, the audio processing device 200 may be implemented as a bluetooth adapter (dongle) or a charging cradle of a bluetooth headset having the above-described structure. The aforementioned at least two microphones may include at least three microphones, for example, and the two microphones are microphones disposed at different positions on the same bluetooth sound pickup device. Or, the aforementioned at least two microphones may include two microphones respectively disposed on two different bluetooth sound pickup devices, where the two bluetooth sound pickup devices include, for example, a left ear earphone and a right ear earphone of a bluetooth earphone, and the left ear earphone is provided with at least one microphone, and the right ear earphone is provided with at least one other microphone; or two independent bluetooth sound equipment are included, and each bluetooth sound equipment is provided with a microphone respectively.
The interaction between the bluetooth sound pickup device and the audio processing device 200, which can improve the quality of the speech sound, is described below by taking the bluetooth sound pickup device as a wireless bluetooth headset and taking the bluetooth dongle as an example (when the bluetooth sound pickup device is a wireless bluetooth sound, the interaction process is also similar, and is not described in detail below).
For example, in a conversation scenario, a first speaker makes a call with a second speaker through a wireless bluetooth headset, and during the conversation, the left side of the first speaker has other colleagues talking, while the right side is relatively quiet. In the conversation scene, the left earphone and the right earphone of the wireless Bluetooth earphone of the caller A pick up sound to obtain two paths of audio data, namely left path audio data and right path audio data; after the bluetooth sound pickup device and the bluetooth dongle are connected via bluetooth, the left audio data and the right audio data can be respectively transmitted to two bluetooth chips of the bluetooth dongle, for example, the slave chip 210 receives the left audio data, and the master chip 220 receives the right audio data; then, the slave chip 210 transmits the left audio data to the master chip 220; main chip 220 is through comparing left way audio data and right way audio data, it has more noise (the pronunciation of coming from speaker A left side colleague) to confirm that there is in left way audio data, and the pronunciation of right way audio data is comparatively pure, so select right way audio data to carry out audio frequency and fall the noise processing, obtain the back speech signal of falling the noise of this conversation scene, fall the back speech signal of falling the noise and be conveyed to terminal equipment (be the cell-phone in this scene) through wired or wireless mode, like this, speaker B will hear the pronunciation that has higher quality, thereby have better user experience. Compared with the prior art in which the wireless Bluetooth headset only adopts the pickup of one microphone, for example, the left headset is just adopted to pick up the audio data with much noise, the scheme of the application can reduce the interference noise in the call scene to a greater extent, and effectively improve the voice call quality. In the scene, at least one microphone of each of two earphones of the Bluetooth earphone is fused into a pickup array, and noise reduction processing is preferentially carried out after multipath pickup, so that the conversation quality is improved; in addition, multi-channel voice collection is realized by using dongle, and individual control and data processing of the two earphones can also be realized by using dongle, as described above.
In the above call scenario example, the audio processing device 200 is replaced with the charging bin from the bluetooth dongle, and the process is the same, which is not described herein again. Data interaction between the wireless bluetooth headset and the charging bin in another example call scenario is described below.
In another example of a call scenario, the wireless bluetooth headset is not worn on the ear of the user, but is placed in a charging bin for storage and charging, and the wireless bluetooth headset and the charging bin can form a conference pickup system. That is, in this example, the wireless bluetooth headset no longer functions as a listening device, but merely as a sound pickup device, and a speaker may be provided on the charging chamber to play sound. In the example, a left earphone and a right earphone of the wireless Bluetooth earphone pick up sound of the participant to obtain two paths of audio data, namely a left path of audio data and a right path of audio data; when the earphone is connected with the pins of the charging bin or connected with the Bluetooth, the left audio data and the right audio data can be respectively transmitted to two Bluetooth chips of the charging bin, for example, the slave chip 210 receives the left audio data, and the master chip 220 receives the right audio data; then, the slave chip 210 transmits the left audio data to the master chip 220; the main chip 220 determines that the voice energy value of the left audio data is smaller and the voice energy value of the right audio data is larger by comparing the left audio data and the right audio data (for example, the left earphone may be farther away from the participant speaker and the right earphone may be closer to the participant speaker), and then the right audio data may be selected for audio noise reduction processing to obtain a noise-reduced voice signal of the conversation scene, and then the noise-reduced voice signal is transmitted to the terminal device (which may be a computer in the scene) in a wired or wireless manner, so that the remote participant will hear the voice with higher quality, thereby having better user experience. In the above example, the charging bin may also digitize the voice signal obtained by noise reduction, convert the voice signal into a recording file, and store the recording file locally to form a local recording system. The audio file can be transmitted to the terminal device in real time or non-real time (by means of bluetooth HID/BLE/SPP/A2DP/HFP or USB, etc.), and the terminal device can perform functions of transcription, conversation or translation, etc., or upload to the cloud, etc.
The audio processing device and its data interaction with the bluetooth sound pickup device and the terminal device according to a more specific embodiment of the present application are exemplarily described above. Based on the above description, the audio processing device 200 according to the embodiment of the present application can acquire at least two paths of audio data from at least two microphones on at least one bluetooth sound pickup device for the same call scene, perform audio denoising processing based on the at least two paths of audio data, obtain denoised voice signals of the call scene, and transmit the denoised voice signals to the terminal device, so that the noise reduction reliability can be improved, thereby improving the quality of voice call and improving the user experience. In addition, each Bluetooth chip receives one path of audio data, so that the data transmission rate can be improved compared with the case that one Bluetooth chip receives at least two paths of audio data.
A schematic block diagram of an audio processing device 300 and its data interaction with a microphone on a bluetooth sound pickup device and a terminal device according to another more specific embodiment of the present application will now be described with reference to fig. 3. As shown in fig. 3, the audio processing device 300 includes at least two bluetooth chips, and each bluetooth chip is configured to receive one path of audio data, so that the audio processing device 300 receives at least two paths of audio data, where the at least two paths of audio data are obtained by at least two microphones respectively picking up sound in the same call scene. Wherein, the at least two bluetooth chips include a slave chip 310 and a master chip 320 (for simplicity, only one slave chip is shown in fig. 3, and in practice, a plurality of slave chips may be included), wherein: the main chip 320 is configured to receive one path of audio data, the slave chip 310 is configured to receive another path of audio data, and the slave chip 310 is further configured to perform audio noise reduction processing according to the received one path of audio data, and transmit a first noise-reduced voice signal obtained after noise reduction to the main chip 320; the main chip 320 is further configured to perform audio noise reduction processing according to a path of audio data received by the main chip, so as to obtain a second noise-reduced voice signal after noise reduction, and perform voice fusion with the second noise-reduced voice signal after receiving the first noise-reduced voice signal transmitted by the slave chip 310.
In the embodiment of the present application, the audio processing 300 includes at least two bluetooth chips, and each bluetooth chip receives one path of audio data from a bluetooth sound pickup device; the slave chip 310 and the master chip 320 respectively perform audio noise reduction on one path of received audio data to obtain noise-reduced voice signals, the slave chip 310 transmits the noise-reduced voice signals obtained after processing to the master chip 320, the master chip 320 performs voice fusion on the noise-reduced voice signals from the slave chip 310 and the noise-reduced voice signals obtained after processing, and finally the noise-reduced voice signals of the picked-up call scene are obtained and transmitted to the terminal equipment. Because different paths of audio data are picked up for the same call scene, after the slave chip 310 and the master chip 320 perform noise reduction processing on the audio data, the slave chip 310 and the master chip 320 perform voice fusion on different noise-reduced voice signals, and then the noise reduction reliability can be improved compared with the case that the audio noise reduction processing is performed only on one path of audio data, so that the quality of the voice signals is improved; accordingly, the voice signal is transmitted to the terminal device and then transmitted to the other party of the call by the terminal device, so that the call quality can be improved, and the user experience of the call scene is improved. In addition, each Bluetooth chip receives one path of audio data, so that compared with the situation that one Bluetooth chip receives at least two paths of audio data, the data transmission rate can be improved, and the diversified requirements of data processing can be met. Furthermore, each Bluetooth chip only performs noise reduction processing on one path of audio data, so that the data processing efficiency of a single Bluetooth chip can be improved.
In one embodiment of the present application, the audio processing device 300 may be implemented as a bluetooth adapter (dongle) or a charging cradle of a bluetooth headset having the above-described structure. The aforementioned at least two microphones may comprise at least three microphones, for example, and two of the microphones are microphones disposed at different positions on the same bluetooth sound pickup device to form a microphone array. Alternatively, the aforementioned at least two microphones may include two microphones respectively disposed on two different bluetooth sound pickup devices, for example, a left ear earphone and a right ear earphone of a bluetooth earphone, or two bluetooth sound devices.
According to another aspect of the present application, there is also provided an audio processing system, which may comprise at least two of the audio processing device 100, the audio processing device 200 or the audio processing device 300 described above, described below in connection with fig. 4.
Fig. 4 shows a schematic block diagram of an audio processing system 400 according to an embodiment of the application. As shown in fig. 4, the audio processing system 400 includes at least two audio processing devices (for simplicity, only two audio processing devices 410 and 420 are shown in fig. 4), and each of the at least two audio processing devices processes at least two paths of audio data to cooperatively implement noise reduction processing on multiple paths of audio data collected in the same call scene. The audio processing devices 410 and 420 may be the audio processing device 100, the audio processing device 200, or the audio processing device 300 according to the embodiment of the present application, which are described above. Having described the structure and operation of the audio processing device 100, the audio processing device 200, or the audio processing device 300 in detail, those skilled in the art can understand the structure and operation of the audio processing devices 410 and 420 in conjunction with the above description and apply to more complex multi-person conversation scenarios, for example. And will not be described again for brevity.
Generally, the audio processing system 400 according to the embodiment of the present application includes at least two audio processing devices, and each audio processing device can improve the noise reduction reliability as described above, so as to improve the quality of a voice signal, and therefore, multiple audio processing devices in the audio processing system 400 cooperate with each other, so that the noise reduction processing on multiple channels of audio data acquired by picking up sound in the same call scene under a more complex scene can be implemented, and the quality of the voice signal can be significantly improved.
Although the illustrative embodiments have been described herein with reference to the accompanying drawings, it is to be understood that the above-described illustrative embodiments are only exemplary, and are not intended to limit the scope of the present application thereto. Various changes and modifications may be effected therein by one of ordinary skill in the pertinent art without departing from the scope or spirit of the present application. All such changes and modifications are intended to be included within the scope of the present application as claimed in the appended claims.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
In the several embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described device embodiments are merely illustrative, and for example, the division of the units is only one logical functional division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another device, or some features may be omitted, or not executed.
In the description provided herein, numerous specific details are set forth. However, it is understood that embodiments of the application may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the description of exemplary embodiments of the present application, various features of the present application are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the application and aiding in the understanding of one or more of the various inventive aspects. However, the method of the present application should not be construed to reflect the intent: this application is intended to cover such departures from the present disclosure as come within known or customary practice in the art to which this invention pertains. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this application.
It will be understood by those skilled in the art that all of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where such features are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
Furthermore, those skilled in the art will appreciate that while some embodiments described herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the application and form different embodiments. For example, in the claims, any of the claimed embodiments may be used in any combination.
The various component embodiments of the present application may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that a microprocessor or Digital Signal Processor (DSP) may be used in practice to implement some or all of the functionality of some of the modules according to the embodiments of the application. The present application may also be embodied as apparatus programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present application may be stored on a computer readable medium or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the application, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The application may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.
The above description is only for the specific embodiments of the present application or the description thereof, and the protection scope of the present application is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope disclosed in the present application, and shall be covered by the protection scope of the present application. The protection scope of the present application shall be subject to the protection scope of the claims.

Claims (10)

1. The audio processing equipment is characterized by comprising a Bluetooth chip, wherein the Bluetooth chip is used for receiving at least two paths of audio data, and the at least two paths of audio data are obtained by respectively picking up sound of at least two microphones in the same conversation scene;
the Bluetooth chip is further used for carrying out audio noise reduction processing on the basis of all the received audio data to obtain noise-reduced voice signals of the conversation scene, and transmitting the noise-reduced voice signals to the terminal equipment.
2. The audio processing device according to claim 1, wherein the bluetooth chip at least includes a master chip and a slave chip, the master chip is configured to receive one path of the audio data, and the slave chip is configured to receive another path of the audio data, wherein:
the slave chip is also used for transmitting the audio data received by the slave chip to the master chip; the master chip is also used for carrying out the audio noise reduction processing according to all the received audio data after receiving the audio data transmitted by the slave chip; alternatively, the first and second electrodes may be,
the slave chip is also used for carrying out audio noise reduction processing according to the audio data received by the slave chip, and transmitting a first noise-reduced voice signal obtained after noise reduction to the master chip; the master chip is further used for carrying out the audio denoising processing according to the audio data received by the master chip, so as to obtain a second denoised voice signal after denoising, and carrying out voice fusion with the second denoised voice signal after receiving the first denoised voice signal transmitted by the slave chip.
3. The audio processing device as claimed in claim 1, wherein the bluetooth chip comprises at least an LE-audio bluetooth chip supporting next generation bluetooth technology, and the LE-audio bluetooth chip supporting next generation bluetooth technology is configured to receive the at least two audio data paths and perform the audio denoising process according to all the received audio data paths.
4. The audio processing device according to any one of claims 1 to 3, wherein the step of the audio noise reduction processing performed by the Bluetooth chip specifically includes:
respectively calculating voice energy values of all the received audio data;
comparing the voice energy values of all the paths of audio data to obtain a comparison result;
and synthesizing multiple paths of audio data in all paths of audio data into one path of audio data based on the comparison result, and then performing audio noise reduction processing, or selecting one path of audio data with the largest voice energy value to perform audio noise reduction processing, so as to obtain noise-reduced voice signals of the call scene.
5. The audio processing device of claim 4, wherein the Bluetooth chip is further configured to:
for audio data all the way, when audio data's the speech energy value of all the way is less than first threshold value, perhaps another way audio data's the speech energy value with when audio data's the difference of the speech energy value of all the way is greater than the second threshold value, to gathering audio data's the bluetooth sound pickup equipment that the microphone of all the way is located sends control signal, in order to control the gathering audio data's the microphone of all the way is closed.
6. The audio processing device of any of claims 1-3, wherein the at least two microphones comprise at least three microphones, and wherein two microphones are microphones disposed at different locations on the same Bluetooth pickup device.
7. The audio processing device of any of claims 1-3, wherein the at least two microphones comprise two microphones disposed on two different Bluetooth enabled sound pickup devices, respectively.
8. The audio processing device of claim 7, wherein the two Bluetooth sound pickup devices comprise a left ear headphone and a right ear headphone of a Bluetooth headphone or comprise two Bluetooth sound devices.
9. The audio processing device of any of claims 1-3, wherein the audio processing device comprises a Bluetooth adapter or a charging cradle of a Bluetooth headset.
10. An audio processing system, characterized in that the audio processing system comprises at least two audio processing devices according to any one of claims 1 to 9, and each of the at least two audio processing devices processes at least two paths of audio data to cooperatively realize the noise reduction processing of multiple paths of audio data obtained by picking up sound in the same call scene.
CN202211104274.6A 2022-09-09 2022-09-09 Audio processing apparatus and audio processing system Pending CN115499761A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211104274.6A CN115499761A (en) 2022-09-09 2022-09-09 Audio processing apparatus and audio processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211104274.6A CN115499761A (en) 2022-09-09 2022-09-09 Audio processing apparatus and audio processing system

Publications (1)

Publication Number Publication Date
CN115499761A true CN115499761A (en) 2022-12-20

Family

ID=84469193

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211104274.6A Pending CN115499761A (en) 2022-09-09 2022-09-09 Audio processing apparatus and audio processing system

Country Status (1)

Country Link
CN (1) CN115499761A (en)

Similar Documents

Publication Publication Date Title
US9756422B2 (en) Noise estimation in a mobile device using an external acoustic microphone signal
CN104521247B (en) Bluetooth headset hearing aid and anti-noise method and apparatus
CN103874000B (en) A kind of hearing instrument
CN106960670B (en) Recording method and electronic equipment
US9866947B2 (en) Dual-microphone headset and noise reduction processing method for audio signal in call
CN109195045A (en) The method, apparatus and earphone of test earphone wearing state
CN111464905A (en) Hearing enhancement method and system based on intelligent wearable device and wearable device
CN108549206A (en) A kind of band has the smartwatch of voice interactive function earphone
CN106604167B (en) A kind of method and mobile terminal of adjust automatically earphone left and right acoustic channels output volume
CN111683319A (en) Call pickup noise reduction method, earphone and storage medium
CN111402915A (en) Signal processing method, device and system
WO2021170061A1 (en) Wireless sound amplification system and terminal
CN111491228A (en) Noise reduction earphone and control method thereof
JP2023510159A (en) Bluetooth communication method and apparatus
CN113411726A (en) Audio processing method, device and system
CN108235165B (en) Microphone neck ring earphone
CN105847566A (en) Mobile terminal audio volume adjusting method and device
CN111081238B (en) Bluetooth sound box voice interaction control method, device and system
CN113207056B (en) Wireless earphone and transparent transmission method, device and system thereof
CN114333886A (en) Audio processing method and device, electronic equipment and storage medium
CN113038318B (en) Voice signal processing method and device
CN208597147U (en) A kind of bluetooth headset
CN116033312B (en) Earphone control method and earphone
WO2014006220A1 (en) Consumer electronics device adapted for hearing loss compensation
CN115499761A (en) Audio processing apparatus and audio processing system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination