WO2022218175A1 - Method for estimating echo delay between distributed devices, and electronic device - Google Patents

Method for estimating echo delay between distributed devices, and electronic device Download PDF

Info

Publication number
WO2022218175A1
WO2022218175A1 PCT/CN2022/084862 CN2022084862W WO2022218175A1 WO 2022218175 A1 WO2022218175 A1 WO 2022218175A1 CN 2022084862 W CN2022084862 W CN 2022084862W WO 2022218175 A1 WO2022218175 A1 WO 2022218175A1
Authority
WO
WIPO (PCT)
Prior art keywords
electronic device
distance
delay
audio
echo
Prior art date
Application number
PCT/CN2022/084862
Other languages
French (fr)
Chinese (zh)
Inventor
丁浩
钟小飞
李刚
张斌
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2022218175A1 publication Critical patent/WO2022218175A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • H04M9/08Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
    • H04M9/082Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic using echo cancellers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0264Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M9/00Arrangements for interconnection not involving centralised switching
    • H04M9/08Two-way loud-speaking telephone systems with means for conditioning the signal, e.g. for suppressing echoes for one or both directions of traffic
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech

Definitions

  • the present application relates to the field of electronic technologies, and in particular, to a method for estimating an echo delay of a distributed device and an electronic device.
  • the voice of the remote user played by the audio output device will be acquired by the audio input device and the remote user's voice will be transmitted through the network.
  • the sound is transmitted to the remote user.
  • the user will not only hear the voice he has just spoken, but also the interference of the echo may become larger and larger, resulting in howling, which greatly affects the user's experience.
  • a feasible processing method is: by fixing the relative position of the audio output device and the audio input device, manually measure the propagation delay of the sound between the audio output device and the audio input device, and combine the audio output device with the audio output device.
  • the echo delay is determined by the hardware and software characteristics of the audio input device. After determining the echo delay, manually configure the echo delay parameters for the echo cancellation module on the audio output device, so that the echo cancellation module can work normally and avoid the audio output device from playing echoes.
  • this method can only solve the echo problem when the audio output device and the audio input device are relatively unchanged.
  • the echo delay parameter that has been configured to the echo cancellation module is no longer valid. , the electronic device cannot effectively filter out the echo.
  • the embodiment of the present application provides a method for estimating echo delay of distributed devices.
  • the method includes: an audio output device establishes a spatial coordinate system, continuously receives location information transmitted by multiple audio input devices through a network connection, and based on different audio input devices The location information of the device continuously updates the echo delay of different audio input devices, thereby eliminating the echo.
  • the first electronic device determines a first distance, where the first distance is the distance between the first electronic device and the second electronic device at the first moment; the first electronic device is based on the The first distance and the speed of sound determine a first propagation delay, where the first propagation delay is the time delay between the propagation of the sound signal from the first electronic device to the second electronic device at the first moment; the first electronic device Determining that the first propagation delay is the first echo delay, or the first electronic device determines that the sum of the first propagation delay and the first processing delay is the first echo delay, and the first processing delay The sum of the delay from acquiring the voice signal to playing the voice signal by the first electronic device and the delay of acquiring the voice signal by the second electronic device and forwarding the voice signal to the first electronic device by the second electronic device.
  • the first electronic device determines the propagation delay between the first electronic device and the second electronic device by determining the distance between the first electronic device and the second electronic device, and further based on the propagation delay Determines the echo delay.
  • the method first allows the electronic device itself to move at will, without affecting the effect of echo cancellation, which facilitates the user's experience; and does not require manual measurement of echo delay, users can participate in online video, conferences, and calls at any time with their electronic devices. , without worrying about echo effects.
  • the electronic device determines a second distance, where the second distance is the distance between the first electronic device and the second time at the second time The distance between the electronic devices; the first electronic device determines a second propagation delay based on the second distance and the speed of sound, and the second propagation delay is when the sound signal is transmitted from the first electronic device to the first electronic device at the second moment.
  • the first electronic device continuously updates the distance between the second electronic device and the first electronic device, and further updates the echo delay, so that the user can move the first electronic device/second electronic device at will, which improves the user experience.
  • the method before the first electronic device determines the first distance at the first moment, the method further includes: at a third moment before the first moment, the first electronic device determines the first distance. A processing delay.
  • the first electronic device determines the first processing delay by determining the first processing delay.
  • the echo delay between the second electronic device and the first electronic device is used to perform echo cancellation on the echo related to the second electronic device.
  • the first electronic device determines the first processing delay at a third time before the first time, specifically including: at a third time before the first time, When the distance between the first electronic device and the second electronic device is less than the distance threshold, the first electronic device plays the first audio; the first audio; the first electronic device determines that the time difference between playing the first audio and receiving the first audio is the first processing delay.
  • the user can bring the second electronic device close to the first electronic device, and determine the first processing delay by calculating the time difference between the audio signal passing between the second electronic device and the first electronic device, so that the first electronic device
  • the device can determine the different processing delays caused by different types of electronic devices, and thus determine the echo delays.
  • the first electronic device determines the first processing delay, which specifically includes: at a third time before the first time, responding to According to the user's input, the first electronic device plays the first audio; the first electronic device receives the first audio sent by the second electronic device through the wireless network/near field communication service; the first electronic device determines to play the first audio
  • the time difference between the audio and receiving the first audio is the first processing delay.
  • the user can bring the second electronic device close to the first electronic device, and determine the first processing delay by calculating the time difference between the audio signal passing between the second electronic device and the first electronic device, so that the first electronic device
  • the device can determine the different processing delays caused by different types of electronic devices, and thus determine the echo delays.
  • determining the first distance by the first electronic device specifically includes: the first electronic device receives first motion information, where the first motion information includes the distance of the second electronic device. motion state; the first electronic device determines the first distance based on the first motion information.
  • the first electronic device receives the motion information of the second electronic device, such as the acceleration, speed, attitude angle and other parameters in the X-axis, Y-axis, and Z-axis directions, which helps to determine the first electronic device more accurately distance from the second electronic device.
  • the motion information of the second electronic device such as the acceleration, speed, attitude angle and other parameters in the X-axis, Y-axis, and Z-axis directions, which helps to determine the first electronic device more accurately distance from the second electronic device.
  • determining the first distance by the first electronic device specifically includes: the first electronic device receives first location information, where the first location information includes a distance of the second electronic device. location; the first electronic device determines the first distance based on the first location information.
  • the first electronic device receives the position information of the second electronic device, and then the distance between the first electronic device and the second electronic device can be determined, which is helpful for more accurate determination of the first electronic device and the second electronic device. distance between electronic devices.
  • determining the first distance by the first electronic device specifically includes: the first electronic device receiving first distance information, where the first distance information includes the first distance; the The first electronic device determines the first distance based on the first distance information.
  • the second electronic device determines the distance from the first electronic device, it informs the first electronic device of the distance, thereby reducing the computational burden of the first electronic device.
  • the first electronic device determines a second processing delay, where the second processing delay is from acquiring the voice signal to playing the voice signal by the first electronic device The sum of the delay time and the delay time of the third electronic device acquiring the voice signal to the third electronic device and forwarding the voice signal to the first electronic device; at the fifth time after the fourth time, the first electronic device determines the third distance, the The third distance is the distance between the first electronic device and the third electronic device at the fifth moment; the first electronic device determines a third propagation delay based on the third distance and the speed of sound, and the third propagation delay is At the fifth moment, the time delay between the sound signal propagating from the first electronic device to the third electronic device. The first electronic device determines that the sum of the third propagation delay and the second processing delay is a third echo delay.
  • the electronic device can determine the echo delays of multiple devices, and use the corresponding echo delays to cancel the echoes related to different devices, which improves the user experience.
  • the first electronic device determines the first processing delay.
  • the first electronic device determines a first distance, where the first distance is the distance between the first electronic device and the second electronic device at the first moment; the first electronic device is based on the first distance and the speed of sound to determine a first propagation delay, where the first propagation delay is the delay between the propagation of the sound signal from the first electronic device to the second electronic device at the first moment; the first electronic device determines the first propagation delay A propagation delay is the first echo delay, or the first electronic device determines that the sum of the first propagation delay and the first processing delay is the first echo delay, and the first processing delay is the The sum of the delay from acquiring the voice signal to playing the voice signal by the first electronic device and the delay of the second electronic device acquiring the voice signal to the second electronic device forwarding the voice signal to the first electronic device.
  • the first electronic device determines the propagation delay between the first electronic device and the second electronic device by determining the distance between the first electronic device and the second electronic device, and further based on the propagation delay Determines the echo delay.
  • the method first allows the electronic device itself to move at will, without affecting the effect of echo cancellation, which facilitates the user's experience; and does not require manual measurement of echo delay, users can participate in online video, conferences, and calls at any time with their electronic devices. , without worrying about echo effects.
  • the first electronic device determines the first processing delay, which specifically includes: when the first electronic device and the second electronic device communicate with each other.
  • the first electronic device plays the first audio; the second electronic device collects the first audio; the second electronic device sends the first electronic device through the wireless network/near field communication service. the first audio; the first electronic device determines that the time difference between playing the first audio and receiving the first audio is the first processing delay.
  • the user can bring the second electronic device close to the first electronic device, and determine the first processing delay by calculating the time difference between the audio signal passing between the second electronic device and the first electronic device, so that the first electronic device
  • the device can determine the different processing delays caused by different types of electronic devices, and thus determine the echo delays.
  • the first electronic device plays signaling, which specifically includes: before the first moment At the third moment, in response to the user's input, the first electronic device plays the first audio; the second electronic device collects the first audio; the second electronic device sends the first electronic The device sends the first audio; the first electronic device determines that the time difference between playing the first audio and receiving the first audio is the first processing delay.
  • the user can bring the second electronic device close to the first electronic device, and determine the first processing delay by calculating the time difference between the audio signal passing between the second electronic device and the first electronic device, so that the first electronic device
  • the device can determine the different processing delays caused by different types of electronic devices, and thus determine the echo delays.
  • the first electronic device determining the first distance specifically includes: the second electronic device determining first motion information, where the first motion information includes the second electronic device The second electronic device sends motion information to the first electronic device; the first electronic device receives the motion information, and the first electronic device determines the first distance based on the first motion information.
  • the second electronic device records its own motion information such as acceleration, speed, attitude angle and other parameters in the directions of the X-axis, Y-axis, and Z-axis, and sends the motion information to the first electronic device, which is helpful for The distance between the first electronic device and the second electronic device is more accurately determined.
  • the first electronic device determining the first distance specifically includes: the second electronic device determining first location information, where the first location information includes the second electronic device The second electronic device sends the first position information to the first electronic device; the first electronic device receives the first position information, and the first electronic device determines the first distance based on the first position information.
  • the second electronic device can determine its own position based on the motion information, and send the position information including its own position to the first electronic device, so as to determine the distance between the first electronic device and the second electronic device.
  • the distance helps to more accurately determine the distance between the first electronic device and the second electronic device.
  • the first electronic device determining the first distance specifically includes: the second electronic device determining first distance information, where the first distance information includes the first distance; The second electronic device sends the first distance information to the second electronic device; the first electronic device receives the first distance information, and the first electronic device determines the first distance based on the first position information.
  • the second electronic device determines the distance from the first electronic device, it informs the first electronic device of the distance, thereby reducing the computational burden of the first electronic device.
  • an embodiment of the present application provides an electronic device, the electronic device includes: one or more processors and a memory; the memory is coupled to the one or more processors, and the memory is used to store computer program codes,
  • the computer program code includes computer instructions invoked by the one or more processors to cause the electronic device to perform:
  • the first electronic device determines a first distance, where the first distance is the distance between the first electronic device and the second electronic device at the first moment; the first electronic device is based on the first distance and The speed of sound determines a first propagation delay, where the first propagation delay is the delay between the propagation of the sound signal from the first electronic device to the second electronic device at the first moment; the first electronic device determines the first propagation delay The propagation delay is the first echo delay, or the first electronic device determines that the sum of the first propagation delay and the first processing delay is the first echo delay, and the first processing delay is the first echo delay The sum of the delay from acquiring the voice signal to playing the voice signal by the electronic device and the delay of the second electronic device acquiring the voice signal to the second electronic device forwarding the voice signal to the first electronic device.
  • the one or more processors are further configured to invoke the computer instructions to cause the electronic device to execute: at a second time after the first time, the electronic device determining a second distance, where the second distance is the distance between the first electronic device and the second electronic device at the second moment; the first electronic device determines a second propagation delay based on the second distance and the speed of sound, the The second propagation delay is the delay between the transmission of the sound signal from the first electronic device to the second electronic device at the second moment; the first electronic device determines that the second propagation delay is the second echo delay , or the first electronic device determines that the sum of the second propagation delay and the first processing delay is the second echo delay; the second echo delay is different from the first echo delay.
  • the one or more processors are further configured to invoke the computer instructions to cause the electronic device to execute: at a third moment before the first moment, the first The electronic device determines the first processing delay.
  • the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: at a third moment before the first moment, when the first moment When the distance between an electronic device and a second electronic device is less than the distance threshold, the first electronic device plays the first audio; the first electronic device receives the first audio sent by the second electronic device through a wireless network/near field communication service ; the first electronic device determines that the time difference between playing the first audio and receiving the first audio is the first processing delay.
  • the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: at a third time point before the first time point, in response to a user input, the first electronic device plays the first audio; the first electronic device receives the first audio sent by the second electronic device through the wireless network/near field communication service; the first electronic device determines to play the first audio and The time difference between receiving the first audio is the first processing delay.
  • the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: the first electronic device receives first motion information, the first A motion information includes a motion state of the second electronic device; the first electronic device determines the first distance based on the first motion information.
  • the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: the first electronic device receives first location information, the first A location information includes the location of the second electronic device; the first electronic device determines the first distance based on the first location information.
  • the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: the first electronic device receives the first distance information, the first A distance information includes the first distance; the first electronic device determines the first distance based on the first distance information.
  • the one or more processors are further configured to invoke the computer instruction to cause the electronic device to execute: at the fourth moment, the first electronic device determines the second process Delay, the second processing delay is the delay from the acquisition of the voice signal to the playback of the voice signal by the first electronic device and the delay from the acquisition of the voice signal by the third electronic device to the transfer of the voice signal to the first electronic device by the third electronic device. the sum;
  • the first electronic device determines a third distance, where the third distance is the distance between the first electronic device and the third electronic device at the fifth time; the first electronic device is based on The third distance and the speed of sound determine a third propagation time delay, where the third propagation time delay is the time delay between the propagation of the sound signal from the first electronic device to the third electronic device at the fifth moment.
  • the first electronic device determines that the sum of the third propagation delay and the second processing delay is a third echo delay.
  • an embodiment of the present application provides a chip system, the chip system is applied to an electronic device, the chip system includes one or more processors, and the processors are configured to invoke computer instructions to cause the electronic device to execute the first Aspects and methods described in any possible implementation of the first aspect.
  • an embodiment of the present application provides a computer program product containing instructions, when the above computer program product is run on an electronic device, the electronic device is made to perform the first aspect and any possible implementation manner of the first aspect the described method, or perform the method described in the first aspect and any possible implementation manner of the first aspect.
  • the embodiments of the present application provide a computer-readable storage medium, including instructions, when the above-mentioned instructions are run on an electronic device, the electronic device can execute the first aspect and any possible implementation manner of the first aspect. method described.
  • the electronic device provided in the third aspect, the chip system provided in the fourth aspect, the computer program product provided in the fifth aspect, and the computer storage medium provided in the sixth aspect are all used to execute the methods provided by the embodiments of the present application. . Therefore, for the beneficial effects that can be achieved, reference may be made to the beneficial effects in the corresponding method, which will not be repeated here.
  • FIG. 1 is an exemplary schematic diagram of an echo generation process involved in the present application
  • FIG. 2 is an exemplary schematic diagram of propagation delay and processing delay involved in the application
  • FIG. 3 is an exemplary schematic diagram of an echo module involved in the present application.
  • FIG. 4 is an exemplary schematic diagram of a usage scenario of the echo cancellation method involved in the application
  • FIG. 5 is another exemplary schematic diagram of a usage scenario of the echo cancellation method involved in the present application.
  • FIG. 6 is an exemplary schematic diagram of the orientation diagram of the audio input device in FIG. 5;
  • FIG. 7 is an exemplary schematic diagram of a conference scene involved in the application.
  • FIG. 8 is an exemplary schematic diagram of an echo cancellation data flow in the scenario shown in FIG. 7;
  • FIG. 9 is an exemplary schematic diagram of an echo delay estimation method provided by an embodiment of the present application.
  • 10A to 10C are exemplary schematic diagrams of two position calibration methods provided by the embodiments of the present application.
  • FIG. 11A to FIG. 11B are an exemplary schematic diagram of a process of determining an echo delay provided by an embodiment of the present application.
  • FIG. 12 is an exemplary schematic diagram of establishing a three-dimensional space coordinate system by the audio output device 200 provided by the embodiment of the present application;
  • FIG. 13 is an exemplary schematic diagram of an echo delay estimation method under a single device provided by an embodiment of the present application.
  • FIG. 15 is a schematic diagram of an exemplary hardware structure of an electronic device 100 provided by an embodiment of the present application.
  • FIG. 16 is a schematic diagram of an exemplary software structure of an electronic device 100 according to an embodiment of the present application.
  • first and second are only used for descriptive purposes, and should not be construed as implying or implying relative importance or implying the number of indicated technical features. Therefore, the features defined as “first” and “second” may explicitly or implicitly include one or more of the features. In the description of the embodiments of the present application, unless otherwise specified, the “multiple” The meaning is two or more.
  • the local audio output device will play the sound made by the user before, and the sound played by the audio output device is an echo.
  • the far-end audio output device plays the sound, since the near-end audio input device is always working, the sound will be acquired by the near-end audio output device and transmitted to the far-end audio output device through the network.
  • the sound played by the far-end audio output device includes the sound of the far-end user speaking by himself at the previous moment, and the sound is the echo.
  • the following takes the content shown in FIG. 1 as an example to specifically introduce the echo generation process.
  • FIG. 1 is an exemplary schematic diagram of the echo generation process involved in the present application.
  • the voice 1 of the far-end user is transmitted to the near-end audio output device 1 through the network, and the audio output device 1 plays the voice 1 .
  • Both the voice 2 of the near-end user and the voice 1 of the far-end user are acquired by the audio input device 1, and then the voice 1 and the voice 2 are transmitted to the audio output device 2 through the network.
  • Audio output device 2 will play voice 1 and voice 2.
  • voice 1 is an echo.
  • the echo cancellation includes: by pre-estimating the echo delay, combined with an adaptive algorithm, the echo is eliminated from the superimposed signal of the sound signal and the echo signal.
  • the echo cancellation module is the module responsible for echo cancellation.
  • the echo delay is the sum of the propagation delay and the processing delay; when the interaction delay between the audio input device and the audio output device is not When negligible, the echo delay can be the sum of propagation delay, processing delay and interaction delay.
  • the interaction delay between the audio input device and the audio output device can be ignored; when the audio input device and the audio output device are wirelessly connected, In some cases, for example, when the communication quality of the channel carrying the wireless connection is poor, the transmission rate of the channel is low, or the channel delay is large, it is considered that the interaction delay between the audio input device and the audio output device cannot be ignored.
  • wireless connections include wireless networks and short-range communication services.
  • the wireless network includes cellular mobile communication, WIFI, and the like.
  • the short-range communication service can be in many forms, such as Bluetooth, Hi-Link, Near Field Communication (Near Field Communication, NFC), Apple Wireless Direct Link (Apple Wireless Direct Link, AWDL) and other protocols, which are not limited here.
  • the propagation delay is the delay of the spatial propagation of the sound between the audio output device and the audio input device;
  • the processing delay consists of two parts, respectively are processing delay 1 and processing delay 2.
  • the processing delay 1 is the delay from the time when the sound signal on the audio output device is sent to the echo cancellation module until the sound signal is played by the audio playback module of the audio output device; The delay between the acquisition of the audio input device and the sound signal being sent to the echo cancellation module on the audio output device.
  • the processing delay 2 can be divided into a processing delay 21 , an interaction delay, and a processing delay 22 .
  • the processing delay 21 is the time difference between the audio input device collecting the sound signal from the audio input module and the communication module sending the sound signal
  • the interaction delay is the time difference between the audio input device sending the sound signal and the audio output device receiving the sound signal , that is, the interaction delay is related to the communication performance of the channel carrying the data interaction between the audio output device and the audio input device
  • the processing delay 22 is the time between the audio output device receiving the sound signal and the echo cancellation module on the audio output device receiving the sound signal. Time difference.
  • the propagation delay is related to the relative position between the audio output device and the audio input device; the processing delay 1 is related to the hardware and software on the audio output device; the processing delay 2 is related to the audio input device and the audio output device. on the hardware and software.
  • the following takes the content shown in FIG. 2 as an example to exemplarily introduce the propagation delay and the processing delay.
  • FIG. 2 is an exemplary schematic diagram of propagation delay and processing delay involved in the present application.
  • the device will transmit the digital sound signal to the echo cancellation module and to the audio output device 1 at the same time.
  • the audio output device 1 will convert the digital sound signal 1 into an analog sound signal through a D/A converter or the like and play it out.
  • the audio input device 1 acquires the analog sound signal, converts the analog sound signal into a digital sound signal through a D/A converter, etc., and then transmits the digital sound signal to the echo cancellation module. After the echo cancellation module cancels the echo, the digital sound signal output by the echo module is transmitted to the far end through the network.
  • the processing delay 1 is the time difference between the digital sound signals being sent to the audio output device 1 of the echo cancellation module; the propagation delay is the analog sound signal between the audio output device 1 and the audio input device 1.
  • the time of sound wave propagation between the two; the processing delay 2 is the time difference between the audio input device 1 obtains the digital voice signal and the audio input device 1 sends the digital voice signal 2 to the echo cancellation module.
  • the following takes the content shown in FIG. 3 as an example to exemplarily introduce how the echo cancellation module cancels the echo.
  • FIG. 3 is an exemplary schematic diagram of the echo module involved in this application.
  • the input of the echo cancellation module includes: echo delay, near-end sound, and reference sound; the output of the echo cancellation module is near-end sound-echo.
  • the near-end sound is all sound signals obtained by the near-end audio input device; the reference sound is the sound signal transmitted by the far-end through the network.
  • the digital filter in the echo cancellation module calculates the weight of the filter through adaptive algorithms such as: least mean square (LMS) algorithm, normalized least mean square (NLMS) algorithm, proportional normalized least mean square (PNLMS) algorithm .
  • LMS least mean square
  • NLMS normalized least mean square
  • PNLMS proportional normalized least mean square
  • the device manufacturer will pre-test the echo delay of the device, and the tested echo delay The value is written to the echo cancellation module.
  • the echo cancellation module of the device can work normally according to the echo delay parameters determined by the factory test to eliminate the echo.
  • FIG. 4 is an exemplary schematic diagram of a usage scenario of the echo cancellation method involved in this application.
  • the propagation delay can be determined by manually measuring the distance between the audio input device and the audio output device. For example, when the distance between the microphone and the speaker is about 15cm, the propagation delay in the echo delay is about 0.15/340 ⁇ 0 second, which can be ignored.
  • the sum of processing delay 1 and processing delay 2 can be obtained by pre-testing. In this case, the echo delay is equal to the processing delay.
  • the microphone When the user uses the call function, video function, and voice function, after the microphone receives the voice from the local user and the voice of the remote user played by the speaker, the microphone amplifies the analog signal through the amplifier, and quantizes it into digital through the D/A converter. After the signal, the digital signal is passed to the echo cancellation module.
  • the echo cancellation module filters the received sound signal, and after filtering the voice of the remote user, the echo cancellation module encodes the filtered sound signal and transmits it to the opposite end user through the network.
  • the energy of the echo obtained by the audio input device can be reduced to compensate for the inaccurate estimation of the echo delay.
  • the performance of the echo cancellation module to filter out echoes is degraded.
  • FIG. 5 is another exemplary schematic diagram of a usage scenario of the echo cancellation method involved in this application.
  • the user can choose the audio input device and audio output device with strong directionality.
  • the audio input device can choose a pressure-pressure-difference composite microphone.
  • the pressure and differential pressure composite microphone has good directivity, and mainly accepts the sound signal from the main lobe direction, while the sound signal from the side lobe direction will be attenuated to a greater extent.
  • the strength of the directivity is positively related to the ratio of the main lobe to the side lobe in the audio input device/audio output device directional diagram.
  • the direction map is used to describe the normalized response of the audio input device/audio output device to the sound signal in different directions. direction.
  • the direction of the sound played by the audio output device is mainly the direction of the microphone.
  • the side lobe direction will be attenuated to a greater extent.
  • the energy of the sound signal will be further attenuated by reflections from objects such as walls.
  • objects such as walls will be wrapped with absorbing materials, so that the energy of the sound signal propagating along path 2 is further attenuated when reflected by objects such as walls.
  • FIG. 6 is an exemplary schematic diagram of the orientation diagram of the audio input device in FIG. 5 .
  • the audio input device acquires the user's voice from the 0-degree direction, and acquires the echo from the 120-degree to 240-degree direction.
  • the energy of the echo obtained by the audio input device can be reduced by improving the direction map of the audio input device.
  • the propagation delay in the echo delay cannot be ignored, and the echo delays of different audio input devices are different.
  • the absolute spatial positions of the audio output device and the audio input device can change; the relative distance between the audio output device and different audio input devices can also change.
  • the audio output device does not know what type of device the audio input device is; similarly, the audio input device does not know what type of device the audio output device is. In this case, the processing delay of different audio input devices cannot be estimated by pre-measurement.
  • the direction of arrival of the echo on the audio input device is not known, the echo energy cannot be reduced by selecting a highly directional audio input device.
  • the following takes the content shown in FIG. 7 as an example to exemplarily introduce a multi-person conference scenario.
  • FIG. 7 is an exemplary schematic diagram of a conference scenario involved in this application.
  • the audio output device may be a tablet, and the audio input device may be a mobile phone.
  • the local user's random movement will cause the random movement of the audio input device.
  • the propagation delay in the echo delay varies with time.
  • all devices with an audio input function can be used as audio input devices, it is impossible for the audio output device and the audio input device to know each other's processing delay.
  • FIG. 8 is an exemplary schematic diagram of an echo cancellation data flow in the scenario shown in FIG. 7 .
  • the digital sound signal 1 of the far-end user is sent to the near-end audio output device through the network.
  • the audio output device receives the digital sound signal 1, it converts it into an analog sound signal 1 and plays it out.
  • the analog sound signal 1 propagates in the near-end space.
  • the audio input device 1 can acquire the analog sound signal 2 of the near-end user 1 and the analog sound signal 1 propagating in space.
  • the audio input device 1 will acquire all the analog sound signals, convert them into digital sound signals 2, and send them to the audio output module through the network.
  • the audio output module performs echo cancellation on the digital sound signal 2, it sends the echo-cancelled digital sound signal to the remote device.
  • the audio input device 2 can acquire the analog sound signal 3 of the near-end user 2 and the analog sound signal 2 propagating in space.
  • the audio input device 2 will acquire all analog sound signals, convert them into digital sound signals 3, and send them to the audio output module through the network. After the audio output module performs echo cancellation on the digital sound signal 3, it sends the echo-cancelled digital sound signal to the remote device.
  • the echo signal in the sound signal output by the echo cancellation module on the audio output device Since the propagation delay and processing delay in the echo delay cannot be determined, the echo signal in the sound signal output by the echo cancellation module on the audio output device.
  • the processing delay 2 can be determined, but the echo delay cannot be determined.
  • the echo delay can be determined by measuring in advance.
  • the method cannot accurately estimate the echo delay when the user uses it in the real scene, and thus cannot effectively filter and suppress the echo.
  • the audio output device and audio input device can be different types of devices, such as mobile phones, tablets, wristbands, smart eyes, VR/AR devices, vehicle terminals, etc., it is impossible to require audio input devices and audio output devices to have good orientation. sex.
  • the echo delay estimation method provided by the present application is described below by taking a single audio input device 201 and a single audio output device 200 as examples.
  • the echo delay estimation method provided by the present application estimates the processing delay by using signaling. And, using the device's gyroscope and/or accelerometer sensor and/or image sensor and other sensors to perform spatial modeling, estimate the distance between different audio input devices and audio output devices, and then determine the propagation delay.
  • the electronic device may not perform step S905, that is, the processing delay is considered to be negligible, and the propagation delay calculated in steps S906 and S907 is used as the echo delay.
  • FIG. 9 is an exemplary schematic diagram of an echo delay estimation method provided by an embodiment of the present application.
  • the echo delay estimation method provided by the application specifically includes:
  • S901 The user starts a conference application.
  • the conference/call application Before the user is ready to start participating in the network conference/call, the conference/call application will be pre-launched. After starting the conference/call application, the user needs to select a suitable device as the audio input device/audio output device.
  • the electronic device After the electronic device starts the conference application, it will find the available audio input device and audio output device through wired network, wireless network, short-range communication service or image sensor and other methods. After discovering the available audio input devices and audio output devices, the application displays the available devices on the application interface for the user to select.
  • the user can select the tablet as the audio output device 200 and the mobile phone as the audio input device 201; or, the user can select the tablet as the audio output device 200 and the Bluetooth headset as the audio input device 201; or, the user can select the mobile phone as the audio output device 200 , a microphone as the audio input device 201; alternatively, the user can select a projector with an audio function as the audio output device 200, a tablet as the audio input device 201, etc., which are not limited herein.
  • the electronic device reminds the user to start the position calibration.
  • the position calibration is complete.
  • the position calibration mainly includes two methods: first, the user manually confirms that the distance between the audio input device and the audio output device is lower than the distance threshold; second, the audio output device/audio input device determines that the distance between the two is low at the distance threshold.
  • FIGS. 10A to 10C are exemplary schematic diagrams of two position calibration methods provided by the embodiments of the present application.
  • the first position calibration method is shown in FIG. 10A and FIG. 10B .
  • the tablet and the mobile phone will remind the user to bring the audio input device and the audio output device close to each other.
  • an interface 1001 will be displayed on the mobile phone, and the content displayed in the interface 1001 is used to remind the user that the position calibration has been turned on. .
  • the interface 1001 and the interface 2001 may also display other contents, such as informing the user how to complete the position calibration.
  • the user can click the confirmation control 1002 on the audio input device to inform the audio input device that the position calibration has been completed;
  • the OK control of 2002 informs the audio output device that position calibration has been completed.
  • the audio input device can inform the audio output device that the position calibration has been completed; correspondingly, the user clicks the OK control 2002 to inform the audio output device 200 that the position calibration has been completed. calibration.
  • the second position calibration method is shown in FIG. 10A and FIG. 10C .
  • the proximity communication service can be enabled to estimate the distance between the audio input device 201 and the audio output device 200; , laser sensor, etc. to estimate the distance between audio input device 201 and audio output device 200; 200 distance, etc.
  • estimating the distance between the audio input device 201 and the audio output device 200 through the short-range communication service includes: the audio input device 201/audio output device 200 determines the distance between the two according to the strength of the received signal. For example, when the audio input device 201 is connected to the audio output device 200 via Bluetooth, the received signal strength indicator (RSSI) and The distance between the audio input device 201 and the audio output device 200 is determined.
  • RSSI received signal strength indicator
  • d is the distance between the audio input device 201 and the audio output device 200
  • A is the signal strength when the transmitting end and the receiving end are separated by 1 meter
  • n is the attenuation factor related to the environment.
  • the distance between the audio input device 201 and the audio output device 200 is determined, it is determined whether the distance is less than a preset distance threshold. If the distance is less than or equal to the distance threshold, it may be considered that the audio input device 201 is close to the audio output device 200, that is, the position calibration has been completed; if the distance is greater than the distance threshold, it may be considered that the audio input device 201 is not close to the audio output device 200, that is, Position calibration not completed.
  • the mobile phone is used as the audio input device 201
  • the tablet is used as the audio output device 200 .
  • the mobile phone displays whether the calibration is completed on the interface 1001 according to the estimated distance between the mobile phone and the tablet and the distance threshold; similarly, the tablet displays on the interface 1001 whether the calibration is completed according to the estimated distance between the mobile phone and the tablet and the distance threshold. Complete the calibration.
  • the audio output device 200 or the audio input device 201 determines whether a confirmation input from the user is received.
  • the confirmation input is that the user confirms that the audio output device 200 and the audio input device 201 are close to each other in space. How the audio output device 200 or the audio input device 201 obtains the user's input can refer to the content in FIG. 10A , and details are not repeated here. If the audio input device 201 or the audio output device 200 receives the confirmation input of the user, then step S905 and step S906 are performed; if the audio input device 201 or the audio output device 200 does not receive the confirmation input of the user, then step S904 is performed.
  • the audio output device 200 or the audio input device 201 can determine whether the distance between the audio input device 201 and the audio output device 200 is less than or equal to the distance threshold. If it is less than or equal to the distance threshold, go to step S905 and step S906; if it is greater than the distance threshold, go to step S904.
  • S904 remind the user to adjust the positions of the audio input device 201 and the audio output device 200 .
  • the audio input device 201 and/or the audio output device 200 reminds the user to adjust the positions of the audio input device 201 and the audio output device 200 through various methods such as screen display, playing sound, and vibration.
  • the audio input device 201 and/or the audio output device 200 reminds the user to adjust the positions of the audio input device 201 and the audio output device 200 through the content displayed on the screen. Reference may be made to the description in FIG. 10A , which will not be repeated here.
  • Step S903 is executed.
  • the audio output device 200 sends voice signaling, and after receiving the voice signaling, the audio input device 201 forwards the voice signaling to the audio output device 200, and further determines the propagation delay.
  • the audio output device 200 can send signaling to the audio input device 201, and the signaling can be an analog signal (sound wave), which can propagate freely in space, and the audio input device 201 continuously collects sound waves in the space, wherein the sound waves include signaling.
  • the signaling is a signal with specific time-domain waveform characteristics or spectral characteristics.
  • the propagation delay can be ignored in this case, so the echo delay is equal to the processing delay.
  • the audio input device 201 continuously transmits the acquired sound waves to the audio output device 200 through the network or the short-range communication service.
  • the audio output device 200 can determine whether the received signal includes the signaling according to the time domain feature, frequency domain feature, and time domain-frequency domain joint feature of the signaling, and then determine that the signaling is sent from the echo cancellation module to the audio input device 201.
  • the time difference T2 between echo cancellation modules sent to the audio output device 200 is the processing delay.
  • the audio output device 200 can use methods such as cross-correlation function, matched filter, spectrum analysis, time-frequency analysis, etc. to determine whether to The signalling is received, and the time when the signalling is received.
  • the audio output device 200 may inform the audio input device 201 that the position calibration is successful; or, the user may inform the audio input device 201 that the position calibration is successful through interaction .
  • the user can freely move the audio input device 201 and/or the audio output device 200 .
  • the audio input device 201 may send the signaling, and after receiving the signaling, the audio output device 200 forwards the signaling to the audio input device 201 to determine the propagation delay. After the audio input device 201 determines the propagation delay, it sends the value of the propagation delay to the audio output device 200 .
  • FIG. 11A to FIG. 11B are taken as an example to exemplarily introduce the process of the audio output device 200 determining whether the received signal includes signaling and the audio output device 200 determining the processing delay.
  • FIG. 11A to FIG. 11B are exemplary schematic diagrams of a process of determining an echo delay provided by an embodiment of the present application.
  • the signaling 1 is generated by the audio output device 200 , it can be transmitted to the audio output module of the audio output device 200 via the echo cancellation module of the audio output device 200 .
  • the audio output module of the audio output device 200 converts the digital signal signaling 1 into a sound wave signal and plays it through a speaker; the audio input module in the audio input device 201 acquires and converts the sound wave signaling 1 into a digital signal signaling 1.
  • the audio input module transmits signaling 1 to the communication module of the audio input device 201 .
  • the communication module of the audio input device 201 will transmit the signaling 1 to the communication module of the audio output device 200 through the network or the short-range transmission service.
  • the communication module of the audio output device 200 transmits the received signaling 1 to the echo cancellation module of the audio output device 200 .
  • the audio output device 200 records the time when signaling 1 is sent by the echo cancellation module as T 1 , and records the time when signaling 1 is received by the echo cancellation module as T 2 .
  • processing delay calculated by sending the signaling is equivalent to calculating that the sound sent from the remote electronic device reaches the communication module on the audio input device 201 to the audio input device 201 and forwards the echo to the audio output device. Latency between communication modules on 200.
  • the audio output device 200 can determine that the echo module at time T2 receives the signaling 1 sent by the audio input device 201 according to the time domain feature of the signaling 1 . Further, the processing delay in the echo processing is determined.
  • Step S908 is executed.
  • the audio output device 200 may establish a three-dimensional space coordinate system with itself as the origin.
  • both the audio output device 200 and the audio input device 201 may establish a three-dimensional space coordinate system with itself as the origin.
  • the three-dimensional space coordinate system may be a Cartesian coordinate system, a polar coordinate system, a spherical coordinate system, or the like, which is not limited here.
  • Step S907 is executed.
  • the audio input device 201 sends motion information, location information or distance information to the audio output device 200 .
  • the audio input device 201 may periodically send motion information to the audio output device 200 through the network/near field communication service.
  • the motion information includes one or more of speed, acceleration, azimuth angle, pitch angle, yaw angle, and the like.
  • the audio output device 200 receives the motion information sent by the audio input device 201, it can calculate the coordinates of the audio input device 201 in the three-dimensional space coordinate system, and then determine the distance between the audio input device 201 and itself, and determine the echo delay according to the distance. propagation delay.
  • the audio input device 201 or the audio output device 200 may combine the motion information and an inertial navigation algorithm to determine the position information of the device corresponding to the motion.
  • the audio input device 201 can determine its own coordinates according to the motion information.
  • the audio input device 201 sends its own location information to the audio output device 200 .
  • the location information may include coordinates.
  • the audio output device 200 determines the location of the audio input device 201 , the distance between the two can be determined according to its own location and the location of the audio input device 201 .
  • the audio input device 201 also establishes a three-dimensional space coordinate system, when the audio output device 200 is located at the origin, the audio input device can directly determine its own coordinates according to the motion information, and determine itself and the audio output device 200 according to its own coordinates. and send the distance information to the audio output device 200.
  • the distance information includes the distance between the audio input device 201 and the audio output device 200 .
  • Step S908 is executed.
  • the audio input device 201 may periodically send motion information, location information or distance information to the audio output device 200 .
  • the audio input device 201 determines that the distance between the current position and the position when the position information and distance information were sent to the audio output device 200 last time is greater than the threshold, the audio input device 201 sends the position information or distance information to the audio output device 200 .
  • FIG. 12 is an exemplary schematic diagram of establishing a three-dimensional space coordinate system by the audio output device 200 according to an embodiment of the present application.
  • the audio input device 201 can determine its own coordinates according to the motion information. For example, at a certain moment, the audio input device 201 determines its own coordinates as (x 0 , y 0 , z 0 ) according to the motion information; at another moment, the audio input device 201 determines its own coordinates as (x 1 according to the motion information) , y 1 , z 1 );
  • the audio input device 201 can send its own position information such as (x 0 , y 0 , z 0 ) or (x 1 , y 1 , z 1 ) to the audio output device 200 through the network; or, when the audio output device 200 does not move In the case of , the audio input device 201 converts its distance information as or send to the audio output device; or, when the audio input device 201 determines the position information of the audio output device 200 such as (x 2 , y 2 , z 2 ), the audio input device 201 sends its own distance information such as or sent to the audio output device 200 .
  • the audio output device 200 can determine the distance between the audio output device 200 and the audio input device 201 after receiving the motion information or the position information sent by the audio input device 201 .
  • the echo cancellation module determines the echo delay, and performs echo cancellation on the received sound signal.
  • the audio output device 200 can determine the processing delay in the echo delay; according to steps S906 and S907, can determine the propagation delay in the echo delay.
  • the audio output device 200 can filter out the echo in the sound obtained by the audio input device 201 according to the echo delay, and send the filtered sound. to the remote user's device.
  • steps S901 to S908 can be completely executed, so that the audio output device 200 can determine the echo of the new device of the new user. extension.
  • the echo delay of the new device is related to the distance between the new device and the audio output device 200 .
  • the echo delay estimation method provided by the present application as shown in FIG. 9 is exemplarily introduced in the form of a data flow.
  • FIG. 13 is an exemplary schematic diagram of an echo delay estimation method under a single device provided by an embodiment of the present application.
  • step S901 and S902 after the user starts the conference/call application, and selects the appropriate audio input device 201 and audio output device 200, the audio input device 201 and the audio output device 200 are started. Position calibration.
  • the audio input device 201 and the audio output device 200 continuously perform position calibration until the position calibration is successful.
  • the position calibration is successful, when the audio input device 201/audio output device 200 provides interactive controls for the user, whether the audio input device 201/audio output device 200 accepts the user's input position; or, the audio input device 201/
  • the audio output device 200 may determine whether to complete the position calibration through the mobile cellular network, WIFI, and short-range communication services.
  • the audio output device 200 After the audio output device 200 receives the confirmation input from the user, the audio output device 200 determines that the position calibration is successful, and sends a message to inform the audio input device 201 that the position calibration is successful; correspondingly, if the audio input device 201 receives the confirmation input from the user, the audio The input device 201 determines that the position calibration is successful, and sends a message to inform the audio output device 200 that the position calibration is successful.
  • the audio input device 201 or the audio output device 200 uses a mobile cellular network, WIFI, or short-range communication service, it is determined whether the distance between the two is less than a distance threshold. Among them, any one of the audio input device 201 or the audio output device 200 determines that the distance between the two is less than the distance threshold according to the mobile cellular network, WIFI, and short-range communication services, considers that the position calibration is successful, and sends a message to inform the other device. Calibration is successful; or, both the audio input device 201 and the audio output device 200 determine that the distance between them is less than the distance threshold according to the mobile cellular network, WIFI, and short-range communication services, and it is considered that the position calibration is successful.
  • the audio output device 200 plays a signaling (sound wave) in the space, and the audio input device 201 continuously collects the sound in the space, converts the sound into a digital signal, and transmits it through the network/nearby network.
  • the distance communication service is transmitted to the audio output device 200 .
  • the audio output device 200 determines the time difference between sending the signaling and receiving the signaling as the processing delay in the echo delay.
  • the audio output device 200 establishes a three-dimensional space coordinate system, wherein the initial positions of the audio output device 200 and the audio input device 201 are (0, 0, 0).
  • the audio input device 201 may periodically calculate and update its position (coordinates) in the three-dimensional space coordinate system, and send the information to the audio output device 200 .
  • the audio output device 200 can determine the distance between the audio input device 201 and the audio output device 200 in combination with the location of the audio output device 200. Still further, the propagation delay among the echo delays can be determined according to the distance between the audio input device 201 and the audio output device 200 . Combined with the processing delays determined between, the audio output device 200 can determine the echo delays.
  • the audio output device 200 can continuously adjust the parameters of the echo cancellation module based on the determined echo delay, so that echo cancellation can be performed on the received sound signal.
  • the following takes multiple audio input devices and a single audio output device as examples to exemplarily introduce the echo delay estimation method provided by the present application as shown in FIG. 9 in the form of a data flow.
  • the new device when a new device joins the call or conference, the new device, as the audio input device 202, can perform position calibration with other audio input devices 201 that have completed position calibration and have started to work normally, and The location information and the processing delay 21 are sent to the audio output device 200 .
  • the audio output device 200 determines the echo delay of the audio input device 202 according to the network/near field communication service delay, the received location information, the processing delay 21 , and the processing delay 22 .
  • FIG. 14 is an exemplary schematic diagram of an echo delay estimation method under multiple devices provided by an embodiment of the present application.
  • the audio output device 200 has determined and continuously updated the echo delay of the audio input device 201 .
  • the position calibration can be performed with the audio input device 201 as a reference.
  • steps S902 , S903 and S904 when the new audio input device 202 joins the conference/call, it can perform position calibration with the audio input device 201 .
  • the audio input device 201 sends the position (coordinates) information of the audio input device 201 to the audio input device 202 as the position of the audio input device 201 .
  • the audio input device 202 may periodically calculate and update its position (coordinates) in the three-dimensional space coordinate system, and send the information to the audio output device 200 .
  • the audio output device 200 determines and updates the echo delay of the audio input device 202 based on the location information of the audio input device 202.
  • the audio input device 202 sends its own processing delay 21 to the audio output device 200 .
  • the audio output device 200 may determine its own processing delay 22 and may determine the interaction delay determined by the network/near field communication service. In turn, the audio output device 200 may determine the echo delay of the audio input device 202 .
  • the audio output device 200 can adjust the parameters of the echo cancellation module based on the echo delay of the audio input device 202 to filter out the echo in the sound collected by the audio input device 202; correspondingly, the audio output device 200 can be based on the audio input device.
  • the echo delay of 201 adjusts the parameters of the echo cancellation module to filter out the echo in the sound collected by the audio input device 201 .
  • the audio input device 201 , the audio input device 202 , and the audio output device 200 in the embodiments of the present application may be the electronic device 100 hereinafter.
  • FIG. 15 is a schematic diagram of an exemplary hardware structure of an electronic device 100 according to an embodiment of the present application.
  • the electronic device 100 may be a cell phone, tablet computer, desktop computer, laptop computer, handheld computer, notebook computer, ultra-mobile personal computer (UMPC), netbook, as well as cellular telephones, personal digital assistants (personal digital assistants) digital assistant (PDA), augmented reality (AR) devices, virtual reality (VR) devices, artificial intelligence (AI) devices, wearable devices, in-vehicle devices, smart home devices and/or Smart city equipment, the embodiments of the present application do not specifically limit the specific type of the electronic equipment.
  • PDA personal digital assistants
  • AR augmented reality
  • VR virtual reality
  • AI artificial intelligence
  • the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charge management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2 , mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, sensor module 180, buttons 190, motor 191, indicator 192, camera 193, display screen 194, and Subscriber identification module (subscriber identification module, SIM) card interface 195 and so on.
  • SIM Subscriber identification module
  • the sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light. Sensor 180L, bone conduction sensor 180M, etc.
  • the structures illustrated in the embodiments of the present invention do not constitute a specific limitation on the electronic device 100 .
  • the electronic device 100 may include more or less components than shown, or combine some components, or separate some components, or arrange different components.
  • the illustrated components may be implemented in hardware, software, or a combination of software and hardware.
  • the processor 110 may include one or more processing units, for example, the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (neural-network processing unit, NPU), etc. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
  • application processor application processor, AP
  • modem processor graphics processor
  • ISP image signal processor
  • controller video codec
  • digital signal processor digital signal processor
  • baseband processor baseband processor
  • neural-network processing unit neural-network processing unit
  • the controller can generate an operation control signal according to the instruction operation code and timing signal, and complete the control of fetching and executing instructions.
  • a memory may also be provided in the processor 110 for storing instructions and data.
  • the memory in processor 110 is cache memory. This memory may hold instructions or data that have just been used or recycled by the processor 110 . If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated accesses are avoided and processor 110 latency is reduced, thereby increasing the efficiency of the system.
  • the processor 110 may include one or more interfaces.
  • the interface may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous transceiver (universal asynchronous transmitter) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and / or universal serial bus (universal serial bus, USB) interface, etc.
  • I2C integrated circuit
  • I2S integrated circuit built-in audio
  • PCM pulse code modulation
  • PCM pulse code modulation
  • UART universal asynchronous transceiver
  • MIPI mobile industry processor interface
  • GPIO general-purpose input/output
  • SIM subscriber identity module
  • USB universal serial bus
  • the I2C interface is a bidirectional synchronous serial bus that includes a serial data line (SDA) and a serial clock line (SCL).
  • the processor 110 may contain multiple sets of I2C buses.
  • the processor 110 can be respectively coupled to the touch sensor 180K, the charger, the flash, the camera 193 and the like through different I2C bus interfaces.
  • the processor 110 may couple the touch sensor 180K through the I2C interface, so that the processor 110 and the touch sensor 180K communicate with each other through the I2C bus interface, so as to realize the touch function of the electronic device 100 .
  • the I2S interface can be used for audio communication.
  • the processor 110 may contain multiple sets of I2S buses.
  • the processor 110 may be coupled with the audio module 170 through an I2S bus to implement communication between the processor 110 and the audio module 170 .
  • the audio module 170 can transmit audio signals to the wireless communication module 160 through the I2S interface, so as to realize the function of answering calls through a Bluetooth headset.
  • the PCM interface can also be used for audio communications, sampling, quantizing and encoding analog signals.
  • the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface.
  • the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
  • the UART interface is a universal serial data bus used for asynchronous communication.
  • the bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication.
  • a UART interface is typically used to connect the processor 110 with the wireless communication module 160 .
  • the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function.
  • the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
  • the MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193 .
  • MIPI interfaces include camera serial interface (CSI), display serial interface (DSI), etc.
  • the processor 110 communicates with the camera 193 through a CSI interface, so as to realize the photographing function of the electronic device 100 .
  • the processor 110 communicates with the display screen 194 through the DSI interface to implement the display function of the electronic device 100 .
  • the GPIO interface can be configured by software.
  • the GPIO interface can be configured as a control signal or as a data signal.
  • the GPIO interface may be used to connect the processor 110 with the camera 193, the display screen 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like.
  • the GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
  • the USB interface 130 is an interface that conforms to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like.
  • the USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through the headphones.
  • the interface can also be used to connect other electronic devices, such as AR devices.
  • the interface connection relationship between the modules illustrated in the embodiment of the present invention is only a schematic illustration, and does not constitute a structural limitation of the electronic device 100 .
  • the electronic device 100 may also adopt different interface connection manners in the foregoing embodiments, or a combination of multiple interface connection manners.
  • the charging management module 140 is used to receive charging input from the charger.
  • the power management module 141 is used for connecting the battery 142 , the charging management module 140 and the processor 110 .
  • the wireless communication function of the electronic device 100 may be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modulation and demodulation processor, the baseband processor, and the like.
  • Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals.
  • Each antenna in electronic device 100 may be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization.
  • the antenna 1 can be multiplexed as a diversity antenna of the wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
  • the mobile communication module 150 may provide wireless communication solutions including 2G/3G/4G/5G etc. applied on the electronic device 100 .
  • the mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA) and the like.
  • the mobile communication module 150 can receive electromagnetic waves from the antenna 1, filter and amplify the received electromagnetic waves, and transmit them to the modulation and demodulation processor for demodulation.
  • the mobile communication module 150 can also amplify the signal modulated by the modulation and demodulation processor, and then turn it into an electromagnetic wave for radiation through the antenna 1 .
  • at least part of the functional modules of the mobile communication module 150 may be provided in the processor 110 .
  • at least part of the functional modules of the mobile communication module 150 may be provided in the same device as at least part of the modules of the processor 110 .
  • the modem processor may include a modulator and a demodulator.
  • the modulator is used to modulate the low frequency baseband signal to be sent into a medium and high frequency signal.
  • the demodulator is used to demodulate the received electromagnetic wave signal into a low frequency baseband signal. Then the demodulator transmits the demodulated low-frequency baseband signal to the baseband processor for processing.
  • the low frequency baseband signal is processed by the baseband processor and passed to the application processor.
  • the application processor outputs sound signals through audio devices (not limited to the speaker 170A, the receiver 170B, etc.), or displays images or videos through the display screen 194 .
  • the modem processor may be a stand-alone device.
  • the modem processor may be independent of the processor 110, and may be provided in the same device as the mobile communication module 150 or other functional modules.
  • the wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), bluetooth (BT), global navigation satellites Wireless communication solutions such as global navigation satellite system (GNSS), frequency modulation (FM), near field communication (NFC), and infrared technology (IR).
  • WLAN wireless local area networks
  • BT Bluetooth
  • GNSS global navigation satellite system
  • FM frequency modulation
  • NFC near field communication
  • IR infrared technology
  • the wireless communication module 160 may be one or more devices integrating at least one communication processing module.
  • the wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 .
  • the wireless communication module 160 can also receive the signal to be sent from the processor 110 , perform frequency modulation on it, amplify it, and convert it into electromagnetic waves for radiation through the antenna 2 .
  • the antenna 1 of the electronic device 100 is coupled with the mobile communication module 150, and the antenna 2 is coupled with the wireless communication module 160, so that the electronic device 100 can communicate with the network and other devices through wireless communication technology.
  • the wireless communication technology may include global system for mobile communications (GSM), general packet radio service (GPRS), code division multiple access (CDMA), broadband Code Division Multiple Access (WCDMA), Time Division Code Division Multiple Access (TD-SCDMA), Long Term Evolution (LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc.
  • the GNSS may include global positioning system (global positioning system, GPS), global navigation satellite system (global navigation satellite system, GLONASS), Beidou navigation satellite system (beidou navigation satellite system, BDS), quasi-zenith satellite system (quasi -zenith satellite system, QZSS) and/or satellite based augmentation systems (SBAS).
  • global positioning system global positioning system, GPS
  • global navigation satellite system global navigation satellite system, GLONASS
  • Beidou navigation satellite system beidou navigation satellite system, BDS
  • quasi-zenith satellite system quadsi -zenith satellite system, QZSS
  • SBAS satellite based augmentation systems
  • the electronic device 100 implements a display function through a GPU, a display screen 194, an application processor, and the like.
  • the GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor.
  • the GPU is used to perform mathematical and geometric calculations for graphics rendering.
  • Processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.
  • Display screen 194 is used to display images, videos, and the like.
  • the electronic device 100 may implement a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
  • the ISP is used to process the data fed back by the camera 193 .
  • the shutter is opened, the light is transmitted to the camera photosensitive element through the lens, the light signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye.
  • Camera 193 is used to capture still images or video.
  • the object is projected through the lens to generate an optical image onto the photosensitive element.
  • the photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
  • CMOS complementary metal-oxide-semiconductor
  • the photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal.
  • the ISP outputs the digital image signal to the DSP for processing.
  • DSP converts digital image signals into standard RGB, YUV and other formats of image signals.
  • the electronic device 100 may include 1 or N cameras 193 , where N is a positive integer greater than 1.
  • a digital signal processor is used to process digital signals, in addition to processing digital image signals, it can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy and so on.
  • Video codecs are used to compress or decompress digital video.
  • the electronic device 100 may support one or more video codecs.
  • the electronic device 100 can play or record videos of various encoding formats, such as: Moving Picture Experts Group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4 and so on.
  • MPEG Moving Picture Experts Group
  • MPEG2 moving picture experts group
  • MPEG3 MPEG4
  • MPEG4 Moving Picture Experts Group
  • the NPU is a neural-network (NN) computing processor.
  • NN neural-network
  • Applications such as intelligent cognition of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
  • the internal memory 121 may include one or more random access memories (RAM) and one or more non-volatile memories (NVM).
  • RAM random access memories
  • NVM non-volatile memories
  • Random access memory can include static random-access memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random access memory (SDRAM), double data rate synchronization Dynamic random access memory (double data rate synchronous dynamic random access memory, DDR SDRAM, such as the fifth generation DDR SDRAM is generally called DDR5 SDRAM), etc.;
  • SRAM static random-access memory
  • DRAM dynamic random access memory
  • SDRAM synchronous dynamic random access memory
  • DDR SDRAM double data rate synchronous dynamic random access memory
  • DDR SDRAM double data rate synchronous dynamic random access memory
  • DDR SDRAM double data rate synchronous dynamic random access memory
  • DDR5 SDRAM double data rate synchronous dynamic random access memory
  • Non-volatile memory may include magnetic disk storage devices, flash memory.
  • Flash memory can be divided into NOR FLASH, NAND FLASH, 3D NAND FLASH, etc. according to the operating principle, and can include single-level memory cell (SLC), multi-level memory cell (multi-level memory cell, SLC) according to the level of storage cell potential.
  • cell, MLC multi-level memory cell
  • TLC triple-level cell
  • QLC quad-level cell
  • UFS universal flash storage
  • eMMC embedded multimedia memory card
  • the random access memory can be directly read and written by the processor 110, and can be used to store executable programs (eg, machine instructions) of an operating system or other running programs, and can also be used to store data of users and application programs.
  • executable programs eg, machine instructions
  • the random access memory can be directly read and written by the processor 110, and can be used to store executable programs (eg, machine instructions) of an operating system or other running programs, and can also be used to store data of users and application programs.
  • the non-volatile memory can also store executable programs and store data of user and application programs, etc., and can be loaded into the random access memory in advance for the processor 110 to directly read and write.
  • the external memory interface 120 can be used to connect an external non-volatile memory, so as to expand the storage capacity of the electronic device 100 .
  • the external non-volatile memory communicates with the processor 110 through the external memory interface 120 to realize the data storage function. For example, save music, video, etc. files in external non-volatile memory.
  • the electronic device 100 may implement audio functions through an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, an application processor, and the like. Such as music playback, recording, etc.
  • the audio module 170 is used for converting digital audio information into analog audio signal output, and also for converting analog audio input into digital audio signal. Audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be provided in the processor 110 , or some functional modules of the audio module 170 may be provided in the processor 110 .
  • Speaker 170A also referred to as a "speaker" is used to convert audio electrical signals into sound signals.
  • the electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
  • the receiver 170B also referred to as "earpiece" is used to convert audio electrical signals into sound signals.
  • the voice can be answered by placing the receiver 170B close to the human ear.
  • the microphone 170C also called “microphone” or “microphone” is used to convert sound signals into electrical signals.
  • the user can make a sound by approaching the microphone 170C through a human mouth, and input the sound signal into the microphone 170C.
  • the electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C, which can implement a noise reduction function in addition to collecting sound signals. In other embodiments, the electronic device 100 may further be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions.
  • the earphone jack 170D is used to connect wired earphones.
  • the earphone interface 170D may be the USB interface 130, or may be a 3.5mm open mobile terminal platform (OMTP) standard interface, a cellular telecommunications industry association of the USA (CTIA) standard interface.
  • OMTP open mobile terminal platform
  • CTIA cellular telecommunications industry association of the USA
  • the pressure sensor 180A is used to sense pressure signals, and can convert the pressure signals into electrical signals.
  • the pressure sensor 180A may be provided on the display screen 194 .
  • the capacitive pressure sensor may be comprised of at least two parallel plates of conductive material. When a force is applied to the pressure sensor 180A, the capacitance between the electrodes changes.
  • the electronic device 100 determines the intensity of the pressure according to the change in capacitance. When a touch operation acts on the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A.
  • the electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A.
  • touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions. For example, when a touch operation whose intensity is less than the first pressure threshold acts on the short message application icon, the instruction for viewing the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, the instruction to create a new short message is executed.
  • the gyro sensor 180B may be used to determine the motion attitude of the electronic device 100 .
  • the angular velocity of electronic device 100 about three axes ie, x, y, and z axes
  • the gyro sensor 180B can be used for image stabilization.
  • the gyro sensor 180B detects the shaking angle of the electronic device 100, calculates the distance that the lens module needs to compensate according to the angle, and allows the lens to offset the shaking of the electronic device 100 through reverse motion to achieve anti-shake.
  • the gyro sensor 180B can also be used for navigation and somatosensory game scenarios.
  • the air pressure sensor 180C is used to measure air pressure.
  • the electronic device 100 calculates the altitude through the air pressure value measured by the air pressure sensor 180C to assist in positioning and navigation.
  • the magnetic sensor 180D includes a Hall sensor.
  • the electronic device 100 can detect the opening and closing of the flip holster using the magnetic sensor 180D.
  • the electronic device 100 can detect the opening and closing of the flip according to the magnetic sensor 180D. Further, according to the detected opening and closing state of the leather case or the opening and closing state of the flip cover, characteristics such as automatic unlocking of the flip cover are set.
  • the acceleration sensor 180E can detect the magnitude of the acceleration of the electronic device 100 in various directions (generally three axes).
  • the magnitude and direction of gravity can be detected when the electronic device 100 is stationary. It can also be used to identify the posture of electronic devices, and can be used in applications such as horizontal and vertical screen switching, pedometers, etc.
  • the electronic device 100 can measure the distance through infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 can use the distance sensor 180F to measure the distance to achieve fast focusing.
  • Proximity light sensor 180G may include, for example, light emitting diodes (LEDs) and light detectors, such as photodiodes.
  • the light emitting diodes may be infrared light emitting diodes.
  • the electronic device 100 emits infrared light to the outside through the light emitting diode.
  • Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 .
  • the electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power.
  • Proximity light sensor 180G can also be used in holster mode, pocket mode automatically unlocks and locks the screen.
  • the ambient light sensor 180L is used to sense ambient light brightness.
  • the fingerprint sensor 180H is used to collect fingerprints.
  • the temperature sensor 180J is used to detect the temperature.
  • the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy.
  • Touch sensor 180K also called “touch device”.
  • the touch sensor 180K may be disposed on the display screen 194 , and the touch sensor 180K and the display screen 194 form a touch screen, also called a “touch screen”.
  • the touch sensor 180K is used to detect a touch operation on or near it.
  • the touch sensor can pass the detected touch operation to the application processor to determine the type of touch event.
  • Visual output related to touch operations may be provided through display screen 194 .
  • the touch sensor 180K may also be disposed on the surface of the electronic device 100 , which is different from the location where the display screen 194 is located.
  • the bone conduction sensor 180M can acquire vibration signals.
  • the keys 190 include a power-on key, a volume key, and the like. Keys 190 may be mechanical keys. It can also be a touch key.
  • the electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
  • Motor 191 can generate vibrating cues.
  • the indicator 192 can be an indicator light, which can be used to indicate the charging state, the change of the power, and can also be used to indicate a message, a missed call, a notification, and the like.
  • the SIM card interface 195 is used to connect a SIM card.
  • the audio input device 201/audio output device 200 may use the distance sensor 180F to determine the distance between the audio input device 201 and the audio output device 200.
  • the audio input device 201/audio output device 200 may use the wireless communication module 160 and the mobile communication module 150 to determine the distance between the audio input device 201 and the audio output device 200 the distance.
  • step S905 after determining the distance between the audio input device 201 and the audio output device 200, information such as temperature and air pressure in the current space can be obtained according to the air pressure sensor 180C, the temperature sensor 180J, etc. , and then more accurately determine the speed of sound, and then more accurately determine the propagation delay in the echo delay.
  • the audio input device may not have the receiver 170B; the audio output device may not have the microphone 170C.
  • the software system of the electronic device 100 may adopt a layered architecture, an event-driven architecture, a microkernel architecture, a microservice architecture, or a cloud architecture.
  • the embodiment of the present invention takes an Android system with a layered architecture as an example to illustrate the software structure of the electronic device 100 as an example.
  • FIG. 16 is a schematic diagram of an exemplary software structure of an electronic device 100 according to an embodiment of the present application.
  • the layered architecture divides the software into several layers, and each layer has a clear role and division of labor. Layers communicate with each other through software interfaces.
  • the Android system is divided into four layers, which are, from top to bottom, an application layer, an application framework layer, an Android runtime (Android runtime) and a system library, and a kernel layer.
  • the application layer can include a series of application packages.
  • the application package can include applications such as camera, gallery, calendar, call, map, navigation, WLAN, Bluetooth, music, video, short message and so on.
  • Application bundles can also include meeting/calling applications such as WeLink.
  • the application framework layer provides an application programming interface (application programming interface, API) and a programming framework for applications in the application layer.
  • the application framework layer includes some predefined functions.
  • the function may be a method for determining the distance between the audio input device 201 and the audio output device 200; or, the function may be a method for establishing a three-dimensional space coordinate system; or, the function may be to send motion information, position to other devices information or distance information, etc.
  • the application framework layer may include window managers, content providers, view systems, telephony managers, resource managers, notification managers, and the like.
  • a window manager is used to manage window programs.
  • the window manager can get the size of the display screen, determine whether there is a status bar, lock the screen, take screenshots, etc.
  • Content providers are used to store and retrieve data and make these data accessible to applications.
  • the data may include video, images, audio, calls made and received, browsing history and bookmarks, phone book, etc.
  • the view system includes visual controls, such as controls for displaying text, controls for displaying pictures, and so on. View systems can be used to build applications.
  • a display interface can consist of one or more views.
  • the display interface including the short message notification icon may include a view for displaying text and a view for displaying pictures, and may include displaying an interface as shown in FIG. 7 .
  • the phone manager is used to provide the communication function of the electronic device 100 .
  • the management of call status including connecting, hanging up, etc.).
  • the resource manager provides various resources for the application, such as localization strings, icons, pictures, layout files, video files and so on.
  • the notification manager enables applications to display notification information in the status bar, which can be used to convey notification-type messages, and can disappear automatically after a brief pause without user interaction. For example, the notification manager is used to notify download completion, message reminders, etc.
  • the notification manager can also display notifications in the status bar at the top of the system in the form of graphs or scroll bar text, such as notifications of applications running in the background, and notifications on the screen in the form of dialog windows. For example, text information is prompted in the status bar, a prompt sound is issued, the electronic device vibrates, and the indicator light flashes.
  • Android Runtime includes core libraries and a virtual machine. Android runtime is responsible for scheduling and management of the Android system.
  • the core library consists of two parts: one is the function functions that the java language needs to call, and the other is the core library of Android.
  • the application layer and the application framework layer run in virtual machines.
  • the virtual machine executes the java files of the application layer and the application framework layer as binary files.
  • the virtual machine is used to perform functions such as object lifecycle management, stack management, thread management, safety and exception management, and garbage collection.
  • a system library can include multiple functional modules. For example: surface manager (surface manager), media library (Media Libraries), 3D graphics processing library (eg: OpenGL ES), 2D graphics engine (eg: SGL), etc.
  • surface manager surface manager
  • media library Media Libraries
  • 3D graphics processing library eg: OpenGL ES
  • 2D graphics engine eg: SGL
  • the Surface Manager is used to manage the display subsystem and provides a fusion of 2D and 3D layers for multiple applications.
  • the media library supports playback and recording of a variety of commonly used audio and video formats, as well as still image files.
  • the media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
  • the 3D graphics processing library is used to implement 3D graphics drawing, image rendering, compositing, and layer processing.
  • 2D graphics engine is a drawing engine for 2D drawing.
  • the kernel layer is the layer between hardware and software.
  • the kernel layer contains at least display drivers, camera drivers, audio drivers, and sensor drivers.
  • the application can call the driver of the kernel layer through the framework layer of the application, for example, call the audio driver to play signaling, call the sensor driver to determine the distance from other devices, etc., call the sensor driver to determine its own position, etc.
  • the term “when” may be interpreted to mean “if” or “after” or “in response to determining" or “in response to detecting" depending on the context.
  • the phrases “in determining" or “if detecting (the stated condition or event)” can be interpreted to mean “if determining" or “in response to determining" or “on detecting (the stated condition or event)” or “in response to the detection of (the stated condition or event)”.
  • the above-mentioned embodiments it may be implemented in whole or in part by software, hardware, firmware or any combination thereof.
  • software it can be implemented in whole or in part in the form of a computer program product.
  • the computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, all or part of the processes or functions according to the embodiments of the present application are generated.
  • the computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable device.
  • the computer instructions may be stored on or transmitted from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions may be transmitted over a wire from a website site, computer, server or data center (eg coaxial cable, optical fiber, digital subscriber line) or wireless (eg infrared, wireless, microwave, etc.) to another website site, computer, server or data center.
  • the computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that includes an integration of one or more available media.
  • the available media may be magnetic media (eg, floppy disks, hard disks, magnetic tapes), optical media (eg, DVDs), or semiconductor media (eg, solid state drives), and the like.
  • the process can be completed by instructing the relevant hardware by a computer program, and the program can be stored in a computer-readable storage medium.
  • the program When the program is executed , which may include the processes of the foregoing method embodiments.
  • the aforementioned storage medium includes: ROM or random storage memory RAM, magnetic disk or optical disk and other mediums that can store program codes.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)

Abstract

A method for estimating an echo delay between distributed devices, and an electronic device, a system, a computer program product and a readable storage medium. The method comprises: firstly, a device determining the distance between devices, and determining a propagation delay between different devices; secondly, determining a processing delay between the different devices by means of an audio signal; and finally, determining an echo delay between the different devices on the basis of the propagation delay and the processing delay, and then performing echo cancellation.

Description

分布式设备回声时延估计方法及电子设备Distributed equipment echo delay estimation method and electronic equipment
本申请要求于2021年04月17日提交中国专利局、申请号为202110415110.4、申请名称为“分布式设备回声时延估计方法及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed on April 17, 2021 with the application number 202110415110.4 and titled "Echo Delay Estimation Method for Distributed Equipment and Electronic Equipment", the entire contents of which are incorporated by reference in this application.
技术领域technical field
本申请涉及电子技术领域,尤其涉及分布式设备回声时延估计方法及电子设备。The present application relates to the field of electronic technologies, and in particular, to a method for estimating an echo delay of a distributed device and an electronic device.
背景技术Background technique
随着智能设备和基础通信建设的普及,用户可以通过网络与远方的朋友进行直接的语音、视频交流,极大的拉近了人与人之间的交互距离。在网络通话、视频会议等场景中,远端的用户的声音通过网络传输到近端的设备后,通过近端的音频输出设备播放远端说话者的声音,使得近端的用户能够直接听到远端用户的声音。With the popularization of smart devices and basic communication construction, users can communicate with distant friends directly through the Internet, which greatly shortens the interaction distance between people. In scenarios such as network calls and video conferences, after the voice of the far-end user is transmitted to the near-end device through the network, the voice of the far-end speaker is played through the near-end audio output device, so that the near-end user can directly hear the voice of the far-end speaker. The voice of the far end user.
但是,在网络通话、视频会议场景中,由于音频输出设备和音频输入设备均一直处在工作状态,音频输出设备播放的远端用户的声音会被音频输入设备获取并通过网络将远端用户的声音传输给远端的用户。在该情况下,用户不仅会听到自己刚刚说话的声音,并且回声的干扰可能越来越大,产生啸叫,极大的影响了用户的体验。However, in the scenarios of network calls and video conferences, since the audio output device and the audio input device are always working, the voice of the remote user played by the audio output device will be acquired by the audio input device and the remote user's voice will be transmitted through the network. The sound is transmitted to the remote user. In this case, the user will not only hear the voice he has just spoken, but also the interference of the echo may become larger and larger, resulting in howling, which greatly affects the user's experience.
为了消除回声的影响,一种可行的处理方法为:通过固定音频输出设备和音频输入设备的相对位置,人工测量声音在音频输出设备与音频输入设备之间的传播时延,并结合音频输出设备和音频输入设备的硬件、软件特点确定回声时延。在确定回声时延之后,手动的为音频输出设备上的回声消除模块配置回声时延参数,使得回声消除模块能够正常工作,避免了音频输出设备播放回声。In order to eliminate the influence of echo, a feasible processing method is: by fixing the relative position of the audio output device and the audio input device, manually measure the propagation delay of the sound between the audio output device and the audio input device, and combine the audio output device with the audio output device. The echo delay is determined by the hardware and software characteristics of the audio input device. After determining the echo delay, manually configure the echo delay parameters for the echo cancellation module on the audio output device, so that the echo cancellation module can work normally and avoid the audio output device from playing echoes.
但是,该方法只能解决音频输出设备与音频输入设备相对不变情况下的回声问题,当用户移动音频输入设备/音频输入设备后,已经被配置给回声消除模块的回声时延参数不再有效,电子设备不能有效滤除回声。However, this method can only solve the echo problem when the audio output device and the audio input device are relatively unchanged. When the user moves the audio input device/audio input device, the echo delay parameter that has been configured to the echo cancellation module is no longer valid. , the electronic device cannot effectively filter out the echo.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供了一种分布式设备回声时延估计方法,该方法包括:音频输出设备建立空间坐标系,并通过网络连接不断接收多个音频输入设备传递的位置信息,并且基于不同音频输入设备的位置信息不断更新不同音频输入设备的回声时延,进而消除回声。The embodiment of the present application provides a method for estimating echo delay of distributed devices. The method includes: an audio output device establishes a spatial coordinate system, continuously receives location information transmitted by multiple audio input devices through a network connection, and based on different audio input devices The location information of the device continuously updates the echo delay of different audio input devices, thereby eliminating the echo.
第一方面,在第一时刻,第一电子设备确定第一距离,该第一距离为在第一时刻时该第一电子设备与第二电子设备之间的距离;该第一电子设备基于该第一距离和声速确定第一传播时延,该第一传播时延为在第一时刻时,声音信号从该第一电子设备传播到第二电子设备之间的时延;该第一电子设备确定该第一传播时延为第一回声时延,或者,该第一电子设备确定该第一传播时延与第一处理时延的和为该第一回声时延,该第一处理时延为该第一电子设备从获取语音信号到播放语音信号的时延与第二电子设备获取语音信号到第二电子设备转发语音信号至第一电子设备上时延的和。In the first aspect, at the first moment, the first electronic device determines a first distance, where the first distance is the distance between the first electronic device and the second electronic device at the first moment; the first electronic device is based on the The first distance and the speed of sound determine a first propagation delay, where the first propagation delay is the time delay between the propagation of the sound signal from the first electronic device to the second electronic device at the first moment; the first electronic device Determining that the first propagation delay is the first echo delay, or the first electronic device determines that the sum of the first propagation delay and the first processing delay is the first echo delay, and the first processing delay The sum of the delay from acquiring the voice signal to playing the voice signal by the first electronic device and the delay of acquiring the voice signal by the second electronic device and forwarding the voice signal to the first electronic device by the second electronic device.
在上述实施例中,第一电子设备通过确定第一电子设备和第二电子设备之间的距离,进而确定第一电子设备与第二电子设备之间的传播时延,再进一步基于传播时延确定回声时延。该方法首先使得电子设备本身可以随意移动,不影响回声消除的效果,方便了用户的体验;且不需要人工测量回声时延,用户可以随时将自己的电子设备参与到网络视频、会议、通话 中,而无需担忧回声的影响。In the above embodiment, the first electronic device determines the propagation delay between the first electronic device and the second electronic device by determining the distance between the first electronic device and the second electronic device, and further based on the propagation delay Determines the echo delay. The method first allows the electronic device itself to move at will, without affecting the effect of echo cancellation, which facilitates the user's experience; and does not require manual measurement of echo delay, users can participate in online video, conferences, and calls at any time with their electronic devices. , without worrying about echo effects.
结合第一方面的实施例,在一些实施例中,在第一时刻后的第二时刻,该电子设备确定第二距离,该第二距离为在第二时刻时该第一电子设备与第二电子设备之间的距离;该第一电子设备基于该第二距离和声速确定第二传播时延,该第二传播时延为在第二时刻时,声音信号从该第一电子设备传输到第二电子设备之间的时延;该第一电子设备确定该第二传播时延为第二回声时延,或者该第一电子设备确定该第二传播时延与第一处理时延的和为该第二回声时延;该第二回声时延与该第一回声时延不同。In combination with the embodiments of the first aspect, in some embodiments, at a second time after the first time, the electronic device determines a second distance, where the second distance is the distance between the first electronic device and the second time at the second time The distance between the electronic devices; the first electronic device determines a second propagation delay based on the second distance and the speed of sound, and the second propagation delay is when the sound signal is transmitted from the first electronic device to the first electronic device at the second moment. The delay between two electronic devices; the first electronic device determines that the second propagation delay is the second echo delay, or the first electronic device determines that the sum of the second propagation delay and the first processing delay is the second echo delay; the second echo delay is different from the first echo delay.
在上述实施例中,第一电子设备不断更新第二电子设备与第一电子设备之间的距离,进而不断更新回声时延,使得用户可以随意移动第一电子设备/第二电子设备,提高了用户的体验。In the above embodiment, the first electronic device continuously updates the distance between the second electronic device and the first electronic device, and further updates the echo delay, so that the user can move the first electronic device/second electronic device at will, which improves the user experience.
结合第一方面的实施例,在一些实施例中,在第一时刻,第一电子设备确定第一距离前,还包括:在第一时刻前的第三时刻,该第一电子设备确定该第一处理时延。With reference to the embodiments of the first aspect, in some embodiments, before the first electronic device determines the first distance at the first moment, the method further includes: at a third moment before the first moment, the first electronic device determines the first distance. A processing delay.
在上述实施例中,考虑到在一部分情况中,回声时延中处理时延不能忽略,而不同的电子设备对应不同的处理时延,所以第一电子设备通过确定第一处理时延,确定第二电子设备与第一电子设备之间的回声时延来对第二电子设备有关的回声进行回声消除。In the above embodiment, considering that in some cases, the processing delay in the echo delay cannot be ignored, and different electronic devices correspond to different processing delays, the first electronic device determines the first processing delay by determining the first processing delay. The echo delay between the second electronic device and the first electronic device is used to perform echo cancellation on the echo related to the second electronic device.
结合第一方面的实施例,在一些实施例中,在第一时刻前的第三时刻,该第一电子设备确定该第一处理时延,具体包括:在第一时刻前的第三时刻,当该第一电子设备与第二电子设备的距离小于距离阈值时,该第一电子设备播放第一音频;该第一电子设备通过无线网络/近距离通信服务接收该第二电子设备发送的该第一音频;该第一电子设备确定播放第一音频和接收该第一音频之间的时间差为该第一处理时延。With reference to the embodiments of the first aspect, in some embodiments, the first electronic device determines the first processing delay at a third time before the first time, specifically including: at a third time before the first time, When the distance between the first electronic device and the second electronic device is less than the distance threshold, the first electronic device plays the first audio; the first audio; the first electronic device determines that the time difference between playing the first audio and receiving the first audio is the first processing delay.
在上述实施例中,用户可以将第二电子设备靠近第一电子设备,通过计算音频信号在第二电子设备与第一电子设备之间流传的时间差,确定第一处理时延,使得第一电子设备可以确定不同类型的电子设备导致的不同的处理时延,进而确定回声时延。In the above embodiment, the user can bring the second electronic device close to the first electronic device, and determine the first processing delay by calculating the time difference between the audio signal passing between the second electronic device and the first electronic device, so that the first electronic device The device can determine the different processing delays caused by different types of electronic devices, and thus determine the echo delays.
结合第一方面的实施例,在一些实施例中,在第一时刻前的第三时刻,该第一电子设备确定第一处理时延,具体包括:在第一时刻前的第三时刻,响应于用户的输入,该第一电子设备播放第一音频;该第一电子设备通过无线网络/近距离通信服务接收该第二电子设备发送的该第一音频;该第一电子设备确定播放第一音频和接收该第一音频之间的时间差为该第一处理时延。With reference to the embodiments of the first aspect, in some embodiments, at a third time before the first time, the first electronic device determines the first processing delay, which specifically includes: at a third time before the first time, responding to According to the user's input, the first electronic device plays the first audio; the first electronic device receives the first audio sent by the second electronic device through the wireless network/near field communication service; the first electronic device determines to play the first audio The time difference between the audio and receiving the first audio is the first processing delay.
在上述实施例中,用户可以将第二电子设备靠近第一电子设备,通过计算音频信号在第二电子设备与第一电子设备之间流传的时间差,确定第一处理时延,使得第一电子设备可以确定不同类型的电子设备导致的不同的处理时延,进而确定回声时延。In the above embodiment, the user can bring the second electronic device close to the first electronic device, and determine the first processing delay by calculating the time difference between the audio signal passing between the second electronic device and the first electronic device, so that the first electronic device The device can determine the different processing delays caused by different types of electronic devices, and thus determine the echo delays.
结合第一方面的实施例,在一些实施例中,该第一电子设备确定第一距离,具体包括:该第一电子设备接收第一运动信息,该第一运动信息包括该第二电子设备的运动状态;该第一电子设备基于该第一运动信息确定该第一距离。With reference to the embodiments of the first aspect, in some embodiments, determining the first distance by the first electronic device specifically includes: the first electronic device receives first motion information, where the first motion information includes the distance of the second electronic device. motion state; the first electronic device determines the first distance based on the first motion information.
在上述实施例中,第一电子设备接收第二电子设备的运动信息如X轴、Y轴、Z轴方向上的加速度、速度、姿态角等参数,有助于更精准的确定第一电子设备与第二电子设备之间的距离。In the above-mentioned embodiment, the first electronic device receives the motion information of the second electronic device, such as the acceleration, speed, attitude angle and other parameters in the X-axis, Y-axis, and Z-axis directions, which helps to determine the first electronic device more accurately distance from the second electronic device.
结合第一方面的实施例,在一些实施例中,该第一电子设备确定第一距离,具体包括:该第一电子设备接收第一位置信息,该第一位置信息包括该第二电子设备的位置;该第一电子设备基于该第一位置信息确定该第一距离。With reference to the embodiments of the first aspect, in some embodiments, determining the first distance by the first electronic device specifically includes: the first electronic device receives first location information, where the first location information includes a distance of the second electronic device. location; the first electronic device determines the first distance based on the first location information.
在上述实施例中,第一电子设备接收第二电子设备的位置信息,进而可以确定第一电子 设备与第二电子设备之间的距离,有助于更精确的确定第一电子设备与第二电子设备之间的距离。In the above-mentioned embodiment, the first electronic device receives the position information of the second electronic device, and then the distance between the first electronic device and the second electronic device can be determined, which is helpful for more accurate determination of the first electronic device and the second electronic device. distance between electronic devices.
结合第一方面的实施例,在一些实施例中,该第一电子设备确定第一距离,具体包括:该第一电子设备接收第一距离信息,该第一距离信息包括该第一距离;该第一电子设备基于该第一距离信息确定该第一距离。With reference to the embodiments of the first aspect, in some embodiments, determining the first distance by the first electronic device specifically includes: the first electronic device receiving first distance information, where the first distance information includes the first distance; the The first electronic device determines the first distance based on the first distance information.
在上述实施例中,第二电子设备确定与第一电子设备的距离后,将该距离告知第一电子设备,降低第一电子设备的计算负担。In the above embodiment, after the second electronic device determines the distance from the first electronic device, it informs the first electronic device of the distance, thereby reducing the computational burden of the first electronic device.
结合第一方面的实施例,在一些实施例中,在第四时刻,第一电子设备确定第二处理时延,该第二处理时延为该第一电子设备从获取语音信号到播放语音信号的时延与第三电子设备获取语音信号到第三电子设备转发语音信号至第一电子设备上时延的和;在第四时刻后的第五时刻,第一电子设备确定第三距离,该第三距离为在第五时刻时该第一电子设备与第三电子设备之间的距离;该第一电子设备基于该第三距离和声速确定第三传播时延,该第三传播时延为在第五时刻时,声音信号从该第一电子设备传播到第三电子设备之间的时延。该第一电子设备确定该第三传播时延与该第二处理时延的和为第三回声时延。With reference to the embodiments of the first aspect, in some embodiments, at the fourth moment, the first electronic device determines a second processing delay, where the second processing delay is from acquiring the voice signal to playing the voice signal by the first electronic device The sum of the delay time and the delay time of the third electronic device acquiring the voice signal to the third electronic device and forwarding the voice signal to the first electronic device; at the fifth time after the fourth time, the first electronic device determines the third distance, the The third distance is the distance between the first electronic device and the third electronic device at the fifth moment; the first electronic device determines a third propagation delay based on the third distance and the speed of sound, and the third propagation delay is At the fifth moment, the time delay between the sound signal propagating from the first electronic device to the third electronic device. The first electronic device determines that the sum of the third propagation delay and the second processing delay is a third echo delay.
在上述实施例中,电子设备可以确定多个设备的回声时延,并对与不同设备相关的回声采用对应的回声时延进行消除,提升了用户的体验。In the above embodiment, the electronic device can determine the echo delays of multiple devices, and use the corresponding echo delays to cancel the echoes related to different devices, which improves the user experience.
第二方面,在第一时刻前的第三时刻,第一电子设备确定第一处理时延。在第一时刻,该第一电子设备确定第一距离,该第一距离为在第一时刻时该第一电子设备与第二电子设备之间的距离;该第一电子设备基于该第一距离和声速确定第一传播时延,该第一传播时延为在第一时刻时,声音信号从该第一电子设备传播到第二电子设备之间的时延;该第一电子设备确定该第一传播时延为第一回声时延,或者,该第一电子设备确定该第一传播时延与该第一处理时延的和为该第一回声时延,该第一处理时延为该第一电子设备从获取语音信号到播放语音信号的时延与第二电子设备获取语音信号到第二电子设备转发语音信号至第一电子设备上时延的和。In a second aspect, at a third time point before the first time point, the first electronic device determines the first processing delay. At the first moment, the first electronic device determines a first distance, where the first distance is the distance between the first electronic device and the second electronic device at the first moment; the first electronic device is based on the first distance and the speed of sound to determine a first propagation delay, where the first propagation delay is the delay between the propagation of the sound signal from the first electronic device to the second electronic device at the first moment; the first electronic device determines the first propagation delay A propagation delay is the first echo delay, or the first electronic device determines that the sum of the first propagation delay and the first processing delay is the first echo delay, and the first processing delay is the The sum of the delay from acquiring the voice signal to playing the voice signal by the first electronic device and the delay of the second electronic device acquiring the voice signal to the second electronic device forwarding the voice signal to the first electronic device.
在上述实施例中,第一电子设备通过确定第一电子设备和第二电子设备之间的距离,进而确定第一电子设备与第二电子设备之间的传播时延,再进一步基于传播时延确定回声时延。该方法首先使得电子设备本身可以随意移动,不影响回声消除的效果,方便了用户的体验;且不需要人工测量回声时延,用户可以随时将自己的电子设备参与到网络视频、会议、通话中,而无需担忧回声的影响。In the above embodiment, the first electronic device determines the propagation delay between the first electronic device and the second electronic device by determining the distance between the first electronic device and the second electronic device, and further based on the propagation delay Determines the echo delay. The method first allows the electronic device itself to move at will, without affecting the effect of echo cancellation, which facilitates the user's experience; and does not require manual measurement of echo delay, users can participate in online video, conferences, and calls at any time with their electronic devices. , without worrying about echo effects.
结合第二方面的一些实施例,在一些实施例中,在第一时刻前的第三时刻,该第一电子设备确定第一处理时延,具体包括:当该第一电子设备与第二电子设备的距离小于距离阈值时,该第一电子设备播放第一音频;该第二电子设备采集该第一音频;该第二电子设备通过无线网络/近距离通信服务向该第一电子设备发送该第一音频;该第一电子设备确定播放第一音频和接收该第一音频之间的时间差为该第一处理时延。With reference to some embodiments of the second aspect, in some embodiments, at a third time point before the first time point, the first electronic device determines the first processing delay, which specifically includes: when the first electronic device and the second electronic device communicate with each other. When the distance of the device is less than the distance threshold, the first electronic device plays the first audio; the second electronic device collects the first audio; the second electronic device sends the first electronic device through the wireless network/near field communication service. the first audio; the first electronic device determines that the time difference between playing the first audio and receiving the first audio is the first processing delay.
在上述实施例中,用户可以将第二电子设备靠近第一电子设备,通过计算音频信号在第二电子设备与第一电子设备之间流传的时间差,确定第一处理时延,使得第一电子设备可以确定不同类型的电子设备导致的不同的处理时延,进而确定回声时延。In the above embodiment, the user can bring the second electronic device close to the first electronic device, and determine the first processing delay by calculating the time difference between the audio signal passing between the second electronic device and the first electronic device, so that the first electronic device The device can determine the different processing delays caused by different types of electronic devices, and thus determine the echo delays.
结合第二方面的一些实施例,在一些实施例中,当该第一电子设备与第二电子设备的距离小于距离阈值时,该第一电子设备播放信令,具体包括:在第一时刻前的第三时刻,响应于用户的输入,该第一电子设备播放第一音频;该第二电子设备采集该第一音频;该第二电 子设备通过无线网络/近距离通信服务向该第一电子设备发送该第一音频;该第一电子设备确定播放第一音频和接收该第一音频之间的时间差为该第一处理时延。With reference to some embodiments of the second aspect, in some embodiments, when the distance between the first electronic device and the second electronic device is less than a distance threshold, the first electronic device plays signaling, which specifically includes: before the first moment At the third moment, in response to the user's input, the first electronic device plays the first audio; the second electronic device collects the first audio; the second electronic device sends the first electronic The device sends the first audio; the first electronic device determines that the time difference between playing the first audio and receiving the first audio is the first processing delay.
在上述实施例中,用户可以将第二电子设备靠近第一电子设备,通过计算音频信号在第二电子设备与第一电子设备之间流传的时间差,确定第一处理时延,使得第一电子设备可以确定不同类型的电子设备导致的不同的处理时延,进而确定回声时延。In the above embodiment, the user can bring the second electronic device close to the first electronic device, and determine the first processing delay by calculating the time difference between the audio signal passing between the second electronic device and the first electronic device, so that the first electronic device The device can determine the different processing delays caused by different types of electronic devices, and thus determine the echo delays.
结合第二方面的一些实施例,在一些实施例中,该第一电子设备确定第一距离,具体包括:该第二电子设备确定第一运动信息,该第一运动信息包括该第二电子设备的运动状态;该第二电子设备向该第一电子设备发送运动信息;该第一电子设备接收该运动信息,该第一电子设备基于该第一运动信息确定该第一距离。With reference to some embodiments of the second aspect, in some embodiments, the first electronic device determining the first distance specifically includes: the second electronic device determining first motion information, where the first motion information includes the second electronic device The second electronic device sends motion information to the first electronic device; the first electronic device receives the motion information, and the first electronic device determines the first distance based on the first motion information.
在上述实施例中,第二电子设备记录自己的运动信息如X轴、Y轴、Z轴方向上的加速度、速度、姿态角等参数,并将运动信息发送给第一电子设备,有助于更精准的确定第一电子设备与第二电子设备之间的距离。In the above-mentioned embodiment, the second electronic device records its own motion information such as acceleration, speed, attitude angle and other parameters in the directions of the X-axis, Y-axis, and Z-axis, and sends the motion information to the first electronic device, which is helpful for The distance between the first electronic device and the second electronic device is more accurately determined.
结合第二方面的一些实施例,在一些实施例中,该第一电子设备确定第一距离,具体包括:该第二电子设备确定第一位置信息,该第一位置信息包括该第二电子设备的位置;该第二电子设备向该第一电子设备发送该第一位置信息;该第一电子设备接收该第一位置信息,该第一电子设备基于该第一位置信息确定该第一距离。With reference to some embodiments of the second aspect, in some embodiments, the first electronic device determining the first distance specifically includes: the second electronic device determining first location information, where the first location information includes the second electronic device The second electronic device sends the first position information to the first electronic device; the first electronic device receives the first position information, and the first electronic device determines the first distance based on the first position information.
在上述实施例中,第二电子设备基于运动信息可以确定自己的位置,并将包括自己的位置的位置信息发送给第一电子设备,进而可以确定第一电子设备与第二电子设备之间的距离,有助于更精确的确定第一电子设备与第二电子设备之间的距离。In the above embodiment, the second electronic device can determine its own position based on the motion information, and send the position information including its own position to the first electronic device, so as to determine the distance between the first electronic device and the second electronic device. The distance helps to more accurately determine the distance between the first electronic device and the second electronic device.
结合第二方面的一些实施例,在一些实施例中,该第一电子设备确定第一距离,具体包括:该第二电子设备确定第一距离信息,该第一距离信息包括该第一距离;该第二电子设备向该第二电子设备发送该第一距离信息;该第一电子设备接收该第一距离信息,该第一电子设备基于该第一位置信息确定该第一距离。With reference to some embodiments of the second aspect, in some embodiments, the first electronic device determining the first distance specifically includes: the second electronic device determining first distance information, where the first distance information includes the first distance; The second electronic device sends the first distance information to the second electronic device; the first electronic device receives the first distance information, and the first electronic device determines the first distance based on the first position information.
在上述实施例中,第二电子设备确定与第一电子设备的距离后,将该距离告知第一电子设备,降低第一电子设备的计算负担。In the above embodiment, after the second electronic device determines the distance from the first electronic device, it informs the first electronic device of the distance, thereby reducing the computational burden of the first electronic device.
第三方面,本申请实施例提供了一种电子设备,该电子设备包括:一个或多个处理器和存储器;该存储器与该一个或多个处理器耦合,该存储器用于存储计算机程序代码,该计算机程序代码包括计算机指令,该一个或多个处理器调用该计算机指令以使得该电子设备执行:In a third aspect, an embodiment of the present application provides an electronic device, the electronic device includes: one or more processors and a memory; the memory is coupled to the one or more processors, and the memory is used to store computer program codes, The computer program code includes computer instructions invoked by the one or more processors to cause the electronic device to perform:
在第一时刻,第一电子设备确定第一距离,该第一距离为在第一时刻时该第一电子设备与第二电子设备之间的距离;该第一电子设备基于该第一距离和声速确定第一传播时延,该第一传播时延为在第一时刻时,声音信号从该第一电子设备传播到第二电子设备之间的时延;该第一电子设备确定该第一传播时延为第一回声时延,或者,该第一电子设备确定该第一传播时延与第一处理时延的和为该第一回声时延,该第一处理时延为该第一电子设备从获取语音信号到播放语音信号的时延与第二电子设备获取语音信号到第二电子设备转发语音信号至第一电子设备上时延的和。At the first moment, the first electronic device determines a first distance, where the first distance is the distance between the first electronic device and the second electronic device at the first moment; the first electronic device is based on the first distance and The speed of sound determines a first propagation delay, where the first propagation delay is the delay between the propagation of the sound signal from the first electronic device to the second electronic device at the first moment; the first electronic device determines the first propagation delay The propagation delay is the first echo delay, or the first electronic device determines that the sum of the first propagation delay and the first processing delay is the first echo delay, and the first processing delay is the first echo delay The sum of the delay from acquiring the voice signal to playing the voice signal by the electronic device and the delay of the second electronic device acquiring the voice signal to the second electronic device forwarding the voice signal to the first electronic device.
结合第三方面的一些实施例,在一些实施例中,该一个或多个处理器,还用于调用该计算机指令以使得该电子设备执行:在第一时刻后的第二时刻,该电子设备确定第二距离,该第二距离为在第二时刻时该第一电子设备与第二电子设备之间的距离;该第一电子设备基于该第二距离和声速确定第二传播时延,该第二传播时延为在第二时刻时,声音信号从该第一电子设备传输到第二电子设备之间的时延;该第一电子设备确定该第二传播时延为第二回声 时延,或者该第一电子设备确定该第二传播时延与第一处理时延的和为该第二回声时延;该第二回声时延与该第一回声时延不同。With reference to some embodiments of the third aspect, in some embodiments, the one or more processors are further configured to invoke the computer instructions to cause the electronic device to execute: at a second time after the first time, the electronic device determining a second distance, where the second distance is the distance between the first electronic device and the second electronic device at the second moment; the first electronic device determines a second propagation delay based on the second distance and the speed of sound, the The second propagation delay is the delay between the transmission of the sound signal from the first electronic device to the second electronic device at the second moment; the first electronic device determines that the second propagation delay is the second echo delay , or the first electronic device determines that the sum of the second propagation delay and the first processing delay is the second echo delay; the second echo delay is different from the first echo delay.
结合第三方面的一些实施例,在一些实施例中,该一个或多个处理器,还用于调用该计算机指令以使得该电子设备执行:在第一时刻前的第三时刻,该第一电子设备确定该第一处理时延。With reference to some embodiments of the third aspect, in some embodiments, the one or more processors are further configured to invoke the computer instructions to cause the electronic device to execute: at a third moment before the first moment, the first The electronic device determines the first processing delay.
结合第三方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:在第一时刻前的第三时刻,当该第一电子设备与第二电子设备的距离小于距离阈值时,该第一电子设备播放第一音频;该第一电子设备通过无线网络/近距离通信服务接收该第二电子设备发送的该第一音频;该第一电子设备确定播放第一音频和接收该第一音频之间的时间差为该第一处理时延。With reference to some embodiments of the third aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: at a third moment before the first moment, when the first moment When the distance between an electronic device and a second electronic device is less than the distance threshold, the first electronic device plays the first audio; the first electronic device receives the first audio sent by the second electronic device through a wireless network/near field communication service ; the first electronic device determines that the time difference between playing the first audio and receiving the first audio is the first processing delay.
结合第三方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:在第一时刻前的第三时刻,响应于用户的输入,该第一电子设备播放第一音频;该第一电子设备通过无线网络/近距离通信服务接收该第二电子设备发送的该第一音频;该第一电子设备确定播放第一音频和接收该第一音频之间的时间差为该第一处理时延。With reference to some embodiments of the third aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: at a third time point before the first time point, in response to a user input, the first electronic device plays the first audio; the first electronic device receives the first audio sent by the second electronic device through the wireless network/near field communication service; the first electronic device determines to play the first audio and The time difference between receiving the first audio is the first processing delay.
结合第三方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:该第一电子设备接收第一运动信息,该第一运动信息包括该第二电子设备的运动状态;该第一电子设备基于该第一运动信息确定该第一距离。With reference to some embodiments of the third aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: the first electronic device receives first motion information, the first A motion information includes a motion state of the second electronic device; the first electronic device determines the first distance based on the first motion information.
结合第三方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:该第一电子设备接收第一位置信息,该第一位置信息包括该第二电子设备的位置;该第一电子设备基于该第一位置信息确定该第一距离。With reference to some embodiments of the third aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: the first electronic device receives first location information, the first A location information includes the location of the second electronic device; the first electronic device determines the first distance based on the first location information.
结合第三方面的一些实施例,在一些实施例中,该一个或多个处理器,具体用于调用该计算机指令以使得该电子设备执行:该第一电子设备接收第一距离信息,该第一距离信息包括该第一距离;该第一电子设备基于该第一距离信息确定该第一距离。With reference to some embodiments of the third aspect, in some embodiments, the one or more processors are specifically configured to invoke the computer instructions to cause the electronic device to execute: the first electronic device receives the first distance information, the first A distance information includes the first distance; the first electronic device determines the first distance based on the first distance information.
结合第三方面的一些实施例,在一些实施例中,该一个或多个处理器,还用于调用该计算机指令以使得该电子设备执行:在第四时刻,第一电子设备确定第二处理时延,该第二处理时延为该第一电子设备从获取语音信号到播放语音信号的时延与第三电子设备获取语音信号到第三电子设备转发语音信号至第一电子设备上时延的和;With reference to some embodiments of the third aspect, in some embodiments, the one or more processors are further configured to invoke the computer instruction to cause the electronic device to execute: at the fourth moment, the first electronic device determines the second process Delay, the second processing delay is the delay from the acquisition of the voice signal to the playback of the voice signal by the first electronic device and the delay from the acquisition of the voice signal by the third electronic device to the transfer of the voice signal to the first electronic device by the third electronic device. the sum;
在第四时刻后的第五时刻,第一电子设备确定第三距离,该第三距离为在第五时刻时该第一电子设备与第三电子设备之间的距离;该第一电子设备基于该第三距离和声速确定第三传播时延,该第三传播时延为在第五时刻时,声音信号从该第一电子设备传播到第三电子设备之间的时延。该第一电子设备确定该第三传播时延与该第二处理时延的和为第三回声时延。At a fifth time after the fourth time, the first electronic device determines a third distance, where the third distance is the distance between the first electronic device and the third electronic device at the fifth time; the first electronic device is based on The third distance and the speed of sound determine a third propagation time delay, where the third propagation time delay is the time delay between the propagation of the sound signal from the first electronic device to the third electronic device at the fifth moment. The first electronic device determines that the sum of the third propagation delay and the second processing delay is a third echo delay.
第四方面,本申请实施例提供了一种芯片系统,该芯片系统应用于电子设备,该芯片系统包括一个或多个处理器,该处理器用于调用计算机指令以使得该电子设备执行如第一方面以及第一方面中任一可能的实现方式描述的方法。In a fourth aspect, an embodiment of the present application provides a chip system, the chip system is applied to an electronic device, the chip system includes one or more processors, and the processors are configured to invoke computer instructions to cause the electronic device to execute the first Aspects and methods described in any possible implementation of the first aspect.
第五方面,本申请实施例提供一种包含指令的计算机程序产品,当上述计算机程序产品在电子设备上运行时,使得该电子设备执行如第一方面以及第一方面中任一可能的实现方式描述的方法,或执行如第一方面以及第一方面中任一可能的实现方式描述的方法。In a fifth aspect, an embodiment of the present application provides a computer program product containing instructions, when the above computer program product is run on an electronic device, the electronic device is made to perform the first aspect and any possible implementation manner of the first aspect the described method, or perform the method described in the first aspect and any possible implementation manner of the first aspect.
第六方面,本申请实施例提供一种计算机可读存储介质,包括指令,当上述指令在电子设备上运行时,得该电子设备执行如第一方面以及第一方面中任一可能的实现方式描述的方法。In a sixth aspect, the embodiments of the present application provide a computer-readable storage medium, including instructions, when the above-mentioned instructions are run on an electronic device, the electronic device can execute the first aspect and any possible implementation manner of the first aspect. method described.
可以理解地,上述第三方面提供的电子设备、第四方面提供的芯片系统、第五方面提供的计算机程序产品以及第六方面提供的计算机存储介质均用于执行本申请实施例所提供的方法。因此,其所能达到的有益效果可参考对应方法中的有益效果,此处不再赘述。It can be understood that the electronic device provided in the third aspect, the chip system provided in the fourth aspect, the computer program product provided in the fifth aspect, and the computer storage medium provided in the sixth aspect are all used to execute the methods provided by the embodiments of the present application. . Therefore, for the beneficial effects that can be achieved, reference may be made to the beneficial effects in the corresponding method, which will not be repeated here.
附图说明Description of drawings
图1为本申请涉及的回声产生过程的一个示例性示意图;FIG. 1 is an exemplary schematic diagram of an echo generation process involved in the present application;
图2为本申请涉及的传播时延和处理时延的一个示例性示意图;FIG. 2 is an exemplary schematic diagram of propagation delay and processing delay involved in the application;
图3为本申请涉及的回声模块的一个示例性示意图;3 is an exemplary schematic diagram of an echo module involved in the present application;
图4为本申请涉及的回声消除方法使用场景的一个示例性示意图;FIG. 4 is an exemplary schematic diagram of a usage scenario of the echo cancellation method involved in the application;
图5为本申请涉及的回声消除方法使用场景的另一个示例性示意图;FIG. 5 is another exemplary schematic diagram of a usage scenario of the echo cancellation method involved in the present application;
图6为图5中音频输入设备的方向图的一个示例性示意图;FIG. 6 is an exemplary schematic diagram of the orientation diagram of the audio input device in FIG. 5;
图7为本申请涉及的会议场景的一个示例性示意图;FIG. 7 is an exemplary schematic diagram of a conference scene involved in the application;
图8为图7所示场景下回声消除数据流程的一个示例性示意图;FIG. 8 is an exemplary schematic diagram of an echo cancellation data flow in the scenario shown in FIG. 7;
图9为本申请实施例提供的回声时延估计方法的一个示例性示意图;FIG. 9 is an exemplary schematic diagram of an echo delay estimation method provided by an embodiment of the present application;
图10A至图10C为本申请实施例提供的两种位置校准方法的示例性示意图;10A to 10C are exemplary schematic diagrams of two position calibration methods provided by the embodiments of the present application;
图11A至图11B为本申请实施例提供的确定回声时延过程的一个示例性示意图;FIG. 11A to FIG. 11B are an exemplary schematic diagram of a process of determining an echo delay provided by an embodiment of the present application;
图12为本申请实施例提供的音频输出设备200建立三维空间坐标系的一个示例性示意图;FIG. 12 is an exemplary schematic diagram of establishing a three-dimensional space coordinate system by the audio output device 200 provided by the embodiment of the present application;
图13为本申请实施例提供的单设备下的回声时延估计方法的一个示例性示意图;13 is an exemplary schematic diagram of an echo delay estimation method under a single device provided by an embodiment of the present application;
图14为本申请实施例提供的多设备下的回声时延估计方法的一个示例性示意图;14 is an exemplary schematic diagram of an echo delay estimation method under multiple devices provided by an embodiment of the present application;
图15为本申请实施例提供的电子设备100的一个示例性硬件结构示意图;FIG. 15 is a schematic diagram of an exemplary hardware structure of an electronic device 100 provided by an embodiment of the present application;
图16为本申请实施例提供的电子设备100的一个示例性软件结构示意图。FIG. 16 is a schematic diagram of an exemplary software structure of an electronic device 100 according to an embodiment of the present application.
具体实施方式Detailed ways
本申请以下实施例中所使用的术语只是为了描述特定实施例的目的,而并非旨在作为对本申请的限制。如在本申请的说明书和所附权利要求书中所使用的那样,单数表达形式“一个”、“一种”、“该”、“上述”、“该”和“这一”旨在也包括复数表达形式,除非其上下文中明确地有相反指示。还应当理解,本申请中使用的术语“和/或”是指并包含一个或多个所列出项目的任何或所有可能组合。The terms used in the following embodiments of the present application are only for the purpose of describing specific embodiments, and are not intended to be used as limitations of the present application. As used in the specification of this application and the appended claims, the singular expressions "a," "an," "the," "above," "the," and "the" are intended to also include Plural expressions unless the context clearly dictates otherwise. It will also be understood that, as used in this application, the term "and/or" refers to and includes any and all possible combinations of one or more of the listed items.
以下,术语“第一”、“第二”仅用于描述目的,而不能理解为暗示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征,在本申请实施例的描述中,除非另有说明,“多个”的含义是两个或两个以上。Hereinafter, the terms "first" and "second" are only used for descriptive purposes, and should not be construed as implying or implying relative importance or implying the number of indicated technical features. Therefore, the features defined as "first" and "second" may explicitly or implicitly include one or more of the features. In the description of the embodiments of the present application, unless otherwise specified, the "multiple" The meaning is two or more.
为了便于理解,下面先对本申请实施例涉及的相关术语及相关概念进行介绍。本发明的实施方式部分使用的术语仅用于对本发明的具体实施例进行解释,而非旨在限定本发明。For ease of understanding, related terms and related concepts involved in the embodiments of the present application are first introduced below. The terms used in the embodiments of the present invention are only used to explain specific embodiments of the present invention, and are not intended to limit the present invention.
(1)回声(1) Echo
在网络通话、视频会议等场景中,远端或近端的用户发出声音后,本地的音频输出设备 会播放出用户之前发出的声音,则该音频输出设备播放的声音为回声。In scenarios such as network calls and video conferences, after the far-end or near-end user makes a sound, the local audio output device will play the sound made by the user before, and the sound played by the audio output device is an echo.
具体的,以远端用户为例,当远端用户发出声音后,该声音被远端的音频输入设备获取,并通过网络传输给近端的音频输出设备。近端的音频输出设备播放该声音后,由于近端的音频输入设备一直处于工作状态,所以该声音又会被近端的音频输出设备获取,并通过网络传输给远端的音频输出设备。远端的音频输出设备播放的声音中包括有前一时刻远端用户自己说话的声音,该声音即为回声。Specifically, taking the far-end user as an example, when the far-end user makes a sound, the sound is acquired by the far-end audio input device, and transmitted to the near-end audio output device through the network. After the near-end audio output device plays the sound, since the near-end audio input device is always working, the sound will be acquired by the near-end audio output device and transmitted to the far-end audio output device through the network. The sound played by the far-end audio output device includes the sound of the far-end user speaking by himself at the previous moment, and the sound is the echo.
下面以图1所示的内容为例,具体的介绍回声的产生过程。The following takes the content shown in FIG. 1 as an example to specifically introduce the echo generation process.
图1为本申请涉及的回声产生过程的一个示例性示意图。FIG. 1 is an exemplary schematic diagram of the echo generation process involved in the present application.
如图1所示,远端用户的语音1通过网络传输到近端的音频输出设备1,音频输出设备1播放该语音1。近端用户的语音2和远端用户的语音1都会被音频输入设备1获取到,进而将语音1和语音2通过网络传输到音频输出设备2上。音频输出设备2会播放语音1和语音2,对于远端用户1来说,语音1为回声。As shown in FIG. 1 , the voice 1 of the far-end user is transmitted to the near-end audio output device 1 through the network, and the audio output device 1 plays the voice 1 . Both the voice 2 of the near-end user and the voice 1 of the far-end user are acquired by the audio input device 1, and then the voice 1 and the voice 2 are transmitted to the audio output device 2 through the network. Audio output device 2 will play voice 1 and voice 2. For remote user 1, voice 1 is an echo.
(2)回声消除(2) Echo cancellation
回声消除包括:通过预先估计回声时延,结合自适应算法,从声音信号和回声信号的叠加信号中消除回声。回声消除模块为负责回声消除的模块。The echo cancellation includes: by pre-estimating the echo delay, combined with an adaptive algorithm, the echo is eliminated from the superimposed signal of the sound signal and the echo signal. The echo cancellation module is the module responsible for echo cancellation.
其中,当音频输入设备与音频输出设备之间通信的交互时延可以忽略时,回声时延为传播时延和处理时延的和;当音频输入设备与音频输出设备之间的交互时延不可以忽略时,回声时延可以为传播时延、处理时延和交互时延的和。Among them, when the interaction delay of the communication between the audio input device and the audio output device can be ignored, the echo delay is the sum of the propagation delay and the processing delay; when the interaction delay between the audio input device and the audio output device is not When negligible, the echo delay can be the sum of propagation delay, processing delay and interaction delay.
具体的,当音频输入设备和音频输出设备之间为有线连接时,认为音频输入设备与音频输出设备之间的交互时延可以忽略;当音频输入设备和音频输出设备之间为无线连接时,在一些情况下,例如,承载无线连接的信道通信质量较差、信道传输速率低或者信道时延较大时,认为音频输入设备与音频输出设备之间的交互时延不可以忽略。Specifically, when there is a wired connection between the audio input device and the audio output device, it is considered that the interaction delay between the audio input device and the audio output device can be ignored; when the audio input device and the audio output device are wirelessly connected, In some cases, for example, when the communication quality of the channel carrying the wireless connection is poor, the transmission rate of the channel is low, or the channel delay is large, it is considered that the interaction delay between the audio input device and the audio output device cannot be ignored.
其中,无线连接包括无线网络和近距离通信服务等。其中,无线网络包括蜂窝移动通信、WIFI等。其中,近距离通信服务可以是很多种形式,例如蓝牙、Hi-Link、近场通信(Near Field Communication,NFC)、Apple无线直连(Apple Wireles Direct Link,AWDL)等协议,在此不作限定。Among them, wireless connections include wireless networks and short-range communication services. The wireless network includes cellular mobile communication, WIFI, and the like. The short-range communication service can be in many forms, such as Bluetooth, Hi-Link, Near Field Communication (Near Field Communication, NFC), Apple Wireless Direct Link (Apple Wireless Direct Link, AWDL) and other protocols, which are not limited here.
其中,当音频输入设备与音频输出设备之间的交互时延可以忽略时,传播时延为声音在音频输出设备与音频输入设备之间空间传播的时延;处理时延由两部分组成,分别为处理时延1和处理时延2。其中,处理时延1为音频输出设备上声音信号从送到回声消除模块起,至声音信号被音频输出设备的音频播放模块播放该声音信号之间的时延;处理时延2为声音信号被音频输入设备获取,至声音信号被送到音频输出设备上的回声消除模块之间的时延。Among them, when the interaction delay between the audio input device and the audio output device can be ignored, the propagation delay is the delay of the spatial propagation of the sound between the audio output device and the audio input device; the processing delay consists of two parts, respectively are processing delay 1 and processing delay 2. Among them, the processing delay 1 is the delay from the time when the sound signal on the audio output device is sent to the echo cancellation module until the sound signal is played by the audio playback module of the audio output device; The delay between the acquisition of the audio input device and the sound signal being sent to the echo cancellation module on the audio output device.
其中,当音频输入设备与音频输出设备之间的交互时延不可以忽略时,进一步的,处理时延2可以分为处理时延21、交互时延、处理时延22。其中,处理时延21为音频输入设备从音频输入模块采集声音信号到通信模块发送声音信号之间的时间差;交互时延为从音频输入设备发送声音信号到音频输出设备接收声音信号之间的时间差,即交互时延与承载音频输出设备与音频输入设备数据交互的信道的通信性能有关;处理时延22为音频输出设备接收声音信号之间到音频输出设备上回声消除模块接收声音信号之间的时间差。Wherein, when the interaction delay between the audio input device and the audio output device cannot be ignored, further, the processing delay 2 can be divided into a processing delay 21 , an interaction delay, and a processing delay 22 . Among them, the processing delay 21 is the time difference between the audio input device collecting the sound signal from the audio input module and the communication module sending the sound signal; the interaction delay is the time difference between the audio input device sending the sound signal and the audio output device receiving the sound signal , that is, the interaction delay is related to the communication performance of the channel carrying the data interaction between the audio output device and the audio input device; the processing delay 22 is the time between the audio output device receiving the sound signal and the echo cancellation module on the audio output device receiving the sound signal. Time difference.
值得说明的是,传播时延与音频输出设备与音频输入设备之间的相对位置有关;处理时延1与音频输出设备上的硬件、软件有关;处理时延2与音频输入设备和音频输出设备上的硬件、软件有关。It is worth noting that the propagation delay is related to the relative position between the audio output device and the audio input device; the processing delay 1 is related to the hardware and software on the audio output device; the processing delay 2 is related to the audio input device and the audio output device. on the hardware and software.
下面以图2所示的内容为例,示例性的介绍传播时延、处理时延。The following takes the content shown in FIG. 2 as an example to exemplarily introduce the propagation delay and the processing delay.
图2为本申请涉及的传播时延和处理时延的一个示例性示意图。FIG. 2 is an exemplary schematic diagram of propagation delay and processing delay involved in the present application.
如图2所示,远端用户的数字声音信号通过网络传输到近端的设备后,设备会将该数字声音信号传输到回声消除模块上并且同时传送到音频输出设备1上。音频输出设备1会将该数字声音信号1通过D/A转换器等,将数字声音信号转变为模拟声音信号播放出去。音频输入设备1会获取该模拟声音信号,通过D/A转换器等将该模拟声音信号转变为数字声音信号,进而将数字声音信号传输到回声消除模块上。回声消除模块将回声消除后,将回声模块输出的数字声音信号通过网络传输到远端。As shown in FIG. 2 , after the digital sound signal of the far-end user is transmitted to the near-end device through the network, the device will transmit the digital sound signal to the echo cancellation module and to the audio output device 1 at the same time. The audio output device 1 will convert the digital sound signal 1 into an analog sound signal through a D/A converter or the like and play it out. The audio input device 1 acquires the analog sound signal, converts the analog sound signal into a digital sound signal through a D/A converter, etc., and then transmits the digital sound signal to the echo cancellation module. After the echo cancellation module cancels the echo, the digital sound signal output by the echo module is transmitted to the far end through the network.
在图2所示的内容中,处理时延1为数字声音信号分别被送到回声消除模块音频输出设备1之间的时间差;传播时延为模拟声音信号在音频输出设备1与音频输入设备1之间声波传播的时间;处理时延2为音频输入设备1获取到数字语音信号至音频输入设备1将该数字语音信号2送到回声消除模块之间的时间差。In the content shown in Figure 2, the processing delay 1 is the time difference between the digital sound signals being sent to the audio output device 1 of the echo cancellation module; the propagation delay is the analog sound signal between the audio output device 1 and the audio input device 1. The time of sound wave propagation between the two; the processing delay 2 is the time difference between the audio input device 1 obtains the digital voice signal and the audio input device 1 sends the digital voice signal 2 to the echo cancellation module.
下面以图3所示的内容为例,示例性的介绍回声消除模块是如何消除回声的。The following takes the content shown in FIG. 3 as an example to exemplarily introduce how the echo cancellation module cancels the echo.
图3为本申请涉及的回声模块的一个示例性示意图。FIG. 3 is an exemplary schematic diagram of the echo module involved in this application.
如图3所示,回声消除模块的输入包括:回声时延、近端声音、参考声音;回声消除模块的输出为近端声音-回声。其中,近端声音为近端音频输入设备获取到的所有声音信号;参考声音为远端通过网络传输过来的声音信号。As shown in FIG. 3 , the input of the echo cancellation module includes: echo delay, near-end sound, and reference sound; the output of the echo cancellation module is near-end sound-echo. Among them, the near-end sound is all sound signals obtained by the near-end audio input device; the reference sound is the sound signal transmitted by the far-end through the network.
回声消除模块中的数字滤波器通过自适应算法如:最小均方(LMS)算法、归一化最小均方(NLMS)算法、比例归一化最小均方(PNLMS)算法计算滤波器的权值。The digital filter in the echo cancellation module calculates the weight of the filter through adaptive algorithms such as: least mean square (LMS) algorithm, normalized least mean square (NLMS) algorithm, proportional normalized least mean square (PNLMS) algorithm .
值得说明的是,当回声消除模块的输入回声时延越接近真实回声的时延时,回声消除模块输出中回声的剩余越少。It is worth noting that, when the input echo delay of the echo cancellation module is closer to the delay time of the real echo, the less the residual echo in the output of the echo cancellation module is.
其次,下面介绍与本申请有关的一种回声消除方法:Secondly, an echo cancellation method related to the present application is introduced below:
具体的,当音频输入设备与音频输出设备位于同一设备上或者音频输入设备和音频输出设备之间的位置相对固定时,设备制造者会预先测试设备的回声时延,并将测试的回声时延值写入回声消除模块中。用户在设备使用该设备过程中,设备的回声消除模块根据出厂测试确定的回声时延参数能够正常工作,消除回声。Specifically, when the audio input device and the audio output device are located on the same device or the positions between the audio input device and the audio output device are relatively fixed, the device manufacturer will pre-test the echo delay of the device, and the tested echo delay The value is written to the echo cancellation module. When the user uses the device, the echo cancellation module of the device can work normally according to the echo delay parameters determined by the factory test to eliminate the echo.
图4为本申请涉及的回声消除方法使用场景的一个示例性示意图。FIG. 4 is an exemplary schematic diagram of a usage scenario of the echo cancellation method involved in this application.
如图4所示,对于手机来说,由于麦克风和喇叭的位置相对固定,可以通过人工测量音频输入设备和音频输出设备之间的距离,确定传播时延。例如,当麦克风和喇叭之间的距离大约为15cm时,回声时延中的传播时延大约为0.15/340≈0秒,可以忽略不计。处理时延1和处理时延2的和可以通过预先测试获得。在该情况下,回声时延等于处理时延。As shown in Figure 4, for a mobile phone, since the positions of the microphone and the speaker are relatively fixed, the propagation delay can be determined by manually measuring the distance between the audio input device and the audio output device. For example, when the distance between the microphone and the speaker is about 15cm, the propagation delay in the echo delay is about 0.15/340≈0 second, which can be ignored. The sum of processing delay 1 and processing delay 2 can be obtained by pre-testing. In this case, the echo delay is equal to the processing delay.
当用户使用通话功能、视频功能、语音功能时,麦克风接收到本地用户发出的声音以及喇叭播放的远端用户的声音后,麦克风将模拟信号通过放大器放大,并经过D/A转换器量化为数字信号后,将该数字信号传递到回声消除模块。回声消除模块对收到的声音信号进行滤波,滤除远端用户的声音后,回声消除模块将滤波后的声音信号编码后通过网络传输到对端用户处。When the user uses the call function, video function, and voice function, after the microphone receives the voice from the local user and the voice of the remote user played by the speaker, the microphone amplifies the analog signal through the amplifier, and quantizes it into digital through the D/A converter. After the signal, the digital signal is passed to the echo cancellation module. The echo cancellation module filters the received sound signal, and after filtering the voice of the remote user, the echo cancellation module encodes the filtered sound signal and transmits it to the opposite end user through the network.
当音频输入设备与音频输出设备之间的相对位置不固定,且回声时延不能准确估计的情况下,可以通过降低音频输入设备获取到的回声的能量,进而弥补回声时延估计不准导致的回声消除模块滤除回声的性能下降。When the relative position between the audio input device and the audio output device is not fixed, and the echo delay cannot be accurately estimated, the energy of the echo obtained by the audio input device can be reduced to compensate for the inaccurate estimation of the echo delay. The performance of the echo cancellation module to filter out echoes is degraded.
图5为本申请涉及的回声消除方法使用场景的另一个示例性示意图。FIG. 5 is another exemplary schematic diagram of a usage scenario of the echo cancellation method involved in this application.
在如图5所示的场景中,由于音频输入设备与音频输出设备之间的距离不确定,故无法确定回声时延中的传播时延。为了降低该场景下的回声,用户可以选择方向性强的音频输入设备和音频输出设备。例如,音频输入设备可以选择压强压差复合式的麦克风。压强压差复合式的麦克风具有较好的方向性,主要接受来自主瓣方向的声音信号,而来自旁瓣方向上的声音信号会被较大程度的衰减。In the scenario shown in FIG. 5, since the distance between the audio input device and the audio output device is uncertain, the propagation delay in the echo delay cannot be determined. In order to reduce the echo in this scene, the user can choose the audio input device and audio output device with strong directionality. For example, the audio input device can choose a pressure-pressure-difference composite microphone. The pressure and differential pressure composite microphone has good directivity, and mainly accepts the sound signal from the main lobe direction, while the sound signal from the side lobe direction will be attenuated to a greater extent.
其中,方向性的强弱与音频输入设备/音频输出设备方向图中主瓣与旁瓣的比值正相关。方向图用于描述音频输入设备/音频输出设备对不同方向上声音信号的归一化响应,其中,主瓣为方向图中归一化响应最大的方向,旁瓣为主瓣对应方向之外的方向。Among them, the strength of the directivity is positively related to the ratio of the main lobe to the side lobe in the audio input device/audio output device directional diagram. The direction map is used to describe the normalized response of the audio input device/audio output device to the sound signal in different directions. direction.
在图5所示的场景中,音频输出设备发出的声音信号沿路径1或路径2传播到音频输入设备上时,对于音频输入设备,音频输出设备播放的声音的来波方向主要是麦克风方向的旁瓣方向,会被较大程度的衰减。对于路径2来说,受到墙壁等物体的反射,声音信号的能量会被进一步衰减。在演唱会、会议室等类似场景中,墙壁等物体会被包裹上吸波材料,进而使得沿着路径2传播的声音信号的能量在被墙壁等物体反射时被再进一步的衰减。In the scenario shown in Figure 5, when the sound signal from the audio output device propagates to the audio input device along path 1 or path 2, for the audio input device, the direction of the sound played by the audio output device is mainly the direction of the microphone. The side lobe direction will be attenuated to a greater extent. For path 2, the energy of the sound signal will be further attenuated by reflections from objects such as walls. In concerts, conference rooms and other similar scenes, objects such as walls will be wrapped with absorbing materials, so that the energy of the sound signal propagating along path 2 is further attenuated when reflected by objects such as walls.
图6为图5中音频输入设备的方向图的一个示例性示意图。FIG. 6 is an exemplary schematic diagram of the orientation diagram of the audio input device in FIG. 5 .
当音频输入设备的方向图如图6所示时,可以认为音频输入设备从0度方向获取到用户的声音,并且从120度到240度的方向获取到回声。在该情况下,由于音频输入设备对不同方向的声音的响应不同,故可以通过改善音频输入设备的方向图,进而降低音频输入设备获取到的回声的能量。When the orientation diagram of the audio input device is shown in FIG. 6 , it can be considered that the audio input device acquires the user's voice from the 0-degree direction, and acquires the echo from the 120-degree to 240-degree direction. In this case, since the response of the audio input device to sounds in different directions is different, the energy of the echo obtained by the audio input device can be reduced by improving the direction map of the audio input device.
在会议/通话等场景中,由于多个音频输入设备散落在空间内的不同位置,回声时延中的传播时延不能忽略,且不同音频输入设备的回声时延不同。在会议进行中,音频输出设备和音频输入设备的绝对空间位置可以变化;音频输出设备和不同音频输入设备之间的相对距离也可以发生变化。其次,音频输出设备不知道音频输入设备是什么类型的设备;同样的,音频输入设备也不知道音频输出设备是什么类型的设备。在该情况下,无法通过预先测量估计不同音频输入设备的处理时延。其次,由于不知道音频输入设备上回声的来波方向,无法通过选择方向性强的音频输入设备降低回声能量。In scenarios such as conferences/calls, since multiple audio input devices are scattered at different positions in the space, the propagation delay in the echo delay cannot be ignored, and the echo delays of different audio input devices are different. During the conference, the absolute spatial positions of the audio output device and the audio input device can change; the relative distance between the audio output device and different audio input devices can also change. Second, the audio output device does not know what type of device the audio input device is; similarly, the audio input device does not know what type of device the audio output device is. In this case, the processing delay of different audio input devices cannot be estimated by pre-measurement. Second, since the direction of arrival of the echo on the audio input device is not known, the echo energy cannot be reduced by selecting a highly directional audio input device.
下面以图7所示的内容为例,示例性的介绍多人会议场景。The following takes the content shown in FIG. 7 as an example to exemplarily introduce a multi-person conference scenario.
图7为本申请涉及的会议场景的一个示例性示意图。FIG. 7 is an exemplary schematic diagram of a conference scenario involved in this application.
如图7所示,在会议/通话场景中,音频输出设备可以是平板,音频输入设备可以是手机。本地用户在该会议场景中,用户的随机移动会导致音频输入设备的随机移动。在该情况下,回声时延中传播时延随着时间变化而变换。并且,由于具备音频输入功能的设备均可以作为音频输入设备,音频输出设备与音频输入设备之间不可能互相知道对方的处理时延。As shown in FIG. 7 , in a conference/call scenario, the audio output device may be a tablet, and the audio input device may be a mobile phone. In this conference scenario, the local user's random movement will cause the random movement of the audio input device. In this case, the propagation delay in the echo delay varies with time. Moreover, since all devices with an audio input function can be used as audio input devices, it is impossible for the audio output device and the audio input device to know each other's processing delay.
图8为图7所示场景下回声消除数据流程的一个示例性示意图。FIG. 8 is an exemplary schematic diagram of an echo cancellation data flow in the scenario shown in FIG. 7 .
如图8所示,远端用户的数字声音信号1通过网络发送到近端的音频输出设备上。音频输出设备收到数字声音信号1后,转换为模拟声音信号1后播放出来。模拟声音信号1在近端的空间中传播。As shown in FIG. 8 , the digital sound signal 1 of the far-end user is sent to the near-end audio output device through the network. After the audio output device receives the digital sound signal 1, it converts it into an analog sound signal 1 and plays it out. The analog sound signal 1 propagates in the near-end space.
当近端用户1说话后,音频输入设备1可以获取到近端用户1的模拟声音信号2和在空间中传播的模拟声音信号1。音频输入设备1会将获取到所有模拟声音信号,转换为数字声音信号2,通过网络发送给音频输出模块。音频输出模块对数字声音信号2进行回声消除后,将回声消除后的数字声音信号发送给远端设备。同理,当近端用户2说话后,音频输入设备2可以获取到近端用户2的模拟声音信号3和在空间中传播的模拟声音信号2。音频输入设备2会将获取到所有模拟声音信号,转换为数字声音信号3,通过网络发送给音频输出模块。音 频输出模块对数字声音信号3进行回声消除后,将回声消除后的数字声音信号发送给远端设备。After the near-end user 1 speaks, the audio input device 1 can acquire the analog sound signal 2 of the near-end user 1 and the analog sound signal 1 propagating in space. The audio input device 1 will acquire all the analog sound signals, convert them into digital sound signals 2, and send them to the audio output module through the network. After the audio output module performs echo cancellation on the digital sound signal 2, it sends the echo-cancelled digital sound signal to the remote device. Similarly, after the near-end user 2 speaks, the audio input device 2 can acquire the analog sound signal 3 of the near-end user 2 and the analog sound signal 2 propagating in space. The audio input device 2 will acquire all analog sound signals, convert them into digital sound signals 3, and send them to the audio output module through the network. After the audio output module performs echo cancellation on the digital sound signal 3, it sends the echo-cancelled digital sound signal to the remote device.
由于无法确定回声时延中的传播时延、处理时延,音频输出设备上的回声消除模块输出的声音信号中的回声信号。Since the propagation delay and processing delay in the echo delay cannot be determined, the echo signal in the sound signal output by the echo cancellation module on the audio output device.
值得说明的是,若用户使用的会议软件、通话软件等软件为设备的本地应用,并且该本地应用可以访问电子设备的驱动,则可以确定处理时延2,但仍然无法确定回声时延。It is worth noting that if the conference software, call software and other software used by the user are local applications of the device, and the local application can access the driver of the electronic device, the processing delay 2 can be determined, but the echo delay cannot be determined.
结合图5至图8所示的内容,可以确定,在音频输入设备与音频输出设备之间距离不固定的情况下,尤其是在会议/通话等日常场景中,通过提前测量确定回声时延的方法不能够准确估计真实场景中用户使用时的回声时延,进而不能有效滤除、抑制回声。其次,由于音频输出设备与音频输入设备可以为不同类型的设备,例如手机、平板、手环、智能眼睛、VR/AR设备、车载终端等,无法要求音频输入设备以及音频输出设备具有良好的方向性。Combining the content shown in Figure 5 to Figure 8, it can be determined that when the distance between the audio input device and the audio output device is not fixed, especially in daily scenarios such as conferences/calls, the echo delay can be determined by measuring in advance. The method cannot accurately estimate the echo delay when the user uses it in the real scene, and thus cannot effectively filter and suppress the echo. Secondly, since the audio output device and audio input device can be different types of devices, such as mobile phones, tablets, wristbands, smart eyes, VR/AR devices, vehicle terminals, etc., it is impossible to require audio input devices and audio output devices to have good orientation. sex.
再次,下面以单个音频输入设备201、单个音频输出设备200为例介绍本申请提供的回声时延估计方法。Again, the echo delay estimation method provided by the present application is described below by taking a single audio input device 201 and a single audio output device 200 as examples.
本申请提供的回声时延估计方法,通过使用信令对处理时延进行估计。并且,利用设备的陀螺仪和/或加速度计传感器和/或图像传感器等传感器,进行空间建模,估计不同音频输入设备与音频输出设备之间的距离,进而确定传播时延。The echo delay estimation method provided by the present application estimates the processing delay by using signaling. And, using the device's gyroscope and/or accelerometer sensor and/or image sensor and other sensors to perform spatial modeling, estimate the distance between different audio input devices and audio output devices, and then determine the propagation delay.
在本申请的一些实施例中,电子设备可以不执行步骤S905,即认为处理时延可以忽略,以步骤S906、步骤S907计算确定的传播时延作为回声时延。In some embodiments of the present application, the electronic device may not perform step S905, that is, the processing delay is considered to be negligible, and the propagation delay calculated in steps S906 and S907 is used as the echo delay.
图9为本申请实施例提供的回声时延估计方法的一个示例性示意图。FIG. 9 is an exemplary schematic diagram of an echo delay estimation method provided by an embodiment of the present application.
如图9所示,本申请提供的回声时延估计方法具体包括:As shown in Figure 9, the echo delay estimation method provided by the application specifically includes:
S901:用户启动会议类应用。S901: The user starts a conference application.
具体的,当用户准备开始参与网络会议/通话前,会预先启动会议/通话类应用。在启动会议/通话类应用后,用户需要选择合适的设备作为音频输入设备/音频输出设备。Specifically, before the user is ready to start participating in the network conference/call, the conference/call application will be pre-launched. After starting the conference/call application, the user needs to select a suitable device as the audio input device/audio output device.
电子设备启动会议类应用后,会通过有线网络、无线网络、近距离通信服务或图像传感器等多种方式去发现可用的音频输入设备、音频输出设备。在发现到可用的音频输入设备、音频输出设备后,应用将可用的设备展现在应用的界面上,供用户选择。After the electronic device starts the conference application, it will find the available audio input device and audio output device through wired network, wireless network, short-range communication service or image sensor and other methods. After discovering the available audio input devices and audio output devices, the application displays the available devices on the application interface for the user to select.
例如,用户可以选择平板作为音频输出设备200、手机作为音频输入设备201;或者,用户可以选择平板作为音频输出设备200、蓝牙耳机作为音频输入设备201;或者,用户可以选择手机作为音频输出设备200、麦克风作为音频输入设备201;或者,用户可以选择带有音响功能的投影仪作为音频输出设备200、平板作为音频输入设备201等,在此不做限定。For example, the user can select the tablet as the audio output device 200 and the mobile phone as the audio input device 201; or, the user can select the tablet as the audio output device 200 and the Bluetooth headset as the audio input device 201; or, the user can select the mobile phone as the audio output device 200 , a microphone as the audio input device 201; alternatively, the user can select a projector with an audio function as the audio output device 200, a tablet as the audio input device 201, etc., which are not limited herein.
S902:开启位置校准。S902: Enable position calibration.
具体的,当用户选择完音频输出设备之后、音频输入设备之后,电子设备提醒用户开始位置校准。当音频输入设备至音频输出设备之间的距离低于距离阈值后,位置校准完成。Specifically, after the user selects the audio output device and the audio input device, the electronic device reminds the user to start the position calibration. When the distance between the audio input device and the audio output device falls below the distance threshold, the position calibration is complete.
其中,位置校准主要包括两种方式:其一,用户手动确认音频输入设备与音频输出设备之间的距离低于距离阈值;其二,音频输出设备/音频输入设备确定两者之间的距离低于距离阈值。Among them, the position calibration mainly includes two methods: first, the user manually confirms that the distance between the audio input device and the audio output device is lower than the distance threshold; second, the audio output device/audio input device determines that the distance between the two is low at the distance threshold.
下面以图10A至图10C所示的内容为例,示例性的分别介绍两种位置校准方式。Taking the content shown in FIG. 10A to FIG. 10C as an example below, two position calibration methods are exemplarily introduced respectively.
图10A至图10C为本申请实施例提供的两种位置校准方法的示例性示意图。FIGS. 10A to 10C are exemplary schematic diagrams of two position calibration methods provided by the embodiments of the present application.
其中,第一种位置校准方法如图10A与图10B所示。The first position calibration method is shown in FIG. 10A and FIG. 10B .
如图10A所示,当用户选择手机作为音频输入设备201、平板作为音频输出设备200后,平板和手机会提醒用户将音频输入设备与音频输出设备靠近。例如,手机上会显示界面1001,界面1001中显示的内容用于提醒用户当前已经开启位置校准,同样的,平板上会显示界面2001,界面2001中显示的内容用于提醒用户当前已经开启位置校准。As shown in FIG. 10A , when the user selects the mobile phone as the audio input device 201 and the tablet as the audio output device 200 , the tablet and the mobile phone will remind the user to bring the audio input device and the audio output device close to each other. For example, an interface 1001 will be displayed on the mobile phone, and the content displayed in the interface 1001 is used to remind the user that the position calibration has been turned on. .
如图10B所示,界面1001与界面2001还可以显示其他内容,例如告知用户如何完成位置校准。当用户通过移动音频输入设备和/或音频输出设备使得两者靠近后,可以通过点击音频输入设备上的确定控件1002告知音频输入设备已经完成位置校准;或者,用户可以通过点自己音频输出设备上的确定控件2002告知音频输出设备已经完成位置校准。As shown in FIG. 10B , the interface 1001 and the interface 2001 may also display other contents, such as informing the user how to complete the position calibration. When the user moves the audio input device and/or the audio output device so that they are close to each other, the user can click the confirmation control 1002 on the audio input device to inform the audio input device that the position calibration has been completed; The OK control of 2002 informs the audio output device that position calibration has been completed.
值得说明的是,用户点击确定控件1002告知音频输入设备201已经完成位置校准后,音频输入设备可以告知音频输出设备已经完成位置校准;对应的,用户点击确定控件2002告知音频输出设备200已经完成位置校准。It is worth noting that, after the user clicks the OK control 1002 to inform the audio input device 201 that the position calibration has been completed, the audio input device can inform the audio output device that the position calibration has been completed; correspondingly, the user clicks the OK control 2002 to inform the audio output device 200 that the position calibration has been completed. calibration.
其中,第二种位置校准方法如图10A与图10C所示。The second position calibration method is shown in FIG. 10A and FIG. 10C .
当用户选择手机作为音频输入设备201、平板作为音频输出设备200后。音频输入设备201和音频输出设备200,可以开启近距离通信服务去估计音频输入设备201与音频输出设备200的距离;或者,音频输入设备201和/或音频输出设备200之间通过传感器如图像传感器、激光传感器等估计音频输入设备201与音频输出设备200的距离;或者,音频输入设备201和/或音频输出设备200之间通过路由器、基站等网络接入设备估计音频输入设备201与音频输出设备200的距离等。When the user selects the mobile phone as the audio input device 201 and the tablet as the audio output device 200 . For the audio input device 201 and the audio output device 200, the proximity communication service can be enabled to estimate the distance between the audio input device 201 and the audio output device 200; , laser sensor, etc. to estimate the distance between audio input device 201 and audio output device 200; 200 distance, etc.
其中,通过近距离通信服务估计音频输入设备201与音频输出设备200的距离包括:音频输入设备201/音频输出设备200根据接收信号的强度确定两者之间的距离。例如,当音频输入设备201与音频输出设备200蓝牙连接时,可以根据接收信号强度指示(Received signal strengthindicator,RSSI)以及
Figure PCTCN2022084862-appb-000001
确定音频输入设备201与音频输出设备200的距离。其中,d为音频输入设备201与音频输出设备200的距离,A为发射端和接收端相隔1米时的信号强度,η为与环境有关的衰减因子。
Wherein, estimating the distance between the audio input device 201 and the audio output device 200 through the short-range communication service includes: the audio input device 201/audio output device 200 determines the distance between the two according to the strength of the received signal. For example, when the audio input device 201 is connected to the audio output device 200 via Bluetooth, the received signal strength indicator (RSSI) and
Figure PCTCN2022084862-appb-000001
The distance between the audio input device 201 and the audio output device 200 is determined. Among them, d is the distance between the audio input device 201 and the audio output device 200, A is the signal strength when the transmitting end and the receiving end are separated by 1 meter, and n is the attenuation factor related to the environment.
当确定音频输入设备201和音频输出设备200之间的距离后,判断是否该距离是否小于预设的距离阈值。若该距离小于等于距离阈值,则可以认为音频输入设备201靠近音频输出设备200,即已经完成位置校准;若该距离大于距离阈值,则可以认为音频输入设备201没有靠近音频输出设备200,即还未完成位置校准。After the distance between the audio input device 201 and the audio output device 200 is determined, it is determined whether the distance is less than a preset distance threshold. If the distance is less than or equal to the distance threshold, it may be considered that the audio input device 201 is close to the audio output device 200, that is, the position calibration has been completed; if the distance is greater than the distance threshold, it may be considered that the audio input device 201 is not close to the audio output device 200, that is, Position calibration not completed.
如图10C所示,手机作为音频输入设备201,平板作为音频输出设备200。手机根据估计手机与平板之间的距离与距离阈值的大小,在界面1001上显示是否完成校准;同样的,平板据估计手机与平板之间的距离与距离阈值的大小,在界面1001上显示是否完成校准。As shown in FIG. 10C , the mobile phone is used as the audio input device 201 , and the tablet is used as the audio output device 200 . The mobile phone displays whether the calibration is completed on the interface 1001 according to the estimated distance between the mobile phone and the tablet and the distance threshold; similarly, the tablet displays on the interface 1001 whether the calibration is completed according to the estimated distance between the mobile phone and the tablet and the distance threshold. Complete the calibration.
S903:音频输入设备201是否靠近音频输出设备200。S903: Whether the audio input device 201 is close to the audio output device 200.
具体的,当步骤S902中使用的第一种位置校准方法时,音频输出设备200或音频输入设备201确定是否接收到用户的确认输入。其中,确认输入为用户确认音频输出设备200和音频输入设备201已经在空间位置上互相靠近。音频输出设备200或音频输入设备201如何获取用户的输入可以参考图10A中的内容,此处不再赘述。若音频输入设备201或音频输出设备200接收到用户的确认输入,则执行步骤S905、步骤S906;若音频输入设备201或音频输 出设备200没有接收到用户的确认输入,则执行步骤S904。Specifically, when the first position calibration method is used in step S902, the audio output device 200 or the audio input device 201 determines whether a confirmation input from the user is received. The confirmation input is that the user confirms that the audio output device 200 and the audio input device 201 are close to each other in space. How the audio output device 200 or the audio input device 201 obtains the user's input can refer to the content in FIG. 10A , and details are not repeated here. If the audio input device 201 or the audio output device 200 receives the confirmation input of the user, then step S905 and step S906 are performed; if the audio input device 201 or the audio output device 200 does not receive the confirmation input of the user, then step S904 is performed.
当步骤S902中使用第二种位置校准方法时,音频输出设备200或音频输入设备201可以判断音频输入设备201与音频输出设备200之间的距离是否小于等于距离阈值。若小于等于距离阈值,执行步骤S905、步骤S906;若大于距离阈值,执行步骤S904。When the second position calibration method is used in step S902, the audio output device 200 or the audio input device 201 can determine whether the distance between the audio input device 201 and the audio output device 200 is less than or equal to the distance threshold. If it is less than or equal to the distance threshold, go to step S905 and step S906; if it is greater than the distance threshold, go to step S904.
S904:提醒用户调整音频输入设备201与音频输出设备200的位置。S904 : remind the user to adjust the positions of the audio input device 201 and the audio output device 200 .
音频输入设备201和/或音频输出设备200通过屏幕显示、播放声音、震动等多种方式提醒用户去调整音频输入设备201与音频输出设备200的位置。其中,音频输入设备201和/或音频输出设备200通过屏幕显示的内容提醒用户调整音频输入设备201与音频输出设备200位置,可以参考图10A的描述,此处不再赘述。The audio input device 201 and/or the audio output device 200 reminds the user to adjust the positions of the audio input device 201 and the audio output device 200 through various methods such as screen display, playing sound, and vibration. The audio input device 201 and/or the audio output device 200 reminds the user to adjust the positions of the audio input device 201 and the audio output device 200 through the content displayed on the screen. Reference may be made to the description in FIG. 10A , which will not be repeated here.
执行步骤S903。Step S903 is executed.
S905:音频输出设备200发送语音信令,音频输入设备201接收到语音信令后,将语音信令转发给音频输出设备200,进而确定传播时延。S905: The audio output device 200 sends voice signaling, and after receiving the voice signaling, the audio input device 201 forwards the voice signaling to the audio output device 200, and further determines the propagation delay.
具体的,当位置校准完成后,认为音频输入设备201与音频输出设备200的空间位置重合或接近重合。此时,音频输出设备200可以向音频输入设备201发送信令,该信令可以是模拟信号(声波),并在空间中自由传播,音频输入设备201在空间中不断的采集声波,其中声波包括信令。其中,信令为具有特定时域波形特征或者是频谱特征的信号。Specifically, after the position calibration is completed, it is considered that the spatial positions of the audio input device 201 and the audio output device 200 are coincident or nearly coincident. At this time, the audio output device 200 can send signaling to the audio input device 201, and the signaling can be an analog signal (sound wave), which can propagate freely in space, and the audio input device 201 continuously collects sound waves in the space, wherein the sound waves include signaling. The signaling is a signal with specific time-domain waveform characteristics or spectral characteristics.
由于认为音频输入设备201与音频输出设备200的空间位置重合,在该情况下可以忽略传播时延,故回声时延等于处理时延。Since it is considered that the spatial positions of the audio input device 201 and the audio output device 200 are coincident, the propagation delay can be ignored in this case, so the echo delay is equal to the processing delay.
音频输入设备201不断将获取的声波通过网络或近距离通服务不断的将该声波对应的数字信号传输给音频输出设备200。音频输出设备200可以根据信令的时域特征、频域特征、时域-频域联合特征,确定接收到信号中是否包括该信令,进而确定该信令从回声消除模块至音频输入设备201发送到音频输出设备200的回声消除模块之间的时间差T2。其中,该时间差T2为处理时延。The audio input device 201 continuously transmits the acquired sound waves to the audio output device 200 through the network or the short-range communication service. The audio output device 200 can determine whether the received signal includes the signaling according to the time domain feature, frequency domain feature, and time domain-frequency domain joint feature of the signaling, and then determine that the signaling is sent from the echo cancellation module to the audio input device 201. The time difference T2 between echo cancellation modules sent to the audio output device 200 . Wherein, the time difference T2 is the processing delay.
其中,由于音频输出设备200已知信令的时域特征、频域特征或时频域联合特征,音频输出设备200可以利用互相关函数、匹配滤波器、频谱分析、时频分析等方法确定是否收到信令,以及收到信令的时间。Wherein, since the audio output device 200 knows the time domain feature, frequency domain feature or time-frequency domain joint feature of the signaling, the audio output device 200 can use methods such as cross-correlation function, matched filter, spectrum analysis, time-frequency analysis, etc. to determine whether to The signalling is received, and the time when the signalling is received.
值得说明的是,当音频输出设备200和/或音频输入设备201完成位置校准后,音频输出设备200可以告知音频输入设备201位置校准成功;或者,用户可以通过交互告知音频输入设备201位置校准成功。在位置校准成功后,用户可以自由的移动音频输入设备201和/或音频输出设备200。It should be noted that, after the audio output device 200 and/or the audio input device 201 completes the position calibration, the audio output device 200 may inform the audio input device 201 that the position calibration is successful; or, the user may inform the audio input device 201 that the position calibration is successful through interaction . After the position calibration is successful, the user can freely move the audio input device 201 and/or the audio output device 200 .
值得说明的是,在部分情况下,可以由音频输入设备201发送信令,音频输出设备200接收到信令后,将信令转发给音频输入设备201,进而确定传播时延。音频输入设备201确定传播时延后,将该传播时延的值发送给音频输出设备200。It should be noted that, in some cases, the audio input device 201 may send the signaling, and after receiving the signaling, the audio output device 200 forwards the signaling to the audio input device 201 to determine the propagation delay. After the audio input device 201 determines the propagation delay, it sends the value of the propagation delay to the audio output device 200 .
下面以图11A至图11B所示的内容为例,示例性的分别介绍音频输出设备200确定接收到信号是否包括信令以及音频输出设备200确定处理时延的过程。11A to FIG. 11B are taken as an example to exemplarily introduce the process of the audio output device 200 determining whether the received signal includes signaling and the audio output device 200 determining the processing delay.
图11A至图11B为本申请实施例提供的确定回声时延过程的一个示例性示意图。FIG. 11A to FIG. 11B are exemplary schematic diagrams of a process of determining an echo delay provided by an embodiment of the present application.
如图11A所示,信令1由音频输出设备200产生后,可以经由音频输出设备200的回声消除模块传输到音频输出设备200的音频输出模块。音频输出设备200的音频输出模块将数 字信号信令1通过喇叭转变为声波信号后播放;音频输入设备201中的音频输入模块获取,并将该声波信令1转变为数字信号信令1。音频输入模块将信令1传输到音频输入设备201的通信模块。音频输入设备201的通信模块会将该信令1通过网络或近距离传输服务传输到音频输出设备200的通信模块。音频输出设备200的通信模块将收到的信令1传输给音频输出设备200的回声消除模块。As shown in FIG. 11A , after the signaling 1 is generated by the audio output device 200 , it can be transmitted to the audio output module of the audio output device 200 via the echo cancellation module of the audio output device 200 . The audio output module of the audio output device 200 converts the digital signal signaling 1 into a sound wave signal and plays it through a speaker; the audio input module in the audio input device 201 acquires and converts the sound wave signaling 1 into a digital signal signaling 1. The audio input module transmits signaling 1 to the communication module of the audio input device 201 . The communication module of the audio input device 201 will transmit the signaling 1 to the communication module of the audio output device 200 through the network or the short-range transmission service. The communication module of the audio output device 200 transmits the received signaling 1 to the echo cancellation module of the audio output device 200 .
由于音频输入设备201和音频输出设备200已经完成位置校准,所以可以忽略音频输入设备201和音频输出设备200之间的空间距离,进而可以忽略回声时延中的传播时延。音频输出设备200记录信令1被回声消除模块发送的时刻为T 1,以及记录信令1被回声消除模块接收的时刻为T 2。音频输出设备200可以根据T 1、T 2确定回声时延中的处理时延。例如,处理时延=T 2-T 1Since the position calibration of the audio input device 201 and the audio output device 200 has been completed, the spatial distance between the audio input device 201 and the audio output device 200 can be ignored, and thus the propagation delay in the echo delay can be ignored. The audio output device 200 records the time when signaling 1 is sent by the echo cancellation module as T 1 , and records the time when signaling 1 is received by the echo cancellation module as T 2 . The audio output device 200 may determine the processing delay among the echo delays according to T 1 and T 2 . For example, processing delay = T 2 -T 1 .
可以理解的是,通过发送信令计算出的处理时延等价于计算出从远端的电子设备发送的声音到达音频输入设备201上的通信模块至音频输入设备201将回声转发给音频输出设备200上的通信模块之间的时延。It can be understood that the processing delay calculated by sending the signaling is equivalent to calculating that the sound sent from the remote electronic device reaches the communication module on the audio input device 201 to the audio input device 201 and forwards the echo to the audio output device. Latency between communication modules on 200.
如图11B所示,音频输出设备200可以根据信令1的时域特征,确定T 2时刻回声模块接收到音频输入设备201发送的信令1。进而,确定回声处理中的处理时延。 As shown in FIG. 11B , the audio output device 200 can determine that the echo module at time T2 receives the signaling 1 sent by the audio input device 201 according to the time domain feature of the signaling 1 . Further, the processing delay in the echo processing is determined.
执行步骤S908。Step S908 is executed.
S906:建立三维空间坐标系。S906: Establish a three-dimensional space coordinate system.
具体的,在校准完成后,音频输出设备200可以以自己为原点建立三维空间坐标系。Specifically, after the calibration is completed, the audio output device 200 may establish a three-dimensional space coordinate system with itself as the origin.
在本申请一些实施例中,音频输出设备200和音频输入设备201都可以以自己为原点建立三维空间坐标系。In some embodiments of the present application, both the audio output device 200 and the audio input device 201 may establish a three-dimensional space coordinate system with itself as the origin.
其中,三维空间坐标系可以是笛卡尔坐标系、极坐标系、球坐标系等,在此不作限定。The three-dimensional space coordinate system may be a Cartesian coordinate system, a polar coordinate system, a spherical coordinate system, or the like, which is not limited here.
执行步骤S907。Step S907 is executed.
S907:音频输入设备201向音频输出设备200发送运动信息、位置信息或距离信息。S907 : The audio input device 201 sends motion information, location information or distance information to the audio output device 200 .
在音频输出设备200建立三维空间坐标系后,音频输入设备201可以通过网络/近距离通信服务周期性的向音频输出设备200发送运动信息。其中,运动信息包括速度、加速度、方位角、俯仰角、偏转角等中的一种或多种。音频输出设备200收到音频输入设备201发送的运动信息后,可以计算出音频输入设备201在三维空间坐标系的坐标,进而确定音频输入设备201与自己的距离,并根据该距离确定回声时延的传播时延。其中,当确定运动信息后,音频输入设备201或音频输出设备200可以结合运动信息和惯性导航算法,去确定运动所对应的设备的位置信息。After the audio output device 200 establishes the three-dimensional space coordinate system, the audio input device 201 may periodically send motion information to the audio output device 200 through the network/near field communication service. The motion information includes one or more of speed, acceleration, azimuth angle, pitch angle, yaw angle, and the like. After the audio output device 200 receives the motion information sent by the audio input device 201, it can calculate the coordinates of the audio input device 201 in the three-dimensional space coordinate system, and then determine the distance between the audio input device 201 and itself, and determine the echo delay according to the distance. propagation delay. Wherein, after determining the motion information, the audio input device 201 or the audio output device 200 may combine the motion information and an inertial navigation algorithm to determine the position information of the device corresponding to the motion.
若音频输入设备201也建立了三维空间坐标系,音频输入设备201可以根据运动信息确定自己的坐标。音频输入设备201将自己的位置信息发送给音频输出设备200。其中,位置信息可以包括坐标。音频输出设备200确定音频输入设备201的位置后,可以根据自己的位置和音频输入设备201的位置确定两者之间的距离。If the audio input device 201 also establishes a three-dimensional space coordinate system, the audio input device 201 can determine its own coordinates according to the motion information. The audio input device 201 sends its own location information to the audio output device 200 . The location information may include coordinates. After the audio output device 200 determines the location of the audio input device 201 , the distance between the two can be determined according to its own location and the location of the audio input device 201 .
若音频输入设备201也建立了三维空间坐标系,在音频输出设备200位于原点的情况下,音频输入设备还可以直接根据运动信息确定自己的坐标、并根据自己的坐标确定自己与音频输出设备200的距离,并将距离信息发送给音频输出设备200。其中,距离信息包括音频输入设备201与音频输出设备200之间的距离。If the audio input device 201 also establishes a three-dimensional space coordinate system, when the audio output device 200 is located at the origin, the audio input device can directly determine its own coordinates according to the motion information, and determine itself and the audio output device 200 according to its own coordinates. and send the distance information to the audio output device 200. The distance information includes the distance between the audio input device 201 and the audio output device 200 .
执行步骤S908。Step S908 is executed.
值得说明的是,音频输入设备201可以周期性的向音频输出设备200发送运动信息、位置信息或者距离信息。或者,在音频输入设备201判断当前位置与上一次向音频输出设备200发送位置信息、距离信息时的位置之间的距离大于阈值时,音频输入设备201向音频输出设备200发送位置信息或距离信息。It should be noted that the audio input device 201 may periodically send motion information, location information or distance information to the audio output device 200 . Alternatively, when the audio input device 201 determines that the distance between the current position and the position when the position information and distance information were sent to the audio output device 200 last time is greater than the threshold, the audio input device 201 sends the position information or distance information to the audio output device 200 .
图12为本申请实施例提供的音频输出设备200建立三维空间坐标系的一个示例性示意图。FIG. 12 is an exemplary schematic diagram of establishing a three-dimensional space coordinate system by the audio output device 200 according to an embodiment of the present application.
如图12所示,音频输入设备201在建立坐标系后,可以根据运动信息确定自己的坐标。例如,在某一时刻,音频输入设备201根据运动信息确定自己的坐标为(x 0,y 0,z 0);在另一时刻,音频输入设备201根据运动信息确定自己的坐标为(x 1,y 1,z 1); As shown in FIG. 12 , after establishing the coordinate system, the audio input device 201 can determine its own coordinates according to the motion information. For example, at a certain moment, the audio input device 201 determines its own coordinates as (x 0 , y 0 , z 0 ) according to the motion information; at another moment, the audio input device 201 determines its own coordinates as (x 1 according to the motion information) , y 1 , z 1 );
音频输入设备201可以将自己的位置信息如(x 0,y 0,z 0)或(x 1,y 1,z 1)通过网络发送到音频输出设备200;或者,当音频输出设备200未移动的情况下,音频输入设备201将自己的距离信息如
Figure PCTCN2022084862-appb-000002
或者
Figure PCTCN2022084862-appb-000003
发送给音频输出设备;或者,当音频输入设备201确定音频输出设备200的位置信息如(x 2,y 2,z 2),音频输入设备201将自己的距离信息如
Figure PCTCN2022084862-appb-000004
或者
Figure PCTCN2022084862-appb-000005
发送给音频输出设备200。
The audio input device 201 can send its own position information such as (x 0 , y 0 , z 0 ) or (x 1 , y 1 , z 1 ) to the audio output device 200 through the network; or, when the audio output device 200 does not move In the case of , the audio input device 201 converts its distance information as
Figure PCTCN2022084862-appb-000002
or
Figure PCTCN2022084862-appb-000003
send to the audio output device; or, when the audio input device 201 determines the position information of the audio output device 200 such as (x 2 , y 2 , z 2 ), the audio input device 201 sends its own distance information such as
Figure PCTCN2022084862-appb-000004
or
Figure PCTCN2022084862-appb-000005
sent to the audio output device 200 .
同理,音频输出设备200接收到音频输入设备201发送的运动信息或者位置信息,可以确定音频输出设备200与音频输入设备201之间的距离。Similarly, the audio output device 200 can determine the distance between the audio output device 200 and the audio input device 201 after receiving the motion information or the position information sent by the audio input device 201 .
S908:回声消除模块确定回声时延,对收到的声音信号进行回声消除。S908: The echo cancellation module determines the echo delay, and performs echo cancellation on the received sound signal.
根据步骤S905,音频输出设备200可以确定回声时延中的处理时延;根据步骤S906、S907,可以确定回声时延中的传播时延。音频输出设备200可以确定针对音频输入设备201的回声时延,其中回声时延=处理时延+传播时延。音频输出设备200在确定针对音频输入设备201的回声时延后,可以根据该回声时延,音频输出设备200滤除音频输入设备201获取到的声音中的回声,并将滤除后的声音发送到远端用户的设备上。According to step S905, the audio output device 200 can determine the processing delay in the echo delay; according to steps S906 and S907, can determine the propagation delay in the echo delay. The audio output device 200 may determine the echo delay for the audio input device 201, where echo delay=processing delay+propagation delay. After the audio output device 200 determines the echo delay for the audio input device 201, the audio output device 200 can filter out the echo in the sound obtained by the audio input device 201 according to the echo delay, and send the filtered sound. to the remote user's device.
结束。Finish.
值得说明的是,在网络通话、视频会议等场景中,当有新设备加入通话或会议时,可以通过完整执行步骤S901至步骤S908,使得音频输出设备200可以确定新用户的新设备的回声时延。其中,新设备的回声时延与新设备与音频输出设备200之间的距离有关。It is worth noting that, in scenarios such as network calls and video conferences, when a new device joins the call or conference, steps S901 to S908 can be completely executed, so that the audio output device 200 can determine the echo of the new device of the new user. extension. The echo delay of the new device is related to the distance between the new device and the audio output device 200 .
下面以单个音频输入设备、单个音频输出设备为例,以数据流程的形式示例性的介绍如图9所示的本申请提供的回声时延估计方法。Taking a single audio input device and a single audio output device as examples below, the echo delay estimation method provided by the present application as shown in FIG. 9 is exemplarily introduced in the form of a data flow.
图13为本申请实施例提供的单设备下的回声时延估计方法的一个示例性示意图。FIG. 13 is an exemplary schematic diagram of an echo delay estimation method under a single device provided by an embodiment of the present application.
如图13所示,对应于步骤S901、步骤S902,用户启动会议类/通话类应用后,并且选择合适的音频输入设备201和音频输出设备200后,开始音频输入设备201和音频输出设备200的位置校准。As shown in FIG. 13, corresponding to steps S901 and S902, after the user starts the conference/call application, and selects the appropriate audio input device 201 and audio output device 200, the audio input device 201 and the audio output device 200 are started. Position calibration.
对应于步骤S903、步骤S904,音频输入设备201和音频输出设备200不断的进行位置校准,直到位置校准成功。其中,位置校准是否成功,当音频输入设备201/音频输出设备200为用户提供可交互的控件,以音频输入设备201/音频输出设备200是否接受到用户的输入位置;或者,音频输入设备201/音频输出设备200可以通过移动蜂窝网络、WIFI、近距离通信服务,确定是否完成位置校准。Corresponding to steps S903 and S904, the audio input device 201 and the audio output device 200 continuously perform position calibration until the position calibration is successful. Wherein, whether the position calibration is successful, when the audio input device 201/audio output device 200 provides interactive controls for the user, whether the audio input device 201/audio output device 200 accepts the user's input position; or, the audio input device 201/ The audio output device 200 may determine whether to complete the position calibration through the mobile cellular network, WIFI, and short-range communication services.
当音频输出设备200接收到用户的确认输入后,音频输出设备200确定位置校准成功,发送消息告知音频输入设备201位置校准成功;对应的,若是音频输入设备201接收到用户 的确认输入后,音频输入设备201确定位置校准成功,发送消息告知音频输出设备200位置校准成功。After the audio output device 200 receives the confirmation input from the user, the audio output device 200 determines that the position calibration is successful, and sends a message to inform the audio input device 201 that the position calibration is successful; correspondingly, if the audio input device 201 receives the confirmation input from the user, the audio The input device 201 determines that the position calibration is successful, and sends a message to inform the audio output device 200 that the position calibration is successful.
当音频输入设备201或音频输出设备200,通过移动蜂窝网络、WIFI、近距离通信服务,确定两者之间的距离是否小于距离阈值。其中,音频输入设备201或音频输出设备200中任一个设备根据移动蜂窝网络、WIFI、近距离通信服务,确定两者之间的距离小于距离阈值,认为位置校准成功,发送消息告知另外一个设备位置校准成功;或者,音频输入设备201和音频输出设备200均根据移动蜂窝网络、WIFI、近距离通信服务确定两者之间的距离小于距离阈值,认为位置校准成功。When the audio input device 201 or the audio output device 200 uses a mobile cellular network, WIFI, or short-range communication service, it is determined whether the distance between the two is less than a distance threshold. Among them, any one of the audio input device 201 or the audio output device 200 determines that the distance between the two is less than the distance threshold according to the mobile cellular network, WIFI, and short-range communication services, considers that the position calibration is successful, and sends a message to inform the other device. Calibration is successful; or, both the audio input device 201 and the audio output device 200 determine that the distance between them is less than the distance threshold according to the mobile cellular network, WIFI, and short-range communication services, and it is considered that the position calibration is successful.
对应于步骤S905,在位置校准成功后,音频输出设备200在空间中播放信令(声波),音频输入设备201不断采集空间中的声音,并将该声音转变为数字信号后,通过网络/近距离通信服务发送给音频输出设备200。音频输出设备200确定发送信令与接收信令之间的时间差为回声时延中的处理时延。Corresponding to step S905, after the position calibration is successful, the audio output device 200 plays a signaling (sound wave) in the space, and the audio input device 201 continuously collects the sound in the space, converts the sound into a digital signal, and transmits it through the network/nearby network. The distance communication service is transmitted to the audio output device 200 . The audio output device 200 determines the time difference between sending the signaling and receiving the signaling as the processing delay in the echo delay.
对应于步骤S906,音频输出设备200建立三维空间坐标系,其中音频输出设备200与音频输入设备201的初始位置为(0,0,0)。Corresponding to step S906, the audio output device 200 establishes a three-dimensional space coordinate system, wherein the initial positions of the audio output device 200 and the audio input device 201 are (0, 0, 0).
对应于步骤S907,音频输入设备201可以周期性的计算并更新自己在三维空间坐标系中的位置(坐标),并将该信息发送给音频输出设备200。Corresponding to step S907 , the audio input device 201 may periodically calculate and update its position (coordinates) in the three-dimensional space coordinate system, and send the information to the audio output device 200 .
对应于步骤S908,音频输出设备200接收到音频输入设备201的位置信息后,结合音频输出设备200的位置,可以确定音频输入设备201和音频输出设备200之间的距离。再进一步的,根据音频输入设备201和音频输出设备200之间的距离可以确定回声时延之中的传播时延。结合之间确定的处理时延,音频输出设备200可以确定回声时延。Corresponding to step S908, after receiving the location information of the audio input device 201, the audio output device 200 can determine the distance between the audio input device 201 and the audio output device 200 in combination with the location of the audio output device 200. Still further, the propagation delay among the echo delays can be determined according to the distance between the audio input device 201 and the audio output device 200 . Combined with the processing delays determined between, the audio output device 200 can determine the echo delays.
音频输出设备200可以基于确定的回声时延不断调整回声消除模块的参数,使得可以对收到的声音信号进行回声消除。The audio output device 200 can continuously adjust the parameters of the echo cancellation module based on the determined echo delay, so that echo cancellation can be performed on the received sound signal.
其次,下面以多个音频输入设备、单个音频输出设备为例,以数据流程的形式示例性的介绍如图9所示的本申请提供的回声时延估计方法。Secondly, the following takes multiple audio input devices and a single audio output device as examples to exemplarily introduce the echo delay estimation method provided by the present application as shown in FIG. 9 in the form of a data flow.
在网络通话、视频会议等场景中,当有新设备加入通话或会议时,新设备作为音频输入设备202,可以与其他已经完成位置校准并已经开始正常工作的音频输入设备201进行位置校准,并将位置信息、处理时延21发送给音频输出设备200。音频输出设备200根据网络/近距离通信服务时延,以及接收到的位置信息、处理时延21、处理时延22确定音频输入设备202的回声时延。In scenarios such as network calls and video conferences, when a new device joins the call or conference, the new device, as the audio input device 202, can perform position calibration with other audio input devices 201 that have completed position calibration and have started to work normally, and The location information and the processing delay 21 are sent to the audio output device 200 . The audio output device 200 determines the echo delay of the audio input device 202 according to the network/near field communication service delay, the received location information, the processing delay 21 , and the processing delay 22 .
图14为本申请实施例提供的多设备下的回声时延估计方法的一个示例性示意图。FIG. 14 is an exemplary schematic diagram of an echo delay estimation method under multiple devices provided by an embodiment of the present application.
如图14所示,音频输出设备200已经确定并且不断更新音频输入设备201的回声时延。当音频输入设备202作为新设备加入会议时,可以与音频输入设备201作为基准进行位置校准。As shown in FIG. 14 , the audio output device 200 has determined and continuously updated the echo delay of the audio input device 201 . When the audio input device 202 joins the conference as a new device, the position calibration can be performed with the audio input device 201 as a reference.
对应于步骤S902、S903、S904,新的音频输入设备202加入会议/通话时,可以与音频输入设备201进行位置校准。Corresponding to steps S902 , S903 and S904 , when the new audio input device 202 joins the conference/call, it can perform position calibration with the audio input device 201 .
当位置校准完成后,音频输入设备201向音频输入设备202发送音频输入设备201的位置(坐标)信息作为音频输入设备的201的位置。When the position calibration is completed, the audio input device 201 sends the position (coordinates) information of the audio input device 201 to the audio input device 202 as the position of the audio input device 201 .
对应于步骤S907,音频输入设备202可以周期性的计算并更新自己在三维空间坐标系中的位置(坐标),并将该信息发送给音频输出设备200。Corresponding to step S907 , the audio input device 202 may periodically calculate and update its position (coordinates) in the three-dimensional space coordinate system, and send the information to the audio output device 200 .
音频输出设备200基于音频输入设备202的位置信息,确定并更新音频输入设备202的 回声时延。The audio output device 200 determines and updates the echo delay of the audio input device 202 based on the location information of the audio input device 202.
音频输入设备202发送自己的处理时延21给音频输出设备200。音频输出设备200可以确定自己的处理时延22以及可以确定由网络/近距离通信服务决定的交互时延。进而,音频输出设备200可以确定音频输入设备202的回声时延。The audio input device 202 sends its own processing delay 21 to the audio output device 200 . The audio output device 200 may determine its own processing delay 22 and may determine the interaction delay determined by the network/near field communication service. In turn, the audio output device 200 may determine the echo delay of the audio input device 202 .
对应于步骤S908,音频输出设备200可以基于音频输入设备202的回声时延调整回声消除模块的参数以滤除音频输入设备202采集声音中的回声;对应的,音频输出设备200可以基于音频输入设备201的回声时延调整回声消除模块的参数以滤除音频输入设备201采集声音中的回声。Corresponding to step S908, the audio output device 200 can adjust the parameters of the echo cancellation module based on the echo delay of the audio input device 202 to filter out the echo in the sound collected by the audio input device 202; correspondingly, the audio output device 200 can be based on the audio input device. The echo delay of 201 adjusts the parameters of the echo cancellation module to filter out the echo in the sound collected by the audio input device 201 .
再次,下面介绍本申请实施例提供的电子设备。Again, the electronic device provided by the embodiments of the present application is described below.
其中,本申请实施例中的音频输入设备201、音频输入设备202、音频输出设备200可以是下文中的电子设备100。The audio input device 201 , the audio input device 202 , and the audio output device 200 in the embodiments of the present application may be the electronic device 100 hereinafter.
图15为本申请实施例提供的电子设备100的一个示例性硬件结构示意图。FIG. 15 is a schematic diagram of an exemplary hardware structure of an electronic device 100 according to an embodiment of the present application.
电子设备100可以是手机、平板电脑、桌面型计算机、膝上型计算机、手持计算机、笔记本电脑、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本,以及蜂窝电话、个人数字助理(personal digital assistant,PDA)、增强现实(augmented reality,AR)设备、虚拟现实(virtual reality,VR)设备、人工智能(artificial intelligence,AI)设备、可穿戴式设备、车载设备、智能家居设备和/或智慧城市设备,本申请实施例对该电子设备的具体类型不作特殊限制。The electronic device 100 may be a cell phone, tablet computer, desktop computer, laptop computer, handheld computer, notebook computer, ultra-mobile personal computer (UMPC), netbook, as well as cellular telephones, personal digital assistants (personal digital assistants) digital assistant (PDA), augmented reality (AR) devices, virtual reality (VR) devices, artificial intelligence (AI) devices, wearable devices, in-vehicle devices, smart home devices and/or Smart city equipment, the embodiments of the present application do not specifically limit the specific type of the electronic equipment.
电子设备100可以包括处理器110,外部存储器接口120,内部存储器121,通用串行总线(universal serial bus,USB)接口130,充电管理模块140,电源管理模块141,电池142,天线1,天线2,移动通信模块150,无线通信模块160,音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,传感器模块180,按键190,马达191,指示器192,摄像头193,显示屏194,以及用户标识模块(subscriber identification module,SIM)卡接口195等。其中传感器模块180可以包括压力传感器180A,陀螺仪传感器180B,气压传感器180C,磁传感器180D,加速度传感器180E,距离传感器180F,接近光传感器180G,指纹传感器180H,温度传感器180J,触摸传感器180K,环境光传感器180L,骨传导传感器180M等。The electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charge management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2 , mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, sensor module 180, buttons 190, motor 191, indicator 192, camera 193, display screen 194, and Subscriber identification module (subscriber identification module, SIM) card interface 195 and so on. The sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light. Sensor 180L, bone conduction sensor 180M, etc.
可以理解的是,本发明实施例示意的结构并不构成对电子设备100的具体限定。在本申请另一些实施例中,电子设备100可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It can be understood that, the structures illustrated in the embodiments of the present invention do not constitute a specific limitation on the electronic device 100 . In other embodiments of the present application, the electronic device 100 may include more or less components than shown, or combine some components, or separate some components, or arrange different components. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.
处理器110可以包括一个或多个处理单元,例如:处理器110可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 110 may include one or more processing units, for example, the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (neural-network processing unit, NPU), etc. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。The controller can generate an operation control signal according to the instruction operation code and timing signal, and complete the control of fetching and executing instructions.
处理器110中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使用的指令或数据。如果处理器110需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复 存取,减少了处理器110的等待时间,因而提高了系统的效率。A memory may also be provided in the processor 110 for storing instructions and data. In some embodiments, the memory in processor 110 is cache memory. This memory may hold instructions or data that have just been used or recycled by the processor 110 . If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated accesses are avoided and processor 110 latency is reduced, thereby increasing the efficiency of the system.
在一些实施例中,处理器110可以包括一个或多个接口。接口可以包括集成电路(inter-integrated circuit,I2C)接口,集成电路内置音频(inter-integrated circuit sound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purpose input/output,GPIO)接口,用户标识模块(subscriber identity module,SIM)接口,和/或通用串行总线(universal serial bus,USB)接口等。In some embodiments, the processor 110 may include one or more interfaces. The interface may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous transceiver (universal asynchronous transmitter) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and / or universal serial bus (universal serial bus, USB) interface, etc.
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDA)和一根串行时钟线(derail clock line,SCL)。在一些实施例中,处理器110可以包含多组I2C总线。处理器110可以通过不同的I2C总线接口分别耦合触摸传感器180K,充电器,闪光灯,摄像头193等。例如:处理器110可以通过I2C接口耦合触摸传感器180K,使处理器110与触摸传感器180K通过I2C总线接口通信,实现电子设备100的触摸功能。The I2C interface is a bidirectional synchronous serial bus that includes a serial data line (SDA) and a serial clock line (SCL). In some embodiments, the processor 110 may contain multiple sets of I2C buses. The processor 110 can be respectively coupled to the touch sensor 180K, the charger, the flash, the camera 193 and the like through different I2C bus interfaces. For example, the processor 110 may couple the touch sensor 180K through the I2C interface, so that the processor 110 and the touch sensor 180K communicate with each other through the I2C bus interface, so as to realize the touch function of the electronic device 100 .
I2S接口可以用于音频通信。在一些实施例中,处理器110可以包含多组I2S总线。处理器110可以通过I2S总线与音频模块170耦合,实现处理器110与音频模块170之间的通信。在一些实施例中,音频模块170可以通过I2S接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。The I2S interface can be used for audio communication. In some embodiments, the processor 110 may contain multiple sets of I2S buses. The processor 110 may be coupled with the audio module 170 through an I2S bus to implement communication between the processor 110 and the audio module 170 . In some embodiments, the audio module 170 can transmit audio signals to the wireless communication module 160 through the I2S interface, so as to realize the function of answering calls through a Bluetooth headset.
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块170与无线通信模块160可以通过PCM总线接口耦合。在一些实施例中,音频模块170也可以通过PCM接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。所述I2S接口和所述PCM接口都可以用于音频通信。The PCM interface can also be used for audio communications, sampling, quantizing and encoding analog signals. In some embodiments, the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface. In some embodiments, the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
UART接口是一种通用串行数据总线,用于异步通信。该总线可以为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接口通常被用于连接处理器110与无线通信模块160。例如:处理器110通过UART接口与无线通信模块160中的蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块170可以通过UART接口向无线通信模块160传递音频信号,实现通过蓝牙耳机播放音乐的功能。The UART interface is a universal serial data bus used for asynchronous communication. The bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is typically used to connect the processor 110 with the wireless communication module 160 . For example, the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function. In some embodiments, the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
MIPI接口可以被用于连接处理器110与显示屏194,摄像头193等外围器件。MIPI接口包括摄像头串行接口(camera serial interface,CSI),显示屏串行接口(display serial interface,DSI)等。在一些实施例中,处理器110和摄像头193通过CSI接口通信,实现电子设备100的拍摄功能。处理器110和显示屏194通过DSI接口通信,实现电子设备100的显示功能。The MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193 . MIPI interfaces include camera serial interface (CSI), display serial interface (DSI), etc. In some embodiments, the processor 110 communicates with the camera 193 through a CSI interface, so as to realize the photographing function of the electronic device 100 . The processor 110 communicates with the display screen 194 through the DSI interface to implement the display function of the electronic device 100 .
GPIO接口可以通过软件配置。GPIO接口可以被配置为控制信号,也可被配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器110与摄像头193,显示屏194,无线通信模块160,音频模块170,传感器模块180等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。The GPIO interface can be configured by software. The GPIO interface can be configured as a control signal or as a data signal. In some embodiments, the GPIO interface may be used to connect the processor 110 with the camera 193, the display screen 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like. The GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
USB接口130是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口130可以用于连接充电器为电子设备100充电,也可以用于电子设备100与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如AR设备等。The USB interface 130 is an interface that conforms to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like. The USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through the headphones. The interface can also be used to connect other electronic devices, such as AR devices.
可以理解的是,本发明实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对电子设备100的结构限定。在本申请另一些实施例中,电子设备100也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。It can be understood that the interface connection relationship between the modules illustrated in the embodiment of the present invention is only a schematic illustration, and does not constitute a structural limitation of the electronic device 100 . In other embodiments of the present application, the electronic device 100 may also adopt different interface connection manners in the foregoing embodiments, or a combination of multiple interface connection manners.
充电管理模块140用于从充电器接收充电输入。The charging management module 140 is used to receive charging input from the charger.
电源管理模块141用于连接电池142,充电管理模块140与处理器110。The power management module 141 is used for connecting the battery 142 , the charging management module 140 and the processor 110 .
电子设备100的无线通信功能可以通过天线1,天线2,移动通信模块150,无线通信模块160,调制解调处理器以及基带处理器等实现。The wireless communication function of the electronic device 100 may be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modulation and demodulation processor, the baseband processor, and the like.
天线1和天线2用于发射和接收电磁波信号。电子设备100中的每个天线可用于覆盖单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将天线1复用为无线局域网的分集天线。在另外一些实施例中,天线可以和调谐开关结合使用。 Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals. Each antenna in electronic device 100 may be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization. For example, the antenna 1 can be multiplexed as a diversity antenna of the wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
移动通信模块150可以提供应用在电子设备100上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块150可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块150可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块150还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块150的至少部分功能模块可以被设置于处理器110中。在一些实施例中,移动通信模块150的至少部分功能模块可以与处理器110的至少部分模块被设置在同一个器件中。The mobile communication module 150 may provide wireless communication solutions including 2G/3G/4G/5G etc. applied on the electronic device 100 . The mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA) and the like. The mobile communication module 150 can receive electromagnetic waves from the antenna 1, filter and amplify the received electromagnetic waves, and transmit them to the modulation and demodulation processor for demodulation. The mobile communication module 150 can also amplify the signal modulated by the modulation and demodulation processor, and then turn it into an electromagnetic wave for radiation through the antenna 1 . In some embodiments, at least part of the functional modules of the mobile communication module 150 may be provided in the processor 110 . In some embodiments, at least part of the functional modules of the mobile communication module 150 may be provided in the same device as at least part of the modules of the processor 110 .
调制解调处理器可以包括调制器和解调器。其中,调制器用于将待发送的低频基带信号调制成中高频信号。解调器用于将接收的电磁波信号解调为低频基带信号。随后解调器将解调得到的低频基带信号传送至基带处理器处理。低频基带信号经基带处理器处理后,被传递给应用处理器。应用处理器通过音频设备(不限于扬声器170A,受话器170B等)输出声音信号,或通过显示屏194显示图像或视频。在一些实施例中,调制解调处理器可以是独立的器件。在另一些实施例中,调制解调处理器可以独立于处理器110,与移动通信模块150或其他功能模块设置在同一个器件中。The modem processor may include a modulator and a demodulator. Wherein, the modulator is used to modulate the low frequency baseband signal to be sent into a medium and high frequency signal. The demodulator is used to demodulate the received electromagnetic wave signal into a low frequency baseband signal. Then the demodulator transmits the demodulated low-frequency baseband signal to the baseband processor for processing. The low frequency baseband signal is processed by the baseband processor and passed to the application processor. The application processor outputs sound signals through audio devices (not limited to the speaker 170A, the receiver 170B, etc.), or displays images or videos through the display screen 194 . In some embodiments, the modem processor may be a stand-alone device. In other embodiments, the modem processor may be independent of the processor 110, and may be provided in the same device as the mobile communication module 150 or other functional modules.
无线通信模块160可以提供应用在电子设备100上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块160可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块160经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器110。无线通信模块160还可以从处理器110接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), bluetooth (BT), global navigation satellites Wireless communication solutions such as global navigation satellite system (GNSS), frequency modulation (FM), near field communication (NFC), and infrared technology (IR). The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 . The wireless communication module 160 can also receive the signal to be sent from the processor 110 , perform frequency modulation on it, amplify it, and convert it into electromagnetic waves for radiation through the antenna 2 .
在一些实施例中,电子设备100的天线1和移动通信模块150耦合,天线2和无线通信模块160耦合,使得电子设备100可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications,GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(code division multiple access,CDMA),宽带码分多址(wideband code division multiple access,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(global positioning system,GPS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(beidou navigation satellite system,BDS),准天顶卫星系统(quasi-zenith satellite system,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。In some embodiments, the antenna 1 of the electronic device 100 is coupled with the mobile communication module 150, and the antenna 2 is coupled with the wireless communication module 160, so that the electronic device 100 can communicate with the network and other devices through wireless communication technology. The wireless communication technology may include global system for mobile communications (GSM), general packet radio service (GPRS), code division multiple access (CDMA), broadband Code Division Multiple Access (WCDMA), Time Division Code Division Multiple Access (TD-SCDMA), Long Term Evolution (LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc. The GNSS may include global positioning system (global positioning system, GPS), global navigation satellite system (global navigation satellite system, GLONASS), Beidou navigation satellite system (beidou navigation satellite system, BDS), quasi-zenith satellite system (quasi -zenith satellite system, QZSS) and/or satellite based augmentation systems (SBAS).
电子设备100通过GPU,显示屏194,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏194和应用处理器。GPU用于执行数学和几何计算,用于图形渲 染。处理器110可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The electronic device 100 implements a display function through a GPU, a display screen 194, an application processor, and the like. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor. The GPU is used to perform mathematical and geometric calculations for graphics rendering. Processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.
显示屏194用于显示图像,视频等。Display screen 194 is used to display images, videos, and the like.
电子设备100可以通过ISP,摄像头193,视频编解码器,GPU,显示屏194以及应用处理器等实现拍摄功能。The electronic device 100 may implement a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
ISP用于处理摄像头193反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。The ISP is used to process the data fed back by the camera 193 . For example, when taking a photo, the shutter is opened, the light is transmitted to the camera photosensitive element through the lens, the light signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye.
摄像头193用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,电子设备100可以包括1个或N个摄像头193,N为大于1的正整数。Camera 193 is used to capture still images or video. The object is projected through the lens to generate an optical image onto the photosensitive element. The photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal. The ISP outputs the digital image signal to the DSP for processing. DSP converts digital image signals into standard RGB, YUV and other formats of image signals. In some embodiments, the electronic device 100 may include 1 or N cameras 193 , where N is a positive integer greater than 1.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当电子设备100在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。A digital signal processor is used to process digital signals, in addition to processing digital image signals, it can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy and so on.
视频编解码器用于对数字视频压缩或解压缩。电子设备100可以支持一种或多种视频编解码器。这样,电子设备100可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codecs are used to compress or decompress digital video. The electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos of various encoding formats, such as: Moving Picture Experts Group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4 and so on.
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备100的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。The NPU is a neural-network (NN) computing processor. By drawing on the structure of biological neural networks, such as the transfer mode between neurons in the human brain, it can quickly process the input information, and can continuously learn by itself. Applications such as intelligent cognition of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
内部存储器121可以包括一个或多个随机存取存储器(random access memory,RAM)和一个或多个非易失性存储器(non-volatile memory,NVM)。The internal memory 121 may include one or more random access memories (RAM) and one or more non-volatile memories (NVM).
随机存取存储器可以包括静态随机存储器(static random-access memory,SRAM)、动态随机存储器(dynamic random access memory,DRAM)、同步动态随机存储器(synchronous dynamic random access memory,SDRAM)、双倍资料率同步动态随机存取存储器(double data rate synchronous dynamic random access memory,DDR SDRAM,例如第五代DDR SDRAM一般称为DDR5 SDRAM)等;Random access memory can include static random-access memory (SRAM), dynamic random access memory (DRAM), synchronous dynamic random access memory (SDRAM), double data rate synchronization Dynamic random access memory (double data rate synchronous dynamic random access memory, DDR SDRAM, such as the fifth generation DDR SDRAM is generally called DDR5 SDRAM), etc.;
非易失性存储器可以包括磁盘存储器件、快闪存储器(flash memory)。Non-volatile memory may include magnetic disk storage devices, flash memory.
快闪存储器按照运作原理划分可以包括NOR FLASH、NAND FLASH、3D NAND FLASH等,按照存储单元电位阶数划分可以包括单阶存储单元(single-level cell,SLC)、多阶存储单元(multi-level cell,MLC)、三阶储存单元(triple-level cell,TLC)、四阶储存单元(quad-level cell,QLC)等,按照存储规范划分可以包括通用闪存存储(英文:universal flash storage,UFS)、嵌入式多媒体存储卡(embedded multi media Card,eMMC)等。Flash memory can be divided into NOR FLASH, NAND FLASH, 3D NAND FLASH, etc. according to the operating principle, and can include single-level memory cell (SLC), multi-level memory cell (multi-level memory cell, SLC) according to the level of storage cell potential. cell, MLC), triple-level cell (TLC), quad-level cell (QLC), etc., according to the storage specification can include universal flash storage (English: universal flash storage, UFS) , embedded multimedia memory card (embedded multi media Card, eMMC) and so on.
随机存取存储器可以由处理器110直接进行读写,可以用于存储操作系统或其他正在运行中的程序的可执行程序(例如机器指令),还可以用于存储用户及应用程序的数据等。The random access memory can be directly read and written by the processor 110, and can be used to store executable programs (eg, machine instructions) of an operating system or other running programs, and can also be used to store data of users and application programs.
非易失性存储器也可以存储可执行程序和存储用户及应用程序的数据等,可以提前加载到随机存取存储器中,用于处理器110直接进行读写。The non-volatile memory can also store executable programs and store data of user and application programs, etc., and can be loaded into the random access memory in advance for the processor 110 to directly read and write.
外部存储器接口120可以用于连接外部的非易失性存储器,实现扩展电子设备100的存储能力。外部的非易失性存储器通过外部存储器接口120与处理器110通信,实现数据存储 功能。例如将音乐,视频等文件保存在外部的非易失性存储器中。The external memory interface 120 can be used to connect an external non-volatile memory, so as to expand the storage capacity of the electronic device 100 . The external non-volatile memory communicates with the processor 110 through the external memory interface 120 to realize the data storage function. For example, save music, video, etc. files in external non-volatile memory.
电子设备100可以通过音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,以及应用处理器等实现音频功能。例如音乐播放,录音等。The electronic device 100 may implement audio functions through an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, an application processor, and the like. Such as music playback, recording, etc.
音频模块170用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块170还可以用于对音频信号编码和解码。在一些实施例中,音频模块170可以设置于处理器110中,或将音频模块170的部分功能模块设置于处理器110中。The audio module 170 is used for converting digital audio information into analog audio signal output, and also for converting analog audio input into digital audio signal. Audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be provided in the processor 110 , or some functional modules of the audio module 170 may be provided in the processor 110 .
扬声器170A,也称“喇叭”,用于将音频电信号转换为声音信号。电子设备100可以通过扬声器170A收听音乐,或收听免提通话。Speaker 170A, also referred to as a "speaker", is used to convert audio electrical signals into sound signals. The electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
受话器170B,也称“听筒”,用于将音频电信号转换成声音信号。当电子设备100接听电话或语音信息时,可以通过将受话器170B靠近人耳接听语音。The receiver 170B, also referred to as "earpiece", is used to convert audio electrical signals into sound signals. When the electronic device 100 answers a call or a voice message, the voice can be answered by placing the receiver 170B close to the human ear.
麦克风170C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风170C发声,将声音信号输入到麦克风170C。电子设备100可以设置至少一个麦克风170C。在另一些实施例中,电子设备100可以设置两个麦克风170C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,电子设备100还可以设置三个,四个或更多麦克风170C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。The microphone 170C, also called "microphone" or "microphone", is used to convert sound signals into electrical signals. When making a call or sending a voice message, the user can make a sound by approaching the microphone 170C through a human mouth, and input the sound signal into the microphone 170C. The electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C, which can implement a noise reduction function in addition to collecting sound signals. In other embodiments, the electronic device 100 may further be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions.
耳机接口170D用于连接有线耳机。耳机接口170D可以是USB接口130,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The earphone jack 170D is used to connect wired earphones. The earphone interface 170D may be the USB interface 130, or may be a 3.5mm open mobile terminal platform (OMTP) standard interface, a cellular telecommunications industry association of the USA (CTIA) standard interface.
压力传感器180A用于感受压力信号,可以将压力信号转换成电信号。在一些实施例中,压力传感器180A可以设置于显示屏194。压力传感器180A的种类很多,如电阻式压力传感器,电感式压力传感器,电容式压力传感器等。电容式压力传感器可以是包括至少两个具有导电材料的平行板。当有力作用于压力传感器180A,电极之间的电容改变。电子设备100根据电容的变化确定压力的强度。当有触摸操作作用于显示屏194,电子设备100根据压力传感器180A检测所述触摸操作强度。电子设备100也可以根据压力传感器180A的检测信号计算触摸的位置。在一些实施例中,作用于相同触摸位置,但不同触摸操作强度的触摸操作,可以对应不同的操作指令。例如:当有触摸操作强度小于第一压力阈值的触摸操作作用于短消息应用图标时,执行查看短消息的指令。当有触摸操作强度大于或等于第一压力阈值的触摸操作作用于短消息应用图标时,执行新建短消息的指令。The pressure sensor 180A is used to sense pressure signals, and can convert the pressure signals into electrical signals. In some embodiments, the pressure sensor 180A may be provided on the display screen 194 . There are many types of pressure sensors 180A, such as resistive pressure sensors, inductive pressure sensors, capacitive pressure sensors, and the like. The capacitive pressure sensor may be comprised of at least two parallel plates of conductive material. When a force is applied to the pressure sensor 180A, the capacitance between the electrodes changes. The electronic device 100 determines the intensity of the pressure according to the change in capacitance. When a touch operation acts on the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A. The electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A. In some embodiments, touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions. For example, when a touch operation whose intensity is less than the first pressure threshold acts on the short message application icon, the instruction for viewing the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, the instruction to create a new short message is executed.
陀螺仪传感器180B可以用于确定电子设备100的运动姿态。在一些实施例中,可以通过陀螺仪传感器180B确定电子设备100围绕三个轴(即,x,y和z轴)的角速度。陀螺仪传感器180B可以用于拍摄防抖。示例性的,当按下快门,陀螺仪传感器180B检测电子设备100抖动的角度,根据角度计算出镜头模组需要补偿的距离,让镜头通过反向运动抵消电子设备100的抖动,实现防抖。陀螺仪传感器180B还可以用于导航,体感游戏场景。The gyro sensor 180B may be used to determine the motion attitude of the electronic device 100 . In some embodiments, the angular velocity of electronic device 100 about three axes (ie, x, y, and z axes) may be determined by gyro sensor 180B. The gyro sensor 180B can be used for image stabilization. Exemplarily, when the shutter is pressed, the gyro sensor 180B detects the shaking angle of the electronic device 100, calculates the distance that the lens module needs to compensate according to the angle, and allows the lens to offset the shaking of the electronic device 100 through reverse motion to achieve anti-shake. The gyro sensor 180B can also be used for navigation and somatosensory game scenarios.
气压传感器180C用于测量气压。在一些实施例中,电子设备100通过气压传感器180C测得的气压值计算海拔高度,辅助定位和导航。The air pressure sensor 180C is used to measure air pressure. In some embodiments, the electronic device 100 calculates the altitude through the air pressure value measured by the air pressure sensor 180C to assist in positioning and navigation.
磁传感器180D包括霍尔传感器。电子设备100可以利用磁传感器180D检测翻盖皮套的开合。在一些实施例中,当电子设备100是翻盖机时,电子设备100可以根据磁传感器180D检测翻盖的开合。进而根据检测到的皮套的开合状态或翻盖的开合状态,设置翻盖自动解锁等特性。The magnetic sensor 180D includes a Hall sensor. The electronic device 100 can detect the opening and closing of the flip holster using the magnetic sensor 180D. In some embodiments, when the electronic device 100 is a flip machine, the electronic device 100 can detect the opening and closing of the flip according to the magnetic sensor 180D. Further, according to the detected opening and closing state of the leather case or the opening and closing state of the flip cover, characteristics such as automatic unlocking of the flip cover are set.
加速度传感器180E可检测电子设备100在各个方向上(一般为三轴)加速度的大小。当电子设备100静止时可检测出重力的大小及方向。还可以用于识别电子设备姿态,应用于横竖屏切换,计步器等应用。The acceleration sensor 180E can detect the magnitude of the acceleration of the electronic device 100 in various directions (generally three axes). The magnitude and direction of gravity can be detected when the electronic device 100 is stationary. It can also be used to identify the posture of electronic devices, and can be used in applications such as horizontal and vertical screen switching, pedometers, etc.
距离传感器180F,用于测量距离。电子设备100可以通过红外或激光测量距离。在一些实施例中,拍摄场景,电子设备100可以利用距离传感器180F测距以实现快速对焦。Distance sensor 180F for measuring distance. The electronic device 100 can measure the distance through infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 can use the distance sensor 180F to measure the distance to achieve fast focusing.
接近光传感器180G可以包括例如发光二极管(LED)和光检测器,例如光电二极管。发光二极管可以是红外发光二极管。电子设备100通过发光二极管向外发射红外光。电子设备100使用光电二极管检测来自附近物体的红外反射光。当检测到充分的反射光时,可以确定电子设备100附近有物体。当检测到不充分的反射光时,电子设备100可以确定电子设备100附近没有物体。电子设备100可以利用接近光传感器180G检测用户手持电子设备100贴近耳朵通话,以便自动熄灭屏幕达到省电的目的。接近光传感器180G也可用于皮套模式,口袋模式自动解锁与锁屏。Proximity light sensor 180G may include, for example, light emitting diodes (LEDs) and light detectors, such as photodiodes. The light emitting diodes may be infrared light emitting diodes. The electronic device 100 emits infrared light to the outside through the light emitting diode. Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 . The electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power. Proximity light sensor 180G can also be used in holster mode, pocket mode automatically unlocks and locks the screen.
环境光传感器180L用于感知环境光亮度。指纹传感器180H用于采集指纹。The ambient light sensor 180L is used to sense ambient light brightness. The fingerprint sensor 180H is used to collect fingerprints.
温度传感器180J用于检测温度。在一些实施例中,电子设备100利用温度传感器180J检测的温度,执行温度处理策略。The temperature sensor 180J is used to detect the temperature. In some embodiments, the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy.
触摸传感器180K,也称“触控器件”。触摸传感器180K可以设置于显示屏194,由触摸传感器180K与显示屏194组成触摸屏,也称“触控屏”。触摸传感器180K用于检测作用于其上或附近的触摸操作。触摸传感器可以将检测到的触摸操作传递给应用处理器,以确定触摸事件类型。可以通过显示屏194提供与触摸操作相关的视觉输出。在另一些实施例中,触摸传感器180K也可以设置于电子设备100的表面,与显示屏194所处的位置不同。Touch sensor 180K, also called "touch device". The touch sensor 180K may be disposed on the display screen 194 , and the touch sensor 180K and the display screen 194 form a touch screen, also called a “touch screen”. The touch sensor 180K is used to detect a touch operation on or near it. The touch sensor can pass the detected touch operation to the application processor to determine the type of touch event. Visual output related to touch operations may be provided through display screen 194 . In other embodiments, the touch sensor 180K may also be disposed on the surface of the electronic device 100 , which is different from the location where the display screen 194 is located.
骨传导传感器180M可以获取振动信号。The bone conduction sensor 180M can acquire vibration signals.
按键190包括开机键,音量键等。按键190可以是机械按键。也可以是触摸式按键。电子设备100可以接收按键输入,产生与电子设备100的用户设置以及功能控制有关的键信号输入。The keys 190 include a power-on key, a volume key, and the like. Keys 190 may be mechanical keys. It can also be a touch key. The electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
马达191可以产生振动提示。Motor 191 can generate vibrating cues.
指示器192可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。The indicator 192 can be an indicator light, which can be used to indicate the charging state, the change of the power, and can also be used to indicate a message, a missed call, a notification, and the like.
SIM卡接口195用于连接SIM卡。The SIM card interface 195 is used to connect a SIM card.
在本申请的一些实施例中,在步骤S903和步骤S904中,音频输入设备201/音频输出设备200可以使用距离传感器180F,确定音频输入设备201与音频输出设备200之间的距离。In some embodiments of the present application, in steps S903 and S904, the audio input device 201/audio output device 200 may use the distance sensor 180F to determine the distance between the audio input device 201 and the audio output device 200.
在本申请的一些实施例中,在步骤S903和步骤S904中,音频输入设备201/音频输出设备200可以使用无线通信模块160、移动通信模块150,确定音频输入设备201与音频输出设备200之间的距离。In some embodiments of the present application, in steps S903 and S904, the audio input device 201/audio output device 200 may use the wireless communication module 160 and the mobile communication module 150 to determine the distance between the audio input device 201 and the audio output device 200 the distance.
在本申请的一些实施例中,在步骤S905中,在确定音频输入设备201与音频输出设备200之间的距离后,可以根据气压传感器180C、温度传感器180J等获取当前空间中温度、气压等信息,进而更准确的确定声速,进而更准确的确定回声时延中的传播时延。In some embodiments of the present application, in step S905, after determining the distance between the audio input device 201 and the audio output device 200, information such as temperature and air pressure in the current space can be obtained according to the air pressure sensor 180C, the temperature sensor 180J, etc. , and then more accurately determine the speed of sound, and then more accurately determine the propagation delay in the echo delay.
值得说明的是,音频输入设备可以没有受话器170B;音频输出设备可以没有麦克风170C。It should be noted that the audio input device may not have the receiver 170B; the audio output device may not have the microphone 170C.
电子设备100的软件系统可以采用分层架构,事件驱动架构,微核架构,微服务架构,或云架构。本发明实施例以分层架构的Android系统为例,示例性说明电子设备100的软件结构。The software system of the electronic device 100 may adopt a layered architecture, an event-driven architecture, a microkernel architecture, a microservice architecture, or a cloud architecture. The embodiment of the present invention takes an Android system with a layered architecture as an example to illustrate the software structure of the electronic device 100 as an example.
图16为本申请实施例提供的电子设备100的一个示例性软件结构示意图。FIG. 16 is a schematic diagram of an exemplary software structure of an electronic device 100 according to an embodiment of the present application.
分层架构将软件分成若干个层,每一层都有清晰的角色和分工。层与层之间通过软件接口通信。在一些实施例中,将Android系统分为四层,从上至下分别为应用程序层,应用程序框架层,安卓运行时(Android runtime)和系统库,以及内核层。The layered architecture divides the software into several layers, and each layer has a clear role and division of labor. Layers communicate with each other through software interfaces. In some embodiments, the Android system is divided into four layers, which are, from top to bottom, an application layer, an application framework layer, an Android runtime (Android runtime) and a system library, and a kernel layer.
应用程序层可以包括一系列应用程序包。The application layer can include a series of application packages.
如图16所示,应用程序包可以包括相机,图库,日历,通话,地图,导航,WLAN,蓝牙,音乐,视频,短信息等应用程序。应用程序包还可以包括会议/通话类应用,如WeLink。As shown in Figure 16, the application package can include applications such as camera, gallery, calendar, call, map, navigation, WLAN, Bluetooth, music, video, short message and so on. Application bundles can also include meeting/calling applications such as WeLink.
应用程序框架层为应用程序层的应用程序提供应用编程接口(application programming interface,API)和编程框架。应用程序框架层包括一些预先定义的函数。例如,函数可以是用于确定音频输入设备201与音频输出设备200的距离的方法;或者,函数可以是用于建立三维空间坐标系的方法;或者,函数可以是向其他设备发送运动信息、位置信息或距离信息等的方法。The application framework layer provides an application programming interface (application programming interface, API) and a programming framework for applications in the application layer. The application framework layer includes some predefined functions. For example, the function may be a method for determining the distance between the audio input device 201 and the audio output device 200; or, the function may be a method for establishing a three-dimensional space coordinate system; or, the function may be to send motion information, position to other devices information or distance information, etc.
如图16所示,应用程序框架层可以包括窗口管理器,内容提供器,视图系统,电话管理器,资源管理器,通知管理器等。As shown in Figure 16, the application framework layer may include window managers, content providers, view systems, telephony managers, resource managers, notification managers, and the like.
窗口管理器用于管理窗口程序。窗口管理器可以获取显示屏大小,判断是否有状态栏,锁定屏幕,截取屏幕等。A window manager is used to manage window programs. The window manager can get the size of the display screen, determine whether there is a status bar, lock the screen, take screenshots, etc.
内容提供器用来存放和获取数据,并使这些数据可以被应用程序访问。所述数据可以包括视频,图像,音频,拨打和接听的电话,浏览历史和书签,电话簿等。Content providers are used to store and retrieve data and make these data accessible to applications. The data may include video, images, audio, calls made and received, browsing history and bookmarks, phone book, etc.
视图系统包括可视控件,例如显示文字的控件,显示图片的控件等。视图系统可用于构建应用程序。显示界面可以由一个或多个视图组成的。例如,包括短信通知图标的显示界面,可以包括显示文字的视图以及显示图片的视图,可以包括显示如图7中内容所示的界面。The view system includes visual controls, such as controls for displaying text, controls for displaying pictures, and so on. View systems can be used to build applications. A display interface can consist of one or more views. For example, the display interface including the short message notification icon may include a view for displaying text and a view for displaying pictures, and may include displaying an interface as shown in FIG. 7 .
电话管理器用于提供电子设备100的通信功能。例如通话状态的管理(包括接通,挂断等)。The phone manager is used to provide the communication function of the electronic device 100 . For example, the management of call status (including connecting, hanging up, etc.).
资源管理器为应用程序提供各种资源,比如本地化字符串,图标,图片,布局文件,视频文件等等。The resource manager provides various resources for the application, such as localization strings, icons, pictures, layout files, video files and so on.
通知管理器使应用程序可以在状态栏中显示通知信息,可以用于传达告知类型的消息,可以短暂停留后自动消失,无需用户交互。比如通知管理器被用于告知下载完成,消息提醒等。通知管理器还可以是以图表或者滚动条文本形式出现在系统顶部状态栏的通知,例如后台运行的应用程序的通知,还可以是以对话窗口形式出现在屏幕上的通知。例如在状态栏提示文本信息,发出提示音,电子设备振动,指示灯闪烁等。The notification manager enables applications to display notification information in the status bar, which can be used to convey notification-type messages, and can disappear automatically after a brief pause without user interaction. For example, the notification manager is used to notify download completion, message reminders, etc. The notification manager can also display notifications in the status bar at the top of the system in the form of graphs or scroll bar text, such as notifications of applications running in the background, and notifications on the screen in the form of dialog windows. For example, text information is prompted in the status bar, a prompt sound is issued, the electronic device vibrates, and the indicator light flashes.
Android Runtime包括核心库和虚拟机。Android runtime负责安卓系统的调度和管理。Android Runtime includes core libraries and a virtual machine. Android runtime is responsible for scheduling and management of the Android system.
核心库包含两部分:一部分是java语言需要调用的功能函数,另一部分是安卓的核心库。The core library consists of two parts: one is the function functions that the java language needs to call, and the other is the core library of Android.
应用程序层和应用程序框架层运行在虚拟机中。虚拟机将应用程序层和应用程序框架层的java文件执行为二进制文件。虚拟机用于执行对象生命周期的管理,堆栈管理,线程管理,安全和异常的管理,以及垃圾回收等功能。The application layer and the application framework layer run in virtual machines. The virtual machine executes the java files of the application layer and the application framework layer as binary files. The virtual machine is used to perform functions such as object lifecycle management, stack management, thread management, safety and exception management, and garbage collection.
系统库可以包括多个功能模块。例如:表面管理器(surface manager),媒体库(Media Libraries),三维图形处理库(例如:OpenGL ES),2D图形引擎(例如:SGL)等。A system library can include multiple functional modules. For example: surface manager (surface manager), media library (Media Libraries), 3D graphics processing library (eg: OpenGL ES), 2D graphics engine (eg: SGL), etc.
表面管理器用于对显示子系统进行管理,并且为多个应用程序提供了2D和3D图层的融合。The Surface Manager is used to manage the display subsystem and provides a fusion of 2D and 3D layers for multiple applications.
媒体库支持多种常用的音频,视频格式回放和录制,以及静态图像文件等。媒体库可以支持多种音视频编码格式,例如:MPEG4,H.264,MP3,AAC,AMR,JPG,PNG等。The media library supports playback and recording of a variety of commonly used audio and video formats, as well as still image files. The media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
三维图形处理库用于实现三维图形绘图,图像渲染,合成,和图层处理等。The 3D graphics processing library is used to implement 3D graphics drawing, image rendering, compositing, and layer processing.
2D图形引擎是2D绘图的绘图引擎。2D graphics engine is a drawing engine for 2D drawing.
内核层是硬件和软件之间的层。内核层至少包含显示驱动,摄像头驱动,音频驱动,传感器驱动。其中,应用程序可以通过应用程序的框架层调用内核层的驱动,例如,调用音频驱动播放信令,调用传感器驱动确定与其他设备的距离等,调用传感器驱动确定自己的位置等。The kernel layer is the layer between hardware and software. The kernel layer contains at least display drivers, camera drivers, audio drivers, and sensor drivers. Among them, the application can call the driver of the kernel layer through the framework layer of the application, for example, call the audio driver to play signaling, call the sensor driver to determine the distance from other devices, etc., call the sensor driver to determine its own position, etc.
上述实施例中所用,根据上下文,术语“当…时”可以被解释为意思是“如果…”或“在…后”或“响应于确定…”或“响应于检测到…”。类似地,根据上下文,短语“在确定…时”或“如果检测到(所陈述的条件或事件)”可以被解释为意思是“如果确定…”或“响应于确定…”或“在检测到(所陈述的条件或事件)时”或“响应于检测到(所陈述的条件或事件)”。As used in the above embodiments, the term "when" may be interpreted to mean "if" or "after" or "in response to determining..." or "in response to detecting..." depending on the context. Similarly, depending on the context, the phrases "in determining..." or "if detecting (the stated condition or event)" can be interpreted to mean "if determining..." or "in response to determining..." or "on detecting (the stated condition or event)" or "in response to the detection of (the stated condition or event)".
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行该计算机程序指令时,全部或部分地产生按照本申请实施例该的流程或功能。该计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。该计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,该计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线)或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。该计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如DVD)、或者半导体介质(例如固态硬盘)等。In the above-mentioned embodiments, it may be implemented in whole or in part by software, hardware, firmware or any combination thereof. When implemented in software, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, all or part of the processes or functions according to the embodiments of the present application are generated. The computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable device. The computer instructions may be stored on or transmitted from one computer readable storage medium to another computer readable storage medium, for example, the computer instructions may be transmitted over a wire from a website site, computer, server or data center (eg coaxial cable, optical fiber, digital subscriber line) or wireless (eg infrared, wireless, microwave, etc.) to another website site, computer, server or data center. The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that includes an integration of one or more available media. The available media may be magnetic media (eg, floppy disks, hard disks, magnetic tapes), optical media (eg, DVDs), or semiconductor media (eg, solid state drives), and the like.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,该流程可以由计算机程序来指令相关的硬件完成,该程序可存储于计算机可读取存储介质中,该程序在执行时,可包括如上述各方法实施例的流程。而前述的存储介质包括:ROM或随机存储记忆体RAM、磁碟或者光盘等各种可存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented. The process can be completed by instructing the relevant hardware by a computer program, and the program can be stored in a computer-readable storage medium. When the program is executed , which may include the processes of the foregoing method embodiments. The aforementioned storage medium includes: ROM or random storage memory RAM, magnetic disk or optical disk and other mediums that can store program codes.

Claims (18)

  1. 一种回声时延估计方法,其特征在于,包括:A method for echo delay estimation, comprising:
    在第一时刻,第一电子设备确定第一距离,所述第一距离为在第一时刻时所述第一电子设备与第二电子设备之间的距离;At the first moment, the first electronic device determines a first distance, where the first distance is the distance between the first electronic device and the second electronic device at the first moment;
    所述第一电子设备基于所述第一距离和声速确定第一传播时延,所述第一传播时延为在第一时刻时,声音信号从所述第一电子设备传播到第二电子设备之间的时延;The first electronic device determines a first propagation delay based on the first distance and the speed of sound, where the first propagation delay is when the sound signal propagates from the first electronic device to the second electronic device at a first moment delay between;
    所述第一电子设备确定所述第一传播时延为第一回声时延,或者,所述第一电子设备确定所述第一传播时延与第一处理时延的和为所述第一回声时延,所述第一处理时延为所述第一电子设备从获取语音信号到播放语音信号的时延与第二电子设备获取语音信号到第二电子设备转发语音信号至第一电子设备上时延的和。The first electronic device determines that the first propagation delay is the first echo delay, or the first electronic device determines that the sum of the first propagation delay and the first processing delay is the first echo delay Echo delay, the first processing delay is the delay from acquiring the voice signal to playing the voice signal by the first electronic device and the second electronic device acquiring the voice signal to the second electronic device and forwarding the voice signal to the first electronic device. the sum of the upper delays.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    在第一时刻后的第二时刻,所述第一电子设备确定第二距离,所述第二距离为在第二时刻时所述第一电子设备与第二电子设备之间的距离;At a second time after the first time, the first electronic device determines a second distance, where the second distance is the distance between the first electronic device and the second electronic device at the second time;
    所述第一电子设备基于所述第二距离和声速确定第二传播时延,所述第二传播时延为在第二时刻时,声音信号从所述第一电子设备传输到第二电子设备之间的时延;The first electronic device determines a second propagation delay based on the second distance and the speed of sound, where the second propagation delay is when the sound signal is transmitted from the first electronic device to the second electronic device at a second moment delay between;
    所述第一电子设备确定所述第二传播时延为第二回声时延,或者所述第一电子设备确定所述第二传播时延与所述第一处理时延的和为所述第二回声时延;The first electronic device determines that the second propagation delay is the second echo delay, or the first electronic device determines that the sum of the second propagation delay and the first processing delay is the first Second echo delay;
    所述第二回声时延与所述第一回声时延不同。The second echo delay is different from the first echo delay.
  3. 根据权利要求1或2所述的方法,其特征在于,在第一时刻,第一电子设备确定第一距离前,还包括:The method according to claim 1 or 2, wherein, at the first moment, before the first electronic device determines the first distance, the method further comprises:
    在第一时刻前的第三时刻,所述第一电子设备确定所述第一处理时延。At a third time point before the first time point, the first electronic device determines the first processing delay.
  4. 根据权利要求3所述的方法,其特征在于,在第一时刻前的第三时刻,所述第一电子设备确定所述第一处理时延,具体包括:The method according to claim 3, wherein, at a third time before the first time, the first electronic device determines the first processing delay, which specifically includes:
    在第一时刻前的第三时刻,当所述第一电子设备与第二电子设备的距离小于距离阈值时,所述第一电子设备播放第一音频;At a third moment before the first moment, when the distance between the first electronic device and the second electronic device is less than a distance threshold, the first electronic device plays the first audio;
    所述第一电子设备通过无线网络/近距离通信服务接收所述第二电子设备发送的所述第一音频;The first electronic device receives the first audio sent by the second electronic device through a wireless network/near field communication service;
    所述第一电子设备确定播放第一音频和接收所述第一音频之间的时间差为所述第一处理时延。The first electronic device determines that a time difference between playing the first audio and receiving the first audio is the first processing delay.
  5. 根据权利要求3所述的方法,其特征在于,在第一时刻前的第三时刻,所述第一电子设备确定第一处理时延,具体包括:The method according to claim 3, wherein, at a third time before the first time, the first electronic device determines the first processing delay, which specifically includes:
    在第一时刻前的第三时刻,响应于用户的输入,所述第一电子设备播放第一音频;At a third moment before the first moment, in response to the user's input, the first electronic device plays the first audio;
    所述第一电子设备通过无线网络/近距离通信服务接收所述第二电子设备发送的所述第一音频;The first electronic device receives the first audio sent by the second electronic device through a wireless network/near field communication service;
    所述第一电子设备确定播放所述第一音频和接收所述第一音频之间的时间差为所述第一处理时延。The first electronic device determines that a time difference between playing the first audio and receiving the first audio is the first processing delay.
  6. 根据权利要求1至5中任一所述的方法,其特征在于,所述第一电子设备确定第一距离,具体包括:The method according to any one of claims 1 to 5, wherein the determining of the first distance by the first electronic device specifically includes:
    所述第一电子设备接收第一运动信息,所述第一运动信息包括所述第二电子设备的运动状态;the first electronic device receives first motion information, the first motion information includes a motion state of the second electronic device;
    所述第一电子设备基于所述第一运动信息确定所述第一距离。The first electronic device determines the first distance based on the first motion information.
  7. 根据权利要求1至5中任一所述的方法,其特征在于,所述第一电子设备确定第一距离,具体包括:The method according to any one of claims 1 to 5, wherein the determining of the first distance by the first electronic device specifically includes:
    所述第一电子设备接收第一位置信息,所述第一位置信息包括所述第二电子设备的位置;the first electronic device receives first location information, the first location information including the location of the second electronic device;
    所述第一电子设备基于所述第一位置信息确定所述第一距离。The first electronic device determines the first distance based on the first location information.
  8. 根据权利要求1至5中任一所述的方法,其特征在于,所述第一电子设备确定第一距离,具体包括:The method according to any one of claims 1 to 5, wherein the determining of the first distance by the first electronic device specifically includes:
    所述第一电子设备接收第一距离信息,所述第一距离信息包括所述第一距离;the first electronic device receives first distance information, the first distance information includes the first distance;
    所述第一电子设备基于所述第一距离信息确定所述第一距离。The first electronic device determines the first distance based on the first distance information.
  9. 根据权利要求1至8中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1 to 8, wherein the method further comprises:
    在第四时刻,第一电子设备确定第二处理时延,所述第二处理时延为所述第一电子设备从获取语音信号到播放语音信号的时延与第三电子设备获取语音信号到第三电子设备转发语音信号至第一电子设备上时延的和;At the fourth moment, the first electronic device determines a second processing delay, where the second processing delay is the delay from acquiring the voice signal to playing the voice signal by the first electronic device and the time from acquiring the voice signal to the third electronic device The third electronic device forwards the voice signal to the sum of the delays on the first electronic device;
    在第四时刻后的第五时刻,第一电子设备确定第三距离,所述第三距离为在第五时刻时所述第一电子设备与第三电子设备之间的距离;At a fifth time after the fourth time, the first electronic device determines a third distance, where the third distance is the distance between the first electronic device and the third electronic device at the fifth time;
    所述第一电子设备基于所述第三距离和声速确定第三传播时延,所述第三传播时延为在第五时刻时,声音信号从所述第一电子设备传播到第三电子设备之间的时延;The first electronic device determines a third propagation delay based on the third distance and the speed of sound, where the third propagation delay is when the sound signal propagates from the first electronic device to the third electronic device at a fifth moment in time delay between;
    所述第一电子设备确定所述第三传播时延与所述第二处理时延的和为第三回声时延。The first electronic device determines that the sum of the third propagation delay and the second processing delay is a third echo delay.
  10. 根据权利要求3所述的方法,其特征在于,在第一时刻前的第三时刻,所述第一电子设备确定第一处理时延,具体包括:The method according to claim 3, wherein, at a third time before the first time, the first electronic device determines the first processing delay, which specifically includes:
    当所述第一电子设备与第二电子设备的距离小于距离阈值时,所述第一电子设备播放第一音频;When the distance between the first electronic device and the second electronic device is less than a distance threshold, the first electronic device plays the first audio;
    所述第二电子设备采集所述第一音频;the second electronic device collects the first audio;
    所述第二电子设备通过无线网络/近距离通信服务向所述第一电子设备发送所述第一音频;The second electronic device sends the first audio to the first electronic device through a wireless network/near field communication service;
    所述第一电子设备确定播放第一音频和接收所述第一音频之间的时间差为所述第一处理时延。The first electronic device determines that a time difference between playing the first audio and receiving the first audio is the first processing delay.
  11. 根据权利要求3所述的方法,其特征在于,当所述第一电子设备与第二电子设备的距离小于距离阈值时,所述第一电子设备播放信令,具体包括:The method according to claim 3, wherein when the distance between the first electronic device and the second electronic device is less than a distance threshold, the first electronic device plays signaling, which specifically includes:
    在第一时刻前的第三时刻,响应于用户的输入,所述第一电子设备播放第一音频;At a third moment before the first moment, in response to the user's input, the first electronic device plays the first audio;
    所述第二电子设备采集所述第一音频;the second electronic device collects the first audio;
    所述第二电子设备通过无线网络/近距离通信服务向所述第一电子设备发送所述第一音频;The second electronic device sends the first audio to the first electronic device through a wireless network/near field communication service;
    所述第一电子设备确定播放第一音频和接收所述第一音频之间的时间差为所述第一处理时延。The first electronic device determines that a time difference between playing the first audio and receiving the first audio is the first processing delay.
  12. 根据权利要求10或11所述的方法,其特征在于,所述第一电子设备确定第一距离,具体包括:The method according to claim 10 or 11, wherein the determining of the first distance by the first electronic device specifically includes:
    所述第二电子设备确定第一运动信息,所述第一运动信息包括所述第二电子设备的运动状态;the second electronic device determines first motion information, where the first motion information includes a motion state of the second electronic device;
    所述第二电子设备向所述第一电子设备发送运动信息;the second electronic device sends motion information to the first electronic device;
    所述第一电子设备接收所述运动信息,所述第一电子设备基于所述第一运动信息确定所述第一距离。The first electronic device receives the motion information, and the first electronic device determines the first distance based on the first motion information.
  13. 根据权利要求10或11所述的方法,其特征在于,所述第一电子设备确定第一距离,具体包括:The method according to claim 10 or 11, wherein the determining of the first distance by the first electronic device specifically includes:
    所述第二电子设备确定第一位置信息,所述第一位置信息包括所述第二电子设备的位置;the second electronic device determines first location information, the first location information including the location of the second electronic device;
    所述第二电子设备向所述第一电子设备发送所述第一位置信息;the second electronic device sends the first location information to the first electronic device;
    所述第一电子设备接收所述第一位置信息,所述第一电子设备基于所述第一位置信息确定所述第一距离。The first electronic device receives the first location information, and the first electronic device determines the first distance based on the first location information.
  14. 根据权利要求10或11所述的方法,其特征在于,所述第一电子设备确定第一距离,具体包括:The method according to claim 10 or 11, wherein the determining of the first distance by the first electronic device specifically includes:
    所述第二电子设备确定第一距离信息,所述第一距离信息包括所述第一距离;the second electronic device determines first distance information, the first distance information includes the first distance;
    所述第二电子设备向所述第二电子设备发送所述第一距离信息;The second electronic device sends the first distance information to the second electronic device;
    所述第一电子设备接收所述第一距离信息,所述第一电子设备基于所述第一位置信息确定所述第一距离。The first electronic device receives the first distance information, and the first electronic device determines the first distance based on the first location information.
  15. 一种电子设备,其特征在于,所述电子设备包括:一个或多个处理器和存储器;An electronic device, characterized in that the electronic device comprises: one or more processors and a memory;
    所述存储器与所述一个或多个处理器耦合,所述存储器用于存储计算机程序代码,所述计算机程序代码包括计算机指令,所述一个或多个处理器调用所述计算机指令以使得所述电子设备执行如权利要求1至9中任一项所述的方法。The memory is coupled to the one or more processors for storing computer program code, the computer program code comprising computer instructions that the one or more processors invoke to cause the The electronic device performs the method of any one of claims 1 to 9.
  16. 一种芯片系统,所述芯片系统应用于电子设备,所述芯片系统包括一个或多个处理器,所述处理器用于调用计算机指令以使得所述电子设备执行如权利要求1至9中任一项所述的方法。A chip system, the chip system applied to an electronic device, the chip system comprising one or more processors for invoking computer instructions to cause the electronic device to execute any one of claims 1 to 9 method described in item.
  17. 一种包含指令的计算机程序产品,其特征在于,当所述计算机程序产品在电子设备上运行时,使得所述电子设备执行如权利要求1至9中任一项所述的方法。A computer program product comprising instructions, wherein, when the computer program product is run on an electronic device, the electronic device is caused to perform the method according to any one of claims 1 to 9.
  18. 一种计算机可读存储介质,包括指令,其特征在于,当所述指令在电子设备上运行时,使得所述电子设备执行如权利要求1至9中任一项所述的方法。A computer-readable storage medium comprising instructions, characterized in that, when the instructions are executed on an electronic device, the electronic device is caused to perform the method according to any one of claims 1 to 9.
PCT/CN2022/084862 2021-04-17 2022-04-01 Method for estimating echo delay between distributed devices, and electronic device WO2022218175A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110415110.4 2021-04-17
CN202110415110.4A CN115223581A (en) 2021-04-17 2021-04-17 Echo time delay estimation method for distributed equipment and electronic equipment

Publications (1)

Publication Number Publication Date
WO2022218175A1 true WO2022218175A1 (en) 2022-10-20

Family

ID=83605202

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/084862 WO2022218175A1 (en) 2021-04-17 2022-04-01 Method for estimating echo delay between distributed devices, and electronic device

Country Status (2)

Country Link
CN (1) CN115223581A (en)
WO (1) WO2022218175A1 (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9246545B1 (en) * 2014-04-11 2016-01-26 Amazon Technologies, Inc. Adaptive estimation of delay in audio systems
US20160171988A1 (en) * 2014-12-15 2016-06-16 Wire Swiss Gmbh Delay estimation for echo cancellation using ultrasonic markers
CN108091343A (en) * 2018-01-16 2018-05-29 北京三体云联科技有限公司 A kind of echo cancel method and device
CN109040501A (en) * 2018-09-10 2018-12-18 成都擎天树科技有限公司 A kind of echo cancel method improving VOIP phone quality
CN109961797A (en) * 2017-12-25 2019-07-02 阿里巴巴集团控股有限公司 A kind of echo cancel method, device and electronic equipment
CN110769352A (en) * 2018-07-25 2020-02-07 西安中兴新软件有限责任公司 Signal processing method and device and computer storage medium
CN110956974A (en) * 2019-12-05 2020-04-03 浙江大华技术股份有限公司 Echo cancellation method and related device
CN111060875A (en) * 2019-12-12 2020-04-24 北京声智科技有限公司 Method and device for acquiring relative position information of equipment and storage medium
CN112116919A (en) * 2019-06-19 2020-12-22 杭州海康威视数字技术股份有限公司 Echo cancellation method and device and electronic equipment

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9246545B1 (en) * 2014-04-11 2016-01-26 Amazon Technologies, Inc. Adaptive estimation of delay in audio systems
US20160171988A1 (en) * 2014-12-15 2016-06-16 Wire Swiss Gmbh Delay estimation for echo cancellation using ultrasonic markers
CN109961797A (en) * 2017-12-25 2019-07-02 阿里巴巴集团控股有限公司 A kind of echo cancel method, device and electronic equipment
CN108091343A (en) * 2018-01-16 2018-05-29 北京三体云联科技有限公司 A kind of echo cancel method and device
CN110769352A (en) * 2018-07-25 2020-02-07 西安中兴新软件有限责任公司 Signal processing method and device and computer storage medium
CN109040501A (en) * 2018-09-10 2018-12-18 成都擎天树科技有限公司 A kind of echo cancel method improving VOIP phone quality
CN112116919A (en) * 2019-06-19 2020-12-22 杭州海康威视数字技术股份有限公司 Echo cancellation method and device and electronic equipment
CN110956974A (en) * 2019-12-05 2020-04-03 浙江大华技术股份有限公司 Echo cancellation method and related device
CN111060875A (en) * 2019-12-12 2020-04-24 北京声智科技有限公司 Method and device for acquiring relative position information of equipment and storage medium

Also Published As

Publication number Publication date
CN115223581A (en) 2022-10-21

Similar Documents

Publication Publication Date Title
WO2021000876A1 (en) Voice control method, electronic equipment and system
CN111666119A (en) UI component display method and electronic equipment
CN112119641B (en) Method and device for realizing automatic translation through multiple TWS (time and frequency) earphones connected in forwarding mode
CN114449599A (en) Network link switching method based on electronic equipment position and electronic equipment
CN112637758B (en) Equipment positioning method and related equipment thereof
WO2022022609A1 (en) Method for preventing inadvertent touch and electronic device
WO2022160991A1 (en) Permission control method and electronic device
WO2022007944A1 (en) Device control method, and related apparatus
EP4044578A1 (en) Audio processing method and electronic device
CN114466107A (en) Sound effect control method and device, electronic equipment and computer readable storage medium
WO2022143258A1 (en) Voice interaction processing method and related apparatus
WO2022105702A1 (en) Method and electronic device for saving image
WO2022170856A1 (en) Method for establishing connection, and electronic device
CN115641867B (en) Voice processing method and terminal equipment
CN116708751B (en) Method and device for determining photographing duration and electronic equipment
CN114006698B (en) token refreshing method and device, electronic equipment and readable storage medium
WO2022218175A1 (en) Method for estimating echo delay between distributed devices, and electronic device
WO2022135195A1 (en) Method and apparatus for displaying virtual reality interface, device, and readable storage medium
CN113467904B (en) Method and device for determining collaboration mode, electronic equipment and readable storage medium
WO2022007757A1 (en) Cross-device voiceprint registration method, electronic device and storage medium
WO2020233581A1 (en) Height measuring method and electronic device
CN115706755A (en) Echo cancellation method, electronic device, and storage medium
CN115480250A (en) Voice recognition method and device, electronic equipment and storage medium
WO2020233554A1 (en) Angle measurement method and electronic device
WO2022052767A1 (en) Method for controlling device, electronic device, and system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22787402

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22787402

Country of ref document: EP

Kind code of ref document: A1