WO2022022647A1 - 电子设备的录音方法及录音装置 - Google Patents
电子设备的录音方法及录音装置 Download PDFInfo
- Publication number
- WO2022022647A1 WO2022022647A1 PCT/CN2021/109323 CN2021109323W WO2022022647A1 WO 2022022647 A1 WO2022022647 A1 WO 2022022647A1 CN 2021109323 W CN2021109323 W CN 2021109323W WO 2022022647 A1 WO2022022647 A1 WO 2022022647A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- focal length
- gain
- signal
- initial
- target
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 62
- 238000007499 fusion processing Methods 0.000 claims abstract description 17
- 230000004927 fusion Effects 0.000 claims description 41
- 238000004590 computer program Methods 0.000 claims 4
- 230000000694 effects Effects 0.000 description 9
- 230000007423 decrease Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 8
- 238000004891 communication Methods 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 3
- 238000003672 processing method Methods 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0356—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for synchronising with other signals, e.g. video signals
Definitions
- the present application belongs to the field of communication technologies, and in particular relates to a recording method and a recording device of an electronic device.
- the distance between the sound source and the mobile terminal usually changes.
- the sound recorded by the mobile terminal will gradually decrease, so that the user may not be able to hear the sound clearly; as the distance between the sound source and the mobile terminal gradually decreases, the sound recorded by the mobile terminal will gradually decrease. Gradually increase, resulting in possible breakouts. Therefore, in view of the situation where the distance between the sound source and the mobile terminal changes, how to improve the recording quality has become an urgent problem to be solved.
- the purpose of the embodiments of the present application is to provide a recording method and a recording device for an electronic device, which can solve the problem of how to improve the recording quality.
- an embodiment of the present application provides a recording method of an electronic device.
- the electronic device includes M microphones, and each microphone is connected to a first voice path and a second voice path, where M is an integer greater than or equal to 2.
- the method includes: obtaining the shooting focal length of the camera in a state of video shooting; if the shooting focal length of the camera changes from the initial focal length to the target focal length, determining the target gain according to the initial focal length, the target focal length and the initial gain, and comparing the target gain with the The gain of the second voice path connected to the ith microphone is adjusted to the target gain, the initial gain is the gain of the first voice path connected to the ith microphone, and i takes values 1, 2...M in turn; Perform signal enhancement processing on the voice signal output by the first voice channel connected to the ith microphone and the voice signal output by the second voice channel connected with the ith microphone to obtain the ith voice enhanced signal; perform signal fusion on the M voice enhanced signals processing to obtain a first recording signal.
- an embodiment of the present application provides a recording device.
- the recording device includes M microphones, and each microphone is connected with a first voice channel and a second voice channel, where M is an integer greater than or equal to 2.
- the recording device includes an acquisition module, a determination module and a processing module.
- the acquiring module is used for acquiring the shooting focal length of the camera when the video is in the state of shooting.
- the determining module is used to determine the target gain according to the initial focal length, the target focal length and the initial gain if the shooting focal length obtained by the acquiring module is changed from the initial focal length to the target focal length, and the initial gain is the first voice path connected with the i-th microphone. gain.
- the processing module is used for adjusting the gain of the second voice path connected with the i-th microphone to the target gain determined by the determination module; and to the voice signal output from the first voice path connected with the i-th microphone and with the i-th microphone.
- i takes the value of 1, 2...M in turn.
- an embodiment of the present application provides an electronic device, the electronic device includes a processor, a memory, and a program or instruction stored in the memory and executable on the processor, the program or instruction being executed by the processor When executed, the steps of the method as provided in the first aspect are implemented.
- an embodiment of the present application provides a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, the steps of the method provided in the first aspect are implemented.
- an embodiment of the present application provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, and the processor is configured to run a program or an instruction to implement the method provided in the first aspect.
- the shooting focal length of the camera can be obtained in the state of video shooting; if the shooting focal length of the camera changes from the initial focal length to the target focal length, the target gain is determined according to the initial focal length, the target focal length and the initial gain , and adjust the gain of the second voice path connected with the ith microphone to the target gain, the initial gain is the gain of the first voice path connected with the ith microphone, and i takes values 1, 2...M in turn; Carry out signal enhancement processing to the voice signal output by the first voice path connected with the ith microphone and the voice signal output by the second voice path connected with the ith microphone to obtain the ith voice enhancement signal; The signal is subjected to signal fusion processing to obtain a first recording signal.
- the shooting focal length of the camera of the electronic device also changes. Therefore, by setting the first voice path connected to the i-th microphone as Fixed gain, the second voice channel connected with the i-th microphone is set to a variable gain that changes with the shooting focal length, so that when the shooting focal length becomes larger, the gain of the second voice path becomes larger to record signals at longer distances.
- the gain of the second voice channel is smaller to record a signal at a closer distance, and then the voice signal is enhanced by comparing the difference between the two signals, thereby improving the quality of the voice signal obtained by the final fusion.
- the effect of video shooting is
- FIG. 1 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
- FIG. 2 is one of the schematic diagrams of a recording method of an electronic device provided by an embodiment of the present application.
- FIG 3 is the second schematic diagram of the recording method of the electronic device provided by the embodiment of the present application.
- FIG. 4 is a schematic diagram of a field of view of a camera provided by an embodiment of the present application.
- FIG. 5 is a schematic structural diagram of a recording device provided by an embodiment of the present application.
- FIG. 6 is one of the schematic hardware diagrams of the electronic device provided by the embodiment of the present application.
- FIG. 7 is the second schematic diagram of the hardware of the electronic device provided by the embodiment of the present application.
- first, second and the like in the description and claims of the present application are used to distinguish similar objects, and are not used to describe a specific order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances so that the embodiments of the present application can be practiced in sequences other than those illustrated or described herein, and distinguish between “first”, “second”, etc.
- the objects are usually of one type, and the number of objects is not limited.
- the first object may be one or more than one.
- “and/or” in the description and claims indicates at least one of the connected objects, and the character “/" generally indicates that the associated objects are in an "or” relationship.
- the embodiment of the present application provides a recording method and a recording device for an electronic device, which can obtain the shooting focal length of a camera in a video shooting state; Target focal length and initial gain, determine the target gain, and adjust the gain of the second voice path connected with the ith microphone to the target gain, and the initial gain is the gain of the first voice path connected with the ith microphone, i is taken in turn.
- the value is 1, 2...M; perform signal enhancement processing on the voice signal output by the first voice path connected to the ith microphone and the voice signal output by the second voice path connected with the ith microphone to obtain the ith voice signal.
- Speech enhancement signal perform signal fusion processing on the M speech enhancement signals to obtain a first recording signal.
- the shooting focal length of the camera of the electronic device also changes. Therefore, by setting the first voice path connected to the i-th microphone as Fixed gain, the second voice channel connected with the i-th microphone is set to a variable gain that changes with the shooting focal length, so that when the shooting focal length becomes larger, the gain of the second voice path becomes larger to record signals at longer distances.
- the gain of the second voice channel is smaller to record a signal at a closer distance, and then the voice signal is enhanced by comparing the difference between the two signals, thereby improving the quality of the voice signal obtained by the final fusion.
- the effect of video shooting is
- an embodiment of the present application provides an electronic device.
- the electronic device includes M microphones, and each microphone is connected to a first voice path and a second voice path, where M is an integer greater than or equal to 2.
- the electronic device is a mobile phone, and the mobile phone is provided with two microphones (mic), a codec (codec) and a digital signal processor (adsp) as an example.
- the microphone 1 is connected with the first voice path 01 and the second voice path 02
- the microphone 2 is connected with the first voice path 03 and the second voice path 04
- each voice path in the four voice paths respectively includes an analog-to-digital converter (analog-to-digital converter, ADC), and the other end of the 4 voice paths is connected to the coding module of the codec.
- the digital signal processor includes a recording enhancement module connected with the encoding module, and a noise reduction module connected with the recording enhancement module.
- the microphone is used to collect sound signals (also known as sound source signals, sound wave signals) emitted by sound sources (such as characters, musical instruments, water currents, wind waves, etc.);
- the ADC is used to convert the sound signals collected by the microphone by analog The signal is converted into a digital signal;
- the encoding module is used to encode the digital signal output by the ADC to obtain the encoded signal;
- the recording enhancement module is used to enhance the 2-channel encoded signal connected with each microphone, and obtain the same signal as each microphone.
- the 1-channel enhanced signal corresponding to the microphone, and then the 2-channel enhanced signal corresponding to the two microphones is obtained;
- the noise reduction module is used to fuse the 2-channel enhanced signal to obtain a 1-channel fusion signal, and perform noise reduction processing on the fusion signal. , to get the final recording signal.
- the specific description of the recording method of the electronic device may be referred to the following embodiments, which will not be repeated here.
- FIG. 1 is an example of an electronic device including two microphones. These two microphones can be used to collect different types of audio respectively.
- the microphone 1 can be mainly used to collect human voices
- the microphone 2 It is used to collect ambient sound, but it does not form any limitation to the embodiments of the present application.
- the electronic device may further include three microphones or more than three microphones, and as the number of microphones increases, the recording effect is gradually enhanced.
- an embodiment of the present application provides a recording method of an electronic device.
- the method may include the following S201 to S204.
- the method will be exemplarily described below by taking the execution subject as an electronic device as shown in FIG. 1 as an example.
- the electronic device acquires the shooting focal length of the camera.
- the above-mentioned video shooting state is a state in which the user triggers the camera to collect and cache video frames through touch input on the video shooting control or video recording control after running the camera application, that is, the electronic device is in the process of video shooting among.
- the shooting focal length of the camera may change at any time.
- the electronic device is in an auto-focus state, and as the subject moves, the focal length of the camera is automatically adjusted; another possible implementation is that as the subject moves or the subject moves To change, the user triggers the camera to adjust the focus through manual input. Therefore, in order to detect the change of the shooting focal length of the camera in time, after the electronic device enters the video shooting state, the electronic device can periodically detect the shooting focal length of the camera to determine whether the shooting focal length of the camera has changed, and then determine whether The gain of the speech path needs to be adjusted.
- the recording method provided by the embodiment of the present application may further include: running a camera application; after that, setting the shooting focal length of the camera to the initial focal length, and setting the field of view of the camera to the initial field of view , and set the gain of each speech channel to the initial gain.
- the initial focal length, initial field angle and initial gain may all be preset values.
- the electronic device runs the camera application and performs initial settings. For example, set the initial focal length of the camera to 4 times the focal length, set the initial field of view of the camera to ⁇ , and set the gain of the ADC in the first voice path and the second voice path connected to each camera to 12dB (decibels ).
- the gain of the first voice channel is a fixed gain, and as the shooting focal length of the camera changes, the gain of the first voice channel remains unchanged.
- the gain of the second voice path is a variable gain. As the focal length of the camera increases, the gain of the second voice path increases, and as the focal length of the camera decreases, the gain of the second voice path decreases.
- the above-mentioned embodiment is described by taking an example that the initial gain of the second voice path is equal to the initial gain of the first voice path. It can be understood that, in actual implementation, the initial gain of the second voice path may be larger or smaller than that of the first voice path.
- the initial gain of the channel may be specifically determined according to actual usage requirements, which is not limited in this embodiment of the present application.
- the electronic device determines the target gain according to the initial focal length, the target focal length and the initial gain, and adjusts the gain of the second voice path connected to the ith microphone as the target gain.
- the above-mentioned initial gain is the gain of the first voice path connected to the ith microphone, that is, the fixed gain set for the first voice path connected to the ith microphone when the shooting focal length of the camera is the initial focal length.
- i takes values of 1, 2...M in sequence. That is, if the shooting focal length of the camera changes from the initial focal length to the target focal length, the electronic device determines the second voice connected to the first microphone according to the initial focal length, the target focal length and the gain of the first voice path connected to the first microphone the gain of the channel and adjust it; according to the initial focal length, the target focal length and the gain of the first voice channel connected with the second microphone, determine the gain of the second voice channel connected with the second microphone and adjust it; ...; According to the initial focal length, the target focal length and the gain of the first voice path connected with the Mth microphone, determine and adjust the gain of the second voice path connected with the Mth microphone.
- the above-mentioned target focal length is larger than the initial focal length, or smaller than the initial focal length.
- the gain of the second voice path connected to the ith microphone needs to be increased; if the target focal length is smaller than the initial focal length, the gain of the second voice path connected to the ith microphone needs to be reduced.
- the above-mentioned "determining the target gain according to the initial focal length, target focal length and initial gain” can be implemented in any one of the following two ways:
- Method 1 If the target focal length is greater than the initial focal length, the electronic device uses the sum of the initial gain and the first gain as the target gain, wherein the first gain is the product of the target value and the preset value, and the target value is the target focal length.
- *preset value initial gain+(target focal length-initial focal length)*preset value, and the preset value can be used to represent The gain difference between two consecutive focal lengths.
- the initial gain is 12 dB
- the preset value is 3 dB
- the initial focal length is ⁇ 4 (ie, 4 times the focal length).
- Method 2 If the target focal length is smaller than the initial focal length, the electronic device uses the difference between the initial gain and the first gain as the target gain, where the first gain is the product of the target value and the preset value, and the target value is the target focal length.
- *preset value initial gain-(initial focal length-target focal length)*preset value, and the preset value can be used to represent The gain difference between two consecutive focal lengths.
- the initial gain is 12 dB
- the preset value is 3 dB
- the initial focal length is ⁇ 4 (ie, 4 times the focal length).
- the focal length of the camera is changed to ⁇ 1 (ie, 1 times the focal length), the target gain
- variable gain range is set for the gain of the second speech path, that is, the gain of the second speech passage can only be adjusted within the variable gain range.
- the variable gain range is 0-30dB, combined with the example in the above-mentioned way 1, when the shooting focal length of the camera changes to ⁇ 10 (ie, 10 times the focal length), the target gain is 30dB, and when the shooting focal length of the camera changes When the focal length is greater than 10 times, the target gain is still 30dB.
- the above embodiment is exemplified by taking the initial gain of the first voice path connected to each microphone as the same gain as an example. It can be understood that in actual implementation, the initial gain of the first voice path connected to each microphone is It can also be unequal, which can be determined according to the actual use requirements.
- a voice enhancement signal can be obtained by comparing the voice signal output from the first voice path and the voice signal output from the second voice path and performing enhancement processing.
- the electronic device may compare the signal-to-noise ratio of the speech signal output by the first speech channel with the signal-to-noise ratio of the speech signal output by the second speech channel. If the signal-to-noise ratio of the speech signal output by the first speech channel is greater than the signal-to-noise ratio of the speech signal output by the second speech channel, then the speech signal output by the first speech channel is enhanced to obtain a speech enhancement signal; If the signal-to-noise ratio of the speech signal output by the speech channel is smaller than that of the speech signal output by the second speech channel, the speech signal output by the second speech channel is enhanced to obtain a speech enhancement signal.
- the electronic device can compare the preset feature parameters of the voice signal output by the first voice channel with the preset feature parameters of the voice signal output by the second voice channel;
- the speech fragments that meet the requirements are synthesized to obtain a speech enhancement signal.
- the preset characteristic parameters may include at least one of acoustic wave amplitude information, voiceprint information, and signal-to-noise ratio.
- the output voice signal and the voice signal output from the second voice channel 04 connected to the microphone 2 are input into the recording enhancement module after being respectively encoded by the ADC analog-to-digital conversion and the encoding module.
- the recording enhancement module can compare the voice signal output by the first voice path 01 and the voice signal output by the second voice path 02 to obtain the first voice enhancement signal, and compare the voice signal output by the first voice path 03 with the second voice path. 04 The output voice signal, get the second voice enhancement signal.
- the two speech enhancement signals can be directly fused to obtain the first recording signal; or, the two speech enhancement signals can be fused through the noise reduction module to obtain a fusion signal, and then the fusion signal can be processed by fusion processing. Perform noise reduction processing to obtain a first recording signal.
- the embodiments of the present application provide multiple noise reduction processing methods:
- the first method is that the electronic device performs noise reduction processing on the first fusion signal according to the field of view and shooting direction of the camera to eliminate noise from outside the shooting range, thereby obtaining the first recording signal.
- the second method is that the electronic device is provided with at least three microphones, and these three microphones can form a microphone array.
- Each sound source is located, so as to obtain the position information of each sound source, and then eliminate the noise information outside the azimuth of the sound source.
- the position information can be understood as the distance information of the sound source signal from the earphone and the position information relative to the earphone.
- the electronic device may also use other noise reduction processing methods to perform noise reduction processing on the fusion signal to obtain the first recording signal, which may be determined according to actual usage requirements, which is not limited in the embodiments of the present application.
- An embodiment of the present application provides a recording method for an electronic device, since when the distance between the sound source and the electronic device changes, the shooting focal length of the camera of the electronic device also changes.
- the connected first voice path is set to a fixed gain
- the second voice path connected to the i-th microphone is set to a variable gain that changes with the shooting focal length, so that the gain of the second voice path when the shooting focal length becomes larger
- the gain of the second voice channel is smaller to record the signal at a closer distance, and then the voice signal is enhanced by comparing the difference between the two signals, thereby improving the final fusion.
- the quality of the obtained voice signal is improved, and the shooting effect of the video is improved.
- noise reduction processing may be performed on the fusion signals first, and then a final recording signal is obtained.
- S204 may be specifically implemented by the following S204A to S204C.
- the electronic device performs signal fusion processing on the M speech enhancement signals to obtain a first fusion signal.
- the electronic device acquires the field of view and the shooting direction of the camera.
- the angle of view of the camera When the electronic device is in the process of video shooting, when the focal length of the camera changes, the angle of view of the camera also changes. For example, when the focal length increases, the angle of view decreases, and when the focal length decreases, the angle of view decreases. Increase; when the electronic device is turned, such as moving from left to right along a horizontal line or from top to bottom along a vertical line, the camera's shooting direction will change.
- the embodiment of the present application can acquire the field of view and shooting direction of the camera, so as to eliminate noise from outside the shooting range according to the field of view and shooting direction of the camera.
- the electronic device performs noise reduction processing on the first fusion signal according to the field of view angle and shooting direction of the camera to obtain a first recording signal.
- the above noise reduction processing is used to eliminate noise from outside the shooting range, that is, to eliminate sound from outside the actual viewing area.
- the focal length of the camera of the mobile phone is ⁇ 4 (ie, 4 times the focal length)
- the field of view (also called the wide angle) of the camera can be a. Since the fusion signal includes the recording signal within the shooting range and the recording signal outside the shooting range, after the mobile phone determines the shooting range according to the field of view and shooting direction of the camera, the mobile phone can use the preset algorithm according to the shooting range, The recording signal outside the shooting range in the fusion signal is eliminated or canceled, so as to obtain the recording signal within the shooting range, that is, the first recording signal.
- the recording method provided by the embodiment of the present application can perform noise reduction processing on the fusion signal according to the field of view angle and the shooting direction of the camera, so as to eliminate noise from outside the shooting range, thereby improving the recording quality of the finally obtained recording signal.
- the embodiment of the present application may Periodically acquire the field of view and shooting direction of the camera to determine whether the field of view and shooting direction of the camera change.
- the recording method provided in this embodiment of the present application may further include the following S205 to S207.
- the electronic device obtains the shooting focal length of the camera again to obtain the second fusion signal.
- the shooting focal length may not have changed or may have changed relative to the shooting focal length obtained last time.
- the shooting focal length does not change, then there is no need to adjust the gain of the second voice path, and directly compare the voice signal output by the first voice path and the voice signal output by the second voice path to obtain a voice enhancement signal, that is, the second voice signal. fusion signal.
- the gain of the second voice path needs to be re-adjusted; after that, the gain to be adjusted is determined according to the initial focal length, the adjusted focal length and the initial gain, and the gain of the second voice path is adjusted to this The gain is to be adjusted; after that, compare the speech signal output by the first speech channel and the speech signal output by the second speech channel to obtain a speech enhancement signal, that is, a second fusion signal.
- a speech enhancement signal that is, a second fusion signal.
- the electronic device obtains the field of view and the shooting direction of the camera again.
- the electronic device when the target object changes, the electronic device performs noise reduction processing on the second fusion signal according to the changed target object to obtain a second recording signal.
- the above-mentioned target object includes at least one of a field of view angle and a shooting direction.
- the electronic device may periodically acquire the field of view angle and shooting direction of the camera according to a preset period, so as to detect whether the field of view angle and shooting direction of the camera have changed. If at least one of the field of view and shooting direction of the camera changes, the electronic device needs to re-determine the noise reduction direction according to the changed field of view and shooting direction, and perform noise reduction processing on the new fusion signal. For example, if the user turns the electronic device and the position of the sound source remains the same, since the relative position of the sound source and the microphone changes, the shooting direction also changes.
- the electronic device needs to Shooting direction, re-determine the shooting range, and then use a preset algorithm according to the shooting range to eliminate or cancel the recording signal outside the shooting range in the second fusion signal, so as to obtain the recording signal within the shooting range, that is, the second recording signal .
- the relative position of the sound source and the electronic device may change. Therefore, by periodically acquiring the field of view of the camera and shooting direction, can accurately determine the shooting range, and more accurately denoise the voice signal.
- the execution subject may be an electronic device, a recording device, or a control module in the recording device for executing the recording method.
- the recording device provided by the embodiment of the present application is described by taking the recording method performed by the recording device as an example.
- an embodiment of the present application provides a recording apparatus 500 .
- the recording device includes M microphones, and each microphone is connected with a first voice channel and a second voice channel, where M is an integer greater than or equal to 2.
- the recording device 500 includes an acquisition module 501 , a determination module 502 and a processing module 503 .
- the obtaining module 501 may be configured to obtain the shooting focal length of the camera in the state of video shooting.
- the determining module 502 can be used to determine the target gain according to the initial focal length, the target focal length and the initial gain if the shooting focal length obtained by the obtaining module 501 is changed from the initial focal length to the target focal length, and the initial gain is connected to the ith microphone The gain of the first speech path.
- the processing module 503 can be used to adjust the gain of the second voice path connected with the i-th microphone to the target gain determined by the determination module 502; Perform signal enhancement processing on the voice signal output by the second voice channel connected to the ith microphone to obtain the ith voice enhancement signal; and perform signal fusion processing on the M voice enhancement signals to obtain the first recording signal.
- i takes the value of 1, 2...M in turn.
- the determination module 502 can be specifically used for: if the target focal length is greater than the initial focal length, then the sum of the initial gain and the first gain is taken as the target gain; or, if the target focal length is less than the initial focal length, then the initial gain and the first The difference between a gain is used as the target gain.
- the first gain is a product of a target value and a preset value, and the target value is an absolute value of a difference between the target focal length and the initial focal length.
- the processing module 503 may be specifically configured to perform signal fusion processing on the M speech enhancement signals to obtain a first fusion signal.
- the acquiring module 501 can also be used to acquire the field of view and shooting direction of the camera.
- the processing module 503 can be specifically configured to perform noise reduction processing on the first fusion signal according to the field of view and shooting direction of the camera acquired by the acquisition module 501 to obtain a first recording signal, and the noise reduction processing is used to eliminate the noise from the shooting range. outside noise.
- the acquiring module 501 may be configured to re-acquire the shooting focal length of the camera after obtaining the first recording signal to obtain the second fusion signal; and re-acquire the field of view and shooting direction of the camera.
- the processing module 503 can also be used to perform noise reduction processing on the second fusion signal according to the changed target object when the target object changes to obtain a second recording signal, wherein the target object includes the angle of view and the At least one of the shooting directions.
- the processing module 503 may also be configured to run a camera application before acquiring the focal length of the camera.
- the processing module 503 can also be used to set the shooting focal length of the camera as the initial focal length, set the field of view of the camera as the initial field of view, and set the gain of each speech channel as the initial gain.
- An embodiment of the present application provides a recording device, because when the distance between the sound source and the electronic device changes, the shooting focal length of the camera also changes. Therefore, the recording device passes through the first voice channel that connects with the i-th microphone. Set as a fixed gain, and set the second voice path connected to the i-th microphone to a variable gain that changes with the shooting focal length, so that when the shooting focal length becomes larger, the gain of the second voice path becomes larger to record farther When the focal length of the shooting becomes smaller, the gain of the second voice channel is smaller to record the signal at a closer distance, and then the recording device enhances the voice signal by comparing the difference between the two signals, thereby improving the voice signal obtained by the final fusion. quality, and improve the shooting effect of the video.
- the recording device in this embodiment of the present application may be a device, or may be a component, an integrated circuit, or a chip in a terminal.
- the apparatus may be a mobile electronic device or a non-mobile electronic device.
- the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palmtop computer, an in-vehicle electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook, or a personal digital assistant (personal digital assistant).
- UMPC ultra-mobile personal computer
- netbook or a personal digital assistant (personal digital assistant).
- non-mobile electronic devices can be servers, network attached storage (NAS), personal computer (personal computer, PC), television (television, TV), teller machine or self-service machine, etc., this application Examples are not specifically limited.
- the recording device in the embodiment of the present application may be a device having an operating system.
- the operating system may be an Android (Android) operating system, an ios operating system, or other possible operating systems, which are not specifically limited in the embodiments of the present application.
- the recording device provided in the embodiment of the present application can implement each process implemented by the method embodiments in FIG. 1 to FIG. 4 , and to avoid repetition, details are not described here.
- an embodiment of the present application further provides an electronic device 600, including a processor 601, a memory 602, and a program or instruction stored in the memory 602 and executable on the processor 601, the program Or, when the instruction is executed by the processor 601, each process of the above-mentioned recording method embodiment can be implemented, and the same technical effect can be achieved. To avoid repetition, details are not repeated here.
- the electronic devices in the embodiments of the present application include the above-mentioned mobile electronic devices and non-mobile electronic devices.
- FIG. 7 is a schematic diagram of a hardware structure of an electronic device implementing an embodiment of the present application.
- the electronic device 700 includes but is not limited to: a radio frequency unit 701, a network module 702, an audio output unit 703, an input unit 704, a sensor 705, a display unit 706, a user input unit 707, an interface unit 708, a memory 709, and a processor 710, etc. part.
- the electronic device 700 may also include a power source (such as a battery) for supplying power to various components, and the power source may be logically connected to the processor 710 through a power management system, so as to manage charging, discharging, and power management through the power management system. consumption management and other functions.
- the electronic device includes M microphones, and each microphone is connected to a first voice path and a second voice path, where M is an integer greater than or equal to 2.
- the structure of the electronic device shown in FIG. 7 does not constitute a limitation on the electronic device.
- the electronic device may include more or less components than the one shown, or combine some components, or arrange different components, which will not be repeated here. .
- the processor 710 may be configured to acquire the shooting focal length of the camera in the state of video shooting.
- the processor 710 can also be used to determine the target gain according to the initial focal length, the target focal length and the initial gain if the shooting focal length obtained by the processor 710 is changed from the initial focal length to the target focal length, and the initial gain is the same as that of the ith microphone.
- the gain of the connected first speech path can also be used to adjust the gain of the second voice path connected with the i-th microphone to the target gain; Perform signal enhancement processing on the voice signals output by the connected second voice channel to obtain the i-th voice enhanced signal; and perform signal fusion processing on the M voice enhanced signals to obtain the first recording signal.
- i takes the value of 1, 2...M in turn.
- the processor 710 may be specifically configured to: if the target focal length is greater than the initial focal length, use the sum of the initial gain and the first gain as the target gain; or, if the target focal length is less than the initial focal length, then use the initial gain and the first The difference between a gain is used as the target gain.
- the first gain is the product of the target value and the preset value, and the target value is the absolute value of the difference between the target focal length and the initial focal length.
- the processor 710 may be specifically configured to: perform signal fusion processing on the M speech enhancement signals to obtain a first fusion signal; and obtain the field of view and shooting direction of the camera; and according to the field of view and shooting direction of the camera In the direction, noise reduction processing is performed on the first fusion signal to obtain a first recording signal, and the noise reduction processing is used to eliminate noise from outside the shooting range.
- the processor 710 can be used to re-acquire the shooting focal length of the camera after obtaining the first recording signal to obtain the second fusion signal; and re-acquire the field of view and shooting direction of the camera; when the target object changes In the case of , according to the changed target object, noise reduction processing is performed on the second fusion signal to obtain a second recording signal.
- the target object includes at least one of a field of view angle and a shooting direction.
- the processor 710 may be further configured to run a camera application before acquiring the shooting focal length of the camera; set the shooting focal length of the camera to the initial focal length, and set the field of view of the camera to the initial field of view, and Set the gain of each speech channel to the initial gain.
- the embodiment of the present application provides an electronic device, because when the distance between the sound source and the electronic device changes, the shooting focal length of the camera of the electronic device also changes, therefore, the electronic device is connected to the ith microphone by connecting The first voice path is set to a fixed gain, and the second voice path connected to the i-th microphone is set to a variable gain that changes with the shooting focal length, so that the gain of the second voice path changes when the shooting focal length becomes larger.
- the focal length becomes smaller the gain of the second voice channel is smaller to record the signal at a closer distance, and then the electronic device enhances the voice signal by comparing the difference between the two signals, thereby improving the final performance.
- the quality of the voice signal obtained by fusion is improved, and the shooting effect of the video is improved.
- the input unit 704 may include a graphics processing unit (graphics processing unit, GPU) 7041 and a microphone 7042. Such as camera) to obtain still pictures or video image data for processing.
- the display unit 706 may include a display panel 7061, which may be configured in the form of a liquid crystal display, an organic light emitting diode, or the like.
- the user input unit 707 includes a touch panel 7071 and other input devices 7072 .
- the touch panel 7071 is also called a touch screen.
- the touch panel 7071 may include two parts, a touch detection device and a touch controller.
- Other input devices 7072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which will not be repeated here.
- Memory 709 may be used to store software programs as well as various data including, but not limited to, application programs and operating systems.
- the processor 710 may integrate an application processor and a modem processor, wherein the application processor mainly handles the operating system, user interface, and application programs, and the like, and the modem processor mainly handles wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into the processor 710.
- Embodiments of the present application further provide a readable storage medium, where a program or an instruction is stored on the readable storage medium, and when the program or instruction is executed by a processor, each process of the above-mentioned recording method embodiment can be implemented, and the same technology can be achieved. The effect, in order to avoid repetition, is not repeated here.
- the processor is the processor in the electronic device in the above embodiment.
- the readable storage medium includes a computer-readable storage medium, such as computer read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disk or optical disk, etc.
- An embodiment of the present application further provides a chip, the chip includes a processor and a communication interface, the communication interface is coupled to the processor, the processor is used for running a program or an instruction, implements each process of the above recording method embodiment, and can To achieve the same technical effect, in order to avoid repetition, details are not repeated here.
- the chip mentioned in the embodiments of the present application may also be referred to as a system-on-chip, a system-on-chip, a system-on-a-chip, or a system-on-a-chip, or the like.
- the method of the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course can also be implemented by hardware, but in many cases the former is better implementation.
- the technical solution of the present application can be embodied in the form of a software product in essence or in a part that contributes to the prior art, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, CD-ROM), including several instructions to make a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods in the various embodiments of the present application.
- a storage medium such as ROM/RAM, magnetic disk, CD-ROM
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Studio Devices (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Description
Claims (13)
- 一种电子设备的录音方法,所述电子设备包括M个麦克风,且每个麦克风与一个第一语音通路和一个第二语音通路连接,M为大于或等于2的整数,所述方法包括:在处于视频拍摄状态的情况下,获取摄像头的拍摄焦距;若所述摄像头的拍摄焦距由初始焦距变化为目标焦距,则根据所述初始焦距、所述目标焦距和初始增益,确定目标增益,并将与第i个麦克风连接的第二语音通路的增益调整为目标增益,所述初始增益为与第i个麦克风连接的第一语音通路的增益,i依次取值为1、2……M;对与第i个麦克风连接的第一语音通路输出的语音信号和与第i个麦克风连接的第二语音通路输出的语音信号进行信号增强处理,得到第i个语音增强信号;对M个语音增强信号进行信号融合处理,得到第一录音信号。
- 根据权利要求1所述的方法,其中,所述根据所述初始焦距、所述目标焦距和初始增益,确定目标增益,包括:若所述目标焦距大于所述初始焦距,则将所述初始增益与第一增益之和,作为所述目标增益;或者,若所述目标焦距小于所述初始焦距,则将所述初始增益与第一增益之差,作为所述目标增益;其中,所述第一增益为目标值和预设值的乘积,所述目标值为所述目标焦距与所述初始焦距的差值的绝对值。
- 根据权利要求1或2所述的方法,其中,所述对M个语音增强信号进行信号融合处理,得到第一录音信号,包括:对所述M个语音增强信号进行信号融合处理,得到第一融合信号;获取所述摄像头的视场角和拍摄方向;根据所述摄像头的视场角和拍摄方向,对所述第一融合信号进行降噪处理,得到所述第一录音信号,所述降噪处理用于消除来自拍摄范围之外的噪声。
- 根据权利要求3所述的方法,其中,所述根据所述摄像头的视场角和拍摄方向,对所述第一融合信号进行降噪处理,得到所述第一录音信号之后,所述方法还包括:重新获取所述摄像头的拍摄焦距,以得到第二融合信号;重新获取所述摄像头的视场角和拍摄方向;在目标对象发生变化的情况下,根据变化后的目标对象,对所述第二融合信号进行降噪处理,得到第二录音信号,其中,所述目标对象包括视场角和拍摄方向中的至少一项。
- 根据权利要求1或2所述的方法,其中,所述在处于视频拍摄状态的情况下,获取摄像头的拍摄焦距之前,所述方法还包括:运行相机应用程序;将所述摄像头的拍摄焦距设置为初始焦距,并将所述摄像头的视场角设置为初始视场角,以及将每个语音通路的增益设置为所述初始增益。
- 一种录音装置,其中,所述录音装置包括M个麦克风,且每个麦克风与一个第一语音通路和一个第二语音通路连接,M为大于或等于2的整数,所述录音装置包括获取模块、确定模块和处理模块;所述获取模块,用于在处于视频拍摄状态的情况下,获取摄像头的拍摄焦距;所述确定模块,用于若所述获取模块获取的所述拍摄焦距由初始焦距变化为目标焦距,则根据所述初始焦距、所述目标焦距和初始增益,确定目标增益,所述初始增益为与第i个麦克风连接的所述第一语音通路的增益;所述处理模块,用于将与第i个麦克风连接的第二语音通路的增益调整为所述确定模块确定的所述目标增益;并对与第i个麦克风连接的第一语音通路输出的语音信号和与第i个麦克风连接的第二语音通路输出的语音信号进行信号增强处理,得到第i个语音增强信号;以及对M个语音增强信号进行信号融合处理,得到第一录音信号;其中,i依次取值为1、2……M。
- 根据权利要求6所述的录音装置,其中,所述确定模块,具体用于:若所述目标焦距大于所述初始焦距,则将所述初始增益与第一增益之和,作为所述目标增益;或者,若所述目标焦距小于所述初始焦距,则将所述初始增益与第一增益之差,作为所述目标增益;其中,所述第一增益为目标值和预设值的乘积,所述目标值为所述目标焦距与所述初始焦距的差值的绝对值。
- 根据权利要求6或7所述的录音装置,其中,所述处理模块,具体用于对所述M个语音增强信号进行信号融合处理,得到第一融合信号;所述获取模块,还用于获取所述摄像头的视场角和拍摄方向;所述处理模块,具体用于根据所述获取模块获取的所述摄像头的视场角和拍摄方向,对所述第一融合信号进行降噪处理,得到所述第一录音信号,所述降噪处理用于消除来自拍摄范围之外的噪声。
- 根据权利要求8所述的录音装置,其中,所述获取模块,用于在得到所述第一录音信号之后,重新获取所述摄像头的拍摄焦距,以得到第二融合信号;以及重新获取所述摄像头的视场角和拍摄方向;所述处理模块,还用于在目标对象发生变化的情况下,根据变化后的目标对象,对所述第二融合信号进行降噪处理,得到第二录音信号,其中,所述目标对象包括视场角和拍摄方向中的至少一项。
- 根据权利要求6或7所述的录音装置,其中,所述处理模块,还用于在获取摄像头的拍摄焦距之前,运行相机应用程序;所述处理模块,还用于将所述摄像头的拍摄焦距设置为初始焦距,并将所述摄像头的视场角设置为初始视场角,以及将每个语音通路的增益设置为所述初始增益。
- 一种电子设备,其中,包括处理器、存储器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述计算机程序被所述处理器执行时实现如权利要求1至5中任一项所述的录音方法的步骤。
- 一种电子设备,被配置成用于执行如权利要求1至5中任一项所述的录音方法。
- 一种可读存储介质,其中,所述可读存储介质上存储计算机程序,所述计算机程序被处理器执行时实现如权利要求1至4中任一项所述的录音方法的步骤。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010760783.9 | 2020-07-31 | ||
CN202010760783.9A CN111916102B (zh) | 2020-07-31 | 2020-07-31 | 电子设备的录音方法及录音装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022022647A1 true WO2022022647A1 (zh) | 2022-02-03 |
Family
ID=73287363
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/109323 WO2022022647A1 (zh) | 2020-07-31 | 2021-07-29 | 电子设备的录音方法及录音装置 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN111916102B (zh) |
WO (1) | WO2022022647A1 (zh) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111916102B (zh) * | 2020-07-31 | 2024-05-28 | 维沃移动通信有限公司 | 电子设备的录音方法及录音装置 |
CN112492430B (zh) * | 2020-12-17 | 2023-12-15 | 维沃移动通信有限公司 | 电子设备和电子设备的录音方法 |
CN112689221B (zh) * | 2020-12-18 | 2023-05-30 | Oppo广东移动通信有限公司 | 录音方法、录音装置、电子设备及计算机可读存储介质 |
CN113099031B (zh) * | 2021-02-26 | 2022-05-17 | 华为技术有限公司 | 声音录制方法及相关设备 |
CN113472943B (zh) * | 2021-06-30 | 2022-12-09 | 维沃移动通信有限公司 | 音频处理方法、装置、设备及存储介质 |
Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101510425A (zh) * | 2008-02-15 | 2009-08-19 | 株式会社东芝 | 声音识别装置以及用于执行声音识别的方法 |
CN103079148A (zh) * | 2012-12-28 | 2013-05-01 | 中兴通讯股份有限公司 | 一种终端双麦克风降噪的方法及装置 |
CN103888703A (zh) * | 2014-03-28 | 2014-06-25 | 深圳市中兴移动通信有限公司 | 增强录音的拍摄方法和摄像装置 |
CN104376847A (zh) * | 2013-08-12 | 2015-02-25 | 联想(北京)有限公司 | 一种语音信号处理方法和装置 |
CN104699445A (zh) * | 2013-12-06 | 2015-06-10 | 华为技术有限公司 | 一种音频信息处理方法及装置 |
CN106713793A (zh) * | 2015-11-18 | 2017-05-24 | 天津三星电子有限公司 | 一种声音播放控制方法及其装置 |
CN106774882A (zh) * | 2012-09-17 | 2017-05-31 | 联想(北京)有限公司 | 一种信息处理的方法及电子设备 |
CN107197090A (zh) * | 2017-05-18 | 2017-09-22 | 维沃移动通信有限公司 | 一种语音信号的接收方法及移动终端 |
EP3373037A1 (en) * | 2017-03-10 | 2018-09-12 | The Hi-Tech Robotic Systemz Ltd | Single casing advanced driver assistance system |
CN109313904A (zh) * | 2016-05-30 | 2019-02-05 | 索尼公司 | 视频音频处理设备、视频音频处理方法和程序 |
CN110970057A (zh) * | 2018-09-29 | 2020-04-07 | 华为技术有限公司 | 一种声音处理方法、装置与设备 |
CN111050269A (zh) * | 2018-10-15 | 2020-04-21 | 华为技术有限公司 | 音频处理方法和电子设备 |
CN111385728A (zh) * | 2018-12-29 | 2020-07-07 | 华为技术有限公司 | 一种音频信号处理方法及装置 |
CN111916102A (zh) * | 2020-07-31 | 2020-11-10 | 维沃移动通信有限公司 | 电子设备的录音方法及录音装置 |
US10923124B2 (en) * | 2013-05-24 | 2021-02-16 | Google Llc | Method and apparatus for using image data to aid voice recognition |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107197187A (zh) * | 2017-05-27 | 2017-09-22 | 维沃移动通信有限公司 | 一种视频的拍摄方法及移动终端 |
CN110995909B (zh) * | 2019-11-20 | 2021-03-30 | 维沃移动通信有限公司 | 一种声音补偿方法及装置 |
-
2020
- 2020-07-31 CN CN202010760783.9A patent/CN111916102B/zh active Active
-
2021
- 2021-07-29 WO PCT/CN2021/109323 patent/WO2022022647A1/zh active Application Filing
Patent Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101510425A (zh) * | 2008-02-15 | 2009-08-19 | 株式会社东芝 | 声音识别装置以及用于执行声音识别的方法 |
CN106774882A (zh) * | 2012-09-17 | 2017-05-31 | 联想(北京)有限公司 | 一种信息处理的方法及电子设备 |
CN103079148A (zh) * | 2012-12-28 | 2013-05-01 | 中兴通讯股份有限公司 | 一种终端双麦克风降噪的方法及装置 |
US10923124B2 (en) * | 2013-05-24 | 2021-02-16 | Google Llc | Method and apparatus for using image data to aid voice recognition |
CN104376847A (zh) * | 2013-08-12 | 2015-02-25 | 联想(北京)有限公司 | 一种语音信号处理方法和装置 |
CN104699445A (zh) * | 2013-12-06 | 2015-06-10 | 华为技术有限公司 | 一种音频信息处理方法及装置 |
CN103888703A (zh) * | 2014-03-28 | 2014-06-25 | 深圳市中兴移动通信有限公司 | 增强录音的拍摄方法和摄像装置 |
CN106713793A (zh) * | 2015-11-18 | 2017-05-24 | 天津三星电子有限公司 | 一种声音播放控制方法及其装置 |
CN109313904A (zh) * | 2016-05-30 | 2019-02-05 | 索尼公司 | 视频音频处理设备、视频音频处理方法和程序 |
EP3373037A1 (en) * | 2017-03-10 | 2018-09-12 | The Hi-Tech Robotic Systemz Ltd | Single casing advanced driver assistance system |
CN107197090A (zh) * | 2017-05-18 | 2017-09-22 | 维沃移动通信有限公司 | 一种语音信号的接收方法及移动终端 |
CN110970057A (zh) * | 2018-09-29 | 2020-04-07 | 华为技术有限公司 | 一种声音处理方法、装置与设备 |
CN111050269A (zh) * | 2018-10-15 | 2020-04-21 | 华为技术有限公司 | 音频处理方法和电子设备 |
CN111385728A (zh) * | 2018-12-29 | 2020-07-07 | 华为技术有限公司 | 一种音频信号处理方法及装置 |
CN111916102A (zh) * | 2020-07-31 | 2020-11-10 | 维沃移动通信有限公司 | 电子设备的录音方法及录音装置 |
Also Published As
Publication number | Publication date |
---|---|
CN111916102B (zh) | 2024-05-28 |
CN111916102A (zh) | 2020-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022022647A1 (zh) | 电子设备的录音方法及录音装置 | |
CN110970057B (zh) | 一种声音处理方法、装置与设备 | |
CN107105367B (zh) | 一种音频信号处理方法及终端 | |
US20160227336A1 (en) | Contextual Switching of Microphones | |
US20100150360A1 (en) | Audio source localization system and method | |
US10461712B1 (en) | Automatic volume leveling | |
CN112291672B (zh) | 扬声器的控制方法、控制装置以及电子设备 | |
CN110390953B (zh) | 啸叫语音信号的检测方法、装置、终端及存储介质 | |
CN115831155A (zh) | 音频信号的处理方法、装置、电子设备及存储介质 | |
US20230014836A1 (en) | Method for chorus mixing, apparatus, electronic device and storage medium | |
CN111462764A (zh) | 音频编码方法、装置、计算机可读存储介质及设备 | |
WO2023151526A1 (zh) | 音频采集方法、装置、电子设备及外设组件 | |
CN113160846A (zh) | 噪声抑制方法和电子设备 | |
CN112735370A (zh) | 一种语音信号处理方法、装置、电子设备和存储介质 | |
WO2023016053A1 (zh) | 一种声音信号处理方法及电子设备 | |
CN113077808B (zh) | 一种语音处理方法、装置和用于语音处理的装置 | |
CN109348021B (zh) | 移动终端及音频播放方法 | |
WO2016109103A1 (en) | Directional audio capture | |
CN111508513A (zh) | 音频处理方法及装置、计算机存储介质 | |
US11646046B2 (en) | Psychoacoustic enhancement based on audio source directivity | |
CN114758669B (zh) | 音频处理模型的训练、音频处理方法、装置及电子设备 | |
WO2024077452A1 (zh) | 音频处理方法、装置、设备及存储介质 | |
CN113450823B (zh) | 基于音频的场景识别方法、装置、设备及存储介质 | |
CN116913328B (zh) | 音频处理方法、电子设备及存储介质 | |
CN113380248B (zh) | 语音控制方法、装置、设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21850535 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21850535 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21850535 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 02.08.2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21850535 Country of ref document: EP Kind code of ref document: A1 |