WO2019061192A1 - 音频处理方法及相关产品 - Google Patents

音频处理方法及相关产品 Download PDF

Info

Publication number
WO2019061192A1
WO2019061192A1 PCT/CN2017/104090 CN2017104090W WO2019061192A1 WO 2019061192 A1 WO2019061192 A1 WO 2019061192A1 CN 2017104090 W CN2017104090 W CN 2017104090W WO 2019061192 A1 WO2019061192 A1 WO 2019061192A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
digital signal
recording
converting
memory
Prior art date
Application number
PCT/CN2017/104090
Other languages
English (en)
French (fr)
Inventor
王周丹
夏相声
Original Assignee
深圳传音通讯有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳传音通讯有限公司 filed Critical 深圳传音通讯有限公司
Priority to PCT/CN2017/104090 priority Critical patent/WO2019061192A1/zh
Publication of WO2019061192A1 publication Critical patent/WO2019061192A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/725Cordless telephones

Definitions

  • the present invention relates to the field of audio technologies, and in particular, to an audio processing method and related products.
  • mobile terminals such as mobile phones, tablet computers, etc.
  • mobile terminals are becoming more and more popular, and users are increasingly demanding mobile terminals.
  • Many mobile terminals mobile phones, tablet computers, etc.
  • the recording software of the mobile terminal records audio through the built-in microphone.
  • the recording process when it is necessary to re-record, it is necessary to abandon the currently recorded audio, specifically, stop the current recording, and delete the recording, or, Save the recording and re-create a file for recording. This series of operations takes a long time and may result in missed the best recording time.
  • the embodiment of the invention provides an audio processing method and related products, which can simplify the re-recording operation steps.
  • a first aspect of the embodiments of the present invention provides an audio processing method, including:
  • the first digital signal is called, and the second digital signal is overlaid on the first digital signal to obtain a target audio file.
  • a second aspect of the embodiments of the present invention provides an audio processing apparatus, including:
  • a first processing unit configured to convert the first audio into a first digital signal and detect the first digital signal if the first audio needs to be re-recorded during the recording of the first audio Stored in memory;
  • a first recording unit configured to acquire a second audio, and convert the second audio into a second digital signal
  • a second processing unit configured to call the first digital signal, and overlay the second digital signal with the first digital signal to obtain a target audio file.
  • an embodiment of the present invention provides a mobile terminal, including: a processor and a memory; and one or more programs, where the one or more programs are stored in the memory, and configured to be The processor executes, the program including instructions for some or all of the steps as described in the first aspect.
  • an embodiment of the present invention provides a computer readable storage medium, wherein the computer readable storage medium is configured to store a computer program, wherein the computer program causes a computer to perform the first aspect of the embodiment of the present invention. Instructions for some or all of the steps described in the section.
  • an embodiment of the present invention provides a computer program product, wherein the computer program product comprises a non-transitory computer readable storage medium storing a computer program, the computer program being operative to cause a computer to execute Some or all of the steps described in the first aspect of the invention.
  • the computer program product can be a software installation package.
  • 1a is a schematic flowchart diagram of a first embodiment of an audio processing method according to an embodiment of the present invention
  • FIG. 1b is a schematic diagram showing a recording interface of a mobile terminal according to an embodiment of the present invention.
  • FIG. 2 is a schematic flowchart diagram of a second embodiment of an audio processing method according to an embodiment of the present disclosure
  • FIG. 3 is a schematic flow chart of a third embodiment of an audio processing method according to an embodiment of the present invention.
  • FIG. 4a is a schematic structural diagram of an embodiment of an audio processing device according to an embodiment of the present invention.
  • FIG. 4b is a schematic structural diagram of a first processing unit of the audio processing device described in FIG. 4a according to an embodiment of the present invention
  • FIG. 4c is another schematic structural diagram of a first processing unit of the audio processing device described in FIG. 4a according to an embodiment of the present invention
  • FIG. 4d is a schematic structural diagram of a second processing unit of the audio processing device described in FIG. 4a according to an embodiment of the present invention
  • FIG. 4e is another schematic structural diagram of a second processing unit of the audio processing device described in FIG. 4a according to an embodiment of the present invention
  • FIG. 4f is still another schematic structural diagram of the audio processing apparatus described in FIG. 4a according to an embodiment of the present invention.
  • FIG. 4g is still another schematic structural diagram of the audio processing device depicted in FIG. 4a according to an embodiment of the present disclosure
  • FIG. 5 is a schematic structural diagram of an embodiment of a mobile terminal according to an embodiment of the present invention.
  • the embodiment of the present invention provides an audio processing method and related products, which can simplify the operation steps of re-recording.
  • references to "an embodiment” herein mean that a particular feature, structure, or characteristic described in connection with the embodiments can be included in at least one embodiment of the invention. Present the phrase at various locations in the manual and They are not necessarily all referring to the same embodiment, nor are they separate or alternative embodiments that are mutually exclusive. Those skilled in the art will understand and implicitly understand that the embodiments described herein can be combined with other embodiments.
  • the mobile terminal described in the embodiments of the present invention may include a smart phone (such as an Android mobile phone, an IOS mobile phone, a Windows Phone mobile phone, etc.), a tablet computer, a palmtop computer, a notebook computer, a mobile Internet device (MID, Mobile Internet Devices), or a wearable device.
  • a smart phone such as an Android mobile phone, an IOS mobile phone, a Windows Phone mobile phone, etc.
  • a tablet computer such as an Android mobile phone, an IOS mobile phone, a Windows Phone mobile phone, etc.
  • a palmtop computer such as a notebook computer
  • MID Mobile Internet Devices
  • FIG. 1a is a schematic flowchart diagram of a first embodiment of an audio processing method according to an embodiment of the present invention. As shown in FIG. 1a, the audio processing method described in this embodiment includes the following steps:
  • the first audio is converted into a first digital signal, and the first digital signal is saved in a memory.
  • the virtual button of the re-recording in the process of performing the first audio recording unfinished recording, can be added by the recording interface on the mobile terminal, so that the user can input the re-recording instruction. .
  • a virtual button for re-recording can be added to the recording interface. If the user wants to abandon the current recording and re-record the second audio, the re-recording instruction can be directly input in the recording interface during the recording of the first audio.
  • step 101 converting the first audio into the first digital signal may include the following steps:
  • the plurality of audio segments are subjected to analog-to-digital conversion by using a plurality of processes to obtain the first digital signal, and the plurality of audio segments are in one-to-one correspondence with the plurality of processes.
  • the first audio is a sound signal recorded by the mobile terminal.
  • the durations of the multiple audio segments may be equal or unequal.
  • multiple processes may be used to perform analog-to-digital conversion on the plurality of audio segments. Thus, the analog-to-digital conversion speed may be improved, and the first digital signal may be quickly obtained.
  • Each of the plurality of audio segments corresponds to a unique one of the processes.
  • the first audio may be converted into a first digital signal, and specifically, the first audio may be first rotated It is replaced by an analog signal, and then amplified by a processor, and then subjected to analog-to-digital conversion by a converter to obtain a first digital signal.
  • dividing the first audio into multiple audio segments may include the following steps:
  • A1. Determine a duration of the first audio.
  • the first audio is divided into the plurality of audio segments according to the duration.
  • the first audio may be divided according to the duration of the first audio.
  • a dividing rule is preset, and the rule may be to set a fixed time for the duration T of each of the divided audio segments.
  • the duration standard T 0 where T 0 is a number greater than 0, the number of audio segments into which the first audio is divided may be determined according to the duration of the first audio, or each audio of the divided plurality of audio segments may also be
  • the duration T of the segment is preset within the range [T 1 , T 2 ], and the number of audio segments into which the first audio is divided may be determined according to the duration of the first audio, where T 1 and T 2 are numbers greater than zero.
  • the first audio in converting the first audio into the first digital signal, may be quantized, and then the first quantized audio is converted into a first digital signal.
  • the manner of acquiring the second audio may be the following two types: first: acquiring the second audio through the microphone of the mobile terminal; second, reading an audio file from the designated location in the memory as the second audio,
  • the specified location can be set by the user or the system defaults.
  • the second audio is converted into a second digital signal
  • the method of converting the first audio into the first digital signal in step 101 may be adopted.
  • the The second audio is divided into a plurality of audio segments, and the plurality of audio segments are subjected to analog-to-digital conversion by using a plurality of processes to obtain the second digital signal, and the plurality of audio segments are in one-to-one correspondence with the plurality of processes.
  • the second audio in converting the second audio into the second digital signal, the second audio may be quantized, and then the second quantized audio is converted into a second digital signal.
  • step 102 converting the second audio into a second digital signal may include the following steps:
  • the second audio is quantized, and the second audio can be quantized by directly using the quantization function to define a quantizer.
  • the second audio may be first sampled to obtain a sample point, and then The sampled audio signal is quantized using a quantization function.
  • the first digital signal may be called from the memory, and the second digital signal is overlaid on the first digital signal, and the audio track of the first audio is overwritten.
  • overlaying the second digital signal with the first digital signal may include the following steps:
  • the second digital signal is covered by the first digital signal according to the starting position as a starting point.
  • the starting position of the first digital signal may be determined according to the time header file of the first digital signal.
  • the first digital signal may be saved in the memory in step 101, and the first digital signal is recorded.
  • the start position then covers the first digital signal from the start position of the second digital signal.
  • overlaying the second digital signal with the first digital signal may include the following steps:
  • the second digital signal is covered by the first digital signal after being cleared.
  • step 101 the first digital signal is stored in the memory, and the end position of the first digital signal is recorded. After the first digital signal is cleared, the second position is taken as a starting point, and the second The digital signal covers the first digital signal from the termination position described above.
  • the first audio is converted into a first digital signal, and the first A digital signal is stored in the memory while acquiring the second audio, and converting the second audio into a second digital signal, then calling the first digital signal, and overlaying the second digital signal with the first Digital signal to get the target audio file.
  • the re-recording can be started directly, simplifying the re-recording operation steps and shortening the time interval for starting to record the second audio.
  • FIG. 2 is an audio processing method according to an embodiment of the present invention.
  • the virtual button of the re-recording in the process of performing the first audio recording unfinished recording, can be added by the recording interface on the mobile terminal, so that the user can input the re-recording instruction.
  • a virtual button for re-recording can be added to the recording interface. If the user wants to abandon the current recording and re-record the second audio, the re-recording instruction can be directly input in the recording interface during the recording of the first audio.
  • the first audio may be divided into multiple audio segments, and then each of the plurality of audio segments is respectively subjected to analog-to-digital conversion, thereby shortening the step of using time.
  • the second audio in converting the first audio into the first digital signal, may be quantized, and then the first quantized audio is converted into the first digital signal.
  • the first audio is a sound signal recorded by the mobile terminal.
  • the first audio may be first converted into an analog signal, and then amplified by the processor, and then subjected to analog-to-digital conversion by the converter to obtain a first digital signal.
  • the manner of acquiring the second audio may be the following two types: first: acquiring the second audio through the microphone of the mobile terminal; second, reading an audio file from the designated location in the memory as the second audio,
  • the specified location can be set by the user or the system defaults.
  • the second audio may be subjected to noise reduction processing to improve the quality of the second audio.
  • the second audio is subjected to noise reduction processing, and the waveform sample of the noise is sampled, and then the second audio and noise samples are analyzed to remove noise.
  • performing noise reduction processing on the second audio may include the following steps:
  • the noise reduction processing on the second audio may be implemented by using an adaptive filtering algorithm (Least Mean Square, LMS).
  • LMS least Mean Square
  • the second audio is converted into the second digital signal, and the method of converting the first audio into the first digital signal in the foregoing step 202 may be adopted.
  • the second audio may be divided into multiple The audio segment is subjected to analog-to-digital conversion by using a plurality of processes to obtain the second digital signal, and the plurality of audio segments are in one-to-one correspondence with the plurality of processes.
  • the second audio is converted into a second digital signal, and the second audio is first quantized to obtain a second quantized audio, and then the second quantized audio is converted into a second digital signal.
  • the first digital signal may be called from a memory, and the second digital signal is overlaid on the first digital signal, and the audio track of the first audio is overwritten.
  • the second digital signal covers the first digital signal, and the first digital signal may be directly covered from the starting position of the first digital signal; or the first digital signal may be cleared first, and then The second digital signal is overwritten with the first digital signal after the processing.
  • the mobile terminal receives the re-recording instruction, and in the process of recording the first audio, if the mobile terminal detects that the first audio needs to be re-recorded, converting the first audio into the first audio a digital signal, and storing the first digital signal in a memory, acquiring a second audio, performing noise reduction processing on the second audio, and converting the second audio signal after the noise reduction into a second digital signal, The first digital signal is then invoked and the second digital signal is overlaid with the first digital signal to obtain a target audio file. Furthermore, during the recording of the first audio, the re-recording can be started directly, simplifying the re-recording operation steps and shortening the time interval for starting to record the second audio.
  • FIG. 3 it is a schematic flowchart of a third embodiment of an audio processing method according to an embodiment of the present invention.
  • the audio processing method described in this embodiment includes the following steps:
  • the virtual button of the re-recording in the process of performing the first audio recording unfinished recording, can be added by the recording interface on the mobile terminal, so that the user can input the re-recording instruction.
  • a virtual button for re-recording can be added to the recording interface. If the user wants to abandon the current recording and re-record the second audio, the re-recording instruction can be directly input in the recording interface during the recording of the first audio.
  • the first audio may be divided into multiple audio segments, and then each of the plurality of audio segments is respectively subjected to analog-to-digital conversion, thereby shortening the step of using time.
  • the second audio in converting the first audio into the first digital signal, may be quantized, and then the first quantized audio is converted into the first digital signal.
  • the first audio is a sound signal recorded by the mobile terminal.
  • the first audio may be first converted into an analog signal, and then amplified by the processor, and then subjected to analog-to-digital conversion by the converter to obtain a first digital signal.
  • the manner of acquiring the second audio may be the following two types: first: acquiring the second audio through the microphone of the mobile terminal; second, reading an audio file from the designated location in the memory as the second audio,
  • the specified location can be set by the user or the system defaults.
  • the second audio is converted into the second digital signal, and the method of converting the first audio into the first digital signal in the foregoing step 302 may be adopted.
  • the second audio may be divided into multiple The audio segment is subjected to analog-to-digital conversion by using a plurality of processes to obtain the second digital signal, and the plurality of audio segments are in one-to-one correspondence with the plurality of processes.
  • the second audio is converted into a second digital signal, and the second audio is first quantized to obtain a second quantized audio, and then the second quantized audio is converted into a second digital signal.
  • the first digital signal can be called from the memory, and the second digital signal is covered.
  • the first digital signal overlays the audio track of the first audio.
  • the second digital signal covers the first digital signal, and the first digital signal may be directly covered from the starting position of the first digital signal; or the first digital signal may be cleared first, and then The second digital signal is overwritten with the first digital signal after the processing.
  • the mobile terminal receives the re-recording instruction, and in the process of recording the first audio, if the mobile terminal detects that the first audio needs to be re-recorded, converting the first audio into the first audio a digital signal, and storing the first digital signal in a memory while acquiring a second audio, converting the second audio into a second digital signal, then calling the first digital signal, and
  • the second digital signal covers the first digital signal to obtain a target audio file.
  • the re-recording can be started directly, simplifying the re-recording operation steps and shortening the time interval for starting to record the second audio.
  • FIG. 4 is a schematic structural diagram of an embodiment of an audio processing apparatus according to an embodiment of the present invention.
  • the audio processing device described in this embodiment includes: a first processing unit 401, a first recording unit 402, and a second processing unit 403, as follows:
  • the first processing unit 401 is configured to convert the first audio into a first digital signal and detect the first digital signal if the first audio needs to be re-recorded during the recording of the first audio
  • the signal is stored in the memory
  • a first recording unit 402 configured to acquire a second audio, and convert the second audio into a second digital signal
  • the second processing unit 403 is configured to invoke the first digital signal, and cover the first digital signal by the second digital signal to obtain a target audio file.
  • FIG. 4b is a specific refinement structure of the first processing unit 401 of the audio processing device described in FIG. 4a, and the first processing unit 401 may include: a dividing module 4011 and a converting module 4012, details as follows:
  • a dividing module 4011 configured to divide the first audio into multiple audio segments
  • the converting module 4012 is configured to perform analog-to-digital conversion on the plurality of audio segments by using a plurality of processes to obtain the first digital signal, where the plurality of audio segments are in one-to-one correspondence with the plurality of processes.
  • FIG. 4c is another specific refinement structure of the first processing unit 401 of the audio processing device described in FIG. 4a
  • the first processing unit 401 may include: a quantization module 4013 and a conversion module. 4014, as follows:
  • a quantization module 4013 configured to perform quantization processing on the second audio to obtain second quantized audio
  • the converting module 4014 is configured to convert the second quantized audio into the second digital signal.
  • FIG. 4d is a specific refinement structure of the second processing unit 403 of the audio processing device described in FIG. 4a, and the second processing unit 403 may include: an obtaining module 4031 and a first overlay module. 4032, as follows:
  • An obtaining module 4031 configured to acquire a starting position of the first digital signal
  • the first overlay module 4032 is configured to cover the first digital signal by using the second digital signal according to the starting position.
  • FIG. 4e is a specific refinement structure of the second processing unit 403 of the audio processing device described in FIG. 4a, and the second processing unit 403 may include: a clearing module 4033 and a second overlay.
  • Module 4034 as follows:
  • Clearing module 4031 configured to perform zeroing processing on the first digital signal
  • the second overlay module 4032 is configured to cover the second digital signal with the first digital signal after the clearing process.
  • FIG. 4f is still another modified structure of the audio processing device described in FIG. 4a.
  • FIG. 4f is compared with FIG. 4a.
  • the audio processing device in FIG. 4f may further include: a noise reduction unit 404, specifically as follows:
  • the noise reduction unit 404 is configured to perform noise reduction processing on the second audio.
  • FIG. 4g is still another modified structure of the audio processing device described in FIG. 4a, and FIG. 4g is compared with FIG. 4a.
  • the audio processing device in FIG. 4g may further include: a receiving unit 405, specifically as follows :
  • the receiving unit 405 is configured to receive a re-recording instruction.
  • the mobile terminal receives the re-recording instruction, and in the process of recording the first audio, if the mobile terminal detects that the first audio needs to be re-recorded, converting the first audio into the first audio a digital signal, and storing the first digital signal in a memory, acquiring a second audio, converting the second audio into a second digital signal, and then calling the first digital signal And superimposing the second digital signal on the first digital signal to obtain a target audio file. Furthermore, during the recording of the first audio, the re-recording can be started directly, simplifying the re-recording operation steps and shortening the time interval for starting to record the second audio.
  • the audio processing device described in the device embodiment of the present invention is presented in the form of a functional unit.
  • the term "unit” as used herein shall be understood to mean the broadest possible meaning, and the object for implementing the functions described for each "unit” may be, for example, an integrated circuit ASIC, a single circuit for executing one or more software or firmware.
  • a processor shared, dedicated or chipset
  • memory of the program combinatorial logic, and/or other suitable components that perform the functions described above.
  • the first processing unit 401 is configured to convert the first audio into a first digital signal, and if the first audio needs to be re-recorded during the recording of the first audio,
  • the function of storing the first digital signal in the memory may be implemented by the mobile terminal shown in FIG. 5, and may be executed by the processor 3000 by calling the executable program code in the memory 4000 during the process of recording the first audio.
  • the first audio is converted into a first digital signal, and the first digital signal is saved in a memory.
  • FIG. 5 it is a schematic structural diagram of an embodiment of a mobile terminal according to an embodiment of the present invention.
  • the mobile terminal described in this embodiment includes: at least one input device 1000; at least one output device 2000; at least one processor 3000, such as a CPU; and a memory 4000, the input device 1000, the output device 2000, the processor 3000, and The memory 4000 is connected by a bus 5000.
  • the input device 1000 may specifically include a touch panel, a physical button, a mouse, and a microphone.
  • the output device 2000 described above may specifically be a display screen.
  • the above memory 4000 may be a high speed RAM memory or a non-volatile memory such as a magnetic disk memory.
  • the above memory 4000 is used to store a set of program codes, and the input device 1000, the output device 2000, and the processor 3000 are used to call the program code stored in the memory 4000, and perform the following operations:
  • the processor 3000 is configured to:
  • the first digital signal is called, and the second digital signal is overlaid on the first digital signal to obtain a target audio file.
  • the processor 3000 converts the first audio into a first digital signal, including:
  • the plurality of audio segments are analog-to-digital converted by using a plurality of processes to obtain the first digital signal, and the plurality of audio segments are in one-to-one correspondence with the plurality of processes.
  • the processor 3000 divides the first audio into multiple audio segments, including:
  • the first audio is divided into the plurality of audio segments according to the duration.
  • the processor 3000 converts the second audio into a second digital signal, including:
  • processor 3000 is further configured to:
  • the second audio is subjected to noise reduction processing.
  • the processor 3000 converts the second audio into a second digital signal, including:
  • the second audio after the noise reduction process is converted into a second digital signal.
  • the foregoing processor 3000 acquires the second audio, including:
  • the processor 3000 overlays the second digital signal with the first digital signal, including:
  • the second digital signal covers the first digital signal according to the starting position as a starting point.
  • the processor 3000 overlays the second digital signal with the first digital signal, including:
  • the second digital signal is overlaid with the first digital signal after the processing.
  • processor 3000 is further configured to:
  • the embodiment of the present invention further provides a computer storage medium, wherein the computer storage medium can store a program, and the program includes some or all of the steps of any one of the audio processing methods described in the foregoing method embodiments.
  • Embodiments of the present invention also provide a computer program product comprising a non-transitory computer readable storage medium storing a computer program, the computer program being operative to cause a computer to perform the operations as recited in the above method embodiments Any or all of the steps of any of the audio processing methods.
  • embodiments of the present invention can be provided as a method, apparatus (device), or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or a combination of software and hardware. Moreover, the invention can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) including computer usable program code.
  • the computer program is stored/distributed in a suitable medium, provided with other hardware or as part of the hardware, or in other distributed forms, such as over the Internet or other wired or wireless telecommunication systems.
  • the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device.
  • the apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.
  • These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device.
  • the instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Telephone Function (AREA)

Abstract

本发明实施例提供了一种音频处理方法及相关产品,所述方法包括:在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中;获取第二音频,并将所述第二音频转化为第二数字信号;调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。通过本发明实施例可以减少重新录音的操作步骤,缩短开始录制第二音频的时间间隔。

Description

音频处理方法及相关产品 技术领域
本发明涉及音频技术领域,具体涉及一种音频处理方法及相关产品。
背景技术
随着信息技术的快速发展,移动终端(如手机、平板电脑等等)使用越来越普及,用户对移动终端的要求也越来越高,很多移动终端(手机、平板电脑等)具有录音功能,使人们可以保留自己想要保留的原声语音。
相关技术中,移动终端的录音软件都是通过内置麦克风录制音频,在录音过程中,当需要重新录音时,需要放弃当前已经录制的音频,具体地,停止当前录音,并删除该录音,或者,保存该录音并重新建立一个文件进行录音,这一系列操作需花费较长时间,可能导致错过最佳录音时间。
发明内容
本发明实施例提供了一种音频处理方法及相关产品,可以简化重新录音操作步骤。
本发明实施例第一方面提供了一种音频处理方法,包括:
在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中;
获取第二音频,并将所述第二音频转化为第二数字信号;
调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。
本发明实施例第二方面提供了一种音频处理装置,包括:
第一处理单元,用于在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中;
第一录制单元,用于获取第二音频,并将所述第二音频转化为第二数字信号;
第二处理单元,用于调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。
第三方面,本发明实施例提供了一种移动终端,包括:处理器和存储器;以及一个或多个程序,所述一个或多个程序被存储在所述存储器中,并且被配置成由所述处理器执行,所述程序包括用于如第一方面中所描述的部分或全部步骤的指令。
第四方面,本发明实施例提供了一种计算机可读存储介质,其中,所述计算机可读存储介质用于存储计算机程序,其中,所述计算机程序使得计算机执行如本发明实施例第一方面中所描述的部分或全部步骤的指令。
第五方面,本发明实施例提供了一种计算机程序产品,其中,所述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,所述计算机程序可操作来使计算机执行如本发明实施例第一方面中所描述的部分或全部步骤。该计算机程序产品可以为一个软件安装包。
附图说明
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1a是本发明实施例提供的一种音频处理方法的第一实施例流程示意图;
图1b是本发明实施例提供的一种移动终端的录音界面的演示示意图;
图2是本发明实施例提供的一种音频处理方法的第二实施例流程示意图;
图3是本发明实施例提供的一种音频处理方法的第三实施例流程示意图;
图4a是本发明实施例提供的一种音频处理装置的实施例结构示意图;
图4b是本发明实施例提供的图4a所描述的音频处理装置的第一处理单元的结构示意图;
图4c是本发明实施例提供的图4a所描述的音频处理装置的第一处理单元的另一结构示意图;
图4d是本发明实施例提供的图4a所描述的音频处理装置的第二处理单元的结构示意图;
图4e是本发明实施例提供的图4a所描述的音频处理装置的第二处理单元的另一结构示意图;
图4f是本发明实施例提供的图4a所描述的音频处理装置的又一结构示意图;
图4g是本发明实施例提供的图4a所描述的音频处理装置的又一结构示意图;
图5是本发明实施例提供的一种移动终端的实施例结构示意图。
具体实施方式
相关技术中,在录制音频的过程中,会遇到如下步骤:在录制第一音频的过程中,想要放弃当前录音,重新录制第二音频,需要在移动终端的显示界面选择结束第一音频的录制,删除第一音频,或者,保存第一音频并重新建立一个文件进行录音,这一系列操作需花费较长时间,可能导致错过最佳录音时间。因此,本发明实施例提供了一种音频处理方法及相关产品,可以简化重新录音的操作步骤。
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。
本发明的说明书和权利要求书及所述附图中的术语“第一”、“第二”、“第三”和“第四”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本发明的至少一个实施例中。在说明书中的各个位置展示该短语并 不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。
本发明实施例所描述移动终端可以包括智能手机(如Android手机、IOS手机、Windows Phone手机等)、平板电脑、掌上电脑、笔记本电脑、移动互联网设备(MID,Mobile Internet Devices)或穿戴式设备等,上述仅是举例,而非穷举,包含但不限于上述移动终端。
请参阅图1a,图1a为本发明实施例提供的一种音频处理方法的第一实施例流程示意图。如图1a所示,本实施例中所描述的音频处理方法,包括以下步骤:
101、在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中。
本发明实施例中,如图1b所示,在进行第一音频录制未完成录制的过程中,可通过在移动终端上的录音界面新增重新录音的虚拟按键,使用户能够输入重新录音的指令。
举例说明,可在录音界面新增重新录音的虚拟按键,若用户想要放弃当前录音,重新录制第二音频,可在录制第一音频的过程中,直接在录音界面输入重新录音的指令。
可选地,在步骤101中,将第一音频转化为第一数字信号,可包括如下步骤:
11、将所述第一音频划分为多个音频段;
12、采用多个进程将所述多个音频段进行模数转换,得到所述第一数字信号,所述多个音频段与所述多个进程一一对应。
其中,上述第一音频为移动终端录制的声音信号。上述多个音频段的时长可以相等或者不等,进一步地,可以采用多个进程将上述多个音频段进行模数转换,如此,可以提升模数转换速度,快速得到第一数字信号,当然,上述多个音频段中每一音频段与唯一一个进程对应。
可选地,可将第一音频转化为第一数字信号,具体地,可将第一音频先转 换成模拟信号,然后经过处理器进行放大处理,再通过转换器进行模数转换,得到第一数字信号。
可选地,在步骤11中,将第一音频划分为多个音频段,可包括如下步骤:
A1、确定所述第一音频的时长;
A2、根据所述时长将所述第一音频划分为所述多个音频段。
其中,可根据第一音频的时长对第一音频进行划分,例如,预先设定一个划分规则,该规则可以是为划分后的多个音频段的每一音频段的时长T设定一个固定的时长标准T0,其中,T0为大于0的数,可根据第一音频的时长确定将第一音频划分为多少个音频段,或者,也可以将划分后的多个音频段的每一个音频段的时长T预设在范围[T1,T2]内,根据第一音频的时长可确定将第一音频划分为多少个音频段,其中,T1和T2为大于0的数。
可选地,在将所述第一音频转化为第一数字信号的过程中,可将第一音频进行量化处理,然后将第一量化音频转化为第一数字信号。
102、获取第二音频,并将所述第二音频转化为第二数字信号。
可选地,获取第二音频的方式可为以下两种:第一种:通过移动终端的麦克风获取第二音频;第二种,从存储器中的指定位置读取一个音频文件作为第二音频,其中,指定位置可由用户自行设置或者系统默认。
可选地,在步骤102中,将所述第二音频转化为第二数字信号,可采取上述步骤101中将第一音频转化为第一数字信号的方式,具体的,例如,可将所述第二音频划分为多个音频段,采用多个进程将所述多个音频段进行模数转换,得到所述第二数字信号,所述多个音频段与所述多个进程一一对应。
其中,在将所述第二音频转化为第二数字信号的过程中,可将第二音频进行量化处理,然后将第二量化音频转化为第二数字信号。
可选地,在步骤102中,将所述第二音频转化为第二数字信号,可包括如下步骤:
21、将所述第二音频进行量化处理,得到第二量化音频;
22、将所述第二量化音频转化为所述第二数字信号。
其中,对第二音频进行量化,可通过直接使用量化函数,定义一个量化器,对第二音频进行量化。具体的,可首先第二音频进行采样,获取样本点,然后 使用量化函数对采样的音频信号进行量化。
103、调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。
本发明实施例中,可从存储器中调用第一数字信号,将第二数字信号覆盖第一数字信号,对第一音频的音轨进行覆盖录音。
可选地,在步骤103中,将所述第二数字信号覆盖所述第一数字信号,可包括如下步骤:
31、获取所述第一数字信号的起始位置;
32、根据所述起始位置为起点,将所述第二数字信号覆盖所述第一数字信号。
其中,可根据第一数字信号的时间头文件确定该第一数字信号的起始位置,当然,也可在步骤101中将第一数字信号保存在存储器的过程中,记录第一数字信号的起始位置,然后将第二数字信号从上述起始位置对第一数字信号进行覆盖。
可选地,在步骤103中,将所述第二数字信号覆盖所述第一数字信号,可包括如下步骤:
B1、将所述第一数字信号进行清零处理;
B2、将所述第二数字信号覆盖清零处理后的所述第一数字信号。
其中,可在步骤101中将第一数字信号保存在存储器的过程中,记录第一数字信号的终止位置,在将第一数字信号进行清零处理后,根据该终止位置为起点,将第二数字信号从上述终止位置对第一数字信号进行覆盖。
可以看出,通过本发明实施例,在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中,同时获取第二音频,并将所述第二音频转化为第二数字信号,然后调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。进而,在录制第一音频的过程中,可直接开始重新录音,简化重新录音的操作步骤,缩短开始录制第二音频的时间间隔。
与上述一致地,请参阅图2,为本发明实施例提供的一种音频处理方法的 第二实施例流程示意图。如图2所示,本实施例中所描述的音频处理方法,包括以下步骤:
201、在录制第一音频的过程中,接收重新录音指令。
本发明实施例中,在进行第一音频录制未完成录制的过程中,可通过在移动终端上的录音界面新增重新录音的虚拟按键,使用户能够输入重新录音的指令。
举例说明,可在录音界面新增重新录音的虚拟按键,若用户想要放弃当前录音,重新录制第二音频,可在录制第一音频的过程中,直接在录音界面输入重新录音的指令。
202、将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中。
可选地,若录制的第一音频时长较长,可将第一音频划分为多个音频段,然后分别对多个音频段中的每一音频段进行模数转换,可缩短这一步骤使用的时间。
可选地,在将所述第一音频转化为第一数字信号的过程中,可将第二音频进行量化处理,然后将第一量化音频转化为第一数字信号。
其中,上述第一音频为移动终端录制的声音信号。具体的,可将第一音频先转换成模拟信号,然后经过处理器进行放大处理,再经过转换器进行模数转换,得到第一数字信号。
203、获取第二音频。
可选地,获取第二音频的方式可为以下两种:第一种:通过移动终端的麦克风获取第二音频;第二种,从存储器中的指定位置读取一个音频文件作为第二音频,其中,指定位置可由用户自行设置或者系统默认。
204、对所述第二音频进行降噪处理。
本发明实施例中,可对第二音频进行降噪处理,提高第二音频的质量。其中,对第二音频进行降噪处理,可对噪音的波形样本进行采样,然后对第二音频和噪音样本进行分析,去除噪音。
可选地,对第二音频进行降噪处理,可包括如下步骤:
41、获取噪音样本;
42、对第二音频和所述噪音样本进行分析,通过滤波器去除所述第二音频中的噪音。
其中,上述对第二音频进行降噪处理可采用自适应滤波算法(Least Mean Square,LMS)实现。
205、将降噪处理后的所述第二音频转化为第二数字信号。
可选地,将所述第二音频转化为第二数字信号,可采取上述步骤202中将第一音频转化为第一数字信号的方式,具体的,可将所述第二音频划分为多个音频段,采用多个进程将所述多个音频段进行模数转换,得到所述第二数字信号,所述多个音频段与所述多个进程一一对应。
可选地,将所述第二音频转化为第二数字信号,还可先将第二音频进行量化处理,得到第二量化音频,然后将第二量化音频转化为第二数字信号。
206、调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。
本发明实施例中,可从存储器中调用所述第一数字信号,将第二数字信号覆盖第一数字信号,对第一音频的音轨进行覆盖录音。
其中,将所述第二数字信号覆盖所述第一数字信号,可直接从第一数字信号的起始位置对第一数字信号进行覆盖;也可先将第一数字信号进行清零处理,然后将第二数字信号覆盖清零处理后的第一数字信号。
可以看出,通过本发明实施例,移动终端接收重新录音指令,在录制第一音频的过程中,若移动终端检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中,同时获取第二音频,对第二音频进行降噪处理,将降噪处理后的所述第二音频转化为第二数字信号,然后调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。进而,在录制第一音频的过程中,可直接开始重新录音,简化重新录音的操作步骤,缩短开始录制第二音频的时间间隔。
与上述一致地,请参阅图3,为本发明实施例提供的一种音频处理方法的第三实施例流程示意图。本实施例中所描述的音频处理方法,包括以下步骤:
301、在录制第一音频的过程中,接收重新录音指令。
本发明实施例中,在进行第一音频录制未完成录制的过程中,可通过在移动终端上的录音界面新增重新录音的虚拟按键,使用户能够输入重新录音的指令。
举例说明,可在录音界面新增重新录音的虚拟按键,若用户想要放弃当前录音,重新录制第二音频,可在录制第一音频的过程中,直接在录音界面输入重新录音的指令。
302、将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中。
可选地,若录制的第一音频时长较长,可将第一音频划分为多个音频段,然后分别对多个音频段中的每一音频段进行模数转换,可缩短这一步骤使用的时间。
可选地,在将所述第一音频转化为第一数字信号的过程中,可将第二音频进行量化处理,然后将第一量化音频转化为第一数字信号。
其中,上述第一音频为移动终端录制的声音信号。具体的,可将第一音频先转换成模拟信号,然后经过处理器进行放大处理,再经过转换器进行模数转换,得到第一数字信号。
303、获取第二音频,并将所述第二音频转化为第二数字信号。
可选地,获取第二音频的方式可为以下两种:第一种:通过移动终端的麦克风获取第二音频;第二种,从存储器中的指定位置读取一个音频文件作为第二音频,其中,指定位置可由用户自行设置或者系统默认。
可选地,将所述第二音频转化为第二数字信号,可采取上述步骤302中将第一音频转化为第一数字信号的方式,具体的,可将所述第二音频划分为多个音频段,采用多个进程将所述多个音频段进行模数转换,得到所述第二数字信号,所述多个音频段与所述多个进程一一对应。
可选地,将所述第二音频转化为第二数字信号,还可先将第二音频进行量化处理,得到第二量化音频,然后将第二量化音频转化为第二数字信号。
304、调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。
本发明实施例中,可从存储器中调用第一数字信号,将第二数字信号覆盖 第一数字信号,对第一音频的音轨进行覆盖录音。
其中,将所述第二数字信号覆盖所述第一数字信号,可直接从第一数字信号的起始位置对第一数字信号进行覆盖;也可先将第一数字信号进行清零处理,然后将第二数字信号覆盖清零处理后的第一数字信号。
可以看出,通过本发明实施例,移动终端接收重新录音指令,在录制第一音频的过程中,若移动终端检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中,同时获取第二音频,并将所述第二音频转化为第二数字信号,然后调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。进而,在录制第一音频的过程中,可直接开始重新录音,简化重新录音的操作步骤,缩短开始录制第二音频的时间间隔。
与上述一致地,以下为实施上述音频处理方法的装置,具体如下:
请参阅图4a,为本发明实施例提供的一种音频处理装置的实施例结构示意图。本实施例中所描述的音频处理装置,包括:第一处理单元401、第一录制单元402和第二处理单元403,具体如下:
第一处理单元401,用于在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中;
第一录制单元402,用于获取第二音频,并将所述第二音频转化为第二数字信号;
第二处理单元403,用于调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。
可选地,如图4b,图4b为图4a中所描述的音频处理装置的第一处理单元401的具体细化结构,所述第一处理单元401可包括:划分模块4011和转换模块4012,具体如下:
划分模块4011,用于将所述第一音频划分为多个音频段;
转换模块4012,用于采用多个进程将所述多个音频段进行模数转换,得到所述第一数字信号,所述多个音频段与所述多个进程一一对应。
可选地,如图4c,图4c为图4a中所描述的音频处理装置的第一处理单元401的另一具体细化结构,所述第一处理单元401可包括:量化模块4013和转化模块4014,具体如下:
量化模块4013,用于将所述第二音频进行量化处理,得到第二量化音频;
转化模块4014,用于将所述第二量化音频转化为所述第二数字信号。
可选地,如图4d,图4d为图4a中所描述的音频处理装置的第二处理单元403的具体细化结构,所述第二处理单元403可包括:获取模块4031和第一覆盖模块4032,具体如下:
获取模块4031,用于获取所述第一数字信号的起始位置;
第一覆盖模块4032,用于根据所述起始位置为起点,将所述第二数字信号覆盖所述第一数字信号。
可选地,如图4e,图4e为图4a中所描述的音频处理装置的第二处理单元403的具体细化结构,所述第二处理单元403可包括:清零模块4033和第二覆盖模块4034,具体如下:
清零模块4031,用于将所述第一数字信号进行清零处理;
第二覆盖模块4032,用于将所述第二数字信号覆盖清零处理后的所述第一数字信号。
可选地,如图4f,图4f为图4a所描述的音频处理装置的又一变型结构,图4f与图4a相比较,图4f中的音频处理装置还可包括:降噪单元404,具体如下:
降噪单元404,用于对所述第二音频进行降噪处理。
可选地,如图4g,图4g为图4a所描述的音频处理装置的又一变型结构,图4g与图4a相比较,图4g中的音频处理装置还可包括:接收单元405,具体如下:
接收单元405,用于接收重新录音指令。
可以看出,通过本发明实施例,移动终端接收重新录音指令,在录制第一音频的过程中,若移动终端检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中,同时获取第二音频,并将所述第二音频转化为第二数字信号,然后调用所述第一数字信 号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。进而,在录制第一音频的过程中,可直接开始重新录音,简化重新录音的操作步骤,缩短开始录制第二音频的时间间隔。
需要注意的是,本发明装置实施例所描述的音频处理装置是以功能单元的形式呈现。这里所使用的术语“单元”应当理解为尽可能最宽的含义,用于实现各个“单元”所描述功能的对象例如可以是集成电路ASIC,单个电路,用于执行一个或多个软件或固件程序的处理器(共享的、专用的或芯片组)和存储器,组合逻辑电路,和/或提供实现上述功能的其他合适的组件。
举例来说,第一处理单元401,用于在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中的功能可以由图5所示的移动终端来实现,具体可以通过处理器3000通过调用存储器4000中的可执行程序代码,在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中。
与上述一致地,请参阅图5,为本发明实施例提供的一种移动终端的实施例结构示意图。本实施例中所描述的移动终端,包括:至少一个输入设备1000;至少一个输出设备2000;至少一个处理器3000,例如CPU;和存储器4000,上述输入设备1000、输出设备2000、处理器3000和存储器4000通过总线5000连接。
其中,上述输入设备1000具体可包括触控面板、物理按键、鼠标和麦克风。
上述输出设备2000具体可为显示屏。
上述存储器4000可以是高速RAM存储器,也可为非易失存储器(non-volatile memory),例如磁盘存储器。上述存储器4000用于存储一组程序代码,上述输入设备1000、输出设备2000和处理器3000用于调用存储器4000中存储的程序代码,执行如下操作:
上述处理器3000,用于:
在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所 述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中;
获取第二音频,并将所述第二音频转化为第二数字信号;
调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。
可选地,上述处理器3000将所述第一音频转化为第一数字信号,包括:
将所述第一音频划分为多个音频段;
采用多个进程将所述多个音频段进行模数转换,得到所述第一数字信号,所述多个音频段与所述多个进程一一对应。
可选地,上述处理器3000将所述第一音频划分为多个音频段,包括:
确定所述第一音频的时长;
根据所述时长将所述第一音频划分为所述多个音频段。
可选地,上述处理器3000将所述第二音频转化为第二数字信号,包括:
将所述第二音频进行量化处理,得到第二量化音频;
将所述第二量化音频转化为所述第二数字信号。
可选地,上述处理器3000,还具体用于:
对所述第二音频进行降噪处理。
可选地,上述处理器3000将所述第二音频转化为第二数字信号,包括:
将降噪处理后的所述第二音频转化为第二数字信号。
可选地,上述处理器3000获取第二音频,包括:
从存储器的指定位置读取所述第二音频;或者,通过麦克风获取所述第二音频。
可选地,上述处理器3000将所述第二数字信号覆盖所述第一数字信号,包括:
获取所述第一数字信号的起始位置;
根据所述起始位置为起点,将所述第二数字信号覆盖所述第一数字信号。
可选地,上述处理器3000将所述第二数字信号覆盖所述第一数字信号,包括:
将所述第一数字信号进行清零处理;
将所述第二数字信号覆盖清零处理后的所述第一数字信号。
可选地,上述处理器3000,还具体用于:
接收重新录音指令,执行所述获取第二音频。
本发明实施例还提供一种计算机存储介质,其中,该计算机存储介质可存储有程序,该程序执行时包括上述方法实施例中记载的任何一种音频处理方法的部分或全部步骤。
本发明实施例还提供一种计算机程序产品,所述计算机程序产品包括存储了计算机程序的非瞬时性计算机可读存储介质,所述计算机程序可操作来使计算机执行如上述方法实施例中记载的任何一种音频处理方法的部分或全部步骤。
尽管在此结合各实施例对本发明进行了描述,然而,在实施所要求保护的本发明过程中,本领域技术人员通过查看所述附图、公开内容、以及所附权利要求书,可理解并实现所述公开实施例的其他变化。在权利要求中,“包括”(comprising)一词不排除其他组成部分或步骤,“一”或“一个”不排除多个的情况。单个处理器或其他单元可以实现权利要求中列举的若干项功能。相互不同的从属权利要求中记载了某些措施,但这并不表示这些措施不能组合起来产生良好的效果。
本领域技术人员应明白,本发明的实施例可提供为方法、装置(设备)、或计算机程序产品。因此,本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。计算机程序存储/分布在合适的介质中,与其它硬件一起提供或作为硬件的一部分,也可以采用其他分布形式,如通过Internet或其它有线或无线电信系统。
本发明是参照本发明实施例的方法、装置(设备)和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多 个流程和/或方框图一个方框或多个方框中指定的功能的装置。
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。
尽管结合具体特征及其实施例对本发明进行了描述,显而易见的,在不脱离本发明的精神和范围的情况下,可对其进行各种修改和组合。相应地,本说明书和附图仅仅是所附权利要求所界定的本发明的示例性说明,且视为已覆盖本发明范围内的任意和所有修改、变化、组合或等同物。显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。

Claims (10)

  1. 一种音频处理方法,其特征在于,所述方法包括:
    在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中;
    获取第二音频,并将所述第二音频转化为第二数字信号;
    调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。
  2. 根据权利要求1所述的方法,其特征在于,所述将所述第一音频转化为第一数字信号,包括:
    将所述第一音频划分为多个音频段;
    采用多个进程将所述多个音频段进行模数转换,得到所述第一数字信号,所述多个音频段与所述多个进程一一对应。
  3. 根据权利要求2所述的方法,其特征在于,所述将所述第一音频划分为多个音频段,包括:
    确定所述第一音频的时长;
    根据所述时长将所述第一音频划分为所述多个音频段。
  4. 根据权利要求1至3任一项所述的方法,其特征在于,所述将所述第二音频转化为第二数字信号,包括:
    将所述第二音频进行量化处理,得到第二量化音频;
    将所述第二量化音频转化为所述第二数字信号。
  5. 根据权利要求1至3任一项所述的方法,其特征在于,所述方法还包括:
    对所述第二音频进行降噪处理;
    所述将所述第二音频转化为第二数字信号,包括:
    将降噪处理后的所述第二音频转化为第二数字信号。
  6. 根据权利要求1至3任一项所述的方法,其特征在于,所述获取第二音频,包括:
    从存储器的指定位置读取所述第二音频;或者,通过麦克风获取所述第二音频。
  7. 根据权利要求1至3所述的方法,其特征在于,所述将所述第二数字信号覆盖所述第一数字信号,包括:
    获取所述第一数字信号的起始位置;
    根据所述起始位置为起点,将所述第二数字信号覆盖所述第一数字信号。
  8. 一种音频处理装置,其特征在于,所述装置包括:
    第一处理单元,用于在录制第一音频的过程中,若检测到所述第一音频需要重新录音时,将所述第一音频转化为第一数字信号,并将所述第一数字信号保存在存储器中;
    第一录制单元,用于获取第二音频,并将所述第二音频转化为第二数字信号;
    第二处理单元,用于调用所述第一数字信号,并将所述第二数字信号覆盖所述第一数字信号,得到目标音频文件。
  9. 一种移动终端,其特征在于,包括:
    处理器和存储器;其中,所述处理器通过调用所述存储器中的代码或指令以执行如权利要求1至8任意一项所述的方法。
  10. 一种计算机存储介质,其特征在于,其用于存储计算机程序,其中,所述计算机程序使得计算机执行如权利要求1-8任一项所述的方法。
PCT/CN2017/104090 2017-09-28 2017-09-28 音频处理方法及相关产品 WO2019061192A1 (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/104090 WO2019061192A1 (zh) 2017-09-28 2017-09-28 音频处理方法及相关产品

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/104090 WO2019061192A1 (zh) 2017-09-28 2017-09-28 音频处理方法及相关产品

Publications (1)

Publication Number Publication Date
WO2019061192A1 true WO2019061192A1 (zh) 2019-04-04

Family

ID=65900229

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/104090 WO2019061192A1 (zh) 2017-09-28 2017-09-28 音频处理方法及相关产品

Country Status (1)

Country Link
WO (1) WO2019061192A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021123949A1 (en) 2019-12-20 2021-06-24 Idorsia Pharmaceuticals Ltd Pharmaceutical compositions comprising n-[1-(5-cyano-pyridin-2-ylmethyl)-1h-pyrazol-3-yl]-2-[4-(1-trifluoromethyl-cyclopropyl)-phenyl]-acetamide

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070005812A1 (en) * 2005-06-29 2007-01-04 Intel Corporation Asynchronous communicative exchange
WO2014020723A1 (ja) * 2012-08-01 2014-02-06 株式会社コナミデジタルエンタテインメント 処理装置、処理装置の制御方法、及び、処理装置のプログラム
CN106710597A (zh) * 2017-01-04 2017-05-24 广东小天才科技有限公司 语音数据的录音方法及装置

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070005812A1 (en) * 2005-06-29 2007-01-04 Intel Corporation Asynchronous communicative exchange
WO2014020723A1 (ja) * 2012-08-01 2014-02-06 株式会社コナミデジタルエンタテインメント 処理装置、処理装置の制御方法、及び、処理装置のプログラム
CN106710597A (zh) * 2017-01-04 2017-05-24 广东小天才科技有限公司 语音数据的录音方法及装置

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021123949A1 (en) 2019-12-20 2021-06-24 Idorsia Pharmaceuticals Ltd Pharmaceutical compositions comprising n-[1-(5-cyano-pyridin-2-ylmethyl)-1h-pyrazol-3-yl]-2-[4-(1-trifluoromethyl-cyclopropyl)-phenyl]-acetamide

Similar Documents

Publication Publication Date Title
US10785541B2 (en) Screencast recording method, screencast playing method, screen recording terminal, and playing terminal
US9754621B2 (en) Appending information to an audio recording
EP2991372B1 (en) Method and apparatus for managing audio signals
US20200219503A1 (en) Method and apparatus for filtering out voice instruction
US20140241702A1 (en) Dynamic audio perspective change during video playback
TW201518933A (zh) 測試裝置及其測試方法
CN109285556B (zh) 音频处理方法、装置、设备以及存储介质
CN104580888A (zh) 一种图像处理方法及终端
US9466310B2 (en) Compensating for identifiable background content in a speech recognition device
US20200097528A1 (en) Method and Device for Quickly Inserting Text of Speech Carrier
WO2016197708A1 (zh) 一种录音方法及终端
CN111048093A (zh) 会议音箱及会议记录方法、设备、系统和计算机存储介质
US20120004913A1 (en) Method and apparatus for controlling operation of portable terminal using microphone
US10424299B2 (en) Voice command masking systems and methods
WO2014166230A1 (zh) 一种自动录音的方法、装置及移动终端
CN111508531A (zh) 音频处理方法及装置
CN111435600A (zh) 用于处理音频的方法和装置
CN112397102B (zh) 音频处理方法、装置及终端
WO2019061192A1 (zh) 音频处理方法及相关产品
WO2021212985A1 (zh) 声学网络模型训练方法、装置及电子设备
JP6852478B2 (ja) 通信端末、通信プログラム及び通信方法
WO2015131634A1 (zh) 声音降噪方法及终端
WO2016107104A1 (zh) 一种记录语音通信信息的方法及终端、计算机存储介质
CN113157240A (zh) 语音处理方法、装置、设备、存储介质及计算机程序产品
CN105718174B (zh) 一种界面的切换方法及切换系统

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17927181

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17927181

Country of ref document: EP

Kind code of ref document: A1