Vedio noise reduction method, mobile terminal and computer readable storage medium
Technical field
The present invention relates to the technical field of vedio noise reduction more particularly to a kind of vedio noise reduction methods, mobile terminal and calculating
Machine readable storage medium storing program for executing.
Background technique
With the fast development of mobile terminal technology, mobile terminal integrates communication, shooting and the functions such as audio-visual, becomes
Indispensable part in people's daily life.Due to the technological break-through of high-definition camera, the picture of taking pictures of mobile terminal camera
Element is higher and higher, allow mobile terminal take pictures effect and camera matches in excellence or beauty, and mobile terminal is easy to carry, Ren Men
In normal life and tourism, increasingly tend to replace traditional camera to be taken pictures or recorded video with mobile terminal.
Currently, people increasingly like during travelling or daily life with the development of social networks, pass through movement
Terminal records the what is seen and heard of oneself, and by the video sharing of recording to social networks, however, comprising various in the video recorded
Noisy sound is easy to cover the sound of user oneself, needs user repeatedly to shoot, can just obtain including the higher people of clarity
The video of sound, therefore, how to improve the clarity of voice in video is current urgent problem to be solved.
Summary of the invention
The main purpose of the present invention is to provide a kind of vedio noise reduction method, mobile terminal and computer-readable storage mediums
Matter, it is intended to improve the clarity of voice in video.
To achieve the above object, the present invention provides a kind of vedio noise reduction method,
When monitoring vedio noise reduction instruction, obtain according to vedio noise reduction instruction to de-noising video file;
From described to isolate voice audio data and background sound audio data in de-noising video file;
According to noise reduction algorithm, the voice audio data and the background sound audio data is preset, regarded to described to noise reduction
Frequency file executes corresponding noise reduction operation, obtains target video file.
Optionally, the foundation presets noise reduction algorithm, the voice audio data and the background sound audio data, to institute
The step of stating and execute corresponding noise reduction operation to de-noising video file, obtaining target video file include:
From described to reject the background sound audio data in de-noising video file, obtain only comprising the voice audio number
According to target video file.
Optionally, the foundation presets noise reduction algorithm, the voice audio data and the background sound audio data, to institute
The step of stating and execute corresponding noise reduction operation to de-noising video file, obtaining target video file include:
Default vocal print is obtained, and obtains the first sound audio comprising the default vocal print from the voice audio data
Data and the second voice audio data not comprising the default vocal print;
From described to reject the background sound audio data and the second voice audio data in de-noising video file, obtain
To the target video file for only including the first sound audio data.
Optionally, the foundation presets noise reduction algorithm, the voice audio data and the background sound audio data, to institute
The step of stating and execute corresponding noise reduction operation to de-noising video file, obtaining target video file include:
Based on preset background sound disaggregated model, the various background sounds in the background sound audio data are marked,
Obtain the background sound audio data comprising several background sound labels;
With the presence or absence of the first background sound audio comprising presetting background sound label in the background sound audio data of judge mark
Data;
If there is the first background sound audio data comprising presetting background sound label in the background sound audio data of label,
The second background sound that background sound label is not the default background sound label is obtained from the background sound audio data after label
Frequency evidence;
From described to reject the second background sound audio data in de-noising video file, obtain comprising first background
The target video file of sound audio data and the voice audio data.
Optionally, with the presence or absence of first comprising presetting background sound label in the background sound audio data of the judge mark
After the step of background sound audio data, further includes:
If the first background sound audio data comprising presetting background sound label is not present in the background sound audio data of label,
Then display background sound rejects interface, and receives the first background sound label that interface selection is rejected based on the background sound;
The third that background sound label is the first background sound label is obtained from the background sound audio data after label to carry on the back
Scape sound audio data;
It is not the 4th of the first background sound label that background sound label is obtained from the background sound audio data after label
Background sound audio data;
From described to reject the third background sound audio data in de-noising video file, obtain comprising the 4th background
The target video file of sound audio data and the voice audio data.
Optionally, the foundation presets noise reduction algorithm, the voice audio data and the background sound audio data, to institute
After the step of stating and execute corresponding noise reduction operation to de-noising video file, obtaining target video file, further includes:
When monitoring background sound configuration-direct, display background sound configuration interface, and receive and configured based on the background sound
Second background sound label of interface selection;
Default background sound library is obtained, and acquisition is corresponding with the second background sound label from the default background sound library
5th background sound audio data;
The 5th background sound audio data is inserted into the target video file.
Optionally, the vedio noise reduction method further include:
When monitoring background sound warehouse-in instruction, according to the background sound warehouse-in instruction, corresponding video file is obtained, and
Background sound audio data is isolated from the video file;
Based on preset background sound disaggregated model, the various background sounds in the background sound audio data are marked,
Obtain the background sound audio data comprising several background sound labels;
Display background sound is put in storage interface, and receives the third background sound label based on the interface selection of background sound storage;
The 6th background that background sound label is the third background sound label is obtained from the background sound audio data of label
Sound audio data;
The 6th background sound audio data is stored into the default background sound library.
Optionally, the vedio noise reduction method further include:
When monitoring that background sound deletes instruction, is deleted from the background sound and obtain the 4th background sound label in instruction, and
The corresponding background sound audio data of the 4th background sound label is deleted from default background sound library.
In addition, to achieve the above object, the present invention also provides a kind of mobile terminal, the mobile terminal include: memory,
Processor and it is stored in the vedio noise reduction program that can be run on the memory and on the processor, the vedio noise reduction journey
Sequence realizes the step of vedio noise reduction method as described above when being executed by the processor.
In addition, to achieve the above object, it is described computer-readable the present invention also provides a kind of computer readable storage medium
Vedio noise reduction program is stored on storage medium, the vedio noise reduction program realizes video as described above when being executed by processor
The step of noise-reduction method.
The invention proposes a kind of vedio noise reduction method, mobile terminal and computer readable storage mediums, and the present invention is when prison
When measuring vedio noise reduction instruction, obtains according to vedio noise reduction instruction to de-noising video file, and from this and wait for de-noising video file
In isolate voice audio data and background sound audio data, then according to presetting noise reduction algorithm, voice audio data and background
Sound audio data treat de-noising video file and execute corresponding noise reduction operation, target video file obtained, by video file
Carry out noise reduction operation, it is possible to reduce influence of the background sound to voice in the video of recording, it is effective to improve the clear of voice in video
Clear degree.
Detailed description of the invention
A kind of hardware structural diagram of Fig. 1 mobile terminal of each embodiment to realize the present invention;
Fig. 2 is a kind of communications network system architecture diagram provided in an embodiment of the present invention;
Fig. 3 is the flow diagram of vedio noise reduction method first embodiment of the present invention.
The embodiments will be further described with reference to the accompanying drawings for the realization, the function and the advantages of the object of the present invention.
Specific embodiment
It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, it is not intended to limit the present invention.
In subsequent description, it is only using the suffix for indicating such as " module ", " component " or " unit " of element
Be conducive to explanation of the invention, itself there is no a specific meaning.Therefore, " module ", " component " or " unit " can mix
Ground uses.
Terminal can be implemented in a variety of manners.For example, terminal described in the present invention may include such as mobile phone, plate
Computer, laptop, palm PC, personal digital assistant (Personal Digital Assistant, PDA), portable
Media player (Portable Media Player, PMP), navigation device, wearable device, Intelligent bracelet, pedometer etc. move
The fixed terminals such as dynamic terminal, and number TV, desktop computer.
It will be illustrated by taking mobile terminal as an example in subsequent descriptions, it will be appreciated by those skilled in the art that in addition to special
Except element for moving purpose, the construction of embodiment according to the present invention can also apply to the terminal of fixed type.
Referring to Fig. 1, a kind of hardware structural diagram of its mobile terminal of each embodiment to realize the present invention, the shifting
Dynamic terminal 100 may include: RF (Radio Frequency, radio frequency) unit 101, WiFi module 102, audio output unit
103, A/V (audio/video) input unit 104, sensor 105, display unit 106, user input unit 107, interface unit
108, the components such as memory 109, processor 110 and power supply 111.It will be understood by those skilled in the art that shown in Fig. 1
Mobile terminal structure does not constitute the restriction to mobile terminal, and mobile terminal may include components more more or fewer than diagram,
Perhaps certain components or different component layouts are combined.
It is specifically introduced below with reference to all parts of the Fig. 1 to mobile terminal:
Radio frequency unit 101 can be used for receiving and sending messages or communication process in, signal sends and receivees, specifically, by base station
Downlink information receive after, to processor 110 handle;In addition, the data of uplink are sent to base station.In general, radio frequency unit 101
Including but not limited to antenna, at least one amplifier, transceiver, coupler, low-noise amplifier, duplexer etc..In addition, penetrating
Frequency unit 101 can also be communicated with network and other equipment by wireless communication.Any communication can be used in above-mentioned wireless communication
Standard or agreement, including but not limited to GSM (Global System of Mobile communication, global system for mobile telecommunications
System), GPRS (General Packet Radio Service, general packet radio service), CDMA2000 (Code
Division Multiple Access 2000, CDMA 2000), WCDMA (Wideband Code Division
Multiple Access, wideband code division multiple access), TD-SCDMA (Time Division-Synchronous Code
Division Multiple Access, TD SDMA), FDD-LTE (Frequency Division
Duplexing-Long Term Evolution, frequency division duplex long term evolution) and TDD-LTE (Time Division
Duplexing-Long Term Evolution, time division duplex long term evolution) etc..
WiFi belongs to short range wireless transmission technology, and mobile terminal can help user to receive and dispatch electricity by WiFi module 102
Sub- mail, browsing webpage and access streaming video etc., it provides wireless broadband internet access for user.Although Fig. 1 shows
Go out WiFi module 102, but it is understood that, and it is not belonging to must be configured into for mobile terminal, it completely can be according to need
It to omit within the scope of not changing the essence of the invention.
Audio output unit 103 can be in call signal reception pattern, call mode, record mould in mobile terminal 100
When under the isotypes such as formula, speech recognition mode, broadcast reception mode, by radio frequency unit 101 or WiFi module 102 it is received or
The audio data stored in memory 109 is converted into audio signal and exports to be sound.Moreover, audio output unit 103
Audio output relevant to the specific function that mobile terminal 100 executes can also be provided (for example, call signal receives sound, disappears
Breath receives sound etc.).Audio output unit 103 may include loudspeaker, buzzer etc..
A/V input unit 104 is for receiving audio or video signal.A/V input unit 104 may include graphics processor
(Graphics Processing Unit, GPU) 1041 and microphone 1042, graphics processor 1041 is in video acquisition mode
Or the image data of the static images or video obtained in image capture mode by image capture apparatus (such as camera) carries out
Reason.Treated, and picture frame may be displayed on display unit 106.Through graphics processor 1041, treated that picture frame can be deposited
Storage is sent in memory 109 (or other storage mediums) or via radio frequency unit 101 or WiFi module 102.Mike
Wind 1042 can connect in telephone calling model, logging mode, speech recognition mode etc. operational mode via microphone 1042
Quiet down sound (audio data), and can be audio data by such acoustic processing.Audio that treated (voice) data can
To be converted to the format output that can be sent to mobile communication base station via radio frequency unit 101 in the case where telephone calling model.
Microphone 1042 can be implemented various types of noises elimination (or inhibition) algorithms and send and receive sound to eliminate (or inhibition)
The noise generated during frequency signal or interference.
Mobile terminal 100 further includes at least one sensor 105, such as optical sensor, motion sensor and other biographies
Sensor.Specifically, optical sensor includes ambient light sensor and proximity sensor, wherein ambient light sensor can be according to environment
The light and shade of light adjusts the brightness of display panel 1061, and proximity sensor can close when mobile terminal 100 is moved in one's ear
Display panel 1061 and/or backlight.As a kind of motion sensor, accelerometer sensor can detect in all directions (general
For three axis) size of acceleration, it can detect that size and the direction of gravity when static, can be used to identify the application of mobile phone posture
(such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, percussion) etc.;
The fingerprint sensor that can also configure as mobile phone, pressure sensor, iris sensor, molecule sensor, gyroscope, barometer,
The other sensors such as hygrometer, thermometer, infrared sensor, details are not described herein.
Display unit 106 is for showing information input by user or being supplied to the information of user.Display unit 106 can wrap
Display panel 1061 is included, liquid crystal display (Liquid Crystal Display, LCD), Organic Light Emitting Diode can be used
Forms such as (Organic Light-Emitting Diode, OLED) configure display panel 1061.
User input unit 107 can be used for receiving the number or character information of input, and generate the use with mobile terminal
Family setting and the related key signals input of function control.Specifically, user input unit 107 may include touch panel 1071 with
And other input equipments 1072.Touch panel 1071, also referred to as touch screen collect the touch operation of user on it or nearby
(for example user uses any suitable objects or attachment such as finger, stylus on touch panel 1071 or in touch panel 1071
Neighbouring operation), and corresponding attachment device is driven according to preset formula.Touch panel 1071 may include touch detection
Two parts of device and touch controller.Wherein, the touch orientation of touch detecting apparatus detection user, and detect touch operation band
The signal come, transmits a signal to touch controller;Touch controller receives touch information from touch detecting apparatus, and by it
It is converted into contact coordinate, then gives processor 110, and order that processor 110 is sent can be received and executed.In addition, can
To realize touch panel 1071 using multiple types such as resistance-type, condenser type, infrared ray and surface acoustic waves.In addition to touch panel
1071, user input unit 107 can also include other input equipments 1072.Specifically, other input equipments 1072 can wrap
It includes but is not limited in physical keyboard, function key (such as volume control button, switch key etc.), trace ball, mouse, operating stick etc.
It is one or more, specifically herein without limitation.
Further, touch panel 1071 can cover display panel 1061, when touch panel 1071 detect on it or
After neighbouring touch operation, processor 110 is sent to determine the type of touch event, is followed by subsequent processing device 110 according to touch thing
The type of part provides corresponding visual output on display panel 1061.Although in Fig. 1, touch panel 1071 and display panel
1061 be the function that outputs and inputs of realizing mobile terminal as two independent components, but in certain embodiments, it can
The function that outputs and inputs of mobile terminal is realized so that touch panel 1071 and display panel 1061 is integrated, is not done herein specifically
It limits.
Interface unit 108 be used as at least one external device (ED) connect with mobile terminal 100 can by interface.For example,
External device (ED) may include wired or wireless headphone port, external power supply (or battery charger) port, wired or nothing
Line data port, memory card port, the port for connecting the device with identification module, audio input/output (I/O) end
Mouth, video i/o port, ear port etc..Interface unit 108 can be used for receiving the input from external device (ED) (for example, number
It is believed that breath, electric power etc.) and the input received is transferred to one or more elements in mobile terminal 100 or can be with
For transmitting data between mobile terminal 100 and external device (ED).
Memory 109 can be used for storing software program and various data.Memory 109 can mainly include storing program area
The storage data area and, wherein storing program area can (such as the sound of application program needed for storage program area, at least one function
Sound playing function, image player function etc.) etc.;Storage data area can store according to mobile phone use created data (such as
Audio data, phone directory etc.) etc..In addition, memory 109 may include high-speed random access memory, it can also include non-easy
The property lost memory, a for example, at least disk memory, flush memory device or other volatile solid-state parts.
Processor 110 is the control centre of mobile terminal, utilizes each of various interfaces and the entire mobile terminal of connection
A part by running or execute the software program and/or module that are stored in memory 109, and calls and is stored in storage
Data in device 109 execute the various functions and processing data of mobile terminal, to carry out integral monitoring to mobile terminal.Place
Managing device 110 may include one or more processing units;Preferably, processor 110 can integrate application processor and modulatedemodulate is mediated
Manage device, wherein the main processing operation system of application processor, user interface and application program etc., modem processor is main
Processing wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor 110.
Mobile terminal 100 can also include the power supply 111 (such as battery) powered to all parts, it is preferred that power supply 111
Can be logically contiguous by power-supply management system and processor 110, to realize management charging by power-supply management system, put
The functions such as electricity and power managed.
Although Fig. 1 is not shown, mobile terminal 100 can also be including bluetooth module etc., and details are not described herein.
As shown in Figure 1, as may include that operating system, network are logical in a kind of memory 109 of computer storage medium
Letter module, Subscriber Interface Module SIM and vedio noise reduction program, processor 110 can be used for calling the view stored in memory 109
Frequency noise reduction program, and execute following steps:
When monitoring vedio noise reduction instruction, obtain according to vedio noise reduction instruction to de-noising video file;
From described to isolate voice audio data and background sound audio data in de-noising video file;
According to noise reduction algorithm, the voice audio data and the background sound audio data is preset, regarded to described to noise reduction
Frequency file executes corresponding noise reduction operation, obtains target video file.
Further, processor 110 can be used for calling the vedio noise reduction program stored in memory 109, also execute with
Lower step:
From described to reject the background sound audio data in de-noising video file, obtain only comprising the voice audio number
According to target video file.
Further, processor 110 can be used for calling the vedio noise reduction program stored in memory 109, also execute with
Lower step:
Default vocal print is obtained, and obtains the first sound audio comprising the default vocal print from the voice audio data
Data and the second voice audio data not comprising the default vocal print;
From described to reject the background sound audio data and the second voice audio data in de-noising video file, obtain
To the target video file for only including the first sound audio data.
Further, processor 110 can be used for calling the vedio noise reduction program stored in memory 109, also execute with
Lower step:
Based on preset background sound disaggregated model, the various background sounds in the background sound audio data are marked,
Obtain the background sound audio data comprising several background sound labels;
With the presence or absence of the first background sound audio comprising presetting background sound label in the background sound audio data of judge mark
Data;
If there is the first background sound audio data comprising presetting background sound label in the background sound audio data of label,
The second background sound that background sound label is not the default background sound label is obtained from the background sound audio data after label
Frequency evidence;
From described to reject the second background sound audio data in de-noising video file, obtain comprising first background
The target video file of sound audio data and the voice audio data.
Further, processor 110 can be used for calling the vedio noise reduction program stored in memory 109, also execute with
Lower step:
If the first background sound audio data comprising presetting background sound label is not present in the background sound audio data of label,
Then display background sound rejects interface, and receives the first background sound label that interface selection is rejected based on the background sound;
The third that background sound label is the first background sound label is obtained from the background sound audio data after label to carry on the back
Scape sound audio data;
It is not the 4th of the first background sound label that background sound label is obtained from the background sound audio data after label
Background sound audio data;
From described to reject the third background sound audio data in de-noising video file, obtain comprising the 4th background
The target video file of sound audio data and the voice audio data.
Further, processor 110 can be used for calling the vedio noise reduction program stored in memory 109, also execute with
Lower step:
When monitoring background sound configuration-direct, display background sound configuration interface, and receive and configured based on the background sound
Second background sound label of interface selection;
Default background sound library is obtained, and acquisition is corresponding with the second background sound label from the default background sound library
5th background sound audio data;
The 5th background sound audio data is inserted into the target video file.
Further, processor 110 can be used for calling the vedio noise reduction program stored in memory 109, also execute with
Lower step:
When monitoring background sound warehouse-in instruction, according to the background sound warehouse-in instruction, corresponding video file is obtained, and
Background sound audio data is isolated from the video file;
Based on preset background sound disaggregated model, the various background sounds in the background sound audio data are marked,
Obtain the background sound audio data comprising several background sound labels;
Display background sound is put in storage interface, and receives the third background sound label based on the interface selection of background sound storage;
The 6th background that background sound label is the third background sound label is obtained from the background sound audio data of label
Sound audio data;
The 6th background sound audio data is stored into the default background sound library.
Further, processor 110 can be used for calling the vedio noise reduction program stored in memory 109, also execute with
Lower step:
When monitoring that background sound deletes instruction, is deleted from the background sound and obtain the 4th background sound label in instruction, and
The corresponding background sound audio data of the 4th background sound label is deleted from default background sound library.
Wherein, the specific embodiment of mobile terminal of the present invention and each embodiment of following vedio noise reduction methods are essentially identical,
Therefore not to repeat here.
Embodiment to facilitate the understanding of the present invention, the communications network system that mobile terminal of the invention is based below into
Row description.
Referring to Fig. 2, Fig. 2 is a kind of communications network system architecture diagram provided in an embodiment of the present invention, the communication network system
System is the LTE system of universal mobile communications technology, which includes UE (User Equipment, the use of successively communication connection
Family equipment) (the land Evolved UMTS Terrestrial Radio Access Network, evolved UMTS 201, E-UTRAN
Ground wireless access network) 202, EPC (Evolved Packet Core, evolved packet-based core networks) 203 and operator IP operation
204。
Specifically, UE201 can be above-mentioned terminal 100, and details are not described herein again.
E-UTRAN202 includes eNodeB2021 and other eNodeB2022 etc..Wherein, eNodeB2021 can be by returning
Journey (backhaul) (such as X2 interface) is connect with other eNodeB2022, and eNodeB2021 is connected to EPC203,
ENodeB2021 can provide the access of UE201 to EPC203.
EPC203 may include MME (Mobility Management Entity, mobility management entity) 2031, HSS
(Home Subscriber Server, home subscriber server) 2032, other MME2033, SGW (Serving Gate Way,
Gateway) 2034, PGW (PDN Gate Way, grouped data network gateway) 2035 and PCRF (Policy and
Charging Rules Function, policy and rate functional entity) 2036 etc..Wherein, MME2031 be processing UE201 and
The control node of signaling, provides carrying and connection management between EPC203.HSS2032 is all to manage for providing some registers
Such as the function of home location register (not shown) etc, and preserves some related service features, data rates etc. and use
The dedicated information in family.All customer data can be sent by SGW2034, and PGW2035 can provide the IP of UE 201
Address distribution and other functions, PCRF2036 are strategy and the charging control strategic decision-making of business data flow and IP bearing resource
Point, it selects and provides available strategy and charging control decision with charge execution function unit (not shown) for strategy.
IP operation 204 may include internet, Intranet, IMS (IP Multimedia Subsystem, IP multimedia
System) or other IP operations etc..
Although above-mentioned be described by taking LTE system as an example, those skilled in the art should know the present invention is not only
Suitable for LTE system, be readily applicable to other wireless communication systems, such as GSM, CDMA2000, WCDMA, TD-SCDMA with
And the following new network system etc., herein without limitation.
Based on above-mentioned mobile terminal hardware configuration and communications network system, each reality of vedio noise reduction method of the present invention is proposed
Apply example.
The present invention provides a kind of vedio noise reduction method.
Referring to Fig. 3, the flow diagram of Fig. 3 vedio noise reduction method first embodiment of the present invention.
In the present embodiment, which includes:
Step S101 is instructed according to the vedio noise reduction and is obtained to de-noising video text when monitoring vedio noise reduction instruction
Part;
In the present embodiment, which is applied to mobile terminal, is equipped in the mobile terminal and drops for video
The program application made an uproar, when the desktop icons of program application are by touch-control, mobile terminal shows corresponding vedio noise reduction interface, and should
Vedio noise reduction interface display has local video noise reduction control and real-time video noise reduction control, when monitoring in vedio noise reduction interface
When local video noise reduction control is by touch-control, which shows local video selection interface, and receives and be based on the local video
Selection interface selection to de-noising video file, and vedio noise reduction interface is switched to, then when monitoring the vedio noise reduction interface
In beginning noise reduction control by touch-control when, trigger the instruction of corresponding vedio noise reduction;When monitoring the reality in the vedio noise reduction interface
When vedio noise reduction control by touch-control when, which calls camera to start recorded video, will and after recorded video
It records obtained video file to be used as to de-noising video file, then switches to vedio noise reduction interface, and work as and monitor the video
When beginning noise reduction control in noise reduction interface is by touch-control, corresponding vedio noise reduction instruction is triggered.When monitor vedio noise reduction instruct
When, which obtains according to vedio noise reduction instruction to de-noising video file, i.e., obtains view from vedio noise reduction instruction
Frequency file title, and acquisition is corresponding with the video files names to de-noising video file from video file library.
Step S102, from described to isolate voice audio data and background sound audio data in de-noising video file;
In the present embodiment, after getting to de-noising video file, which waits in de-noising video file from this
Voice audio data and background sound audio data are isolated, i.e., waits for reading audio data in de-noising video file from this, and pass through
Blind source separation algorithm isolates voice audio data and background sound audio data from the audio data.Optionally, the movement is whole
End is also based on preset audio disjunctive model, isolates voice audio data and background sound audio to de-noising video file
Data, wherein the preset audio disjunctive model is obtained by machine learning, and the audio number of big data quantity is specially collected
According to, and by the audio data of big data quantity, audio disjunctive model is trained, until the convergence of audio disjunctive model, then
In the terminal by the audio disjunctive model solidification after convergence.
Step S103, according to noise reduction algorithm, the voice audio data and the background sound audio data is preset, to described
Corresponding noise reduction operation is executed to de-noising video file, obtains target video file.
In the present embodiment, after isolating voice audio data and background sound audio data, the mobile terminal is according to pre-
If noise reduction algorithm, the people's sound audio data and background sound audio data treat de-noising video file and execute corresponding noise reduction operation,
Obtain target video file.Optionally, which rejects the background sound audio data to de-noising video file, obtains
Only include the target video file of voice audio data, i.e., wait for reading audio data in de-noising video file from this, and from audio
The background sound audio data is rejected in data, obtains target video file only comprising voice audio data.For example, to noise reduction
The voice audio data and background sound audio data isolated in video file are respectively A and B, then treat de-noising video file and hold
Only include voice audio data in target video file after row noise reduction operation, i.e., only includes A.
Specifically, which presets vocal print, and obtains from the people's sound audio data comprising the default vocal print
The first sound audio data and the second voice audio data of vocal print is preset not comprising this, then wait for de-noising video file from this
Middle rejecting background sound audio data and the second voice audio data obtain the only target comprising the first sound audio data and regard
Frequency file.It should be noted that above-mentioned default vocal print can be set by the user himself, the present embodiment is not especially limited this, should
Default vocal print is chosen as the vocal print of mobile terminal owner.Voice sound in obtained target video file only comprising default vocal print
Frequency evidence, there is no other background sounds and other people sound, further increase the clarity of voice in video.
Specifically, which is also based on preset background sound disaggregated model, to various in background sound audio data
Background sound is marked, and obtains the background sound audio data comprising several background sound labels, and the background sound audio of judge mark
With the presence or absence of the first background sound audio data comprising presetting background sound label in data, if the background sound audio data of label
It is middle to there is the first background sound audio data comprising presetting background sound label, then it is obtained from the background sound audio data after label
Background sound label is not the second background sound audio data of default background sound label, and reject to de-noising video file this
Two background sound audio datas obtain the target video file comprising the first background sound audio data and the people's sound audio data.Its
In, which includes but is not limited to the corresponding label of sound of the wind, the corresponding label of sound of sea wave, the corresponding mark of music background sound
Label, what which was obtained by machine learning, the background sound audio data of big data quantity is specially collected, and lead to
The background sound audio data for crossing big data quantity, is trained background sound disaggregated model, until the convergence of background sound disaggregated model, so
Afterwards in the terminal by the background sound disaggregated model solidification after convergence.It should be noted that the default background sound label is corresponding
Influence of the background sound audio data to voice it is smaller, above-mentioned default background sound label can be based on actual conditions by user and voluntarily set
It sets, the present embodiment is not especially limited this.Comprising voice audio data and to people in the target video file obtained after noise reduction
Sound shadow rings lesser background sound audio data, can improve the clarity of voice in video while guaranteeing certain background sound.
In the present embodiment, the present invention is obtained according to vedio noise reduction instruction to noise reduction when monitoring vedio noise reduction instruction
Video file, and wait for isolating voice audio data and background sound audio data in de-noising video file from this, then according to pre-
If noise reduction algorithm, voice audio data and background sound audio data, treats de-noising video file and execute corresponding noise reduction operation, obtain
To target video file, by carrying out noise reduction operation to video file, it is possible to reduce background sound is to voice in the video of recording
It influences, the effective clarity for improving voice in video.
Further, it is based on above-mentioned first embodiment, proposes vedio noise reduction method second embodiment of the present invention, and it is aforementioned
The difference of embodiment is, if there is no the first backgrounds comprising presetting background sound label in the background sound audio data of label
Sound audio data, then the mobile terminal display background sound rejects interface, needs so that user is based on background sound rejecting interface selection
The corresponding background sound label of the background sound audio data to be rejected, and receive the first background that interface selection is rejected based on background sound
Then tone mark label obtain the third that background sound label is the first background sound label from the background sound audio data after label and carry on the back
Scape sound audio data, and obtaining background sound label from the background sound audio data after label is not the first background sound label
4th background sound audio data finally rejects third background sound audio data to de-noising video file, obtains comprising the 4th
The target video file of background sound audio data and voice audio data.
In specific implementation, obtain the target video file comprising the 4th background sound audio data and voice audio data it
Afterwards, default vocal print can also be obtained, and obtains the voice audio data comprising the default vocal print from the people's sound audio data
With another voice audio data for not including the default vocal print, then rejected from target video file comprising the default vocal print
One voice audio data obtains comprising the 4th background sound audio data and presets not comprising this another voice audio data of vocal print
Video file.
Further, obtain the target video file comprising the 4th background sound audio data and voice audio data it
Afterwards, the background sound label of available 4th background sound audio data, and judge to whether there is and the back in default background sound library
The corresponding background sound audio data of scape tone mark label, if background corresponding with the background sound label is not present in default background sound library
Sound audio data then store the 4th background sound audio data into default background sound library, if deposited in default background sound library
In background sound audio data corresponding with the background sound label, then do not deal with.
In the present embodiment, the present invention rejects interface by display background sound, and the background sound for needing to reject is selected for user
Frequency is according to corresponding background sound label, and then mobile terminal rejects the corresponding back of background sound label in de-noising video file
Scape sound audio data can improve the clarity of voice in video while retaining certain background sound.
Further, above-mentioned first or second embodiments are based on, the third for proposing vedio noise reduction method of the present invention is implemented
Example, the difference with previous embodiment is, after obtaining target video file, back can also be configured in target video file
Jing Sheng, specially when monitoring background sound configuration-direct, the mobile terminal display background sound configuration interface, and receive user's base
In the second background sound label of background sound configuration interface selection, default background sound library is then obtained, and from default background sound library
It is middle to obtain the 5th background sound audio data corresponding with the second background sound label, and the 5th background sound audio data is inserted into this
In target video file.Background sound can be added in target video file, can be improved while retaining certain background sound
The clarity of voice in video.
Further, when monitoring background sound warehouse-in instruction, which obtains according to the background sound warehouse-in instruction
Corresponding video file, and background sound audio data is isolated from the video file, it is then based on preset background sound classification
The various background sounds in the background sound audio data are marked in model, obtain the background sound comprising several background sound labels
Audio data, and display background sound is put in storage interface, and receives the third background sound label based on the interface selection of background sound storage,
The 6th background sound audio that background sound label is the third background sound label is finally obtained from the background sound audio data of label
Data, and measure and store the 6th background sound audio data into default background sound library.The storage for realizing background sound, is obtained convenient for subsequent
It takes.In specific implementation, when monitoring that background sound deletes instruction, is deleted from background sound and obtains the 4th background sound label in instruction,
And the corresponding background sound audio data of the 4th background sound label is deleted from default background sound library.
In the present embodiment, the present invention can also by background sound be added video file in, can while retaining background sound,
Improve the clarity of voice in video.
The present invention also provides a kind of vedio noise reduction device, which includes:
Module is obtained, is regarded for instructing to obtain according to the vedio noise reduction when monitoring vedio noise reduction instruction to noise reduction
Frequency file;
Data separating module, for from described to isolate voice audio data and background sound audio in de-noising video file
Data;
Noise reduction module presets noise reduction algorithm, the voice audio data and the background sound audio data for foundation, right
It is described to execute corresponding noise reduction operation to de-noising video file, obtain target video file.
Further, the noise reduction module is also used to:
From described to reject the background sound audio data in de-noising video file, obtain only comprising the voice audio number
According to target video file.
Further, the noise reduction module is also used to:
Default vocal print is obtained, and obtains the first sound audio comprising the default vocal print from the voice audio data
Data and the second voice audio data not comprising the default vocal print;
From described to reject the background sound audio data and the second voice audio data in de-noising video file, obtain
To the target video file for only including the first sound audio data.
Further, the noise reduction module is also used to:
Based on preset background sound disaggregated model, the various background sounds in the background sound audio data are marked,
Obtain the background sound audio data comprising several background sound labels;
With the presence or absence of the first background sound audio comprising presetting background sound label in the background sound audio data of judge mark
Data;
If there is the first background sound audio data comprising presetting background sound label in the background sound audio data of label,
The second background sound that background sound label is not the default background sound label is obtained from the background sound audio data after label
Frequency evidence;
From described to reject the second background sound audio data in de-noising video file, obtain comprising first background
The target video file of sound audio data and the voice audio data.
Further, the noise reduction module is also used to:
If the first background sound audio data comprising presetting background sound label is not present in the background sound audio data of label,
Then display background sound rejects interface, and receives the first background sound label that interface selection is rejected based on the background sound;
The third that background sound label is the first background sound label is obtained from the background sound audio data after label to carry on the back
Scape sound audio data;
It is not the 4th of the first background sound label that background sound label is obtained from the background sound audio data after label
Background sound audio data;
From described to reject the third background sound audio data in de-noising video file, obtain comprising the 4th background
The target video file of sound audio data and the voice audio data.
Further, the vedio noise reduction device further include:
Display module, for when monitoring background sound configuration-direct, display background sound configuration interface, and receive and be based on institute
State the second background sound label of background sound configuration interface selection;
The acquisition module obtains and described for obtaining default background sound library, and from the default background sound library
The corresponding 5th background sound audio data of two background sound labels;
Background sound is inserted into module, for the 5th background sound audio data to be inserted into the target video file.
Further, the vedio noise reduction device further include:
The acquisition module is also used to when monitoring background sound warehouse-in instruction, according to the background sound warehouse-in instruction, is obtained
Take corresponding video file;
The data separating module is also used to isolate background sound audio data from the video file;
The data separating module is also used to based on preset background sound disaggregated model, to the background sound audio data
In various background sounds be marked, obtain the background sound audio data comprising several background sound labels;
The display module is also used to display background sound storage interface, and receives based on the interface choosing of background sound storage
The third background sound label selected;
The acquisition module is also used to obtain background sound label from the background sound audio data of label as third back
6th background sound audio data of scape tone mark label;
Memory module, for storing the 6th background sound audio data into the default background sound library.
Further, the vedio noise reduction device further include:
Background sound removing module, for deleting in instruction and obtaining from the background sound when monitoring that background sound deletes instruction
The 4th background sound label is taken, and deletes the corresponding background sound frequency of the 4th background sound label from default background sound library
According to.
Wherein, the basic phase of each embodiment of the specific embodiment of above-mentioned vedio noise reduction device and above-mentioned vedio noise reduction method
Together, therefore not to repeat here.
In addition, the embodiment of the present invention also proposes a kind of computer readable storage medium, the computer readable storage medium
On be stored with vedio noise reduction program, the vedio noise reduction program performs the steps of when being executed by processor
When monitoring vedio noise reduction instruction, obtain according to vedio noise reduction instruction to de-noising video file;
From described to isolate voice audio data and background sound audio data in de-noising video file;
According to noise reduction algorithm, the voice audio data and the background sound audio data is preset, regarded to described to noise reduction
Frequency file executes corresponding noise reduction operation, obtains target video file.
Further, it is also performed the steps of when the vedio noise reduction program is executed by processor
From described to reject the background sound audio data in de-noising video file, obtain only comprising the voice audio number
According to target video file.
Further, it is also performed the steps of when the vedio noise reduction program is executed by processor
Default vocal print is obtained, and obtains the first sound audio comprising the default vocal print from the voice audio data
Data and the second voice audio data not comprising the default vocal print;
From described to reject the background sound audio data and the second voice audio data in de-noising video file, obtain
To the target video file for only including the first sound audio data.
Further, it is also performed the steps of when the vedio noise reduction program is executed by processor
Based on preset background sound disaggregated model, the various background sounds in the background sound audio data are marked,
Obtain the background sound audio data comprising several background sound labels;
With the presence or absence of the first background sound audio comprising presetting background sound label in the background sound audio data of judge mark
Data;
If there is the first background sound audio data comprising presetting background sound label in the background sound audio data of label,
The second background sound that background sound label is not the default background sound label is obtained from the background sound audio data after label
Frequency evidence;
From described to reject the second background sound audio data in de-noising video file, obtain comprising first background
The target video file of sound audio data and the voice audio data.
Further, it is also performed the steps of when the vedio noise reduction program is executed by processor
If the first background sound audio data comprising presetting background sound label is not present in the background sound audio data of label,
Then display background sound rejects interface, and receives the first background sound label that interface selection is rejected based on the background sound;
The third that background sound label is the first background sound label is obtained from the background sound audio data after label to carry on the back
Scape sound audio data;
It is not the 4th of the first background sound label that background sound label is obtained from the background sound audio data after label
Background sound audio data;
From described to reject the third background sound audio data in de-noising video file, obtain comprising the 4th background
The target video file of sound audio data and the voice audio data.
Further, it is also performed the steps of when the vedio noise reduction program is executed by processor
When monitoring background sound configuration-direct, display background sound configuration interface, and receive and configured based on the background sound
Second background sound label of interface selection;
Default background sound library is obtained, and acquisition is corresponding with the second background sound label from the default background sound library
5th background sound audio data;
The 5th background sound audio data is inserted into the target video file.
Further, it is also performed the steps of when the vedio noise reduction program is executed by processor
When monitoring background sound warehouse-in instruction, according to the background sound warehouse-in instruction, corresponding video file is obtained, and
Background sound audio data is isolated from the video file;
Based on preset background sound disaggregated model, the various background sounds in the background sound audio data are marked,
Obtain the background sound audio data comprising several background sound labels;
Display background sound is put in storage interface, and receives the third background sound label based on the interface selection of background sound storage;
The 6th background that background sound label is the third background sound label is obtained from the background sound audio data of label
Sound audio data;
The 6th background sound audio data is stored into the default background sound library.
Further, it is also performed the steps of when the vedio noise reduction program is executed by processor
When monitoring that background sound deletes instruction, is deleted from the background sound and obtain the 4th background sound label in instruction, and
The corresponding background sound audio data of the 4th background sound label is deleted from default background sound library.
Wherein, each embodiment of the specific embodiment of computer readable storage medium of the present invention and above-mentioned vedio noise reduction method
Essentially identical, therefore not to repeat here.
It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to non-row
His property includes, so that the process, method, article or the system that include a series of elements not only include those elements, and
And further include other elements that are not explicitly listed, or further include for this process, method, article or system institute it is intrinsic
Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including being somebody's turn to do
There is also other identical elements in the process, method of element, article or system.
The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side
Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases
The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art
The part contributed out can be embodied in the form of software products, which is stored in one as described above
In storage medium (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that terminal device (it can be mobile phone,
Computer, server, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.
The above is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all to utilize this hair
Equivalent structure or equivalent flow shift made by bright specification and accompanying drawing content is applied directly or indirectly in other relevant skills
Art field, is included within the scope of the present invention.