CN106971704A

CN106971704A - A kind of audio-frequency processing method and mobile terminal

Info

Publication number: CN106971704A
Application number: CN201710288677.3A
Authority: CN
Inventors: 林雄周
Original assignee: Vivo Mobile Communication Co Ltd
Current assignee: Vivo Mobile Communication Co Ltd
Priority date: 2017-04-27
Filing date: 2017-04-27
Publication date: 2017-07-21
Anticipated expiration: 2037-04-27
Also published as: CN106971704B

Abstract

The embodiment of the invention discloses a kind of audio-frequency processing method and mobile terminal, wherein, audio-frequency processing method includes：During user gives song recitals, the voice voice data of user is gathered, and judges whether the voice voice data corresponding period in song is located in preset time period.If the voice voice data corresponding period in song is located in preset time period, then judge whether the frequency of the voice voice data reaches the frequency of original singer, if the frequency of the voice voice data is not up to the frequency of original singer, then the frequency of the voice voice data of collection is adjusted to the voice voice data after the frequency of original singer, then output frequency adjustment.So as to which the frequency to user's song is effectively adjusted, it is to avoid the frequency of user voice and the excessive influence singing effect of original singer's sound frequency difference, and then make user in the case where not possessing professional performance ability, still be able to embody preferable performance level.

Description

A kind of audio-frequency processing method and mobile terminal

Technical field

The present embodiments relate to the communications field, more particularly to a kind of audio-frequency processing method and mobile terminal.

Background technology

The functions such as home theater, Karaoke are integrated with many terminals at present, facilitate user K to sing.And the user of K songs is past Toward being amateurish chanteur, in sing procedure, the treble portion or low in can not often being given song recitals according to the frequency of original singer Line point.Lead to not the singing effect for showing high-quality.For example, when user is sung to the treble portion of song, often Using falsetto to reach higher frequency, but because falsetto is difficult to accurate control, therefore the feelings that easily frequency of occurrences is reduced suddenly , i.e., easily there is distorsion in condition.

As can be seen here, in the prior art, the performance effect of high-quality can just only be shown by improving the performance level of user Really, if user sings level deficiency, easily there is distorsion, cause to influence singing effect.

The content of the invention

The embodiment of the present invention provides a kind of audio-frequency processing method and mobile terminal, to solve due to user's performance level not , easily there is distorsion in foot, the problem of causing influence singing effect.

On the one hand there is provided a kind of audio-frequency processing method, method includes：

During user gives song recitals, the voice voice data of the user is gathered；

Judge whether the voice voice data corresponding period in the song is located in preset time period；

If the voice voice data corresponding period in the song is located in preset time period, judge described Whether the frequency of voice voice data reaches the frequency of original singer；

If the frequency of the voice voice data is not up to the frequency of original singer, by the voice voice data of the collection Frequency is adjusted to the frequency of the original singer, the voice voice data after output frequency adjustment；

Wherein, the preset time period is the preset audio fragment corresponding period of the song, the preset audio Fragment is presetting the audio fragment in voice frequency range, the default voice frequency range bag for the frequency of the song original singer Include default high pitch voice frequency range and default bass voice frequency range.

On the other hand, the embodiment of the present invention additionally provides a kind of mobile terminal, including：

Sound acquisition module, during being given song recitals in user, gathers the voice voice data of the user；

Audio position determining module, for judge the voice voice data in the song the corresponding period whether In preset time period；

Evaluation module, if the corresponding period is located at preset time period in the song for the voice voice data It is interior, then judge whether the frequency of the voice voice data reaches the frequency of original singer；

Audio adjusting module, if the frequency for the voice voice data is not up to the frequency of original singer, is adopted described The frequency of the voice voice data of collection is adjusted to the frequency of the original singer；

Output module, the voice voice data after being adjusted for output frequency；

To sum up, during the embodiment of the present invention in user by giving song recitals, the voice voice data of user is gathered, and is sentenced Whether the disconnected voice voice data corresponding period in song is located in preset time period.If the voice voice data is in song The corresponding period is located in preset time period in song, then judges whether the frequency of the voice voice data reaches the frequency of original singer Rate, if the frequency of the voice voice data is not up to the frequency of original singer, by the frequency of the voice voice data of collection adjust to Voice voice data after the frequency of original singer, then output frequency adjustment.So as to which the frequency to user's song is effectively adjusted, keep away Exempt from the frequency and the excessive influence singing effect of original singer's sound frequency difference of user voice, and then user is not being possessed professional performance In the case of ability, it still is able to embody preferable performance level, optimizes the singing effect of singer.

Brief description of the drawings

In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by institute in the description to the embodiment of the present invention The accompanying drawing needed to use is briefly described, it should be apparent that, drawings in the following description are only some implementations of the present invention Example, for those of ordinary skill in the art, without having to pay creative labor, can also be according to these accompanying drawings Obtain other accompanying drawings.

Fig. 1 is a kind of flow chart of audio-frequency processing method of the embodiment of the present invention；

Fig. 2 is the flow chart of another audio-frequency processing method of the embodiment of the present invention；

Fig. 3 is one of block diagram of mobile terminal of the embodiment of the present invention；

Fig. 4 is the two of the block diagram of the mobile terminal of the embodiment of the present invention；

Fig. 5 is the three of the block diagram of the mobile terminal of the embodiment of the present invention；

Fig. 6 is the four of the block diagram of the mobile terminal of the embodiment of the present invention.

Embodiment

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is a part of embodiment of the invention, rather than whole embodiments.Based on this hair Embodiment in bright, the every other implementation that those of ordinary skill in the art are obtained under the premise of creative work is not made Example, belongs to the scope of protection of the invention.

Reference picture 1, shows a kind of flow chart of audio-frequency processing method in the embodiment of the present invention, what the present embodiment was provided Method can be performed by mobile terminal, and audio-frequency processing method includes：

Step 101, during user gives song recitals, the voice voice data of user is gathered.

Wherein, the voice voice data of the collection can be the one or more audio frame of setting time length. To ensure that obvious fluctuation is not present in the frequency of voice voice data in this time, can according to the changing rule of people's sound audio, Determine the setting time length.

In actual applications, the sound for any setting time length that the voice voice data of collection can input for user Frequently.Wherein, setting time length can rule of thumb be set by those skilled in the art, for example, may be configured as 5 milliseconds.

After voice voice data is collected, this section of people's sound audio can be obtained by the analysis to voice voice data The frequency values of data, to be compared with audio source file.

Step 102, judge whether voice voice data corresponding period in song is located in preset time period.

Voice refers to the sound sent by the vibration of vocal cords.Within a certain period of time, the number of times of vocal cord vibration is more at most Tone is higher, i.e. people's acoustic frequency is higher.The voice that frequency generally is in into high pitch voice frequency range is referred to as high pitch, at frequency It is referred to as bass in the voice of bass voice frequency range.For pronunciation difficulty, it is typically difficult to send high pitch and bass.Therefore, This feature can be directed to, judges whether voice voice data corresponding period in song is located in preset time period.If The voice voice data corresponding period in song is located in preset time period, then performs step 103, judge people's sound audio Whether the frequency of data reaches the frequency of original singer.If the voice voice data corresponding period in song is not on presetting In period, then it can intervene without the voice voice data to collection, directly export, so as to retain and embody use The real singing style in family.

Wherein, the preset time period is the preset audio fragment corresponding period of song, and the preset audio fragment is song Audio fragment of the frequency of bent original singer in default voice frequency range, the default voice frequency range includes default high pitch people Acoustic frequency scope and default bass voice frequency range.

For example, when the corresponding period is located in the high pitch segment ranges of song voice voice data in song, i.e., When the treble portion of song has been arrived in user's performance, it is easy to because user's high pitch can not be sung up, cause the performance effect for reducing user Really.The embodiment of the present invention can be adjusted in the period to the voice voice data of collection, because of sound during avoiding user from singing The relatively low influence singing effect of voice frequency.

Step 103, judge whether the frequency of voice voice data reaches the frequency of original singer.

When the corresponding period is located in preset time period voice voice data in song, because user is typically difficult to This section of song is sung with the frequency of original singer, therefore, it can judge in the period whether the frequency of voice voice data reaches original singer Frequency.

Specifically, the frequency-splitting between the frequency of voice voice data and the frequency of original singer can be calculated, and judge to be somebody's turn to do Whether frequency-splitting is less than threshold frequency, if the frequency-splitting is less than threshold frequency, it is determined that the frequency of the voice voice data Reach the frequency of original singer.Determine that the frequency that user sings is close with original singer, can reach preferable singing effect, the situation The voice voice data of collection can not be adjusted down.Step 105 is directly performed, the voice voice data of collection is exported.

If conversely, the frequency of voice voice data is not up to the frequency of original singer, step 104 can be performed, by collection The frequency of voice voice data is adjusted to the frequency of original singer, the voice voice data after output frequency adjustment.To ensure user's Singing effect.

Step 104, the frequency of the voice voice data of collection is adjusted to the frequency of original singer, the people after output frequency adjustment Sound audio data.

When the frequency of the voice voice data of collection is not up to original singer, the voice voice data to collection can be passed through Frequency is adjusted, to reach the frequency of original singer.

Specifically, when the frequency of the voice voice data to collection is adjusted, if the frequency of original singer is in high pitch people Acoustic frequency scope, can improve frequency to the relatively low voice voice data that pronounces.If the frequency of original singer is in bass people's acoustic frequency Scope, can reduce frequency to the higher voice voice data that pronounces.So as to which the frequency of the voice voice data of collection be adjusted To the frequency of original singer, the effect for making the voice voice data of collection be come out by mobile terminal playing is more nearly the performance of original singer Effect.It is, for example, possible to use frequency domain equalizer carries out frequency domain enhancing to the frequency of the voice voice data of collection.Wherein, frequency domain A kind of isostatic compensation equipment of balanced device, isostatic compensation is carried out for the frequency characteristic distortion in complete paired data transmission channel, from And play a part of the frequency of the voice voice data of adjustment collection.Voice data for multiple tracks is, it is necessary to each sound Rail is individually adjusted.

Step 105, the voice voice data of output collection.

In summary, in the embodiment of the present invention, during being given song recitals in user, people's sound audio number of user is gathered According to, and judge whether the voice voice data corresponding period in song is located in preset time period.If people's sound audio Data corresponding period in song is located in preset time period, then judges whether the frequency of the voice voice data reaches original The frequency sung, if the frequency of the voice voice data is not up to the frequency of original singer, by the frequency of the voice voice data of collection Adjust to the voice voice data after the frequency of original singer, then output frequency adjustment.It is effective so as to be carried out to the frequency of user's song Regulation, it is to avoid the frequency of user voice and the excessive influence singing effect of original singer's sound frequency difference, and then user is not being possessed In the case of professional performance ability, it still is able to embody preferable performance level, optimizes the singing effect of singer.

Reference picture 2, shows the flow chart of another audio-frequency processing method of the embodiment of the present invention.The present embodiment is provided Method can be performed by mobile terminal, control audio-frequency processing method include：

Step 201, the preset audio fragment in the audio source file of song is determined.

, can be first according to default people's acoustic frequency in order to determine whether the voice voice data of collection is located in preset time period The data corresponding relation of scope and audio source file, determines the audio source file intermediate frequency rate of song in default voice frequency range Target time section；The audio fragment in the target time section is defined as preset audio fragment again.So that to judge people's sound Whether frequency evidence corresponding period in song, which is located in preset time period, provides foundation.Wherein, the data of audio source file Corresponding relation can be spectrogram.

Specifically, because people's pronunciation characteristic of different sexes is different, corresponding default voice frequency range is also different, therefore The sex of the corresponding original singer of voice data of each period in audio source file can be first determined, further according to each section audio data The corresponding default voice frequency range of sex and different sexes of corresponding original singer, determines that frequency exists in each section audio data respectively Target time section in default voice frequency range.So as to make more accurate judgement for different sexes.

If for example, the default voice frequency range of male singer is 164~698Hz, the default voice of women singer Frequency range is 220~1.1KHz.Then for the song that original singer is male singer, in the data corresponding relation of audio source file Audio fragment of the frequency amplitude in the range of 164~698Hz is preset audio fragment.Equally, sung for original singer for women Audio fragment of the frequency amplitude in the range of 220~1.1KHz is in the song of person, the data corresponding relation of audio source file Preset audio fragment.When audio source file is the song of mixed chorus, the sex per section audio fragment mark, root can be directed to According to corresponding default voice frequency range, the preset audio fragment of each section audio is determined respectively.

In actual applications, in order to lift the efficiency that analysis judges, default sound can be made previously according to audio source file Frequency fragment divides data, and is stored in cloud server or mobile terminal is local.So as to it is determined that the audio source file of song In preset audio fragment the step for before, can be obtained from cloud server preset audio fragment divide data, or from Mobile terminal locally obtains preset audio fragment and divides data.Wherein, the preset audio fragment divides data and is used to characterize audio Period where preset audio fragment in source file.Therefore, only need to be by voice voice data in song when performing step 203 In the corresponding period and set audio fragment and divide data and directly contrast, so as to save operational capability, it is to avoid it is a large amount of frequently Computing influences system response time.

Step 202, during user gives song recitals, the voice voice data of user is gathered.

When in actual applications, it is possible to use such as microphone sound collection equipment, collection user gives song recitals in real time Audio signal, then the audio signal is handled, so as to obtain corresponding voice voice data.

Step 203, judge whether voice voice data corresponding period in song is located in preset time period.

, can basis in order to judge whether voice voice data corresponding period in song is located in preset time period The data corresponding relation of default voice frequency range and audio source file, determines the audio source file intermediate frequency rate of song in default people Target time section in the range of acoustic frequency, then the audio fragment in target time section is defined as preset audio fragment.Can also Predetermined preset audio fragment is locally directly obtained from cloud server or mobile terminal and divides data, i.e., only obtains audio Period where each preset audio fragment in source file, and by the period where each preset audio fragment, judge people's sound audio Whether data corresponding period in song is located in preset time period.Specifically, in the data pair using audio source file , can be with to the analysis of frequency spectrum to prevent from being disturbed by accompaniment sound when answering the frequency spectrum of relation pair audio source file to be analyzed Only the voice track of audio source file is differentiated.

If the voice voice data corresponding period in song is located in preset time period, step 204 is performed, is sentenced Whether the frequency of disconnected voice voice data reaches the frequency of original singer.If the corresponding period is simultaneously in song for the voice voice data It is not located in preset time period, then can intervenes without the voice voice data to collection, directly export, so as to protect Stay and embody the real singing style of user.

Step 204, judge whether the frequency of voice voice data reaches the frequency of original singer.

In order to accurately judge whether the frequency of voice voice data reaches the frequency of original singer, voice voice data can be calculated Frequency and original singer frequency between frequency-splitting, and judge the frequency-splitting whether be less than threshold frequency, if the difference on the frequency Value is less than threshold frequency, it is determined that the frequency of the voice voice data reaches the frequency of original singer.In this case, step is directly performed Rapid 206, export the voice voice data of collection.Otherwise step 205 is performed, the frequency of the voice voice data of collection is adjusted The whole frequency to original singer, the voice voice data after output frequency adjustment.

Specifically, the requirement due to different user to frequency accuracy is different, the threshold frequency can be set by the user. Can rule of thumb it be configured by those skilled in the art.For example, if user, which is necessary to ensure that, shows preferable singing effect, The threshold frequency can be set smaller so that user the period singing effect closer to original singer performance Effect.If user is more desirable to highlight the singing effect of individual when singing, the threshold frequency can be set larger, so that The frequency of the voice voice data gathered when user sings is just adjusted when being significantly lower than the frequency of original singer.

Step 205, the frequency of the voice voice data of collection is adjusted to the frequency of original singer, the people after output frequency adjustment Sound audio data.

, can be according to preset audio fragment when the frequency of the voice voice data of collection is adjusted to the frequency of original singer Length, it is determined that gently adjusting duration.And in the gentle regulation duration, first by the frequency of the voice voice data of collection gently Adjust to the frequency of original singer, then gently adjusted to the frequency of original singer by the frequency of the voice voice data of collection, continued Frequency to the voice voice data of collection strengthens, until preset audio fragment terminate or the voice voice data that gathers in It is disconnected.So that the process of frequency adjustment is more gentle, it is to avoid effect is sung in the excessively lofty influence of the sound exported after frequency adjustment Really.

Specifically, during gentle regulation duration is determined according to the length of preset audio fragment, when preset audio fragment Length exceed threshold length of time when, threshold length of time is defined as gently to adjust duration；When the length of preset audio fragment When degree is not less than threshold length of time, the length of preset audio fragment is defined as gently to adjust duration.I.e. in preset audio piece When the section time is longer, gently the frequency of the voice voice data of collection can be adjusted to the frequency of original singer with the sufficient time Rate；When preset audio fractional time is shorter, it is adjusted without the sufficient time, can be by whole preset audio fractional time It is used as the gentle regulation duration of gentle transition.So as to adjust the time for substantially providing abundance for frequency.

, can be in the gentle regulation duration, with amplitude of accommodation δ=(f2-f1) * during being adjusted to frequency The frequency of the voice voice data of collection is adjusted t/T, it is adjusted after frequency f=f1+ δ, until t=T.Wherein, t For time span of the current time away from the preset audio fragment start time in gentle regulation duration, T is the gentle regulation Duration, f1 is the frequency of the voice voice data of the collection, and f2 is the frequency of the original singer.So as to ensure in various situations Under, the frequency of the voice voice data of collection can be enable gently to adjust to the frequency of original singer.

Further, since certain customers are at the end of singing to a certain audio fragment, it can't be tied according in audio source file The time of beam stops singing in time, but may proceed to extension a period of time." original " word original singer of such as " Qinghai-Tibet Platean " this song Assuming that can continue 4 seconds, certain user persistently may be 5 seconds, now, if the time terminated according to original singer stops the people to collection The adjustment of sound audio data, then will be closed when original singer was sung by the 5th second due to frequency enhancing and cause people's sound audio Frequency declines suddenly, the experience bad to user.Therefore, in order to prevent audio adjustment in this case is unexpected from interrupting, Ke Yi At the end of preset audio fragment, whether the voice voice data of detection collection interrupts.If detecting the voice voice data of collection Do not interrupt, then the frequency persistently to the voice voice data of collection is adjusted, with the stable frequency to original singer, or gently drop The amplitude of the frequency regulation of the low voice voice data to collection.

Step 206, the voice voice data of output collection.

In summary, in the embodiment of the present invention, the sex and dissimilarity according to the corresponding original singer of each section audio data are passed through Not corresponding default voice frequency range, determines target of the frequency in default voice frequency range in each section audio data respectively Period.So as to make more accurate judgement for different sexes.And by from cloud server or mobile terminal Predetermined preset audio fragment is obtained in local and divides data so that calculating speed faster, to people's sound audio number of collection According to frequency enhancing it is effective much sooner.In addition, the progressively enhancing to audio, and the lasting enhancing to paragraph latter end Also so that change is more gentle, it is to avoid variation effect is excessively lofty.So as to which the usage experience of user is substantially improved.

Reference picture 3, shows a kind of block diagram of mobile terminal in the embodiment of the present invention.Mobile terminal includes：Sound collection Module 31, audio position determining module 32, evaluation module 33, audio adjusting module 34 and output module 35.

Wherein, sound acquisition module 31, during being given song recitals in user, gather the voice voice data of user；

Audio position determining module 32, for judging it is pre- whether voice voice data corresponding period in song is located at If in the period；

Evaluation module 33, if the corresponding period is located in preset time period in song for voice voice data, Judge whether the frequency of voice voice data reaches the frequency of original singer；

Audio adjusting module 34, if the frequency for voice voice data is not up to the frequency of original singer, by the people of collection The frequency of sound audio data is adjusted to the frequency of original singer；

Output module 35, the voice voice data after being adjusted for output frequency.

Wherein, preset time period is the preset audio fragment corresponding period of song, and preset audio fragment is that song is former Audio fragment of the frequency sung in default voice frequency range, presetting voice frequency range includes default high pitch people acoustic frequency Scope and default bass voice frequency range.

To sum up, in the embodiment of the present invention, by sound acquisition module 31 during user gives song recitals, collection user's Voice voice data, and by audio position determining module 32 judge the voice voice data in song the corresponding period whether In preset time period.If the voice voice data corresponding period in song is located in preset time period, by commenting Estimate module 33 and judge whether the frequency of the voice voice data reaches the frequency of original singer, if the frequency of the voice voice data does not reach To the frequency of original singer, then the frequency of the voice voice data of collection is adjusted to the frequency of original singer by audio adjusting module 34, then Voice voice data after the adjustment of the output frequency of output module 35.So as to which the frequency to user's song is effectively adjusted, keep away Exempt from the frequency and the excessive influence singing effect of original singer's sound frequency difference of user voice, and then user is not being possessed professional performance In the case of ability, it still is able to embody preferable performance level.

Reference picture 4, in a preferred embodiment of the invention, on the basis of Fig. 3, mobile terminal also includes：In advance If audio fragment determining module 36 and acquisition module 37.

Wherein, preset audio fragment determining module 36, the preset audio fragment in audio source file for determining song.

Acquisition module 37, data are divided for obtaining preset audio fragment from cloud server；Or from mobile terminal sheet Ground obtains preset audio fragment and divides data；Wherein, preset audio fragment divides data and preset for characterizing in audio source file Period where audio fragment.

Specifically, preset audio fragment determining module 36, includes again：

Period determination sub-module 361, for being closed according to default voice frequency range is corresponding with the data of audio source file System, determines target time section of the audio source file intermediate frequency rate of song in default voice frequency range；

Preset audio fragment determination sub-module 362, for the audio fragment in target time section to be defined as into preset audio Fragment.

Wherein, period determination sub-module 361, including：

Sex determining unit 3611, for determining the corresponding original singer of voice data of each period in audio source file Sex；

Period determining unit 3612, it is corresponding with different sexes for the sex according to the corresponding original singer of each section audio data Default voice frequency range, the object time of frequency in each section audio data in default voice frequency range is determined respectively Section.

In addition, audio adjusting module 34, including：

Gentle regulation duration determination sub-module 341, for the length according to preset audio fragment, it is determined that when gently adjusting It is long；

Submodule 342 is adjusted, in gently regulation duration, progressively adjusting the frequency of the voice voice data of collection To the frequency of original singer；And gently adjusted to the frequency of original singer by the frequency of the voice voice data of collection, persistently to adopting The frequency of the voice voice data of collection is strengthened, until the voice voice data that preset audio fragment terminates or gathered is interrupted.

Wherein, gentle regulation duration determination sub-module 341, specifically for when the length of preset audio fragment exceedes threshold value Between length when, threshold length of time is defined as gently to adjust duration；When the length of preset audio fragment is not less than threshold time During length, the length of preset audio fragment is defined as gently to adjust duration.

Submodule 342 is adjusted, specifically in gently regulation duration, with amplitude of accommodation δ=(f2-f1) * t/T to collection The frequency of voice voice data be adjusted, it is adjusted after frequency f=f1+ δ, until t=T；Wherein, t is gentle tune Time span of the current time away from preset audio fragment start time in duration is saved, T is gentle regulation duration, and f1 is the people of collection The frequency of sound audio data, f2 is the frequency of original singer.

Moreover, whether adjustment submodule 342, the voice voice data for being additionally operable to detection collection interrupts；If detecting collection Voice voice data do not interrupt, then the frequency persistently to the voice voice data of collection is adjusted, with stable to original singer's Frequency, or gently reduce the amplitude of the frequency regulation to the voice voice data of collection.

Specifically, evaluation module 33, including：

Frequency-splitting calculating sub module 331, for calculating the frequency between the frequency of voice voice data and the frequency of original singer Rate difference；

Frequency-splitting assesses submodule 332, whether is less than threshold frequency for determination frequency difference；If frequency-splitting is less than Threshold frequency, it is determined that the frequency of voice voice data reaches the frequency of original singer.

To sum up, in the embodiment of the present invention, by audio fragment determining module 36 according to the corresponding original singer of each section audio data Sex and the corresponding default voice frequency range of different sexes, determine that frequency is in default people's audio frequency in each section audio data respectively Target time section in the range of rate.So as to make more accurate judgement for different sexes.And pass through acquisition module 37 divide data from the local middle predetermined preset audio fragment of acquisition of cloud server or mobile terminal so that calculating speed Faster, the enhancing to the frequency of the voice voice data of collection is effective much sooner.In addition, by audio adjusting module 34 to sound The progressively enhancing of frequency, and change is more gentle also to be caused to the lasting enhancing of paragraph latter end, it is to avoid variation effect is excessively It is lofty.

Fig. 5 is the block diagram of another mobile terminal of the embodiment of the present invention.Mobile terminal 500 shown in Fig. 5 includes：At least One processor 501, memory 502, at least one network interface 504 and other users interface 503.In mobile terminal 500 Each component is coupled by bus system 505.It is understood that bus system 505 is used to realize the company between these components Connect letter.Bus system 505 is in addition to including data/address bus, in addition to power bus, controlling bus and status signal bus in addition.But It is that for the sake of clear explanation, various buses are all designated as bus system 505 in Figure 5.

Wherein, user interface 503 can include display, keyboard or pointing device (for example, mouse, trace ball (trackball), touch-sensitive plate or touch-screen etc..

It is appreciated that the memory 502 in the embodiment of the present invention can be volatile memory or nonvolatile memory, Or may include both volatibility and nonvolatile memory.Wherein, nonvolatile memory can be read-only storage (Read- OnlyMemory, ROM), programmable read only memory (ProgrammableROM, PROM), Erasable Programmable Read Only Memory EPROM (ErasablePROM, EPROM), Electrically Erasable Read Only Memory (ElectricallyEPROM, EEPROM) dodge Deposit.Volatile memory can be random access memory (RandomAccessMemory, RAM), and it is used as outside slow at a high speed Deposit.By exemplary but be not restricted explanation, the RAM of many forms can use, such as static RAM (StaticRAM, SRAM), dynamic random access memory (DynamicRAM, DRAM), Synchronous Dynamic Random Access Memory (SynchronousDRAM, SDRAM), double data speed synchronous dynamic RAM (DoubleDataRate SDRAM, DDRSDRAM), enhanced Synchronous Dynamic Random Access Memory (Enhanced SDRAM, ESDRAM), synchronized links Dynamic random access memory (SynchlinkDRAM, SLDRAM) and direct rambus random access memory (DirectRambusRAM, DRRAM).The memory 502 of the system and method for description of the embodiment of the present invention is intended to include but not limited In these memories with any other suitable type.

In some embodiments, memory 502 stores following element, can perform module or data structure, or Their subset of person, or their superset：Operating system 5021 and application program 5022.

Wherein, operating system 5021, comprising various system programs, such as ccf layer, core library layer, driving layer, are used for Realize various basic businesses and handle hardware based task.Application program 5022, includes various application programs, such as media Player (MediaPlayer), browser (Browser) etc., for realizing various applied business.Realize embodiment of the present invention side The program of method may be embodied in application program 5022.

In embodiments of the present invention, by calling program or the instruction of the storage of memory 502, specifically, can be application The program stored in program 5022 or instruction, processor 501 are used for during user gives song recitals, and gather people's sound of user Frequency evidence, and judge whether the voice voice data corresponding period in song is located in preset time period.If the voice Voice data corresponding period in song is located in preset time period, then judges whether the frequency of the voice voice data reaches To the frequency of original singer, if the frequency of the voice voice data is not up to the frequency of original singer, by the voice voice data of collection Frequency is adjusted to the voice voice data after the frequency of original singer, then output frequency adjustment.

The method that the embodiments of the present invention are disclosed can apply in processor 501, or be realized by processor 501. Processor 501 is probably a kind of IC chip, the disposal ability with signal.In implementation process, the above method it is each Step can be completed by the integrated logic circuit of the hardware in processor 501 or the instruction of software form.Above-mentioned processing Device 501 can be general processor, digital signal processor (DigitalSignalProcessor, DSP), application specific integrated circuit (ApplicationSpecificIntegratedCircuit, ASIC), ready-made programmable gate array (FieldProgrammableGateArray, FPGA) or other PLDs, discrete gate or transistor logic Device, discrete hardware components.It can realize or perform disclosed each method, step and the box in the embodiment of the present invention Figure.General processor can be microprocessor or the processor can also be any conventional processor etc..With reference to the present invention The step of method disclosed in embodiment, can be embodied directly in hardware decoding processor and perform completion, or use decoding processor In hardware and software module combination perform completion.Software module can be located at random access memory, and flash memory, read-only storage can In the ripe storage medium in this area such as program read-only memory or electrically erasable programmable memory, register.The storage Medium is located at memory 502, and processor 501 reads the information in memory 502, and the step of the above method is completed with reference to its hardware Suddenly.

It is understood that the embodiment of the present invention description these embodiments can with hardware, software, firmware, middleware, Microcode or its combination are realized.Realized for hardware, processing unit can be realized in one or more application specific integrated circuits (ApplicationSpecificIntegratedCircuits, ASIC), digital signal processor (DigitalSignalProcessing, DSP), digital signal processing appts (DSPDevice, DSPD), programmable logic device (ProgrammableLogicDevice, PLD), field programmable gate array (Field-ProgrammableGateArray, FPGA), general processor, controller, microcontroller, microprocessor, other electronic units for performing the application function or During it is combined.

Realize, can be realized by performing the module (such as process, function) of function of the embodiment of the present invention for software The technology of the embodiment of the present invention.Software code is storable in memory and by computing device.Memory can be in processing Realized in device or outside processor.

Alternatively, processor 501 is additionally operable to, and determines the preset audio fragment in the audio source file of song.

Alternatively, processor 501 is also particularly useful for corresponding with the data of audio source file according to default voice frequency range Relation, determines target time section of the audio source file intermediate frequency rate of song in default voice frequency range；By target time section Interior audio fragment is defined as preset audio fragment.

Alternatively, processor 501 is also particularly useful for determining that the voice data of each period in audio source file is corresponding The sex of original singer；According to the corresponding default voice frequency range of sex and different sexes of the corresponding original singer of each section audio data, Target time section of the frequency in default voice frequency range in each section audio data is determined respectively.

Alternatively, processor 501 is additionally operable to, and preset audio fragment is obtained from cloud server and divides data；Or from shifting Dynamic terminal local obtains preset audio fragment and divides data；Wherein, preset audio fragment divides data and is used to characterize audio source document Period where preset audio fragment in part.

Alternatively, processor 501 is additionally operable to, according to the length of preset audio fragment, it is determined that gently adjusting duration；Gentle Adjust in duration, the frequency of the voice voice data of collection is gently adjusted to the frequency of original singer；By people's sound of collection The frequency of frequency evidence is gently adjusted to the frequency of original singer, and persistently the frequency to the voice voice data of collection strengthens, Until the voice voice data that preset audio fragment terminates or gathered is interrupted.

Alternatively, processor 501 also particularly useful for, when preset audio fragment length exceed threshold length of time when, will Threshold length of time is defined as gently adjusting duration；, will be pre- when the length of preset audio fragment is not less than threshold length of time If the length of audio fragment is defined as gently adjusting duration.

Alternatively, processor 501 is also particularly useful in gently regulation duration, with amplitude of accommodation δ=(f2-f1) * t/T Frequency to the voice voice data of collection is adjusted, it is adjusted after frequency f=f1+ δ, until t=T；Wherein, t is Time span of the current time away from preset audio fragment start time in duration is gently adjusted, T is gentle regulation duration, and f1 is to adopt The frequency of the voice voice data of collection, f2 is the frequency of original singer.

Alternatively, processor 501 is also particularly useful for whether the voice voice data of detection collection interrupts；Adopted if detecting The voice voice data of collection is not interrupted, then the frequency persistently to the voice voice data of collection is adjusted, with stable to original singer Frequency, or gently reduce the amplitude to the regulation of the frequency of the voice voice data of collection.

Alternatively, processor 501 is additionally operable to, and calculates the difference on the frequency between the frequency of voice voice data and the frequency of original singer Value；Whether determination frequency difference is less than threshold frequency；If frequency-splitting is less than threshold frequency, it is determined that the frequency of voice voice data Rate reaches the frequency of original singer.

Mobile terminal 500 can realize each process that mobile terminal is realized in previous embodiment, to avoid repeating, here Repeat no more.

To sum up, during the embodiment of the present invention in user by giving song recitals, the voice voice data of user is gathered, and is sentenced Whether the disconnected voice voice data corresponding period in song is located in preset time period.If the voice voice data is in song The corresponding period is located in preset time period in song, then judges whether the frequency of the voice voice data reaches the frequency of original singer Rate, if the frequency of the voice voice data is not up to the frequency of original singer, by the frequency of the voice voice data of collection adjust to Voice voice data after the frequency of original singer, then output frequency adjustment.So as to which the frequency to user's song is effectively adjusted, keep away Exempt from the frequency and the excessive influence singing effect of original singer's sound frequency difference of user voice, and then user is not being possessed professional performance In the case of ability, it still is able to embody preferable performance level.

Fig. 6 is the block diagram of another mobile terminal of the embodiment of the present invention.Specifically, the mobile terminal in Fig. 6 can be Mobile phone, tablet personal computer, personal digital assistant (PersonalDigital Assistant, PDA) or vehicle-mounted computer etc..

Mobile terminal in Fig. 6 includes radio frequency (RadioFrequency, RF) circuit 610, memory 620, input block 630th, display unit 640, processor 660, voicefrequency circuit 670, WiFi (WirelessFidelity) module 680 and power supply 690.

Wherein, input block 630 can be used for the numeral or character information for receiving user's input, and produce and mobile terminal User set and function control it is relevant signal input.Specifically, in the embodiment of the present invention, the input block 630 can be with Including contact panel 631.Contact panel 631, also referred to as touch-screen, collect touch operation (ratio of the user on or near it Such as user uses the operation of finger, any suitable object of stylus or annex on contact panel 631), and according to setting in advance Fixed formula drives corresponding attachment means.Optionally, contact panel 631 may include touch detecting apparatus and touch controller two Individual part.Wherein, touch detecting apparatus detects the touch orientation of user, and detects the signal that touch operation is brought, and signal is passed Give touch controller；Touch controller receives touch information from touch detecting apparatus, and is converted into contact coordinate, then Give the processor 660, and the order sent of reception processing device 660 and can be performed.Furthermore, it is possible to using resistance-type, electricity The polytypes such as appearance formula, infrared ray and surface acoustic wave realize contact panel 631.Except contact panel 631, input block 630 Other input equipments 632 can also be included, other input equipments 632 can include but is not limited to physical keyboard, function key (such as Volume control button, switch key etc.), trace ball, mouse, the one or more in action bars etc..

Wherein, display unit 640 can be used for information and the movement for showing the information inputted by user or being supplied to user The various menu interfaces of terminal.Display unit 640 may include display panel 641, optionally, can use LCD or organic light emission The forms such as diode (OrganicLight-EmittingDiode, OLED) configure display panel 641.

It should be noted that contact panel 631 can cover display panel 641, touch display screen is formed, when touch display screen inspection Measure after the touch operation on or near it, processor 660 is sent to determine the type of touch event, with preprocessor 660 provide corresponding visual output according to the type of touch event in touch display screen.

Touch display screen includes Application Program Interface viewing area and conventional control viewing area.The Application Program Interface viewing area And arrangement mode of the conventional control viewing area is not limited, can be arranged above and below, left-right situs etc. can distinguish two and show Show the arrangement mode in area.The Application Program Interface viewing area is displayed for the interface of application program.Each interface can be with The interface element such as the icon comprising at least one application program and/or widget desktop controls.The Application Program Interface viewing area It can also be the empty interface not comprising any content.The conventional control viewing area is used to show the higher control of utilization rate, for example, Application icons such as settings button, interface numbering, scroll bar, phone directory icon etc..

Wherein processor 660 is the control centre of mobile terminal, utilizes each of various interfaces and connection whole mobile phone Individual part, by operation or performs and is stored in software program and/or module in first memory 621, and calls and be stored in Data in second memory 622, perform the various functions and processing data of mobile terminal, so as to be carried out to mobile terminal overall Monitoring.Optionally, processor 660 may include one or more processing units.

In embodiments of the present invention, by call store the first memory 621 in software program and/or module and/ Or the data in the second memory 622, processor 660 is for during user gives song recitals, gathering people's sound of user Frequency evidence, and judge whether the voice voice data corresponding period in song is located in preset time period.If the voice Voice data corresponding period in song is located in preset time period, then judges whether the frequency of the voice voice data reaches To the frequency of original singer, if the frequency of the voice voice data is not up to the frequency of original singer, by the voice voice data of collection Frequency is adjusted to the voice voice data after the frequency of original singer, then output frequency adjustment.

Alternatively, processor 660 is additionally operable to, and determines the preset audio fragment in the audio source file of song.

Alternatively, processor 660 is also particularly useful for corresponding with the data of audio source file according to default voice frequency range Relation, determines target time section of the audio source file intermediate frequency rate of song in default voice frequency range；By target time section Interior audio fragment is defined as preset audio fragment.

Alternatively, processor 660 is also particularly useful for determining that the voice data of each period in audio source file is corresponding The sex of original singer；According to the corresponding default voice frequency range of sex and different sexes of the corresponding original singer of each section audio data, Target time section of the frequency in default voice frequency range in each section audio data is determined respectively.

Alternatively, processor 660 is additionally operable to, and preset audio fragment is obtained from cloud server and divides data；Or from shifting Dynamic terminal local obtains preset audio fragment and divides data；Wherein, preset audio fragment divides data and is used to characterize audio source document Period where preset audio fragment in part.

Alternatively, processor 660 is additionally operable to, according to the length of preset audio fragment, it is determined that gently adjusting duration；Gentle Adjust in duration, the frequency of the voice voice data of collection is gently adjusted to the frequency of original singer；By people's sound of collection The frequency of frequency evidence is gently adjusted to the frequency of original singer, and persistently the frequency to the voice voice data of collection strengthens, Until the voice voice data that preset audio fragment terminates or gathered is interrupted.

Alternatively, processor 660 also particularly useful for, when preset audio fragment length exceed threshold length of time when, will Threshold length of time is defined as gently adjusting duration；, will be pre- when the length of preset audio fragment is not less than threshold length of time If the length of audio fragment is defined as gently adjusting duration.

Alternatively, processor 660 is also particularly useful in gently regulation duration, with amplitude of accommodation δ=(f2-f1) * t/T Frequency to the voice voice data of collection is adjusted, it is adjusted after frequency f=f1+ δ, until t=T；Wherein, t is Time span of the current time away from preset audio fragment start time in duration is gently adjusted, T is gentle regulation duration, and f1 is to adopt The frequency of the voice voice data of collection, f2 is the frequency of original singer.

Alternatively, processor 660 is also particularly useful for whether the voice voice data of detection collection interrupts；Adopted if detecting The voice voice data of collection is not interrupted, then the frequency persistently to the voice voice data of collection is adjusted, with stable to original singer Frequency, or gently reduce the amplitude to the regulation of the frequency of the voice voice data of collection.

Alternatively, processor 660 is additionally operable to, and calculates the difference on the frequency between the frequency of voice voice data and the frequency of original singer Value；Whether determination frequency difference is less than threshold frequency；If frequency-splitting is less than threshold frequency, it is determined that the frequency of voice voice data Rate reaches the frequency of original singer.

Mobile terminal can realize each process that mobile terminal is realized in previous embodiment, to avoid repeating, here not Repeat again.

It can be seen that, the mobile terminal in the embodiment of the present invention, by processor 660 during user gives song recitals, collection The voice voice data of user, and judge whether the voice voice data corresponding period in song is located at preset time period It is interior.If the voice voice data corresponding period in song is located in preset time period, the voice voice data is judged Frequency whether reach the frequency of original singer, if the frequency of the voice voice data is not up to the frequency of original singer, by the people of collection The frequency of sound audio data is adjusted to the voice voice data after the frequency of original singer, then output frequency adjustment.So as to be sung to user The frequency of sound is effectively adjusted, it is to avoid the frequency of user voice and the excessive influence singing effect of original singer's sound frequency difference, is entered And make user in the case where not possessing professional performance ability, it still is able to embody preferable performance level.

Those of ordinary skill in the art it is to be appreciated that with reference to disclosed in the embodiment of the present invention embodiment description it is each The unit and algorithm steps of example, can be realized with the combination of electronic hardware or computer software and electronic hardware.These Function is performed with hardware or software mode actually, depending on the application-specific and design constraint of technical scheme.Specialty Technical staff can realize described function to each specific application using distinct methods, but this realization should not Think beyond the scope of this invention.

It is apparent to those skilled in the art that, for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, may be referred to the corresponding process in preceding method embodiment, will not be repeated here.

In embodiment provided herein, it should be understood that disclosed apparatus and method, others can be passed through Mode is realized.For example, device embodiment described above is only schematical, for example, the division of unit, is only one kind Division of logic function, can there is other dividing mode when actually realizing, such as multiple units or component can combine or can To be integrated into another system, or some features can be ignored, or not perform.It is another, it is shown or discussed each other Coupling direct-coupling or communication connection can be by some interfaces, the INDIRECT COUPLING or communication connection of device or unit, Can be electrical, machinery or other forms.

The unit illustrated as separating component can be or may not be physically separate, be shown as unit Part can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple networks On unit.Some or all of unit therein can be selected to realize the purpose of this embodiment scheme according to the actual needs.

In addition, each functional unit in each embodiment of the invention can be integrated in a processing unit, can also That unit is individually physically present, can also two or more units it is integrated in a unit.

If function is realized using in the form of SFU software functional unit and as independent production marketing or in use, can stored In a computer read/write memory medium.Understood based on such, technical scheme is substantially in other words to existing The part for having part that technology contributes or the technical scheme can be embodied in the form of software product, the computer Software product is stored in a storage medium, including some instructions are to cause a computer equipment (can be personal meter Calculation machine, server, or network equipment etc.) perform all or part of step of each of the invention embodiment method.And it is foregoing Storage medium includes：USB flash disk, mobile hard disk, ROM, RAM, magnetic disc or CD etc. are various can be with the medium of store program codes.

More than, it is only the embodiment of the present invention, but protection scope of the present invention is not limited thereto, and it is any to be familiar with Those skilled in the art the invention discloses technical scope in, change or replacement can be readily occurred in, should all be covered Within protection scope of the present invention.Therefore, protection scope of the present invention should be defined by scope of the claims.

Claims

1. a kind of audio-frequency processing method, applied to mobile terminal, it is characterised in that including：

If the voice voice data corresponding period in the song is located in preset time period, the voice is judged Whether the frequency of voice data reaches the frequency of original singer；

If the frequency of the voice voice data is not up to the frequency of original singer, by the frequency of the voice voice data of the collection Adjust to the frequency of the original singer, the voice voice data after output frequency adjustment；

Wherein, the preset time period is the preset audio fragment corresponding period of the song, the preset audio fragment The audio fragment in voice frequency range is being preset for the frequency of the song original singer, the default voice frequency range is including pre- If high pitch voice frequency range and default bass voice frequency range.

2. according to the method described in claim 1, it is characterised in that described during user gives song recitals, gather described use Before the step of voice voice data at family, methods described also includes：

Determine the preset audio fragment in the audio source file of the song.

3. method according to claim 2, it is characterised in that described default in the audio source file of the determination song The step of audio fragment, including：

According to default voice frequency range and the data corresponding relation of audio source file, in the audio source file for determining the song Target time section of the frequency in default voice frequency range；

Audio fragment in the target time section is defined as the preset audio fragment.

4. method according to claim 3, it is characterised in that the basis presets voice frequency range and audio source file Data corresponding relation, in the audio source file for determining the song, object time of the frequency in default voice frequency range The step of section, including：

Determine the sex of the corresponding original singer of voice data of each period in the audio source file；

According to the corresponding default voice frequency range of sex and different sexes of the corresponding original singer of each section audio data, determine respectively Target time section of the frequency in default voice frequency range in each section audio data.

5. method according to claim 2, it is characterised in that the institute in the audio source file for determining the song Before the step of stating preset audio fragment, methods described also includes：

Preset audio fragment is obtained from cloud server and divides data；Or

Preset audio fragment is locally obtained from mobile terminal and divides data；

Wherein, the preset audio fragment, which divides data, is used to characterize the time where preset audio fragment in the audio source file Section.

6. according to the method described in claim 1, it is characterised in that the frequency of the voice voice data by the collection is adjusted The step of frequency of the original singer, including：

According to the length of the preset audio fragment, it is determined that gently adjusting duration；

In the gentle regulation duration, the frequency of the voice voice data of the collection is gently adjusted to the original singer's Frequency；

Gently adjust to the frequency of the original singer, persistently adopted to described by the frequency of the voice voice data of the collection The frequency of the voice voice data of collection is strengthened, until the preset audio fragment terminate or the voice voice data that gathers in It is disconnected.

7. method according to claim 6, it is characterised in that the length according to the preset audio fragment, it is determined that The step of gentle regulation duration, including：

When the length of the preset audio fragment exceedes threshold length of time, the threshold length of time is defined as described flat Slow-readjustment section duration；

When the length of the preset audio fragment is not less than threshold length of time, the length of the preset audio fragment is determined For the gentle regulation duration.

8. method according to claim 6, it is characterised in that described in the gentle regulation duration, by the collection The frequency of voice voice data the step of gently adjust to the frequency of the original singer, including：

In the gentle regulation duration, with frequencies of amplitude of accommodation δ=(f2-f1) the * t/T to the voice voice data of the collection Rate is adjusted, it is adjusted after frequency f=f1+ δ, until t=T；

Wherein, t is time span of the current time away from the preset audio fragment start time in gentle regulation duration, and T is institute Gentle regulation duration is stated, f1 is the frequency of the voice voice data of the collection, and f2 is the frequency of the original singer.

9. method according to claim 6, it is characterised in that at the end of the preset audio fragment, methods described is also Including：

Detect whether the voice voice data of the collection interrupts；

If the voice voice data for detecting the collection is not interrupted, persistently to the frequency of the voice voice data of the collection It is adjusted, with the stable frequency to the original singer, or gently reduces the frequency tune to the voice voice data of the collection The amplitude of section.

10. according to the method described in claim 1, it is characterised in that whether the frequency for judging the voice voice data The step of reaching the frequency of original singer, including：

Calculate the frequency-splitting between the frequency of the voice voice data and the frequency of the original singer；

Judge whether the frequency-splitting is less than threshold frequency；

If the frequency-splitting is less than threshold frequency, it is determined that the frequency of the voice voice data reaches the frequency of the original singer Rate.

11. a kind of mobile terminal, it is characterised in that including：

Audio position determining module, for judging whether the voice voice data corresponding period in the song is located at In preset time period；

Evaluation module, if the corresponding period is located in preset time period in the song for the voice voice data, Then judge whether the frequency of the voice voice data reaches the frequency of original singer；

Audio adjusting module, if the frequency for the voice voice data is not up to the frequency of original singer, by the collection The frequency of voice voice data is adjusted to the frequency of the original singer；

12. mobile terminal according to claim 11, it is characterised in that the mobile terminal also includes：

Preset audio fragment determining module, the preset audio fragment in audio source file for determining the song.

13. mobile terminal according to claim 12, it is characterised in that the preset audio fragment determining module, including：

Period determination sub-module, for the data corresponding relation according to default voice frequency range and audio source file, it is determined that Target time section of the audio source file intermediate frequency rate of the song in default voice frequency range；

Preset audio fragment determination sub-module, for the audio fragment in the target time section to be defined as into the preset audio Fragment.

14. mobile terminal according to claim 13, it is characterised in that the period determination sub-module, including：

Sex determining unit, the property for determining the corresponding original singer of voice data of each period in the audio source file Not；

Period determining unit, for the sex according to the corresponding original singer of each section audio data and the corresponding default people of different sexes Acoustic frequency scope, determines target time section of the frequency in default voice frequency range in each section audio data respectively.

15. mobile terminal according to claim 12, it is characterised in that the mobile terminal also includes：

Acquisition module, data are divided for obtaining preset audio fragment from cloud server；Or locally obtained from mobile terminal Preset audio fragment divides data；Wherein, the preset audio fragment divides data and is used to characterize in the audio source file in advance If the period where audio fragment.

16. mobile terminal according to claim 11, it is characterised in that the audio adjusting module, including：

Gentle regulation duration determination sub-module, for the length according to the preset audio fragment, it is determined that gently adjusting duration；

Submodule is adjusted, in the gentle regulation duration, progressively adjusting the frequency of the voice voice data of the collection The whole frequency to original singer；And gently adjusted to the frequency of the original singer by the frequency of the voice voice data of the collection Afterwards, the frequency persistently to the voice voice data of the collection strengthens, until the preset audio fragment terminates or gathered Voice voice data interrupt.

17. mobile terminal according to claim 16, it is characterised in that

The gentle regulation duration determination sub-module is long more than threshold time specifically for the length when the preset audio fragment When spending, the threshold length of time is defined as the gentle regulation duration；When the preset audio fragment length not less than During threshold length of time, the length of the preset audio fragment is defined as the gentle regulation duration.

18. mobile terminal according to claim 16, it is characterised in that

The adjustment submodule, specifically in the gentle regulation duration, with amplitude of accommodation δ=(f2-f1) * t/T to institute The frequency for stating the voice voice data of collection is adjusted, it is adjusted after frequency f=f1+ δ, until t=T；Wherein, t is Time span of the current time away from the preset audio fragment start time in gentle regulation duration, when T is the gentle regulation Long, f1 is the frequency of the voice voice data of the collection, and f2 is the frequency of the original singer.

19. mobile terminal according to claim 16, it is characterised in that

The adjustment submodule, is additionally operable to detect whether the voice voice data of the collection interrupts；If detecting the collection Voice voice data do not interrupt, then the frequency persistently to the voice voice data of the collection is adjusted, with stable to institute The frequency of original singer is stated, or gently reduces the amplitude of the frequency regulation to the voice voice data of the collection.

20. mobile terminal according to claim 11, it is characterised in that the evaluation module, including：

Frequency-splitting calculating sub module, for calculating the frequency between the frequency of the voice voice data and the frequency of the original singer Rate difference；

Frequency-splitting assesses submodule, for judging whether the frequency-splitting is less than threshold frequency；If the frequency-splitting is small In threshold frequency, it is determined that the frequency of the voice voice data reaches the frequency of the original singer.