CN108449504A

CN108449504A - Voice communication data detection method, device, storage medium and mobile terminal

Info

Publication number: CN108449504A
Application number: CN201810201668.0A
Authority: CN
Inventors: 郑志勇; 柳明; 李智豪
Original assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Current assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date: 2018-03-12
Filing date: 2018-03-12
Publication date: 2018-08-24
Anticipated expiration: 2038-03-12
Also published as: CN108449504B

Abstract

The embodiment of the present application discloses voice communication data detection method, device, storage medium and mobile terminal.This method includes：After voice communication group in default application program is successfully established, detect that detecting event of uttering long and high-pitched sounds is triggered, obtain the downlink voice communicating data of predetermined time period, and carry out piecemeal processing, for each data block, maximum second frequency point of energy value in maximum first frequency point of energy value and low frequency region is obtained in high-frequency region on frequency domain, when the first frequency point meet it is default it is doubtful utter long and high-pitched sounds condition when, determine that the first frequency point is that current data is in the block and doubtful utters long and high-pitched sounds a little, when the multiple doubtful point groups of uttering long and high-pitched sounds that there is presentation periodic feature, and it is doubtful when uttering long and high-pitched sounds that a little corresponding energy value is in rising trend according to the sequence of affiliated data block, determine there is sound of uttering long and high-pitched sounds in downlink voice communicating data.The embodiment of the present application timely and accurately can carry out detection of uttering long and high-pitched sounds by using above-mentioned technical proposal to downlink voice communicating data.

Description

Voice communication data detection method, device, storage medium and mobile terminal

Technical field

The invention relates to voice communication technical field more particularly to voice communication data detection method, device, deposit Storage media and mobile terminal.

Background technology

Currently, as the quick of mobile terminal is popularized, the mobile terminals such as mobile phone and tablet computer have become people's indispensability One of means of communication.Communication mode between mobile terminal user is more and more abundant, is not limited to mobile communication operators already The services such as the traditional phone and short message that quotient provides, under many scenes, user is more likely to using Internet-based logical Voice-enabled chat in letter mode, such as various social softwares and Video chat function.

In addition, application program (Application, APP) function in mobile terminal is increasingly perfect, in many application programs All be provided with voice call function, the communication between the user of same money application program easy to use with exchange.It is with game application Example, some need to carry out between player interactive game be added to built in voice call function, user can use During mobile terminal plays game, speech exchange is carried out with other players.However, in voice call process, voice communication The sound type for including in data is more, such as sound (such as back of the body of game comprising each player's one's voice in speech, application program itself Scape sound or special efficacy sound etc.) and mobile terminal local environment in other sound etc., since sound is more complicated, it is easy to occur It utters long and high-pitched sounds phenomenon, seriously affects the use of user.

Invention content

The embodiment of the present application provides a kind of voice communication data detection method, device, storage medium and mobile terminal, can be with When voice call function in application program for mobile terminal is opened, howling is timely and accurately detected.

In a first aspect, the embodiment of the present application provides a kind of voice communication data detection method, including：

After voice communication group in default application program is successfully established, detect that detecting event of uttering long and high-pitched sounds is triggered；

The downlink voice communicating data of the predetermined time period in mobile terminal is obtained, and to downlink voice call number According to progress piecemeal processing；

For each data block, doubtful present in current data block utter long and high-pitched sounds a little is determined using presupposition analysis mode；

When exist present periodic feature multiple doubtful point groups of uttering long and high-pitched sounds, and it is doubtful utter long and high-pitched sounds a little corresponding energy value according to institute When the sequence of category data block is in rising trend, determine there is sound of uttering long and high-pitched sounds in the downlink voice communicating data；Wherein, described doubtful Point group of uttering long and high-pitched sounds be that continuous adjacent data frequency difference in the block is in preset range it is doubtful utter long and high-pitched sounds a little, the continuous adjacent number Reach default continuous threshold value according to the quantity of block.

Second aspect, the embodiment of the present application provide a kind of voice communication data detection device, including：

Detection trigger module detects detection of uttering long and high-pitched sounds after the voice communication group for presetting in application program is successfully established Event is triggered；

Downstream voice data acquisition module, the downlink voice call number for obtaining the predetermined time period in mobile terminal According to, and piecemeal processing is carried out to the downlink voice communicating data；

A doubtful determining module of uttering long and high-pitched sounds, for for each data block, current data block to be determined using presupposition analysis mode Present in doubtful utter long and high-pitched sounds a little；

It utters long and high-pitched sounds sound determining module, for when there are multiple doubtful point groups of uttering long and high-pitched sounds that periodic feature is presented, and doubtful utters long and high-pitched sounds When the corresponding energy value of point is in rising trend according to the sequence of affiliated data block, determines and exist in the downlink voice communicating data It utters long and high-pitched sounds sound；Wherein, the doubtful point group of uttering long and high-pitched sounds is that continuous adjacent data frequency difference in the block is in doubtful in preset range It utters long and high-pitched sounds a little, the quantity of the continuous adjacent data block reaches default continuous threshold value.

The third aspect, the embodiment of the present application provide a kind of computer readable storage medium, are stored thereon with computer journey Sequence realizes the voice communication data detection method as described in the embodiment of the present application when the program is executed by processor.

Fourth aspect, the embodiment of the present application provide a kind of mobile terminal, including memory, processor and are stored in storage It can realize on device and when the computer program of processor operation, the processor execute the computer program as the application is real Apply the voice communication data detection method described in example.

The voice communication data detection scheme provided in the embodiment of the present application, the voice communication preset in application program are set up After standing successfully, when detecting that detecting event of uttering long and high-pitched sounds is triggered, the downlink voice for obtaining the predetermined time period in mobile terminal is logical Data are talked about, and carry out piecemeal processing；For each data block, doubtful utter long and high-pitched sounds a little is determined whether there is respectively；Further according to doubtful howl The distribution situation of point is made quickly to determine in downlink voice communicating data with the presence or absence of sound of uttering long and high-pitched sounds.By using above-mentioned technical proposal, After the voice communication group of default application program that can be in the terminal is successfully established, timely and accurately converse downlink voice Data carry out detection of uttering long and high-pitched sounds, and subsequently to take appropriate measures, reduce sound of uttering long and high-pitched sounds and use the inconvenience brought to user.

Description of the drawings

Fig. 1 is a kind of flow diagram of voice communication data detection method provided by the embodiments of the present application；

Fig. 2 is the flow diagram of another voice communication data detection method provided by the embodiments of the present application；

Fig. 3 is a kind of structure diagram of voice communication data detection device provided by the embodiments of the present application；

Fig. 4 is a kind of structural schematic diagram of mobile terminal provided by the embodiments of the present application；

Fig. 5 is the structural schematic diagram of another mobile terminal provided by the embodiments of the present application.

Specific implementation mode

Further illustrate the technical solution of the application below with reference to the accompanying drawings and specific embodiments.It is appreciated that It is that specific embodiment described herein is used only for explaining the application, rather than the restriction to the application.It further needs exist for illustrating , illustrate only for ease of description, in attached drawing and the relevant part of the application rather than entire infrastructure.

It should be mentioned that some exemplary embodiments are described as before exemplary embodiment is discussed in greater detail The processing described as flow chart or method.Although each step is described as the processing of sequence, many of which by flow chart Step can be implemented concurrently, concomitantly or simultaneously.In addition, the sequence of each step can be rearranged.When its operation The processing can be terminated when completion, it is also possible to the additional step being not included in attached drawing.The processing can be with Corresponding to method, function, regulation, subroutine, subprogram etc..

Fig. 1 is a kind of flow diagram of voice communication data detection method provided by the embodiments of the present application, and this method can To be executed by voice communication data detection device, wherein the device can generally be integrated in movement by software and or hardware realization In terminal.As shown in Figure 1, this method includes：

After voice communication group in step 101, default application program is successfully established, detect that detecting event of uttering long and high-pitched sounds is touched Hair.

Illustratively, the mobile terminal in the embodiment of the present application may include the mobile devices such as mobile phone and tablet computer.It is default Application program can be the application program of built-in voice group call function, such as online game application, Online class application, video Conference applications or the other applications etc. for needing multiple person cooperational.

Illustratively, can include 2 members in voice communication group, but in most cases, generally comprise 3 or 3 with On member, you can realize the voice communication between 3 or 3 or more mobile terminals.Voice communication group can be by movement It is initiated and is established using the user of default application program in terminal, after voice communication group is successfully established, wrapped in voice communication group It can be communicated between all mobile terminals contained.In general, when mobile terminal is not in silent mode, it is also not in earphone mould When formula, it will be appreciated that be in outer mode playback for mobile terminal, each the sound of user can be used by oneself in voice communication group Mobile terminal microphone acquisition, and after network transmission and processing by the loud speaker of the mobile terminal of other users into Row plays.By taking game application as an example, as needed association's war of forming a team, phonetic function of forming a team can be opened, it is assumed that there are 5 players in team, that After voice communication group is successfully established, this 5 people can converse between each other, any one player can hear separately simultaneously Outer 4 player's words, seemingly other 4 players oneself talking at one's side the same, facilitate and play in exchange.Present techniques The executive agent of scheme, i.e., current mobile terminal can be any one mobile terminal in voice communication group, can also be Some or certain several specified mobile terminals in voice communication group.That is, can be by any one in voice communication group Mobile terminal execution method provided by the embodiments of the present application, can also be by specified one or more mobile terminal execution the application Embodiment provide method, can also all mobile terminals be performed both by method provided by the embodiments of the present application.

In general, when mobile terminal is in outer mode playback, include not only in the collected sound of mobile terminal microphone User itself one's voice in speech, it is also possible to include the sound that the default application program itself that loud speaker plays is sent out, such as background sound It is happy etc., it is also possible to include the sound of ambient enviroment, it is also possible to which that other people speak in the voice communication group played comprising loud speaker Sound, in this way, when the data comprising various sound respectively acquired are sent to the same shifting by multiple mobile terminals by network (such as include 5 mobile terminals in voice communication group, then wherein 4 mobile terminals will be respectively acquiring when dynamic terminal Sound is sent to server, and server gives the audio data transmitting of 4 mobile terminals to the 5th mobile terminal), these sound by Broadcasting can be mixed in the mobile terminal, may will produce phenomenon of uttering long and high-pitched sounds.

In the embodiment of the present application, in order to carry out detection of uttering long and high-pitched sounds on suitable opportunity, detecting event of uttering long and high-pitched sounds can be pre-set The condition being triggered.It optionally, can be in the voice in default application program for the real-time detection timely and effectively uttered long and high-pitched sounds After phone group is successfully established, detecting event of uttering long and high-pitched sounds is triggered immediately；Optionally, detection of uttering long and high-pitched sounds is carried out in order to more targeted, together When save extra power consumption caused by detection operation of uttering long and high-pitched sounds, theory analysis or investigation can be carried out to being easy to happen the scene uttered long and high-pitched sounds Deng reasonably default scene being arranged, when detecting that mobile terminal is in default scene, triggering is uttered long and high-pitched sounds detecting event.

Step 102, the downlink voice communicating data for obtaining predetermined time period in mobile terminal, and to the downlink language Sound communicating data carries out piecemeal processing.

Illustratively, downlink voice communicating data can be the default corresponding server of application program to receive voice logical In words group after the voice data of other mobile terminals, the data of mobile terminal are given by audio mixing etc. haircut, or directly turn The data of mobile terminal are issued, the application does not limit the processing mode of server process voice communication data.It is existing in correlation Have in technology, mobile terminal is played out after server receives downlink voice communicating data by loud speaker, without into Capable detection of uttering long and high-pitched sounds.In the application, after detecting that detecting event of uttering long and high-pitched sounds is triggered, downlink voice call number will not be directly played According to, but downlink voice communicating data is analyzed, to judge in downstream voice data with the presence or absence of sound of uttering long and high-pitched sounds.

In the embodiment of the present application, predetermined time period can according to the concrete configuration of mobile terminal, data-handling capacity and Because usually determining, the embodiment of the present application does not limit the demand of voice communication to timeliness etc..For example, can be between 1 to 2 second Arbitrary duration.It can carry out piecemeal processing according to default unit length to carry out piecemeal processing to downlink voice communicating data, Default unit length for example can be 40 milliseconds.Assuming that predetermined time period is 1.2 seconds, it is 40 milliseconds to preset unit length, that 30 data blocks can be divided into.

Step 103, for each data block, doubtful present in current data block utter long and high-pitched sounds is determined using presupposition analysis mode Point.

The embodiment of the present application is not especially limited presupposition analysis mode.For example, the presupposition analysis mode includes：In frequency The frequency point to be determined that energy value in high-frequency region is higher than preset energy threshold value is obtained on domain, is calculated pre- around the frequency point to be determined If the capacity volume variance value of the frequency point of quantity, when the capacity volume variance value is more than default discrepancy threshold, the frequency to be determined is determined Point is doubtful utters long and high-pitched sounds a little；The high-frequency region is the frequency range that frequency is higher than predeterminated frequency threshold value.

Illustratively, for current data block, frequency domain can be first transformed from the time domain to, spectrum analysis is convenient for.Become The mode the embodiment of the present application of changing does not limit, and Fourier transformation mode may be used, such as the fast algorithm of discrete fourier transform (Fast Fourier Transformation, FFT).By taking 40ms as an example, the audio data (16bit, 16k sample rate) of 40ms Size is 40*16*16/2=1280 bytes, is adapted for use with 1024 and does FFT transform progress spectrum analysis, after FFT is handled Frequency analysis in frequency range be 0~16K/2, step-length be (16K/2)/1024, step-length is about 8Hz.

In the embodiment of the present application, high-frequency region and other regions can be divided using predeterminated frequency threshold value as cut off value.In advance If frequency threshold can be configured according to actual conditions, such as can according to voice frequency and be susceptible to the frequency feature of howling into Row setting, such as can be 1KHz, 1.5KHz or 2KHz etc..Such as predeterminated frequency threshold value is 2KHz, that is, is more than the portion of 2KHz It is divided into high-frequency region.The frequency of general howling appears in high-frequency region, and sound is larger (i.e. energy value is higher), the application Embodiment can quickly determine that a data are in the block according to energy value characteristic distributions and doubtful utter long and high-pitched sounds a little.

Illustratively, the corresponding energy value of each Frequency point (abbreviation frequency point) in data block is obtained, then from high-frequency region In find energy value be higher than preset energy threshold value frequency point to be determined, calculate the energy of the frequency point of preset quantity around frequency point to be determined Measure difference value.Preset energy threshold value and preset quantity can be arranged according to actual demand, for example, preset energy threshold value can be- 10dB, preset quantity can be 8 (before frequency point to be determined 4 and 4 below).By taking step-length above is about 8Hz as an example, it is assumed that The frequency values of frequency point to be determined be 3362Hz, then around it frequency values of frequency point of preset quantity be about 3330Hz, 3338Hz, 3346Hz, 3354Hz, 3370Hz, 3378Hz, 3386Hz and 3394Hz.Capacity volume variance value is for weighing frequency point to be determined and surrounding Difference degree between the frequency point of preset quantity can be specifically the difference of maximum energy value and minimum energy value, can also be energy Variance yields or energy mean square deviation etc. are measured, the application does not limit.Default discrepancy threshold and corresponding, the example of capacity volume variance value Such as, when capacity volume variance value is energy variance yields, it is default variance threshold values to preset discrepancy threshold.When capacity volume variance value is poor more than default When different threshold value, illustrate frequency point to be determined than more prominent, is very likely to be to utter long and high-pitched sounds a little, accordingly, it is determined that frequency point to be determined is doubtful It utters long and high-pitched sounds a little.In this way setting can rapidly and accurately identify it is doubtful utter long and high-pitched sounds a little, lay the first stone to improve detection efficiency of uttering long and high-pitched sounds.

Illustratively, there may be multiple frequency points to be determined, the application in a data block can be highest from corresponding energy Frequency point to be determined proceeds by the doubtful judgement uttered long and high-pitched sounds a little.

Step 104, when there is multiple doubtful point groups of uttering long and high-pitched sounds that periodic feature is presented, and doubtful a little corresponding energy of uttering long and high-pitched sounds When value is in rising trend according to the sequence of affiliated data block, determine there is sound of uttering long and high-pitched sounds in the downlink voice communicating data.

Wherein, the doubtful point group of uttering long and high-pitched sounds is that continuous adjacent data frequency difference in the block is in doubtful in preset range It utters long and high-pitched sounds a little, the quantity of the continuous adjacent data block reaches default continuous threshold value.

Illustratively, if there are doubtful sounds of uttering long and high-pitched sounds in some data block, it can not think whole section of downlink voice call audio In comprising uttering long and high-pitched sounds sound, it is also possible to since certain especial sounds are misidentified as sound of uttering long and high-pitched sounds, such as thorn that when object rubs generates The sound of ear, general frequency is higher and sound is larger, it is likely that is identified as doubtful sound of uttering long and high-pitched sounds, but this sound is generally shorter Promote, the duration is shorter, is not belonging to sound of uttering long and high-pitched sounds, and therefore, it is necessary to increase further judgement.

In the embodiment of the present application, the characteristic distributions of doubtful sound of uttering long and high-pitched sounds present in each data block are analyzed.When continuous When uttering long and high-pitched sounds there are smaller doubtful of frequency difference in multiple adjacent data blocks, these doubtful utter long and high-pitched sounds can a little be become doubtful howl It is point group.That is, doubtful point group of uttering long and high-pitched sounds be that continuous adjacent data frequency difference in the block is in preset range it is doubtful utter long and high-pitched sounds a little, The quantity of the continuous adjacent data block reaches default continuous threshold value.Wherein, preset continuous threshold value can determines according to actual conditions, Such as 3；The corresponding preset range of frequency difference also can determines according to actual conditions, such as 40Hz.Inventor's discovery, howling Characteristics of SSTA persistence is generally shown in a short time, and is periodically occurred, and in addition sound becomes larger.Therefore, the application is implemented In example, periodic feature is presented into multiple (can be regarded as be greater than or equal to 2) doubtful point groups of uttering long and high-pitched sounds and doubtful is uttered long and high-pitched sounds a little pair The energy value answered is in rising trend as decision condition according to the sequence of affiliated data block, to identify current downlink voice call With the presence or absence of sound of uttering long and high-pitched sounds in data, if meeting above-mentioned condition, it is determined that there is sound of uttering long and high-pitched sounds, can rapidly and accurately identify in this way It utters long and high-pitched sounds sound.

Illustratively, it is assumed that downlink voice communicating data is divided into 30 data blocks.If for example, the 1st, 2,3,7,8,9, 13, frequency is all detected in the section (A-40, A+40) in 14,15,19,20,21,25,26 and 27 this 15 data blocks Doubtful to utter long and high-pitched sounds a little, corresponding doubtful utter long and high-pitched sounds of every 3 data blocks a little becomes a doubtful point group of uttering long and high-pitched sounds, and 5 doubtful point groups of uttering long and high-pitched sounds are in Periodic feature, and doubtful a little corresponding energy value of uttering long and high-pitched sounds is sequentially increased, accordingly, it is determined that including howl in downlink voice communicating data It is sound.For another example, it utters long and high-pitched sounds if detecting doubtful in the section (B-40, B+40) of frequency in this 3 data blocks of only the 1st, 2 and 3 Point, corresponding doubtful utter long and high-pitched sounds of this 3 data blocks a little becomes a doubtful point group of uttering long and high-pitched sounds, but there is only this, and week is not presented Phase property feature, accordingly, it can be determined that not including sound of uttering long and high-pitched sounds in downlink voice communicating data.

The voice communication data detection method provided in the embodiment of the present application, the voice communication preset in application program are set up After standing successfully, when detecting that detecting event of uttering long and high-pitched sounds is triggered, the downlink voice for obtaining the predetermined time period in mobile terminal is logical Data are talked about, and carry out piecemeal processing；For each data block, doubtful utter long and high-pitched sounds a little is determined whether there is respectively；Further according to doubtful howl The distribution situation of point is made quickly to determine in downlink voice communicating data with the presence or absence of sound of uttering long and high-pitched sounds.By using above-mentioned technical proposal, After the voice communication group of default application program that can be in the terminal is successfully established, timely and accurately converse downlink voice Data carry out detection of uttering long and high-pitched sounds, and subsequently to take appropriate measures, reduce sound of uttering long and high-pitched sounds and use the inconvenience brought to user.

In some embodiments, after there is sound of uttering long and high-pitched sounds in determining the downlink voice communicating data, further include：By institute Doubtful utter long and high-pitched sounds is stated a little to be determined as uttering long and high-pitched sounds a little；A little the downlink voice communicating data is carried out at chauvent's criterion according to described utter long and high-pitched sounds Reason.Exist in determining downlink voice communicating data and utter long and high-pitched sounds after sound, illustrates that the satisfaction identified before is uttered long and high-pitched sounds sound decision condition It is doubtful utter long and high-pitched sounds a little really to utter long and high-pitched sounds a little, then needing, according to uttering long and high-pitched sounds a little to downlink voice progress chauvent's criterion processing, to prevent from making a whistling sound It makes sound be played out from loud speaker or receiver, influences the use of user.Further, after carrying out chauvent's criterion processing, pass through Loud speaker or receiver are played by chauvent's criterion treated downlink voice communicating data.

In some embodiments, it utters long and high-pitched sounds described in the basis and a little the downlink voice communicating data is carried out at chauvent's criterion Reason, including：The higher frequency uttered long and high-pitched sounds a little of correspondence energy value for choosing preset quantity, as target frequency, to the downlink language Audio signal corresponding with the target frequency carries out attenuation processing in sound communicating data.Preset quantity can be freely arranged, and such as 1 It is a, it is 3, even more, it can also be dynamically determined according to the quantity uttered long and high-pitched sounds a little.Can will utter long and high-pitched sounds a little according to energy value from high to low Sequence be ranked up, choose come front preset quantity utter long and high-pitched sounds a little, will select come the frequency uttered long and high-pitched sounds a little be determined as mesh Mark frequency.Energy value is higher, and the sound of howling is bigger, higher to the influence degree of user, the advantages of this arrangement are as follows, energy It is enough that chauvent's criterion more targetedly is carried out to the higher frequency of energy value, chauvent's criterion efficiency is improved, ensures voice communication Timeliness.

In some embodiments, it utters long and high-pitched sounds described in the basis and a little the downlink voice communicating data is carried out at chauvent's criterion Reason, may also comprise：Decay to audio signal corresponding with all frequencies uttered long and high-pitched sounds a little in the downlink voice communicating data Processing.The advantages of this arrangement are as follows can a chauvent's criterions comprehensively be carried out to all utter long and high-pitched sounds, the broadcasting for the sound that prevents to utter long and high-pitched sounds.

Illustratively, notch filter can be used come to frequency (i.e. target frequency) institute to utter long and high-pitched sounds a little inhibited Corresponding audio signal carries out attenuation processing.Notch filter can rapidly decay input signal in some Frequency point, to reach To hinder the frequency signal by filter effect.The application does not limit the type and design parameter value of notch filter It is fixed.In general, using target frequency as the centre frequency of notch filter, the parameters such as the process bandwidth of notch filter and gain It can be configured according to actual demand.

In some embodiments, by it is described it is doubtful utter long and high-pitched sounds a little be determined as uttering long and high-pitched sounds a little after, may also include：It is a little set to utter long and high-pitched sounds Set inhibition mark.It utters long and high-pitched sounds described in basis after a little carrying out chauvent's criterion processing to the downlink voice communicating data, further includes： The downlink voice communicating data for continuing acquisition predetermined time period includes doubtful howl in the new downlink voice communicating data of determination When crying, judge whether doubtful utter long and high-pitched sounds a little is set inhibition mark, if being set, according to the doubtful howl for inhibiting mark is set It cries and chauvent's criterion processing a little is carried out to new downlink voice communicating data.The sound the advantages of this arrangement are as follows one section of presence is uttered long and high-pitched sounds Downlink voice communicating data after, continuously exist it is doubtful utter long and high-pitched sounds a little, if this it is doubtful utter long and high-pitched sounds it is a little logical in the preceding paragraph downlink voice Occurred in words data, then it is to utter long and high-pitched sounds a little to be very likely to, therefore, can without the judgement uttered long and high-pitched sounds a little, but directly into Row inhibition is handled, and saves the judgment step uttered long and high-pitched sounds a little, while saving power consumption, can promote the timeliness of voice communication.Optionally, If not being set, (i.e. step 104) continues to judge whether it is to utter long and high-pitched sounds a little in the way of in above-described embodiment.Optionally, After inhibiting mark for a setting of uttering long and high-pitched sounds, further include：The update of uttering long and high-pitched sounds after mark is inhibited to utter long and high-pitched sounds index according to setting, in this way Doing is advantageous in that, can record and be uttered long and high-pitched sounds at the time of a little occur in time, convenient subsequently to judge that doubtful utter long and high-pitched sounds a little inhibits to mark with existing Will is uttered long and high-pitched sounds the time difference occurred, to more accurately judge whether doubtful utter long and high-pitched sounds a little is to utter long and high-pitched sounds a little.In addition, according to step Rapid 104 continue to judge that doubtful utter long and high-pitched sounds is a little after uttering long and high-pitched sounds a little, or a new setting of uttering long and high-pitched sounds inhibits mark, and update is uttered long and high-pitched sounds Index.

In some embodiments, described to detect that detecting event of uttering long and high-pitched sounds is triggered, including：Judge in the voice communication group With the presence or absence of the destination mobile terminal for being less than pre-determined distance value with the distance between the mobile terminal, and if it exists, then determine inspection Detecting event of uttering long and high-pitched sounds is measured to be triggered.Under the application scenarios of multi-person speech, inventor find, when there are two mobile terminals it Between distance it is closer when, easily utter long and high-pitched sounds.Assuming that mobile terminal first and mobile terminal second distance in voice communication group compared with Closely, the loud speaker of mobile terminal first can amplify and play the mobile terminal second received microphone acquisition sound, and due to Two mobile terminals are closer, this sound will again be acquired by the microphone of mobile terminal second and be sent to mobile terminal First, the sound are continued to amplify and be played, and the positive feedback amplification of sound are easily formed, to generate sound of uttering long and high-pitched sounds.Therefore, the application In embodiment, it can first judge to compare at a distance from current mobile terminal with the presence or absence of other mobile terminals in voice communication Closely, and if it exists, then trigger detecting event of uttering long and high-pitched sounds, and then detect that detecting event of uttering long and high-pitched sounds is triggered.Wherein, pre-determined distance value is for example It can be 20 meters or 10 meters etc., can be configured according to actual demand.

In the embodiment of the present application, judge in the voice communication group with the presence or absence of small with the distance between the mobile terminal In pre-determined distance value destination mobile terminal specific judgment mode can there are many kinds of, do not limit, be given below it is several Mode is illustratively.

1, preset sound segment is played using predetermined manner, and receives the anti-of other mobile terminals in the voice communication group Feedforward information, the feedback information include that other described mobile terminals are attempted to acquire sound letter corresponding with the preset sound segment Number result；Judged to whether there is the distance between described mobile terminal in the voice communication group according to the feedback information Less than the destination mobile terminal of pre-determined distance value.

The advantages of this arrangement are as follows can rapidly and accurately judge to whether there is destination mobile terminal, and then quickly It determines the need for triggering detecting event of uttering long and high-pitched sounds.It illustratively, can be by loud speaker to preset volume played pre-recorded or pre- The sound clip first obtained；Or, playing the ultrasonic wave segment of predeterminated frequency and preset strength by ultrasonic transmitter.Accordingly , other mobile terminals can acquire the corresponding voice signal of preset sound segment by microphone or ultrasonic receiver.It can root Above-mentioned default volume or predeterminated frequency and preset strength are configured according to pre-determined distance value.The knot for including in feedback information Fruit can refer to whether other mobile terminals can collect the voice signal.When other mobile terminals can collect default sound When the corresponding voice signal of tablet section, illustrate that the distance of two mobile terminals is less than pre-determined distance value.Feedback information can be by presetting The corresponding server of application program is forwarded.In addition, may also include the attribute letter of collected voice signal in feedback information Breath, such as intensity of sound can be declined since the intensity of the sound of mobile terminal playing is known with the propagation of sound Subtract, propagation distance is remoter, and attenuation degree is higher, can determine other according to strength information of the voice signal in feedback information etc. Mobile terminal judges whether the distance is less than pre-determined distance value at a distance from current mobile terminal.

2, obtain the mobile terminal the first location information and other mobile terminals in the voice communication group the Two location informations；According to first location information and second location information, judge whether deposited in the voice communication group It is less than the destination mobile terminal of the pre-determined distance value at a distance between the mobile terminal.

The advantages of this arrangement are as follows mobile terminal generally has positioning function, location information can be utilized quick and precisely Ground is judged to whether there is destination mobile terminal, and then quickly determines the need for triggering detecting event of uttering long and high-pitched sounds.Illustratively, it moves It is fixed that dynamic terminal can be obtained by positioning methods such as global positioning system (Global Positioning System, GPS) or the Big Dippeves Position information, also can obtain location information by modes such as base station location or network positions.Location information may include latitude and longitude coordinates Deng.Second location information of other mobile terminals in voice communication group can be forwarded by the corresponding server of default application program To current mobile terminal.Current mobile terminal determines at least one second that the first location information of itself is come with server forwarding Position information is compared one by one, and it is pre- to judge whether that the distance between second location information and the first location information are less than If distance value.

3, other mobile terminals in the first WiFi information and the voice communication group that the mobile terminal connects are obtained 2nd WiFi information of connection；According to the first WiFi information and the 2nd WiFi information, the voice communication group is judged In with the presence or absence of the destination mobile terminal for being less than the pre-determined distance value with the distance between described mobile terminal.

The advantages of this arrangement are as follows user is to save campus network, generally by the way of connecting Wi-Fi hotspot into Row voice communication can rapidly and accurately be judged to whether there is destination mobile terminal using this feature, and then quickly be determined Whether need to trigger detecting event of uttering long and high-pitched sounds.Illustratively, the attribute information of Wi-Fi hotspot, attribute information are may include in WiFi information Such as can be address media access control (Media Access Control, MAC) of Wi-Fi hotspot title or Wi-Fi hotspot Deng may also include WiFi signal intensity etc..In general, the signal effective range of Wi-Fi hotspot is limited, generally at 50 meters or so (half Diameter), it, can be according to whether there are the 2nd WiFi information if pre-determined distance value is more than the signal effective range of Wi-Fi hotspot Wi-Fi hotspot attribute information is identical as the Wi-Fi hotspot attribute information of the first WiFi information to be to determine in the voice communication group The no destination mobile terminal that there is the distance between mobile terminal and be less than pre-determined distance value, if there are any one the 2nd WiFi The Wi-Fi hotspot attribute information of information is identical as the Wi-Fi hotspot attribute information of the first WiFi information, it is determined that in voice communication group There are destination mobile terminals, that is to say, that when being connect with current mobile terminal there are one other mobile terminals in voice communication group When the same Wi-Fi hotspot, it is believed that other mobile terminals are destination mobile terminal.In addition, if pre-determined distance value is less than WiFi The signal effective range of hot spot, such as 10 meters, then can the same Wi-Fi hotspot further be connected according to WiFi signal strength estimation Mobile terminal respectively at a distance from Wi-Fi hotspot, and then the distance between determine two mobile terminals, whether judge the distance Less than pre-determined distance value.

4, the first voice data of microphone acquisition is obtained, and obtains the downlink voice communicating data in mobile terminal； Wherein, the sound that the loud speaker not comprising the mobile terminal plays in first voice data；According to first sound The sound for whether including same person in data and the downlink voice communicating data, judges whether deposited in the voice communication group It is less than the destination mobile terminal of the pre-determined distance value at a distance between the mobile terminal.

The advantages of this arrangement are as follows can not be quick by other information (such as above-mentioned location information or WiFi information) Accurately judge to whether there is destination mobile terminal, and then quickly determines the need for triggering detecting event of uttering long and high-pitched sounds.It is exemplary , the sound that the loud speaker not comprising the mobile terminal plays in the first voice data may be accomplished by：It is obtaining The loud speaker of mobile terminal during the first voice data and downlink voice communicating data is taken to be closed；Alternatively, The loud speaker for obtaining mobile terminal during the first voice data and downlink voice communicating data is in open state, the first sound Sound data are to filter out the sound number obtained after the voice data of loud speaker broadcasting in all voice datas that microphone acquires According to.When two user's hand-held mobile terminals and closer distance, it is assumed that user's first uses mobile terminal first, user's second to use movement Terminal second, user's first one's voice in speech acquire and are sent to mobile terminal second, mobile terminal second by the microphone of mobile terminal first Downlink voice communicating data in can include user's first one's voice in speech, and due to user's first and closer, the user of user second distance First one's voice in speech can also be acquired by the microphone of mobile terminal second, therefore, for mobile terminal second, microphone acquisition The first voice data and acquisition downlink voice communicating data in include the sound of same person (user's first), so that it is determined that language There are the distance between mobile terminal first and mobile terminal second to be less than pre-determined distance value in sound phone group, i.e., for mobile terminal second For, mobile terminal first is destination mobile terminal.

It is understood that the combination of any one or more above-mentioned mode can be chosen according to actual conditions to judge to be It is no there are destination mobile terminal, the embodiment of the present application does not limit.Moreover, it is judged that with the presence or absence of the related step of destination mobile terminal Suddenly it can also be completed by the corresponding server of default application program, when server is judged, there are when destination mobile terminal, to will determine that As a result it is sent to mobile terminal, the judging result is used to indicate mobile terminal and triggers detecting event of uttering long and high-pitched sounds.Correspondingly, the application The method of embodiment further includes the judging result for receiving the corresponding server of the default application program and sending, when the judgement When in as a result including following content, detecting event of uttering long and high-pitched sounds is triggered：Exist between the mobile terminal in the voice communication group Distance be less than pre-determined distance value destination mobile terminal.The specific deterministic process of server can refer to the several of above-mentioned offer and sentence Disconnected mode, the embodiment of the present application do not repeat.

In the embodiment of the present application, there are the mobile terminals that two distances are closer in voice communication group, and there are feelings of uttering long and high-pitched sounds It when condition, avoids uttering long and high-pitched sounds not by the way of mute speaker, but downlink voice communicating data is carried out at chauvent's criterion Reason, this is determined by the special application scenarios that the embodiment of the present application proposes.Assuming that having 3 members a, b in voice communication group And c, two of which member a and b distance is closer, if the loud speaker of the mobile terminal of b is closed in selection, then a one's voices in speech are just It will not be played in the mobile terminal of b, but simultaneously, c one's voices in speech will not play in the mobile terminal of b, and b can not Hear c one's voices in speech, then just losing the meaning of voice communication group, therefore, the application is in this special application scenarios Under demand, inventor's selection carries out chauvent's criterion processing to downlink voice communicating data, to solve the problems, such as to utter long and high-pitched sounds.

In some embodiments, after there is sound of uttering long and high-pitched sounds in determining the downlink voice communicating data, further include：It obtains The voice data of the mobile terminal acquisition；Voice and background sound lock out operation are carried out to the voice data；To what is isolated Background sound carries out weakening process；After background sound after weakening process is carried out stereo process with the voice isolated, as Ascending voice communicating data is sent to the corresponding server of the default application program.The advantages of this arrangement are as follows Neng Gouyou Effect weakens is uttered long and high-pitched sounds caused by background sound.Illustratively, when there are microphone array, (number of microphone is more than in mobile terminal Or equal to 2) when, can determine whether out sound source position, the sound apart from mobile terminal (such as larger than 1 meter) farther out filtered out according to sound source position Sound is as background sound；Alternatively, the voiceprint of mobile terminal user can be obtained in advance, carried from voice data according to voiceprint User's one's voice in speech is taken out as voice, remaining sound is as background sound.Illustratively, the background sound isolated is carried out Weakening process can reduce the sound of background sound by adjusting the mode of gain, can also wiping out background sound.Background sound passes through After weakening process, volume down is destroyed the increasing condition of sound, and then effectively weakens and uttered long and high-pitched sounds caused by background sound.

Fig. 2 is the flow diagram of another voice communication data detection method provided by the embodiments of the present application, with default Application program is for online game application program, this method comprises the following steps：

Step 201 detects that the voice communication group in default game application is successfully established.

Illustratively, by taking team's battle game as an example, such as king's honor, there are 5 players in every team, and Hong Lan two teams carry out pair It fights, needs progress communication exchange to discuss battle strategy between 5 players of each troop, therefore, many players can select to open Voice call function in team, if a player applies opening in team after voice call function, voice communication group is successfully established.This Afterwards, with any one in 5 players of World War I team, remaining 4 player's one's voice in speech can be heard.In general, player can incite somebody to action Mobile terminal is set as outer mode playback, convenience gaming.

Step 202 judges that whether there is the distance between mobile terminal in voice communication group is less than pre-determined distance value Destination mobile terminal, if so, thening follow the steps 203；Otherwise, step 202 is repeated.

If in 5 players, there are two player distance of mobile terminal it is closer, such as two good friends play together at home, again It sets mobile terminal to outer mode playback simultaneously, is thus very easy to cause to utter long and high-pitched sounds.It therefore, can be first in the embodiment of the present application Judge to whether there is other mobile terminals closer with current distance of mobile terminal in voice communication group, and if it exists, then need Carry out detection of uttering long and high-pitched sounds.

Optionally, the combination of any one or more mode described above may be used in the embodiment of the present application to judge With the presence or absence of destination mobile terminal, the embodiment of the present application does not limit.

Step 203, obtain mobile terminal in predetermined time period downlink voice communicating data.

Illustratively, the microphone of the mobile terminal comprising other 4 teammates is collected in downlink voice communicating data Sound, general in sound includes not only 4 teammate's one's voices in speech, further includes that the loud speakers of 4 mobile terminals for being teammate plays Sound and other ambient sounds etc..The ascending voice that the upload of other 4 mobile terminals is generally collected by game server leads to Data are talked about, and the ascending voice communicating data of 4 mobile terminals is sent to current mobile terminal.

Step 204 carries out piecemeal processing to the downlink voice communicating data.

Step 205, for each data block, doubtful present in current data block utter long and high-pitched sounds is determined using presupposition analysis mode Point.

The presupposition analysis mode includes：Energy value in high-frequency region is obtained on frequency domain to wait for higher than preset energy threshold value Judge frequency point, the capacity volume variance value of the frequency point of preset quantity around the frequency point to be determined is calculated, when the capacity volume variance value is big When default discrepancy threshold, determine that the frequency point to be determined is doubtful utters long and high-pitched sounds a little；The high-frequency region is frequency higher than default frequency The frequency range of rate threshold value.

Step 206 judges whether that multiple doubtful point groups of uttering long and high-pitched sounds of periodic feature, and a doubtful correspondence of uttering long and high-pitched sounds is presented Energy value it is in rising trend according to the sequence of affiliated data block, if so, thening follow the steps 207；Otherwise, it returns to step 203。

Step 207 determines there is sound of uttering long and high-pitched sounds in downlink voice communicating data, and doubtful utter long and high-pitched sounds a little is determined as uttering long and high-pitched sounds a little.

Step 208, the higher frequency uttered long and high-pitched sounds a little of correspondence energy value for choosing preset quantity are used as target frequency Notch filter carries out attenuation processing to audio signal corresponding with target frequency in downlink voice communicating data.

Step 209, the voice data for obtaining mobile terminal acquisition carry out voice and background sound separation behaviour to voice data Make, weakening process is carried out to the background sound isolated, the background sound after weakening process is mixed with the voice isolated After sound processing, the corresponding server of default game application is sent to as ascending voice communicating data.

After voice communication group in the embodiment of the present application in game application is successfully established, if detecting in voice communication group In the presence of the destination mobile terminal closer with current mobile terminal, then detection of uttering long and high-pitched sounds is carried out, it is right respectively when determination exists and utters long and high-pitched sounds sound Ascending voice communicating data and downlink voice communicating data carry out the inhibition processing for sound of uttering long and high-pitched sounds, and can effectively weaken and utter long and high-pitched sounds Sound, the sound that avoids uttering long and high-pitched sounds interfere game process, reduce game player's pain spot, keep the function of mobile terminal more perfect.

Fig. 3 is a kind of structure diagram of voice communication data detection device provided by the embodiments of the present application, which can be by Software and or hardware realization is typically integrated in mobile terminal, can carry out language by executing voice communication data detection method The detection of uttering long and high-pitched sounds of sound communicating data.As shown in figure 3, the device includes：

Detection trigger module 301 detects inspection of uttering long and high-pitched sounds after the voice communication group for presetting in application program is successfully established Survey event is triggered；

Downstream voice data acquisition module 302, the downlink voice for obtaining the predetermined time period in mobile terminal are logical Data are talked about, and piecemeal processing is carried out to the downlink voice communicating data；

A doubtful determining module 303 of uttering long and high-pitched sounds, for for each data block, current data to be determined using presupposition analysis mode It is doubtful present in block to utter long and high-pitched sounds a little；

It utters long and high-pitched sounds sound determining module 304, for when there is multiple doubtful point groups of uttering long and high-pitched sounds that periodic feature is presented, and doubtful howl When crying that a little corresponding energy value is in rising trend according to the sequence of affiliated data block, determines and deposited in the downlink voice communicating data In sound of uttering long and high-pitched sounds；Wherein, the doubtful point group of uttering long and high-pitched sounds is that continuous adjacent data frequency difference in the block is in doubting in preset range It seemingly utters long and high-pitched sounds a little, the quantity of the continuous adjacent data block reaches default continuous threshold value.

Voice communication data detection device provided by the embodiments of the present application, the voice communication group preset in application program are established After success, when detecting that detecting event of uttering long and high-pitched sounds is triggered, the downlink voice call of the predetermined time period in mobile terminal is obtained Data, and carry out piecemeal processing；For each data block, doubtful utter long and high-pitched sounds a little is determined whether there is respectively；It utters long and high-pitched sounds further according to doubtful The distribution situation of point quickly determines in downlink voice communicating data with the presence or absence of sound of uttering long and high-pitched sounds.It, can by using above-mentioned technical proposal After being successfully established with the voice communication group of default application program in the terminal, timely and accurately to downlink voice call number According to detection of uttering long and high-pitched sounds is carried out, subsequently to take appropriate measures, reduces sound of uttering long and high-pitched sounds and use the inconvenience brought to user.

Optionally, the presupposition analysis mode includes：Energy value in high-frequency region is obtained on frequency domain is higher than preset energy The frequency point to be determined of threshold value calculates the capacity volume variance value of the frequency point of preset quantity around the frequency point to be determined, when the energy When difference value is more than default discrepancy threshold, determine that the frequency point to be determined is doubtful utters long and high-pitched sounds a little；The high-frequency region is that frequency is high In the frequency range of predeterminated frequency threshold value.

Optionally, which further includes：

A determining module of uttering long and high-pitched sounds will be described after there is sound of uttering long and high-pitched sounds in determining the downlink voice communicating data Doubtful utter long and high-pitched sounds a little is determined as uttering long and high-pitched sounds a little.

Chauvent's criterion module a little carries out at chauvent's criterion the downlink voice communicating data for uttering long and high-pitched sounds according to Reason.

Optionally, the chauvent's criterion module is specifically used for：

The higher frequency uttered long and high-pitched sounds a little of correspondence energy value for choosing preset quantity, as target frequency, to the downlink language Audio signal corresponding with the target frequency carries out attenuation processing in sound communicating data；Or,

Attenuation processing is carried out to audio signal corresponding with all frequencies uttered long and high-pitched sounds a little in the downlink voice communicating data.

Optionally, described to detect that detecting event of uttering long and high-pitched sounds is triggered, including：

Judge that whether there is the distance between described mobile terminal in the voice communication group is less than pre-determined distance value Destination mobile terminal, and if it exists, then confirmly detect detecting event of uttering long and high-pitched sounds and be triggered.

Optionally, described to judge that whether there is the distance between described mobile terminal in the voice communication group is less than in advance If the destination mobile terminal of distance value, including：

Preset sound segment is played using predetermined manner, and receives the feedback of other mobile terminals in the voice communication group Information, the feedback information include that other described mobile terminals are attempted to acquire voice signal corresponding with the preset sound segment Result；Judged in the voice communication group with the presence or absence of small with the distance between the mobile terminal according to the feedback information In the destination mobile terminal of pre-determined distance value；

Alternatively,

Obtain the mobile terminal the first location information and other mobile terminals in the voice communication group second Location information；According to first location information and second location information, judge to whether there is in the voice communication group The distance between described mobile terminal is less than the destination mobile terminal of the pre-determined distance value；

Alternatively,

Other mobile terminals in the first WiFi information and the voice communication group that the mobile terminal connects are obtained to connect The 2nd WiFi information connect；According to the first WiFi information and the 2nd WiFi information, judge in the voice communication group With the presence or absence of the destination mobile terminal for being less than the pre-determined distance value with the distance between the mobile terminal；

Alternatively,

The first voice data of microphone acquisition is obtained, and obtains the downlink voice communicating data in mobile terminal；Its In, the sound of the loud speaker broadcasting not comprising the mobile terminal in first voice data；According to the first sound number According to the sound that whether includes same person in the downlink voice communicating data, judge to whether there is in the voice communication group The distance between described mobile terminal is less than the destination mobile terminal of the pre-determined distance value.

Optionally, which further includes：

Voice data acquisition module obtains after there is sound of uttering long and high-pitched sounds in determining the downlink voice communicating data The voice data of the mobile terminal acquisition；

Sound separation module, for carrying out voice and background sound lock out operation to the voice data；

Background sound weakens module, for carrying out weakening process to the background sound isolated；

Upstream data sending module, for carrying out at audio mixing the background sound after weakening process with the voice isolated After reason, the corresponding server of the default application program is sent to as ascending voice communicating data.

Optionally, the default application program is online game application program.

The embodiment of the present application also provides a kind of storage medium including computer executable instructions, and the computer is executable When being executed by computer processor for executing voice communication data detection method, this method includes for instruction：

Storage medium --- any various types of memory devices or storage device.Term " storage medium " is intended to wrap It includes：Install medium, such as CD-ROM, floppy disk or magnetic tape equipment；Computer system memory or random access memory, such as DRAM, DDRRAM, SRAM, EDORAM, blue Bath (Rambus) RAM etc.；Nonvolatile memory, such as flash memory, magnetic medium (example Such as hard disk or optical storage)；The memory component etc. of register or other similar types.Storage medium can further include other types Memory or combinations thereof.In addition, storage medium can be located at program in the first computer system being wherein performed, or It can be located in different second computer systems, second computer system is connected to the first meter by network (such as internet) Calculation machine system.Second computer system can provide program instruction to the first computer for executing.Term " storage medium " can To include two or more that may reside in different location (such as in different computer systems by network connection) Storage medium.Storage medium can store the program instruction that can be executed by one or more processors and (such as be implemented as counting Calculation machine program).

Certainly, a kind of storage medium including computer executable instructions that the embodiment of the present application is provided, computer The voice communication data detecting operation that executable instruction is not limited to the described above can also be performed the application any embodiment and be carried Relevant operation in the voice communication data detection method of confession.

The embodiment of the present application provides a kind of mobile terminal, and language provided by the embodiments of the present application can be integrated in the mobile terminal Sound communicating data detection device.Fig. 4 is a kind of structural schematic diagram of mobile terminal provided by the embodiments of the present application.Mobile terminal 400 may include：Memory 401, processor 402 and is stored in the computer that can be run on memory 401 and in processor 402 Program, the processor 402 realize the voice communication data inspection as described in the embodiment of the present application when executing the computer program Survey method.

Mobile terminal provided by the embodiments of the present application, the voice communication group of default application program that can be in the terminal After being successfully established, detection of uttering long and high-pitched sounds timely and accurately is carried out to downlink voice communicating data, subsequently to take appropriate measures, is subtracted Sound of uttering long and high-pitched sounds less uses the inconvenience brought to user.

Fig. 5 is the structural schematic diagram of another mobile terminal provided by the embodiments of the present application, which may include： Shell (not shown), memory 501, central processing unit (central processing unit, CPU) 502 (are also known as located Manage device, hereinafter referred to as CPU), circuit board (not shown) and power circuit (not shown).The circuit board is placed in institute State the space interior that shell surrounds；The CPU502 and the memory 501 are arranged on the circuit board；The power supply electricity Road, for being each circuit or the device power supply of the mobile terminal；The memory 501, for storing executable program generation Code；The CPU502 is run and the executable journey by reading the executable program code stored in the memory 501 The corresponding computer program of sequence code, to realize following steps：

The mobile terminal further includes：Peripheral Interface 503, RF (Radio Frequency, radio frequency) circuit 505, audio-frequency electric Road 506, loud speaker 511, power management chip 508, input/output (I/O) subsystem 509, other input/control devicess 510, Touch screen 512, other input/control devicess 510 and outside port 504, these components pass through one or more communication bus Or signal wire 507 communicates.

It should be understood that diagram mobile terminal 500 is only an example of mobile terminal, and mobile terminal 500 Can have than shown in the drawings more or less component, can combine two or more components, or can be with It is configured with different components.Various parts shown in the drawings can be including one or more signal processings and/or special It is realized in the combination of hardware, software or hardware and software including integrated circuit.

It is just provided in this embodiment below to be described in detail for the utter long and high-pitched sounds mobile terminal of detection of voice communication data, The mobile terminal is by taking mobile phone as an example.

Memory 501, the memory 501 can be by access such as CPU502, Peripheral Interfaces 503, and the memory 501 can Can also include nonvolatile memory to include high-speed random access memory, such as one or more disk memory, Flush memory device or other volatile solid-state parts.

The peripheral hardware that outputs and inputs of equipment can be connected to CPU502 and deposited by Peripheral Interface 503, the Peripheral Interface 503 Reservoir 501.

I/O subsystems 509, the I/O subsystems 509 can be by the input/output peripherals in equipment, such as touch screen 512 With other input/control devicess 510, it is connected to Peripheral Interface 503.I/O subsystems 509 may include 5091 He of display controller One or more input controllers 5092 for controlling other input/control devicess 510.Wherein, one or more input controls Device 5092 processed receives electric signal from other input/control devicess 510 or sends electric signal to other input/control devicess 510, Other input/control devicess 510 may include physical button (pressing button, rocker buttons etc.), dial, slide switch, behaviour Vertical pole clicks idler wheel.It is worth noting that input controller 5092 can with it is following any one connect：Keyboard, infrared port, The indicating equipment of USB interface and such as mouse.

Touch screen 512, the touch screen 512 are the input interface and output interface between customer mobile terminal and user, Visual output is shown to user, visual output may include figure, text, icon, video etc..

Display controller 5091 in I/O subsystems 509 receives electric signal from touch screen 512 or is sent out to touch screen 512 Electric signals.Touch screen 512 detects the contact on touch screen, and the contact detected is converted to and is shown by display controller 5091 The interaction of user interface object on touch screen 512, that is, realize human-computer interaction, the user interface being shown on touch screen 512 Object can be the icon of running game, be networked to the icon etc. of corresponding network.It is worth noting that equipment can also include light Mouse, light mouse are the extensions for the touch sensitive surface for not showing the touch sensitive surface visually exported, or formed by touch screen.

RF circuits 505 are mainly used for establishing the communication of mobile phone and wireless network (i.e. network side), realize mobile phone and wireless network The data receiver of network and transmission.Such as transmitting-receiving short message, Email etc..Specifically, RF circuits 505 receive and send RF letters Number, RF signals are also referred to as electromagnetic signal, and RF circuits 505 convert electrical signals to electromagnetic signal or electromagnetic signal is converted to telecommunications Number, and communicated with communication network and other equipment by the electromagnetic signal.RF circuits 505 may include for executing The known circuit of these functions comprising but it is not limited to antenna system, RF transceivers, one or more amplifiers, tuner, one A or multiple oscillators, digital signal processor, CODEC (COder-DECoder, coder) chipset, user identifier mould Block (Subscriber Identity Module, SIM) etc..

Voicefrequency circuit 506 is mainly used for receiving audio data from Peripheral Interface 503, which is converted to telecommunications Number, and the electric signal is sent to loud speaker 511.

Loud speaker 511, the voice signal for receiving mobile phone from wireless network by RF circuits 505, is reduced to sound And play the sound to user.

Power management chip 508, the hardware for being connected by CPU502, I/O subsystem and Peripheral Interface are powered And power management.

Voice communication data detection device, storage medium and the mobile terminal provided in above-described embodiment can perform the application The voice communication data detection method that any embodiment is provided has and executes the corresponding function module of this method and beneficial to effect Fruit.The not technical detail of detailed description in the above-described embodiments, reference can be made to the voice communication that the application any embodiment is provided Data detection method.

Note that above are only preferred embodiment and the institute's application technology principle of the application.It will be appreciated by those skilled in the art that The application is not limited to specific embodiment described here, can carry out for a person skilled in the art it is various it is apparent variation, The protection domain readjusted and substituted without departing from the application.Therefore, although being carried out to the application by above example It is described in further detail, but the application is not limited only to above example, in the case where not departing from the application design, also May include other more equivalent embodiments, and scope of the present application is determined by scope of the appended claims.

Claims

1. a kind of voice communication data detection method, which is characterized in that including：

Obtain mobile terminal in predetermined time period downlink voice communicating data, and to the downlink voice communicating data into The processing of row piecemeal；

When exist present periodic feature multiple doubtful point groups of uttering long and high-pitched sounds, and it is doubtful utter long and high-pitched sounds a little corresponding energy value according to affiliated number When in rising trend according to the sequence of block, determine there is sound of uttering long and high-pitched sounds in the downlink voice communicating data；Wherein, described doubtful to utter long and high-pitched sounds Point group be that continuous adjacent data frequency difference in the block is in preset range it is doubtful utter long and high-pitched sounds a little, the continuous adjacent data block Quantity reach default continuous threshold value.

2. according to the method described in claim 1, it is characterized in that, the presupposition analysis mode includes：It is obtained on frequency domain high Energy value is higher than the frequency point to be determined of preset energy threshold value in frequency domain, calculates the frequency of preset quantity around the frequency point to be determined The capacity volume variance value of point determines that the frequency point to be determined is doubtful howl when the capacity volume variance value is more than default discrepancy threshold It cries a little；The high-frequency region is the frequency range that frequency is higher than predeterminated frequency threshold value.

3. according to the method described in claim 1, uttering long and high-pitched sounds it is characterized in that, existing in determining the downlink voice communicating data After sound, further include：

Doubtful utter long and high-pitched sounds a little is determined as uttering long and high-pitched sounds a little；

Chauvent's criterion processing a little is carried out to the downlink voice communicating data according to described utter long and high-pitched sounds.

4. according to the method described in claim 3, a little conversing the downlink voice it is characterized in that, uttering long and high-pitched sounds described in the basis Data carry out chauvent's criterion processing, including：

The higher frequency uttered long and high-pitched sounds a little of correspondence energy value for choosing preset quantity, it is logical to the downlink voice as target frequency It talks about audio signal corresponding with the target frequency in data and carries out attenuation processing；Or,

5. according to the method described in claim 1, it is characterized in that, described detect that detecting event of uttering long and high-pitched sounds is triggered, including：

Judge to whether there is the target that the distance between described mobile terminal is less than pre-determined distance value in the voice communication group Mobile terminal, and if it exists, then confirmly detect detecting event of uttering long and high-pitched sounds and be triggered.

6. according to the method described in claim 5, it is characterized in that, described judge to whether there is and institute in the voice communication group The destination mobile terminal that the distance between mobile terminal is less than pre-determined distance value is stated, including：

Preset sound segment is played using predetermined manner, and receives the feedback letter of other mobile terminals in the voice communication group Breath, the feedback information include that other described mobile terminals are attempted to acquire voice signal corresponding with the preset sound segment As a result；Judge that whether there is the distance between described mobile terminal in the voice communication group is less than according to the feedback information The destination mobile terminal of pre-determined distance value；

Alternatively,

Obtain the first location information of the mobile terminal and the second positioning of other mobile terminals in the voice communication group Information；According to first location information and second location information, judge to whether there is and institute in the voice communication group State the destination mobile terminal that the distance between mobile terminal is less than the pre-determined distance value；

Alternatively,

Obtain what other mobile terminals in the first WiFi information and the voice communication group that the mobile terminal connects connected 2nd WiFi information；According to the first WiFi information and the 2nd WiFi information, judge in the voice communication group whether In the presence of the destination mobile terminal for being less than the pre-determined distance value with the distance between the mobile terminal；

Alternatively,

The first voice data of microphone acquisition is obtained, and obtains the downlink voice communicating data in mobile terminal；Wherein, institute State the sound that the loud speaker not comprising the mobile terminal plays in the first voice data；According to first voice data and institute The sound for whether including same person in downlink voice communicating data stated, judge in the voice communication group with the presence or absence of with it is described The distance between mobile terminal is less than the destination mobile terminal of the pre-determined distance value.

7. according to the method described in claim 1, uttering long and high-pitched sounds it is characterized in that, existing in determining the downlink voice communicating data After sound, further include：

Obtain the voice data of the mobile terminal acquisition；

Voice and background sound lock out operation are carried out to the voice data；

Weakening process is carried out to the background sound isolated；

After background sound after weakening process is carried out stereo process with the voice isolated, as ascending voice communicating data It is sent to the corresponding server of the default application program.

8. according to the method described in claim 1, it is characterized in that, the default application program is online game application program.

9. a kind of voice communication data detection device, which is characterized in that including：

Detection trigger module detects detecting event of uttering long and high-pitched sounds after the voice communication group for presetting in application program is successfully established It is triggered；

Downstream voice data acquisition module, the downlink voice communicating data for obtaining the predetermined time period in mobile terminal, And piecemeal processing is carried out to the downlink voice communicating data；

A doubtful determining module of uttering long and high-pitched sounds, for for each data block, being determined in current data block and being deposited using presupposition analysis mode Doubtful utter long and high-pitched sounds a little；

It utters long and high-pitched sounds sound determining module, for when there are multiple doubtful point groups of uttering long and high-pitched sounds that periodic feature is presented, and doubtful utters long and high-pitched sounds a little pair When the energy value answered is in rising trend according to the sequence of affiliated data block, determines to exist in the downlink voice communicating data and utter long and high-pitched sounds Sound；Wherein, what the doubtful point group of uttering long and high-pitched sounds was that continuous adjacent data frequency difference in the block is in preset range doubtful utters long and high-pitched sounds The quantity of point, the continuous adjacent data block reaches default continuous threshold value.

10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is by processor Such as voice communication data detection method according to any one of claims 1-8 is realized when execution.

11. a kind of mobile terminal, which is characterized in that including memory, processor and storage are on a memory and can be in processor The computer program of operation, the processor realize the language as described in claim 1-8 is any when executing the computer program Sound communicating data detection method.