CN108494954B - Voice communication data detection method, device, storage medium and mobile terminal - Google Patents

Voice communication data detection method, device, storage medium and mobile terminal Download PDF

Info

Publication number
CN108494954B
CN108494954B CN201810201127.8A CN201810201127A CN108494954B CN 108494954 B CN108494954 B CN 108494954B CN 201810201127 A CN201810201127 A CN 201810201127A CN 108494954 B CN108494954 B CN 108494954B
Authority
CN
China
Prior art keywords
pitched sounds
data
long
mobile terminal
little
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810201127.8A
Other languages
Chinese (zh)
Other versions
CN108494954A (en
Inventor
郑志勇
柳明
李智豪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN201810201127.8A priority Critical patent/CN108494954B/en
Publication of CN108494954A publication Critical patent/CN108494954A/en
Priority to PCT/CN2019/076978 priority patent/WO2019174492A1/en
Application granted granted Critical
Publication of CN108494954B publication Critical patent/CN108494954B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/72406User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by software upgrading or downloading
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/60Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/16Arrangements for providing special services to substations
    • H04L12/18Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
    • H04L12/189Arrangements for providing special services to substations for broadcast or conference, e.g. multicast in combination with wireless systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/403Arrangements for multi-party communication, e.g. for conferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72454User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72484User interfaces specially adapted for cordless or mobile telephones wherein functions are triggered by incoming communication events
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/568Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/06Selective distribution of broadcast services, e.g. multimedia broadcast multicast service [MBMS]; Services to user groups; One-way selective calling services
    • H04W4/10Push-to-Talk [PTT] or Push-On-Call services
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/16Communication-related supplementary services, e.g. call-transfer or call-hold
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M19/00Current supply arrangements for telephone systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/18Automatic or semi-automatic exchanges with means for reducing interference or noise; with means for reducing effects due to line faults with means for protecting lines
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/22Arrangements for supervision, monitoring or testing
    • H04M3/2236Quality of speech transmission monitoring

Abstract

The embodiment of the present application discloses voice communication data detection method, device, storage medium and mobile terminal.This method comprises: detecting that detecting event of uttering long and high-pitched sounds is triggered after the voice communication group in default application program is successfully established, obtaining the downlink voice communicating data of predetermined time period, and carry out piecemeal processing to it, obtain M data block;It is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, the doubtful data block uttered long and high-pitched sounds a little will be first appeared and be determined as initial data block;From initial data BOB(beginning of block), successively with n data block for data segment to be analyzed, include in current data section doubtful is analyzed using presupposition analysis mode to utter long and high-pitched sounds a little, when the doubtful frequency difference uttered long and high-pitched sounds between a little for including in N number of data segment is in preset range, determine there is sound of uttering long and high-pitched sounds in the downlink voice communicating data.The embodiment of the present application accurately can carry out detection of uttering long and high-pitched sounds to downlink voice communicating data by using above-mentioned technical proposal.

Description

Voice communication data detection method, device, storage medium and mobile terminal
Technical field
The invention relates to voice communication technical field more particularly to voice communication data detection method, device, deposit Storage media and mobile terminal.
Background technique
Currently, the mobile terminals such as mobile phone and tablet computer have become people's indispensability as the quick of mobile terminal is popularized One of means of communication.Communication mode between mobile terminal user is more and more abundant, is not limited to mobile communication operators already The service such as traditional phone and short message that quotient provides, under many scenes, user is more likely to using Internet-based logical Voice-enabled chat and Video chat function in letter mode, such as various social softwares.
In addition, application program (Application, APP) function in mobile terminal is improved day by day, in many application programs All be provided with voice call function, the communication between the user of money application program convenient to use with exchange.It is with game application Example, some game for needing to be interacted between player have been added to built-in voice call function, and user can use During mobile terminal plays game, speech exchange is carried out with other players.However, in voice call process, voice communication The sound type for including in data is more, such as sound (the back of such as game comprising each player's one's voice in speech, application program itself Jing Yin or special efficacy sound etc.) and mobile terminal local environment in other sound etc., since sound is more complicated, it is easy to occur It utters long and high-pitched sounds phenomenon, seriously affects the use of user.
Summary of the invention
The embodiment of the present application provides a kind of voice communication data detection method, device, storage medium and mobile terminal, can be with When voice call function in application program for mobile terminal is opened, howling is timely and accurately detected.
In a first aspect, the embodiment of the present application provides a kind of voice communication data detection method, comprising:
After voice communication group in default application program is successfully established, detect that detecting event of uttering long and high-pitched sounds is triggered;
The downlink voice communicating data of the predetermined time period in mobile terminal is obtained, and to downlink voice call number According to piecemeal processing is carried out, M data block is obtained;
It is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, will first appeared doubtful The data block uttered long and high-pitched sounds a little is determined as initial data block;
From the initial data BOB(beginning of block), successively with n data block for data segment to be analyzed, using the presupposition analysis Mode analyzes include in current data section doubtful and utters long and high-pitched sounds a little, when the doubtful frequency uttered long and high-pitched sounds between a little for including in N number of data segment When rate difference is in preset range, determine there is sound of uttering long and high-pitched sounds in the downlink voice communicating data;Wherein, n=2,3 ..., N; N is less than or equal to M, is greater than or equal to 2;The starting point of each data segment is identical as the starting point of the initial data block, institute Stating initial data block is first data segment.
Second aspect, the embodiment of the present application provide a kind of voice communication data detection device, comprising:
Detection trigger module detects detection of uttering long and high-pitched sounds after the voice communication group for presetting in application program is successfully established Event is triggered;
Downstream voice data obtains module, for obtaining the downlink voice call number of the predetermined time period in mobile terminal According to, and piecemeal processing is carried out to the downlink voice communicating data, obtain M data block;
A doubtful determining module of uttering long and high-pitched sounds, for successively being analyzed in current data block using presupposition analysis mode with the presence or absence of doubtful It seemingly utters long and high-pitched sounds a little, the doubtful data block uttered long and high-pitched sounds a little will be first appeared and be determined as initial data block;
It utters long and high-pitched sounds sound determining module, for from the initial data BOB(beginning of block), successively with n data block for data to be analyzed Section is analyzed include in current data section doubtful using the presupposition analysis mode and uttered long and high-pitched sounds a little, when including in N number of data segment When the doubtful frequency difference uttered long and high-pitched sounds between a little is in preset range, determines to exist in the downlink voice communicating data and utter long and high-pitched sounds Sound;Wherein, n=2,3 ..., N;N is less than or equal to M, is greater than or equal to 2;The starting point of each data segment with the starting number Identical according to the starting point of block, the initial data block is first data segment.
The third aspect, the embodiment of the present application provide a kind of computer readable storage medium, are stored thereon with computer journey Sequence realizes the voice communication data detection method as described in the embodiment of the present application when the program is executed by processor.
Fourth aspect, the embodiment of the present application provide a kind of mobile terminal, including memory, processor and are stored in storage It can realize on device and when the computer program of processor operation, the processor execute the computer program as the application is real Apply voice communication data detection method described in example.
The voice communication data detection scheme provided in the embodiment of the present application, the voice communication preset in application program are set up It after standing successfully, detects that detecting event of uttering long and high-pitched sounds is triggered, obtains the downlink voice call of the predetermined time period in mobile terminal Data, and carry out piecemeal processing;It is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, it will It first appears the doubtful data block uttered long and high-pitched sounds a little and is determined as initial data block;From initial data BOB(beginning of block), successively it is with n data block Data segment to be analyzed is analyzed include in current data section doubtful using presupposition analysis mode and uttered long and high-pitched sounds a little, when N number of data segment In include the doubtful frequency difference uttered long and high-pitched sounds between a little when being in preset range, determine to exist in downlink voice communicating data and make a whistling sound It is sound.By using above-mentioned technical proposal, can the voice communication group of default application program in the terminal be successfully established Afterwards, detection of uttering long and high-pitched sounds accurately is carried out to downlink voice communicating data, taken appropriate measures so as to subsequent, reduction utters long and high-pitched sounds sound to use Family is inconvenient using bring.
Detailed description of the invention
Fig. 1 is a kind of flow diagram of voice communication data detection method provided by the embodiments of the present application;
Fig. 2 is the flow diagram of another voice communication data detection method provided by the embodiments of the present application;
Fig. 3 is a kind of structural block diagram of voice communication data detection device provided by the embodiments of the present application;
Fig. 4 is a kind of structural schematic diagram of mobile terminal provided by the embodiments of the present application;
Fig. 5 is the structural schematic diagram of another mobile terminal provided by the embodiments of the present application.
Specific embodiment
Further illustrate the technical solution of the application below with reference to the accompanying drawings and specific embodiments.It is understood that It is that specific embodiment described herein is used only for explaining the application, rather than the restriction to the application.It further needs exist for illustrating , part relevant to the application is illustrated only for ease of description, in attached drawing rather than entire infrastructure.
It should be mentioned that some exemplary embodiments are described as before exemplary embodiment is discussed in greater detail The processing or method described as flow chart.Although each step is described as the processing of sequence by flow chart, many of these Step can be implemented concurrently, concomitantly or simultaneously.In addition, the sequence of each step can be rearranged.When its operation The processing can be terminated when completion, it is also possible to have the additional step being not included in attached drawing.The processing can be with Corresponding to method, function, regulation, subroutine, subprogram etc..
Fig. 1 is a kind of flow diagram of voice communication data detection method provided by the embodiments of the present application, and this method can To be executed by voice communication data detection device, wherein the device be can be implemented by software and/or hardware, and can generally be integrated in movement In terminal.As shown in Figure 1, this method comprises:
After voice communication group in step 101, default application program is successfully established, detect that detecting event of uttering long and high-pitched sounds is touched Hair.
Illustratively, the mobile terminal in the embodiment of the present application may include the mobile devices such as mobile phone and tablet computer.It is default Application program can be the application program of built-in voice group call function, such as online game application, Online class application, video Conference applications or the other applications for needing multiple person cooperational etc..
Illustratively, 2 members be may include in voice communication group, but in most cases, generally comprise 3 or 3 with On member, the voice communication between 3 or 3 or more mobile terminals can be realized.Voice communication group can be by movement It is initiated and is established using the user of default application program in terminal, after voice communication group is successfully established, wrapped in voice communication group It can be communicated between all mobile terminals contained.In general, being also not in earphone mould when mobile terminal is not in silent mode When formula, it will be appreciated that be in outer mode playback for mobile terminal, the sound of each user can be used by oneself in voice communication group Mobile terminal microphone acquisition, and after network transmission and processing by the loudspeaker of the mobile terminals of other users into Row plays.By taking game application as an example, such as the association's war that needs to form a team, phonetic function of forming a team can be opened, it is assumed that there are 5 players in team, that After voice communication group is successfully established, this 5 people can converse between each other, any one player can hear separately simultaneously Outer 4 player's words, seemingly other 4 players oneself are talking equally at one's side, facilitate the game in exchange.Present techniques The executing subject of scheme, i.e., current mobile terminal can be any one mobile terminal in voice communication group, be also possible to Some or certain several specified mobile terminals in voice communication group.That is, can be by any one in voice communication group Mobile terminal execution method provided by the embodiments of the present application, can also be by specified one or more mobile terminal execution the application Embodiment provide method, can also all mobile terminals be performed both by method provided by the embodiments of the present application.
In general, not only including in the collected sound of mobile terminal microphone when mobile terminal is in outer mode playback User itself one's voice in speech, it is also possible to include the sound that the default application program itself that loudspeaker plays issues, such as background sound It is happy etc., it is also possible to the sound comprising ambient enviroment, it is also possible to which that other people speak in the voice communication group played comprising loudspeaker Sound, in this way, when the data comprising various sound respectively acquired are sent to the same shifting by network by multiple mobile terminals It (such as include 5 mobile terminals in voice communication group, then wherein 4 mobile terminals will be respectively acquiring when dynamic terminal Sound is sent to server, and server gives the audio data transmitting of 4 mobile terminals to the 5th mobile terminal), these sound by Broadcasting can be mixed in the mobile terminal, may generate phenomenon of uttering long and high-pitched sounds.
In the embodiment of the present application, in order to carry out detection of uttering long and high-pitched sounds on suitable opportunity, detecting event of uttering long and high-pitched sounds can be preset The condition being triggered.It optionally, can be in the voice in default application program for the real-time detection timely and effectively uttered long and high-pitched sounds After phone group is successfully established, detecting event of uttering long and high-pitched sounds is triggered immediately;Optionally, detection of uttering long and high-pitched sounds is carried out in order to more targeted, together When save detection of uttering long and high-pitched sounds operate brought by extra power consumption, theory analysis or investigation can be carried out to the scene uttered long and high-pitched sounds is easy to happen Deng reasonably default scene being arranged, when detecting that mobile terminal is in default scene, triggering is uttered long and high-pitched sounds detecting event.
Step 102, the downlink voice communicating data for obtaining predetermined time period in mobile terminal, and to the downlink language Sound communicating data carries out piecemeal processing, obtains M data block.
Illustratively, downlink voice communicating data can be the corresponding server of default application program to receive voice logical In words group after the voice data of other mobile terminals, the data of mobile terminal are given by audio mixing etc. haircut, or directly turn Issue the data of mobile terminal, the application to the processing modes of server process voice communication data without limitation.It is existing in correlation Have in technology, after mobile terminal receives downlink voice communicating data from server, played out by loudspeaker, without into Capable detection of uttering long and high-pitched sounds.In the application, after detecting that detecting event of uttering long and high-pitched sounds is triggered, downlink voice call number will not be directly played According to, but downlink voice communicating data is analyzed, to judge in downstream voice data with the presence or absence of sound of uttering long and high-pitched sounds.
In the embodiment of the present application, predetermined time period can according to the concrete configuration of mobile terminal, data-handling capacity and The demand of voice communication to timeliness etc. is because usually determining, the embodiment of the present application is without limitation.For example, can be between 1 to 2 second Any duration.Carrying out piecemeal processing to downlink voice communicating data can be according to the progress piecemeal processing of default unit length, Default unit length for example can be 40 milliseconds.Assuming that predetermined time period is 1.2 seconds, presetting unit length is 40 milliseconds, that 30 data blocks, i.e. M=30 can be divided into.
Step 103 is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, will for the first time There is the doubtful data block uttered long and high-pitched sounds a little and is determined as initial data block.
The application is not especially limited presupposition analysis mode.For example, the presupposition analysis mode includes: to obtain on frequency domain Energy value in high-frequency region is taken to calculate preset quantity around the frequency point to be determined higher than the frequency point to be determined of preset energy threshold value The capacity volume variance value of frequency point determine that the frequency point to be determined is doubtful when the capacity volume variance value is greater than default discrepancy threshold It seemingly utters long and high-pitched sounds a little;The high-frequency region is the frequency range that frequency is higher than predeterminated frequency threshold value.
Illustratively, for current data block, frequency domain can be first transformed from the time domain to, spectrum analysis is convenient for.Become The mode the embodiment of the present application of changing without limitation, can use Fourier transformation mode, such as the fast algorithm of discrete fourier transform (Fast Fourier Transformation, FFT).By taking 40ms as an example, the audio data (16bit, 16k sample rate) of 40ms Size is 40*16*16/2=1280 byte, is adapted for use with 1024 and does FFT transform progress spectrum analysis, after FFT is handled Frequency analysis in frequency range be 0~16K/2, step-length be (16K/2)/1024, step-length is about 8Hz.
In the embodiment of the present application, high-frequency region and other regions can be divided using predeterminated frequency threshold value as cut off value.In advance If frequency threshold can be configured according to the actual situation, such as can according to voice frequency and be easy to appear the frequency feature of howling into Row setting, such as can be 1KHz, 1.5KHz or 2KHz etc..Such as predeterminated frequency threshold value is 2KHz, that is, is greater than the portion of 2KHz It is divided into high-frequency region.The frequency of general howling appears in high-frequency region, and sound is larger (i.e. energy value is higher), the application Embodiment can quickly determine doubtful uttering long and high-pitched sounds a little in a data block according to energy value characteristic distributions.
Illustratively, the corresponding energy value of each Frequency point (abbreviation frequency point) in data block is obtained, then from high-frequency region In find energy value be higher than preset energy threshold value frequency point to be determined, calculate the energy of the frequency point of preset quantity around frequency point to be determined Measure difference value.Preset energy threshold value and preset quantity can be arranged according to actual needs, for example, preset energy threshold value can be- 10dB, preset quantity can be 8 (before frequency point to be determined 4 and 4 below).By taking step-length above is about 8Hz as an example, it is assumed that The frequency values of frequency point to be determined be 3362Hz, then around it frequency values of frequency point of preset quantity be about 3330Hz, 3338Hz, 3346Hz, 3354Hz, 3370Hz, 3378Hz, 3386Hz and 3394Hz.Capacity volume variance value is for measuring frequency point to be determined and surrounding Difference degree between the frequency point of preset quantity specifically can be the difference of maximum energy value and minimum energy value, can also be energy Variance yields or energy mean square deviation etc. are measured, the application is without limitation.Preset discrepancy threshold and corresponding, the example of capacity volume variance value Such as, when capacity volume variance value is energy variance yields, presetting discrepancy threshold is default variance threshold values.When capacity volume variance value is poor greater than default When different threshold value, illustrate frequency point to be determined than more prominent, is very likely to be to utter long and high-pitched sounds a little, accordingly, it is determined that frequency point to be determined is doubtful It utters long and high-pitched sounds a little.In this way setting can rapidly and accurately identify it is doubtful utter long and high-pitched sounds a little, lay the foundation to improve detection efficiency of uttering long and high-pitched sounds.
Illustratively, there may be multiple frequency points to be determined in a data block, the application can be highest from corresponding energy Frequency point to be determined starts to carry out the doubtful judgement uttered long and high-pitched sounds a little.
Illustratively, it is analyzed in first data block using above-mentioned presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, if In the presence of then doubtful utter long and high-pitched sounds a little first appears, and first data block is determined as initial data block;If it does not exist, then by current number According to next data block of block as new current data block, and new current data block is analyzed using above-mentioned presupposition analysis mode In utter long and high-pitched sounds a little with the presence or absence of doubtful.And so on, it is determined as initial data block until first appearing the doubtful data block uttered long and high-pitched sounds a little, If uttering long and high-pitched sounds a little there is no doubtful in M data block, it is believed that not including sound of uttering long and high-pitched sounds in current downlink voice communicating data.
Step 104, from initial data BOB(beginning of block), successively with n data block be data segment to be analyzed, using presupposition analysis Mode analyzes include in current data section doubtful and utters long and high-pitched sounds a little, when the doubtful frequency uttered long and high-pitched sounds between a little for including in N number of data segment When rate difference is in preset range, determine there is sound of uttering long and high-pitched sounds in downlink voice communicating data.
Wherein, n=2,3 ..., N;N is less than or equal to M, is greater than or equal to 2;The starting point of each data segment with it is described The starting point of initial data block is identical, and the initial data block is first data segment.By taking above-mentioned M=30 as an example, 2≤N≤ 30.When carrying out spectrum analysis, data length to be analyzed can have an impact analysis result, because when data point is less, essence Degree may not be too accurate, so, it is analyzed again using the larger data of length, has been equivalent to a modified place Reason, can more accurately determine whether to utter long and high-pitched sounds.The application to the specific value of N without limitation, it is assumed that N=4, a data The length of block is 40ms, then the time range of initial data block can be denoted as 0 to 40ms, since initial data block has been analyzed Finish, and as the first data segment, so being second data segment since n=2, the time range of second data segment can be remembered It is 0 to 80ms, and so on, the time range of third data segment can be denoted as 0 to 120ms, the time model of third data segment 0 can be denoted as to 160ms by enclosing.
Illustratively, preset range can be arranged according to the actual situation, such as can be 40Hz (such as the example above can recognize To be equivalent to 5 step-lengths).Assuming that the doubtful frequency uttered long and high-pitched sounds a little that 4 data piecewise analysis come out is respectively A, B, C and D, and A, B, There is sound of uttering long and high-pitched sounds in downlink voice communicating data within 40Hz, then can determine in C and D mutual difference.
Optionally, if doubtful utter long and high-pitched sounds for including in current data section a little doubtful is uttered long and high-pitched sounds a little with include in the data segment of front Between frequency difference be not in the preset range, then from next data BOB(beginning of block) of current data section obtain it is described pre- If the downlink voice communicating data of time span, and repeat the related behaviour that piecemeal processing is carried out to downlink voice communicating data Make.The advantages of this arrangement are as follows can be said when the doubtful frequency distance uttered long and high-pitched sounds a little for including in any two data segment is larger Doubtful utter long and high-pitched sounds of bright front a little may not be really to utter long and high-pitched sounds a little, need to continue to test, without to subsequent data segment into The doubtful detection of uttering long and high-pitched sounds of row, saves power consumption, improves utter long and high-pitched sounds sound detection efficiency and accuracy.For example, working as C and A or the difference between B It is different exceed 40Hz when, then since 120ms, reacquire mobile terminal in predetermined time period downlink voice converse number According to, and piecemeal processing is carried out to the downlink voice communicating data, M data block is obtained, then determine new initial data block, and Continue to determine using aforesaid way in downlink voice communicating data with the presence or absence of sound of uttering long and high-pitched sounds.
The voice communication data detection method provided in the embodiment of the present application, the voice communication preset in application program are set up It after standing successfully, detects that detecting event of uttering long and high-pitched sounds is triggered, obtains the downlink voice call of the predetermined time period in mobile terminal Data, and carry out piecemeal processing;It is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, it will It first appears the doubtful data block uttered long and high-pitched sounds a little and is determined as initial data block;From initial data BOB(beginning of block), successively it is with n data block Data segment to be analyzed is analyzed include in current data section doubtful using presupposition analysis mode and uttered long and high-pitched sounds a little, when N number of data segment In include the doubtful frequency difference uttered long and high-pitched sounds between a little when being in preset range, determine to exist in downlink voice communicating data and make a whistling sound It is sound.By using above-mentioned technical proposal, can the voice communication group of default application program in the terminal be successfully established Afterwards, detection of uttering long and high-pitched sounds accurately is carried out to downlink voice communicating data, taken appropriate measures so as to subsequent, reduction utters long and high-pitched sounds sound to use Family is inconvenient using bring.
In some embodiments, after in determining the downlink voice communicating data in the presence of sound of uttering long and high-pitched sounds, further includes: by institute Doubtful utter long and high-pitched sounds is stated a little to be determined as uttering long and high-pitched sounds a little;A little the downlink voice communicating data is carried out at chauvent's criterion according to described utter long and high-pitched sounds Reason.Exist in determining downlink voice communicating data and utter long and high-pitched sounds after sound, illustrate that the satisfaction identified before is uttered long and high-pitched sounds sound decision condition It is doubtful utter long and high-pitched sounds a little really to utter long and high-pitched sounds a little, then needing to prevent from making a whistling sound according to uttering long and high-pitched sounds a little to downlink voice progress chauvent's criterion processing It makes sound play out from loudspeaker or earpiece, influences the use of user.Further, after carrying out chauvent's criterion processing, pass through Loudspeaker or earpiece are played by chauvent's criterion treated downlink voice communicating data.
In some embodiments, described utter long and high-pitched sounds according to a little carries out at chauvent's criterion the downlink voice communicating data Reason, comprising: the higher frequency uttered long and high-pitched sounds a little of correspondence energy value for choosing preset quantity, as target frequency, to the downlink language Audio signal corresponding with the target frequency carries out attenuation processing in sound communicating data.Preset quantity can be freely arranged, and such as 1 It is a, it is 3, even more, it can also be dynamically determined according to the quantity uttered long and high-pitched sounds a little.Can will utter long and high-pitched sounds a little according to energy value from high to low Sequence be ranked up, choose come front preset quantity utter long and high-pitched sounds a little, will select come the frequency uttered long and high-pitched sounds a little be determined as mesh Mark frequency.Energy value is higher, and the sound of howling is bigger, higher to the influence degree of user, the advantages of this arrangement are as follows, energy It is enough that chauvent's criterion more targetedly is carried out to the higher frequency of energy value, chauvent's criterion efficiency is improved, guarantees voice communication Timeliness.
In some embodiments, described utter long and high-pitched sounds according to a little carries out at chauvent's criterion the downlink voice communicating data Reason, may also comprise: decaying to audio signal corresponding with all frequencies uttered long and high-pitched sounds a little in the downlink voice communicating data Processing.The advantages of this arrangement are as follows a chauvent's criterions comprehensively can be carried out to all utter long and high-pitched sounds, the broadcasting for the sound that prevents to utter long and high-pitched sounds.
Illustratively, notch filter can be used come to frequency (i.e. target frequency) institute to utter long and high-pitched sounds a little inhibited Corresponding audio signal carries out attenuation processing.Notch filter can decay rapidly input signal in some Frequency point, to reach To the filter effect for hindering the frequency signal to pass through.The application does not limit the type and design parameter value of notch filter It is fixed.In general, using target frequency as the centre frequency of notch filter, the parameters such as the processing bandwidth of notch filter and gain It can be configured according to actual needs.
In some embodiments, by it is described it is doubtful utter long and high-pitched sounds a little be determined as uttering long and high-pitched sounds a little after, may also include that and a little set to utter long and high-pitched sounds Set inhibition mark.Utter long and high-pitched sounds according to a little to the downlink voice communicating data carry out chauvent's criterion processing after, further includes: The downlink voice communicating data for continuing acquisition predetermined time period includes doubtful howl in the new downlink voice communicating data of determination When crying, judge whether doubtful utter long and high-pitched sounds a little is set inhibition mark, if being set, according to the doubtful howl for inhibiting mark is set It cries and chauvent's criterion processing a little is carried out to new downlink voice communicating data.The advantages of this arrangement are as follows one section has sound of uttering long and high-pitched sounds Downlink voice communicating data after, continuously exist it is doubtful utter long and high-pitched sounds a little, if this it is doubtful utter long and high-pitched sounds it is a little logical in the preceding paragraph downlink voice Occurred in words data, then being very likely to is to utter long and high-pitched sounds a little, therefore, can without the judgement uttered long and high-pitched sounds a little, but directly into Row inhibition processing, saves the judgment step uttered long and high-pitched sounds a little, while saving power consumption, can promote the timeliness of voice communication.Optionally, If not being set, (i.e. step 104) continues to judge whether it is to utter long and high-pitched sounds a little in the way of in above-described embodiment.Optionally, After inhibiting mark for a setting of uttering long and high-pitched sounds, further includes: inhibit uttering long and high-pitched sounds after mark to update index of uttering long and high-pitched sounds according to setting, in this way Doing is advantageous in that, can record and be uttered long and high-pitched sounds at the time of a little occur in time, and subsequent judgement doubtful utter long and high-pitched sounds is facilitated a little to inhibit to mark with existing Will is uttered long and high-pitched sounds the time difference occurred, to more accurately judge whether doubtful utter long and high-pitched sounds a little is to utter long and high-pitched sounds a little.In addition, according to step Rapid 104 continue to judge that doubtful utter long and high-pitched sounds a little is that can also inhibit to indicate for a new setting of uttering long and high-pitched sounds, and update and utter long and high-pitched sounds after uttering long and high-pitched sounds a little Index.
In some embodiments, described to detect that detecting event of uttering long and high-pitched sounds is triggered, comprising: to judge in the voice communication group With the presence or absence of the destination mobile terminal for being less than pre-determined distance value with the distance between the mobile terminal, and if it exists, then determine inspection Detecting event of uttering long and high-pitched sounds is measured to be triggered.Under the application scenarios of multi-person speech, inventors have found that when there are two mobile terminals it Between distance it is closer when, easily utter long and high-pitched sounds.Assuming that mobile terminal first and mobile terminal second distance in voice communication group compared with Closely, the loudspeaker of mobile terminal first can amplify and play the mobile terminal second received microphone acquisition sound, and due to Two mobile terminals are closer, this sound will be acquired again by the microphone of mobile terminal second and be sent to mobile terminal First, the sound are continued to amplify and be played, easily the positive feedback amplification of formation sound, to generate sound of uttering long and high-pitched sounds.Therefore, the application In embodiment, it can first judge to compare at a distance from current mobile terminal in voice communication with the presence or absence of other mobile terminals Closely, and if it exists, then trigger detecting event of uttering long and high-pitched sounds, and then detect that detecting event of uttering long and high-pitched sounds is triggered.Wherein, pre-determined distance value is for example It can be 20 meters or 10 meters etc., can be configured according to actual needs.
In the embodiment of the present application, judge in the voice communication group with the presence or absence of small with the distance between the mobile terminal In pre-determined distance value destination mobile terminal specific judgment mode can there are many kinds of, and without limitation, be given below several Mode is illustratively.
1, preset sound segment is played using predetermined manner, and receives the anti-of other mobile terminals in the voice communication group Feedforward information, the feedback information include that other described mobile terminals are attempted to acquire sound letter corresponding with the preset sound segment Number result;Judged to whether there is the distance between described mobile terminal in the voice communication group according to the feedback information Less than the destination mobile terminal of pre-determined distance value.
The advantages of this arrangement are as follows can rapidly and accurately judge with the presence or absence of destination mobile terminal, and then quickly It determines the need for triggering detecting event of uttering long and high-pitched sounds.Illustratively, volume played pre-recorded or pre- can be preset by loudspeaker The sound clip first obtained;Or, playing the ultrasonic wave segment of predeterminated frequency and preset strength by ultrasonic transmitter.Accordingly , other mobile terminals can acquire the corresponding voice signal of preset sound segment by microphone or ultrasonic receiver.It can root Above-mentioned default volume or predeterminated frequency and preset strength are configured according to pre-determined distance value.The knot for including in feedback information Fruit can refer to whether other mobile terminals can collect the voice signal.When other mobile terminals can collect default sound When the corresponding voice signal of tablet section, illustrate that the distance of two mobile terminals is less than pre-determined distance value.Feedback information can be by presetting The corresponding server of application program is forwarded.In addition, may also include the attribute letter of collected voice signal in feedback information Breath, such as intensity of sound, due to the intensity of the sound of mobile terminal playing be it is known, as the propagation of sound can be declined Subtract, propagation distance is remoter, and attenuation degree is higher, can determine other according to strength information of the voice signal in feedback information etc. Mobile terminal judges whether the distance is less than pre-determined distance value at a distance from current mobile terminal.
2, the of other mobile terminals in the first location information and the voice communication group of the mobile terminal is obtained Two location informations;According to first location information and second location information, judge whether deposit in the voice communication group It is less than the destination mobile terminal of the pre-determined distance value at a distance between the mobile terminal.
The advantages of this arrangement are as follows mobile terminal generally has positioning function, it can be using location information quick and precisely Ground is judged with the presence or absence of destination mobile terminal, and then quickly determines the need for triggering detecting event of uttering long and high-pitched sounds.Illustratively, it moves It is fixed that dynamic terminal can be obtained by positioning methods such as global positioning system (Global Positioning System, GPS) or Beidous Position information, can also obtain location information by modes such as base station location or network positions.Location information may include latitude and longitude coordinates Deng.Second location information of other mobile terminals in voice communication group can be forwarded by the corresponding server of default application program To current mobile terminal.By the forwarding of the first location information of itself and server comes, at least one second determines current mobile terminal Position information is compared one by one, judges whether there is the distance between second location information and the first location information and is less than in advance If distance value.
3, other mobile terminals in the first WiFi information and the voice communication group of the mobile terminal connection are obtained 2nd WiFi information of connection;According to the first WiFi information and the 2nd WiFi information, the voice communication group is judged In with the presence or absence of the destination mobile terminal for being less than the pre-determined distance value with the distance between described mobile terminal.
The advantages of this arrangement are as follows user is to save campus network, generally by the way of connecting Wi-Fi hotspot into Row voice communication can use this feature and rapidly and accurately judge with the presence or absence of destination mobile terminal, and then quickly determines Whether need to trigger detecting event of uttering long and high-pitched sounds.It illustratively, may include the attribute information of Wi-Fi hotspot, attribute information in WiFi information Such as it can be address media access control (Media Access Control, MAC) of Wi-Fi hotspot title or Wi-Fi hotspot Deng may also include WiFi signal intensity etc..In general, the signal effective range of Wi-Fi hotspot is limited, generally at 50 meters or so (half Diameter), it, can be according to whether there are the 2nd WiFi information if pre-determined distance value is greater than the signal effective range of Wi-Fi hotspot Wi-Fi hotspot attribute information is identical as the Wi-Fi hotspot attribute information of the first WiFi information to be to determine in the voice communication group The distance between no presence and mobile terminal are less than the destination mobile terminal of pre-determined distance value, if it exists any one the 2nd WiFi The Wi-Fi hotspot attribute information of information is identical as the Wi-Fi hotspot attribute information of the first WiFi information, it is determined that in voice communication group There are destination mobile terminals, that is to say, that when there is other mobile terminals to connect in voice communication group with current mobile terminal When the same Wi-Fi hotspot, it is believed that other mobile terminals are destination mobile terminal.In addition, if pre-determined distance value is less than WiFi The signal effective range of hot spot, such as 10 meters, then the same Wi-Fi hotspot further can be connected according to WiFi signal strength estimation Mobile terminal respectively at a distance from Wi-Fi hotspot, and then the distance between determine two mobile terminals, whether judge the distance Less than pre-determined distance value.
4, the first voice data of microphone acquisition is obtained, and obtains the downlink voice communicating data in mobile terminal; Wherein, the sound that the loudspeaker in first voice data not comprising the mobile terminal plays;According to first sound In data and the downlink voice communicating data whether include the same person sound, judge whether deposit in the voice communication group It is less than the destination mobile terminal of the pre-determined distance value at a distance between the mobile terminal.
The advantages of this arrangement are as follows can not be quick by other information (such as above-mentioned location information or WiFi information) Accurately judge with the presence or absence of destination mobile terminal, and then quickly determines the need for triggering detecting event of uttering long and high-pitched sounds.It is exemplary , the sound that the loudspeaker in the first voice data not comprising the mobile terminal plays may be accomplished by: obtain The loudspeaker of mobile terminal during the first voice data and downlink voice communicating data is taken to be in close state;Alternatively, The loudspeaker for obtaining mobile terminal during the first voice data and downlink voice communicating data is in the open state, the first sound Sound data are to filter out the sound number obtained after the voice data of loudspeaker broadcasting in all voice datas that microphone acquires According to.When two user's hand-held mobile terminals and when being closer, it is assumed that user's first uses mobile terminal first, and user's second uses movement Terminal second, user's first one's voice in speech acquire and are sent to mobile terminal second, mobile terminal second by the microphone of mobile terminal first Downlink voice communicating data in can include user's first one's voice in speech, and since user's first and user's second are closer, user First one's voice in speech can also be acquired by the microphone of mobile terminal second, therefore, for mobile terminal second, microphone acquisition The first voice data and acquisition downlink voice communicating data in include the same person (user's first) sound, so that it is determined that language There are the distance between mobile terminal first and mobile terminal second to be less than pre-determined distance value in sound phone group, i.e., for mobile terminal second For, mobile terminal first is destination mobile terminal.
It is understood that the combination of any one or more above-mentioned mode can be chosen according to the actual situation to judge to be No there are destination mobile terminals, and the embodiment of the present application is without limitation.Moreover, it is judged that with the presence or absence of the related step of destination mobile terminal Suddenly it can also be completed by the corresponding server of default application program, when server is judged to judge there are when destination mobile terminal As a result it is sent to mobile terminal, the judging result is used to indicate mobile terminal and triggers detecting event of uttering long and high-pitched sounds.Correspondingly, the application The method of embodiment further includes the judging result for receiving the corresponding server of the default application program and sending, when the judgement When in as a result including following content, detecting event of uttering long and high-pitched sounds is triggered: existing between the mobile terminal in the voice communication group Distance be less than pre-determined distance value destination mobile terminal.The specific deterministic process of server can refer to the several of above-mentioned offer and sentence Disconnected mode, the embodiment of the present application do not repeat them here.
In the embodiment of the present application, the mobile terminal being closer in voice communication group there are two, and there are feelings of uttering long and high-pitched sounds It when condition, avoids uttering long and high-pitched sounds not by the way of mute speaker, but downlink voice communicating data is carried out at chauvent's criterion Reason, this is that the special application scenarios proposed by the embodiment of the present application determine.Assuming that having 3 members a, b in voice communication group And c, two of them member a and b are closer, if the loudspeaker of the mobile terminal of b is closed in selection, then a one's voice in speech is just It will not be played in the mobile terminal of b, but simultaneously, c one's voice in speech will not play in the mobile terminal of b, and b can not Hear c one's voice in speech, then just losing the meaning of voice communication group, therefore, the application is in this special application scenarios Under demand, inventor's selection carries out chauvent's criterion processing to downlink voice communicating data, to solve the problems, such as to utter long and high-pitched sounds.
In some embodiments, after in determining the downlink voice communicating data in the presence of sound of uttering long and high-pitched sounds, further includes: obtain The voice data of the mobile terminal acquisition;Voice and background sound lock out operation are carried out to the voice data;To what is isolated Background sound carries out weakening process;After background sound after weakening process and the voice isolated are carried out stereo process, as Ascending voice communicating data is sent to the corresponding server of the default application program.The advantages of this arrangement are as follows Neng Gouyou Effect weakens is uttered long and high-pitched sounds due to caused by background sound.Illustratively, when there are microphone array, (number of microphone is greater than in mobile terminal Or when being equal to 2), can determine whether out sound source position, the sound apart from mobile terminal (such as larger than 1 meter) farther out is filtered out according to sound source position Sound is as background sound;Alternatively, the voiceprint of mobile terminal user can be obtained in advance, mentioned from voice data according to voiceprint User's one's voice in speech is taken out as voice, remaining sound is as background sound.Illustratively, the background sound isolated is carried out Weakening process can be the sound for reducing background sound by adjusting the mode of gain, can also be with wiping out background sound.Background sound passes through After weakening process, volume down is destroyed the increasing condition of sound, and then effectively weakens and uttered long and high-pitched sounds due to caused by background sound.
Fig. 2 is the flow diagram of another voice communication data detection method provided by the embodiments of the present application, with default Application program is for online game application program, this method comprises the following steps:
Step 201 detects that the voice communication group in default game application is successfully established.
Illustratively, by taking team's battle game as an example, such as king's honor, there are 5 players in every team, and Hong Lan two teams carry out pair It fights, needs to carry out communication exchange between 5 players of each troop and discuss battle strategy, therefore, many players can select to open Voice call function in team, if a player applies opening in team after voice call function, voice communication group is successfully established.This Afterwards, with any one in 5 players of World War I team, remaining 4 player's one's voice in speech can be heard.In general, player can incite somebody to action Mobile terminal is set as outer mode playback, convenience gaming.
Step 202 judges that whether there is the distance between mobile terminal in voice communication group is less than pre-determined distance value Destination mobile terminal, if so, thening follow the steps 203;Otherwise, step 202 is repeated.
If in 5 players, there are two player distance of mobile terminal it is closer, such as two good friends play together at home, again Outer mode playback is set by mobile terminal simultaneously, is thus very easy to cause to utter long and high-pitched sounds.It therefore, can be first in the embodiment of the present application Judge to whether there is and other closer mobile terminals of current distance of mobile terminal in voice communication group, and if it exists, then need Carry out detection of uttering long and high-pitched sounds.
Optionally, can be judged using the combination of any one or more mode described above in the embodiment of the present application With the presence or absence of destination mobile terminal, the embodiment of the present application is without limitation.
Step 203, obtain mobile terminal in predetermined time period downlink voice communicating data.
Illustratively, the microphone of the mobile terminal in downlink voice communicating data comprising other 4 teammates is collected Sound, general in sound not only includes 4 teammate's one's voices in speech, further includes that the loudspeaker of the mobile terminal of 4 teammates plays Sound and other ambient sounds etc..Generally led to by the ascending voice that game server collects the upload of other 4 mobile terminals Data are talked about, and the ascending voice communicating data of 4 mobile terminals is sent to current mobile terminal.
Step 204 carries out piecemeal processing to the downlink voice communicating data, obtains M data block.
Step 205 is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, will for the first time There is the doubtful data block uttered long and high-pitched sounds a little and is determined as initial data block.
Wherein, the presupposition analysis mode includes: and obtains energy value in high-frequency region on frequency domain to be higher than preset energy threshold The frequency point to be determined of value calculates the capacity volume variance value of the frequency point of preset quantity around the frequency point to be determined, when the energy difference When different value is greater than default discrepancy threshold, determine that the frequency point to be determined is doubtful utters long and high-pitched sounds a little;The high-frequency region is higher than for frequency The frequency range of predeterminated frequency threshold value.
Step 206, from initial data BOB(beginning of block), successively with n data block be data segment to be analyzed, using presupposition analysis Mode analyzes include in current data section doubtful and utters long and high-pitched sounds a little, judge to include in each current data section it is doubtful utter long and high-pitched sounds a little with Whether the doubtful frequency difference uttered long and high-pitched sounds between a little for including in the data segment of front is in preset range, if so, executing step Rapid 207;Otherwise, 203 are returned to step.
Wherein, n=2,3 ..., N;N is less than or equal to M, is greater than or equal to 2;The starting point of each data segment with it is described The starting point of initial data block is identical, and the initial data block is first data segment.When executing step 203 again, preset The starting point of time span is the terminal of current data section.
Step 207 determines there is sound of uttering long and high-pitched sounds in downlink voice communicating data, and doubtful utter long and high-pitched sounds a little is determined as uttering long and high-pitched sounds a little.
Step 208, the higher frequency uttered long and high-pitched sounds a little of correspondence energy value for choosing preset quantity are used as target frequency Notch filter carries out attenuation processing to audio signal corresponding with target frequency in downlink voice communicating data.
Step 209, the voice data for obtaining mobile terminal acquisition carry out voice and background sound separation behaviour to voice data Make, weakening process is carried out to the background sound isolated, the background sound after weakening process and the voice isolated are mixed After sound processing, the corresponding server of default game application is sent to as ascending voice communicating data.
After voice communication group in the embodiment of the present application in game application is successfully established, if detecting in voice communication group In the presence of with the closer destination mobile terminal of current mobile terminal, then carry out detection of uttering long and high-pitched sounds, determine exist utter long and high-pitched sounds sound when, it is right respectively Ascending voice communicating data and downlink voice communicating data carry out the inhibition processing for sound of uttering long and high-pitched sounds, and can effectively weaken and utter long and high-pitched sounds Sound, the sound that avoids uttering long and high-pitched sounds interfere game process, reduce game player's pain spot, keep the function of mobile terminal more perfect.
Fig. 3 is a kind of structural block diagram of voice communication data detection device provided by the embodiments of the present application, which can be by Software and or hardware realization is typically integrated in mobile terminal, can carry out language by executing voice communication data detection method The detection of uttering long and high-pitched sounds of sound communicating data.As shown in figure 3, the device includes:
Detection trigger module 301 detects inspection of uttering long and high-pitched sounds after the voice communication group for presetting in application program is successfully established Survey event is triggered;
Downstream voice data obtains module 302, and the downlink voice for obtaining the predetermined time period in mobile terminal is logical Data are talked about, and piecemeal processing is carried out to the downlink voice communicating data, obtain M data block;
Whether a doubtful determining module 303 of uttering long and high-pitched sounds is deposited for successively being analyzed in current data block using presupposition analysis mode It utters long and high-pitched sounds a little doubtful, the doubtful data block uttered long and high-pitched sounds a little will be first appeared and be determined as initial data block;
It utters long and high-pitched sounds sound determining module 304, for being successively to be analyzed with n data block from the initial data BOB(beginning of block) Data segment is analyzed include in current data section doubtful using the presupposition analysis mode and uttered long and high-pitched sounds a little, wrapped when in N number of data segment When the doubtful frequency difference uttered long and high-pitched sounds between a little contained is in preset range, determines to exist in the downlink voice communicating data and make a whistling sound It is sound;Wherein, n=2,3 ..., N;N is less than or equal to M, is greater than or equal to 2;The starting point of each data segment with the starting The starting point of data block is identical, and the initial data block is first data segment.
The voice communication data detection device provided in the embodiment of the present application, the voice communication preset in application program are set up After standing successfully, when detecting that detecting event of uttering long and high-pitched sounds is triggered, the downlink voice for obtaining the predetermined time period in mobile terminal is logical Data are talked about, and carry out piecemeal processing;It is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, The doubtful data block uttered long and high-pitched sounds a little will be first appeared and be determined as initial data block, from initial data BOB(beginning of block), successively with n data block For data segment to be analyzed, include in current data section doubtful is analyzed using presupposition analysis mode and is uttered long and high-pitched sounds a little, when N number of data When the doubtful frequency difference uttered long and high-pitched sounds between a little for including in section is in preset range, determines and exist in downlink voice communicating data It utters long and high-pitched sounds sound.By using above-mentioned technical proposal, can the voice communication group of default application program in the terminal be created as After function, detection of uttering long and high-pitched sounds accurately is carried out to downlink voice communicating data, is taken appropriate measures so as to subsequent, reduction utter long and high-pitched sounds sound to User is inconvenient using bring.
Optionally, the presupposition analysis mode includes: and obtains energy value in high-frequency region on frequency domain to be higher than preset energy The frequency point to be determined of threshold value calculates the capacity volume variance value of the frequency point of preset quantity around the frequency point to be determined, when the energy When difference value is greater than default discrepancy threshold, determine that the frequency point to be determined is doubtful utters long and high-pitched sounds a little;The high-frequency region is that frequency is high In the frequency range of predeterminated frequency threshold value.
Optionally, sound determining module of uttering long and high-pitched sounds is also used to: if the doubtful a little number with front of uttering long and high-pitched sounds for including in current data section It is not in the preset range according to the doubtful frequency difference uttered long and high-pitched sounds between a little for including in section, then from the next of current data section A data BOB(beginning of block) obtains the downlink voice communicating data of the predetermined time period, and repeats to downlink voice call number According to the relevant operation for carrying out piecemeal processing.
Optionally, the device further include:
A determining module of uttering long and high-pitched sounds will be described after there is sound of uttering long and high-pitched sounds in determining the downlink voice communicating data Doubtful utter long and high-pitched sounds a little is determined as uttering long and high-pitched sounds a little.
Chauvent's criterion module a little carries out at chauvent's criterion the downlink voice communicating data for uttering long and high-pitched sounds according to Reason.
Optionally, the chauvent's criterion module is specifically used for:
The higher frequency uttered long and high-pitched sounds a little of correspondence energy value for choosing preset quantity, as target frequency, to the downlink language Audio signal corresponding with the target frequency carries out attenuation processing in sound communicating data;Or,
Attenuation processing is carried out to audio signal corresponding with all frequencies uttered long and high-pitched sounds a little in the downlink voice communicating data.
It is optionally, described to detect that detecting event of uttering long and high-pitched sounds is triggered, comprising:
Judge that whether there is the distance between described mobile terminal in the voice communication group is less than pre-determined distance value Destination mobile terminal, and if it exists, then confirmly detect detecting event of uttering long and high-pitched sounds and be triggered.
Optionally, whether there is the distance between described mobile terminal in the judgement voice communication group to be less than in advance If the destination mobile terminal of distance value, comprising:
Preset sound segment is played using predetermined manner, and receives the feedback of other mobile terminals in the voice communication group Information, the feedback information include that other described mobile terminals are attempted to acquire voice signal corresponding with the preset sound segment Result;Judged in the voice communication group according to the feedback information with the presence or absence of small with the distance between the mobile terminal In the destination mobile terminal of pre-determined distance value;
Alternatively,
Obtain second of other mobile terminals in the first location information and the voice communication group of the mobile terminal Location information;According to first location information and second location information, judge to whether there is in the voice communication group The distance between described mobile terminal is less than the destination mobile terminal of the pre-determined distance value;
Alternatively,
Other mobile terminals in the first WiFi information and the voice communication group of the mobile terminal connection are obtained to connect The 2nd WiFi information connect;According to the first WiFi information and the 2nd WiFi information, judge in the voice communication group With the presence or absence of the destination mobile terminal for being less than the pre-determined distance value with the distance between the mobile terminal;
Alternatively,
The first voice data of microphone acquisition is obtained, and obtains the downlink voice communicating data in mobile terminal;Its In, the sound of the loudspeaker broadcasting in first voice data not comprising the mobile terminal;According to the first sound number According to in the downlink voice communicating data whether include the same person sound, judge to whether there is in the voice communication group The distance between described mobile terminal is less than the destination mobile terminal of the pre-determined distance value.
Optionally, the device further include:
Voice data obtains module, after there is sound of uttering long and high-pitched sounds in determining the downlink voice communicating data, obtains The voice data of the mobile terminal acquisition;
Sound separation module, for carrying out voice and background sound lock out operation to the voice data;
Background sound weakens module, for carrying out weakening process to the background sound isolated;
Upstream data sending module, for carrying out the background sound after weakening process and the voice isolated at audio mixing After reason, the corresponding server of the default application program is sent to as ascending voice communicating data.
Optionally, the default application program is online game application program.
The embodiment of the present application also provides a kind of storage medium comprising computer executable instructions, and the computer is executable Instruction is used to execute voice communication data detection method when being executed by computer processor, this method comprises:
After voice communication group in default application program is successfully established, detect that detecting event of uttering long and high-pitched sounds is triggered;
The downlink voice communicating data of the predetermined time period in mobile terminal is obtained, and to downlink voice call number According to piecemeal processing is carried out, M data block is obtained;
It is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, will first appeared doubtful The data block uttered long and high-pitched sounds a little is determined as initial data block;
From the initial data BOB(beginning of block), successively with n data block for data segment to be analyzed, using the presupposition analysis Mode analyzes include in current data section doubtful and utters long and high-pitched sounds a little, when the doubtful frequency uttered long and high-pitched sounds between a little for including in N number of data segment When rate difference is in preset range, determine there is sound of uttering long and high-pitched sounds in the downlink voice communicating data;Wherein, n=2,3 ..., N; N is less than or equal to M, is greater than or equal to 2;The starting point of each data segment is identical as the starting point of the initial data block, institute Stating initial data block is first data segment.
Storage medium --- any various types of memory devices or storage equipment.Term " storage medium " is intended to wrap It includes: install medium, such as CD-ROM, floppy disk or magnetic tape equipment;Computer system memory or random access memory, such as DRAM, DDRRAM, SRAM, EDORAM, Lan Basi (Rambus) RAM etc.;Nonvolatile memory, such as flash memory, magnetic medium (example Such as hard disk or optical storage);Register or the memory component of other similar types etc..Storage medium can further include other types Memory or combinations thereof.In addition, storage medium can be located at program in the first computer system being wherein performed, or It can be located in different second computer systems, second computer system is connected to the first meter by network (such as internet) Calculation machine system.Second computer system can provide program instruction to the first computer for executing.Term " storage medium " can To include two or more that may reside in different location (such as in the different computer systems by network connection) Storage medium.Storage medium can store the program instruction that can be performed by one or more processors and (such as be implemented as counting Calculation machine program).
Certainly, a kind of storage medium comprising computer executable instructions, computer provided by the embodiment of the present application The voice communication data detecting operation that executable instruction is not limited to the described above can also be performed the application any embodiment and be mentioned Relevant operation in the voice communication data detection method of confession.
The embodiment of the present application provides a kind of mobile terminal, and language provided by the embodiments of the present application can be integrated in the mobile terminal Sound communicating data detection device.Fig. 4 is a kind of structural schematic diagram of mobile terminal provided by the embodiments of the present application.Mobile terminal 400 may include: memory 401, processor 402 and be stored in the computer that can be run on memory 401 and in processor 402 Program, the processor 402 realize the voice communication data inspection as described in the embodiment of the present application when executing the computer program Survey method.
Mobile terminal provided by the embodiments of the present application, can default application program in the terminal voice communication group After being successfully established, detection of uttering long and high-pitched sounds accurately is carried out to downlink voice communicating data, is taken appropriate measures so as to subsequent, reduces and makes a whistling sound Make sound inconvenient using bring to user.
Fig. 5 is the structural schematic diagram of another mobile terminal provided by the embodiments of the present application, which may include: Shell (not shown), memory 501, central processing unit (central processing unit, CPU) 502 (are also known as located Manage device, hereinafter referred to as CPU), circuit board (not shown) and power circuit (not shown).The circuit board is placed in institute State the space interior that shell surrounds;The CPU502 and the memory 501 are arranged on the circuit board;The power supply electricity Road, for each circuit or the device power supply for the mobile terminal;The memory 501, for storing executable program generation Code;The CPU502 is run and the executable journey by reading the executable program code stored in the memory 501 The corresponding computer program of sequence code, to perform the steps of
After voice communication group in default application program is successfully established, detect that detecting event of uttering long and high-pitched sounds is triggered;
The downlink voice communicating data of the predetermined time period in mobile terminal is obtained, and to downlink voice call number According to piecemeal processing is carried out, M data block is obtained;
It is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, will first appeared doubtful The data block uttered long and high-pitched sounds a little is determined as initial data block;
From the initial data BOB(beginning of block), successively with n data block for data segment to be analyzed, using the presupposition analysis Mode analyzes include in current data section doubtful and utters long and high-pitched sounds a little, when the doubtful frequency uttered long and high-pitched sounds between a little for including in N number of data segment When rate difference is in preset range, determine there is sound of uttering long and high-pitched sounds in the downlink voice communicating data;Wherein, n=2,3 ..., N; N is less than or equal to M, is greater than or equal to 2;The starting point of each data segment is identical as the starting point of the initial data block, institute Stating initial data block is first data segment.
The mobile terminal further include: Peripheral Interface 503, RF (Radio Frequency, radio frequency) circuit 505, audio-frequency electric Road 506, loudspeaker 511, power management chip 508, input/output (I/O) subsystem 509, other input/control devicess 510, Touch screen 512, other input/control devicess 510 and outside port 504, these components pass through one or more communication bus Or signal wire 507 communicates.
It should be understood that illustrating the example that mobile terminal 500 is only mobile terminal, and mobile terminal 500 It can have than shown in the drawings more or less component, can combine two or more components, or can be with It is configured with different components.Various parts shown in the drawings can include one or more signal processings and/or dedicated It is realized in the combination of hardware, software or hardware and software including integrated circuit.
It is just provided in this embodiment below to be described in detail for the utter long and high-pitched sounds mobile terminal of detection of voice communication data, The mobile terminal takes the mobile phone as an example.
Memory 501, the memory 501 can be accessed by CPU502, Peripheral Interface 503 etc., and the memory 501 can It can also include nonvolatile memory to include high-speed random access memory, such as one or more disk memory, Flush memory device or other volatile solid-state parts.
The peripheral hardware that outputs and inputs of equipment can be connected to CPU502 and deposited by Peripheral Interface 503, the Peripheral Interface 503 Reservoir 501.
I/O subsystem 509, the I/O subsystem 509 can be by the input/output peripherals in equipment, such as touch screen 512 With other input/control devicess 510, it is connected to Peripheral Interface 503.I/O subsystem 509 may include 5091 He of display controller For controlling one or more input controllers 5092 of other input/control devicess 510.Wherein, one or more input controls Device 5092 processed receives electric signal from other input/control devicess 510 or sends electric signal to other input/control devicess 510, Other input/control devicess 510 may include physical button (push button, rocker buttons etc.), dial, slide switch, behaviour Vertical pole clicks idler wheel.It is worth noting that input controller 5092 can with it is following any one connect: keyboard, infrared port, The indicating equipment of USB interface and such as mouse.
Touch screen 512, the touch screen 512 are the input interface and output interface between customer mobile terminal and user, Visual output is shown to user, visual output may include figure, text, icon, video etc..
Display controller 5091 in I/O subsystem 509 receives electric signal from touch screen 512 or sends out to touch screen 512 Electric signals.Touch screen 512 detects the contact on touch screen, and the contact that display controller 5091 will test is converted to and is shown The interaction of user interface object on touch screen 512, i.e. realization human-computer interaction, the user interface being shown on touch screen 512 Object can be the icon of running game, the icon for being networked to corresponding network etc..It is worth noting that equipment can also include light Mouse, light mouse are the extensions for the touch sensitive surface for not showing the touch sensitive surface visually exported, or formed by touch screen.
RF circuit 505 is mainly used for establishing the communication of mobile phone Yu wireless network (i.e. network side), realizes mobile phone and wireless network The data receiver of network and transmission.Such as transmitting-receiving short message, Email etc..Specifically, RF circuit 505 receives and sends RF letter Number, RF signal is also referred to as electromagnetic signal, and RF circuit 505 converts electrical signals to electromagnetic signal or electromagnetic signal is converted to telecommunications Number, and communicated by the electromagnetic signal with communication network and other equipment.RF circuit 505 may include for executing The known circuit of these functions comprising but it is not limited to antenna system, RF transceiver, one or more amplifiers, tuner, one A or multiple oscillators, digital signal processor, CODEC (COder-DECoder, coder) chipset, user identifier mould Block (Subscriber Identity Module, SIM) etc..
Voicefrequency circuit 506 is mainly used for receiving audio data from Peripheral Interface 503, which is converted to telecommunications Number, and the electric signal is sent to loudspeaker 511.
Loudspeaker 511 is reduced to sound for mobile phone to be passed through RF circuit 505 from the received voice signal of wireless network And the sound is played to user.
Power management chip 508, the hardware for being connected by CPU502, I/O subsystem and Peripheral Interface are powered And power management.
The application can be performed in voice communication data detection device, storage medium and the mobile terminal provided in above-described embodiment Voice communication data detection method provided by any embodiment has and executes the corresponding functional module of this method and beneficial to effect Fruit.The not technical detail of detailed description in the above-described embodiments, reference can be made to voice communication provided by the application any embodiment Data detection method.
Note that above are only the preferred embodiment and institute's application technology principle of the application.It will be appreciated by those skilled in the art that The application is not limited to specific embodiment described here, be able to carry out for a person skilled in the art it is various it is apparent variation, The protection scope readjusted and substituted without departing from the application.Therefore, although being carried out by above embodiments to the application It is described in further detail, but the application is not limited only to above embodiments, in the case where not departing from the application design, also It may include more other equivalent embodiments, and scope of the present application is determined by the scope of the appended claims.

Claims (10)

1. a kind of voice communication data detection method characterized by comprising
It detects that the voice communication group in the default game application in mobile terminal is successfully established, judges in the voice communication group With the presence or absence of the destination mobile terminal for being less than pre-determined distance value with the distance between the mobile terminal, and if it exists, then determine inspection Detecting event of uttering long and high-pitched sounds is measured to be triggered;
Obtain mobile terminal in predetermined time period downlink voice communicating data, and to the downlink voice communicating data into The processing of row piecemeal, obtains M data block;
It is successively analyzed in current data block using presupposition analysis mode and is uttered long and high-pitched sounds a little with the presence or absence of doubtful, doubtful utter long and high-pitched sounds will be first appeared The data block of point is determined as initial data block;
From the initial data BOB(beginning of block), successively with n data block for data segment to be analyzed, using the presupposition analysis mode It analyzes include in current data section doubtful to utter long and high-pitched sounds a little, when the doubtful difference on the frequency uttered long and high-pitched sounds between a little for including in N number of data segment It is different in the preset range when, determine there is sound of uttering long and high-pitched sounds in the downlink voice communicating data;Wherein, n=2,3 ..., N;N is small In or equal to M, it is greater than or equal to 2;The starting point of each data segment is identical as the starting point of the initial data block, and described Beginning data block is first data segment;
Doubtful utter long and high-pitched sounds a little is determined as uttering long and high-pitched sounds a little, inhibits mark for a setting of uttering long and high-pitched sounds, after inhibiting mark according to setting Utter long and high-pitched sounds and to update index of uttering long and high-pitched sounds;
Chauvent's criterion processing a little is carried out to the downlink voice communicating data according to described utter long and high-pitched sounds, continues to obtain predetermined time period Downlink voice communicating data judge described doubtful when in the new downlink voice communicating data of determination comprising doubtful utter long and high-pitched sounds It utters long and high-pitched sounds and inhibition mark a little whether is set, if being set, described the doubtful of mark is inhibited to utter long and high-pitched sounds a little pair according to being set New downlink voice communicating data carries out chauvent's criterion processing.
2. the method according to claim 1, wherein the presupposition analysis mode includes: to obtain height on frequency domain Energy value is higher than the frequency point to be determined of preset energy threshold value in frequency domain, calculates the frequency of preset quantity around the frequency point to be determined The capacity volume variance value of point determines that the frequency point to be determined is doubtful howl when the capacity volume variance value is greater than default discrepancy threshold It cries a little;The high-frequency region is the frequency range that frequency is higher than predeterminated frequency threshold value.
3. the method according to claim 1, wherein further include:
If the doubtful doubtful frequency uttered long and high-pitched sounds between a little for including a little and in the data segment of front of uttering long and high-pitched sounds for including in current data section Difference is not in the preset range, then obtains the predetermined time period from next data BOB(beginning of block) of current data section Downlink voice communicating data, and repeat to downlink voice communicating data carry out piecemeal processing relevant operation.
4. the method according to claim 1, wherein described utter long and high-pitched sounds according to a little converses to the downlink voice Data carry out chauvent's criterion processing, comprising:
The higher frequency uttered long and high-pitched sounds a little of correspondence energy value for choosing preset quantity, it is logical to the downlink voice as target frequency It talks about audio signal corresponding with the target frequency in data and carries out attenuation processing;Or,
Attenuation processing is carried out to audio signal corresponding with all frequencies uttered long and high-pitched sounds a little in the downlink voice communicating data.
5. the method according to claim 1, wherein whether there is and institute in the judgement voice communication group State the destination mobile terminal that the distance between mobile terminal is less than pre-determined distance value, comprising:
Preset sound segment is played using predetermined manner, and receives the feedback letter of other mobile terminals in the voice communication group Breath, the feedback information include that other described mobile terminals are attempted to acquire voice signal corresponding with the preset sound segment As a result;Judge that whether there is the distance between described mobile terminal in the voice communication group is less than according to the feedback information The destination mobile terminal of pre-determined distance value;
Alternatively,
Obtain the second positioning of other mobile terminals in the first location information and the voice communication group of the mobile terminal Information;According to first location information and second location information, judge to whether there is and institute in the voice communication group State the destination mobile terminal that the distance between mobile terminal is less than the pre-determined distance value;
Alternatively,
Obtain other mobile terminals connection in the first WiFi information and the voice communication group of the mobile terminal connection 2nd WiFi information;According to the first WiFi information and the 2nd WiFi information, judge in the voice communication group whether In the presence of the destination mobile terminal for being less than the pre-determined distance value with the distance between the mobile terminal;
Alternatively,
The first voice data of microphone acquisition is obtained, and obtains the downlink voice communicating data in mobile terminal;Wherein, institute State the sound that the loudspeaker in the first voice data not comprising the mobile terminal plays;According to first voice data and institute State in downlink voice communicating data whether include the same person sound, judge in the voice communication group with the presence or absence of with it is described The distance between mobile terminal is less than the destination mobile terminal of the pre-determined distance value.
6. uttering long and high-pitched sounds the method according to claim 1, wherein existing in determining the downlink voice communicating data After sound, further includes:
Obtain the voice data of the mobile terminal acquisition;
Voice and background sound lock out operation are carried out to the voice data;
Weakening process is carried out to the background sound isolated;
After background sound after weakening process and the voice isolated are carried out stereo process, as ascending voice communicating data It is sent to the corresponding server of the default application program.
7. the method according to claim 1, wherein the default application program is online game application program.
8. a kind of voice communication data detection device characterized by comprising
Detection trigger module is sentenced for detecting that the voice communication group in the default game application in mobile terminal is successfully established With the presence or absence of mobile less than the target of pre-determined distance value eventually with the distance between the mobile terminal in the voice communication group of breaking End, and if it exists, then confirmly detect detecting event of uttering long and high-pitched sounds and be triggered;
Downstream voice data obtains module, for obtaining the downlink voice communicating data of the predetermined time period in mobile terminal, And piecemeal processing is carried out to the downlink voice communicating data, obtain M data block;
A doubtful determining module of uttering long and high-pitched sounds, for successively being analyzed in current data block using presupposition analysis mode with the presence or absence of doubtful howl It cries a little, the doubtful data block uttered long and high-pitched sounds a little will be first appeared and be determined as initial data block;Wherein, the presupposition analysis mode includes: The frequency point to be determined that energy value in high-frequency region is higher than preset energy threshold value is obtained on frequency domain, calculates the frequency point week to be determined The capacity volume variance value for enclosing the frequency point of preset quantity determines described wait sentence when the capacity volume variance value is greater than default discrepancy threshold Determining frequency point is doubtful utter long and high-pitched sounds a little;The high-frequency region is the frequency range that frequency is higher than predeterminated frequency threshold value;
It utters long and high-pitched sounds sound determining module, for from the initial data BOB(beginning of block), successively with n data block for data segment to be analyzed, It analyzes include in current data section doubtful using the presupposition analysis mode to utter long and high-pitched sounds a little, when include in N number of data segment doubts When the frequency difference seemingly uttered long and high-pitched sounds between a little is in preset range, determine there is sound of uttering long and high-pitched sounds in the downlink voice communicating data; Wherein, n=2,3 ..., N;N is less than or equal to M, is greater than or equal to 2;The starting point of each data segment with the initial data The starting point of block is identical, and the initial data block is first data segment;
A determining module of uttering long and high-pitched sounds will be described doubtful after there is sound of uttering long and high-pitched sounds in determining the downlink voice communicating data It utters long and high-pitched sounds and is a little determined as uttering long and high-pitched sounds a little, for a setting inhibition mark of uttering long and high-pitched sounds, the update of uttering long and high-pitched sounds after inhibiting mark according to setting is maked a whistling sound Sling draws;
Chauvent's criterion module a little carries out chauvent's criterion processing to the downlink voice communicating data for uttering long and high-pitched sounds according to, after The continuous downlink voice communicating data for obtaining predetermined time period, utters long and high-pitched sounds in the new downlink voice communicating data of determination comprising doubtful When point, judge whether doubtful utter long and high-pitched sounds a little is set the inhibition mark, if being set, is marked according to the inhibition is set Doubtful utter long and high-pitched sounds of will a little carries out chauvent's criterion processing to new downlink voice communicating data.
9. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is held by processor The voice communication data detection method as described in any in claim 1-7 is realized when row.
10. a kind of mobile terminal, which is characterized in that including memory, processor and storage are on a memory and can be in processor The computer program of operation, the processor realize language as claimed in claim 1 when executing the computer program Sound communicating data detection method.
CN201810201127.8A 2018-03-12 2018-03-12 Voice communication data detection method, device, storage medium and mobile terminal Active CN108494954B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201810201127.8A CN108494954B (en) 2018-03-12 2018-03-12 Voice communication data detection method, device, storage medium and mobile terminal
PCT/CN2019/076978 WO2019174492A1 (en) 2018-03-12 2019-03-05 Voice call data detection method, device, storage medium and mobile terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810201127.8A CN108494954B (en) 2018-03-12 2018-03-12 Voice communication data detection method, device, storage medium and mobile terminal

Publications (2)

Publication Number Publication Date
CN108494954A CN108494954A (en) 2018-09-04
CN108494954B true CN108494954B (en) 2019-10-25

Family

ID=63338520

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810201127.8A Active CN108494954B (en) 2018-03-12 2018-03-12 Voice communication data detection method, device, storage medium and mobile terminal

Country Status (2)

Country Link
CN (1) CN108494954B (en)
WO (1) WO2019174492A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108494954B (en) * 2018-03-12 2019-10-25 Oppo广东移动通信有限公司 Voice communication data detection method, device, storage medium and mobile terminal
CN113225657B (en) * 2021-04-16 2022-09-30 深圳木芯科技有限公司 Multi-channel squeal suppression method based on double-microphone architecture
CN113596662B (en) * 2021-07-30 2024-04-02 北京小米移动软件有限公司 Method for suppressing howling, device for suppressing howling, earphone, and storage medium
CN113473304B (en) * 2021-08-17 2024-01-23 北京小米移动软件有限公司 Howling suppression method, device, earphone and storage medium
CN113593518A (en) * 2021-08-25 2021-11-02 歌尔科技有限公司 Howling suppression method and device, in-ear earphone and storage medium
CN113749620B (en) * 2021-09-27 2024-03-12 广州医科大学附属第一医院(广州呼吸中心) Sleep apnea detection method, system, equipment and storage medium

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013125257A1 (en) * 2012-02-20 2013-08-29 株式会社Jvcケンウッド Noise signal suppression apparatus, noise signal suppression method, special signal detection apparatus, special signal detection method, informative sound detection apparatus, and informative sound detection method
JP2013236272A (en) * 2012-05-09 2013-11-21 Sony Corp Voice processing device and voice processing method and program
CN104580699B (en) * 2014-12-15 2017-06-30 广东欧珀移动通信有限公司 Acoustic control intelligent terminal method and device when a kind of standby
CN106488052A (en) * 2015-08-27 2017-03-08 成都鼎桥通信技术有限公司 One kind is uttered long and high-pitched sounds scene recognition method and equipment
CN106878533B (en) * 2015-12-10 2021-03-19 北京奇虎科技有限公司 Communication method and device of mobile terminal
CN105895115A (en) * 2016-04-01 2016-08-24 北京小米移动软件有限公司 Squeal determining method and squeal determining device
CN106100676A (en) * 2016-06-07 2016-11-09 海能达通信股份有限公司 Control method, user terminal and the interphone terminal of audio frequency output
CN107645696B (en) * 2016-07-20 2019-04-19 腾讯科技(深圳)有限公司 One kind is uttered long and high-pitched sounds detection method and device
CN107566658A (en) * 2017-10-13 2018-01-09 广东欧珀移动通信有限公司 Call method, device, storage medium and mobile terminal
CN108494954B (en) * 2018-03-12 2019-10-25 Oppo广东移动通信有限公司 Voice communication data detection method, device, storage medium and mobile terminal

Also Published As

Publication number Publication date
WO2019174492A1 (en) 2019-09-19
CN108494954A (en) 2018-09-04

Similar Documents

Publication Publication Date Title
CN108494954B (en) Voice communication data detection method, device, storage medium and mobile terminal
CN108449493A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108449496A (en) Voice communication data detection method, device, storage medium and mobile terminal
CN108449507A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108449503A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108172237A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108449502A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108449506A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108418968A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108449499A (en) Voice communication data processing method, device, storage medium and mobile terminal
WO2014117722A1 (en) Speech processing method, device and terminal apparatus
CN108449497A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108449495A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN109218535A (en) Intelligence adjusts method, apparatus, storage medium and the terminal of volume
CN109151789A (en) Interpretation method, device, system and bluetooth headset
CN108429955A (en) Release ambient sound enters the intelligent apparatus and method of earphone
CN108449504B (en) Voice communication data detection method, device, storage medium and mobile terminal
CN108418982A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108449492A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108449508A (en) Voice communication processing method, device, storage medium and mobile terminal
CN107977187B (en) Reverberation adjusting method and electronic equipment
CN108429858A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN108449505A (en) Voice communication data detection method, device, storage medium and mobile terminal
CN108449498B (en) Voice call data processing method and device, storage medium and mobile terminal
CN108449494A (en) voice communication data processing method, device, storage medium and server

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Applicant after: OPPO Guangdong Mobile Communications Co., Ltd.

Address before: Changan town in Guangdong province Dongguan 523860 usha Beach Road No. 18

Applicant before: Guangdong OPPO Mobile Communications Co., Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant