CN110942783B

CN110942783B - Group call type crank call classification method based on audio multistage clustering

Info

Publication number: CN110942783B
Application number: CN201910978660.XA
Authority: CN
Inventors: 高圣翔; 黄远; 杨晶超; 宁珊; 李娅强; 戚梦苑; 孙旭东; 陈海鹏; 王宪法; 鲍尚策; 王文重; 王瑞杰
Original assignee: National Computer Network and Information Security Management Center; Zhuhai Comleader Information Technology Co Ltd
Current assignee: National Computer Network and Information Security Management Center; Zhuhai Comleader Information Technology Co Ltd
Priority date: 2019-10-15
Filing date: 2019-10-15
Publication date: 2022-06-17
Anticipated expiration: 2039-10-15
Also published as: CN110942783A

Abstract

The invention relates to a group call type crank call classification method based on audio multistage clustering, which comprises the following steps: s100, dividing an audio pool comprising a plurality of audio data into a plurality of equal groups, sequentially performing feature extraction and feature comparison on each group, and further performing cluster analysis to obtain audio clusters; s200, voice transcription is carried out on the audio, and keyword library retrieval comparison is carried out on the text subjected to voice transcription to obtain a keyword comparison result; s300, performing audio library retrieval comparison on the audio clustering to obtain an audio clustering result; and S400, merging and analyzing the keyword comparison result and the audio clustering result to obtain the automatically classified group call type crank call. The invention has the beneficial effects that: the group call type harassing call can be effectively detected and found; the method combines the means of key words, text transcription and the like, realizes automatic classification of the crank calls, saves labor cost and improves efficiency.

Description

Group call type crank call classification method based on audio multistage clustering

Technical Field

The invention relates to the field of audio identification, in particular to a group call type harassing call classification method based on audio multistage clustering.

Background

At present, various technical means are provided in China to realize the detection of harassing calls, wherein aiming at group call type calls, the detection and the discovery are carried out through audio characteristic comparison and audio clustering technology. However, along with the continuous deepening of the disturbance audio data treatment work, lawless persons continuously update the techniques to confront the disturbance audio data simultaneously, and great challenges are brought to the current treatment work.

In addition, the conventional audio feature comparison and audio clustering technology is to perform clustering analysis once in a group after audio data are coarsely grouped, so that the clustering analysis of the audio data is completed. The quality of the clustering analysis result completely depends on the quality of the same audio data grouping, and as the audio grouping is random grouping, part of the same harassing audios cannot be grouped into one group and cannot be aggregated together, so that the difficulty in harassing audio treatment is objectively increased.

With the increasingly severe form of countermeasure, the content of the group call type telephone is changeable, the treatment workload is increased, and at present, no content-based classification algorithm is used for the detection of the group call type telephone process. And the prior art has the following defects:

(1) according to the traditional clustering algorithm, certain defects of group call type crank calls are found, and all crank calls cannot be detected and found;

(2) the traditional crank call discovery does not combine score judgment such as key words and text transcription, and crank calls cannot be effectively classified.

Disclosure of Invention

The invention aims to solve at least one of technical problems in the prior art, and provides a group call type harassing call classification method based on audio multistage clustering, which realizes automatic classification, saves labor cost and improves efficiency.

The technical scheme of the invention comprises a method for classifying group call type crank calls based on audio multistage clustering, which is characterized by comprising the following steps: s100, dividing an audio pool comprising a plurality of audio data into a plurality of equivalent groups, sequentially performing feature extraction and feature comparison on each group, and further performing clustering analysis to obtain audio clusters; s200, voice transcription is carried out on the audio, and keyword library retrieval comparison is carried out on the text subjected to voice transcription to obtain a keyword comparison result; s300, performing audio library retrieval comparison on the audio clustering to obtain an audio clustering result; and S400, combining and analyzing the keyword comparison result and the audio clustering result to obtain the automatically classified group call type harassing calls.

According to the method for classifying the group call type crank calls based on the audio multistage clustering, S100 specifically comprises the following steps: s110, inputting an audio pool comprising a plurality of audio data, and randomly dividing the audio pool into N groups, wherein the maximum number of each group is M audio; s120, after feature extraction and feature comparison, each group of audio is subjected to clustering analysis according to a threshold value; s130, aggregating the clustering results of each group again to form a final clustering result; and S140, circularly executing the steps S110 to S130 until the clustering processing of all the audio data is completed.

According to the method for classifying the group call type crank calls based on the audio multistage cluster, S120 further comprises the following steps: if the audio clustering is successful, the secondary analysis is not participated; and if the audio clustering is unsuccessful, putting the audio into an audio pool again, and executing the steps S110-140 again.

According to the method for classifying the group call type crank calls based on the audio multistage cluster, M, N and the cycle execution times can be set in a user-defined mode.

According to the method for classifying the group call type crank calls based on the audio multistage clustering, the S200 specifically comprises the following steps: s210, comparing the audio with a harassment audio library to obtain an audio similarity score; s220, audio clustering is carried out on the audio to obtain audio clustering result information

According to the method for classifying the group call type crank calls based on the audio multistage cluster, the S300 specifically comprises the following steps: and (4) transcribing the audio files in the audio clusters into texts according to a set proportion, and judging whether the texts are harassing calls or not by combining a keyword identification method to obtain a keyword comparison result.

According to the method for classifying the group call type crank calls based on the audio multistage clustering, S400 specifically comprises the following steps: s410, summarizing and associating the results of S200 and S300 to obtain the audio comparison similarity score and the audio cluster number of each audio file; and S420, aiming at each audio cluster, judging and marking the corresponding harassment type according to the success of the audio comparison result, the comparison of the audio comparison similarity score with the set threshold value and the keyword comparison result, and automatically classifying according to the harassment type judgment and marking.

The beneficial effects of the invention are as follows: the group call type harassing call can be effectively detected and found; the method combines the means of key words, text transcription and the like, realizes automatic classification of the crank calls, saves labor cost and improves efficiency.

Drawings

The invention is further described below with reference to the accompanying drawings and examples;

FIG. 1 illustrates an overall flow diagram according to an embodiment of the invention;

FIG. 2 is a flow diagram illustrating packet clustering according to an embodiment of the present invention;

fig. 3 is a general flow chart of the classification of a group call type crank call according to the embodiment of the invention.

Detailed Description

Reference will now be made in detail to the present preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to like elements throughout.

In the description of the present invention, the meaning of several is one or more, and the meaning of a plurality is more than two, and cannot be understood as indicating or implying relative importance or implicitly indicating the number of technical features indicated or implicitly indicating the precedence of the technical features indicated.

In the description of the present invention, unless otherwise explicitly defined, such as set forth, a person skilled in the art can reasonably determine the specific meaning of the above-mentioned words in the present invention in combination with the details of the technical solution.

The noun explains:

audio library: and intercepting the audio clips with fixed lengths, and converting the audio clips into a model library for audio matching after feature extraction.

Clustering: the same feature audio is aggregated into one category.

FIG. 1 shows a general flow diagram according to an embodiment of the invention. The process includes steps S100 to S400 as follows: s100, dividing an audio pool comprising a plurality of audio data into a plurality of equal groups, sequentially performing feature extraction and feature comparison on each group, and further performing cluster analysis to obtain audio clusters; s200, voice transcription is carried out on the audio, and keyword library retrieval comparison is carried out on the text subjected to voice transcription to obtain a keyword comparison result; s300, performing audio library retrieval comparison on the audio clustering to obtain an audio clustering result; and S400, merging and analyzing the keyword comparison result and the audio clustering result to obtain the automatically classified group call type crank call.

FIG. 2 is a flow chart illustrating packet clustering according to an embodiment of the present invention. The method specifically comprises the following steps:

the method comprises the steps of finding group call type crank calls in mass data by utilizing the characteristics of batch call of the group call type crank calls and comprehensively utilizing relevant technologies such as audio comparison, audio clustering and voice-to-text conversion, then comprehensively scoring by combining and combining a keyword analysis technology, a voice-to-text technology and the like, and finally outputting crank call classification information.

In the analysis process, an audio multistage clustering method is adopted in the audio clustering process. The central idea is as follows: averagely grouping the audio data, carrying out cluster analysis on samples in a group, and storing the analysis result; on the basis of the last clustering analysis result, recombining and randomly grouping the audio data which are not successfully clustered, clustering and analyzing the samples in the groups, and merging the analysis result into the analysis result of the last time; and cycling sequentially until no new clusters are generated.

The specific process is described as follows:

(1) audio data is input and randomly divided into N groups, each group having a maximum of M audio frequencies.

(2) After feature extraction and feature comparison, each group is subjected to clustering analysis according to a threshold value, and if clustering is successful, secondary analysis is not involved; otherwise, the audio pool is put again to wait for the next analysis.

(3) And aggregating the clustering results of each group again to form a final clustering result.

(4) And (4) placing the audio failed in clustering into an audio pool for scattering, and executing the step (1) again after recombining.

(5) And repeating iteration until no new clustering result is generated.

After the method, all the group call type crank calls can be found and classified. In fact, the desired effect may be achieved over an infinite number of iterations, but at the expense of a significant amount of time. The grouping, the size of the group and the number of iterations are usually set for limitation, so that the harassing group call can be found as much as possible, and the analysis time can be prolonged.

Fig. 3 is a general flow chart of the classification of a group call type crank call according to the embodiment of the invention. The concrete steps are summarized as follows:

(1) and comparing the audio file with a harassment audio library to obtain an audio similarity score.

(2) And carrying out audio clustering on the audio files to obtain audio clustering result information.

(3) And (4) transferring the audio files in the audio cluster into texts according to a certain proportion, and judging whether the texts are harassing calls or not by combining a keyword recognition technology.

(4) And comprehensively analyzing the results of the steps and automatically classifying the crank calls.

Based on the embodiment of fig. 3, the invention further discloses a classification counting scheme of group call type harassing calls, which comprises the following steps:

and the classification module collects and associates the results of the audio comparison module and the audio clustering module to obtain the audio comparison similarity score and the audio clustering number of each audio file.

For each audio cluster, three cases can be classified according to whether the audio comparison result is successful, namely: comparing all the audio files in the category with the harassment audio library successfully, comparing partial audio files in the category with the harassment audio library successfully, and comparing all the audio files in the category with the harassment audio library unsuccessfully, and aiming at the former two conditions, the audio files in the category can be summarized as containing harassment audio files.

And aiming at each audio cluster, comparing the similarity score with a set threshold value according to audio comparison. If the audio similarity is greater than or equal to the threshold value, marking the audio similarity as a disturbance type I; conversely, if the audio similarity is less than the threshold, the audio is marked as disturbance type II; in addition, other audio files which are not successfully compared with the harassment audio library in the class are marked as harassment type III.

For each audio cluster, if the cluster does not contain a harassment audio file, the judgment needs to be carried out through the text content of the harassment audio file, currently, a keyword analysis technology is adopted to judge whether the harassment audio file is a harassment audio file, and if the harassment audio file is a harassment type IV.

By integrating the flows, it can be seen that the disturbance type I, the disturbance type II and the disturbance type III have strong relevance, and the disturbance type IV may be a new means for lawbreakers.

The present invention is further described below with reference to the above figures and flow. The following embodiments are merely used to more clearly illustrate the flow scheme of data analysis, and should not be taken as limiting the scope of the present invention.

The embodiments of the present invention have been described in detail with reference to the accompanying drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of those skilled in the art without departing from the gist of the present invention.

Claims

1. A method for classifying group call type crank calls based on audio multistage clustering is characterized by comprising the following steps:

s100, dividing an audio pool comprising a plurality of audio data into a plurality of equal groups, sequentially performing feature extraction and feature comparison on each group, and further performing cluster analysis to obtain audio clusters; the S100 specifically includes: s110, inputting an audio pool comprising a plurality of audio data, and randomly dividing the audio pool into N groups, wherein the maximum number of each group is M audio; s120, after feature extraction and feature comparison, each group of audio is subjected to clustering analysis according to a threshold value; s130, aggregating the clustering results of each group again to form a final clustering result; s140, circularly executing the steps S110-S130 until all audio data are clustered;

s200, voice transcription is carried out on the audio, and keyword library retrieval comparison is carried out on the text subjected to voice transcription to obtain a keyword comparison result;

s300, performing audio library retrieval comparison on the audio clustering to obtain an audio clustering result;

and S400, merging and analyzing the keyword comparison result and the audio clustering result to obtain the automatically classified group call type crank call.

2. The method for classifying group call-type harassing calls based on audio multistage clustering as claimed in claim 1, wherein said S120 further comprises:

if the audio clustering is successful, the secondary analysis is not participated;

and if the audio clustering is unsuccessful, putting the audio into an audio pool again, and executing the steps S110-140 again.

3. The method for classifying group call type crank calls based on audio multistage clustering according to claim 1, wherein: wherein M, N and the number of loop executions can be set by user.

4. The method for classifying group call type crank calls based on audio multistage clustering according to claim 1, wherein the S200 specifically comprises: and (4) transcribing the audio files in the audio clusters into texts according to a set proportion, and judging whether the texts are harassing calls or not by combining a keyword identification method to obtain a keyword comparison result.

5. The method for classifying group-call-type harassing calls based on audio multi-level clustering according to claim 1, wherein the S300 specifically comprises:

s310, comparing the audio with a harassment audio library to obtain an audio similarity score;

and S320, carrying out audio clustering on the audio to obtain audio clustering result information.

6. The method for classifying group call type crank calls based on audio multistage clustering according to claim 1, wherein the S400 specifically comprises:

s410, summarizing and associating the results of S200 and S300 to obtain the audio comparison similarity score and the audio cluster number of each audio file;

and S420, aiming at each audio cluster, judging and marking the corresponding harassment type according to the success of the audio comparison result, the comparison of the audio comparison similarity score with the set threshold value and the keyword comparison result, and automatically classifying according to the harassment type judgment and marking.