CN113066496A

CN113066496A - Method for analyzing call robbing of two conversation parties in audio

Info

Publication number: CN113066496A
Application number: CN202110284452.7A
Authority: CN
Inventors: 董金杰
Original assignee: Zhejiang Baiying Technology Co Ltd
Current assignee: Zhejiang Baiying Technology Co Ltd
Priority date: 2021-03-17
Filing date: 2021-03-17
Publication date: 2021-07-02

Abstract

The invention relates to a method for analyzing the call robbing of two conversation parties in audio, which comprises the following steps: (1) processing and analyzing the audio file into dialogue text information; (2) respectively gathering the speaking contents of the two parties of the conversation into two lists; (3) and (4) judging the call robbing situation according to the comparison of the speaking start time and the speaking end time of the two parties of the conversation. The method of the invention converts the audio file into text information, and can quickly, conveniently and accurately judge the call robbing situation of the two parties in the conversation in the audio by comparing the speaking time nodes of the two parties.

Description

Method for analyzing call robbing of two conversation parties in audio

Technical Field

The invention relates to the technical field of dialogue analysis, in particular to a method for analyzing the call robbing of two dialogue parties in audio.

Background

With the need of the client for analyzing the two parties of the conversation in the audio, whether the two parties of the conversation in the audio are robbed or not needs to be analyzed and judged, and the existing manual analysis scheme is not efficient, but easy to leak.

Disclosure of Invention

Aiming at the defects of the existing scheme, the invention discloses a method for analyzing the call robbing of two conversation parties in audio.

The technical scheme of the invention is as follows:

a method for analyzing the double-party call robbing of a conversation in audio is characterized by comprising the following steps: (1) acquiring audio files of two-party conversation, and analyzing the audio files into conversation text information through asr preprocessing; (2) respectively gathering the talking contents of the two parties A and B in the dialogue text information into an a list and a B list; (3) traversing the b list with each dialog block of the a list; (4) judging whether the speaking ending time of a [ i ] block of speaking content in the a list is smaller than the speaking starting time of b [ i ] block of speaking content in the b list; (5) if the judgment in the step (4) is positive, the judgment is finished if the speech is not robbed, and a [ i +1] blocks of speech contents in the a list are obtained;

(6) if the judgment in the step (4) is negative, then judging whether the ending time of the a [ i ] block of speaking content is greater than the starting time of the b [ i ] block of speaking content, and the starting time of the a [ i ] block of speaking content is less than the ending time of the b [ i ] block of speaking content;

(7) if the judgment in the step (6) is negative, the judgment is that no robbery exists, the judgment is finished, and a [ i +1] block speaking content in the a list is obtained;

(8) if the judgment in the step (6) is yes, then judging whether the starting time of the a [ i ] block of speaking content is less than the starting time of the b [ i ] block of speaking content;

(9) if the judgment in the step (8) is yes, the call robbing situation is that when A speaks, B talks, the judgment is finished, and a [ i +1] block speaking content in the a list is obtained;

(10) if the judgment in the step (8) is negative, then judging whether the starting time of b [ i ] block of speaking content is less than the starting time of a [ i ] block of speaking content;

(11) if the judgment in the step (10) is yes, the call robbing situation is that A carries out call robbing when B speaks, the judgment is finished, and a [ i +1] block speaking content in the a list is obtained;

(12) if the acquisition of a [ i +1] block of the speaking content fails, namely the traversal of the list a to the list b is completed, the dialogue analysis is finished.

Preferably, the dialog text information includes a content of a speech, a start time of the speech, and an end time of the speech.

The invention has the beneficial effects that:

the method of the invention converts the audio file into text information, and can quickly, conveniently and accurately judge the call robbing situation of the two parties in the conversation in the audio by comparing the speaking time nodes of the two parties.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a diagram illustrating the conversion of an audio file into text followed by call grabbing in the embodiment.

Detailed Description

For further understanding of the present invention, the present invention will be described in detail with reference to examples, which are provided for illustration of the present invention but are not intended to limit the scope of the present invention.

As shown in fig. 1, the present embodiment relates to a method for analyzing the double-party robbing of a conversation in audio, which is characterized in that the method comprises the following steps: (1) acquiring audio files of two-party conversation, and analyzing the audio files into conversation text information through asr preprocessing; (2) respectively gathering the talking contents of the two parties A and B in the dialogue text information into an a list and a B list; (3) traversing the b list with each dialog block of the a list; (4) judging whether the speaking ending time of a [ i ] block of speaking content in the a list is smaller than the speaking starting time of b [ i ] block of speaking content in the b list; (5) if the judgment in the step (4) is positive, the judgment is finished if the speech is not robbed, and a [ i +1] blocks of speech contents in the a list are obtained;

As shown in fig. 2, the dialog text information includes the contents of the utterance, the utterance start time, and the utterance end time.

The present invention and its embodiments have been described above schematically, without limitation, and the embodiments of the present invention are shown in the drawings, and the actual structures are not limited thereto. Therefore, those skilled in the art should understand that they can easily and effectively design and modify the structure and embodiments of the present invention without departing from the spirit and scope of the present invention.

Claims

1. A method for analyzing the double-party call robbing of a conversation in audio is characterized by comprising the following steps: (1) acquiring audio files of two-party conversation, and analyzing the audio files into conversation text information through asr preprocessing; (2) respectively gathering the talking contents of the two parties A and B in the dialogue text information into an a list and a B list; (3) traversing the b list with each dialog block of the a list; (4) judging whether the speaking ending time of a [ i ] block of speaking content in the a list is smaller than the speaking starting time of b [ i ] block of speaking content in the b list; (5) if the judgment in the step (4) is positive, the judgment is finished if the speech is not robbed, and a [ i +1] blocks of speech contents in the a list are obtained;

2. The method as claimed in claim 1, wherein the dialog text information includes the content of the utterance, the start time of the utterance, and the end time of the utterance.