CN105895116A

CN105895116A - Dual track voice break-in and interruption analysis method

Info

Publication number: CN105895116A
Application number: CN201610209686.4A
Authority: CN
Inventors: 刘郁松; 何国涛; 李全忠; 蒲瑶
Original assignee: Universal Information Technology (beijing) Co Ltd
Current assignee: Puqiang times (Zhuhai Hengqin) Information Technology Co., Ltd
Priority date: 2016-04-06
Filing date: 2016-04-06
Publication date: 2016-08-24
Anticipated expiration: 2036-04-06
Also published as: CN105895116B

Abstract

The present invention discloses a dual track voice break-in and interruption analysis method. The method comprises the steps of carrying out the effective voice end point detection by a voice activity detection technology and aiming at the recording streams of two tracks, and finding out the talk duration during the whole voice; according to the effective voice end points of the two track recordings, uniformly processing the end point time of each fragment, and laying all end points on a time axis by uniformly describing the three attributes of time points, tracks and end point types; traversing all time points from front to back, and analyzing whether the end points are beginning position end points or ending position end points. The dual track voice break-in and interruption analysis method can capture the break-in and interruption phenomena timely when the break-in and interruption phenomena are generated between two or more roles, and carries out the subsequent processing, thereby avoiding a break-in and interruption impolite conversation mode, and providing the superior guarantee for the customer services.

Description

A kind of double track voice rob analysis method of chipping in

Technical field

The invention belongs to customer service communicating tech field, particularly relate to a kind of double track voice robs analysis side of chipping in Method.

Background technology

Voice customer service refers to the customer service mainly carried out with the form of mobile phone, during customer service, The problem often robbed words, chip between two or more roles.Wherein rob words and refer between two roles, One role just finishes, and another role the most just speaks, and middle not free interval, this is at talk In be a kind of unhandsome mode, can be considered rude, half-hearted by the other side.Chip between two roles of finger, One role speaks, and another role directly chips in and states the suggestion of oneself, and this is more in talk Unhandsome mode.The phenomenon this rob words, chipping in has had a strong impact on the quality level of customer service.

Summary of the invention

It is an object of the invention to provide a kind of double track voice robs analysis method of chipping in, it is intended to solve customer service During the problem robbed words, chip in that occurs.

The present invention is achieved in that the analysis method of chipping in of robbing of this double track voice comprises the following steps:

Step one, carry out efficient voice end points by voice activity detection technology for the recording stream of two sound channels Detection, finds out and said from several seconds to several seconds in whole voice and exchange words；

Step 2, the efficient voice end points recorded according to two sound channels are unified by the endpoint time of each fragment Process, by time point, sound channel, three attribute Unify legislation of endpoint type, and all end points are tiled arrive On time shaft；

Step 3, two be close to end points, the most previous end points is the beginning end points that role A speaks, Later end points is the end caps that role B speaks, and this is phenomenon of chipping in.

Step 4, two be close to end points, the most previous end points be role A speak terminate end points, Later end points is the beginning end points that role B speaks, and the time boundary difference of two end points is less than 200ms, It is and robs words phenomenon.

The present invention also takes following technical measures:

Efficient voice end points in step one comprises time started, end time, three attributes of speaker.

In step 2, endpoint type includes starting and terminating.

The analysis method of endpoint type is comprised the following steps:

Step one, inspection endpoint type；

Step 2, if starting position end points, then judge whether stack top comprises starting position；

If step 3 stack top comprises starting position, then judge time started position whether with this starting position Role is identical；

If step 4 is identical, then corrupt data is described, it is impossible to a people does not finish words, opens again Beginning speaks；

Step 5, if it is different, then illustrate to chip in, records this information of chipping in, and is ejected by stack top end points；

If step 6 stack top does not comprise starting position, then by starting position pop down, endpoint location is added 1, and Continue cycling through；

Step 7, if end position end points, then judge whether stack top comprises starting position；

If step 8 stack top comprises starting position, then judge time started position whether with this end position Role is the most identical；

If step 9 is identical, then explanation is normal end points, does not has to chip in, when recording this end position Between point；

Step 10 if it is different, error in data is then described, before there occurs really not record of chipping in；

If step 11 stack top does not comprise starting position, then see end position and the start bit of previous end points Whether put within 200ms, be, rob words, words time of origin robbed in record, and is ejected by stack top end points；

Step 12, by all chip in finish message records robbed, wherein every section rob chip in comprise the time started, End time, type, rob direction of chipping in.

The present invention has the advantage that with good effect: the robbing of this double track voice chips in analysis method can be Rob words between two or more roles, be required to capture this phenomenon timely when chipping in, and Carry out subsequent treatment, it is to avoid rob words and unhandsome talking mode of chipping in, provide high-quality for customer service Guarantee.

Accompanying drawing explanation

Fig. 1 be the embodiment of the present invention provide double track voice rob analysis method flow diagram of chipping in；

Fig. 2 is the analysis method flow diagram to endpoint type that the embodiment of the present invention provides.

Detailed description of the invention

In order to make the purpose of the present invention, technical scheme and advantage clearer, below in conjunction with embodiment, The present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to Explain the present invention, be not intended to limit the present invention.

Below in conjunction with the accompanying drawings 1,2 and specific embodiment the application principle of the present invention is further described.

The analysis method of chipping in of robbing of this double track voice comprises the following steps:

S101, carry out efficient voice end points inspection by voice activity detection technology for the recording stream of two sound channels Survey, find out and whole voice was said from several seconds to several seconds exchange words；

S102, the efficient voice end points recorded according to two sound channels, at unified for the endpoint time of each fragment All end points by time point, sound channel, three attribute Unify legislation of endpoint type, and are tiled then by reason On countershaft；

S103, two be close to end points, the most previous end points is the beginning end points that role A speaks, after One end points is the end caps that role B speaks, and this is phenomenon of chipping in.

S104, two be close to end points, the most previous end points be role A speak terminate end points, after One end points is the beginning end points that role B speaks, and the time boundary difference of two end points is less than 200ms, i.e. For robbing words phenomenon.

Efficient voice end points in S101 comprises time started, end time, three attributes of speaker.

In S102, endpoint type includes starting and terminating.

The analysis method of endpoint type is comprised the following steps:

S201, inspection endpoint type；

S202, if starting position end points, then judge whether stack top comprises starting position；

If S203 stack top comprises starting position, then judge time started position whether with the angle of this starting position Color is identical；

If S204 is identical, then corrupt data is described, it is impossible to a people does not finish words, starts again Speak；

S205, if it is different, then illustrate to chip in, records this information of chipping in, and is ejected by stack top end points；

If S206 stack top does not comprise starting position, then by starting position pop down, endpoint location is added 1, and continue Continuous circulation；

S207, if end position end points, then judge whether stack top comprises starting position；

If S208 stack top comprises starting position, then judge time started position whether with the angle of this end position Color is the most identical；

If S209 is identical, then explanation is normal end points, does not has to chip in, records this end position time Point；

S210 if it is different, error in data is then described, before there occurs really not record of chipping in；

If S211 stack top does not comprise starting position, then see that the end position of previous end points and starting position are No within 200ms, it is to rob words, words time of origin robbed in record, and is ejected by stack top end points；

S212, by all chip in finish message records robbed, wherein rob to chip in and comprise time started, knot for every section Bundle time, type (rob words or chip in), rob direction of chipping in (who robs has been chipped in whom).

This double track voice rob analysis method of chipping in can rob between two or more roles words, It is required to when chipping in capture this phenomenon timely, and carries out subsequent treatment, it is to avoid rob words and chip in Unhandsome talking mode, provides the guarantee of high-quality for customer service.

The foregoing is only presently preferred embodiments of the present invention, not in order to limit the present invention, all at this Any amendment, equivalent and the improvement etc. made within bright spirit and principle, should be included in the present invention Protection domain within.

Claims

1. a double track voice rob analysis method of chipping in, it is characterised in that robbing of this double track voice is slotting Words analysis method comprises the following steps:

2. double track voice as claimed in claim 1 rob analysis method of chipping in, it is characterised in that in step Efficient voice end points in rapid one comprises time started, end time, three attributes of speaker.

3. double track voice as claimed in claim 1 rob analysis method of chipping in, it is characterised in that step In two, endpoint type includes starting and terminating.

4. double track voice as claimed in claim 1 rob analysis method of chipping in, it is characterised in that opposite end The analysis method of vertex type comprises the following steps:

Step one, inspection endpoint type；

If step 3 stack top comprises starting position, then judge time started position whether with this start bit The role put is identical；

If step 4 is identical, then corrupt data is described, it is impossible to a people does not finish words, weighs again Newly loquitur；

Step 5, if it is different, then illustrate to chip in, records this information of chipping in, and by stack top end points Eject；

If step 8 stack top comprises starting position, then judge time started position whether with this stop bits The role put is the most identical；

If step 9 is identical, then explanation is normal end points, does not has to chip in, records this stop bits Put time point；

If step 11 stack top does not comprise starting position, then see the end position of previous end points and open Whether beginning position, within 200ms, is, robs words, and words time of origin robbed in record, and by stack top End points ejects；

Step 12, by all chip in finish message records robbed, wherein rob to chip in and comprise beginning for every section Time, end time, type, rob direction of chipping in.