US20080154579A1 - Method of analyzing conversational transcripts - Google Patents
Method of analyzing conversational transcripts Download PDFInfo
- Publication number
- US20080154579A1 US20080154579A1 US11/614,200 US61420006A US2008154579A1 US 20080154579 A1 US20080154579 A1 US 20080154579A1 US 61420006 A US61420006 A US 61420006A US 2008154579 A1 US2008154579 A1 US 2008154579A1
- Authority
- US
- United States
- Prior art keywords
- transcripts
- call
- conversational
- sequences
- corpus
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims description 42
- KFZUDNZQQCWGKF-UHFFFAOYSA-M sodium;4-methylbenzenesulfinate Chemical compound [Na+].CC1=CC=C(S([O-])=O)C=C1 KFZUDNZQQCWGKF-UHFFFAOYSA-M 0.000 description 14
- 238000005065 mining Methods 0.000 description 8
- 230000000699 topical effect Effects 0.000 description 6
- 238000013459 approach Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000003416 augmentation Effects 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000003064 k means clustering Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/685—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
Definitions
- This invention relates to a method of analyzing conversational transcripts, preferably textual conversational transcripts, and in particular to analyzing and processing of unstructured data in such conversational transcripts.
- Call centers are centralized offices for the purpose of receiving and transmitting a large volume of requests by various communication channels like telephone, emails, and instant messages.
- a call centre is operated by a company to administer various types of support or information inquiries from customers.
- This call center model is becoming popular because it allows companies to be in direct touch with their customers.
- a typical call center agent handles tens to hundreds of calls per day depending on the complexity of the issues that the agent is required to addresses.
- relatively large volumes of data are produced everyday in the form of call transcripts. This is in addition to the data produced as a part of the process which include e-mail, instant messages (IM), reports from the agent, customer satisfaction surveys etc.
- the conversation (call) transcripts includes transcripts of conversations through various media like audio calls and IM/web-chat.
- a typical call that an agent would receive and respond to at a call center consists of various phases.
- a call to a call centre agent broadly consists of the following three phases: introduction part between the agent and the customer, seeking information about the problem, and providing solution to the problem of the customer.
- Agents in call centers typically use keyword search over manually authored knowledge bases to obtain useful information that could help them to solve the problem faced by the customer. The utility of such a system is limited by the smartness of the agent to enter the right keywords.
- Knowledge authoring, or the act of populating the knowledge base is done manually. Because they are manually authored, these knowledge bases can not quickly adapt to the various kinds of new problems that arise during the course of time. Moreover, manual augmentation of such knowledge bases would consume tremendous manpower and may not be exhaustive. Identification of undocumented and effective sequences of steps would be impossible in the case of manual authoring of knowledge bases.
- the call center agent is configured to receive input information from first user (customer, etc.) via a voice call.
- conversational transcripts are created from the information received from the first user.
- Once the conversational transcripts have been created at least one defined situation from a list of defined situations is selected. The selected situation is identified in the conversational transcripts.
- a set of procedural sequences is identified by comparing the at least one identified situation in the conversational transcript with knowledge derived from a corpus of historical conversational transcripts, which are preferably available in a corpus.
- the set of procedural sequence is provided to the first user.
- the set of procedural sequence may be provided to the second user (call center agent) and the second user can provide the procedural sequence to the first user.
- a second aspect of the invention is a method for creating a document of procedural sequences from information received via an input call.
- the call center agent is configured to receive input information from first user (customer, etc.) via a voice call.
- conversational transcripts are created from the information received from the first user.
- at least one defined situation from a list of defined situations is selected.
- the selected situation is identified in the conversational transcripts.
- a set of procedural sequences is identified by comparing the at least one identified situation in the conversational transcript with knowledge derived from a corpus of historical conversational transcripts, which are preferably available in a corpus.
- the second user is prompted to create a document comprising a set of procedural sequences and add the set of procedural sequences into a repository.
- the document created is authored by the second user and such a document that is added into the repository can be advantageously used in identifying situations and providing procedural sequences to the identified sequences from the input call of the first user.
- FIG. 1 illustrates an exemplary embodiment of a method of analyzing call transcripts in accordance with this invention.
- FIG. 2 illustrates an exemplary embodiment of analyzing conversational transcripts and generating documents.
- FIG. 3 illustrates an exemplary embodiment an analyzing conversational transcripts.
- FIG. 4 illustrates an exemplary embodiment of call processing using a clustering algorithm.
- FIG. 5 illustrates an exemplary embodiment an algorithm used in sequence mining on call corpora.
- FIG. 6 illustrates an exemplary embodiment of authoring new knowledge and updating the knowledge base.
- call transcripts Collections and analysis of call transcripts are typically very diverse in the kind of predicament they address.
- the call transcripts are clustered initially to arrive at topical collections, which are collections of calls addressing a specific issue, where such topical collections are more homogeneous in nature and can be advantageously used for analysis of the call transcripts.
- topical collections of calls is further split into two subsets, that of second user (hereinafter referred to as call center agent or simply agent) sentences and first user (also referred to as a customer) sentences.
- the two subsets are further clustered to build clusters which possibly containing sentences representative of similar procedural steps, where each cluster thus describes a sub-procedural step.
- An entropy-based measure to quantify the quality of a sub-procedural cluster is defined, and the entropy is used to measure the cluster quality and to refine clusters until good quality clusters are found. Once these clusters are obtained, each call in the topical collection can now be represented as a sequence of such clusters.
- the collection of calls, represented as sequences, are subjected to frequent pattern mining to discover frequent procedural sequences for a topical collection. Representative distinctive procedural sequences are determined from these frequent sequences, using for example, techniques such as leader clustering algorithms.
- FIG. 1 illustrates an exemplary embodiment of a method 100 which is the integrated approach of analyzing call transcripts.
- the method involves performing clustering of the corpus of call transcripts in order to group related text segments. Each group of segments discovered in 120 is represented by a representative syntax in 130 . These groups of segments are also referred to as the sub-procedure text segments (STPS).
- STPS sub-procedure text segments
- the calls are represented as a sequence of STPS.
- mining is performed on the sequence to determine the frequent sequences across the calls.
- a clustering algorithm such as CAARD is used to get distinct and long sequences from the sequences that are mined and the results are used in subsequent embodiments.
- FIG. 4 illustrates, in more details, an exemplary embodiment of a method 400 which is the integrated approach of analyzing call transcripts.
- the method involves performing clustering of the corpus of call transcripts in order to find topic-specific clusters S i of calls received.
- each cluster S i collects the set of agent (second users) sentences A i and the Customer sentences (first users) C i .
- STPS sub-procedure text segments
- the entropy which defines the quality of the STPS is computed and in 445 a check is made on the entropy value, which gives the quality of the STPS.
- the clustering parameters are changed and the 430 is repeated until the entropy value of the STPS computed is good, i.e., above a predefined threshold.
- the calls are represented as a sequence of cluster labels.
- mining is performed on the sequence to determine the frequent sequences across the calls S i .
- a clustering algorithm such as CAARD is used to get distinct sequences from the sequences that are mined and the results are output to the first user.
- the output from the algorithm may be provided directly to the customer or via the agent to the customer.
- While clustering the textual segments in the corpus of the historical conversational transcripts the historical transcripts are organized into groups and conversational transcripts in accordance with predefined criteria, and the groups of conversational transcripts to obtain groups of related textual segments. While clustering groups of conversational transcripts to obtain groups of related documents, at least two set of documents from the groups of documents corresponding each belonging to the customer and the agent are generated. Typically, a K-means clustering is performed on the corpus to obtain groups of conversations on a single topic, to extract coherent and meaningful SPTS from the conversation transcripts. For each such topical cluster, the set of agent sentences and customer sentences are collected separately, and they are clustered adaptively in accordance with 145 , until good quality SPTS are obtained.
- each SPTS cluster is represented by sentences that characterize the cluster. These characteristic sentences can be obtained using one of text summarization algorithms.
- the descriptive and discriminative words from the sentences in the cluster are used to describe the cluster. Descriptive words are those that contribute most to the average similarity among the calls in the cluster. Discriminative words are those that are more prevalent in the calls in the cluster as compared to the rest of the calls.
- FIG. 2 illustrates an exemplary embodiment of a method 200 of analyzing call transcripts in accordance with the claimed invention.
- an input call is received from a customer by an agent.
- the call for example an input voice call, call via an instant messenger, call via email or other means known to a person skilled in the art, preferably a voice call
- conversational transcripts are created from the information received from the call of the customer.
- at least one defined situation is selected from a list of defined situations, where each situations is uniquely defined by a set of parameters.
- at least one situation is selected from the set of defined situations that are derived from the conversational transcript. This selection can be based on a threshold value which has been predefined.
- a set of procedural sequences is identified by comparing the at least one identified situation in the conversational transcript with knowledge derived from a corpus in 100 , for example a set of distinct and long frequently occurring subsequence of STPS as in 160 .
- the customer is provided with the identified procedural sequences, either directly or via the agent.
- FIG. 3 illustrates an exemplary embodiment of a method 300 of analyzing the call transcripts in order to determine and author new information to incorporate in an already existing knowledge-base.
- conversational transcript of the received call is created. From the conversational transcript, at least one defined situation in the list of defined situations is identified in 330 .
- the identified situation is compared with the knowledge repository derived from the historical conversation as in 100 . Once identified situation is not found in knowledge repository, a new document comprising a set of procedural sequences is created and added to the knowledge repository.
- a typical call consists of a sequence of information exchanges and assumes that there exist some sentences in a conversation that reflect sub-procedures steps.
- the goodness measure (hereinafter referred to as goodness) of the SPTS cluster can be partially judged by the frequency of calls that contains sentences from each SPTS cluster, and the number of calls the sentences in each SPTS cluster are scattered into. This intuition is quantitatively captured by defining entropy of a call with respect to the SPTS clusters.
- Call 1 be represented by the sequence ⁇ C 2 , C 1 , C 5 , C 6 , C 4 > and Call 2 by the sequence ⁇ C 3 , C 5 , C 5 , C 3 , C 5 >.
- Call 1 is more scattered than Call 2 .
- the entropy measure is used to measure the scatter of a call.
- E C (c) represent entropy with respect to a given set C of SPTS clusters.
- the entropy measure is computed using Eq. (1):
- d i is the fraction of the call C in a cluster i.
- E(Call 1 ) 0.6989
- E(Call 2 ) 0.29.
- Entropy is satisfactory measure to compare calls of the same cardinality.
- Call 3 which is ⁇ C 1 , C 2 > and Call 4 which is ⁇ C 1 , C 1 , C 1 , C 1 , C 2 , C 2 , C 2 >, which have the same entropy. Therefore, Call 3 should have a higher score since it is scattered across as many clusters as it can be.
- a Normalized Entropy (NE) is proposed which is a better measure of scatter of the call. NE of a call c with respect to a set C of clusters is defined as
- a quality measure for a given set C of clusters is defined using the normalized entropy of all the calls in the corpus R, as the cardinality weighted average of NE values for calls as defined in Eq. (3):
- a corpus of call collections would contain calls pertaining to diverse issues.
- a clustering of agent sentences and customer sentences of such a corpus would lead to topic-type clusters than SPTS clusters.
- a homogeneous corpus could be expected to give better SPTS clusters than a more diverse collection, based on the assumption that the larger dissimilarities (which are topic-level) have to be eliminated in order to expose the lesser (SPTS) dissimilarities to the clustering algorithm. Therefore, in this invention it is hypothesized that NE of the clusters increases with the homogeneity of the corpus of calls to be clustered.
- ⁇ C 1 , . . . , C n ⁇ be the clusters of sentences corresponding to customer and ⁇ A 1 , . . . , A m ⁇ be that to agents.
- Each call is represented by a sequence of those clusters to which sentences in the call belong to.
- ⁇ START, A 2 , C 1 , A 4 , C 4 , END> represent a call where the first sentence in the call, which is in A 2 , was spoken by agent, second sentence, which is in C 1 , is spoken by customer, and so on.
- the call consists of sentences belonging to As and Cs alternatively as illustrated in FIG. 4 .
- FIG. 4 illustrates an exemplary embodiment 400 of splitting calls received as inputs into agent sentences and customer sentences, clustering these sentences and representing calls as a sequence of clusters. Consecutive sentences by the same person can be combined into a single sentence. Once the distinct sequences of SPTS clusters, these sequences are added to the procedure repository
- an agent prompting application is an online process as illustrated in FIG. 5 , consisting of extraction of procedures from a voice call.
- An ASR Automatic Speech Recognition
- a call transcript to the procedure matcher happens periodically in the call (say, once in 30 seconds or so), where the procedure matcher finds the closest matching procedure(s) from the repository of procedures, as described by the method in accordance with this invention previously, to the call transcript.
- the matched procedure(s) is (are) displayed to the agent. The agent would be able to take clues from the displayed matching procedure(s) to decide on actions to be taken in the remaining part of the call.
- the Knowledge Authoring application is an offline process aimed at capturing any new knowledge which has not been recorded in the procedure DB for possible inclusion into the knowledge base.
- FIG. 3 illustrates an exemplary embodiment of use of extracted procedures in knowledge authoring. This is a semi-supervised process where whether or not the suggested new knowledge is worthy of inclusion into the knowledge base is left to the discretion of the knowledge author.
- the person who is responsible for maintaining the knowledge base could generate a document related to this call, capturing the new knowledge and store the new knowledge in the knowledge base.
- the call transcript is fed into the procedure matcher that finds procedure that match with the call (based on a threshold). If matching procedure for the call is not found, the knowledge author is prompted to possibly include the new knowledge captured into a document format and then updated into the knowledge base.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Library & Information Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Abstract
Description
- This invention relates to a method of analyzing conversational transcripts, preferably textual conversational transcripts, and in particular to analyzing and processing of unstructured data in such conversational transcripts.
- Call centers are centralized offices for the purpose of receiving and transmitting a large volume of requests by various communication channels like telephone, emails, and instant messages. A call centre is operated by a company to administer various types of support or information inquiries from customers. This call center model is becoming popular because it allows companies to be in direct touch with their customers. In such call centers a typical call center agent handles tens to hundreds of calls per day depending on the complexity of the issues that the agent is required to addresses. However, with advances in the area of speech recognition and the widespread deployment of speech recognition technology in call centers, relatively large volumes of data are produced everyday in the form of call transcripts. This is in addition to the data produced as a part of the process which include e-mail, instant messages (IM), reports from the agent, customer satisfaction surveys etc. In the sequel, the conversation (call) transcripts includes transcripts of conversations through various media like audio calls and IM/web-chat.
- A typical call that an agent would receive and respond to at a call center consists of various phases. For example, a call to a call centre agent broadly consists of the following three phases: introduction part between the agent and the customer, seeking information about the problem, and providing solution to the problem of the customer. Agents in call centers typically use keyword search over manually authored knowledge bases to obtain useful information that could help them to solve the problem faced by the customer. The utility of such a system is limited by the smartness of the agent to enter the right keywords.
- Knowledge authoring, or the act of populating the knowledge base is done manually. Because they are manually authored, these knowledge bases can not quickly adapt to the various kinds of new problems that arise during the course of time. Moreover, manual augmentation of such knowledge bases would consume tremendous manpower and may not be exhaustive. Identification of undocumented and effective sequences of steps would be impossible in the case of manual authoring of knowledge bases.
- Without a way to improve the method of analyzing call transcripts, the promise of this technology may never be fully achieved.
- According to a first aspect of the invention thereof, is described a method for analyzing transcripts of conversation between at least two users. Typically the call center agent is configured to receive input information from first user (customer, etc.) via a voice call. After receiving the call, conversational transcripts are created from the information received from the first user. Once the conversational transcripts have been created at least one defined situation from a list of defined situations is selected. The selected situation is identified in the conversational transcripts. After the situations has been identified, a set of procedural sequences is identified by comparing the at least one identified situation in the conversational transcript with knowledge derived from a corpus of historical conversational transcripts, which are preferably available in a corpus. The set of procedural sequence is provided to the first user. In a further embodiment, the set of procedural sequence may be provided to the second user (call center agent) and the second user can provide the procedural sequence to the first user.
- According to a second aspect of the invention thereof, is a method for creating a document of procedural sequences from information received via an input call. Typically the call center agent is configured to receive input information from first user (customer, etc.) via a voice call. After receiving the call, conversational transcripts are created from the information received from the first user. Once the conversational transcripts have been created, at least one defined situation from a list of defined situations is selected. The selected situation is identified in the conversational transcripts. After the situations have been identified, a set of procedural sequences is identified by comparing the at least one identified situation in the conversational transcript with knowledge derived from a corpus of historical conversational transcripts, which are preferably available in a corpus. On negative determination, i.e., when the identified situation is not found in the historical conversational transcripts, the second user (call center agent) is prompted to create a document comprising a set of procedural sequences and add the set of procedural sequences into a repository. The document created is authored by the second user and such a document that is added into the repository can be advantageously used in identifying situations and providing procedural sequences to the identified sequences from the input call of the first user.
-
FIG. 1 illustrates an exemplary embodiment of a method of analyzing call transcripts in accordance with this invention. -
FIG. 2 illustrates an exemplary embodiment of analyzing conversational transcripts and generating documents. -
FIG. 3 illustrates an exemplary embodiment an analyzing conversational transcripts. -
FIG. 4 illustrates an exemplary embodiment of call processing using a clustering algorithm. -
FIG. 5 illustrates an exemplary embodiment an algorithm used in sequence mining on call corpora. -
FIG. 6 illustrates an exemplary embodiment of authoring new knowledge and updating the knowledge base. - Collections and analysis of call transcripts are typically very diverse in the kind of predicament they address. The call transcripts are clustered initially to arrive at topical collections, which are collections of calls addressing a specific issue, where such topical collections are more homogeneous in nature and can be advantageously used for analysis of the call transcripts. Each such topical collection of calls is further split into two subsets, that of second user (hereinafter referred to as call center agent or simply agent) sentences and first user (also referred to as a customer) sentences. The two subsets are further clustered to build clusters which possibly containing sentences representative of similar procedural steps, where each cluster thus describes a sub-procedural step. An entropy-based measure to quantify the quality of a sub-procedural cluster is defined, and the entropy is used to measure the cluster quality and to refine clusters until good quality clusters are found. Once these clusters are obtained, each call in the topical collection can now be represented as a sequence of such clusters. The collection of calls, represented as sequences, are subjected to frequent pattern mining to discover frequent procedural sequences for a topical collection. Representative distinctive procedural sequences are determined from these frequent sequences, using for example, techniques such as leader clustering algorithms.
-
FIG. 1 illustrates an exemplary embodiment of amethod 100 which is the integrated approach of analyzing call transcripts. In 120, the method involves performing clustering of the corpus of call transcripts in order to group related text segments. Each group of segments discovered in 120 is represented by a representative syntax in 130. These groups of segments are also referred to as the sub-procedure text segments (STPS). In 140, the calls are represented as a sequence of STPS. Once the calls have been represented as a sequence of STPS, in 150 mining is performed on the sequence to determine the frequent sequences across the calls. After this, in 160 a clustering algorithm such as CAARD is used to get distinct and long sequences from the sequences that are mined and the results are used in subsequent embodiments. - Reference is now made to
FIG. 4 which illustrates, in more details, an exemplary embodiment of amethod 400 which is the integrated approach of analyzing call transcripts. In 410, the method involves performing clustering of the corpus of call transcripts in order to find topic-specific clusters Si of calls received. In 420, each cluster Si collects the set of agent (second users) sentences Ai and the Customer sentences (first users) Ci. In 430, the sentences Ai and Ci p clustered to find the sub-procedure text segments (STPS). In 440 the entropy which defines the quality of the STPS is computed and in 445 a check is made on the entropy value, which gives the quality of the STPS. If the entropy value computed is not good, i.e., does not satisfy a predefined criteria, in 450 the clustering parameters are changed and the 430 is repeated until the entropy value of the STPS computed is good, i.e., above a predefined threshold. In 445, if the entropy value is determined to be good, i.e., above a predefined threshold then, in 460, the calls are represented as a sequence of cluster labels. Once the calls have been represented as a sequence of cluster labels, in 470 mining is performed on the sequence to determine the frequent sequences across the calls Si. After this, in 480 a clustering algorithm such as CAARD is used to get distinct sequences from the sequences that are mined and the results are output to the first user. The output from the algorithm may be provided directly to the customer or via the agent to the customer. - While clustering the textual segments in the corpus of the historical conversational transcripts, the historical transcripts are organized into groups and conversational transcripts in accordance with predefined criteria, and the groups of conversational transcripts to obtain groups of related textual segments. While clustering groups of conversational transcripts to obtain groups of related documents, at least two set of documents from the groups of documents corresponding each belonging to the customer and the agent are generated. Typically, a K-means clustering is performed on the corpus to obtain groups of conversations on a single topic, to extract coherent and meaningful SPTS from the conversation transcripts. For each such topical cluster, the set of agent sentences and customer sentences are collected separately, and they are clustered adaptively in accordance with 145, until good quality SPTS are obtained.
- In a further embodiment, in order to make the output more readable, each SPTS cluster is represented by sentences that characterize the cluster. These characteristic sentences can be obtained using one of text summarization algorithms. The descriptive and discriminative words from the sentences in the cluster are used to describe the cluster. Descriptive words are those that contribute most to the average similarity among the calls in the cluster. Discriminative words are those that are more prevalent in the calls in the cluster as compared to the rest of the calls.
-
FIG. 2 illustrates an exemplary embodiment of amethod 200 of analyzing call transcripts in accordance with the claimed invention. In 210, an input call is received from a customer by an agent. On receiving the call, for example an input voice call, call via an instant messenger, call via email or other means known to a person skilled in the art, preferably a voice call, in 220 conversational transcripts are created from the information received from the call of the customer. After creating conversational transcripts, in 230 at least one defined situation is selected from a list of defined situations, where each situations is uniquely defined by a set of parameters. In 240, from the set of defined situations that are derived from the conversational transcript, at least one situation is selected. This selection can be based on a threshold value which has been predefined. Once a selected situations in the conversational transcripts has been identified in 240, a set of procedural sequences is identified by comparing the at least one identified situation in the conversational transcript with knowledge derived from a corpus in 100, for example a set of distinct and long frequently occurring subsequence of STPS as in 160. Once the set of procedural sequences has been identified, in 170, the customer is provided with the identified procedural sequences, either directly or via the agent. -
FIG. 3 illustrates an exemplary embodiment of amethod 300 of analyzing the call transcripts in order to determine and author new information to incorporate in an already existing knowledge-base. In 320, conversational transcript of the received call is created. From the conversational transcript, at least one defined situation in the list of defined situations is identified in 330. In 340, the identified situation is compared with the knowledge repository derived from the historical conversation as in 100. Once identified situation is not found in knowledge repository, a new document comprising a set of procedural sequences is created and added to the knowledge repository. - A typical call consists of a sequence of information exchanges and assumes that there exist some sentences in a conversation that reflect sub-procedures steps. The goodness measure (hereinafter referred to as goodness) of the SPTS cluster can be partially judged by the frequency of calls that contains sentences from each SPTS cluster, and the number of calls the sentences in each SPTS cluster are scattered into. This intuition is quantitatively captured by defining entropy of a call with respect to the SPTS clusters. Let, Call1 be represented by the sequence <C2, C1, C5, C6, C4> and Call2 by the sequence <C3, C5, C5, C3, C5>. As is obvious from the representation, Call1 is more scattered than Call2. The entropy measure is used to measure the scatter of a call. Let, EC(c) represent entropy with respect to a given set C of SPTS clusters. The entropy measure is computed using Eq. (1):
-
E C(c)=Σi d i log(d i), (1) - where in Eq. (1), di is the fraction of the call C in a cluster i. For example, for the clusters numbered C1 to C6, E(Call1)=0.6989, and E(Call2)=0.29.
- Entropy is satisfactory measure to compare calls of the same cardinality. Consider the case of Call3 which is <C1, C2> and Call4 which is <C1, C1, C1, C1, C2, C2, C2, C2>, which have the same entropy. Therefore, Call3 should have a higher score since it is scattered across as many clusters as it can be. Thus, a Normalized Entropy (NE) is proposed which is a better measure of scatter of the call. NE of a call c with respect to a set C of clusters is defined as
-
NE C(c)=−(Σi d i log(d i))/log(|c|), (2) - where in Eq (2), |c| is the length of the call c. Normalized Entropy would lie between a value between 0 and 1 since log(|c|) is the maximum value that Normalized Entropy takes. Call3 would have an NE value of 1.0 and Call4 would have an NE value of 0.333.
- Thus, a quality measure for a given set C of clusters is defined using the normalized entropy of all the calls in the corpus R, as the cardinality weighted average of NE values for calls as defined in Eq. (3):
-
NE R(C)=(Σc ε R NE C(c)*|c|)/(Σc ε R |c|). (3) - Important properties of the NE measure:
-
- NE increases with the number of clusters because, there are more clusters for a given call to get scattered into.
- For a given number of clusters and an approximately equal number of data elements, NE decreases with the increase in average call length. This is due to the increased probability of two steps (in the call) getting mapped to the same cluster with increase in the length of a call.
NE is parameterized by both the clusters and the corpus. Hence, a comparison can be made between the clustering algorithms by their NE values on various corpora, and different corpora by their NE values when subjected to the same clustering technique. However, the characteristics of the NE function outlined previously leads to two important requirements for NE value based comparison. The clusters to be compared should have roughly same values for the following ratios - Sentences ratio: Number of sentences to be clustered/Number of clusters
- Call length ratio: Average Call Length/Number of clusters
- A corpus of call collections would contain calls pertaining to diverse issues. A clustering of agent sentences and customer sentences of such a corpus, would lead to topic-type clusters than SPTS clusters. A homogeneous corpus could be expected to give better SPTS clusters than a more diverse collection, based on the assumption that the larger dissimilarities (which are topic-level) have to be eliminated in order to expose the lesser (SPTS) dissimilarities to the clustering algorithm. Therefore, in this invention it is hypothesized that NE of the clusters increases with the homogeneity of the corpus of calls to be clustered.
- This describes the technique of extracting useful information from the call sequences by means of frequent sequence mining. Mining sequential patterns from data is well known to a person skilled in the art. Algorithms available in the art can be used to find frequent sequences from the set of call sequences. However, these algorithms result in many redundant sequences. Hence, summarization techniques like CAARD to find long and distinct sequences can be advantageously used. Such a
sequence mining approach 300 has been illustrated inFIG. 3 where in accordance with this invention distinct frequent sequences can be extracted. - Let, {C1, . . . , Cn} be the clusters of sentences corresponding to customer and {A1, . . . , Am} be that to agents. Each call is represented by a sequence of those clusters to which sentences in the call belong to. For example, <START, A2, C1, A4, C4, END> represent a call where the first sentence in the call, which is in A2, was spoken by agent, second sentence, which is in C1, is spoken by customer, and so on. Without any loss of generality it can be assumed that the call consists of sentences belonging to As and Cs alternatively as illustrated in
FIG. 4 .FIG. 4 illustrates anexemplary embodiment 400 of splitting calls received as inputs into agent sentences and customer sentences, clustering these sentences and representing calls as a sequence of clusters. Consecutive sentences by the same person can be combined into a single sentence. Once the distinct sequences of SPTS clusters, these sequences are added to the procedure repository - Consider, an agent prompting application is an online process as illustrated in
FIG. 5 , consisting of extraction of procedures from a voice call. An ASR (Automatic Speech Recognition) system at any time during the call has the transcript of the call up to that point in time. A call transcript to the procedure matcher happens periodically in the call (say, once in 30 seconds or so), where the procedure matcher finds the closest matching procedure(s) from the repository of procedures, as described by the method in accordance with this invention previously, to the call transcript. The matched procedure(s) is (are) displayed to the agent. The agent would be able to take clues from the displayed matching procedure(s) to decide on actions to be taken in the remaining part of the call. - Assume that a laptop help-desk agent receives a call on resetting a user password. There could be two ways of resetting the user password in Windows OS:
-
- 1. pressing Ctrl+Alt+Delete and press on change password button to change the password, and
- 2. opening control panel, clicking on “user accounts” icon to change the password.
- The Knowledge Authoring application is an offline process aimed at capturing any new knowledge which has not been recorded in the procedure DB for possible inclusion into the knowledge base.
FIG. 3 illustrates an exemplary embodiment of use of extracted procedures in knowledge authoring. This is a semi-supervised process where whether or not the suggested new knowledge is worthy of inclusion into the knowledge base is left to the discretion of the knowledge author. - For example, suppose a laptop helpdesk agent, while conversing with a customer, has diagnosed and fixed an unknown problem or diagnosed a known problem but fixed the problem in a way that is not captured in the knowledge base that he typically uses. If this type of the call is discovered then, the person who is responsible for maintaining the knowledge base, could generate a document related to this call, capturing the new knowledge and store the new knowledge in the knowledge base. After completion of a call, the call transcript is fed into the procedure matcher that finds procedure that match with the call (based on a threshold). If matching procedure for the call is not found, the knowledge author is prompted to possibly include the new knowledge captured into a document format and then updated into the knowledge base.
- Although the invention has been described with reference to the embodiments described above, it will be evident that other embodiments may be alternatively used to achieve the same object. The scope of the invention is not limited to the embodiments described above, but can also be applied to software programs and computer program products in general. It should be noted that the above-mentioned embodiments illustrate rather than limit the invention and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs should not limit the scope of the claim. The invention can be implemented by means of hardware and software comprising several distinct elements.
Claims (3)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/614,200 US20080154579A1 (en) | 2006-12-21 | 2006-12-21 | Method of analyzing conversational transcripts |
US12/061,705 US8135579B2 (en) | 2006-12-21 | 2008-04-03 | Method of analyzing conversational transcripts |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/614,200 US20080154579A1 (en) | 2006-12-21 | 2006-12-21 | Method of analyzing conversational transcripts |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/061,705 Continuation US8135579B2 (en) | 2006-12-21 | 2008-04-03 | Method of analyzing conversational transcripts |
Publications (1)
Publication Number | Publication Date |
---|---|
US20080154579A1 true US20080154579A1 (en) | 2008-06-26 |
Family
ID=39544154
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/614,200 Abandoned US20080154579A1 (en) | 2006-12-21 | 2006-12-21 | Method of analyzing conversational transcripts |
US12/061,705 Active 2029-09-30 US8135579B2 (en) | 2006-12-21 | 2008-04-03 | Method of analyzing conversational transcripts |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/061,705 Active 2029-09-30 US8135579B2 (en) | 2006-12-21 | 2008-04-03 | Method of analyzing conversational transcripts |
Country Status (1)
Country | Link |
---|---|
US (2) | US20080154579A1 (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090157641A1 (en) * | 2007-12-17 | 2009-06-18 | Frank-Uwe Andersen | Query routing in distributed database system |
US20100332477A1 (en) * | 2009-06-24 | 2010-12-30 | Nexidia Inc. | Enhancing Call Center Performance |
US20110055699A1 (en) * | 2009-08-28 | 2011-03-03 | International Business Machines Corporation | Intelligent self-enabled solution discovery |
US20120278159A1 (en) * | 2011-04-27 | 2012-11-01 | Kumar Gangadharan | Method and apparatus for enhancing customer service experience |
US20150058006A1 (en) * | 2013-08-23 | 2015-02-26 | Xerox Corporation | Phonetic alignment for user-agent dialogue recognition |
US10482132B2 (en) | 2016-03-16 | 2019-11-19 | Microsoft Technology Licensing, Llc | Contact creation and utilization |
US10642889B2 (en) * | 2017-02-20 | 2020-05-05 | Gong I.O Ltd. | Unsupervised automated topic detection, segmentation and labeling of conversations |
US20200167604A1 (en) * | 2018-11-28 | 2020-05-28 | International Business Machines Corporation | Creating compact example sets for intent classification |
US11094320B1 (en) * | 2014-12-22 | 2021-08-17 | Amazon Technologies, Inc. | Dialog visualization |
US11170173B2 (en) | 2019-02-05 | 2021-11-09 | International Business Machines Corporation | Analyzing chat transcript data by classifying utterances into products, intents and clusters |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8494995B2 (en) * | 2010-11-17 | 2013-07-23 | International Business Machines Corporation | Systems and methods for dynamically reconfiguring predictive analytics within workflows using selective user feedback |
US10061822B2 (en) * | 2013-07-26 | 2018-08-28 | Genesys Telecommunications Laboratories, Inc. | System and method for discovering and exploring concepts and root causes of events |
WO2015013554A1 (en) * | 2013-07-26 | 2015-01-29 | Genesys Telecommunications Laboratories, Inc. | System and method for discovering and exploring concepts |
US9971764B2 (en) | 2013-07-26 | 2018-05-15 | Genesys Telecommunications Laboratories, Inc. | System and method for discovering and exploring concepts |
US10666792B1 (en) * | 2016-07-22 | 2020-05-26 | Pindrop Security, Inc. | Apparatus and method for detecting new calls from a known robocaller and identifying relationships among telephone calls |
JP6567128B1 (en) * | 2018-05-08 | 2019-08-28 | ベクスト株式会社 | Conversation support system and conversation support method |
US12113934B1 (en) | 2024-04-04 | 2024-10-08 | Fmr Llc | Systems and methods for intelligent call agent evaluations |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6748353B1 (en) * | 1993-12-29 | 2004-06-08 | First Opinion Corporation | Authoring language translator |
US20040218751A1 (en) * | 2003-04-29 | 2004-11-04 | International Business Machines Corporation | Automated call center transcription services |
US6922466B1 (en) * | 2001-03-05 | 2005-07-26 | Verizon Corporate Services Group Inc. | System and method for assessing a call center |
US20050286705A1 (en) * | 2004-06-16 | 2005-12-29 | Matsushita Electric Industrial Co., Ltd. | Intelligent call routing and call supervision method for call centers |
US7306560B2 (en) * | 1993-12-29 | 2007-12-11 | Clinical Decision Support, Llc | Computerized medical diagnostic and treatment advice system including network access |
US7389229B2 (en) * | 2002-10-17 | 2008-06-17 | Bbn Technologies Corp. | Unified clustering tree |
US7487095B2 (en) * | 2003-02-11 | 2009-02-03 | Microsoft Corporation | Method and apparatus for managing user conversations |
US7606714B2 (en) * | 2003-02-11 | 2009-10-20 | Microsoft Corporation | Natural language classification within an automated response system |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090112588A1 (en) * | 2007-10-31 | 2009-04-30 | International Business Machines Corporation | Method for segmenting communication transcripts using unsupervsed and semi-supervised techniques |
-
2006
- 2006-12-21 US US11/614,200 patent/US20080154579A1/en not_active Abandoned
-
2008
- 2008-04-03 US US12/061,705 patent/US8135579B2/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6748353B1 (en) * | 1993-12-29 | 2004-06-08 | First Opinion Corporation | Authoring language translator |
US7306560B2 (en) * | 1993-12-29 | 2007-12-11 | Clinical Decision Support, Llc | Computerized medical diagnostic and treatment advice system including network access |
US6922466B1 (en) * | 2001-03-05 | 2005-07-26 | Verizon Corporate Services Group Inc. | System and method for assessing a call center |
US7389229B2 (en) * | 2002-10-17 | 2008-06-17 | Bbn Technologies Corp. | Unified clustering tree |
US7487095B2 (en) * | 2003-02-11 | 2009-02-03 | Microsoft Corporation | Method and apparatus for managing user conversations |
US7606714B2 (en) * | 2003-02-11 | 2009-10-20 | Microsoft Corporation | Natural language classification within an automated response system |
US20040218751A1 (en) * | 2003-04-29 | 2004-11-04 | International Business Machines Corporation | Automated call center transcription services |
US20050286705A1 (en) * | 2004-06-16 | 2005-12-29 | Matsushita Electric Industrial Co., Ltd. | Intelligent call routing and call supervision method for call centers |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090157641A1 (en) * | 2007-12-17 | 2009-06-18 | Frank-Uwe Andersen | Query routing in distributed database system |
US8166063B2 (en) * | 2007-12-17 | 2012-04-24 | Nokia Siemens Networks Oy | Query routing in distributed database system |
US20100332477A1 (en) * | 2009-06-24 | 2010-12-30 | Nexidia Inc. | Enhancing Call Center Performance |
US8051086B2 (en) * | 2009-06-24 | 2011-11-01 | Nexidia Inc. | Enhancing call center performance |
US20110055699A1 (en) * | 2009-08-28 | 2011-03-03 | International Business Machines Corporation | Intelligent self-enabled solution discovery |
US8291319B2 (en) * | 2009-08-28 | 2012-10-16 | International Business Machines Corporation | Intelligent self-enabled solution discovery |
EP2702477A4 (en) * | 2011-04-27 | 2014-11-05 | 24 7 Customer Inc | Method and apparatus for enhancing customer service experience |
EP2702477A1 (en) * | 2011-04-27 | 2014-03-05 | 24/7 Customer, Inc. | Method and apparatus for enhancing customer service experience |
US20120278159A1 (en) * | 2011-04-27 | 2012-11-01 | Kumar Gangadharan | Method and apparatus for enhancing customer service experience |
US20150058006A1 (en) * | 2013-08-23 | 2015-02-26 | Xerox Corporation | Phonetic alignment for user-agent dialogue recognition |
US11094320B1 (en) * | 2014-12-22 | 2021-08-17 | Amazon Technologies, Inc. | Dialog visualization |
US10482132B2 (en) | 2016-03-16 | 2019-11-19 | Microsoft Technology Licensing, Llc | Contact creation and utilization |
US10642889B2 (en) * | 2017-02-20 | 2020-05-05 | Gong I.O Ltd. | Unsupervised automated topic detection, segmentation and labeling of conversations |
US20200167604A1 (en) * | 2018-11-28 | 2020-05-28 | International Business Machines Corporation | Creating compact example sets for intent classification |
US11748393B2 (en) * | 2018-11-28 | 2023-09-05 | International Business Machines Corporation | Creating compact example sets for intent classification |
US11170173B2 (en) | 2019-02-05 | 2021-11-09 | International Business Machines Corporation | Analyzing chat transcript data by classifying utterances into products, intents and clusters |
Also Published As
Publication number | Publication date |
---|---|
US20080181390A1 (en) | 2008-07-31 |
US8135579B2 (en) | 2012-03-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8135579B2 (en) | Method of analyzing conversational transcripts | |
US10387410B2 (en) | Method and system of classification in a natural language user interface | |
CN108197282B (en) | File data classification method and device, terminal, server and storage medium | |
CN108153800B (en) | Information processing method, information processing apparatus, and recording medium | |
US8781880B2 (en) | System, method and apparatus for voice analytics of recorded audio | |
WO2006085661A1 (en) | Question answering data edition device, question answering data edition method, and question answering data edition program | |
CN110516057B (en) | Petition question answering method and device | |
EP3232336A1 (en) | Method and device for recognizing stop word | |
US20220058213A1 (en) | Systems and methods for identifying dynamic types in voice queries | |
JPWO2019156103A1 (en) | FAQ maintenance support device, FAQ maintenance support method, and program | |
CN109508441A (en) | Data analysing method, device and electronic equipment | |
CN114722191A (en) | Automatic call clustering method and system based on semantic understanding processing | |
CN110311943A (en) | The inquiry of data and methods of exhibiting in a kind of electric power enterprise big data platform | |
CN113886545A (en) | Knowledge question answering method, knowledge question answering device, computer readable medium and electronic equipment | |
WO2012047955A1 (en) | Language dictation recognition systems and methods for using the same | |
CN110413770B (en) | Method and device for classifying group messages into group topics | |
CN114491010A (en) | Training method and device of information extraction model | |
CN111949777A (en) | Intelligent voice conversation method and device based on crowd classification and electronic equipment | |
CN116501844A (en) | Voice keyword retrieval method and system | |
CN116303951A (en) | Dialogue processing method, device, electronic equipment and storage medium | |
CN116166771A (en) | Speech service content recognition method based on behavior text processing | |
CN114580407A (en) | Intelligent consulting method and system for industry recruitment in park based on maximum entropy | |
CN117972160B (en) | Multi-mode information processing method and device | |
CN115878775B (en) | Method and device for generating cross-type dialogue data | |
JP7231171B1 (en) | Processing operation support device and program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: INTERNATIONAL BUSINESSS MACHINES CORPORATION, NEW Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KUMMAMURU, KRISHNA;PADMANABHAN, DEEPAK S;REEL/FRAME:018666/0736 Effective date: 20061218 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |
|
AS | Assignment |
Owner name: MIDWAY TECHNOLOGY COMPANY LLC, MICHIGAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTERNATIONAL BUSINESS MACHINES CORPORATION;REEL/FRAME:037704/0257 Effective date: 20151231 |
|
AS | Assignment |
Owner name: SERVICENOW, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MIDWAY TECHNOLOGY COMPANY LLC;REEL/FRAME:038324/0816 Effective date: 20160324 |