WO2004099934A2 - Apparatus and method for processing service interactions - Google Patents

Apparatus and method for processing service interactions Download PDF

Info

Publication number
WO2004099934A2
WO2004099934A2 PCT/US2004/013946 US2004013946W WO2004099934A2 WO 2004099934 A2 WO2004099934 A2 WO 2004099934A2 US 2004013946 W US2004013946 W US 2004013946W WO 2004099934 A2 WO2004099934 A2 WO 2004099934A2
Authority
WO
WIPO (PCT)
Prior art keywords
free form
form input
human
accordance
workflow
Prior art date
Application number
PCT/US2004/013946
Other languages
French (fr)
Other versions
WO2004099934A3 (en
Inventor
Michael Eric Cloran
Original Assignee
Interactions, Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Interactions, Llc filed Critical Interactions, Llc
Priority to CA002524591A priority Critical patent/CA2524591A1/en
Priority to NZ543885A priority patent/NZ543885A/en
Priority to EP04751363A priority patent/EP1620777A4/en
Priority to AU2004237227A priority patent/AU2004237227B2/en
Publication of WO2004099934A2 publication Critical patent/WO2004099934A2/en
Publication of WO2004099934A3 publication Critical patent/WO2004099934A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing
    • H04M3/5166Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing in combination with interactive voice response systems or voice portals, e.g. as front-ends
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06311Scheduling, planning or task assignment for a person or group
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q20/00Payment architectures, schemes or protocols
    • G06Q20/22Payment schemes or models
    • G06Q20/24Credit schemes, i.e. "pay after"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/01Customer relationship services
    • G06Q30/015Providing customer assistance, e.g. assisting a customer within a business location or via helpdesk
    • G06Q30/016After-sales
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals

Definitions

  • This invention relates to the field of interactive response communication systems, and, more particularly to an interactive response communications systems that use human interpretation of customer intent and data as input to a workflow on a computer.
  • INR interactive voice response
  • An INR system typically communicates with customers using a set of prerecorded phrases, responds to some spoken input and touch-tone signals, and can route or transfer calls.
  • a drawback to such INR systems is that they are normally built around a "menu" structure, which presents callers with just a few valid options at a time and require a narrow range of responses from callers.
  • the typical model of customer service interaction is for one agent to assist a customer for the duration of the customer's interaction.
  • one agent for example, a technical support representative
  • may transfer the customer to another agent such as a sales representative
  • another agent such as a sales representative
  • one agent spends his or her time assisting that one customer for the full duration of the customer's call or chat session, or is occupied resolving the customer's issue via e-mail.
  • Most call centers also expect the agent to take the time to log (document) the call.
  • Deficiencies in this heavy agent interface model is (1) there is a high agent turnover rate and (2) a great deal of initial and ongoing agent training is usually required, which all add up to making customer service a significant expense for these customer service providers.
  • the interactive response system handles outgoing communication to the end user, including speaking (using text-to-speech or professionally recorded clips) so the human agent's voice would never be heard by the customer, eliminating any concern of an agent's accent.
  • the actual interaction with the customer is managed by a software-based router whose front end is email, interactive data or speech capable.
  • a router in accordance with this invention seamlessly blends both human customer service agents (for interpretation of written or spoken input) and software speech recognition (for spoken word input) to interpret customer input in real-time for intelligent interpretation.
  • New agents can act in the customer service agent role without their response being weighted by the router, but the agent can still receive feedback on their performance.
  • An objective performance measure then exists to decide when to promote a new hire from trainee status.
  • all responses to customers used by the system are scripted and engineered, removing the huge requirements of training customer service agents in how to speak to customers.
  • workflows are updated any time by business analysts, process engineer, or company appointed personnel.
  • Workflows may be advantageously designed so that the automated portion of the system can handle tasks involving sensitive data such as social security numbers, credit cards, etc., whereby the human agents never have access to this data.
  • the workflow may be engineered to distribute the components of the input in a manner such that no one agent handles the whole of the customer data. For example, one agent might see or hear a full customer name, while another has access to the customer's social security number, and neither learns the customer's home address.
  • This invention provides a system and method of blending human interpretation, speech recognition technology, text parsing and lexical analysis, text-to-speech capabilities and other resources in a system for the automated processing of customer-company interactions.
  • the central element of the system is a software-based router that manages the conversation with the end user either in real-time (voice, online text chats) or correspondence (e-mail).
  • the router follows rules (herein called "workflows") developed and tweaked over time by business analysts. These rules form a script for the router to follow when interacting with end users.
  • the router draws on both text-to- speech capabilities and prerecorded responses when replying to an end user.
  • the router employs both speech recognition technology and the interpretive abilities of human customer service agents, seamlessly blending the two. This blending can be performed in real-time or near real-time to allow the router to carry on a conversation-like interaction with an end user.
  • the system integrates human agents in an innovative way. Because user input is digitized, the router can direct only those portions of the input that require human interpretation to human agents. No one customer service agent is tied to a customer conversation for its entire duration; the interaction is managed by the router itself. Also, the router is able to send the digitized input to more than one human agent for simultaneous interpretation, which provides double and triple checking of each answer from the agents. Such double and triple check also provides an objective measure and ranking of the speed and accuracy of the agents.
  • the system is designed to work over a TCP/IP network, so that the customer service agents can be located virtually anywhere in the world.
  • the system comprises off-the-shelf hardware and software, some customized software (as noted below), and can be integrated with existing company resources, such as databases and telephone networks.
  • FIG. 1 is a block diagram illustrating one embodiment of an architecture of an interactive response system according to an exemplary embodiment of this invention
  • FIG. 2 is a flow chart illustrating an embodiment of a method of the present invention for communication among a customer, the interactive response system and a human interface;
  • FIG. 3 A is a chart illustrating one embodiment of a customer/interactive response system interaction in the context of FIG. 2;
  • FIG. 3B is a computer screen illustrating one embodiment for capturing customer intent and data in the context of FIG. 2;
  • FIG. 4A is a chart illustrating one embodiment of a customer/interactive response system interaction in the context of FIG. 2;
  • FIG. 4B is a computer screen illustrating one embodiment for capturing customer intent and data in the context of FIG. 2;
  • FIG. 5 A is a chart illustrating one embodiment of a customer/interactive response system interaction in the context of FIG. 2;
  • FIG. 5B is a computer screen illustrating one embodiment for capturing customer intent and data in the context of FIG. 2;
  • FIG. 6 is a flow chart of processing an email in the context of an interactive response system in accordance with another aspect of this invention. Detailed Description
  • FIG. 1 illustrates one embodiment of an architecture, of the type in which the present invention can be used, for connecting an interactions platform 102 to an interactive response system 100 through an interactive router 101 (herein referred to as an "iRouter").
  • interactions platform 102 is connected to a customer 103 through communications link 104.
  • Interactions platform 102 is also connected to interactive response system 100 at iRouter 101 via a datalink, which comprises a TCP/IP data link in this exemplary embodiment.
  • Interactions platform 102 in this exemplary embodiment comprises a computer server.
  • Interactions platform 102 can also be an e-mail gateway or web server.
  • customer input enters interactive response system 100 via telephone or intercom and text is entered via email or an interactive chatting interface (e.g., a web page or a stand-alone application such as AOL Instant Messenger).
  • an interactive chatting interface e.g., a web page or a stand-alone application such as AOL Instant Messenger.
  • a number of different types of devices can be used to implement each of the interactions platform 102 and communications links 104.
  • Interactions platform 102 may be implemented by any device capable of communicating with the customer 103.
  • interactions platform 102 may be a telephony server in interactive response system 100 where the customer is calling by telephone.
  • the telephony server handles answering, transferring and disconnecting incoming calls.
  • the telephony server is also a storehouse for prerecorded audio clips so that it can play any welcome prompt and as other audio clips as directed by iRouter 101.
  • a telephony server in accordance with this embodiment is assembled from off- the-shelf components, for example Windows XP Professional for an operating system, a central processor, such as a Pentium processor, and an Intel Dialogic voice board.
  • the communications link 104 may be implemented by any means of providing an interface between the customer's telephone and the telephony server.
  • communications link 104 may be a dial-up connection or a two-way wireless communication link.
  • interactions platform 102 may be a gateway server in interactive response systems 100.
  • the customer interacts with the interactive response server by e-mail or by interactive text chats.
  • the gateway server runs customized open source e-mail or www server software.
  • a gateway server in accordance with this exemplary embodiment is designed to conduct e-mail and interactive text chat transactions with customers, while also forwarding and receiving data to other elements of the system.
  • the communications link 104 maybe implemented by any means of providing an interface between the customer's computer and the gateway server.
  • communications link 104 may be a dedicated interface, a single network, a combination of networks, a dial-up connection or a cable modem.
  • an interactive response system may communicate via voice and text data with a customer.
  • multiple customer bases may be accommodated by a dedicated interactions platform 102 for each of the customer bases.
  • a workflow (as will be described further, below) can be selected by determining which of the multiple interactions platforms 102 initiated the interaction.
  • the iRouter 101 comprises software to control interactive response system 100.
  • iRouter 101 "owns” the interaction with customer 103 from beginning to end by coordinating activity among other components and managing the transaction.
  • iRouter 101 manages interactions with customer 103 according to one or more programmable scripts, called, according to this exemplary embodiment, "workflows.”
  • a workflow comprises an interaction flow wherein the path through the workflow depends upon intent and data input from the customer. Workflows are preprogrammed by system engineers and, advantageously, periodically "tweaked” in order to improve customer satisfaction, speed, accuracy, etc.
  • iRouter 101 is almost always "in charge” of selecting the next step or path in the workflow.
  • iRouter 101 receives interaction input from interactions platform 102 in the form of audio clips, email, text data or other interaction type ⁇ depending on the form of customer communication - and forwards the input to one or more human agents 105, speech recognition engines or expert systems (collectively 108) and uses the responses to advance its current workflow.
  • human interpretation (or translation) of the input is necessary, iRouter 101 directs human agent desktop software to display an appropriate visual context of the current workflow.
  • iRouter 101 advances through the workflow and directs interactions platform 102 to respond appropriately to customer 103.
  • interactions platform 102 comprises a telephony server
  • iRouter 101 may deliver sound clips to play back to a customer, send text-to- speech clips or both.
  • interactions platform 102 may store sound clips, have text- to-speech capability or both
  • iRouter directs interactions platform 102 as to what to play to a customer and when.
  • iRouter 101 comprises, in this exemplary embodiment, a networked, off-the-shelf commercially available processor running an operating system such as Windows XP or Linux.
  • iRouter 101 software includes a modified open NXML browser and voice XML script incorporating objects appropriate to the specific application. One skilled in the art will understand how to construct these objects after studying this specification.
  • interactive response system 100 includes at least one pool of human agents 105.
  • a pool of human agents 105 is often located at a contact center site.
  • Human agents 105 use specialized desktop software specific to system 100 (as will be described further, below, in connection with FIGs. 3B, 4B and 5B) that presents a given workflow on their screen —along with a history or context of the customer interaction to that point.
  • the human agent or agents 105 interpret the input and select an appropriate customer intent, data or both in the workflow.
  • human agents 105 wear headphones and hear sound clips streamed from the telephony server 102 at the direction of iRouter 101.
  • a single human agent 105 will not handle the entire transaction for customer 103. Rather, human agent 105 handles some piece of the transaction that has been designated by the workflow designer as requiring human interpretation of customer's 103 utterance.
  • IRouter 101 can send the same customer 103 interaction to any number of human agents 105, and may distribute pieces of a given interaction to many different human agents 105.
  • human agents 105 are preferably off-site. Further, human agents 105 maybe in diverse geographic areas of the world, such as India, the Philippines and Mexico.
  • Human agents 105 may be in groups in a building or may be working from home. In applications that require 24/7 human agent support, human agents may be disposed around the world so that each human agent may work during suitable business hours.
  • Interactive response system 100 of the present invention employs custom human agent application software. Human agents 105 use a custom application developed in Java and running on a standard call center computer network workstation. Generally speaking, interactive response system 100 applies human intelligence towards interpretation of customer 103 input into "intent" (what the customer wants) and data (any input requires to determine what the customer wants). The interpretation normally comprises selecting the most-correct interpretation of what was said from a list of choices, in this exemplary embodiment.
  • Workflow server 106 of the present invention is an archive of the workflows used by the Interactions router.
  • Workflow server 106 can be built with off-the-shelf hardware using a commercially available processor running a standard server operating system, with the workflow documents written in XML in this exemplary embodiment.
  • Workflow server 106 maintains a compilation of business rules that govern the behavior of iRouter 101.
  • Interactive response system 100 employs a workflow designer used by a business analyst or process engineer to map out workflows.
  • a workflow serves as the map that iRouter 100 follows in a given interaction, with speech recognition or human agents.
  • the workflow "steers" iRouter 100 along a path in the workflow in response to customer input.
  • a place in the workflow, along with data collected to that point is called a "context.”
  • Performance and interactions archive 107 of the present invention comprises a database that can be maintained on any common computer server hardware. Performance and interactions archive 107 contains both archival data of system transactions with customers 103 (i.e., a repository of sound clips, e-mails, chats, etc. from interactions with customer 103) as well as performance data for human agents 105.
  • the present invention employs "reporter" software to generate statistics about a group of interactions or to display performance ranking for human agent 105.
  • Reporter software can also reconstruct an interaction with customer 103 from sound clips, e-mails, or chat text that constituted customer's 103 contact stored in interactions archive 107.
  • Reporter software is a series of simple scripts, and can run on any common server hardware.
  • the present invention also comprises, in this exemplary embodiment, manager/administrator software, usually run from the same station as reporter software.
  • Manager/administrator software sets operating parameters for interactive response system 100.
  • Such operating parameters include, but are not limited to, business rules for load balancing, uploading changes in workflow, and other administrative changes.
  • Manager/administrator software is often a small custom Java application running on a standard call center computer workstation.
  • Support system 108 of the present invention consist of numerous databases and customer proprietary systems (also including off-the-shelf speech recognition software such as
  • support system 108 may include a database for customer information or a knowledge base.
  • Speech recognition software is, in this exemplary embodiment, an off-the-shelf component used to interpret customer 103 utterances.
  • Support system 108 may also include a text-to-speech capability, often off-the-shelf software that reads text to customer 103.
  • Company agents 109 of the present invention consist of human agents that handle customer 103 requests not relevant to interactive response system 100 matters. For example, should customer 103 require specific assistance with a company matter that an outsourced human agent 105 is not capable of handling, interactive response system 100 transfers the call to company agent 109.
  • the elements of interactive response system 100 communicate over a TCP/IP network in this exemplary embodiment. Communication is driven by the workflow that iRouter 101 follows.
  • Database in the present embodiment can be a flat file database, a relational database, an object database, or some combination thereof.
  • a database is searchable using any database searching language, such as Structured Query Language (SQL).
  • SQL Structured Query Language
  • Server and “workstation” refer to any general purpose computer system which is programmable using a computer programming language, such as C++, Java, or other language, such as a scripting language or assembly language. These computer systems may also include specially programmed, special purpose hardware, for example Intel Dialogic voice boards.
  • the invention is not limited to a particular computer platform, particular processor, particular operating system, or particular high-level programming language. Additionally, the computer system may be a multiprocessor computer system or may include multiple computers connected over a computer network. The invention is not limited to any particular implementation using software or hardware or firmware, or any combination thereof.
  • FIG.'s 2 through 5 these figures illustrate an example of how information is retrieved and handled by interactive response system 100 when a customer interacts with the interactive response system 100 via telephone.
  • the example shown in FIG. 2 presupposes that all required hardware, software, networking and system integration is complete, and that a business analyst has mapped out the possible steps in a customer interaction using the graphic workflow designer.
  • the business analyst also has scripted the text for anything that the interactive response system may say to a customer, including, but not limited to, the initial prompt (e.g., "Thank you for calling, how can I help you today?"), response(s) to a customer, requests for additional information, "stutter speech” (sounds sent to the customer while the iRouter is determining a response), and a closing statement.
  • Either text-to-speech software or voice talent records the server-side speech pieces as written by the business analyst. This workflow is then loaded into the interactive response system where it is available to the iRouter.
  • the interaction begins with the customer calling the customer service telephone number of a company.
  • the interactions platform in this case a telephony server, answers the telephone call and retrieves the appropriate workflow stored in the workflow database, based on either (1) ANI/DNIS information of the caller or (2) other business rules (e.g., line or trunk the call came in on), as illustrated at 202.
  • the telephony server then plays the appropriate welcome prompt as illustrated at 203 and the customer then responses to that prompt (block 204).
  • an imaginary airline, iterair provides customer service via an interactive response system in accordance with a call center embodiment of this invention.
  • the interaction platform is therefore a telephony interface and iRouter selects a workflow appropriate to Interair.
  • a first point or context in the workflow is shown in the illustrative workflow of FIG. 3 A.
  • the telephony server begins digitizing the customer's spoken input and connects to the iRouter.
  • workflow or business rules determine if the interactive response to the customer needs to be handled by a human agent or speech recognition software. That is, the iRouter selects the appropriate workflow for the call from the workflow repository and follows the workflow rules to conduct a conversation with the customer.
  • iRouter uses software-based speech recognition from the support systems or has the customer's audio streamed to human agents in contact centers as appropriate, as illustrated in block 205. If human agents are required by the workflow, iRouter identifies available human agents by applying a load balancing algorithm, triggers workflow pop-up on their screens (as illustrated in the initially blank pop-up screen, FIG. 3B), and begins streaming customer audio to the one or more identified human agents, as shown at block 207. The human agent(s) hear the customer utterance in headphones, and computer software prompts for an interpretation of the utterance as shown in blocks 210 and 211. In accordance with the exemplary workflow of FIG.
  • the customer utterance that the human agent or agents hear is "I need to check my flight from Chicago to London this afternoon.”
  • the agents' screen indicates the current context (or point in the workflow) as illustrated in FIG. 4B.
  • Such multiplicity of selection allows the agents interpretive flexibility, which enables the iRouter to jump around in its workflow according to the interpreted intent.
  • the iRouter can respond appropriately even if the customer changes subjects in midstream.
  • each agent selects what he or she feels is the best fit interpretation of the customer utterance in the current context of the workflow.
  • the human agent(s) selects "CFT" (Check Flight Time) and enters or selects from drop down menus the departure and arrival cities (or other, preprogrammed information that the customer could possibly utter).
  • CFT Check Flight Time
  • human agents can elect to apply acceleration to the customer audio clip(s) received at the station in order to compensate for any response delay (usually due to lag time in application set-up - the time it will take for human agent desktop software to accept the streaming audio and display the appropriate workflow).
  • Network latency might be around 0.2 seconds, where application delay could be more in the 1+ second range.
  • the interactive response system accelerates the voice clip (although not to the point of discernible distortion). The purpose is to strive for a more "real- time" conversational interaction, so that the customer does not experience a notable delay while awaiting a response.
  • the acceleration is applied to the speech as it is streaming from the telephony server.
  • acceleration can never overcome the inherent latency of the link but will allow human agents to "recover" any application set-up time and reduce the amount of lag time in the interaction, ideally up to the limits imposed by latency in the network.
  • acceleration is optional, wherein a novice agent may need a slower playback, while a more experienced agent may apply acceleration.
  • the iRouter evaluates the accuracy, in real time, of the customer audio interpretation and updates each agent's speed/accuracy profile.
  • the iRouter processes the interpretation and performs the next step(s) in the workflow (e.g., database lookup based on input data) and then forwards an appropriate response 218 to the customer through the telephony server (if the interpretation is deemed accurate). If the iRouter determines the interpretation is accurate, it directs the playback of responses to the customer from the telephony server based on the interpretation of either the speech recognition software or by applying key algorithms to the responses of one or more human agents. In this example, the response is given in the last block of screen 2, FIG. 4A. .
  • the iRouter compares the interpretation of two human agents, and, if no consensus is reached, plays the customer audio clip for a third human agent for a further interpretation (i.e., "majority rule" determines which is the accurate response).
  • Other business rules may also be used to determine the accurate interpretation. For example, an interpretation from the agent with the best accuracy score may be selected. Alternatively, one of the interpretations may be selected and, played back to the customer ("I understood you to say ") and the customer response determines whether the interpretation was correct. Further, the interpretations may be selected from known data (e.g., two interpretations of an email address could be compared against a database of customer email addresses, only one of two interpretations of a credit card number will pass a checksum algorithm, etc).
  • the interactive response system allows for virtually any number of human agents to handle to same customer interaction at once. That is, an interactive response system could have two agents listening during a busy time or have seven human agents listening during a more idle time. Moreover, during times of high call volume, accuracy can be decreased by removing the "double-checking" rule to maintain high response time. An agent assigned a high trust ranking based on the agent's speed/accuracy profile may be asked to work without the double- checking. In addition to trading off accuracy for quicker system availability, a steady flow of audio clips is flowing by each agent, thereby decreasing human agent "slack" time.
  • the call will be transferred (if so directed by a step in the workflow or by business rules), or the customer terminates the call, as shown in block 215. If the interpretation is deemed inaccurate in block 213, the iRouter plays a stall speech to the customer (block 216) and send the audio clip to another human agent for another interpretation (block 217) and then reevaluate its accuracy.
  • the iRouter manages interaction with the customer to call completion, using the workflow as its guide.
  • the iRouter may stream customer utterances to human agents for interpretation at numerous points in the call.
  • a snapshot of the customer interaction is preserved in the archive database. Human agents' speed/accuracy profiles are constantly updated and maintained. If human intervention is not needed to interpret customer's request, speech recognition software interprets the audio clip and the iRouter determines the appropriate response as shown in blocks 206 and 214.
  • the captured customer utterance has two requests: food and entertainment queries.
  • the human agent captures two intents: meal and movie.
  • the interactive response system already knows the flight information from the previous data entered in FIG. 4B (this data is visible in FIG. 5B).
  • the human agent enters "General” and “Meal.”
  • the human agent also enters "Movie.”
  • the interactive response system then provides the appropriate response.
  • the customer requests further information regarding the meal or movie such as: "what meal is offered?", "Are their special meals?", "What is the movie rated?”, the appropriate human agent interpretation options are located on the computer screen.
  • FIG. 6 illustrates an example of how information is retrieved and handled by the interactive response system when a customer interacts via electronic mail (email, as it is commonly l ⁇ iown in the art).
  • the interaction begins with the customer emailing to the customer service email of a company.
  • the interactions platform in this exemplary embodiment, a gateway server, opens the email and retrieves the appropriate workflow stored in the workflow database based on either (1) the to/from information of the customer or (2) other business rules, as illustrated at 602.
  • the gateway server then sends the appropriate response acknowledgement as illustrated at 602.
  • the iRouter identifies available human agent(s) to handle the email by applying a load balancing algorithm, triggers workflow pop-up on their screens, and begins streaming customer audio to the or those human agents, as shown at block 603.
  • the human agent(s) interpret the email as shown in blocks 604 and 605.
  • the iRouter evaluates the accuracy, in real time, of the customer email interpretation and updates each agent's speed/accuracy profile
  • the iRouter processes the interpretation and performs the next steps in the workflow accordingly.
  • the iRouter forwards an appropriate email response to the customer through the gateway server (if the interpretation is deemed accurate) as seen in block 607.
  • the emails are then archived in the appropriate database as illustrated in block 608. If the interpretation is deemed inaccurate, the iRouter sends the email to another human agent for another interpretation (block 609) and then reevaluate its accuracy.
  • the iRouter manages interaction with the customer to email response, using the workflow as its guide.
  • a seamless blend of speech recognition software and human agent interaction to provide added customer privacy and security such that human access to confidential customer data is minimized.
  • customer personal data such as credit card information, social security number and address
  • the present invention uses a software-based front-end that incorporates speech recognition technology to allow a computer to capture, verify, or update a customer's sensitive data.
  • the software manages the customer interaction so that confidential data is stored in a database and not passed to any human agent.
  • the software can stream audio clips of the customer's utterances over a TCP/IP network to client software being used by human agents any time that human intervention is required.
  • the transaction is portioned into discrete, logical units allows business analysts to engineer the process so that the same human agent never sees more than one element of a given set of customer data. For example, if two agents see a particular customer's credit card number, two different agents see the customer's name. No one agent sees a full record or profile for a given customer. This helps call center operations, which often experience high agent turnover, minimize the problem of identity theft.
  • interactions platform 102 may also accommodate still pictures in any format (e.g., jpeg, tiff), motion pictures, scanned data, facsimiles, web pages, etc., which can be forwarded to a human agent's station. Such facility is useful, for example, for monitoring alarms, parsing faxes, etc. The human agent's interpretation is then delivered to the iRouter in the context of the workflow, as above.
  • jpeg, tiff motion pictures
  • scanned data e.g., scanned data
  • facsimiles e.g.
  • web pages e.g., etc.
  • Such facility is useful, for example, for monitoring alarms, parsing faxes, etc.
  • the human agent's interpretation is then delivered to the iRouter in the context of the workflow, as above.

Abstract

An interactive voice and data response system then directs input to a voice, text, and web-capable software-based router, which is able to intelligently respond to the input by drawing on a combination of human agents, advanced speech recognition and expert systems, connected to the router via a TCP/IP network. The digitized input is broken down into components so that the customer interaction is managed as a series of small tasks rather than one ongoing conversation. The router manages the interactions and keeps pace with a real-time conversation. The system utilizes both speech recognition and human intelligence for purposes of interpreting customer utterance or customer text. The system may use more than one human agent, or both human agents and speech recognition software, to interpret simultaneously the same component for error-checking and interpretation accuracy.

Description

APPARATUS AND METHOD FOR PROCESSING SERVICE INTERACTIONS
Cross-Reference to Related Application
This application is related to and claims the benefit of U.S. Provisional Application No. 60/467,935, filed May 5, 2003, the entire disclosure of which is hereby incorporated herein by reference. Field of the Invention
This invention relates to the field of interactive response communication systems, and, more particularly to an interactive response communications systems that use human interpretation of customer intent and data as input to a workflow on a computer. Background of the Invention
Many companies interact with their customers via electronic means (most commonly via telephone, e-mail, and online text chat). Such electronic systems save the companies a large amount of money by limiting the number of customer service or support agents needed. These electronic systems, however, generally provide a less than satisfactory customer experience. The customer experience may be acceptable for simple transactions, but are frequently inconsistent or downright frustrating if the customer is not adept at talking to or interacting with a computer.
Such interactive response systems are well known in the art. For example, providing customer service via telephone using an interactive voice response (INR) system is one such system. An example of customer service systems utilizing IVR technology is described in U.S. Patent No. 6,411,686. An INR system typically communicates with customers using a set of prerecorded phrases, responds to some spoken input and touch-tone signals, and can route or transfer calls. A drawback to such INR systems is that they are normally built around a "menu" structure, which presents callers with just a few valid options at a time and require a narrow range of responses from callers.
Many of these INR systems now incorporate speech recognition technology. An example of a system incorporating speech recognition technology is described in U.S. Patent No. 6,499,013. The robustness of the speech recognition technology used by IVR systems vary, but at present all have a predetermined range of responses that they listen for and can understand, which limits the ability of the end user to interact with the system in everyday language.
Therefore, the caller will often feel that they are being forced to speak to the system "as though
05/05/2004 1547887.01 they are talking to a computer." Moreover, even when interacting with a system that utilizes speech recognition, customer input is often either not recognized or incorrectly determined, causing the customer to seek a connection to a human customer service agent as soon as possible. Human customer service agents continue to be used for more involved customer service requests. These agents may speak to the customer over the phone, respond to customer e-mails, and chat with customers online. Agents normally answer customer questions or respond to customer requests. Companies have customer service groups, which are sometimes outsourced to businesses that specialize in "customer relations management." Such businesses run centers staffed by hundreds of agents who spend their entire working day on the phone or otherwise interacting with customers. An example of such system is described in U.S. Patent No. 5,987,116.
The typical model of customer service interaction is for one agent to assist a customer for the duration of the customer's interaction. At times, one agent (for example, a technical support representative) may transfer the customer to another agent (such as a sales representative) if the customer needs help with multiple requests. But in general, one agent spends his or her time assisting that one customer for the full duration of the customer's call or chat session, or is occupied resolving the customer's issue via e-mail. Most call centers also expect the agent to take the time to log (document) the call. Deficiencies in this heavy agent interface model is (1) there is a high agent turnover rate and (2) a great deal of initial and ongoing agent training is usually required, which all add up to making customer service a significant expense for these customer service providers.
In order to alleviate some of the expenses associated with agents, some organizations outsource their customer service needs. One trend in the United States in recent years, as high-speed fiber optic voice and data networks have proliferated, is to locate customer service centers overseas to take advantage of lower labor costs. Such outsourcing requires that the overseas customer service agents be fluent in English. In cases where these agents are used for telephone-based support, the agent's ability to understand and speak clearly in English is often an issue. An unfortunate result of off shore outsourcing is misunderstanding and a less than satisfactory customer service experience for the person seeking service. Therefore, there is a need in the art for an interactive system that provides a consistently high-quality experience without the expense of a large staff of dedicated, highly trained agents. Summary of the Invention It is therefore an object of the invention to provide an interactive response system with interactions portioning. That is, a human agent would be able to interact intermittently through a customer call by hearing only those portions of the call requiring his or her interpretation so no one customer service agent is tied to the customer's conversation for its full duration. It is an additional object of the invention to provide an interactive response system with multiple-agent checking so that a customer's intent, input (data) or both, is accurately determined. Using double, triple or more checking, more than one human agent evaluates and chooses an interpretation for an instance of customer input, thus improving accuracy of the call and providing an objective measure of each human agent's speed and accuracy.
It is also an object of the invention to provide an interactive response system with agent portability so that customer service agents can be located nearly anywhere in the world. If a human agent is needed to interpret intent, data, or both, one, or advantageously two or more, agents hear or see only the component of the interaction need to be interpreted or translated into a context that the interactive response system understands. The interactive response system handles outgoing communication to the end user, including speaking (using text-to-speech or professionally recorded clips) so the human agent's voice would never be heard by the customer, eliminating any concern of an agent's accent. The actual interaction with the customer is managed by a software-based router whose front end is email, interactive data or speech capable. It is a further object of the invention to provide an interactive response system in which the customer can speak in a conversational tone instead of responding as if "speaking to a computer." A router in accordance with this invention seamlessly blends both human customer service agents (for interpretation of written or spoken input) and software speech recognition (for spoken word input) to interpret customer input in real-time for intelligent interpretation. It is an even further object of the invention to provide an interactive response system that allows for simplified human agent training and evaluation. This invention provides the ability for multiple agents to evaluate the same component of customer input simultaneously. Further, this invention provides a means to objectively rate the speed and accuracy of an agent's response to input, which greatly simplifies the hiring and training process. New agents can act in the customer service agent role without their response being weighted by the router, but the agent can still receive feedback on their performance. An objective performance measure then exists to decide when to promote a new hire from trainee status. In addition, all responses to customers used by the system are scripted and engineered, removing the huge requirements of training customer service agents in how to speak to customers.
It is another object of the invention to provide an interactive response system that allows for workload balancing by dynamically adjusting the number of agents assigned to each component of customer interaction for purposes of multiple agent checking. For example, in times of heavier end-user traffic, the system advantageously evaluates and executes a tradeoff between agent accuracy and availability. To effect such balancing, some components of customer input are single-checked by the most accurate agents - thereby maintaining 100% availability of the system. At times of lower traffic, accuracy is increased through triple or quadruple checking, which also creates a steady pace of work for human agents. Being able to ramp up availability without severely degrading accuracy is a significant enhancement over current call center models.
It is yet another object of the invention to provide an interactive response system that provides speech acceleration to enable faster customer service and response time.
Acceleration applied to audio being streamed across a TCP/IP network to help overcome delays introduced by application setup times.
It is even yet another object of the invention to provide an interactive response system with interaction control such that interactive steps with customers are determined by choices in a workflow. Advantageously, workflows are updated any time by business analysts, process engineer, or company appointed personnel.
It is a still further object of the invention to provide an interactive response system with end user security so that customer confidential data is kept secure. Workflows may be advantageously designed so that the automated portion of the system can handle tasks involving sensitive data such as social security numbers, credit cards, etc., whereby the human agents never have access to this data. Even if a workflow requires that customer service agents do handle sensitive data, the workflow may be engineered to distribute the components of the input in a manner such that no one agent handles the whole of the customer data. For example, one agent might see or hear a full customer name, while another has access to the customer's social security number, and neither learns the customer's home address. These and other objects of the invention are accomplished in accordance with the principles of the invention by providing an interactive response system that uses human agents to interpret and input customer intent and data from customer utterances or written text. This invention provides a system and method of blending human interpretation, speech recognition technology, text parsing and lexical analysis, text-to-speech capabilities and other resources in a system for the automated processing of customer-company interactions.
This system is a solution for customer relations management. The central element of the system is a software-based router that manages the conversation with the end user either in real-time (voice, online text chats) or correspondence (e-mail). The router follows rules (herein called "workflows") developed and tweaked over time by business analysts. These rules form a script for the router to follow when interacting with end users. The router draws on both text-to- speech capabilities and prerecorded responses when replying to an end user. For interpretation of user utterances, the router employs both speech recognition technology and the interpretive abilities of human customer service agents, seamlessly blending the two. This blending can be performed in real-time or near real-time to allow the router to carry on a conversation-like interaction with an end user. The incorporation of human interpretation of user utterances or written text allows the router to use open-ended, conversational prompts, and to respond in context to user input that software might find ambiguous. Users are thus able to interact with the system using everyday language, and are not forced into a narrow range of responses.
The system integrates human agents in an innovative way. Because user input is digitized, the router can direct only those portions of the input that require human interpretation to human agents. No one customer service agent is tied to a customer conversation for its entire duration; the interaction is managed by the router itself. Also, the router is able to send the digitized input to more than one human agent for simultaneous interpretation, which provides double and triple checking of each answer from the agents. Such double and triple check also provides an objective measure and ranking of the speed and accuracy of the agents. The system is designed to work over a TCP/IP network, so that the customer service agents can be located virtually anywhere in the world. Advantageously, the system comprises off-the-shelf hardware and software, some customized software (as noted below), and can be integrated with existing company resources, such as databases and telephone networks. Brief Description of the Drawings Further features of the invention, its nature and various advantages will be more apparent from the following detailed description of the preferred embodiment, taken in conjunction with the accompanying drawings, in which like reference characters refer to like parts throughout, and in which:
FIG. 1 is a block diagram illustrating one embodiment of an architecture of an interactive response system according to an exemplary embodiment of this invention;
FIG. 2 is a flow chart illustrating an embodiment of a method of the present invention for communication among a customer, the interactive response system and a human interface;
FIG. 3 A is a chart illustrating one embodiment of a customer/interactive response system interaction in the context of FIG. 2;
FIG. 3B is a computer screen illustrating one embodiment for capturing customer intent and data in the context of FIG. 2;
FIG. 4A is a chart illustrating one embodiment of a customer/interactive response system interaction in the context of FIG. 2; FIG. 4B is a computer screen illustrating one embodiment for capturing customer intent and data in the context of FIG. 2;
FIG. 5 A is a chart illustrating one embodiment of a customer/interactive response system interaction in the context of FIG. 2;
FIG. 5B is a computer screen illustrating one embodiment for capturing customer intent and data in the context of FIG. 2; and
FIG. 6 is a flow chart of processing an email in the context of an interactive response system in accordance with another aspect of this invention. Detailed Description
FIG. 1 illustrates one embodiment of an architecture, of the type in which the present invention can be used, for connecting an interactions platform 102 to an interactive response system 100 through an interactive router 101 (herein referred to as an "iRouter"). As shown in FIG. 1, interactions platform 102 is connected to a customer 103 through communications link 104. Interactions platform 102 is also connected to interactive response system 100 at iRouter 101 via a datalink, which comprises a TCP/IP data link in this exemplary embodiment. Interactions platform 102 in this exemplary embodiment comprises a computer server. The exact configuration of the computer server varies with the implementation but typically consists of a Pentium-based server running an operating system such as Windows XP Professional or Linux, using a voice board from a vendor such as Dialogic. Interactions platform 102 can also be an e-mail gateway or web server. Thus, customer input enters interactive response system 100 via telephone or intercom and text is entered via email or an interactive chatting interface (e.g., a web page or a stand-alone application such as AOL Instant Messenger). In this architecture of FIG. 1, a number of different types of devices can be used to implement each of the interactions platform 102 and communications links 104. Interactions platform 102 may be implemented by any device capable of communicating with the customer 103. For example, interactions platform 102 may be a telephony server in interactive response system 100 where the customer is calling by telephone. The telephony server handles answering, transferring and disconnecting incoming calls. The telephony server is also a storehouse for prerecorded audio clips so that it can play any welcome prompt and as other audio clips as directed by iRouter 101.
A telephony server in accordance with this embodiment is assembled from off- the-shelf components, for example Windows XP Professional for an operating system, a central processor, such as a Pentium processor, and an Intel Dialogic voice board. Using this architecture, the communications link 104 may be implemented by any means of providing an interface between the customer's telephone and the telephony server. For example, communications link 104 may be a dial-up connection or a two-way wireless communication link.
In another exemplary embodiment, interactions platform 102 may be a gateway server in interactive response systems 100. h accordance with this exemplary embodiment, the customer interacts with the interactive response server by e-mail or by interactive text chats. The gateway server runs customized open source e-mail or www server software. Further, a gateway server in accordance with this exemplary embodiment is designed to conduct e-mail and interactive text chat transactions with customers, while also forwarding and receiving data to other elements of the system. Using this architecture, the communications link 104 maybe implemented by any means of providing an interface between the customer's computer and the gateway server. For example, communications link 104 may be a dedicated interface, a single network, a combination of networks, a dial-up connection or a cable modem. While only one interactions platform 102 is illustrated in FIG.1 , one skilled in the art will appreciate that multiple interactions platforms 102 may be used in this system after studying this specification. With multiple interactions platforms 102, an interactive response system may communicate via voice and text data with a customer. Further, multiple customer bases may be accommodated by a dedicated interactions platform 102 for each of the customer bases. In this manner, a workflow (as will be described further, below) can be selected by determining which of the multiple interactions platforms 102 initiated the interaction.
In the architecture of FIG. 1, the iRouter 101 comprises software to control interactive response system 100. iRouter 101 "owns" the interaction with customer 103 from beginning to end by coordinating activity among other components and managing the transaction. iRouter 101 manages interactions with customer 103 according to one or more programmable scripts, called, according to this exemplary embodiment, "workflows." i general, a workflow comprises an interaction flow wherein the path through the workflow depends upon intent and data input from the customer. Workflows are preprogrammed by system engineers and, advantageously, periodically "tweaked" in order to improve customer satisfaction, speed, accuracy, etc. In accordance with this exemplary embodiment and in contrast to the prior art, iRouter 101 is almost always "in charge" of selecting the next step or path in the workflow. iRouter 101 receives interaction input from interactions platform 102 in the form of audio clips, email, text data or other interaction type ~ depending on the form of customer communication - and forwards the input to one or more human agents 105, speech recognition engines or expert systems (collectively 108) and uses the responses to advance its current workflow. When human interpretation (or translation) of the input is necessary, iRouter 101 directs human agent desktop software to display an appropriate visual context of the current workflow. Once iRouter 101 understands the input, iRouter 101 advances through the workflow and directs interactions platform 102 to respond appropriately to customer 103. In an exemplary embodiment wherein interactions platform 102 comprises a telephony server, iRouter 101 may deliver sound clips to play back to a customer, send text-to- speech clips or both. Alternatively, interactions platform 102 may store sound clips, have text- to-speech capability or both, this embodiment, iRouter directs interactions platform 102 as to what to play to a customer and when. iRouter 101 comprises, in this exemplary embodiment, a networked, off-the-shelf commercially available processor running an operating system such as Windows XP or Linux. Further, iRouter 101 software includes a modified open NXML browser and voice XML script incorporating objects appropriate to the specific application. One skilled in the art will understand how to construct these objects after studying this specification.
In accordance with the exemplary architecture of FIG. 1, interactive response system 100 includes at least one pool of human agents 105. A pool of human agents 105 is often located at a contact center site. Human agents 105, in accordance with the present embodiment of this invention, use specialized desktop software specific to system 100 (as will be described further, below, in connection with FIGs. 3B, 4B and 5B) that presents a given workflow on their screen —along with a history or context of the customer interaction to that point. The human agent or agents 105 interpret the input and select an appropriate customer intent, data or both in the workflow.
For telephone interactions, human agents 105 wear headphones and hear sound clips streamed from the telephony server 102 at the direction of iRouter 101. i accordance with one aspect of this invention, a single human agent 105 will not handle the entire transaction for customer 103. Rather, human agent 105 handles some piece of the transaction that has been designated by the workflow designer as requiring human interpretation of customer's 103 utterance. IRouter 101 can send the same customer 103 interaction to any number of human agents 105, and may distribute pieces of a given interaction to many different human agents 105. In accordance with the exemplary embodiment of this invention, human agents 105 are preferably off-site. Further, human agents 105 maybe in diverse geographic areas of the world, such as India, the Philippines and Mexico. Human agents 105 may be in groups in a building or may be working from home. In applications that require 24/7 human agent support, human agents may be disposed around the world so that each human agent may work during suitable business hours. Interactive response system 100 of the present invention employs custom human agent application software. Human agents 105 use a custom application developed in Java and running on a standard call center computer network workstation. Generally speaking, interactive response system 100 applies human intelligence towards interpretation of customer 103 input into "intent" (what the customer wants) and data (any input requires to determine what the customer wants). The interpretation normally comprises selecting the most-correct interpretation of what was said from a list of choices, in this exemplary embodiment.
Workflow server 106 of the present invention, an off-the-shelf component, is an archive of the workflows used by the Interactions router. Workflow server 106 can be built with off-the-shelf hardware using a commercially available processor running a standard server operating system, with the workflow documents written in XML in this exemplary embodiment. Workflow server 106 maintains a compilation of business rules that govern the behavior of iRouter 101.
Interactive response system 100 employs a workflow designer used by a business analyst or process engineer to map out workflows. A workflow serves as the map that iRouter 100 follows in a given interaction, with speech recognition or human agents. The workflow "steers" iRouter 100 along a path in the workflow in response to customer input. A place in the workflow, along with data collected to that point is called a "context."
A visual version of the workflow is seen by human agent 105. The workflow designer builds instructions for human agent 105 into the workflow in order to guide human agent 105 in choosing the next appropriate step. The workflow designer preferably consists of a version of Eclipse software development environment customized to focus on building XML documents. However, one skilled in the art will be able to develop a workflow designer after studying this specification. Performance and interactions archive 107 of the present invention comprises a database that can be maintained on any common computer server hardware. Performance and interactions archive 107 contains both archival data of system transactions with customers 103 (i.e., a repository of sound clips, e-mails, chats, etc. from interactions with customer 103) as well as performance data for human agents 105. The present invention employs "reporter" software to generate statistics about a group of interactions or to display performance ranking for human agent 105. Reporter software can also reconstruct an interaction with customer 103 from sound clips, e-mails, or chat text that constituted customer's 103 contact stored in interactions archive 107. Reporter software is a series of simple scripts, and can run on any common server hardware.
The present invention also comprises, in this exemplary embodiment, manager/administrator software, usually run from the same station as reporter software.
Manager/administrator software sets operating parameters for interactive response system 100.
Such operating parameters include, but are not limited to, business rules for load balancing, uploading changes in workflow, and other administrative changes. Manager/administrator software is often a small custom Java application running on a standard call center computer workstation.
Support system 108 of the present invention consist of numerous databases and customer proprietary systems (also including off-the-shelf speech recognition software such as
Speechworks) that may be employed in responding to customer 103 requests. For example, support system 108 may include a database for customer information or a knowledge base. Speech recognition software is, in this exemplary embodiment, an off-the-shelf component used to interpret customer 103 utterances. Support system 108 may also include a text-to-speech capability, often off-the-shelf software that reads text to customer 103.
Company agents 109 of the present invention consist of human agents that handle customer 103 requests not relevant to interactive response system 100 matters. For example, should customer 103 require specific assistance with a company matter that an outsourced human agent 105 is not capable of handling, interactive response system 100 transfers the call to company agent 109.
The elements of interactive response system 100 communicate over a TCP/IP network in this exemplary embodiment. Communication is driven by the workflow that iRouter 101 follows. "Database" in the present embodiment can be a flat file database, a relational database, an object database, or some combination thereof. A database is searchable using any database searching language, such as Structured Query Language (SQL).
"Server" and "workstation" refer to any general purpose computer system which is programmable using a computer programming language, such as C++, Java, or other language, such as a scripting language or assembly language. These computer systems may also include specially programmed, special purpose hardware, for example Intel Dialogic voice boards. The invention is not limited to a particular computer platform, particular processor, particular operating system, or particular high-level programming language. Additionally, the computer system may be a multiprocessor computer system or may include multiple computers connected over a computer network. The invention is not limited to any particular implementation using software or hardware or firmware, or any combination thereof. Turning now to FIG.'s 2 through 5, these figures illustrate an example of how information is retrieved and handled by interactive response system 100 when a customer interacts with the interactive response system 100 via telephone. The example shown in FIG. 2 presupposes that all required hardware, software, networking and system integration is complete, and that a business analyst has mapped out the possible steps in a customer interaction using the graphic workflow designer. The business analyst also has scripted the text for anything that the interactive response system may say to a customer, including, but not limited to, the initial prompt (e.g., "Thank you for calling, how can I help you today?"), response(s) to a customer, requests for additional information, "stutter speech" (sounds sent to the customer while the iRouter is determining a response), and a closing statement. Either text-to-speech software or voice talent records the server-side speech pieces as written by the business analyst. This workflow is then loaded into the interactive response system where it is available to the iRouter.
As shown in block 201, the interaction begins with the customer calling the customer service telephone number of a company. The interactions platform, in this case a telephony server, answers the telephone call and retrieves the appropriate workflow stored in the workflow database, based on either (1) ANI/DNIS information of the caller or (2) other business rules (e.g., line or trunk the call came in on), as illustrated at 202. The telephony server then plays the appropriate welcome prompt as illustrated at 203 and the customer then responses to that prompt (block 204). For purpose of example, an imaginary airline, iterair, provides customer service via an interactive response system in accordance with a call center embodiment of this invention. The interaction platform is therefore a telephony interface and iRouter selects a workflow appropriate to Interair.
A first point or context in the workflow is shown in the illustrative workflow of FIG. 3 A. There is no customer utterance, thus no intent or data to capture (and respond to). The only response is the greeting and the prompt for customer input. Processing proceeds to box 204 in the flowchart of FIG. 2. The telephony server begins digitizing the customer's spoken input and connects to the iRouter. At this point, workflow or business rules determine if the interactive response to the customer needs to be handled by a human agent or speech recognition software. That is, the iRouter selects the appropriate workflow for the call from the workflow repository and follows the workflow rules to conduct a conversation with the customer.
To interpret customer speech, iRouter uses software-based speech recognition from the support systems or has the customer's audio streamed to human agents in contact centers as appropriate, as illustrated in block 205. If human agents are required by the workflow, iRouter identifies available human agents by applying a load balancing algorithm, triggers workflow pop-up on their screens (as illustrated in the initially blank pop-up screen, FIG. 3B), and begins streaming customer audio to the one or more identified human agents, as shown at block 207. The human agent(s) hear the customer utterance in headphones, and computer software prompts for an interpretation of the utterance as shown in blocks 210 and 211. In accordance with the exemplary workflow of FIG. 4 A, the customer utterance that the human agent or agents hear is "I need to check my flight from Chicago to London this afternoon." The agents' screen indicates the current context (or point in the workflow) as illustrated in FIG. 4B. In this illustrative screen shot, there are 12 possible requests (including unanswerable and terminate) that the human agent can select. In operation, there are several hundred possible interpretations available to the agents. Such multiplicity of selection allows the agents interpretive flexibility, which enables the iRouter to jump around in its workflow according to the interpreted intent. Thus, in accordance with one aspect of this invention, the iRouter can respond appropriately even if the customer changes subjects in midstream.
In each case, each agent selects what he or she feels is the best fit interpretation of the customer utterance in the current context of the workflow. In example of FIG. 4B, the human agent(s) selects "CFT" (Check Flight Time) and enters or selects from drop down menus the departure and arrival cities (or other, preprogrammed information that the customer could possibly utter).
Note that, in blocks 208 and 209, human agents can elect to apply acceleration to the customer audio clip(s) received at the station in order to compensate for any response delay (usually due to lag time in application set-up - the time it will take for human agent desktop software to accept the streaming audio and display the appropriate workflow). Network latency might be around 0.2 seconds, where application delay could be more in the 1+ second range. To compensate for the application delay, the interactive response system accelerates the voice clip (although not to the point of discernible distortion). The purpose is to strive for a more "real- time" conversational interaction, so that the customer does not experience a notable delay while awaiting a response. The acceleration is applied to the speech as it is streaming from the telephony server. The acceleration can never overcome the inherent latency of the link but will allow human agents to "recover" any application set-up time and reduce the amount of lag time in the interaction, ideally up to the limits imposed by latency in the network. However, acceleration is optional, wherein a novice agent may need a slower playback, while a more experienced agent may apply acceleration.
In test 213, the iRouter evaluates the accuracy, in real time, of the customer audio interpretation and updates each agent's speed/accuracy profile. Next, in block 214, the iRouter processes the interpretation and performs the next step(s) in the workflow (e.g., database lookup based on input data) and then forwards an appropriate response 218 to the customer through the telephony server (if the interpretation is deemed accurate). If the iRouter determines the interpretation is accurate, it directs the playback of responses to the customer from the telephony server based on the interpretation of either the speech recognition software or by applying key algorithms to the responses of one or more human agents. In this example, the response is given in the last block of screen 2, FIG. 4A. .
To determine accuracy, the iRouter compares the interpretation of two human agents, and, if no consensus is reached, plays the customer audio clip for a third human agent for a further interpretation (i.e., "majority rule" determines which is the accurate response). Other business rules may also be used to determine the accurate interpretation. For example, an interpretation from the agent with the best accuracy score may be selected. Alternatively, one of the interpretations may be selected and, played back to the customer ("I understood you to say ...") and the customer response determines whether the interpretation was correct. Further, the interpretations may be selected from known data (e.g., two interpretations of an email address could be compared against a database of customer email addresses, only one of two interpretations of a credit card number will pass a checksum algorithm, etc). The interactive response system allows for virtually any number of human agents to handle to same customer interaction at once. That is, an interactive response system could have two agents listening during a busy time or have seven human agents listening during a more idle time. Moreover, during times of high call volume, accuracy can be decreased by removing the "double-checking" rule to maintain high response time. An agent assigned a high trust ranking based on the agent's speed/accuracy profile may be asked to work without the double- checking. In addition to trading off accuracy for quicker system availability, a steady flow of audio clips is flowing by each agent, thereby decreasing human agent "slack" time.
Returning to the flowchart of FIG. 2, either the customer will respond again as seen in block 204, the call will be transferred (if so directed by a step in the workflow or by business rules), or the customer terminates the call, as shown in block 215. If the interpretation is deemed inaccurate in block 213, the iRouter plays a stall speech to the customer (block 216) and send the audio clip to another human agent for another interpretation (block 217) and then reevaluate its accuracy. The iRouter manages interaction with the customer to call completion, using the workflow as its guide. The iRouter may stream customer utterances to human agents for interpretation at numerous points in the call. Once the call has concluded, a snapshot of the customer interaction is preserved in the archive database. Human agents' speed/accuracy profiles are constantly updated and maintained. If human intervention is not needed to interpret customer's request, speech recognition software interprets the audio clip and the iRouter determines the appropriate response as shown in blocks 206 and 214.
Continuing with the Interair example, the captured customer utterance, as seen in FIG. 5A, has two requests: food and entertainment queries. In accordance with another aspect of this invention, the human agent captures two intents: meal and movie. There is no relevant data to enter because the interactive response system already knows the flight information from the previous data entered in FIG. 4B (this data is visible in FIG. 5B). As seen in FIG. 5B, the human agent enters "General" and "Meal." The human agent also enters "Movie." As seen in FIG. 5A, the interactive response system then provides the appropriate response. As seen in FIG. 5B, if the customer requests further information regarding the meal or movie such as: "what meal is offered?", "Are their special meals?", "What is the movie rated?", the appropriate human agent interpretation options are located on the computer screen.
FIG. 6 illustrates an example of how information is retrieved and handled by the interactive response system when a customer interacts via electronic mail (email, as it is commonly lαiown in the art). As shown in block 601, the interaction begins with the customer emailing to the customer service email of a company. The interactions platform, in this exemplary embodiment, a gateway server, opens the email and retrieves the appropriate workflow stored in the workflow database based on either (1) the to/from information of the customer or (2) other business rules, as illustrated at 602. The gateway server then sends the appropriate response acknowledgement as illustrated at 602. Then the iRouter identifies available human agent(s) to handle the email by applying a load balancing algorithm, triggers workflow pop-up on their screens, and begins streaming customer audio to the or those human agents, as shown at block 603. The human agent(s) interpret the email as shown in blocks 604 and 605. After test 606, where the iRouter evaluates the accuracy, in real time, of the customer email interpretation and updates each agent's speed/accuracy profile, the iRouter processes the interpretation and performs the next steps in the workflow accordingly. Eventually, the iRouter forwards an appropriate email response to the customer through the gateway server (if the interpretation is deemed accurate) as seen in block 607. The emails are then archived in the appropriate database as illustrated in block 608. If the interpretation is deemed inaccurate, the iRouter sends the email to another human agent for another interpretation (block 609) and then reevaluate its accuracy. The iRouter manages interaction with the customer to email response, using the workflow as its guide.
Other features of the present invention include: a seamless blend of speech recognition software and human agent interaction to provide added customer privacy and security such that human access to confidential customer data is minimized. In a customer contact center environment, customer personal data, such as credit card information, social security number and address, is routinely made available to human agents interacting with customers. The present invention uses a software-based front-end that incorporates speech recognition technology to allow a computer to capture, verify, or update a customer's sensitive data. The software manages the customer interaction so that confidential data is stored in a database and not passed to any human agent. The software can stream audio clips of the customer's utterances over a TCP/IP network to client software being used by human agents any time that human intervention is required. In cases where the workflow does require that a human agent handle sensitive customer information, the transaction is portioned into discrete, logical units allows business analysts to engineer the process so that the same human agent never sees more than one element of a given set of customer data. For example, if two agents see a particular customer's credit card number, two different agents see the customer's name. No one agent sees a full record or profile for a given customer. This helps call center operations, which often experience high agent turnover, minimize the problem of identity theft.
Other features of the present invention include: interactions platform 102 may also accommodate still pictures in any format (e.g., jpeg, tiff), motion pictures, scanned data, facsimiles, web pages, etc., which can be forwarded to a human agent's station. Such facility is useful, for example, for monitoring alarms, parsing faxes, etc. The human agent's interpretation is then delivered to the iRouter in the context of the workflow, as above.
It will be understood that the foregoing is only illustrative of the principles of the invention and that various modifications can be made by those skilled in the art without departing form the scope of the invention, which is limited only by the claims that follow.

Claims

WHAT IS CLAIMED IS:
1. A method for operating an interactive response system comprising: selecting a computer-operated workflow depending upon how input is received; receiving free form input; interpreting said free form input into intent and data in the context of a current point the workflow; determining automatically a next point in said workflow based on the interpreted intent and data at the current point in the workflow; and taking further action responsive to the next point in the workflow.
2. A method in accordance with claim 1 wherein interpreting said free form input compπses: deteπnining a best fit of said free form input in the current context of said workflow.
3. A method in accordance with claim 2 wherein determining a best fit comprises: interpreting said free form input automatically.
4. A method in accordance with claim 2 wherein determining a best fit comprises: interpreting said free form input by a human agent.
5. A method in accordance with claim 2 wherein determining a best fit comprises: interpreting said free form input by a plurality of human agents.
6. A method in accordance with claim 5 wherein determining a best fit comprises: interpreting said free form input by said plurality of human agents; and interpreting said free form input by another human agent if said plurality of human agents do not agree.
7. A method in accordance with claim 5 wherein determining a best fit comprises: interpreting said free form input by said plurality of human agents, and, if said plurality of human operators do not agree; acting on one interpretation of said free form input of the majority of said human operators.
8. A method in accordance with claim 5 wherein determining a best fit comprises: interpreting said free form input by said plurality of human agents, and, if said plurality of human agents do not agree; acting on one interpretation of said free form input based on past performance of the plurality of human agents.
9. A method in accordance with claim 5 wherein determining a best fit comprises: interpreting said free form input by said plurality of human agents, and, if said plurality of human agents do not agree; selecting an interpretation of said input and transmitting the interpretation to a customer for verification.
10. A method in accordance with claim 5 wherein determining a best fit comprises: interpreting said free form input by said plurality of human agents, and, if said plurality of human agents do not agree; acting on one interpretation of said free form input based on known data or algorithms.
11. A method in accordance with claim 1 wherein taking further action comprises: changing said next point to said current point in the workflow; prompting for further input; receiving further free form input; interpreting said further free form input into said content and data in said context of said current point in said workflow; determining said next point in said workflow based on the interpreted intent and data at said current point in said workflow; and taking further action responsive to said next point in said workflow.
12. A method in accordance with claim 11 wherein interpreting said free form input comprises interpreting said free form input at a first means for interpretation and wherein interpreting said further free form input comprises interpreting said further free form input at a second means for interpretation.
13. A method in accordance with claim 11 wherein interpreting said free form input comprises interpreting said free form input at a first human agent station and wherein interpreting said further free form input comprises interpreting said further free form input at a second human agent station.
14. A method in accordance with claim 11 wherein said interactive response system includes a plurality of human agent stations further including: delivering said free form input to one or more of said plurality of human agent stations such that each of said plurality of human agent stations only receives selected free form input during said workflow.
15. A method in accordance with claim 14 wherein delivering said free form input comprises delivering said free form input so that each of said plurality of human agents stations receives limited access to confidential data.
16. An interactive response system comprising: one or more interactions platforms connected to a network configured to receive free form input over the network; a plurality of human agent position systems configured to interpret said free form input into intention and data; and an interactions router connected to said interactions platform and said plurality of human agent positions configured to receive said free form input from said one or more interactions platforms, forward said free form input to one or more of said plurality of human agent position systems, receive said intention and data from said one or more of said plurality of human agent position systems and take further action based on said intention and data from said free form input.
17. An interactive response system in accordance with claim 16 wherein one or more of said plurality of human agent position systems comprises automatic speech recognition systems.
18. An interactive response system in accordance with claimlό wherein one or more of said plurality of human agent position systems comprises human agent stations.
19. An interactive response system in accordance with claim 18 wherein said interactions router is further configured to forward said free form input to one or more of said plurality of human agent position systems depending on a load factor.
20. An interactive response system in accordance with claim 19 wherein said interactions router is configured to forward said free form input to one of said plurality of human agent position systems when said load factor is high.
21. An interactive response system of claim 19 wherein said interactions router is configured to forward said free form input to a set of said plurality of human agent position systems when said load factor is low.
22. An interactive response system in accordance with claim 19 wherein said interactions router is configured to forward said free form input to two of said plurality of human agent position systems and, if said intention and data is not the same from said two of said plurality of human agent positions systems, forwarding said free form input to a third one of said plurality of human agent position systems.
23. An interactive response system in accordance with claim 19 wherein said interactions router is further configured to deliver contextual information with said free form input to said human agent position systems.
24. An interactive response system in accordance with claim 23 wherein said contextual information comprises one or more display screens.
25. An interactive response system in accordance with claim 19 wherein said human agent position systems are geographically diverse.
26. An interactive response system in accordance with claim 16 wherein said free form input comprises voice input.
27. An interactive response system in accordance with claim 16 wherein said free form input comprises email.
28. An interactive response system in accordance with claim 16 wherein said free form input comprises textural data.
29. A method for operating an interactive response system, said interactive response system comprising an interactions platform connected to a network, a plurality of human agent position systems configured to interpret free form input into intention and data and an interactions router comiected to said interactions platform and said plurality of human agent positions, said method comprising: selecting a computer-operated workflow and a context in said workflow at said interactions router responsive to data received at said interactions platform; receiving said free form input from said network at said interactions platform; forwarding said free form input from said interactions platform to said interactions router; sending said free form input and data representative of context in said workflow from said interactions router to one or more of said plurality of human agent positions; interpreting said free form input into intent and data in said context of said workflow; sending said intent and data to said interactions router; determining automatically a further context in said workflow at said interactions router based on said interpreted intent and said data at said current context in said workflow; and taking further action responsive to said further context in said workflow.
30. A method in accordance with claim 29 wherein said plurality of human agent positions comprise human agents and position systems and wherem interpreting said free form input comprises interpreting said free form input by said human agent and entering said intent and data into said position system by said human agent.
31. A method in accordance with claim 29 where forwarding said free form input comprises forwarding said free form input to said plurality of human agent positions, and wherein said method further includes: determining whether said intent and data received from said human agent positions agree; and forwarding said free form input to another of said plurality of human agent positions if said intent and data received from said human agent positions does not agree.
32. A method in accordance with claim 29 wherein' said free form input is only sent to one of said plurality of human agent positions once during a workflow.
33. A method in accordance with claim 29 wherein said free form input comprises human utterances, and wherein sending said free form input includes accelerating said human utterances.
PCT/US2004/013946 2003-05-05 2004-05-05 Apparatus and method for processing service interactions WO2004099934A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CA002524591A CA2524591A1 (en) 2003-05-05 2004-05-05 Apparatus and method for processing service interactions
NZ543885A NZ543885A (en) 2003-05-05 2004-05-05 Apparatus and method for processing service interactions
EP04751363A EP1620777A4 (en) 2003-05-05 2004-05-05 Apparatus and method for processing service interactions
AU2004237227A AU2004237227B2 (en) 2003-05-05 2004-05-05 Apparatus and method for processing service interactions

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US46793503P 2003-05-05 2003-05-05
US60/467,935 2003-05-05

Publications (2)

Publication Number Publication Date
WO2004099934A2 true WO2004099934A2 (en) 2004-11-18
WO2004099934A3 WO2004099934A3 (en) 2009-04-09

Family

ID=33435145

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2004/013946 WO2004099934A2 (en) 2003-05-05 2004-05-05 Apparatus and method for processing service interactions

Country Status (6)

Country Link
US (5) US7606718B2 (en)
EP (1) EP1620777A4 (en)
AU (1) AU2004237227B2 (en)
CA (1) CA2524591A1 (en)
NZ (1) NZ543885A (en)
WO (1) WO2004099934A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8666040B2 (en) 2006-09-22 2014-03-04 International Business Machines Corporation Analyzing Speech Application Performance
CN104883299A (en) * 2015-06-24 2015-09-02 上海斐讯数据通信技术有限公司 Router configuration method, system and router

Families Citing this family (81)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6603835B2 (en) 1997-09-08 2003-08-05 Ultratec, Inc. System for text assisted telephony
US8416925B2 (en) 2005-06-29 2013-04-09 Ultratec, Inc. Device independent text captioned telephone service
US20050144143A1 (en) * 2003-09-03 2005-06-30 Steven Freiberg Method and system for identity theft prevention, detection and victim assistance
US7783513B2 (en) * 2003-10-22 2010-08-24 Intellisist, Inc. Business performance and customer care quality measurement
US8515024B2 (en) 2010-01-13 2013-08-20 Ultratec, Inc. Captioned telephone service
US7912206B2 (en) * 2004-07-30 2011-03-22 Miller John S Technique for providing a personalized electronic messaging service through an information assistance provider
US7593962B2 (en) * 2005-02-18 2009-09-22 American Tel-A-Systems, Inc. System and method for dynamically creating records
US11258900B2 (en) 2005-06-29 2022-02-22 Ultratec, Inc. Device independent text captioned telephone service
US8265260B2 (en) * 2005-07-28 2012-09-11 At&T Intellectual Property I, L.P. Methods, systems, and computer program products for providing human-assisted natural language call routing
US8565409B1 (en) 2005-10-28 2013-10-22 At&T Intellectual Property Ii, L.P. Method and apparatus for routing a call to a subject matter expert via a packet network
US8027457B1 (en) * 2005-12-01 2011-09-27 Cordell Coy Process for automated deployment of natural language
US20070162282A1 (en) * 2006-01-09 2007-07-12 Gilad Odinak System and method for performing distributed speech recognition
US20080109489A1 (en) * 2006-11-03 2008-05-08 Adrian Sherwood Method For Generating Reports
US20080133346A1 (en) * 2006-11-30 2008-06-05 Jyh-Herng Chow Human responses and rewards for requests at web scale
US20080232570A1 (en) * 2007-03-20 2008-09-25 Avaya Technology Llc Automatic Reconstitution of Telecommunications Sessions
EP1976255B1 (en) 2007-03-29 2015-03-18 Intellisist, Inc. Call center with distributed speech recognition
US20080270142A1 (en) * 2007-04-25 2008-10-30 Find 1-4-U Inc. Remote Interactive Information Delivery System
US8005198B2 (en) * 2007-06-29 2011-08-23 Avaya Inc. Methods and apparatus for defending against telephone-based robotic attacks using permutation of an IVR menu
US7978831B2 (en) * 2007-06-29 2011-07-12 Avaya Inc. Methods and apparatus for defending against telephone-based robotic attacks using random personal codes
US8005197B2 (en) * 2007-06-29 2011-08-23 Avaya Inc. Methods and apparatus for defending against telephone-based robotic attacks using contextual-based degradation
US8635069B2 (en) 2007-08-16 2014-01-21 Crimson Corporation Scripting support for data identifiers, voice recognition and speech in a telnet session
CA2665009C (en) * 2008-05-23 2018-11-27 Accenture Global Services Gmbh System for handling a plurality of streaming voice signals for determination of responsive action thereto
CA2665014C (en) 2008-05-23 2020-05-26 Accenture Global Services Gmbh Recognition processing of a plurality of streaming voice signals for determination of responsive action thereto
CA2665055C (en) * 2008-05-23 2018-03-06 Accenture Global Services Gmbh Treatment processing of a plurality of streaming voice signals for determination of responsive action thereto
US20100217603A1 (en) * 2009-02-26 2010-08-26 Hammond Daniel D Method, System, and Apparatus for Enabling Adaptive Natural Language Processing
US8995423B2 (en) 2009-10-21 2015-03-31 Genesys Telecommunications Laboratories, Inc. Multimedia routing system for securing third party participation in call consultation or call transfer of a call in Progress
US20110258126A1 (en) * 2010-04-14 2011-10-20 Lg Chem, Ltd. Systems and methods for determining a warranty obligation of a supplier to an original equipment manufacturer for a vehicle battery pack
US8898723B2 (en) * 2010-08-20 2014-11-25 Sony Corporation Virtual channel declarative script binding
CA2748080A1 (en) * 2010-08-20 2012-02-20 Advanis Inc. System and method for conducting a computer-aided telephone interview
US9245525B2 (en) 2011-01-05 2016-01-26 Interactions Llc Automated speech recognition proxy system for natural language understanding
US9472185B1 (en) 2011-01-05 2016-10-18 Interactions Llc Automated recognition system for natural language understanding
US8560321B1 (en) * 2011-01-05 2013-10-15 Interactions Corportion Automated speech recognition system for natural language understanding
US20130006874A1 (en) * 2011-06-30 2013-01-03 Avaya Inc System and method for preserving context across multiple customer service venues
US8923501B2 (en) * 2011-07-29 2014-12-30 Avaya Inc. Method and system for managing contacts in a contact center
US9064259B2 (en) 2012-12-19 2015-06-23 Genesys Telecomminucations Laboratories, Inc. Customer care mobile application
US9984374B2 (en) 2013-02-25 2018-05-29 Genesys Telecommunications Laboratories Inc. Mobile expert desktop
US9088656B2 (en) 2012-12-12 2015-07-21 Genesys Telecommunications Laboratories, Inc. System and method for access number distribution in a contact center
US8649501B1 (en) 2012-12-28 2014-02-11 Convergent Resources Holdings, LLC Interactive dialing system
WO2014130499A1 (en) * 2013-02-21 2014-08-28 Edward Fredkin Personalized agent pool for interaction with multiple automated providers
US8767948B1 (en) 2013-03-15 2014-07-01 Genesys Telecommunications Laboratories, Inc. Back office services of an intelligent automated agent for a contact center
US9710800B2 (en) * 2013-05-03 2017-07-18 Oracle International Corporation Using voice input at a mobile point of sale
US10389876B2 (en) 2014-02-28 2019-08-20 Ultratec, Inc. Semiautomated relay method and apparatus
US20180270350A1 (en) 2014-02-28 2018-09-20 Ultratec, Inc. Semiautomated relay method and apparatus
US10748523B2 (en) 2014-02-28 2020-08-18 Ultratec, Inc. Semiautomated relay method and apparatus
US10878721B2 (en) 2014-02-28 2020-12-29 Ultratec, Inc. Semiautomated relay method and apparatus
US20180034961A1 (en) 2014-02-28 2018-02-01 Ultratec, Inc. Semiautomated Relay Method and Apparatus
US11205181B2 (en) * 2014-03-07 2021-12-21 Transform Sr Brands Llc Merchandise return and/or exchange systems, methods, and media
US9947342B2 (en) 2014-03-12 2018-04-17 Cogito Corporation Method and apparatus for speech behavior visualization and gamification
US9325849B1 (en) * 2014-03-14 2016-04-26 Directly, Inc. Customer service routing
US9569751B2 (en) * 2014-05-29 2017-02-14 Avaya Inc. Mechanism for creation and utilization of an attribute tree in a contact center
US9516167B2 (en) 2014-07-24 2016-12-06 Genesys Telecommunications Laboratories, Inc. Media channel management apparatus for network communications sessions
US10033797B1 (en) 2014-08-20 2018-07-24 Ivanti, Inc. Terminal emulation over HTML
US9866649B2 (en) * 2014-11-07 2018-01-09 Iac Search & Media, Inc. Automatic scaling of system for providing answers to requests
US10276188B2 (en) 2015-09-14 2019-04-30 Cogito Corporation Systems and methods for identifying human emotions and/or mental health states based on analyses of audio inputs and/or behavioral data collected from computing devices
US11100278B2 (en) 2016-07-28 2021-08-24 Ivanti, Inc. Systems and methods for presentation of a terminal application screen
US10586188B2 (en) 2016-11-08 2020-03-10 Wipro Limited Method and system for dynamic recommendation of experts for resolving queries
US10380516B1 (en) * 2017-01-16 2019-08-13 Directly Software, Inc. CRM including multi-thread messaging
US10410626B1 (en) * 2017-01-16 2019-09-10 Directly Software, Inc. Progressive classifier
US10440187B1 (en) * 2017-01-16 2019-10-08 Directly Software, Inc. Bootstrapped predicative routing in CRM
US10176808B1 (en) * 2017-06-20 2019-01-08 Microsoft Technology Licensing, Llc Utilizing spoken cues to influence response rendering for virtual assistants
KR102348904B1 (en) 2017-07-25 2022-01-07 삼성에스디에스 주식회사 Method for providing chatting service with chatbot assisted by human counselor
KR102338618B1 (en) 2017-07-25 2021-12-10 삼성에스디에스 주식회사 Method for providing chatting service with chatbot assisted by human agents
US10621282B1 (en) * 2017-10-27 2020-04-14 Interactions Llc Accelerating agent performance in a natural language processing system
KR101914583B1 (en) 2018-02-12 2018-11-05 주식회사 머니브레인 Interactive ai agent system and method for actively providing a security related service based on monitoring of a dialogue session among users via the dialogue session or a separate session, computer readable recording medium
US10789943B1 (en) 2018-08-31 2020-09-29 Interactions Llc Proxy for selective use of human and artificial intelligence in a natural language understanding system
KR20200059558A (en) 2018-11-21 2020-05-29 삼성전자주식회사 Device for generating user profile and system comprising the device
US10623572B1 (en) * 2018-11-21 2020-04-14 N3, Llc Semantic CRM transcripts from mobile communications sessions
US11017778B1 (en) 2018-12-04 2021-05-25 Sorenson Ip Holdings, Llc Switching between speech recognition systems
US10388272B1 (en) 2018-12-04 2019-08-20 Sorenson Ip Holdings, Llc Training speech recognition systems using word sequences
US11170761B2 (en) 2018-12-04 2021-11-09 Sorenson Ip Holdings, Llc Training of speech recognition systems
US10573312B1 (en) 2018-12-04 2020-02-25 Sorenson Ip Holdings, Llc Transcription generation from multiple speech recognition systems
US20210064984A1 (en) * 2019-08-29 2021-03-04 Sap Se Engagement prediction using machine learning in digital workplace
US11443264B2 (en) 2020-01-29 2022-09-13 Accenture Global Solutions Limited Agnostic augmentation of a customer relationship management application
US11539900B2 (en) 2020-02-21 2022-12-27 Ultratec, Inc. Caption modification and augmentation systems and methods for use by hearing assisted user
US11481785B2 (en) 2020-04-24 2022-10-25 Accenture Global Solutions Limited Agnostic customer relationship management with browser overlay and campaign management portal
US11392960B2 (en) * 2020-04-24 2022-07-19 Accenture Global Solutions Limited Agnostic customer relationship management with agent hub and browser overlay
US11588900B2 (en) * 2020-07-02 2023-02-21 Jpmorgan Chase Bank, N.A. Systems and methods multi-tenant and omni-channel routing
US11488604B2 (en) 2020-08-19 2022-11-01 Sorenson Ip Holdings, Llc Transcription of audio
WO2022141142A1 (en) * 2020-12-30 2022-07-07 浙江核新同花顺网络信息股份有限公司 Method and system for determining target audio and video
US11936812B2 (en) 2021-12-22 2024-03-19 Kore.Ai, Inc. Systems and methods for handling customer conversations at a contact center
US11889022B2 (en) 2021-12-22 2024-01-30 Kore.Ai, Inc. Systems and methods for handling customer conversations at a contact center

Family Cites Families (71)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US629830A (en) * 1896-10-31 1899-08-01 Lucius W Winchester Feeding-machine attachment for printing-presses.
US5199062A (en) * 1988-01-20 1993-03-30 Phone Base Systems Inc. Telephone communications system including a digital telephone switch, a voice response unit and a stored program sequence for controlling both the switch and the voice response unit
US5033088A (en) 1988-06-06 1991-07-16 Voice Processing Corp. Method and apparatus for effectively receiving voice input to a voice recognition system
WO1995027360A1 (en) 1994-03-31 1995-10-12 Citibank, N.A. Interactive voice response system
US5483588A (en) * 1994-12-23 1996-01-09 Latitute Communications Voice processing interface for a teleconference system
US5822727A (en) * 1995-03-30 1998-10-13 At&T Corp Method for automatic speech recognition in telephony
US5740240A (en) * 1995-04-10 1998-04-14 Edify Corporation Computer telephony integration system and method
US5884032A (en) 1995-09-25 1999-03-16 The New Brunswick Telephone Company, Limited System for coordinating communications via customer contact channel changing system using call centre for setting up the call between customer and an available help agent
AU714336B2 (en) * 1996-07-25 1999-12-23 Clearway Acquisition, Inc. Web serving system with primary and secondary servers
US5855003A (en) 1996-10-11 1998-12-29 Motorola, Inc. Method and apparatus for establishing a link in a wireless communication system
US5987115A (en) 1996-12-03 1999-11-16 Northern Telecom Limited Systems and methods for servicing calls by service agents connected via standard telephone lines
US6058435A (en) 1997-02-04 2000-05-02 Siemens Information And Communications Networks, Inc. Apparatus and methods for responding to multimedia communications based on content analysis
US6292830B1 (en) 1997-08-08 2001-09-18 Iterations Llc System for optimizing interaction among agents acting on multiple levels
US6038293A (en) 1997-09-03 2000-03-14 Mci Communications Corporation Method and system for efficiently transferring telephone calls
US6370508B2 (en) 1998-09-11 2002-04-09 Genesys Telecommunications Laboratories, Inc. Interface engine for managing business processes within a multimedia communication-center
US6366658B1 (en) 1998-05-07 2002-04-02 Mci Communications Corporation Telecommunications architecture for call center services using advanced interactive voice responsive service node
US6377921B1 (en) * 1998-06-26 2002-04-23 International Business Machines Corporation Identifying mismatches between assumed and actual pronunciations of words
US6499013B1 (en) 1998-09-09 2002-12-24 One Voice Technologies, Inc. Interactive user interface using speech recognition and natural language processing
US6161087A (en) 1998-10-05 2000-12-12 Lernout & Hauspie Speech Products N.V. Speech-recognition-assisted selective suppression of silent and filled speech pauses during playback of an audio recording
US6473505B1 (en) 1999-04-27 2002-10-29 Sprint Communications Company L.P. Call processing system for handling calls to a call center
US6401061B1 (en) 1999-05-13 2002-06-04 Yuri L. Zieman Combinatorial computational technique for transformation phrase text-phrase meaning
US6934759B2 (en) * 1999-05-26 2005-08-23 Enounce, Inc. Method and apparatus for user-time-alignment for broadcast works
US6535848B1 (en) 1999-06-08 2003-03-18 International Business Machines Corporation Method and apparatus for transcribing multiple files into a single document
US6553113B1 (en) 1999-07-09 2003-04-22 First Usa Bank, Na System and methods for call decisioning in a virtual call center integrating telephony with computers
US6442269B1 (en) * 1999-08-23 2002-08-27 Aspect Communications Method and apparatus for integrating business data and transaction data in a transaction processing environment
US6314176B1 (en) 1999-09-23 2001-11-06 Mci Worldcom, Inc. Third party call control
GB9930720D0 (en) * 1999-12-29 2000-02-16 Ibm Call centre agent automated assistance
US7068774B1 (en) 2000-02-25 2006-06-27 Harris Corporation Integrated acd and ivr scripting for call center tracking of calls
US6442247B1 (en) 2000-03-29 2002-08-27 Genesys Telecommunications Laboratories, Inc. Method and apparatus for recording and automated playback of personal agent greetings in a communication-center environment
US7509266B2 (en) * 2000-05-31 2009-03-24 Quality Data Management Inc. Integrated communication system and method
US6831966B1 (en) 2000-06-30 2004-12-14 Qwest Communications International, Inc. Multi-tenant, multi-media call center services platform system
US20020032591A1 (en) * 2000-09-08 2002-03-14 Agentai, Inc. Service request processing performed by artificial intelligence systems in conjunctiion with human intervention
US20030002651A1 (en) 2000-12-29 2003-01-02 Shires Glen E. Data integration with interactive voice response systems
AU2002225272A1 (en) 2001-01-23 2002-08-06 Ericsson Inc. Call center with ivr, whereby customer information and call control functions are presented to the agents in web documents
US6587558B2 (en) 2001-01-29 2003-07-01 Immequire, Llc System and method for virtual interactive response unit
US6925167B2 (en) 2001-02-01 2005-08-02 Estech Systems, Inc. Service observing in a voice over IP telephone system
US6922466B1 (en) 2001-03-05 2005-07-26 Verizon Corporate Services Group Inc. System and method for assessing a call center
EP1241863A3 (en) 2001-03-13 2005-06-22 Siemens Aktiengesellschaft Call distribution in a call center using the dialled number
US20020152071A1 (en) 2001-04-12 2002-10-17 David Chaiken Human-augmented, automatic speech recognition engine
US20030009352A1 (en) * 2001-06-15 2003-01-09 Andy Bolotinikov Interpreter certification system
AU2002355066B2 (en) * 2001-07-19 2007-03-01 Nice Systems Ltd. Method, apparatus and system for capturing and analyzing interaction based content
US20030046210A1 (en) * 2001-08-31 2003-03-06 Vora Poorvi L. Anonymous acquisition of digital products based on secret splitting
US8468027B2 (en) 2001-09-04 2013-06-18 Kombea Corporation Systems and methods for deploying and utilizing a network of conversation control systems
US20030046102A1 (en) 2001-09-04 2003-03-06 Kombia, L.L.C. Systems and methods for maintaining consistency in interpersonal communications related to marketing operations
US20030110023A1 (en) * 2001-12-07 2003-06-12 Srinivas Bangalore Systems and methods for translating languages
US20030179876A1 (en) * 2002-01-29 2003-09-25 Fox Stephen C. Answer resource management system and method
US7292689B2 (en) 2002-03-15 2007-11-06 Intellisist, Inc. System and method for providing a message-based communications infrastructure for automated call center operation
US8170197B2 (en) 2002-03-15 2012-05-01 Intellisist, Inc. System and method for providing automated call center post-call processing
US7167899B2 (en) 2002-03-26 2007-01-23 Matsushita Electric Industrial Co., Ltd. Web-content aware automatic call transfer system and process for mobile users and operators
US20030185380A1 (en) * 2002-04-01 2003-10-02 Pablo Garin Interactive telephone reply system
US20030212558A1 (en) 2002-05-07 2003-11-13 Matula Valentine C. Method and apparatus for distributed interactive voice processing
US6771746B2 (en) * 2002-05-16 2004-08-03 Rockwell Electronic Commerce Technologies, Llc Method and apparatus for agent optimization using speech synthesis and recognition
US7539086B2 (en) 2002-10-23 2009-05-26 J2 Global Communications, Inc. System and method for the secure, real-time, high accuracy conversion of general-quality speech into text
US20040162724A1 (en) * 2003-02-11 2004-08-19 Jeffrey Hill Management of conversations
US7606714B2 (en) * 2003-02-11 2009-10-20 Microsoft Corporation Natural language classification within an automated response system
US7466812B1 (en) * 2003-10-22 2008-12-16 Cisco Technology, Inc. Connecting an endpoint to a conference call
US20050259803A1 (en) * 2004-05-19 2005-11-24 Nokia Corporation Managing a conference session
US7133513B1 (en) 2004-07-21 2006-11-07 Sprint Spectrum L.P. Method and system for transcribing voice content of an on-going teleconference into human-readable notation
US7908141B2 (en) 2004-09-29 2011-03-15 International Business Machines Corporation Extracting and utilizing metadata to improve accuracy in speech to text conversions
US8442197B1 (en) 2006-03-30 2013-05-14 Avaya Inc. Telephone-based user interface for participating simultaneously in more than one teleconference
US20090124272A1 (en) 2006-04-05 2009-05-14 Marc White Filtering transcriptions of utterances
CA2644666A1 (en) 2006-04-17 2007-10-25 Vovision Llc Methods and systems for correcting transcribed audio files
US8326927B2 (en) * 2006-05-23 2012-12-04 Cisco Technology, Inc. Method and apparatus for inviting non-rich media endpoints to join a conference sidebar session
US8306816B2 (en) 2007-05-25 2012-11-06 Tigerfish Rapid transcription by dispersing segments of source material to a plurality of transcribing stations
US20090052646A1 (en) * 2007-08-24 2009-02-26 Mcgowan John T Automatic Conferencing System
US8407049B2 (en) 2008-04-23 2013-03-26 Cogi, Inc. Systems and methods for conversation enhancement
US8332212B2 (en) 2008-06-18 2012-12-11 Cogi, Inc. Method and system for efficient pacing of speech for transcription
US20100268534A1 (en) 2009-04-17 2010-10-21 Microsoft Corporation Transcription, archiving and threading of voice communications
US8335689B2 (en) 2009-10-14 2012-12-18 Cogi, Inc. Method and system for efficient management of speech transcribers
US8326624B2 (en) 2009-10-26 2012-12-04 International Business Machines Corporation Detecting and communicating biometrics of recorded voice during transcription process
US8370142B2 (en) 2009-10-30 2013-02-05 Zipdx, Llc Real-time transcription of conference calls

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of EP1620777A4 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8666040B2 (en) 2006-09-22 2014-03-04 International Business Machines Corporation Analyzing Speech Application Performance
US8929519B2 (en) 2006-09-22 2015-01-06 International Business Machines Corporation Analyzing speech application performance
CN104883299A (en) * 2015-06-24 2015-09-02 上海斐讯数据通信技术有限公司 Router configuration method, system and router

Also Published As

Publication number Publication date
US20100061529A1 (en) 2010-03-11
US8484042B2 (en) 2013-07-09
EP1620777A4 (en) 2009-11-25
US8332231B2 (en) 2012-12-11
US7606718B2 (en) 2009-10-20
US20140112460A1 (en) 2014-04-24
US20130096924A1 (en) 2013-04-18
US20050002502A1 (en) 2005-01-06
EP1620777A2 (en) 2006-02-01
US8626520B2 (en) 2014-01-07
AU2004237227B2 (en) 2011-07-14
CA2524591A1 (en) 2004-11-18
AU2004237227A1 (en) 2004-11-18
WO2004099934A3 (en) 2009-04-09
US20130297496A1 (en) 2013-11-07
NZ543885A (en) 2009-09-25

Similar Documents

Publication Publication Date Title
US8626520B2 (en) Apparatus and method for processing service interactions
US10171659B2 (en) Customer portal of an intelligent automated agent for a contact center
US9571650B2 (en) Method and system for generating a responsive communication based on behavioral assessment data
JP4057785B2 (en) A storage media interface engine that provides summary records for multimedia files stored in a multimedia communication center
US9992336B2 (en) System for analyzing interactions and reporting analytic results to human operated and system interfaces in real time
CA2917294C (en) Intelligent automated agent for a contact center
US8094803B2 (en) Method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
JP2002528824A (en) Method and apparatus for building multimedia applications using an interactive multimedia viewer
JP2002537594A (en) Method and apparatus for providing a media-independent self-help module within a multimedia communication center customer interface
CA2760146A1 (en) Secure customer service proxy portal
CN114730357A (en) System and method for dialog management
WO2023196363A1 (en) Modular technologies for servicing telephony systems
KR20230156599A (en) A system that records and manages calls in the contact center

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2524591

Country of ref document: CA

Ref document number: 2004237227

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2875/CHENP/2005

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2004751363

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2004237227

Country of ref document: AU

Date of ref document: 20040505

Kind code of ref document: A

WWP Wipo information: published in national office

Ref document number: 2004237227

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 543885

Country of ref document: NZ

WWP Wipo information: published in national office

Ref document number: 2004751363

Country of ref document: EP