SYSTEM AND METHOD FOR SPEECH-ENABLED AUTOMATED AGENT ASSISTANCE WITHIN A CLOUD-BASED CONTACT CENTER
Methods to reduce agent effort and improve customer experience quality through artificial intelligence. The Agent Assist tool provides contact centers with an innovative tool designed to reduce agent effort, improve quality and reduce costs by minimizing search and data entry tasks The Agent Assist tool is natively built and fully unified within the agent interface while keeping all data internally protected from third-party sharing.
This application claims priority to U.S. Provisional Patent Application No. 62/870,913, filed Jul. 5, 2019, entitled “SYSTEM AND METHOD FOR AUTOMATION WITHIN A CLOUD-BASED CONTACT CENTER,” which is incorporated herein by reference in its entirety.
BACKGROUNDToday, contact centers are primarily on-premise software solutions. This requires an enterprise to make a substantial investment in hardware, installation and regular maintenance of such solutions. Using on-premise software, agents and supervisors are stationed in an on-site call center. In addition, a dedicated IT staff is required because on-site software may be too complicated for supervisors and agents to handle on their own. Another drawback of on-premise solutions is that such solutions cannot be easily enhanced to include capabilities to that meet the current demands of technology, such as automation. Thus, there is a need for a solution to enhance the agent experience to enhance the interactions with customers who interact with contact centers.
SUMMARYDisclosed herein are systems and methods for providing a cloud-based contact center solution providing agent automation through the use of e.g., artificial intelligence and the like.
In accordance with an aspect, there is disclosed a method, comprising receiving a speech communication from a customer; converting the speech communication to text to perform inference processing on the text to determine a customer intent; automatically analyzing the text to determine a subject of the speech communication and key terms associated with the subject; automatically parsing a knowledgebase using the key terms for at least one responsive answer associated with the subject; and providing the solution to an agent in a unified interface during the communication with the customer. In accordance with another aspect, a cloud-based software platform is disclosed in which the example method above is performed.
Other systems, methods, features and/or advantages will be or may become apparent to one with skill in the art upon examination of the following drawings and detailed description. It is intended that all such additional systems, methods, features and/or advantages be included within this description and be protected by the accompanying claims.
The components in the drawings are not necessarily to scale relative to each other. Like reference numerals designate corresponding parts throughout the several views.
Assist;
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art. Methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure. While implementations will be described within a cloud-based contact center, it will become evident to those skilled in the art that the implementations are not limited thereto.
The present disclosure is generally directed to a cloud-based contact center and, more particularly, methods and systems for proving intelligent, automated services within a cloud-based contact center. With the rise of cloud-based computing, contact centers that take advantage of this infrastructure are able to quickly add new features and channels. Cloud-based contact centers improve the customer experience by leveraging application programming interfaces (APIs) and software development kits (SDKs) to allow the contact center to change in in response to an enterprise's needs. For example, communications channels may be easily added as the APIs and SDKs enable adding channels, such as SMS/MMS, social media, web, etc. Cloud-based contact centers provide a platform that enables frequent updates. Yet another advantage of cloud-based contact centers is increased reliability, as cloud-based contact centers may be strategically and geographically distributed around the world to optimally route calls to reduce latency and provide the highest quality experience. As such, customers are connected to agents faster and more efficiently.
Example Cloud-Based Contact Center Architecture
The contact center 150 may be cloud-based and distributed over a plurality of locations. The contact center 150 may include servers, databases, and other components. In particular, the contact center 150 may include, but is not limited to, a routing server, a SIP server, an outbound server, automated call distribution (ACD), a computer telephony integration server (CTI), an email server, an IM server, a social server, a SMS server, and one or more databases for routing, historical information and campaigns.
The routing server may serve as an adapter or interface between the switch and the remainder of the routing, monitoring, and other communication-handling components of the contact center. The routing server may be configured to process PSTN calls, VoIP calls, and the like. For example, the routing server may be configured with the CTI server software for interfacing with the switch/media gateway and contact center equipment. In other examples, the routing server may include the SIP server for processing SIP calls. The routing server may extract data about the customer interaction such as the caller's telephone number (often known as the automatic number identification (ANI) number), or the customer's internet protocol (IP) address, or email address, and communicate with other contact center components in processing the interaction.
The ACD is used by inbound, outbound and blended contact centers to manage the flow of interactions by routing and queuing them to the most appropriate agent. Within the CTI, software connects the ACD to a servicing application (e.g., customer service, CRM, sales, collections, etc.), and looks up or records information about the caller. CTI may display a customer's account information on the agent desktop when an interaction is delivered.
For inbound SIP messages, the routing server may use statistical data from the statistics server and a routing database to the route SIP request message. A response may be sent to the media server directing it to route the interaction to a target agent 120. The routing database may include: customer relationship management (CRM) data; data pertaining to one or more social networks (including, but not limited to network graphs capturing social relationships within relevant social networks, or media updates made by members of relevant social networks); agent skills data; data extracted from third party data sources including cloud-based data sources such as CRM; or any other data that may be useful in making routing decisions.
Customers 110 may initiate inbound communications (e.g., telephony calls, emails, chats, video chats, social media posts, etc.) to the contact center 150 via an end user device. End user devices may be a communication device, such as, a telephone, wireless phone, smart phone, personal computer, electronic tablet, etc., to name some non-limiting examples. Customers 110 operating the end user devices may initiate, manage, and respond to telephone calls, emails, chats, text messaging, web-browsing sessions, and other multi-media transactions. Agent(s) 120 and customers 110 may communicate with each other and with other services over the network 130. For example, a customer calling on telephone handset may connect through the PSTN and terminate on a private branch exchange (PBX). A video call originating from a tablet may connect through the network 130 terminate on the media server. The channels 140 are coupled to the communications network 130 for receiving and transmitting telephony calls between customers 110 and the contact center 150. A media gateway may include a telephony switch or communication switch for routing within the contact center. The switch may be a hardware switching system or a soft switch implemented via software. For example, the media gateway may communicate with an automatic call distributor (ACD), a private branch exchange (PBX), an IP-based software switch and/or other switch to receive Internet-based interactions and/or telephone network-based interactions from a customer 110 and route those interactions to an agent 120. More detail of these interactions is provided below.
As another example, a customer smartphone may connect via the WAN and terminate on an interactive voice response (IVR)/intelligent virtual agent (IVA) components. IVR are self-service voice tools that automate the handling of incoming and outgoing calls. Advanced IVRs use speech recognition technology to enable customers 110 to interact with them by speaking instead of pushing buttons on their phones. IVR applications may be used to collect data, schedule callbacks and transfer calls to live agents. IVA systems are more advanced and utilize artificial intelligence (AI), machine learning (ML), advanced speech technologies (e.g., natural language understanding (NLU)/natural language processing (NLP)/natural language generation (NLG)) to simulate live and unstructured cognitive conversations for voice, text and digital interactions. IVA systems may cover a variety of media channels in addition to voice, including, but not limited to social media, email, SMS/MMS, IM, etc. and they may communicate with their counterpart's application (not shown) within the contact center 150. The IVA system may be configured with a script for querying customers on their needs. The IVA system may ask an open-ended questions such as, for example, “How can I help you?” and the customer 110 may speak or otherwise enter a reason for contacting the contact center 150. The customer's response may then be used by a routing server to route the call or communication to an appropriate contact center resource.
In response, the routing server may find an appropriate agent 120 or automated resource to which an inbound customer communication is to be routed, for example, based on a routing strategy employed by the routing server, and further based on information about agent availability, skills, and other routing parameters provided, for example, by the statistics server. The routing server may query one or more databases, such as a customer database, which stores information about existing clients, such as contact information, service level agreement requirements, nature of previous customer contacts and actions taken by contact center to resolve any customer issues, etc. The routing server may query the customer information from the customer database via an ANI or any other information collected by the IVA system.
Once an appropriate agent and/or automated resource is identified as being available to handle a communication, a connection may be made between the customer 110 and an agent device of the identified agent 120 and/or the automate resource. Collected information about the customer and/or the customer's historical information may also be provided to the agent device for aiding the agent in better servicing the communication. In this regard, each agent device may include a telephone adapted for regular telephone calls, VoIP calls, etc. The agent device may also include a computer for communicating with one or more servers of the contact center and performing data processing associated with contact center operations, and for interfacing with customers via voice and other multimedia communication mechanisms.
The contact center 150 may also include a multimedia/social media server for engaging in media interactions other than voice interactions with the end user devices and/or other web servers 160. The media interactions may be related, for example, to email, vmail (voice mail through email), chat, video, text-messaging, web, social media, co-browsing, etc. In this regard, the multimedia/social media server may take the form of any IP router conventional in the art with specialized hardware and software for receiving, processing, and forwarding multi-media events.
The web servers 160 may include, for example, social media sites, such as, Facebook, Twitter, Instagram, etc. In this regard, the web servers 160 may be provided by third parties and/or maintained outside of the contact center 160 that communicate with the contact center 150 over the network 130. The web servers 160 may also provide web pages for the enterprise that is being supported by the contact center 150. End users may browse the web pages and get information about the enterprise's products and services. The web pages may also provide a mechanism for contacting the contact center, via, for example, web chat, voice call, email, WebRTC, etc.
The integration of real-time and non-real-time communication services may be performed by unified communications (UC)/presence sever. Real-time communication services include Internet Protocol (IP) telephony, call control, instant messaging (IM)/chat, presence information, real-time video and data sharing. Non-real-time applications include voicemail, email, SMS and fax services. The communications services are delivered over a variety of communications devices, including IP phones, personal computers (PCs), smartphones and tablets. Presence provides real-time status information about the availability of each person in the network, as well as their preferred method of communication (e.g., phone, email, chat and video).
Recording applications may be used to capture and play back audio and screen interactions between customers and agents. Recording systems should capture everything that happens during interactions and what agents do on their desktops. Surveying tools may provide the ability to create and deploy post-interaction customer feedback surveys in voice and digital channels. Typically, the IVR/IVA development environment is leveraged for survey development and deployment rules. Reporting/dashboards are tools used to track and manage the performance of agents, teams, departments, systems and processes within the contact center.
Automation
As shown in
With respect to the cloud-based contact center,
The intent inference module automatically infers the customer's 110 intent from the text of the user input using artificial intelligence or machine learning techniques. These artificial intelligence techniques may include, for example, identifying one or more keywords from the user input and searching a database of potential intents (e.g., call reasons) corresponding to the given keywords. The database of potential intents and the keywords corresponding to the intents may be automatically mined from a collection of historical interaction recordings, in which a customer may provide a statement of the issue, and in which the intent is explicitly encoded by an agent.
Some aspects of the present disclosure relate to automatically navigating an IVR system of a contact center on behalf of a user using, for example, the loaded script. In some implementations of the present disclosure, the script includes a set of fields (or parameters) of data that are expected to be required by the contact center in order to resolve the issue specified by the customer's 110 intent. In some implementations of the present disclosure, some of the fields of data are automatically loaded from a stored user profile. These stored fields may include, for example, the customer's 110 full name, address, customer account numbers, authentication information (e.g., answers to security questions) and the like.
Some aspects of the present disclosure relate to the automatic authentication of the customer 110 with the provider. For example, in some implementations of the present disclosure, the user profile may include authentication information that would typically be requested of users accessing customer support systems such as usernames, account identifying information, personal identification information (e.g., a social security number), and/or answers to security questions. As additional examples, the automation infrastructure 200 may have access to text messages and/or email messages sent to the customer's 110 account on the end user device in order to access one-time passwords sent to the customer 110, and/or may have access to a one-time password (OTP) generator stored locally on the end user device. Accordingly, implementations of the present disclosure may be capable of automatically authenticating the customer 110 with the contact center prior to an interaction.
In some implementations of the present disclosure an application programming interface (API) is used to interact with the provider directly. The provider may define a protocol for making commonplace requests to their systems. This API may be implemented over a variety of standard protocols such as Simple Object Access Protocol (SOAP) using Extensible Markup Language (XML), a Representational State Transfer (REST) API with messages formatted using XML or JavaScript Object Notation (JSON), and the like. Accordingly, a customer experience automation system 200 according to one implementation of the present disclosure automatically generates a formatted message in accordance with an API define by the provider, where the message contains the information specified by the script in appropriate portions of the formatted message.
Some aspects of the present disclosure relate to systems and methods for automating and augmenting aspects of an interaction between the customer 110 and a live agent of the contact center. In an implementation, once a interaction, such as through a phone call, has been initiated with the agent 120, metadata regarding the conversation is displayed to the customer 110 and/or agent 120 in the UI throughout the interaction. Information, such as call metadata, may be presented to the customer 110 through the UI 205 on the customer's 110 mobile device 105. Examples of such information might include, but not be limited to, the provider, department call reason, agent name, and a photo of the agent.
According to some aspects of implementations of the present disclosure, both the customer 110 and the agent 120 can share relevant content with each other through the application (e.g., the application running on the end user device). The agent may share their screen with the customer 110 or push relevant material to the customer 110.
In yet another implementation, the automation infrastructure 200 may also “listen” in on the conversation and automatically push relevant content from a knowledge base to the customer 110 and/or agent 120. For example, the application may use a real-time transcription of the customer's input (e.g., speech) to query a knowledgebase to provide a solution to the agent 120. The agent may share a document describing the solution with the customer 110. The application may include several layers of intelligence where it gathers customer intelligence to learn everything it can about why the customer 110 is calling. Next, it may perform conversation intelligence, which is extracting more context about the customer's intent. Next, it may perform interaction intelligence to pull information from other sources about customer 100. The automation infrastructure 200 may also perform contact center intelligence to implement WFM/WFO features of the contact center 150.
Agent Assist Overview
Thus, in the context of
Agent Assist is powered by artificial intelligence (Al) to provide real-time guidance for frontline employees to respond to customer needs quickly and accurately. For example, as a customer 110 states a need, agents 120 are provided answers or supporting information immediately to expedite the conversation and simplify tasks. Agent Assist determines why customers are calling and what their intent is. Similarly, IVR assist makes recommendations to a supervisor to optimize IVR for a better customer experience, for example, Agent Assist helps optimize IVR questions to match customers' reasons for calling and what their intent is.
By leveraging automated assistance and reducing agent-supervisor ad-hoc interactions, Agent Assist gives supervisors more time to focus on workforce engagement activities. Agent Assist reduces manual supervision and assistance. Agent Assist improves agent proficiency and accuracy. Agent Assist reduces short and long term training efforts through real-time error identification, eliminates busy work with smart note technology (the ability to systematically recognize and enter all key aspects of an interaction into the conversation notes); and improved handle time with in-app automations.
With reference to
With reference to
The operational flow continues at 410, wherein the customer voice and/or agent voice may be analyzed before transcription to extract one or more of the following non-limiting features:
Pain
Agony
Empathy
Being sarcastic
Speech speed
Tone
Frustration
Enthusiasm
Interest
Engagement
Understanding these features helps the agent 120 better understand the customer 110. The agent 120 will be better able to understand the customer's problem or issues so a resolution can be more easily achieved.
At 412, the conversation between the agent and the customer is transcribed in either real-time or post-call. This may be performed by the speech-to-text component of the automation infrastructure 200 and saved to a database. At 414, the agent voice channel and the customer voice channel are separated. At 416, the automation infrastructure 200 determines information about the customer and agent, such as, intent, entities (e.g., names, locations, times, etc.) sentiment, sentence phrases (e.g. verb, noun, adjective, etc.). At 418, from the information determined at 416, Agent Assist provides useful insight to the agent 120. This information, as shown in
Thus, in accordance with the operational flow of
Smart Notes
In accordance with the operations performed in
Automatic Data Entry
In accordance with aspects of the disclosure, when Agent Assist detects the participants in a conversation it may automatically fill out any forms that pop-up after such conversations. With reference to
Such automated data entry includes but not limited to:
Date
Time
Day of the week
First name
Last name
Gender
Address
Object e.g., Samsung Galaxy
Type of the Object—e.g. Galaxy S9
Time of the day (e.g. morning, afternoon)
After the information is populated, the process ends at 806.
Real-time analytics and error detection
With reference to
Compliance—words that should not say in the conversation.
Competitors—if agent says the name of competitors.
A set of “do's and don'ts”—words that agent should not say.
If the agent is angry, curse etc.
If the agent is making fun of the caller.
If the agent talks too fast, too slow, or if there is a delay between words.
If the agent shows empathy.
If the agent violates any policy.
If the agent markets other products.
If the agent talks about personal issues.
If the agent is politically motivated.
If the agent promotes violence.
The process monitors the agent in real-time and expands upon the current state of the art, which is monitoring is at word level to monitor the transcript of the conversation and look for certain words or a variation of such words. For instance, if the agent is talking about pricing, the system may look for words such as “our pricing.” “our price list,” “do you want to know how much our product is,” etc. As another example, the agent may say “our product is beating everybody else,” which means the price is very affordable. Other examples such as these are possible.
Artificial Intelligence (AI) Processing/Learning
In accordance with the present disclosure, a layer of deep learning 1002 is applied to create a large set of all potential of sentences and instances (natural language understanding 1004) where the agent:
Said X and meant A.
Said Y and meant A.
Said Z but did not mean A.
Said W and meant B.
This sets have several positive and negative examples around concepts, such as “cursing,” “being frustrated,” “rude attitude,” “too pushy for sale,” “soft attitude,” as well as word level examples, such as “shut up.” Deep learning 1002 does not need to extract features, rather deep learning takes a set of sentences and classes (class is positive/negative, good bad, cursing/not cursing). Deep learning 1002 learns and builds a model out of all of these examples. For example, audio files of conversations 1006 between agents 120 and customers 110 may be input to the deep learning module 1002. Alternatively, transcribed words may be input to the deep learning module 1002. Next, the system uses the learned model to listen to any conversation in real time and to identify the class such “cursing/not cursing.” As soon as the system identifies a class, and if it is negative or positive, it can do the following:
Send an alert to manager
Make an indicator red on the screen
Send a note to the agent to be reviewed in real-time or after the call
Update some data files for reporting and visualization.
As part of the above, the natural language understanding 1004 may be used for intent spotting 1008 to determine intent 1010, which may be used for IVR analysis 1012 and/or agent performance 1014.
In this approach words are not important, rather the combination of all of words, the order of words and al potential variations of them have relevance. Deep learning 1002 considers all of the potential signals that could describe and hint toward a class. This approach is also language agnostic. It does not matter what language agent or caller speaks as long as there are a set of words and a set of classes, deep learning 1002 will learn and the model can be applied to the same language. In addition to the above, metadata may be added to every call, such as the time of the call, the duration of the call, the number of times the agent talked over the caller could be added to the data, etc.
Listening to Other Agents Conversation in Real-Time
As described above, Agent Assist may periodically perform the following to classify conversations of other agents. With reference to
Time of the call
Duration of the call
Topic of the call
Frequency of words in the customer transcription (e.g. Ticket 2, Delay 4, etc.)
Frequency of words in the agent transcription (e.g. rebook 3, etc.)
Cluster conversations based on these features
At 1106, for the conversation happening in within a predetermined period (e.g., one month), the following are performed:
Calculate the point wise mutual information between all of the calls in one cluster
Make a graph of all calls in which the strength of the link is the weight of the point wise mutual information.
At 1108, for the current file:
Extract features
Find the cluster
Calculate the point wise mutual information
Find the closest call to the current call
Show the content of the call to the agent.
At 1110, the process ends.
Learning Module
While the process 1100 analyzes calls, Agent Assist learns and improves by analyzing user clicks. As relevant conversations are presented to the agent (see, e.g., 306), if the agent clicks on a conversation and spends time on it, then it means that the conversation is relevant. Further, if the conversation is located, e.g., third on the list, but the agent clicks on the first conversation and moves forward, Agent Assist does not make any assumptions about the conversation. Hence, the rank of the conversation may be of importance depending on the agent's actions. For the sake of simplicity, Agent Assist shows the top three conversations to the agent. If some conversations ranked equally, Agent Assist picks one based on heuristics, for instance any conversation that has not been picked recently will be picked.
Escalation Assistance
With reference to
Thus, the present disclosure described an Agent Assist tool within a cloud-based contact center environment that is a conversational guide that proactively delivers real-time contextualized next best actions, in-app, to enhance the customer and agent experience. Talkdesk Agent Assist uses Al to empower agents with a personalized assistant that listens, learns and provides intelligent recommendations in every conversation to help resolve complex customer issues faster
General Purpose Computer Description
Numerous other general purpose or special purpose computing system environments or configurations may be used. Examples of well-known computing systems, environments, and/or configurations that may be suitable for use include, but are not limited to, personal computers, servers, handheld or laptop devices, multiprocessor systems, microprocessor-based systems, network personal computers (PCs), minicomputers, mainframe computers, embedded systems, distributed computing environments that include any of the above systems or devices, and the like.
Computer-executable instructions, such as program modules, being executed by a computer may be used. Generally, program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types. Distributed computing environments may be used where tasks are performed by remote processing devices that are linked through a communications network or other data transmission medium. In a distributed computing environment, program modules and other data may be located in both local and remote computer storage media including memory storage devices.
With reference to
Computing device 1300 may have additional features/functionality. For example, computing device 1300 may include additional storage (removable and/or non-removable) including, but not limited to, magnetic or optical disks or tape. Such additional storage is illustrated in
Computing device 1300 typically includes a variety of tangible computer readable media. Computer readable media can be any available tangible media that can be accessed by device 1300 and includes both volatile and non-volatile media, removable and non-removable media.
Tangible computer storage media include volatile and non-volatile, and removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. Memory 1304, removable storage 1308, and non-removable storage 1310 are all examples of computer storage media. Tangible computer storage media include, but are not limited to, RAM, ROM, electrically erasable program read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by computing device 1300. Any such computer storage media may be part of computing device 1300.
Computing device 1300 may contain communications connection(s) 1312 that allow the device to communicate with other devices. Computing device 1300 may also have input device(s) 1314 such as a keyboard, mouse, pen, voice input device, touch input device, etc. Output device(s) 1316 such as a display, speakers, printer, etc. may also be included. All these devices are well known in the art and need not be discussed at length here.
It should be understood that the various techniques described herein may be implemented in connection with hardware or software or, where appropriate, with a combination of both. Thus, the methods and apparatus of the presently disclosed subject matter, or certain aspects or portions thereof, may take the form of program code (i.e., instructions) embodied in tangible media, such as floppy diskettes, CD-ROMs, hard drives, or any other machine-readable storage medium wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the presently disclosed subject matter. In the case of program code execution on programmable computers, the computing device generally includes a processor, a storage medium readable by the processor (including volatile and non-volatile memory and/or storage elements), at least one input device, and at least one output device. One or more programs may implement or utilize the processes described in connection with the presently disclosed subject matter, e.g., through the use of an application programming interface (API), reusable controls, or the like. Such programs may be implemented in a high level procedural or object-oriented programming language to communicate with a computer system. However, the program(s) can be implemented in assembly or machine language, if desired. In any case, the language may be a compiled or interpreted language and it may be combined with hardware implementations.
Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.
Claims
1. A method for automated services in a geographically-distributed cloud-based contact center, the method comprising:
- receiving, by the contact center, a speech communication from a customer;
- converting, by the contact center, the speech communication to text to perform inference processing on the text to determine a customer intent;
- automatically analyzing, by the contact center, the text to determine a subject of the speech communication and key terms associated with the subject;
- automatically parsing, by the contact center, a knowledgebase using the key terms for at least one responsive answer associated with the subject; and
- providing, by the contact center, the solution to an agent in a unified interface during the communication with the customer.
2. The method of 1, further comprising:
- querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; and
- displaying responsive results from the CRM/CSM in the second field in the unified interface.
3. The method of claim 1, further comprising:
- querying a database of customer-agent transcripts using the key terms; and
- displaying responsive results from the database of customer-agent transcripts in the second field in the unified interface.
4. The method of claim 1, wherein the method is performed in real-time as the customer communication progresses with the agent.
5. The method of claim 1, further comprising concurrently displaying to the agent the text in a first field of the unified interface and the solution in a second field of the unified interface.
6. The method of claim 1, further comprising separating the customer voice from the agent voice.
7. The method of claim 6, further comprising applying unsupervised methods to the separated customer voice.
8. The method of claim 7, wherein the unsupervised methods comprise applying biometrics to authenticate the customer, predicting a customer gender, predicting a customer age category, predicting a customer accent, and predict customer demographics.
9. The method of claim 6, further comprising extracting features associated with the customer.
10. The method of claim 9, wherein the features comprise pain, agony, empathy, sarcasm, speech speed, tone, frustration, enthusiasm, interest and engagement.
11. A geographically-distributed cloud-based software platform comprising: one or more computer processors; and one or more computer-readable mediums storing instructions that, when executed by the one or more computer processors, cause the cloud-based software platform to perform automated-services operations comprising:
- receiving, by the contact center, a speech communication from a customer;
- converting, by the contact center, the speech communication to text to perform inference processing on the text to determine a customer intent;
- automatically analyzing, by the contact center, the text to determine a subject of the speech communication and key terms associated with the subject;
- automatically parsing, by the contact center, a knowledgebase using the key terms for at least one responsive answer associated with the subject; and
- providing, by the contact center, the solution to an agent in a unified interface during the communication with the customer.
12. The cloud-based software platform of 11, further comprising instructions to cause operations comprising:
- querying a customer relationship management (CRM) platform/a customer service management (CSM) platform using the key terms; and
- displaying responsive results from the CRM/CSM in the second field in the unified interface.
13. The cloud-based software platform of claim 11, further comprising instructions to cause operations comprising:
- querying a database of customer-agent transcripts using the key terms; and
- displaying responsive results from the database of customer-agent transcripts in the second field in the unified interface.
14. The cloud-based software platform of claim 11, wherein the operations are performed in real-time as the customer communication progresses with the agent.
15. The cloud-based software platform of claim 11, further comprising instructions to cause operations comprising concurrently displaying to the agent the text in a first field of the unified interface and the solution in a second field of the unified interface.
16. The cloud-based software platform of claim 11, further comprising instructions to cause operations comprising separating the customer voice from the agent voice.
17. The cloud-based software platform of claim 16, further comprising instructions to cause operations comprising applying unsupervised methods to the separated customer voice.
18. The cloud-based software platform of claim 17, wherein the unsupervised methods comprise applying biometrics to authenticate the customer, predicting a customer gender, predicting a customer age category, predicting a customer accent, and predict customer demographics.
19. The cloud-based software platform of claim 16, further comprising instructions to cause operations comprising extracting features associated with the customer.
20. The cloud-based software platform of claim 19, wherein the features comprise pain, agony, empathy, sarcasm, speech speed, tone, frustration, enthusiasm, interest and engagement.
Type: Application
Filed: Oct 30, 2019
Publication Date: Jan 7, 2021
Inventors: Jafar Adibi (Los Angeles, CA), Tiago Paiva (San Francisco, CA), Charanya Kannan (Oakland, CA), Bruno Antunes (Sao Silvestre), Joao Carmo (Porto), Marco Costa (Lisboa)
Application Number: 16/668,158