KR102193307B1

KR102193307B1 - Artificial intelligence automated Response system with customer verification function using speech verification and method thereof

Info

Publication number: KR102193307B1
Application number: KR1020200049276A
Authority: KR
Inventors: 김종엽; 이형엽; 이재혁
Original assignee: 주식회사 예스피치
Priority date: 2020-03-06
Filing date: 2020-04-23
Publication date: 2020-12-23

Abstract

The present invention relates to an artificial intelligence automated response system and a method thereof. More specifically, the present invention relates to the artificial intelligence automated response system having a customer authentication function using speaker authentication and the method thereof, wherein a call, control data, and voice of the automated response system can be integrated and relayed, and the voice recognition for the customer′s spoken voice and the speaker authentication are performed at the same time by using the integrated relay so as to provide a service responding to a verified speaker.

Description

Artificial intelligence automated response system with customer verification function using speech verification and method thereof

본 발명은 인공지능 자동응답시스템 및 방법에 관한 것으로, 더욱 상세하게는 자동응답시스템의 호, 제어 데이터 및 음성을 통합하여 중계할 수 있고, 통합 중계를 활용하여 고객의 발화 음성에 대한 음성인식과 동시에 화자인증을 수행하여 검증된 화자에 대응하는 서비스를 제공하는 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답시스템 및 방법에 관한 것이다.The present invention relates to an artificial intelligence automatic response system and method, and more particularly, it is possible to integrate and relay the call, control data and voice of the automatic response system, and use the integrated relay to recognize the voice of the customer's spoken voice and It relates to an artificial intelligence automatic response system and method having a customer authentication function using speaker authentication that simultaneously performs speaker authentication and provides a service corresponding to a verified speaker.

일반적으로, 고객이 어떤 제품의 구매, 또는 서비스의 만족을 결정하는 중요 요소 중 하나는 제품의 AS(After Service), 고객의 불만 해소 및 빠르고 신속한 사후 서비스 제공을 위한 기업, 관공서 등의 대고객 서비스이다. 최근 대고객 서비스를 최전방에서 전담하는 창구역할을 수행하는 것이 자동응답시스템(Automatic Response System: ARS, 또는 Interactive Voice Response: IVR)이다.In general, one of the important factors that determines the customer's purchase of a certain product or satisfaction of a service is the customer service of a company, government office, etc. to resolve customer complaints and provide quick and quick after service. . Recently, the automatic response system (ARS, or Interactive Voice Response: IVR) is performing the window area that is in charge of customer service at the forefront.

자동응답시스템은 교환기와 자동응답시스템 기능에 의존했던 2세대 및 전화와 컴퓨터를 결합한 3세대, CTI(Computer Telephony Integration) 자동응답시스템을 거쳐 사용자의 음성을 음성인식하고 인식된 음성에 대한 최적의 응답을 자동으로 선택하여 음성, 문자 등으로 응답하는 4세대 인공지능(Artificial Intelligence: AI) 자동응답시스템으로 진화하고 있다. The automatic answering system recognizes the user's voice through the second generation that relied on the function of the switchboard and the automatic answering system, and the third generation that combines telephone and computer, and a computer telephony integration (CTI) automatic answering system, and recognizes the user's voice and responds optimally to the recognized voice. It is evolving into a 4th generation artificial intelligence (AI) automatic response system that automatically selects and responds with voice or text.

자동응답시스템은 기업 입장에서 고객서비스 처리비용을 절감할 수 있고, 지점의 역할을 수행할 뿐만 아니라 고객정보를 신속하고 효율적으로 쉽게 입수할 수 있으며, 신속한 서비스를 제공할 수 있는 이점을 제공하므로 급속하게 발전하고 있다.The automatic response system can reduce customer service processing costs from the standpoint of a company, not only play the role of a branch office, but also quickly and efficiently obtain customer information, and provides the advantage of providing rapid service. Is developing.

현재 전화와 컴퓨터가 통합되고 인공지능이 탑재된 자동응답시스템이 주류를 이루고 있으며, 이러한 인공지능 자동응답시스템은 교환기(PBX), CTI 등의 콜 인프라, STT(Speech to Text), SV(Speech Verification), TTS(Text to Speech), 통계, AI 분석 등의 AI 인프라, 상기 콜 인프라와 AI 인프라를 연동시키고 서비스 시나리오 및 단위 통계의 통합을 위해 IVR, 실시간 음성처리, 제3자 연동부를 개발하여 적용하고 있다.Currently, automatic answering systems in which phone and computer are integrated and artificial intelligence are installed are the mainstream, and such artificial intelligence automatic answering systems include call infrastructure such as switchboard (PBX), CTI, STT (Speech to Text), SV (Speech Verification). ), TTS (Text to Speech), statistics, AI infrastructure such as AI analysis, call infrastructure and AI infrastructure are linked, and IVR, real-time voice processing, and third-party linkage are developed and applied for integration of service scenarios and unit statistics. Are doing.

상술한 바와 같이 종래 인공지능 자동응답시스템은 적용되는 고객의 분야에 따른 시나리오 및 통계에 대응하도록 IVR, 실시간 음성처리 및 제3자 연동을 매번 개발하여야 하므로 서비스 구현을 위한 시간이 많이 소요되며, 이에 따른 투자 비용이 증가하는 문제점이 있었다.As described above, the conventional artificial intelligence automatic response system requires a lot of time to implement the service because IVR, real-time voice processing, and third-party interworking must be developed each time to respond to scenarios and statistics according to the applied customer's field. There was a problem in that the investment cost increased.

또한, 이러한 인공지능 자동응답시스템은 고객에게 고객에 대응하는 정확한 서비스 및 정보를 제공하고 정당한 고객에게만 해당 고객에 대응하는 정보를 제공하기 위해 고객이 누구인지를 식별하고 인증하는 것이 중요하다.In addition, it is important to identify and authenticate who the customer is in order to provide the customer with accurate services and information corresponding to the customer, and to provide information corresponding to the customer only to the legitimate customer.

종래 인공지능 자동응답시스템은 고객을 식별 및 인증하기 위해 고객의 주민등록번호, 카드번호 등의 식별정보를 입력하도록 하거나, 상담원 연결 시에는 상담원이 고객으로부터 직접 고객을 식별할 수 있는 식별정보를 음성으로 듣고 고객을 식별 및 인증하고 있다.In the conventional artificial intelligence automatic response system, identification information such as a customer's resident registration number and card number is input to identify and authenticate the customer, or when connecting to a consultant, the consultant listens to the customer through voice identification information that can directly identify the customer. Identifying and authenticating customers.

상술한 바와 같이 종래 인공지능 자동응답시스템은 고객이 말하거나 입력하는 식별정보에 의해서만 고객을 식별하고 인증하므로 고객의 식별정보를 불법적으로 획득한 제3자가 고객의 식별정보를 가지는 고객임을 사칭할 수 있고, 이로 인해 제3자에게 서비스 및 정보를 제공할 수 있는 문제점이 있었다.As described above, the conventional AI automatic response system identifies and authenticates customers only by the identification information spoken or input by the customer, so a third party who illegally obtained the customer's identification information can pretend to be a customer with the customer's identification information. There is a problem in that, due to this, services and information can be provided to third parties.

최근 이러한 문제점을 방지하기 위해 대한민국 등록특허 제10-1209033호 [자동응답시스템의 화자인식을 이용한 고객인증 시스템 및 방법](이하 "선행기술"이라 함)은 화자인식을 통해 고객을 인증하는 기술을 개시한다.In order to prevent such problems recently, Korean Patent Registration No. 10-1209033 [Customer authentication system and method using speaker recognition of an automatic response system] (hereinafter referred to as "priority technology") is a technology that authenticates customers through speaker recognition. Start.

그러나 선행기술은 고객 및 고객의 음성특징에 대한 음성정보를 시스템에 보관하므로 고객의 음성 및 음성정보가 유출되는 경우 상기 고객의 음성을 모방할 수 있어 제3자의 고객 사칭이 가능한 문제점이 있었다.However, in the prior art, since the voice information on the customer and the customer's voice characteristics is stored in the system, if the customer's voice and voice information is leaked, the voice of the customer can be imitated, and thus, there is a problem that a third party can impersonate a customer.

대한민국 등록특허 제10-1209033호(2012.12.06.공고)Korean Patent Registration No. 10-1209033 (2012.12.06.announcement)

따라서 본 발명의 목적은 자동응답시스템의 호, 제어 데이터 및 음성을 통합하여 중계할 수 있고, 통합 중계를 활용하여 고객의 발화 음성에 대한 음성인식과 동시에 화자인증을 수행하여 검증된 화자에 대응하는 서비스를 제공하는 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답시스템 및 방법을 제공함에 있다.Accordingly, an object of the present invention is to be able to relay the call, control data and voice of the automatic response system by integrating, and to respond to the verified speaker by performing speaker authentication and voice recognition for the spoken voice of the customer using the integrated relay. It is to provide an artificial intelligence automatic response system and method having a customer authentication function using speaker authentication that provides a service.

상기와 같은 목적을 달성하기 위한 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답시스템은: 수신되는 음성에 대한 음성인식 및 고객별 음성프린트 등록 및 등록된 음성프린트에 의한 화자인증을 수행하고, 그 인증 결과정보를 전송하는 음성인식/화자인증부(STT/SV); 상기 음성인식된 텍스트 데이터에 대한 응답 음성을 전송하는 응답부; 및 교환기(PBX)로부터 호의 인입 시 통화로를 형성하고, 설정된 시나리오에 따른 자동응답서비스를 제공하되, 음성프린트가 등록된 고객으로부터 가상상담서비스의 선택 시 상기 음성인식/화자인증부로 실시간 입력되는 음성을 제공하고, 상기 고객의 실시간 음성에 대한 음성프린트와 등록된 등록 음성프린트의 비교에 의해 상기 음성인식/화자인증부로부터 수신되는 상기 인증 결과정보가 인증 성공인 것으로 판단되면 상기 응답부를 통해 상기 시나리오 및 고객의 선택에 따른 가상상담서비스를 제공하는 인공지능 음성 허브부(AI VH)를 포함하는 것을 특징으로 한다.An artificial intelligence automatic response system having a customer authentication function using speaker authentication according to the present invention for achieving the above object includes: voice recognition for received voice, registration of voice prints for each customer, and speaker authentication by registered voice prints. A voice recognition/speaker authentication unit (STT/SV) for performing and transmitting the authentication result information; A response unit for transmitting a response voice to the voice-recognized text data; And, when a call is received from the switchboard (PBX), a call path is formed, and an automatic response service is provided according to the set scenario, but when a virtual counseling service is selected from a registered customer, a voice input in real time to the voice recognition/speaker authentication unit And, when it is determined that the authentication result information received from the voice recognition/speaker authentication unit is successful by comparing the voice print for the real-time voice of the customer and the registered voice print, the scenario through the response unit And an artificial intelligence voice hub (AI VH) that provides a virtual counseling service according to a customer's selection.

상기 응답부는, 상기 인공지능 음성 허브부를 통해 상기 음성인식된 텍스트 데이터를 수신받고, 상기 텍스트 데이터에 대응하는 응답 텍스트를 검색하여 상기 인공지능 음성 허브부로 전송하는 인공지능 응답부; 및 상기 인공지능 음성 허브부를 통해 상기 응답 텍스트를 수신받아 음성으로 변환한 응답 음성을 상기 인공지능 음성 허브부로 전송하는 음성변환부(TTS)를 포함하는 것을 특징으로 한다.The response unit may include an artificial intelligence response unit that receives the speech-recognized text data through the artificial intelligence speech hub unit, searches for a response text corresponding to the text data, and transmits it to the artificial intelligence speech hub unit; And a voice conversion unit (TTS) for receiving the response text through the artificial intelligence voice hub unit and transmitting a response voice converted into voice to the artificial intelligence voice hub unit.

상기 응답부는, 상기 인공지능 음성 허브부를 통해 상기 음성인식된 텍스트 데이터를 수신받고, 상기 텍스트 데이터에 대응하는 응답 텍스트를 검색하고, 검색된 응답 텍스트를 음성으로 변환하여 상기 인공지능 음성 허브부로 전송하는 음성 인공지능 응답부인 것을 특징으로 한다.The response unit receives the speech-recognized text data through the artificial intelligence speech hub unit, searches for a response text corresponding to the text data, converts the searched response text into speech, and transmits the speech to the artificial intelligence speech hub unit. It is characterized by being an artificial intelligence response unit.

상기 인공지능 음성 허브부는, 상기 음성인식/화자인증부 및 응답부와 데이터통신 및 음성통신을 수행할 수 있도록 하는 허브 통신부; 자동응답서비스에 따른 적어도 하나 이상의 시나리오를 저장하는 시나리오 DB를 포함하는 허브 저장부; 및 자동응답서비스에 따른 시나리오 설정 및 상기 음성인식/화자인증부 및 응답부의 연동 설정을 수행하고, 교환기(PBX)로부터 호의 인입 시 통화로를 형성하고, 설정된 시나리오에 따른 자동응답서비스를 제공하되, 음성프린트가 등록된 고객으로부터 가상상담서비스의 선택 시 상기 음성인식/화자인증부로 실시간 입력되는 음성을 제공하고, 상기 고객의 실시간 음성에 대한 음성프린트와 등록된 등록 음성프린트의 비교에 의해 상기 음성인식/화자인증부로부터 수신되는 상기 인증 결과정보가 인증 성공인 것으로 판단되면 상기 응답부 및 음성변환부를 통해 상기 시나리오 및 고객의 선택에 따른 가상상담서비스를 제공하는 허브 제어부를 포함하는 것을 특징으로 한다.The artificial intelligence voice hub unit may include a hub communication unit configured to perform data communication and voice communication with the voice recognition/speaker authentication unit and the response unit; A hub storage unit including a scenario DB for storing at least one scenario according to the automatic response service; And setting the scenario according to the automatic response service and setting the interlocking setting of the voice recognition/speaker authentication unit and the response unit, forming a call path when a call is received from the switchboard (PBX), and providing an automatic response service according to the set scenario, Provides voice input in real time to the voice recognition/speaker authentication unit when a virtual counseling service is selected from a customer whose voice print is registered, and recognizes the voice by comparing the voice print for the real-time voice of the customer with the registered voice print. / If it is determined that the authentication result information received from the speaker authentication unit is authentication success, a hub control unit that provides a virtual consultation service according to the scenario and the customer's selection through the response unit and the voice conversion unit.

상기 허브 제어부는, 호의 인입 시 통화로를 형성하고, 호 전환 요청에 따른 호 전환 처리를 수행하는 호 처리부; 자동응답서비스에 따른 시나리오 설정 및 상기 음성인식/화자인증부 및 응답부의 연동 설정을 수행하고, 상기 호 처리부를 통해 통화로 형성 시 형성된 통화를 통해 설정된 시나리오에 따른 안내 음성을 고객의 사용자 단말부로 제공하는 시나리오 처리부; 상기 통화로 형성 시 수신되는 고객의 음성과 상기 시나리오에 따른 안내 음성 및 응답 음성을 분리하여 고객의 음성만 출력하는 화자 분리부; 상기 화자 분리부로부터 출력되는 고객의 음성을 미러링하여 상기 음성인식/화자인증부 및 응답부로 제공하는 스위치 미러링부; 및 상기 음성인식/화자인증부 및 응답부와 연동하여 음성프린트가 등록된 고객으로부터 가상상담서비스의 선택 시 상기 음성인식/화자인증부로 실시간 입력되는 음성을 제공하고, 상기 고객의 실시간 음성에 대한 음성프린트와 등록된 등록 음성프린트의 비교에 의해 상기 음성인식/화자인증부로부터 수신되는 상기 인증 결과정보가 인증 성공인 것으로 판단되면 상기 응답부를 통해 상기 시나리오 및 고객의 선택에 따른 가상상담서비스를 제공하는 연동 처리부를 포함하는 것을 특징으로 한다.The hub control unit may include a call processing unit configured to form a call path when a call is received and perform call transfer processing according to a call transfer request; Scenario setting according to the automatic response service and interlocking setting of the voice recognition/speaker authentication unit and response unit are performed, and guidance voice according to the set scenario through the call formed when the call is formed through the call processing unit is provided to the user terminal of the customer A scenario processing unit; A speaker separating unit for separating the customer's voice received during the call and the guidance voice and the response voice according to the scenario and outputting only the customer's voice; A switch mirroring unit that mirrors the customer's voice output from the speaker separation unit and provides it to the voice recognition/speaker authentication unit and the response unit; And the voice recognition/speaker authentication unit and the response unit are interlocked to provide real-time voice input to the voice recognition/speaker authentication unit when a virtual counseling service is selected from a customer registered with a voice print, and the voice for the real-time voice of the customer If it is determined that the authentication result information received from the voice recognition/speaker authentication unit is successful by comparing the print and the registered voice print, the response unit provides a virtual counseling service according to the scenario and the customer's selection. It characterized in that it comprises a link processing unit.

상기 연동 처리부는, 상기 시나리오 처리부를 통해 가상상담서비스가 선택된 것으로 판단되면 상기 스위치 미러링부를 제어하여 상기 음성인식/화자인증부로 실시간 입력되는 음성을 제공하고, 상기 음성인식/화자인증부로부터 상기 고객의 실시간 음성에 대한 음성프린트와 등록된 등록 음성프린트의 비교에 의한 상기 인증 결과정보가 수신되고 상기 인증 결과정보가 인증 성공인 것으로 판단되면 가상상담서비스 개시명령을 출력하는 인증 연동부; 상기 인증 연동부로부터 가상상담서비스 개시명령이 입력되면 상기 응답부를 통해 상기 시나리오 및 고객의 선택에 따른 가상상담서비스를 제공하는 응답 연동부를 포함하는 것을 특징으로 한다.When it is determined that the virtual counseling service is selected through the scenario processing unit, the interlocking processing unit controls the switch mirroring unit to provide a real-time voice input to the speech recognition/speaker authentication unit, and An authentication linking unit for outputting a virtual counseling service start command when the authentication result information is received by comparing the voice print for real-time voice and the registered voice print and it is determined that the authentication result information is successful; And a response linking unit that provides a virtual counseling service according to the scenario and the customer's selection through the response unit when a command to start the virtual counseling service is input from the authentication linking unit.

상기 시스템은, 통화로가 형성된 호 및 해당 고객에 대한 가상상담서비스 이용 이력, 가상상담서비스에 대한 상담 내용 정보를 저장하고, 저장된 정보에 대한 통계 정보 및 분석정보를 저장하는 정보 분석부를 더 포함하되, 상기 연동 처리부는, 상기 호 처리부를 통한 호 종료 시 상기 호에 대한 정보 및 가상상담서비스 시나리오 정보 및 그에 따른 상담 내용정보를 상기 정보 분석부로 제공하는 제3자 연동부를 더 포함하는 것을 특징으로 한다.The system further includes an information analysis unit that stores a call with a call path, a history of use of a virtual counseling service for a corresponding customer, and counseling content information about the virtual counseling service, and stores statistical information and analysis information on the stored information. , The interworking processing unit further comprises a third-party interworking unit that provides information on the call, virtual counseling service scenario information, and consultation content information according to the call upon termination of the call through the call processing unit to the information analysis unit. .

상기 인공지능 음성 허브부는, 상기 응답부 및 인공지능 음성 허브부와 데이터통신 및 음성통신을 수행할 수 있도록 하는 인증 통신부; 고객의 음성을 저장하는 고객 음성 저장부 및 고객별 음성프린트를 저장하는 음성프린트 DB를 포함하는 인증 저장부; 및 상기 인증 통신부를 통해 인공지능 음성 허브부로부터 수신되는 각 고객의 음성에 대한 음성인식 및 고객별 음성프린트 등록 및 등록된 음성프린트에 의한 화자인증을 수행하고, 그 인증 결과정보를 상기 인증 통신부를 통해 상기 인공지능 음성 허브부로 전송하는 인증 제어부를 포함하는 것을 특징으로 한다.The artificial intelligence voice hub unit includes: an authentication communication unit configured to perform data communication and voice communication with the response unit and the artificial intelligence voice hub unit; An authentication storage unit including a customer voice storage unit for storing customer voices and a voice print DB for storing customer-specific voice prints; And voice recognition for each customer's voice received from the artificial intelligence voice hub unit through the authentication communication unit, registration of a voice print for each customer, and speaker authentication by a registered voice print, and the authentication result information is transmitted to the authentication communication unit. It characterized in that it comprises an authentication control unit for transmitting to the artificial intelligence voice hub through.

상기 인증 제어부는, 상기 인공지능 음성 허브부의 고객에 대한 음성프린트 등록 확인 요청에 의해 상기 음성프린트 DB를 검색하여 해당 고객의 음성프린트 등록 여부를 확인하고, 음성프린트의 등록 여부에 따라 음성프린트 등록 모드 및 음성인증 모드 중 하나를 설정하는 모드 식별부; 상기 모드 식별부에 의해 식별된 모드에 따라 인증 통신부를 통해 상기 인공지능 음성 허브부로부터 제공되는 음성을 수집하는 음성 수집부; 상기 음성 수집부를 통해 획득되는 음성에 대해 음성인식을 수행하여 상기 음성에 대응하는 텍스트 데이터를 출력하는 음성 인식부; 상기 음성프린트 등록 모드에서 상기 음성 수집부를 통해 수집되는 음성으로부터 음성프린트를 생성하여 상기 음성프린트 DB에 저장하여 등록하는 등록부; 및 상기 음성인증 모드에서 상기 음성 수집부를 통해 수집되는 음성으로부터 음성프린트를 생성하고, 생성된 음성프린트와 상기 음성프린트 DB의 해당 고객의 등록 음성프린트를 비교하여 일치 여부에 따른 화자인증을 수행하고, 그 인증 결과정보를 상기 인증통신부를 통해 상기 인공지능 음성 허브부로 전송하는 인증부를 포함하는 것을 특징으로 한다.The authentication control unit searches the voice print DB in response to a voice print registration confirmation request for the customer of the artificial intelligence voice hub to check whether the customer has registered the voice print, and the voice print registration mode according to whether the voice print is registered. And a mode identification unit for setting one of voice authentication modes. A voice collection unit for collecting the voice provided from the artificial intelligence voice hub unit through an authentication communication unit according to the mode identified by the mode identification unit; A speech recognition unit configured to perform speech recognition on the speech acquired through the speech collection unit and output text data corresponding to the speech; A registration unit that generates a voice print from the voice collected through the voice collection unit in the voice print registration mode, stores it in the voice print DB, and registers it; And generating a voice print from the voice collected through the voice collecting unit in the voice authentication mode, comparing the generated voice print with the registered voice print of the corresponding customer in the voice print DB, and performing speaker authentication according to whether or not they match, And an authentication unit for transmitting the authentication result information to the artificial intelligence voice hub unit through the authentication communication unit.

상기와 같은 목적을 달성하기 위한 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답방법은: 인공지능 음성 허브부가 교환기(PBX)로부터 호의 인입 시 통화로를 형성하고, 설정된 시나리오에 따른 자동응답서비스를 개시하는 자동응답서비스 개시 과정; 고객이 가상상담서비스의 선택 시 상기 인공지능 음성 허브부가 상기 호를 발신한 고객이 음성프린트를 등록한 고객이면 음성인식/화자인증부로 실시간 입력되는 음성을 제공하고, 상기 고객의 실시간 음성에 대한 음성프린트와 등록된 등록 음성프린트의 비교에 의해 음성인식/화자인증부로부터 수신되는 인증 결과정보를 수신하는 화자인증 과정; 및 상기 화자인증이 인증 성공인 것으로 판단되면 상기 인공지능 음성 허브부가 응답부 및 음성변환부를 통해 상기 시나리오 및 고객의 선택에 따른 가상상담서비스를 제공하는 가상상담서비스 과정을 포함하는 것을 특징으로 한다.An artificial intelligence automatic response method having a customer authentication function using speaker authentication according to the present invention for achieving the above object is: The artificial intelligence voice hub forms a call path when incoming calls from the exchange (PBX), and An automatic response service initiation process of starting the automatic response service according to the method; When the customer selects the virtual consultation service, the artificial intelligence voice hub unit provides real-time voice input to the voice recognition/speaker authentication unit if the customer who sent the call is a customer who has registered a voice print, and a voice print for the real-time voice of the customer A speaker authentication process of receiving authentication result information received from the voice recognition/speaker authentication unit by comparing the registered voice print with the registered voice print; And a virtual consultation service process in which the artificial intelligence voice hub unit provides a virtual consultation service according to the scenario and the customer's selection through a response unit and a speech conversion unit when it is determined that the speaker authentication is successful.

상기 자동응답서비스 개시 과정은, 인공지능 음성 허브부가 교환기(PBX)로부터 호의 인입 시 통화로를 형성하고, 상기 통화로를 통해 상기 시나리오에 따른 음성안내를 제공하고, 상기 시나리오에 따른 고객의 음성을 수신하여 상기 음성인식/화자인증부로 미러링하는 음성 미러링 단계; 상기 인공지능 음성 허브부가 음성인식/화자인증부와 연동하여 상기 미러링된 고객의 음성에 대한 음성인식 결과인 텍스트 데이터를 응답부로 제공하는 음성인식 단계; 상기 인공지능 음성 허브부가 응답부와 연동하여 상기 텍스트 데이터에 대한 응답 텍스트를 획득하는 응답 획득 단계; 및 상기 인공지능 음성 허브부가 상기 획득된 응답 텍스트를 음성으로 변환하여 상기 고객의 사용자 단말로 제공하는 응답 단계를 포함하는 것을 특징으로 한다.In the process of starting the automatic response service, the artificial intelligence voice hub forms a call path when a call is received from the switchboard (PBX), provides voice guidance according to the scenario through the call path, and transmits the customer's voice according to the scenario. A voice mirroring step of receiving and mirroring to the voice recognition/speaker authentication unit; A voice recognition step of the artificial intelligence voice hub unit interworking with a voice recognition/speaker authentication unit to provide text data, which is a voice recognition result of the mirrored customer's voice, to a response unit; A response obtaining step of acquiring a response text for the text data by the artificial intelligence voice hub unit interworking with a response unit; And a response step of converting the obtained response text into speech by the artificial intelligence voice hub unit and providing it to the user terminal of the customer.

상기 화자인증 과정은, 상기 인공지능 음성 허브부는 음성인식/화자인증부와 연동하여 상기 호를 발신한 고객에 대한 음성프린트 등록 여부를 확인하는 음성프린트 등록 여부 확인 단계; 상기 인공지능 음성 허브부가 연동하는 음성인식/화자인증부에 의해 상기 음성프린트의 등록 여부에 따라 음성프린트 등록 모드 및 음성인증 모드를 식별하여 설정하는 모드 식별 단계; 상기 인공지능 음성 허브부가 모드 식별부에 의해 식별된 모드에 따라 상기 음성인식/화자인증부로 고객의 음성을 미러링하는 음성 미러링 단계; 설정된 모드가 음성프린트 등록 모드이면 상기 인공지능 음성 허브부가 연동하는 음성인식/화자인증부에 의해 상기 음성으로부터 음성프린트를 생성하여 음성프린트 DB에 저장하여 등록하도록 하는 음성프린트 등록 단계; 및 상기 설정된 모드가 인증 모드이면, 상기 인공지능 음성 허브부가 연동하는 음성인식/화자인증부에 의해 음성으로부터 음성프린트를 생성하고, 생성된 음성프린트와 상기 음성프린트 DB의 해당 고객의 등록 음성프린트를 비교하여 일치 여부에 따른 화자인증을 수행하도록 하여 그 인증 결과정보를 하는 인증 단계를 포함하는 것을 특징으로 한다.The speaker authentication process may include a step of checking whether or not to register a voice print in which the artificial intelligence voice hub unit interworks with a voice recognition/speaker authentication unit to check whether a voice print is registered for the customer who has sent the call; A mode identification step of identifying and setting a voice print registration mode and a voice authentication mode according to whether or not the voice print is registered by a voice recognition/speaker authentication unit linked by the artificial intelligence voice hub unit; A voice mirroring step in which the artificial intelligence voice hub mirrors the voice of the customer to the voice recognition/speaker authentication unit according to the mode identified by the mode identification unit; A voice print registration step of generating a voice print from the voice by a voice recognition/speaker authentication unit interlocked with the artificial intelligence voice hub unit and storing and registering it in a voice print DB when the set mode is a voice print registration mode; And if the set mode is an authentication mode, a voice print is generated from a voice by a voice recognition/speaker authentication unit linked to the artificial intelligence voice hub unit, and the generated voice print and a registered voice print of the corresponding customer in the voice print DB are It characterized by including the authentication step of performing the speaker authentication according to the match or not by comparing the authentication result information.

본 발명은 호, 제어 데이터, 음성을 통합하여 중계할 수 있는 인공지능 음성 허브를 구비하므로 고객사에 따른 시나리오의 변경 시 간단하게 설정만으로 변경된 시나리오를 적용할 수 있는 효과가 있다.The present invention has an artificial intelligence voice hub capable of integrating and relaying calls, control data, and voice, so that when a scenario is changed according to a customer company, the changed scenario can be applied by simply setting.

본 발명은 간단한 설정만으로 변경된 시나리오를 적용할 수 있으므로 IVR, 실시간 음성처리 및 제3자 연동을 매번 개발하지 않아도 되므로 개발에 소요되는 시간 및 비용을 절약할 수 있는 효과가 있다.In the present invention, since a changed scenario can be applied with only a simple setting, it is not necessary to develop IVR, real-time voice processing, and third-party interworking every time, thereby saving time and cost required for development.

또한, 본 발명은 고객에 대한 화자인증을 수행하되, 고객의 최초 이용 시 고객에 대한 화자 음성프린트를 자동 생성하여 등록하므로 고객 및 관리자에게 편리성을 제공할 수 있는 효과가 있다.In addition, the present invention performs speaker authentication for a customer, but automatically generates and registers a speaker voice print for the customer upon initial use by the customer, thereby providing convenience to customers and managers.

또한, 본 발명은 수집된 고객의 음성을 음성프린트의 생성 후 삭제하고, 생성된 음성프린트를 영수증과 티켓으로 분할하여 티켓만을 보유하고, 영수증은 신뢰할 수 있는 외부 저장수단에 저장하여 관리하므로, 보안성을 향상시킬 수 있는 효과가 있다.In addition, the present invention deletes the collected customer's voice after creation of a voice print, divides the generated voice print into a receipt and a ticket to hold only tickets, and stores and manages the receipt in a reliable external storage means. There is an effect that can improve sex.

도 1은 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답시스템을 포함하는 통신시스템의 구성을 나타낸 도면이다.
도 2는 본 발명에 따른 인공지능 자동응답시스템의 인공지능 음성 허브의 구성을 나타낸 도면이다.
도 3은 본 발명에 따른 인공지능 자동응답시스템의 음성인식/화자인증부의 구성을 나타낸 도면이다.
도 4는 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답 방법 중 고객의 음성프린트 등록 방법을 나타낸 절차도이다.
도 5는 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답 방법 중 고객인증에 따른 자동응답 방법을 나타낸 절차도이다.1 is a diagram showing the configuration of a communication system including an artificial intelligence automatic response system having a customer authentication function using speaker authentication according to the present invention.
2 is a diagram showing the configuration of an artificial intelligence voice hub of the artificial intelligence automatic response system according to the present invention.
3 is a diagram showing the configuration of a voice recognition/speaker authentication unit of an artificial intelligence automatic response system according to the present invention.
4 is a flowchart illustrating a method of registering a voice print of a customer among artificial intelligence automatic response methods having a customer authentication function using speaker authentication according to the present invention.
5 is a flowchart showing an automatic response method according to customer authentication among artificial intelligence automatic response methods having a customer authentication function using speaker authentication according to the present invention.

이하 첨부된 도면을 참조하여 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답시스템의 구성 및 동작을 상세히 설명하고, 상기 시스템에서의 화자인증에 의한 고객인증을 통한 자동응답 방법을 설명한다.Hereinafter, the configuration and operation of an artificial intelligence automatic response system having a customer authentication function using speaker authentication according to the present invention will be described in detail with reference to the accompanying drawings, and an automatic response method through customer authentication by speaker authentication in the system is described. Explain.

도 1은 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답시스템을 포함하는 통신시스템의 구성을 나타낸 도면이다.1 is a diagram showing the configuration of a communication system including an artificial intelligence automatic response system having a customer authentication function using speaker authentication according to the present invention.

도 1을 참조하면, 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답시스템(200)은 고객이 사용하는 사용자 단말부(100)와 유무선 데이터통신망(10)을 통해 연결되어 상호 데이터통신 및 음성통신을 수행한다.Referring to FIG. 1, an artificial intelligence automatic response system 200 having a customer authentication function using speaker authentication according to the present invention is connected to a user terminal unit 100 used by a customer through a wired/wireless data communication network 10, Data communication and voice communication are performed.

상기 유무선 데이터통신망(10)은 와이파이(WiFi)망을 포함하는 인터넷망, 2세대(2 Generation: 2G), 3G, 4G, 5G 등을 포함하는 이동통신망, 일반전화공중망(PSTN)을 상기 인터넷망 및 이동통신망에 결합시키는 패킷 기반의 차세대 통신망(NGN) 등 중 하나 이상이 결합되어 있는 통신망일 수 있을 것이다.The wired/wireless data communication network 10 includes an Internet network including a Wi-Fi network, a mobile communication network including a 2nd generation (2G), 3G, 4G, 5G, etc., and a general telephone public network (PSTN). And a packet-based next-generation communication network (NGN) coupled to a mobile communication network.

사용자 단말부(100)는 유무선 데이터통신망(10)에 무선으로 접속하는 스마트폰, 스마트패드 등의 모바일 단말기(100-1) 및 상기 유무선 데이터통신망(10)에 유선으로 연결되는 유선 전화기(100-2) 중 어느 하나일 수 있을 것이다.The user terminal unit 100 includes a mobile terminal 100-1 such as a smart phone or a smart pad that wirelessly connects to the wired/wireless data communication network 10, and a wired telephone 100- connected to the wired/wireless data communication network 10 by wire. It could be either of 2).

자동응답시스템(200)은 컴퓨터 전화 통합 시스템(Computer Telephony Integration: CTI)(300), 교환기(PBX)(400), 인공지능 음성 허브부(Artificial Intelligence Voice HUB: AI VH)(500), 응답부(600) 및 음성인식/화자인증부(STT/SV: Speech to Text/Speech Verification)(700)를 포함하고, 실시예에 따라 정보 분석부(800) 및 외부저장부(1000)를 더 포함할 수 있을 것이다.The automatic answering system 200 includes a computer telephony integration (CTI) 300, a switchboard (PBX) 400, an artificial intelligence voice hub (AI VH) 500, a response unit. 600 and a speech recognition/speaker authentication unit (STT/SV: Speech to Text/Speech Verification) 700, and further include an information analysis unit 800 and an external storage unit 1000 according to an embodiment. I will be able to.

상기 CTI(300), 교환기(400), AR VH(500), 응답부(600), STT/SV(700), 정보 분석부(800)는 내부 근거리통신망(Local Area Network: LAN)(20)을 통해 연결되는 것이 바람직하고, 외부저장부(1000)는 유무선 데이터통신망(10)을 통해 연결되는 것이 바람직할 것이다.The CTI 300, the switch 400, the AR VH 500, the response unit 600, the STT/SV 700, the information analysis unit 800 is an internal local area network (LAN) 20 It is preferable to be connected through, and the external storage unit 1000 is preferably connected through a wired/wireless data communication network 10.

상기 CTI(300)는 고객의 사용자 단말부(100)의 전화번호 및 사용자 단말부(100)로부터 입력되는 고객 식별정보 등의 정보를 획득하고, 획득된 정보를 활용한 다양한 정보들을 상담원의 단말기인 상담원 단말기(미도시)에 표시하고, 다른 장치들로 필요한 정보들을 제공한다.The CTI 300 acquires information such as the phone number of the user terminal unit 100 of the customer and customer identification information input from the user terminal unit 100, and provides various information using the obtained information, which is the terminal of the consultant. It is displayed on a counselor terminal (not shown) and provides necessary information to other devices.

PBX(400)는 인입되는 다수의 호들을 상담원 단말기(미도시) 및 AI VH(500)로 스위칭하여 연결하거나 연결된 호를 다른 곳으로 연결하는 호 전환을 수행한다.The PBX 400 switches a plurality of incoming calls to a counselor terminal (not shown) and an AI VH 500 for connection or performs call switching to connect a connected call to another location.

AI VH(500)는 교환기(PBX)(400)로부터 호의 인입 시 통화로를 형성하고, CTI(300), PBX(400) 등의 호 인프라 구성과 응답부(600), 음성인식/화자인증부(700), 정보 분석부(800) 등의 인공지능 인프라 구성들 사이에 구성되어, 호 인프라 구성으로 수신되는 음성을 인공지능 인프라 구성으로 미러링하고, 제어 데이터의 송수신에 의해 호 인프라 구성 및 인공지능 인프라 구성들과 연동하여 설정된 시나리오에 따른 자동응답서비스를 제공한다.AI VH (500) forms a call path when a call is received from the switch (PBX) (400), configures call infrastructure such as CTI (300), PBX (400), and responds (600), voice recognition/speaker authentication unit It is configured between artificial intelligence infrastructure components such as 700 and information analysis unit 800, and mirrors the voice received by the call infrastructure configuration to the artificial intelligence infrastructure configuration, and configures the call infrastructure and artificial intelligence by transmitting and receiving control data. In conjunction with infrastructure configurations, it provides automatic response service according to the set scenario.

상기 AI VH(500)는 상기 시나리오의 개시 후 시나리오 중 가상상담서비스가 선택되면 호를 발신한 고객에 대한 고객 식별정보 및 전화번호 중 어느 하나 이상을 CTI를 통해 획득하고 획득된 고객 식별정보 및 전화번호에 대응하는 상기 고객에 대한 음성프린트가 등록되어 있는지를 확인한다.The AI VH 500 acquires one or more of customer identification information and phone number for the customer who sent the call through CTI, and acquired customer identification information and phone number when the virtual consultation service is selected among the scenarios after the start of the scenario. It is checked whether a voice print for the customer corresponding to the number is registered.

상기 고객에 대한 음성프린트가 등록되어 있지 않으면 AI VH(500)는 CTI(300)를 통한 일반적인 고객인증정보를 획득하여 인증 후 고객이 선택한 가상상담서비스에 대한 가상상담서비스 시나리오를 고객에게 제공하고, 상기 음성인식/화자인증부(STT/SV)(700)와 연동하여 가상상담서비스 시나리오에 따른 음성을 수집하고 수집한 음성에 대한 음성프린트를 생성하여 등록한다(이하 등록된 음성프린트를 "등록 음성프린트"라 한다).If the voice print for the customer is not registered, the AI VH 500 obtains general customer authentication information through the CTI 300 and provides a virtual consultation service scenario for the virtual consultation service selected by the customer after authentication to the customer, In conjunction with the voice recognition/speaker authentication unit (STT/SV) 700, voices according to the virtual counseling service scenario are collected, and voice prints for the collected voices are generated and registered (hereinafter, the registered voice print is referred to as “registered voice”). It is called "print").

또한 상기 고객이 음성프린트가 등록된 고객이면 AI VH(500)는 교환기(400)로 수신되는 고객의 음성을 실시간 미러링하여 상기 STT/SV(700)로 제공하고, 상기 고객의 실시간 음성에 대한 음성프린트와 등록된 등록 음성프린트의 비교에 의해 상기 STT/SV(700)로부터 수신되는 상기 인증 결과정보가 인증 성공인 것으로 판단되면 응답부(600)를 통해 상기 시나리오 및 고객의 선택에 따른 가상상담서비스를 제공한다.In addition, if the customer is a registered customer with voice print, the AI VH 500 mirrors the customer's voice received through the switch 400 in real time and provides it to the STT/SV 700, and provides the voice for the customer's real-time voice. When it is determined that the authentication result information received from the STT/SV 700 is successful by comparing the print and the registered voice print, the virtual counseling service according to the scenario and the customer's selection through the response unit 600 Provides.

응답부(600)는 상기 AI VH(500)로부터 음성 인식된 텍스트 데이터를 입력받고, 상기 텍스트 데이터에 대응하는 응답 음성을 상기 AI VH(500)로 전송한다.The response unit 600 receives speech-recognized text data from the AI VH 500 and transmits a response voice corresponding to the text data to the AI VH 500.

상기 응답부(600)는 인공지능 응답부(610) 및 음성변환부(Text to Speech: TTS)(620)를 포함할 수 있을 것이다.The response unit 600 may include an artificial intelligence response unit 610 and a text to speech (TTS) 620.

상기 인공지능 응답부(610)는 채팅 인공지능 응답부(또는 "채팅 봇(로봇)"이라 함), 음성 인공지능 응답부(또는 "음성 봇(로봇)"이라 함) 등의 인공지능 시스템으로, 입력되는 상기 텍스트 데이터에 대응하는 응답 텍스트를 추출하고, 추출된 응답 텍스트 AI VH(500)로 제공하거나 응답 음성을 AI VH(500)로 제공한다.The artificial intelligence response unit 610 is an artificial intelligence system such as a chat artificial intelligence response unit (or "chat bot (robot)") and a voice artificial intelligence response unit (or "voice bot (robot)"). , The response text corresponding to the input text data is extracted, and the extracted response text is provided to the AI VH 500 or the response voice is provided to the AI VH 500.

상기 음성 봇은 채팅 봇에 TTS(620)가 결합된 구성일 수 있을 것이다.The voice bot may be a configuration in which the TTS 620 is combined with a chat bot.

상기 TTS(620)는 상기 인공지능 응답부(610)가 챗봇인 경우 응답 텍스트를 입력받아 응답 음성으로 변환하여 VI AH(500)로 전송한다.When the artificial intelligence response unit 610 is a chatbot, the TTS 620 receives a response text, converts it into a response voice, and transmits it to the VI AH 500.

STT/SV(700)는 AI VH(500)와 연동하여 미러링되는 고객의 음성을 입력받아 음성인식을 수행하여 인식된 텍스트 데이터를 출력하고, 음성프린트 등록 모드에서 등록 음성프린트를 생성하여 등록하고, 인증 모드에서 인증 음성프린트와 해당 고객에 대해 등록되어 있는 등록 음성프린트를 비교하여 인증을 수행하고, 그 인증 결과정보를 AI VH(500)로 전송한다.The STT/SV 700 interworks with the AI VH 500 to receive the mirrored customer's voice, perform voice recognition, output the recognized text data, and create and register a registered voice print in the voice print registration mode, In the authentication mode, authentication is performed by comparing the authentication voice print and the registered voice print registered for the customer, and the authentication result information is transmitted to the AI VH 500.

상기 STT/SV(700)는 실시예에 따라 상기 등록 음성프린트를 분할하여 티켓과 영수증을 생성하고, 티켓은 내부 저장수단에 저장하고, 영수증은 외부저장부(1000)에 저장하여 등록 음성프린트를 등록하고, 인증 모드에서 외부저장부(1000)로부터 해당 고객의 영수증을 획득하여 티켓과 결합하여 등록 음성프린트를 복원하고, 복원된 등록 음성프린트와 인증 음성프린트를 비교하여 화자인증을 수행한다.The STT/SV 700 divides the registered voice print according to an embodiment to generate a ticket and a receipt, stores the ticket in an internal storage means, and stores the receipt in the external storage unit 1000 to store the registered voice print. After registration, the receipt of the customer is obtained from the external storage unit 1000 in the authentication mode, combined with the ticket, and restored the registered voice print, and speaker authentication is performed by comparing the restored registered voice print and the authenticated voice print.

상기 외부저장부(1000)는 외부의 공신력 있고 보안성이 높은 감독기관의 서버일 수도 있고, 블록체인 네트워크 등이 될 수도 있을 것이다.The external storage unit 1000 may be a server of an external supervisory authority having high trust and security, or may be a block chain network.

정보 분석부(800)는 통화로가 형성된 호 및 해당 고객에 대한 가상상담서비스 이용 이력, 가상상담서비스에 대한 상담 내용 정보를 저장하고, 저장된 정보에 대한 통계 정보를 생성하여 저장하고, 해당 고객이 주로 이용하는 서비스에 대한 정보 등과 같은 다양한 분석정보를 생성하여 저장한다.The information analysis unit 800 stores a call with a call path, a history of using a virtual counseling service for a corresponding customer, and counseling content information for a virtual counseling service, generates and stores statistical information about the stored information, and It creates and stores various analysis information, such as information on services that are mainly used.

도 2는 본 발명에 따른 인공지능 자동응답시스템의 인공지능 음성 허브의 구성을 나타낸 도면이다. 이하 도 2를 참조하여 본 발명에 따른 인공지능 음성 허브, AI VH(500)의 상세 구성 및 동작을 설명한다.2 is a diagram showing the configuration of an artificial intelligence voice hub of the artificial intelligence automatic response system according to the present invention. Hereinafter, a detailed configuration and operation of the AI VH 500 according to the present invention will be described with reference to FIG. 2.

AI VH(500)는 허브 저장부(510), 허브 통신부(520) 및 허브 제어부(511)를 포함한다.The AI VH 500 includes a hub storage unit 510, a hub communication unit 520 and a hub control unit 511.

허브 저장부(510)는 본 발명에 따른 AI VH(500)의 전반적인 동작을 제어하기 위한 제어프로그램을 저장하는 프로그램영역, 상기 제어프로그램 수행 중에 발생되는 데이터를 일시 저장하는 임시영역, 상기 제어프로그램 수행에 필요한 데이터 및 제어프로그램 수행 중에 발생하는 데이터를 반영구적으로 저장하기 위한 데이터영역을 포함한다. 상기 데이터영역에는 본 발명에 따른 시나리오 DB(511)가 구성될 수 있을 것이다.The hub storage unit 510 is a program area for storing a control program for controlling the overall operation of the AI VH 500 according to the present invention, a temporary area for temporarily storing data generated during execution of the control program, and executing the control program It includes a data area for semi-permanently storing data necessary for the system and data generated while executing a control program. The scenario DB 511 according to the present invention may be configured in the data area.

상기 시나리오 DB(511)는 본 발명에 따른 가상상담서비스 시나리오를 포함하는 자동응답서비스 시나리오를 저장한다.The scenario DB 511 stores an automatic response service scenario including a virtual counseling service scenario according to the present invention.

허브 통신부(520)는 상기 STT/SV(700) 및 응답부(600)와 데이터통신 및 음성통신을 수행할 수 있도록 한다.The hub communication unit 520 enables data communication and voice communication with the STT/SV 700 and the response unit 600.

허브 제어부(530)는 호 처리부(540), 시나리오 처리부(550), 스위치 미러링부(560), 화자 분리부(570), 연동 처리부(580)를 포함하여 본 발명에 따른 AI VH(500)의 전반적인 동작을 제어한다.The hub control unit 530 includes a call processing unit 540, a scenario processing unit 550, a switch mirroring unit 560, a speaker separation unit 570, and an interlocking processing unit 580, including the AI VH 500 according to the present invention. Control the overall operation.

구체적으로 설명하면, 호 처리부(540)는 PBX(400)로의 호의 인입 시 PBX(400)를 제어하여 자동응답시스템(200)과 통화로를 형성하고, 호 전환 요청에 따라 PBX(400)를 제어하여 호 전환 처리를 수행한다.Specifically, the call processing unit 540 controls the PBX 400 when incoming calls to the PBX 400 to form a call route with the automatic answering system 200, and controls the PBX 400 according to the call transfer request. To perform call transfer processing.

시나리오 처리부(550)는 상기 호 처리부(540)를 통해 통화로 형성 시 형성된 통화를 통해 설정된 시나리오에 따른 안내 음성을 고객의 사용자 단말부(100)로 제공하는 시나리오 제공부(551) 및 자동응답서비스에 따른 시나리오 변경, 설정 및 상기 STT/SV(700) 및 응답부(600)의 연동 설정을 수행하는 시나리오 관리부(552)를 포함한다.The scenario processing unit 550 provides a scenario providing unit 551 for providing a guide voice according to a scenario set through a call formed through a call formed through the call processing unit 540 to the user terminal unit 100 of the customer, and an automatic response service. It includes a scenario management unit 552 that performs the scenario change and setting according to the STT / SV (700) and the interlocking setting of the response unit (600).

화자 분리부(570)는 상기 통화로 형성 시 수신되는 고객의 음성과 상기 시나리오에 따른 안내 음성 및 응답 음성을 분리하여 고객의 음성만 출력한다.The speaker separation unit 570 separates the customer's voice received during the call and the guidance voice and response voice according to the scenario, and outputs only the customer's voice.

스위치 미러링부(560)는 상기 화자 분리부(570)로부터 출력되는 고객의 음성을 미러링하여 상기 STT/SV(700) 및 응답부(600)로 제공한다.The switch mirroring unit 560 mirrors the customer's voice output from the speaker separation unit 570 and provides it to the STT/SV 700 and the response unit 600.

연동 처리부(580)는 인증 연동부(581), 응답 연동부(582) 및 제3자 연동부(585)를 포함하여, 상기 STT/SV(700) 및 응답부(600)와 연동하여 음성프린트가 등록된 고객으로부터 가상상담서비스의 선택 시 상기 STT/SV(700)로 실시간 입력되는 음성을 제공하고, 상기 고객의 실시간 음성에 대한 음성프린트와 등록된 등록 음성프린트의 비교에 의해 상기 STT/SV(700)로부터 수신되는 상기 인증 결과정보가 인증 성공인 것으로 판단되면 상기 응답부(600)를 통해 상기 시나리오 및 고객의 선택에 따른 가상상담서비스를 제공한다.The interlocking processing unit 580 includes an authentication linking unit 581, a response linking unit 582, and a third-party linking unit 585, and interlocking with the STT/SV 700 and the response unit 600 to print voice. When selecting a virtual counseling service from a registered customer, a voice input in real time is provided to the STT/SV 700, and the STT/SV is performed by comparing the voice print for the real-time voice of the customer and the registered voice print. When it is determined that the authentication result information received from 700 is authentication success, a virtual counseling service according to the scenario and the customer's selection is provided through the response unit 600.

상기 인증 연동부(581)는 상기 시나리오 처리부(550)를 통해 가상상담서비스가 선택된 것으로 판단되면 상기 스위치 미러링부(560)를 제어하여 상기 STT/SV(700)로 실시간 입력되는 음성을 제공하고, 상기 STT/SV(700)로부터 상기 고객의 실시간 음성에 대한 음성프린트와 등록된 등록 음성프린트의 비교에 의한 상기 인증 결과정보가 수신되고 상기 인증 결과정보가 인증 성공인 것으로 판단되면 상기 가상상담서비스 개시명령을 출력한다.When it is determined that the virtual counseling service is selected through the scenario processing unit 550, the authentication linking unit 581 controls the switch mirroring unit 560 to provide a voice input in real time to the STT/SV 700, When the authentication result information is received from the STT/SV 700 by comparing the voice print for the real-time voice of the customer and the registered voice print and the authentication result information is determined to be successful, the virtual counseling service is started. Print the command.

상기 응답 연동부(582)는 상기 인증 연동부(581)로부터 가상상담서비스 개시명령이 입력되면 상기 응답부(500)를 통해 상기 시나리오 및 고객의 선택에 따른 가상상담서비스를 제공한다.The response linking unit 582 provides a virtual counseling service according to the scenario and a customer's selection through the response unit 500 when a command to start a virtual counseling service is input from the authentication linking unit 581.

상기 응답 연동부(582)는 인공지능 응답부(610)와 연동 동작을 처리하는 인공지능 연동부(583) 및 음성변환부(620)와 연동 동작을 처리하는 음성합성 연동부(584)를 포함한다.The response linking unit 582 includes an artificial intelligence linking unit 583 for processing an interlocking operation with the artificial intelligence response unit 610 and a speech synthesis linking unit 584 for processing an interlocking operation with the voice conversion unit 620 do.

상기 제3자 연동부(585)는 상기 호 처리부(540)를 통한 호 종료 시 상기 호에 대한 정보 및 가상상담서비스 시나리오 정보 및 그에 따른 상담 내용정보를 상기 정보 분석부(800)로 제공하여, 상기 정보 분석부(800)가 통화로가 형성된 호 및 해당 고객에 대한 가상상담서비스 이용 이력, 가상상담서비스에 대한 상담 내용 정보를 저장하고, 저장된 정보에 대한 통계 정보 및 분석정보를 저장하도록 한다.The third-party linking unit 585 provides information on the call, virtual consultation service scenario information, and consultation content information according to the call when the call is terminated through the call processing unit 540 to the information analysis unit 800, The information analysis unit 800 stores a call with a call path, a history of using a virtual counseling service for a corresponding customer, and information about counseling contents for a virtual counseling service, and stores statistical information and analysis information about the stored information.

도 3은 본 발명에 따른 인공지능 자동응답시스템의 음성인식/화자인증부의 구성을 나타낸 도면이다.3 is a diagram showing the configuration of a voice recognition/speaker authentication unit of an artificial intelligence automatic response system according to the present invention.

도 3을 참조하면, 상기 STT/SV(700)는 인증 저장부(710), 인증 통신부(720) 및 인증 제어부(730)를 포함한다.Referring to FIG. 3, the STT/SV 700 includes an authentication storage unit 710, an authentication communication unit 720, and an authentication control unit 730.

상기 인증 저장부(710)는 본 발명에 따른 STT/SV(700)의 전반적인 동작을 제어하기 위한 제어프로그램을 저장하는 프로그램영역, 상기 제어프로그램 수행 중에 발생되는 데이터를 일시 저장하는 임시영역, 상기 제어프로그램 수행에 필요한 데이터 및 상기 제어프로그램 수행 중에 발생되는 데이터를 반영구적으로 저장하는 데이터영역을 포함한다. 상기 데이터영역에는 본 발명에 따라 고객 음성 저장부(711) 및 음성프린트 DB(712)가 구성될 수 있을 것이다.The authentication storage unit 710 is a program area for storing a control program for controlling the overall operation of the STT/SV 700 according to the present invention, a temporary area for temporarily storing data generated during execution of the control program, and the control It includes a data area for semi-permanently storing data required for program execution and data generated during execution of the control program. In the data area, a customer voice storage unit 711 and a voice print DB 712 may be configured according to the present invention.

상기 고객 음성 저장부(711)는 음성프린트 등록 모드에서 AI VH(500)로부터 미러링되는 음성을 제1시간동안 수집하여 저장하고, 인증 모드에서 AI VH(500)로부터 미러링되는 음성을 제2시간동안 수집하여 저장한다. 상기 각 시간 이후 활용된 음성은 고객 음성 저장부(711)에서 삭제되는 것이 바람직할 것이다. 상기 제1시간은 제2시간보다 큰 시간인 것이 바람직할 것이다. 일 예로 상기 제1시간은 40초 이상이고, 제2시간은 5초 등이 될 수 있을 것이다.The customer voice storage unit 711 collects and stores the voice mirrored from the AI VH 500 for a first time in the voice print registration mode, and stores the voice mirrored from the AI VH 500 for a second time in the authentication mode. Collect and store. It is preferable that the voice used after each time is deleted from the customer voice storage unit 711. It will be preferable that the first time is a time greater than the second time. For example, the first time may be 40 seconds or more, and the second time may be 5 seconds or the like.

음성프린트 DB(712)는 고객별 등록 음성프린트를 저장하며, 다른 실시예에 따라 상기 고객별 등록 음성프린트를 분할한 고객별 티켓을 저장한다.The voice print DB 712 stores a registered voice print for each customer, and stores a ticket for each customer obtained by dividing the registered voice print for each customer according to another embodiment.

인증 제어부(730)는 모드 식별부(740), 음성 수집부(750), 음성 인식부(760), 등록부(770) 및 인증부(780)를 포함하여, 본 발명에 따른 STT/SV(700)의 전반적인 동작을 제어한다.The authentication control unit 730 includes a mode identification unit 740, a voice collection unit 750, a voice recognition unit 760, a registration unit 770, and an authentication unit 780, and STT/SV 700 according to the present invention. ) To control the overall operation.

구체적으로, 상기 모드 식별부(740)는 상기 AI VH(500)의 고객에 대한 음성프린트 등록 확인 요청에 의해 상기 음성프린트 DB(712)를 검색하여 해당 고객의 음성프린트 등록 여부를 확인하고, 음성프린트의 등록 여부에 따라 음성프린트 등록 모드 및 음성인증 모드 중 하나를 설정한다.Specifically, the mode identification unit 740 searches the voice print DB 712 in response to a voice print registration confirmation request for the customer of the AI VH 500 to check whether the customer has registered the voice print, and One of a voice print registration mode and a voice authentication mode is set according to whether or not a print is registered.

음성 수집부(750)는 상기 모드 식별부(740)에 의해 식별된 모드에 따라 인증 통신부(720)를 통해 상기 AI VH(500)로부터 제공되는 음성을 수집하여 상기 고객 음성 저장부(711)에 저장한다. 상기 음성 수집부(750)는 저장된 음성에 대한 음성프린트가 생성되거나, 등록 완료되거나, 인증 완료되는 경우 상기 고객 음성 저장부(711)에 저장된 해당 고객의 음성을 삭제한다.The voice collection unit 750 collects the voice provided from the AI VH 500 through the authentication communication unit 720 according to the mode identified by the mode identification unit 740 and stores the voice in the customer voice storage unit 711. Save it. The voice collection unit 750 deletes the customer's voice stored in the customer voice storage unit 711 when a voice print of the stored voice is generated, registration is completed, or authentication is completed.

음성 인식부(760)는 상기 음성 수집부(750)를 통해 획득되는 음성에 대해 음성인식을 수행하여 상기 음성에 대응하는 텍스트 데이터를 출력한다.The voice recognition unit 760 performs voice recognition on the voice acquired through the voice collection unit 750 and outputs text data corresponding to the voice.

상기 등록부(770)는 상기 음성프린트 등록 모드에서 상기 음성 수집부(750)를 통해 수집되는 음성으로부터 등록 음성프린트를 생성하여 상기 음성프린트 DB(712)에 저장하여 등록한다.The registration unit 770 generates a registered voice print from the voice collected through the voice collection unit 750 in the voice print registration mode, stores it in the voice print DB 712 and registers it.

인증부(780)는 상기 인증 모드에서 상기 음성 수집부(750)를 통해 수집되는 음성으로부터 인증 음성프린트를 생성하고, 생성된 인증 음성프린트와 상기 음성프린트 DB(712)의 해당 고객의 등록 음성프린트를 비교하여 일치 여부에 따른 화자인증을 수행하고, 그 인증 결과정보를 상기 인증 통신부(720)를 통해 상기 AI VH(500)로 전송한다.The authentication unit 780 generates an authentication voice print from the voice collected through the voice collection unit 750 in the authentication mode, and the generated authentication voice print and the registered voice print of the corresponding customer in the voice print DB 712 The speaker authentication is performed according to whether or not they match, and the authentication result information is transmitted to the AI VH 500 through the authentication communication unit 720.

다른 실시예에 따라 상기 등록부(770)는 등록 음성 프린트 생성부(771), 음성프린트 분리부(772), 티켓 등록부(773) 및 영수증 등록부(774)를 포함하여 구성될 수 있을 것이다.According to another embodiment, the registration unit 770 may include a registered voice print generation unit 771, a voice print separation unit 772, a ticket registration unit 773, and a receipt registration unit 774.

상기 등록 음성프린트 생성부(771)는 상기 음성프린트 등록 모드에서 상기 음성 수집부(750)를 통해 수집되는 음성으로부터 등록할 음성프린트인 등록 음성프린트를 생성한 후 암호화한다.The registered voice print generation unit 771 generates and encrypts a registered voice print, which is a voice print to be registered, from the voice collected through the voice collection unit 750 in the voice print registration mode.

음성프린트 분리부(772)는 상기 등록 음성프린트를 티켓과 영수증으로 분리한다. 상기 음성프린트를 티켓과 영수증을 분리하는 방법으로는 단순 크기(1/2 비율, 1/3 비율 등등)로 분리하는 방법 등과 같이 다양하게 적용될 수 있을 것이다.The voice print separation unit 772 separates the registered voice print into a ticket and a receipt. A method of separating the voice print from a ticket and a receipt may be applied in various ways, such as a method of separating a simple size (1/2 ratio, 1/3 ratio, etc.).

티켓 등록부(773)는 분할된 상기 티켓을 상기 음성프린트 DB(712)에 저장하여 등록한다.The ticket registration unit 773 stores and registers the divided tickets in the voice print DB 712.

영수증 등록부(774)는 상기 영수증을 상기 인증 통신부(720)를 통해 상기 외부저장부(1000)로 전송하여 감독기관 서버(1000) 및 블록체인 네트워크 등의 외부저장부에 등록한다. The receipt registration unit 774 transmits the receipt to the external storage unit 1000 through the authentication communication unit 720 and registers the receipt in an external storage unit such as the supervisory authority server 1000 and a blockchain network.

상기 실시예의 경우, 상기 인증부(780)는 인증 음성프린트 생성부(781), 티켓 획득부(782), 영수증 획득부(783), 등록 음성프린트 결합부(784), 복호부(785), 비교부(786) 및 인증결과 통지부(787)를 포함할 수 있을 것이다.In the case of the above embodiment, the authentication unit 780 includes an authentication voice print generation unit 781, a ticket acquisition unit 782, a receipt acquisition unit 783, a registration voice print combining unit 784, a decoding unit 785, A comparison unit 786 and an authentication result notification unit 787 may be included.

상기 인증 음성프린트 생성부(781)는 상기 인증 모드에서 상기 음성 수집부(750)로부터 수집되는 음성으로부터 인증할 음성프린트인 인증 음성프린트를 생성한다.The authentication voice print generation unit 781 generates an authentication voice print that is a voice print to be authenticated from the voice collected from the voice collection unit 750 in the authentication mode.

티켓 획득부(782)는 상기 인증할 음성을 제공한 고객에 대응하는 티켓을 상기 음성프린트 DB(712)로부터 획득한다.The ticket acquisition unit 782 obtains a ticket corresponding to the customer who provided the voice to be authenticated from the voice print DB 712.

상기 영수증 획득부(783)는 상기 고객에 대한 영수증을 상기 감독기관 서버(1000), 블록체인 네트워크 등으로부터 획득한다.The receipt acquisition unit 783 obtains a receipt for the customer from the supervisory authority server 1000, a blockchain network, and the like.

등록 음성프린트 결합부(784)는 상기 획득된 티켓 및 영수증을 결합하여 암호화된 등록 음성프린트를 생성한다.The registered voice print combining unit 784 combines the obtained ticket and receipt to generate an encrypted registered voice print.

상기 복호부(785)는 암호화된 상기 등록 음성프린트를 복호하여 출력한다.The decryption unit 785 decodes and outputs the encrypted registered voice print.

비교부(786)는 상기 복호된 등록 음성프린트와 상기 인증 음성프린트를 비교하여 일치 여부에 따른 화자인증을 수행하여 인증 결과정보를 출력한다.The comparison unit 786 compares the decoded registered voice print and the authenticated voice print, performs speaker authentication according to whether they match, and outputs authentication result information.

인증결과 통지부(787)는 상기 인증 결과정보를 상기 인증 통신부(720)를 통해 상기 AI VH(500)로 전송한다.The authentication result notification unit 787 transmits the authentication result information to the AI VH 500 through the authentication communication unit 720.

도 4는 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답 방법 중 고객의 음성프린트 등록 방법을 나타낸 절차도이다.4 is a flowchart showing a method of registering a voice print of a customer among artificial intelligence automatic response methods having a customer authentication function using speaker authentication according to the present invention.

도 4를 참조하면, 우선 AI VH(500)는 교환기(400)를 통해 호가 착신되는지를 모니터링하고(S111), 호가 착신되면 교환기(400)를 제어하여 통화로를 형성시킨 후, 서비스 시나리오에 따른 안내 음성을 형성된 상기 통화로를 통해 고객에게 제공한다(S113). 이때의 안내 음성은 서비스 메뉴의 선택을 요청하는 안내 음성일 것이다.Referring to FIG. 4, first, the AI VH 500 monitors whether a call is received through the switch 400 (S111), and when the call is received, the switch 400 is controlled to form a channel, and then according to the service scenario. Guidance voice is provided to the customer through the formed call path (S113). The guide voice at this time will be a guide voice requesting selection of a service menu.

상기 안내 음성 제공 후 AI VH(500)는 가상상담서비스 신청이 선택되는지를 모니터링하고(S115), 가상상담서비스가 아닌 다른 서비스가 선택되면 해당 서비스에 대응하는 서비스 시나리오에 따라 해당 서비스를 제공한다(S117).After providing the guidance voice, the AI VH 500 monitors whether a virtual consultation service application is selected (S115), and when a service other than the virtual consultation service is selected, provides the corresponding service according to the service scenario corresponding to the service ( S117).

반면, 가상상담서비스 신청이 선택되면 AI VH(500)는 STT/SV(700)와 연동하여 화자인증 등록, 즉 상기 호를 발신한 고객에 대한 음성프린트가 등록되어 있는지를 확인하고(S119) 등록 여부를 판단한다(S121).On the other hand, when the virtual consultation service application is selected, the AI VH 500 interlocks with the STT/SV 700 to check whether speaker authentication registration, that is, a voice print for the customer who sent the call, is registered (S119). Whether or not it is determined (S121).

등록되어 있으면 AI VH(500)는 후술할 다음의 도 5에 따른 화자인증 가상상담 서비스를 실행한다(S123). 상기 화자인증 가상상담서비스는 다음의 도 5를 참조하여 상세히 설명한다.If registered, the AI VH 500 executes the speaker authentication virtual counseling service according to FIG. 5, which will be described later (S123). The speaker authentication virtual counseling service will be described in detail with reference to FIG. 5 below.

반면, 미등록되어 있으면 AI VH(500)는 다음과 같이 등록 음성프린트 등록 과정을 수행한다.On the other hand, if it is not registered, the AI VH 500 performs a registered voice print registration process as follows.

AI VH(500)는 STT/SV(700) 및 STT(620)로 고객으로부터 수신되는 음성을 미러링하여 제공함에 의해 가상상담 서비스를 개시한다(S125, S129, S131).The AI VH 500 initiates a virtual consultation service by mirroring and providing the voice received from the customer to the STT/SV 700 and STT 620 (S125, S129, S131).

STT/SV(700)는 가상상담 서비스가 개시되면 고객의 음성을 수집하기 시작한다(S127).STT/SV 700 starts to collect the customer's voice when the virtual consultation service is started (S127).

상기 등록 음성프린트 등록을 위한 가상상담 서비스의 개시 시 STT/SV(700)는 시나리오에 따라 현재 등록 음성프린트 등록을 위한 가상상담 서비스를 제공 중임을 안내하고 제1시간 이상의 음성을 수집함을 고객에게 안내하는 것이 바람직할 것이다.Upon the start of the virtual counseling service for registering the registered voice print, the STT/SV 700 informs the customer that the virtual counseling service for registering the registered voice print is currently being provided according to the scenario and collects voices for more than one hour. It would be desirable to guide.

AI VH(500)는 가상상담 서비스의 개시 후 서비스가 종료되는지를 검사하고(S133), 서비스가 종료되면 가상상담 서비스가 종료되었음을 STT/SV(700)로 통지하여(S135), 상기 STT/SV(700)가 상기 서비스 종료 시까지 수집된 고객의 음성에 대한 등록 음성프린트를 생성하여 등록하도록 제어한다(S137). 이때, STT/SV(700)는 실시예에 따라 상기 등록 음성프린트를 티켓 및 영수증으로 분할할 수 있을 것이다.The AI VH 500 checks whether the service is terminated after the start of the virtual consultation service (S133), and when the service is terminated, it notifies the STT/SV 700 that the virtual consultation service is terminated (S135), and the STT/SV The 700 generates and registers a registered voice print for the customer's voice collected until the service is terminated (S137). At this time, the STT/SV 700 may divide the registered voice print into tickets and receipts according to an embodiment.

티켓 및 영수증으로 분할한 경우 STT/SV(700)는 영수증을 포함하는 영수증 저장 요청 정보를 외부저장부(1000)로 전송한다(S139).When divided into tickets and receipts, the STT/SV 700 transmits the receipt storage request information including the receipt to the external storage unit 1000 (S139).

그러면 상기 외부저장부(1000)는 상기 영수증을 내부 저장수단에 고객별로 저장하여 관리할 것이다(S141).Then, the external storage unit 1000 will store and manage the receipt for each customer in the internal storage means (S141).

반면, STT/SV(700)는 티켓을 내부 인증 저장부(710)의 음성프린트 DB(712)에 저장하고, 고객 음성 저장부(711)에 저장된 고객의 음성을 삭제한다(S143). On the other hand, the STT/SV 700 stores the ticket in the voice print DB 712 of the internal authentication storage unit 710, and deletes the customer's voice stored in the customer voice storage unit 711 (S143).

도 5는 본 발명에 따른 화자인증을 이용한 고객인증 기능을 가지는 인공지능 자동응답 방법 중 고객인증에 따른 자동응답 방법을 나타낸 절차도이다.5 is a flowchart showing an automatic response method according to customer authentication among artificial intelligence automatic response methods having a customer authentication function using speaker authentication according to the present invention.

도 5를 참조하면, AI VH(500)는 가상상담서비스 시나리오에 따라 고객에게 안내 음성을 제공하고, STT/SV(700) 및 TTS(620)로 고객으로부터 수신되는 음성을 미러링하여 제공함에 의해 가상상담서비스를 개시한다(S211, S213, S215).5, the AI VH 500 provides a guide voice to the customer according to the virtual counseling service scenario, and mirrors the voice received from the customer to the STT/SV 700 and the TTS 620, thereby providing a virtual Start counseling service (S211, S213, S215).

가상상담서비스가 개시되면, AI VH(500)와 연동하는 ST/SV(700)는 고객의 음성을 제2시간 단위로 실시간 획득하고, 상기 획득된 고객의 음성에 대응하는 인증 음성프린트를 생성한다(S217). 상기 설명에서는 인증모드에서 상기 시간단위로 음성을 획득하여 인증 음성프린트를 생성하는 경우를 나타내었으나, 시간단위가 아닌 단어단위, 문장단위로 음성을 획득하여 인증 음성프린트를 생성할 수도 있을 것이다.When the virtual counseling service is started, the ST/SV 700 interworking with the AI VH 500 acquires the customer's voice in real time every second time, and generates an authentication voice print corresponding to the acquired customer's voice. (S217). In the above description, the authentication voice print is generated by acquiring the voice in the time unit in the authentication mode. However, it may be possible to generate the authenticated voice print by acquiring the voice in a word unit or a sentence unit instead of a time unit.

인증 음성프린트가 생성되면 STT/SV(700)는 외부저장부(1000)로 상기 호를 발신한 고객에 대한 영수증을 요청하여 수신하고(S219, S221), 상기 고객에 대한 티켓과 영수증을 결합하여 등록 음성프린트를 생성한 후 복호한다(S223).When the authentication voice print is generated, the STT/SV 700 requests and receives a receipt for the customer who sent the call to the external storage unit 1000 (S219, S221), and combines the ticket and the receipt for the customer. After generating a registered voice print, it is decoded (S223).

등록 음성프린트가 획득되면 STT/SV(700)는 인증 음성프린트와 등록 음성프린트를 비교하여 검증을 수행하고, 검증 성공인지를 판단한다(S225, S227).When the registered voice print is acquired, the STT/SV 700 performs verification by comparing the authenticated voice print and the registered voice print, and determines whether the verification is successful (S225 and S227).

검증에 실패하면 STT/SV(700)는 검증실패에 따른 인증 결과정보를 AI VH(500)로 통지하고 종료한다(S229). 상기 검증실패를 통지받은 AI VH(500)는 검증실패에 대응하는 시나리오상의 안내 음성을 고객에게 제공하는 것이 바람직할 것이다.If the verification fails, the STT/SV 700 notifies the authentication result information according to the verification failure to the AI VH 500 and ends (S229). It would be desirable for the AI VH 500 to receive notification of the verification failure to provide a guide voice in a scenario corresponding to the verification failure to the customer.

반면, 검증에 성공하면 STT/SV(700)는 인증 결과정보를 AI VH(500)로 전송하여 검증 성공을 통지한다(S230).On the other hand, if the verification is successful, the STT/SV 700 transmits the authentication result information to the AI VH 500 to notify the verification success (S230).

검증 성공을 통지받은 AI VH(500)는 정상적으로 고객이 서비스를 종료할 때까지(S237) 인공지능 응답부(610) 및 TTS(620)와 연동하여 가상상담서비스를 제공한다(S231, S233, S235).The AI VH 500, which is notified of the success of the verification, provides a virtual consultation service in conjunction with the artificial intelligence response unit 610 and the TTS 620 until the customer normally terminates the service (S237) (S231, S233, S235). ).

한편, 본 발명은 전술한 전형적인 바람직한 실시예에만 한정되는 것이 아니라 본 발명의 요지를 벗어나지 않는 범위 내에서 여러 가지로 개량, 변경, 대체 또는 부가하여 실시할 수 있는 것임은 당해 기술분야에서 통상의 지식을 가진 자라면 용이하게 이해할 수 있을 것이다. 이러한 개량, 변경, 대체 또는 부가에 의한 실시가 이하의 첨부된 특허청구범위의 범주에 속하는 것이라면 그 기술사상 역시 본 발명에 속하는 것으로 보아야 한다.Meanwhile, it is common knowledge in the art that the present invention is not limited to the above-described typical preferred embodiment, but can be implemented in various ways without departing from the gist of the present invention. Anyone who has a will be able to understand it easily. If implementation by such improvement, change, substitution or addition falls within the scope of the following appended claims, the technical idea should also be considered to belong to the present invention.

100: 사용자 단말부 200: 인공지능 자동응답시스템
300: 컴퓨터 전화 통합 시스템(CTI) 400: 교환기(PBX)
500: 인공지능 음성 허브부(AI VH) 510: 허브 저장부
511: 시나리오 DB 520: 허브 통신부
530: 허브 제어부 540: 호 처리부
550: 시나리오 처리부 560: 스위치 미러링부
570: 화자 분리부 580: 연동 처리부
600: 응답부 610: 인공지능 응답부
620: 음성 변환부 700: 음성인식/화자인증부(STT/SV)
710: 인증 저장부 711: 고객 음성 저장부
712: 음성프린트 DB 720: 인증 통신부
730: 인증 제어부 740: 모드 식별부
750: 음성 수집부 760: 음성 인식부
770: 등록부 780: 인증부
800: 정보 분석부 1000: 감독기관 서버100: user terminal 200: artificial intelligence automatic response system
300: Computer Telephony Integration System (CTI) 400: Switchboard (PBX)
500: artificial intelligence voice hub unit (AI VH) 510: hub storage unit
511: scenario DB 520: hub communication unit
530: hub control unit 540: call processing unit
550: scenario processing unit 560: switch mirroring unit
570: speaker separation unit 580: interlocking processing unit
600: response unit 610: artificial intelligence response unit
620: voice conversion unit 700: voice recognition/speaker authentication unit (STT/SV)
710: authentication storage unit 711: customer voice storage unit
712: voice print DB 720: authentication communication unit
730: authentication control unit 740: mode identification unit
750: voice collection unit 760: voice recognition unit
770: register 780: certification
800: information analysis unit 1000: supervisory authority server

Claims

A voice recognition/speaker authentication unit (STT/SV) for performing voice recognition for the received voice, registering a voice print for each customer, and performing speaker authentication based on the registered voice print, and transmitting the authentication result information;
A response unit for transmitting a response voice to the voice-recognized text data; And
When a call is received from a switchboard (PBX), a call path is formed, and an automatic response service is provided according to the set scenario, but when a virtual counseling service is selected from a customer registered with a voice print, the voice input in real time to the voice recognition/speaker authentication unit is And, when the authentication result information received from the voice recognition/speaker authentication unit is determined to be successful by comparing the voice print for the real-time voice of the customer and the registered voice print, the scenario and An artificial intelligence automatic response system having a customer authentication function using speaker authentication, characterized in that it includes an artificial intelligence voice hub (AI VH) that provides a virtual consultation service according to the customer's selection.

The method of claim 1,
The response unit,
An artificial intelligence response unit that receives the speech-recognized text data through the artificial intelligence speech hub unit, searches for a response text corresponding to the text data, and transmits it to the artificial intelligence speech hub unit; And
Having a customer authentication function using speaker authentication, characterized in that it comprises a voice conversion unit (TTS) that receives the response text through the artificial intelligence voice hub and transmits the response voice converted into voice to the artificial intelligence voice hub. Artificial intelligence automatic response system.

The method of claim 1,
The response unit,
A voice artificial intelligence response unit that receives the speech-recognized text data through the artificial intelligence voice hub unit, searches for a response text corresponding to the text data, converts the searched response text into speech, and transmits the result to the artificial intelligence speech hub unit. Artificial intelligence automatic response system having a customer authentication function using speaker authentication, characterized in that.

The method of claim 1,
The artificial intelligence voice hub unit,
A hub communication unit for performing data communication and voice communication with the voice recognition/speaker authentication unit and the response unit;
A hub storage unit including a scenario DB for storing at least one scenario according to the automatic response service; And
Scenario setting according to the automatic response service and the interlocking setting of the voice recognition/speaker authentication unit and the response unit are performed, a call path is formed when a call is received from the switchboard (PBX), and an automatic response service according to the set scenario is provided, When selecting a virtual counseling service from a registered customer, a voice inputted in real time is provided to the voice recognition/speaker authentication unit, and the voice recognition / by comparing the voice print of the real-time voice of the customer with the registered voice print. Speaker authentication comprising a hub control unit that provides a virtual consultation service according to the scenario and customer selection through the response unit and the voice conversion unit when the authentication result information received from the speaker authentication unit is determined to be authentication success. Artificial intelligence automatic response system with customer authentication function using.

The method of claim 4,
The hub control unit,
A call processing unit that forms a call path when a call is received and performs call transfer processing according to a call transfer request;
Scenario setting according to the automatic response service and interlocking setting of the voice recognition/speaker authentication unit and response unit are performed, and guidance voice according to the set scenario through the call formed when the call is formed through the call processing unit is provided to the user terminal of the customer A scenario processing unit;
A speaker separating unit for separating a customer's voice received during the call and a guide voice and a response voice according to the scenario and outputting only the customer's voice;
A switch mirroring unit that mirrors the customer's voice output from the speaker separation unit and provides it to the voice recognition/speaker authentication unit and the response unit; And
When a virtual counseling service is selected from a customer for whom a voice print is registered in connection with the voice recognition/speaker authentication unit and the response unit, a voice inputted in real time to the voice recognition/speaker authentication unit is provided, and a voice print for the real-time voice of the customer When it is determined that the authentication result information received from the voice recognition/speaker authentication unit is successful by comparing the registered voice print with the registered voice print, the linkage providing a virtual consultation service according to the scenario and the customer's selection through the response unit Artificial intelligence automatic response system having a customer authentication function using speaker authentication, characterized in that it comprises a processing unit.

The method of claim 5,
The interlocking processing unit,
When it is determined that the virtual counseling service is selected through the scenario processing unit, the switch mirroring unit is controlled to provide a real-time voice input to the voice recognition/speaker authentication unit, and the voice for the real-time voice of the customer from the voice recognition/speaker authentication unit. An authentication linking unit configured to output a virtual counseling service start command when the authentication result information is received by comparing the print and the registered voice print and the authentication result information is determined to be successful;
An artificial having a customer authentication function using speaker authentication, characterized in that it includes a response linking unit that provides a virtual counseling service according to the scenario and the customer's selection through the response unit when a command to start a virtual counseling service is input from the authentication linking unit. Intelligent automatic response system.

The method of claim 6,
Further comprising an information analysis unit for storing the call with a call path, the history of use of the virtual consultation service for the customer, and information on the consultation contents for the virtual consultation service, and storing statistical information and analysis information on the stored information,
The interlocking processing unit,
Customer authentication using speaker authentication, characterized in that it further comprises a third-party linking unit that provides information on the call, virtual counseling service scenario information, and consultation content information according to the call to the information analysis unit when the call is terminated through the call processing unit. Artificial intelligence automatic response system with functions.

The method of claim 1,
The artificial intelligence voice hub unit,
An authentication communication unit for performing data communication and voice communication with the response unit and the artificial intelligence voice hub unit;
An authentication storage unit including a customer voice storage unit for storing customer voices and a voice print DB for storing customer-specific voice prints; And
Voice recognition for each customer's voice received from the artificial intelligence voice hub unit through the authentication communication unit, registration of customer-specific voice prints, and speaker authentication by registered voice prints, and the authentication result information through the authentication communication unit. An artificial intelligence automatic response system having a customer authentication function using speaker authentication, characterized in that it comprises an authentication control unit transmitting to the artificial intelligence voice hub unit.

The method of claim 8,
The authentication control unit,
In response to a voice print registration confirmation request for the customer of the artificial intelligence voice hub, the voice print DB is searched to check whether the customer has registered the voice print, and depending on whether the voice print is registered, the voice print registration mode and the voice authentication mode are selected. A mode identification unit to set one;
A voice collection unit collecting voice provided from the artificial intelligence voice hub unit through an authentication communication unit according to the mode identified by the mode identification unit;
A speech recognition unit configured to perform speech recognition on the speech acquired through the speech collection unit and output text data corresponding to the speech;
A registration unit that generates a voice print from the voice collected through the voice collection unit in the voice print registration mode, stores it in the voice print DB, and registers it; And
In the voice authentication mode, a voice print is generated from the voice collected through the voice collection unit, and speaker authentication is performed according to whether or not they match by comparing the generated voice print with the registered voice print of the corresponding customer in the voice print DB. An artificial intelligence automatic response system having a customer authentication function using speaker authentication, comprising: an authentication unit transmitting authentication result information to the artificial intelligence voice hub unit through the authentication communication unit.

An automatic answering service initiation process in which the artificial intelligence voice hub forms a call path when a call is received from a switchboard (PBX) and initiates an automatic answering service according to a set scenario;
When the customer selects the virtual consultation service, the artificial intelligence voice hub unit provides real-time voice input to the voice recognition/speaker authentication unit if the customer who sent the call is a customer who has registered a voice print, and a voice print for the real-time voice of the customer A speaker authentication process of receiving authentication result information received from the voice recognition/speaker authentication unit by comparing the registered voice print with the registered voice print; And
When it is determined that the speaker authentication is successful, the artificial intelligence voice hub unit provides a virtual consultation service according to the scenario and the customer's selection through a response unit and a voice conversion unit. Artificial intelligence automatic response method with customer authentication function using.

The method of claim 10,
The process of starting the automatic response service,
The artificial intelligence voice hub part forms a call path when a call is received from the switchboard (PBX), provides voice guidance according to the scenario through the call path, and receives the customer's voice according to the scenario to recognize the voice/speaker authentication A voice mirroring step of mirroring to a part;
A voice recognition step of the artificial intelligence voice hub unit interworking with a voice recognition/speaker authentication unit to provide text data, which is a voice recognition result of the mirrored customer's voice, to a response unit;
A response acquiring step of acquiring a response text for the text data by interworking with a response unit by the artificial intelligence voice hub unit; And
And a response step of converting the obtained response text into voice by the artificial intelligence voice hub unit and providing it to the user terminal of the customer. 15. An artificial intelligence automatic response method having a customer authentication function using speaker authentication.

The method of claim 10,
The speaker authentication process,
The artificial intelligence voice hub unit interworking with the voice recognition/speaker authentication unit to check whether or not to register the voice print for the customer who sent the call;
A mode identification step of identifying and setting a voice print registration mode and a voice authentication mode according to whether or not the voice print is registered by a voice recognition/speaker authentication unit linked by the artificial intelligence voice hub unit;
A voice mirroring step in which the artificial intelligence voice hub mirrors the customer's voice to the voice recognition/speaker authentication unit according to the mode identified by the mode identification unit;
A voice print registration step of generating a voice print from the voice by a voice recognition/speaker authentication unit interlocked with the artificial intelligence voice hub unit and storing and registering it in a voice print DB when the set mode is a voice print registration mode; And
If the set mode is an authentication mode, a voice print is generated from the voice by the voice recognition/speaker authentication unit linked to the artificial intelligence voice hub unit, and the generated voice print is compared with the registered voice print of the corresponding customer in the voice print DB. Thus, the artificial intelligence automatic response method having a customer authentication function using speaker authentication, characterized in that it comprises an authentication step of performing speaker authentication according to the match or not, and the authentication result information.

The method of claim 10,
The artificial intelligence voice hub unit interlocks with the information analysis unit to store the call for which the call path is formed, the history of using the virtual counseling service for the customer, and information about the counseling content for the virtual counseling service, and store statistical information and analysis information on the stored information. Artificial intelligence automatic response method having a customer authentication function using speaker authentication, characterized in that it further comprises a storing information analysis process.