KR102424848B1

KR102424848B1 - Realtime consultation quality evaluation method of electronic apparatus for utilizing consultation pattern model of best consultant based on artificial intelligence, and system

Info

Publication number: KR102424848B1
Application number: KR1020210133238A
Authority: KR
Inventors: 손소열; 김종찬
Original assignee: 주식회사 두유비
Priority date: 2021-10-07
Filing date: 2021-10-07
Publication date: 2022-07-25

Abstract

A counseling quality evaluation method of an electronic device is disclosed. The counseling quality evaluation method comprises the steps of: providing a voice call between a customer terminal and a counselor terminal; obtaining audio data on the basis of a spoken voice of the counselor input through the counselor terminal while the voice call is provided; converting the audio data to obtain a text; analyzing the audio data to obtain feature information of the audio data, and generating voice quality information by comparing the obtained feature information with pre-stored feature information; analyzing the text to obtain rule information of the text, and generating text quality information by comparing the obtained rule information with pre-stored rule information; and providing voice quality information and text quality information through the counselor terminal while the voice call is provided.

Description

REALTIME CONSULTATION QUALITY EVALUATION METHOD OF ELECTRONIC APPARATUS FOR UTILIZING CONSULTATION PATTERN MODEL OF BEST CONSULTANT BASED ON ARTIFICIAL INTELLIGENCE, AND SYSTEM }

본 개시는 전자 장치의 실시간 상담 품질 평가 방법에 관한 것으로, 보다 상세하게는, 상담사들의 실시간 상담 진행 시 상담 품질 평가를 자동으로 진행하여 상담사의 자발적인 상담품질 관리를 지원하는 전자 장치 및 시스템에 관한 것이다.The present disclosure relates to a method for evaluating real-time counseling quality of an electronic device, and more particularly, to an electronic device and system for supporting voluntary counseling quality management of counselors by automatically performing counseling quality evaluation when counselors perform real-time counseling. .

콜센터는 기업 입장에서 고객을 응대하기 위한 최우선 채널이며, 상담사의 상담 응대 품질은 기업의 제품 판매, 계약 체결, 고객 불만 해소, 고객 요구 접수 등 기업 마케팅 및 브랜드 경쟁력 제고에 결정적인 요소이다.The call center is the top priority channel for responding to customers from a company's point of view, and the quality of response by counselors is a decisive factor in corporate marketing and enhancing brand competitiveness, such as product sales, contract signing, customer complaint resolution, and customer request reception.

최근 지속되는 코로나 19의 장기화로 인하여 고객과 콜센터 상담사 간 전화 상담이 증가하고 있으며, 특히 콜센터 업무는 한정된 공간 내 다수의 상담사가 직접적인 음성발화를 수행한다는 점에서 대표적인 코로나 취약 업종이다.Due to the prolonged COVID-19 outbreak, phone consultations between customers and call center counselors are increasing, and in particular, call center work is a representative industry vulnerable to Corona in that a large number of counselors in a limited space perform direct voice communication.

이에 현 시국에서는 비대면 환경 하에서 원활한 고객 응대를 수행할 수 있으면서도 상담사의 상담 품질에 대한 관리를 지원할 수 있는 시스템이 필요하다.Therefore, in the current state, a system that can support the management of the counseling quality of counselors while being able to perform smooth customer service in a non-face-to-face environment is needed.

상담사 교육의 경우 현재 대부분의 콜센터는 상담 관리자들이 일정기간 동안의 상담사 통화한 이력 중 소수의 일부 콜만 랜덤으로 선별하여 청취하고, 이를 정해진 상담 품질 관리 규칙에 의거하여 말 빠르기, 밝은 어조, 규정된 발화 스크립트의 누락 여부 등을 청취 후 수기로 평가하는 형태로 진행하고 있으며, 이는 상담 진행시간 이상의 녹취 청취 평가 시간이 소요되는 바 전수 검사가 현실적으로 불가한 실정이다.In the case of counselor training, at present, most call centers allow counseling managers to randomly select and listen to only a small number of calls from the counselor’s call history for a certain period of time, and according to the set counseling quality management rules, fast-talk, bright tone, and prescribed utterance. It is conducted in the form of hand-evaluating whether or not the script is omitted after listening, and since it takes more time to evaluate the recording and listening than the consultation process time, it is practically impossible to conduct a complete inspection.

또한 상담 품질 평가 결과에 따라 상담 관리자들은 상담사들을 대상으로 대면 환경 내에서 개인 또는 집체 교육을 시행하기에 이 또한 코로나 시국에서의 위험 요소로 작용하고 있다.In addition, according to the results of the counseling quality evaluation, counseling managers conduct individual or group training for counselors in a face-to-face environment, which is also acting as a risk factor in the Corona crisis.

이러한 콜센터 품질 관리의 인적, 시간적 비용 환경을 개선하고 전수 분석이 아닌 랜덤 콜 청취 분석의 문제점을 개선하기 위하여, 콜센터 시장에서는 음성인식 및 텍스트 분석 기술을 적용하여 음성을 자동으로 텍스트로 변환하고, 변환된 텍스트를 분석하여 상담의 품질을 평가하고 있다. 다만, 이 경우 텍스트만으로는 판단할 수 없는 요소, 즉 말 빠르기, 음성의 고조, 어감 등은 평가가 불가한 단점을 가진다.In order to improve the human and time cost environment of such call center quality management and to improve the problem of random call listening analysis rather than full analysis, in the call center market, speech recognition and text analysis technology are applied to automatically convert speech into text and convert The quality of counseling is evaluated by analyzing the text. However, in this case, it is not possible to evaluate factors that cannot be determined by text alone, ie, speed of speech, tone of voice, and tone.

상술한 종래 기술의 문제점을 해결하기 위하여, 본 개시는 우수 상담사의 상담 사례에 따라 인공지능 모델(엔진)을 훈련시켜 표준 품질 평가 툴을 구축하고, 이를 활용하여 개별 상담사의 실시간 상담이 진행되는 동안 실시간으로 상담 품질에 대한 평가를 제공하는 전자 장치 및 시스템을 제공한다.In order to solve the problems of the prior art described above, the present disclosure trains an artificial intelligence model (engine) according to the counseling case of an excellent counselor to build a standard quality evaluation tool, and utilizes it to train an artificial intelligence model (engine) during real-time counseling of individual counselors. An electronic device and system are provided that provide an assessment of consultation quality in real time.

구체적으로, 본 개시는 상담사의 발화 음성의 특징 정보 및 발화 텍스트의 규칙 정보를 별도로 분석하여 상담 품질에 대한 평가를 수행하며, 상담 중인 상담사가 평가 내용을 곧바로 확인할 수 있도록 실시간 피드백을 제공하는 전자 장치 및 시스템을 제공한다.Specifically, the present disclosure provides an electronic device that separately analyzes characteristic information of a counselor's spoken voice and rule information of a spoken text to evaluate the counseling quality, and provides real-time feedback so that the counselor in counseling can immediately check the evaluation details and systems.

또한, 본 개시는, 상담 품질 평가 표준 모델을 구축하기 위한 우수 상담 사례의 선별을 통한 인공 지능 학습 데이터의 구축과 모델 구축 시 기존 텍스트 데이터 단일 모델만을 사용하는 것이 아닌 오디오 데이터의 분석을 포함하여 더 정교한 품질 평가 모델을 구축하고, 구축된 표준 모델을 실시간으로 상담에 적용하여 품질 평가 결과가 상담사에게 즉시 제공되도록 하는 상담 품질 평가 방법을 제공한다.In addition, the present disclosure further includes analysis of audio data rather than using only a single model of existing text data when constructing artificial intelligence learning data through selection of excellent counseling cases to build a standard model for counseling quality evaluation and building a model. It provides a counseling quality evaluation method that builds a sophisticated quality evaluation model and applies the established standard model to counseling in real time so that the quality evaluation result is immediately provided to the counselor.

본 개시의 목적들은 이상에서 언급한 목적으로 제한되지 않으며, 언급되지 않은 본 개시의 다른 목적 및 장점들은 하기의 설명에 의해서 이해될 수 있고, 본 개시의 실시 예에 의해 보다 분명하게 이해될 것이다. 또한, 본 개시의 목적 및 장점들은 특허 청구 범위에 나타낸 수단 및 그 조합에 의해 실현될 수 있음을 쉽게 알 수 있을 것이다.Objects of the present disclosure are not limited to the above-mentioned purposes, and other objects and advantages of the present disclosure that are not mentioned may be understood by the following description, and will be more clearly understood by examples of the present disclosure. Moreover, it will be readily apparent that the objects and advantages of the present disclosure may be realized by the means and combinations thereof indicated in the claims.

본 개시의 일 실시 예에 따른 전자 장치의 상담 품질 평가 방법은, 고객 단말과 상담사 단말 간의 음성 통화를 제공하는 단계, 상기 음성 통화가 제공되는 동안 상기 상담사 단말을 통해 입력되는 상담사의 발화 음성을 기반으로, 오디오 데이터를 획득하는 단계, 상기 오디오 데이터를 변환하여 텍스트를 획득하는 단계, 상기 오디오 데이터를 분석하여 상기 오디오 데이터의 특징 정보를 획득하고, 상기 획득된 특징 정보를 기저장된 특징 정보와 비교하여 음성 품질 정보를 생성하는 단계, 상기 텍스트를 분석하여 상기 텍스트의 규칙 정보를 획득하고, 상기 획득된 규칙 정보를 기저장된 규칙 정보와 비교하여 텍스트 품질 정보를 생성하는 단계, 상기 음성 통화가 제공되는 동안, 상기 상담사 단말을 통해 상기 음성 품질 정보 및 상기 텍스트 품질 정보를 제공하는 단계를 포함한다.A method for evaluating consultation quality of an electronic device according to an embodiment of the present disclosure includes providing a voice call between a customer terminal and a counselor terminal, based on the counselor's speech voice input through the counselor terminal while the voice call is being provided obtaining audio data, converting the audio data to obtain text, analyzing the audio data to obtain characteristic information of the audio data, and comparing the obtained characteristic information with pre-stored characteristic information. generating voice quality information; analyzing the text to obtain rule information of the text; and comparing the obtained rule information with pre-stored rule information to generate text quality information; while the voice call is provided , providing the voice quality information and the text quality information through the counselor terminal.

상기 전자 장치의 상담 품질 평가 방법은, 오디오 데이터의 특징 정보를 추출하여 일치율을 판단하기 위한 제1 인공지능 모델을 생성하는 단계, 우수 상담사의 발화 음성에 매칭되는 오디오 데이터 및 열위 상담사의 발화 음성에 매칭되는 오디오 데이터를 기반으로, 상기 제1 인공지능 모델을 훈련시키는 단계를 포함할 수 있다. 그리고, 상기 음성 품질 정보를 획득하는 단계는, 상기 인공지능 모델을 통해, 상기 획득된 특징 정보를 상기 기저장된 특징 정보와 비교하여 일치율을 획득할 수 있다. 이 경우, 상기 기저장된 특징 정보는, 적어도 하나의 우수 상담사의 발화 음성에 매칭되는 오디오 데이터의 특징 정보일 수 있다.The method for evaluating the counseling quality of the electronic device includes generating a first artificial intelligence model for determining a match rate by extracting feature information of audio data; It may include training the first artificial intelligence model based on the matching audio data. In addition, the acquiring of the voice quality information may include comparing the acquired feature information with the pre-stored feature information to obtain a matching rate through the artificial intelligence model. In this case, the pre-stored characteristic information may be characteristic information of audio data that matches the speech voice of at least one excellent counselor.

또한, 상기 전자 장치의 상담 품질 평가 방법은, 텍스트의 규칙 정보를 추출하여 일치율을 판단하기 위한 제2 인공지능 모델을 생성하는 단계, 우수 상담사의 발화 음성이 변환된 텍스트 및 열위 상담사의 발화 음성이 변환된 텍스트를 기반으로, 상기 제2 인공지능 모델을 훈련시키는 단계를 포함할 수 있다. 이 경우, 상기 텍스트 품질 정보를 획득하는 단계는, 상기 인공지능 모델을 통해, 상기 획득된 규칙 정보를 상기 기저장된 규칙 정보와 비교하여 일치율을 획득할 수 있다. 그리고, 상기 기저장된 규칙 정보는, 적어도 하나의 우수 상담사의 발화 음성이 변환된 텍스트의 규칙 정보일 수 있다.In addition, the method for evaluating the counseling quality of the electronic device includes the steps of extracting rule information from text and generating a second artificial intelligence model for determining a match rate, text from which the spoken voice of the excellent counselor is converted, and the spoken voice of the inferior counselor It may include training the second artificial intelligence model based on the converted text. In this case, the obtaining of the text quality information may include comparing the obtained rule information with the pre-stored rule information to obtain a matching rate through the artificial intelligence model. In addition, the pre-stored rule information may be rule information of a text in which the speech voice of at least one excellent counselor is converted.

상기 특징 정보는, 음성의 높낮이, 공명, 강도, 및 빠르기 중 적어도 하나의 항목을 포함할 수 있다. 이 경우, 상기 음성 품질 정보는, 상기 특징 정보와 상기 기등록된 특징 정보의 항목 별 일치율에 대한 정보를 포함할 수 있다.The feature information may include at least one of pitch, resonance, intensity, and speed of a voice. In this case, the voice quality information may include information on a matching rate for each item of the feature information and the previously registered feature information.

상기 규칙 정보는, 기설정된 스크립트의 발화 여부, 기설정된 스크립트의 발화 횟수, 및 복수의 스크립트의 발화 순서 중 적어도 하나와 관련된 항목을 하나 이상 포함할 수 있다. 이 경우, 상기 텍스트 품질 정보는, 상기 규칙 정보와 상기 기등록된 규칙 정보의 항목 별 일치율에 대한 정보를 포함할 수 있다.The rule information may include one or more items related to at least one of whether a preset script is uttered, a preset number of utterances of the script, and an utterance order of a plurality of scripts. In this case, the text quality information may include information on a matching rate for each item of the rule information and the previously registered rule information.

한편, 상기 전자 장치의 상담 품질 평가 방법은, 상기 상담사 단말을 통해 상담사의 제1 발화 음성이 수신되면, 상기 제1 발화 음성이 변환된 제1 텍스트를 상기 상담사 단말을 통해 표시하는 단계, 상기 제1 발화 음성을 기반으로 제1 음성 품질 정보를 생성하는 단계, 상기 제1 텍스트를 기반으로 제1 텍스트 품질 정보를 생성하는 단계, 상기 상담사 단말을 통해, 상기 제1 음성 품질 정보 및 상기 제1 텍스트 품질 정보를 표시하는 단계, 상기 제1 텍스트가 표시된 상태에서, 상기 상담사 단말을 통해 상담사의 제2 발화 음성이 수신되면, 상기 상담사 단말을 통해 상기 제2 발화 음성이 변환된 제2 텍스트를 상기 제1 텍스트와 함께 표시하는 단계, 상기 제1 발화 음성 및 상기 제2 발화 음성을 기반으로 제2 음성 품질 정보를 생성하는 단계, 상기 제1 텍스트 및 상기 제2 텍스트를 기반으로 제2 텍스트 품질 정보를 생성하는 단계, 상기 상담사 단말을 통해, 상기 제2 음성 품질 정보 및 상기 제2 텍스트 품질 정보를 표시하는 단계를 포함할 수 있다.On the other hand, the method for evaluating the counseling quality of the electronic device includes: when a first spoken voice of a counselor is received through the counselor terminal, displaying a first text converted from the first spoken voice through the counselor terminal; Generating first voice quality information based on a first spoken voice, generating first text quality information based on the first text, through the counselor terminal, the first voice quality information and the first text displaying quality information; when the counselor's second spoken voice is received through the counselor's terminal in a state in which the first text is displayed, the second text converted from the second spoken's voice through the counselor's terminal is displayed as the second text Displaying together with 1 text, generating second speech quality information based on the first spoken voice and the second spoken voice, and generating second text quality information based on the first text and the second text generating, and displaying the second voice quality information and the second text quality information through the counselor terminal.

이 경우, 상기 전자 장치의 상담 품질 평가 방법은, 상기 제1 음성 품질 정보 및 상기 제2 음성 품질 정보를 비교하여, 상기 상담사의 상담 품질이 실시간으로 개선되고 있는지 여부를 판단하는 단계, 상기 상담사의 상담 품질이 실시간으로 개선되고 있는 것으로 판단된 경우, 이후 입력되는 상기 상담사의 발화 음성을 제1 주기로 획득하여 주기 별로 음성 품질 정보를 업데이트하는 단계, 상기 상담사의 상담 품질이 실시간으로 개선되고 있지 않은 것으로 판단된 경우, 이후 입력되는 상기 상담사의 발화 음성을 상기 제1 주기보다 짧은 제2 주기로 획득하여 주기 별로 음성 품질 정보를 업데이트하는 단계를 포함할 수 있다.In this case, the method for evaluating the counseling quality of the electronic device includes comparing the first voice quality information and the second voice quality information to determine whether the counseling quality of the counselor is improving in real time; When it is determined that the consultation quality is being improved in real time, acquiring the uttered voice of the counselor input thereafter in a first cycle and updating the voice quality information for each cycle, it is determined that the counseling quality of the counselor is not improving in real time If it is determined, the method may include acquiring the voice of the counselor's utterance input thereafter with a second period shorter than the first period, and updating the voice quality information for each period.

본 개시에 따른 전자 장치의 상담 품질 평가 방법은, 상담사의 단말을 통해 고객과의 음성 상담을 제공할 뿐만 아니라, 현재 수행 중인 상담에 대한 실시간 평가 정보를 제공할 수 있다.The method for evaluating the counseling quality of an electronic device according to the present disclosure may provide not only voice counseling with a customer through a counselor's terminal, but also real-time evaluation information on counseling currently being performed.

즉, 본 개시에 따른 전자 장치의 상담 품질 평가 방법은, 각 상담사로 하여금 본인의 상담 진행 중 실시간으로 우수 사례와의 비교를 통한 본인의 문제점을 즉시 피드백 받고 개선할 수 있도록 하여, 상담 관리자의 사후 평가 또는 교육 없이도 상담사가 실시간으로 자기 주도적 품질 개선을 성취할 수 있는 환경을 제공할 수 있으며, 이를 통해 비대면 환경 내에서의 상담 품질 관리 및 전수 분석이 수행되도록 지원할 수 있다. 이를 통해, 별도의 랜덤콜 분석이나 상담사 교육 등에 소요되는 인적, 시간적 비용을 최소화하여 관리 인건비 절감과 비대면 기반 자기 주도적 상담 품질 관리가 가능하게 된다.That is, the counseling quality evaluation method of the electronic device according to the present disclosure enables each counselor to immediately receive feedback on his/her problem through comparison with best practices in real time during his/her counseling and improve it, It can provide an environment in which counselors can achieve self-directed quality improvement in real time without evaluation or training, and this can support counseling quality management and holistic analysis within a non-face-to-face environment. Through this, it is possible to minimize the human and time costs required for separate random call analysis or counselor training, thereby reducing administrative labor costs and enabling non-face-to-face self-directed counseling quality management.

도 1은 본 개시의 일 실시 예에 따라 고객 단말 및 상담사 단말과 통신을 수행하는 전자 장치의 동작을 설명하기 위한 도면,
도 2는 본 개시의 일 실시 예에 따른 전자 장치의 상담 품질 평가 방법을 설명하기 위한 흐름도,
도 3은 본 개시의 일 실시 예에 따른 전자 장치가 상담사 단말을 통해 실시간으로 평가 정보(품질 정보)를 제공하는 동작을 설명하기 위한 도면,
도 4는 본 개시의 일 실시 예에 따른 전자 장치의 구성을 설명하기 위한 블록도, 그리고
도 5는 본 개시의 일 실시 예에 따른 전자 장치의 기능적 구성 및 동작을 설명하기 위한 블록도이다.1 is a view for explaining an operation of an electronic device performing communication with a customer terminal and a counselor terminal according to an embodiment of the present disclosure;
2 is a flowchart illustrating a method for evaluating consultation quality of an electronic device according to an embodiment of the present disclosure;
3 is a diagram for explaining an operation in which an electronic device provides evaluation information (quality information) in real time through a counselor terminal according to an embodiment of the present disclosure;
4 is a block diagram illustrating a configuration of an electronic device according to an embodiment of the present disclosure; and
5 is a block diagram illustrating a functional configuration and operation of an electronic device according to an embodiment of the present disclosure.

본 개시에 대하여 구체적으로 설명하기에 앞서, 본 명세서 및 도면의 기재 방법에 대하여 설명한다.Prior to describing the present disclosure in detail, a description will be given of the description of the present specification and drawings.

먼저, 본 명세서 및 청구범위에서 사용되는 용어는 본 개시의 다양한 실시 예들에서의 기능을 고려하여 일반적인 용어들을 선택하였다. 하지만, 이러한 용어들은 당해 기술 분야에 종사하는 기술자의 의도나 법률적 또는 기술적 해석 및 새로운 기술의 출현 등에 따라 달라질 수 있다. 또한, 일부 용어는 출원인이 임의로 선정한 용어도 있다. 이러한 용어에 대해서는 본 명세서에서 정의된 의미로 해석될 수 있으며, 구체적인 용어 정의가 없으면 본 명세서의 전반적인 내용 및 당해 기술 분야의 통상적인 기술 상식을 토대로 해석될 수도 있다. First, terms used in the present specification and claims have been selected in consideration of functions in various embodiments of the present disclosure. However, these terms may vary depending on the intention or legal or technical interpretation of a person skilled in the art, and the emergence of new technology. Also, some terms are arbitrarily selected by the applicant. These terms may be interpreted in the meaning defined in the present specification, and if there is no specific term definition, it may be interpreted based on the general content of the present specification and common technical knowledge in the art.

또한, 본 명세서에 첨부된 각 도면에 기재된 동일한 참조번호 또는 부호는 실질적으로 동일한 기능을 수행하는 부품 또는 구성요소를 나타낸다. 설명 및 이해의 편의를 위해서 서로 다른 실시 예들에서도 동일한 참조번호 또는 부호를 사용하여 설명한다. 즉, 복수의 도면에서 동일한 참조 번호를 가지는 구성요소를 모두 도시되어 있다고 하더라도, 복수의 도면들이 하나의 실시 예를 의미하는 것은 아니다. Also, the same reference numerals or reference numerals in each drawing attached to this specification indicate parts or components that perform substantially the same functions. For convenience of description and understanding, the same reference numerals or reference numerals are used in different embodiments. That is, even though all components having the same reference number are illustrated in a plurality of drawings, the plurality of drawings do not mean one embodiment.

또한, 본 명세서 및 청구범위에서는 구성요소들 간의 구별을 위하여 "제1", "제2" 등과 같이 서수를 포함하는 용어가 사용될 수 있다. 이러한 서수는 동일 또는 유사한 구성요소들을 서로 구별하기 위하여 사용하는 것이며 이러한 서수 사용으로 인하여 용어의 의미가 한정 해석되어서는 안 된다. 일 예로, 이러한 서수와 결합된 구성요소는 그 숫자에 의해 사용 순서나 배치 순서 등이 제한되어서는 안 된다. 필요에 따라서는, 각 서수들은 서로 교체되어 사용될 수도 있다. In addition, in this specification and claims, terms including an ordinal number, such as "first" and "second", may be used to distinguish between elements. This ordinal number is used to distinguish the same or similar elements from each other, and the meaning of the term should not be construed as limited due to the use of the ordinal number. For example, the components combined with such an ordinal number should not be limited in the order of use or arrangement by the number. If necessary, each ordinal number may be used interchangeably.

본 명세서에서 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 출원에서, "포함하다" 또는 "구성되다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.In this specification, the singular expression includes the plural expression unless the context clearly dictates otherwise. In the present application, terms such as "comprises" or "consisting of" are intended to designate that the features, numbers, steps, operations, components, parts, or combinations thereof described in the specification exist, and are intended to indicate that one or more other It is to be understood that this does not preclude the possibility of addition or presence of features or numbers, steps, operations, components, parts, or combinations thereof.

본 개시의 실시 예에서 "모듈", "유닛", "부(part)" 등과 같은 용어는 적어도 하나의 기능이나 동작을 수행하는 구성요소를 지칭하기 위한 용어이며, 이러한 구성요소는 하드웨어 또는 소프트웨어로 구현되거나 하드웨어 및 소프트웨어의 결합으로 구현될 수 있다. 또한, 복수의 "모듈", "유닛", "부(part)" 등은 각각이 개별적인 특정한 하드웨어로 구현될 필요가 있는 경우를 제외하고는, 적어도 하나의 모듈이나 칩으로 일체화되어 적어도 하나의 프로세서로 구현될 수 있다.In an embodiment of the present disclosure, terms such as “module”, “unit”, “part”, etc. are terms for designating a component that performs at least one function or operation, and such component is hardware or software. It may be implemented or implemented as a combination of hardware and software. In addition, a plurality of "modules", "units", "parts", etc. are integrated into at least one module or chip, except when each needs to be implemented as individual specific hardware, and thus at least one processor. can be implemented as

또한, 본 개시의 실시 예에서, 어떤 부분이 다른 부분과 연결되어 있다고 할 때, 이는 직접적인 연결뿐 아니라, 다른 매체를 통한 간접적인 연결의 경우도 포함한다. 또한, 어떤 부분이 어떤 구성요소를 포함한다는 의미는, 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미한다.In addition, in an embodiment of the present disclosure, when a part is connected to another part, this includes not only direct connection but also indirect connection through another medium. In addition, the meaning that a certain part includes a certain component means that other components may be further included without excluding other components unless otherwise stated.

도 1은 본 개시의 일 실시 예에 따라 고객 단말 및 상담사 단말과 통신을 수행하는 전자 장치의 동작을 설명하기 위한 도면이다.1 is a diagram for explaining an operation of an electronic device that communicates with a customer terminal and a counselor terminal according to an embodiment of the present disclosure.

도 1을 참조하면, 전자 장치(100)는 고객 단말(200), 상담사 단말(300)과 각각 통신을 수행할 수 있다.Referring to FIG. 1 , the electronic device 100 may perform communication with a customer terminal 200 and a counselor terminal 300 , respectively.

일 실시 예로, 전자 장치(100)는 서버로 구현될 수 있다. 서버는, 하나 이상의 컴퓨터를 포함하는 시스템으로 구성될 수 있다. 전자 장치(100)는 상담 시스템을 운영하기 위한 서버에 해당할 수도 있다.As an embodiment, the electronic device 100 may be implemented as a server. The server may be configured as a system including one or more computers. The electronic device 100 may correspond to a server for operating a consultation system.

이 밖에, 전자 장치(100)는 데스크탑 PC, 스마트폰, 태블릿 PC 등 다양한 전자 기기로 구현될 수도 있다.In addition, the electronic device 100 may be implemented as various electronic devices such as a desktop PC, a smart phone, and a tablet PC.

고객 단말(200)은 상담을 받는 고객의 단말 장치에 해당한다. 고객 단말(200)은 스마트폰, 태블릿 PC, 웨어러블 장치, VR/AR 기기, 노트북 PC, 데스크탑 PC 등 다양한 단말 기기에 해당할 수 있다. The customer terminal 200 corresponds to a terminal device of a customer receiving a consultation. The customer terminal 200 may correspond to various terminal devices such as a smart phone, a tablet PC, a wearable device, a VR/AR device, a notebook PC, and a desktop PC.

고객 단말(200)은 음성 통화를 수행하기 위한 오디오 입력부(ex. 마이크) 및/또는 오디오 출력부(ex. 스피커, 이어폰/헤드폰 단자, 무선 이어폰/헤드폰과 통신을 수행하기 위한 통신부)를 포함할 수 있다. 또한, 고객 단말(200)은 별도의 오디오 입력 장치 및/또는 오디오 출력 장치와 연결될 수도 있다.The customer terminal 200 may include an audio input unit (ex. a microphone) and/or an audio output unit (ex. a speaker, an earphone/headphone terminal, a communication unit for communicating with a wireless earphone/headphone) for performing a voice call. can Also, the customer terminal 200 may be connected to a separate audio input device and/or an audio output device.

상담사 단말(300)은 상담을 제공하는 상담사의 단말 장치에 해당한다. 상담사 단말(300)은 스마트폰, 태블릿 PC, 웨어러블 장치, VR/AR 기기, 노트북 PC, 데스크탑 PC 등 다양한 단말 기기에 해당할 수 있다.The counselor terminal 300 corresponds to a counselor's terminal device that provides counseling. The counselor terminal 300 may correspond to various terminal devices, such as a smart phone, a tablet PC, a wearable device, a VR/AR device, a notebook PC, and a desktop PC.

상담사 단말(300)은 음성 통화를 수행하기 위한 오디오 입력부(ex. 마이크) 및/또는 오디오 출력부(ex. 스피커, 이어폰/헤드폰 단자, 무선 이어폰/헤드폰과 통신을 수행하기 위한 통신부)를 포함할 수 있다. 또한, 상담사 단말(300)은 별도의 오디오 입력 장치 및/또는 오디오 출력 장치와 연결될 수도 있다.The counselor terminal 300 may include an audio input unit (eg, a microphone) and/or an audio output unit (eg, a speaker, an earphone/headphone terminal, and a communication unit for communicating with a wireless earphone/headphone) for performing a voice call. can Also, the counselor terminal 300 may be connected to a separate audio input device and/or an audio output device.

전자 장치(100)는 고객 단말(200)과 상담사 단말(300) 간의 전화 상담을 위한 음성 통화를 제공할 수 있다. The electronic device 100 may provide a voice call for phone consultation between the customer terminal 200 and the counselor terminal 300 .

이 경우, 전자 장치(100)는 고객 단말(200)을 통해 입력된 고객의 발화 음성을 오디오 데이터로 변환하여 상담사 단말(300)로 전송할 수 있다. 마찬가지로, 전자 장치(100)는 상담사 단말(300)을 통해 입력된 상담사의 발화 음성을 오디오 데이터로 변환하여 고객 단말(300)로 전송할 수 있다.In this case, the electronic device 100 may convert the customer's uttered voice input through the customer terminal 200 into audio data and transmit it to the counselor terminal 300 . Similarly, the electronic device 100 may convert the counselor's utterance voice input through the counselor terminal 300 into audio data and transmit it to the customer terminal 300 .

그리고, 전자 장치(100)는 음성 통화가 제공되는 동안 상담사의 상담 내용을 분석하여 상담에 대한 품질 정보를 생성할 수 있는 바, 이하 도면들을 통해 보다 상세하게 설명한다.In addition, the electronic device 100 may generate quality information about the consultation by analyzing the counselor's consultation content while the voice call is being provided, which will be described in more detail below with reference to the drawings.

도 2는 본 개시의 일 실시 예에 따른 전자 장치의 상담 품질 평가 방법을 설명하기 위한 흐름도이다.2 is a flowchart illustrating a method for evaluating consultation quality of an electronic device according to an embodiment of the present disclosure.

도 2를 참조하면, 전자 장치(100)는 고객 단말(200)과 상담사 단말(300) 간의 음성 통화를 제공할 수 있으며(S210), 음성 통화가 제공되는 동안 상담사 단말(300)을 통해 입력되는 상담사의 발화 음성을 기반으로, 오디오 데이터를 획득할 수 있다(S220).Referring to FIG. 2 , the electronic device 100 may provide a voice call between the customer terminal 200 and the counselor terminal 300 ( S210 ), and is input through the counselor terminal 300 while the voice call is provided. Audio data may be acquired based on the counselor's spoken voice (S220).

여기서, 전자 장치(100)는 오디오 데이터의 특징 정보를 기저장된 특징 정보와 비교하여 음성 품질 정보를 생성할 수 있다(S230).Here, the electronic device 100 may generate voice quality information by comparing the characteristic information of the audio data with the previously stored characteristic information ( S230 ).

특징 정보는, 음성의 높낮이(pitch), 공명(Formant Frequency), 강도(Intensity), 및 빠르기(Speed) 중 적어도 하나의 항목에 대한 값을 포함할 수 있다.The feature information may include a value for at least one of pitch, resonance (Formant Frequency), intensity, and speed (Speed) of a voice.

기저장된 특징 정보는, 과거 상담 사례에 해당하는 적어도 한 명의 상담사의 발화 음성으로부터 추출된 특징 정보일 수 있다. 예를 들어, 기저장된 특징 정보는, 우수 상담 사례의 상담사의 발화 음성으로부터 추출된 특징 정보(ex. 높낮이, 공명, 강도, 빠르기 등)일 수 있으나, 이에 한정되지 않는다.The pre-stored feature information may be feature information extracted from the speech voice of at least one counselor corresponding to the past counseling case. For example, the pre-stored feature information may be feature information (eg, height, resonance, intensity, speed, etc.) extracted from the utterance voice of a counselor in an excellent counseling case, but is not limited thereto.

음성 품질 정보는, 상담사의 발화 음성의 품질을 나타내는 정보이다. 전자 장치(100)는 오디오 데이터의 특징 정보를 기저장된 특징 정보와 항목 별로 비교하여 일치율을 획득하고, 항목 별 일치율에 대한 정보를 포함하는 음성 품질 정보를 생성할 수 있다.The voice quality information is information indicating the quality of the counselor's spoken voice. The electronic device 100 may compare feature information of audio data with pre-stored feature information for each item to obtain a match rate, and generate voice quality information including information on the match rate for each item.

전자 장치(100)는 적어도 하나의 인공지능 모델을 활용하여 오디오 데이터의 특징 정보를 추출 및 비교할 수 있다.The electronic device 100 may extract and compare feature information of audio data by using at least one artificial intelligence model.

관련하여, 전자 장치(100)는 오디오 데이터의 특징 정보를 추출하여 일치율을 판단하기 위한 제1 인공지능 모델을 생성할 수 있다.In relation to this, the electronic device 100 may generate a first artificial intelligence model for determining a match rate by extracting feature information of the audio data.

여기서, 전자 장치(100)는 과거 상담 사례를 기반으로 우수 상담사의 발화 음성에 매칭되는 오디오 데이터 및 열위 상담사의 발화 음성에 매칭되는 오디오 데이터를 추출할 수 있으며, 추출된 오디오 데이터를 활용하여 제1 인공지능 모델을 훈련시킬 수 있다.Here, the electronic device 100 may extract audio data matching the uttered voice of the excellent counselor and the uttered voice of the inferior counselor based on past counseling cases, and may extract audio data matching the uttered voice of the inferior counselor, and utilize the extracted audio data to extract the first You can train artificial intelligence models.

일 예로, 제1 인공지능 모델은 신경망 모델로 구현될 수 있다. 이 경우, 제1 인공지능 모델은 서로 다른 레이어에 포함된 노드 간의 가중치에 따라 업데이트되는 방식으로 훈련될 수 있다. 예를 들어, 제1 인공지능 모델은, 오디오 데이터를 입력 받아 특징 정보를 추출하기 위한 적어도 하나의 레이어를 포함할 수 있으며, 서로 다른 오디오 데이터 간의 특징 정보를 비교하여 일치율을 산출하기 위한 적어도 하나의 레이어를 포함할 수 있다.As an example, the first artificial intelligence model may be implemented as a neural network model. In this case, the first AI model may be trained in such a way that it is updated according to weights between nodes included in different layers. For example, the first AI model may include at least one layer for receiving audio data and extracting feature information, and at least one layer for calculating a matching rate by comparing feature information between different audio data. It can contain layers.

이렇듯 제1 인공지능 모델이 훈련된 상태에서, 전자 장치(100)는 S220 단계를 통해 획득된 오디오 데이터를 제1 인공지능 모델에 입력할 수 있다.In this way, in a state in which the first artificial intelligence model is trained, the electronic device 100 may input the audio data obtained in step S220 to the first artificial intelligence model.

이 경우, 제1 인공지능 모델은 오디오 데이터의 특징 정보를 추출할 수 있으며, 추출된 특징 정보를 기저장된 특징 정보(ex. 우수 상담사의 발화 음성의 특징 정보) 비교하여 항목 별로 일치율을 출력할 수 있다.In this case, the first artificial intelligence model can extract characteristic information of the audio data, and compare the extracted characteristic information with pre-stored characteristic information (eg, characteristic information of an excellent counselor's spoken voice) to output a matching rate for each item. have.

한편, 전자 장치(100)는 상담사의 발화 음성에 해당하는 오디오 데이터를 변환하여 텍스트를 획득할 수 있다(S240). 이 경우, 전자 장치(100)는 STT(Speech to Text) 변환을 수행하기 위한 적어도 하나의 엔진을 사용할 수 있다.Meanwhile, the electronic device 100 may obtain text by converting audio data corresponding to the counselor's uttered voice ( S240 ). In this case, the electronic device 100 may use at least one engine for performing speech to text (STT) conversion.

여기서, 전자 장치(100)는 텍스트를 분석하여 텍스트의 규칙 정보를 획득하고, 획득된 규칙 정보를 기저장된 규칙 정보와 비교하여 텍스트 품질 정보를 생성할 수 있다(S250).Here, the electronic device 100 may analyze the text to obtain rule information of the text, and compare the obtained rule information with pre-stored rule information to generate text quality information ( S250 ).

규칙 정보는, 기설정된 스크립트의 발화 여부, 기설정된 스크립트의 발화 횟수, 및 복수의 스크립트의 발화 순서 중 적어도 하나와 관련된 항목을 하나 이상 포함할 수 있다.The rule information may include one or more items related to at least one of whether a preset script is uttered, the number of utterances of the preset script, and an utterance order of a plurality of scripts.

여기서, 스크립트는, 상담 내용 전체에 해당하는 텍스트일 수 있고, 상담 내용을 구성하는 적어도 하나의 단어 또는 문장에 해당하는 텍스트일 수도 있다.Here, the script may be text corresponding to the entire consultation content, or text corresponding to at least one word or sentence constituting the consultation content.

예를 들어, 규칙 정보는, 인사말의 유무/수, 양해어의 유무/수, 지양어의 유무/수, 대기 멘트의 유무/수 등 다양한 항목에 대한 값을 포함할 수 있으나, 이에 한정되지 않는다.For example, the rule information may include, but is not limited to, values for various items such as presence/number of greetings, presence/number of understanding words, presence/number of non-repudiation words, and presence/number of waiting comments. .

기저장된 규칙 정보는, 과거 상담 사례에 해당하는 적어도 한 명의 상담사의 발화 음성이 변환된 텍스트로부터 추출된 규칙 정보일 수 있다. 예를 들어, 기저장된 규칙 정보는, 우수 상담 사례의 상담사의 발화 음성이 변환된 텍스트의 규칙 정보일 수 있으나, 이에 한정되지 않는다.The pre-stored rule information may be rule information extracted from a text in which the speech voice of at least one counselor corresponding to the past counseling case is converted. For example, the pre-stored rule information may be rule information of a text in which the speech voice of a counselor of an excellent counseling case is converted, but is not limited thereto.

텍스트 품질 정보는, 상담사의 발화 음성이 변환된 텍스트(: 상담 내용)의 품질을 나타내는 정보이다. 전자 장치(100)는 오디오 데이터가 변환된 텍스트의 규칙 정보를 기저장된 규칙 정보(ex. 우수 상담사의 텍스트의 규칙 정보)와 항목 별로 비교하여 일치율을 획득하고, 항목 별 일치율에 대한 정보를 포함하는 텍스트 품질 정보를 생성할 수 있다.The text quality information is information indicating the quality of text (consultation content) in which the counselor's spoken voice is converted. The electronic device 100 compares rule information of text converted from audio data with pre-stored rule information (eg, rule information of text of excellent counselors) for each item to obtain a match rate, and includes information on the match rate for each item. You can generate text quality information.

전자 장치(100)는 적어도 하나의 인공지능 모델을 활용하여 텍스트의 규칙 정보를 추출 및 비교할 수 있다.The electronic device 100 may extract and compare rule information of the text by using at least one artificial intelligence model.

관련하여, 전자 장치(100)는 오디오 데이터의 특징 정보를 추출하여 일치율을 판단하기 위한 제2 인공지능 모델을 생성할 수 있다.In relation to this, the electronic device 100 may generate a second artificial intelligence model for determining a match rate by extracting feature information of the audio data.

여기서, 전자 장치(100)는 과거 상담 사례를 기반으로 우수 상담사의 발화 음성이 변환된 텍스트 및 열위 상담사의 발화 음성이 변환된 텍스트를 추출할 수 있으며, 추출된 텍스트를 활용하여 제2 인공지능 모델을 훈련시킬 수 있다.Here, the electronic device 100 may extract the text in which the speech voice of the excellent counselor is converted and the text in which the speech voice of the inferior counselor is converted based on the past counseling case, and the second artificial intelligence model is used by using the extracted text. can be trained

예를 들어, 제2 인공지능 모델은, 텍스트를 벡터 형태로 변환하기 위한 적어도 하나의 모듈, 변환된 벡터의 의미를 단어 또는 문장 단위로 이해하기 위한 적어도 하나의 모듈, 단어 또는 문장의 존재 여부/수에 따라 텍스트의 규칙 정보를 획득하기 위한 적어도 하나의 모듈 등을 포함할 수 있으나, 이에 한정되지 않는다.For example, the second artificial intelligence model includes at least one module for converting text into a vector form, at least one module for understanding the meaning of the converted vector in word or sentence units, whether a word or sentence exists / It may include at least one module for acquiring rule information of text according to the number, but is not limited thereto.

이렇듯 제2 인공지능 모델이 훈련된 상태에서, 전자 장치(100)는 S240 단계를 통해 획득된 텍스트를 제2 인공지능 모델에 입력할 수 있다.In this way, in a state in which the second artificial intelligence model is trained, the electronic device 100 may input the text obtained in step S240 into the second artificial intelligence model.

이 경우, 제2 인공지능 모델은 입력된 텍스트의 규칙 정보를 추출할 수 있으며, 추출된 규칙 정보를 기저장된 규칙 정보(ex. 우수 상담사의 발화 음성의 규칙 정보)와 비교하여 항목 별로 일치율을 출력할 수 있다.In this case, the second artificial intelligence model can extract rule information from the input text, and compare the extracted rule information with pre-stored rule information (eg, rule information of an excellent counselor's utterance) and output the matching rate for each item. can do.

상술한 바와 같이 음성 품질 정보 및 텍스트 품질 정보가 생성되면, 전자 장치(100)는 상담사 단말(300)을 통해 음성 품질 정보 및 텍스트 품질 정보를 각각 제공할 수 있다(S260). When the voice quality information and the text quality information are generated as described above, the electronic device 100 may provide the voice quality information and the text quality information through the counselor terminal 300, respectively (S260).

구체적으로, 고객 단말(200)과 상담사 단말(300) 간의 음성 통화가 제공되는 동안, 전자 장치(100)는 상담사 단말(300)을 통해 음성 품질 정보 및 텍스트 품질 정보를 표시할 수 있다. 예를 들어, 음성 품질 정보를 구성하는 항목 별 일치율 및 텍스트 품질 정보를 구성하는 항목 별 일치율이 각각 상담사 단말(300)의 화면 상에 디스플레이 될 수 있다.Specifically, while a voice call between the customer terminal 200 and the counselor terminal 300 is provided, the electronic device 100 may display voice quality information and text quality information through the counselor terminal 300 . For example, the matching rate for each item constituting the voice quality information and the matching rate for each item constituting the text quality information may be displayed on the screen of the counselor terminal 300 , respectively.

관련하여, 도 3은 본 개시의 일 실시 예에 따른 전자 장치가 상담사 단말을 통해 실시간으로 평가 정보(품질 정보)를 제공하는 동작을 설명하기 위한 도면이다.3 is a diagram for explaining an operation in which an electronic device provides evaluation information (quality information) in real time through a counselor terminal according to an embodiment of the present disclosure.

도 3은 고객 단말(200)과 상담사 단말(300) 간의 음성 통화가 진행되고 있는 중의 상담사 단말(300)의 화면이 도시된 것이다. 도 3을 참조하면, 전자 장치(100)는 상담 내용을 구성하는 텍스트(301, 302, 303, 304), 실시간으로 음성 품질 정보를 나타내기 위한 UI(User Interface. 310), 실시간으로 텍스트 품질 정보를 나타내기 위한 UI(320) 등을 표시할 수 있다.3 illustrates a screen of the counselor terminal 300 while a voice call between the customer terminal 200 and the counselor terminal 300 is in progress. Referring to FIG. 3 , the electronic device 100 includes texts 301 , 302 , 303 , and 304 constituting consultation contents, a user interface (UI) 310 for displaying voice quality information in real time, and text quality information in real time. A UI 320 for representing , etc. may be displayed.

구체적으로, 고객 단말(200)을 통해 입력된 고객의 발화 음성 및 상담사 단말(300)을 통해 입력된 상담사의 발화 음성이 각각 텍스트로 변환되어 표시될 수 있다(도 3의 301, 302, 303, 304).Specifically, the customer's uttered voice input through the customer terminal 200 and the counselor's uttered voice input through the counselor terminal 300 may be converted into text and displayed (301, 302, 303 in FIG. 3). 304).

예를 들어, 상담사 단말(300)을 통해 상담사의 제1 발화 음성(ex. “아 네 안녕하세요”)이 수신되면, 전자 장치(100)는 제1 발화 음성이 변환된 제1 텍스트(302)를 상담사 단말(300)을 통해 표시할 수 있다.For example, when the counselor's first spoken voice (eg, “Oh yes, hello”) is received through the counselor terminal 300 , the electronic device 100 transmits the first text 302 in which the first spoken voice is converted. It may be displayed through the counselor terminal 300 .

여기서, 전자 장치(100)는 제1 발화 음성을 기반으로 제1 음성 품질 정보를 생성할 수 있다. Here, the electronic device 100 may generate the first voice quality information based on the first spoken voice.

구체적으로, 전자 장치(100)는 제1 발화 음성을 제1 인공지능 모델에 입력하여 특징 정보를 추출하고, 추출된 특징 정보를 기저장된 특징 정보(ex. 우수 상담사의 발화 음성의 특징 정보)와 비교하여 항목 별 일치율을 산출할 수 있다. 그 결과, 높낮이, 공명, 강도, 빠르기 등 각 항목에 대한 일치율을 포함하는 제1 음성 품질 정보가 상담사 단말(300)의 UI(310) 내에 표시될 수 있다.Specifically, the electronic device 100 inputs the first spoken voice to the first artificial intelligence model to extract feature information, and combines the extracted feature information with pre-stored feature information (eg, feature information of an excellent counselor's speech voice) and By comparison, the matching rate for each item can be calculated. As a result, the first voice quality information including the matching rate for each item, such as height, resonance, intensity, and speed, may be displayed in the UI 310 of the counselor terminal 300 .

또한, 전자 장치(100)는 제1 발화 음성이 변환된 텍스트(302)를 기반으로 제1 텍스트 품질 정보를 생성할 수 있다.Also, the electronic device 100 may generate the first text quality information based on the text 302 in which the first spoken voice is converted.

구체적으로, 전자 장치(100)는 텍스트(302)를 제2 인공지능 모델에 입력하여 규칙 정보를 추출하고, 추출된 규칙 정보를 기저장된 규칙 정보(ex. 우수 상담사의 발화 음성이 변환된 텍스트의 규칙 정보)와 비교하여 항목 별 일치율을 산출할 수 있다. 그 결과, 인사말의 수, 양해어의 수, 지양어의 수, 대기 멘트의 수 등 각 항목에 대한 일치율을 포함하는 제1 텍스트 품질 정보가 상담사 단말(300)의 UI(320) 내에 표시될 수 있다.Specifically, the electronic device 100 inputs the text 302 into the second artificial intelligence model to extract rule information, and uses the extracted rule information as pre-stored rule information (eg, of the text in which the excellent counselor's speech is converted). rule information) to calculate the matching rate for each item. As a result, the first text quality information including the matching rate for each item, such as the number of greetings, the number of understanding words, the number of swear words, and the number of waiting comments, may be displayed in the UI 320 of the counselor terminal 300. have.

이후, 상담사 단말(300)을 통해 상담사의 제2 발화 음성(ex. “네네”)이 수신되면, 전자 장치(100)는 제2 발화 음성이 변환된 제2 텍스트(304)를 상담사 단말(300)을 통해 표시할 수 있다.Thereafter, when the counselor's second spoken voice (eg, “nene”) is received through the counselor terminal 300 , the electronic device 100 transmits the second text 304 in which the second spoken voice is converted to the counselor terminal 300 . ) can be indicated by

이 경우, 전자 장치(100)는 제1 발화 음성 및 제2 발화 음성을 기반으로 제2 음성 품질 정보를 생성할 수 있다. In this case, the electronic device 100 may generate second voice quality information based on the first uttered voice and the second uttered voice.

예를 들어, 제1 발화 음성 및 제2 발화 음성을 모두 포함하는 오디오 데이터가 제1 인공지능 모델에 입력된 결과, 제2 음성 품질 정보가 생성될 수 있다.For example, as a result of inputting audio data including both the first uttered voice and the second uttered voice to the first artificial intelligence model, second voice quality information may be generated.

다른 예로, 제1 발화 음성에 대해 생성된 제1 음성 품질 정보, 및 제2 발화 음성에 대해 생성된 음성 품질 정보가 조합된 결과 제2 음성 품질 정보가 생성될 수도 있다.As another example, the second voice quality information may be generated as a result of combining the first voice quality information generated for the first uttered voice and the voice quality information generated for the second uttered voice.

즉, 현재까지 제공된 상담사의 모든 발화 음성이 반영된 제2 음성 품질 정보가 생성되어, UI(310)를 통해 제공될 수 있다.That is, the second voice quality information in which all spoken voices of the counselor provided so far are reflected may be generated and provided through the UI 310 .

마찬가지로, 전자 장치(100)는 제2 발화 음성이 변환된 제2 텍스트(304) 및 상술한 제1 텍스트(302)를 기반으로, 제2 텍스트 품질 정보를 생성할 수 있다.Similarly, the electronic device 100 may generate the second text quality information based on the second text 304 from which the second spoken voice is converted and the above-described first text 302 .

예를 들어, 제1 텍스트(302) 및 제2 텍스트(302)를 모두 포함하는 텍스트가 제2 인공지능 모델에 입력된 결과, 제2 텍스트 품질 정보가 생성될 수 있다.For example, as a result of inputting text including both the first text 302 and the second text 302 into the second artificial intelligence model, second text quality information may be generated.

다른 예로, 제1 텍스트(302)에 대하여 생성된 제1 텍스트 품질 정보, 및 제2 텍스트(304)에 대하여 생성된 텍스트 품질 정보가 조합된 결과 제2 텍스트 품질 정보가 생성될 수도 있다.As another example, the second text quality information may be generated as a result of combining the first text quality information generated with respect to the first text 302 and the text quality information generated with respect to the second text 304 .

이 경우, 현재까지 제공된 상담사의 모든 상담 내용(텍스트)이 반영된 제2 텍스트 품질 정보가 생성되어 UI(320)를 통해 제공될 수 있다.In this case, second text quality information in which all consultation contents (texts) of the counselor provided so far are reflected may be generated and provided through the UI 320 .

이렇듯 도 3의 실시 예에 따라, 상담이 진행되면서(새로운 발화 음성이 계속해서 입력됨) UI(310, 320) 상에 표시되는 품질 정보(음성 품질 정보, 텍스트 품질 정보)가 실시간으로 업데이트되는 바, 상담사는 실시간으로 제공되는 품질 정보를 확인할 수 있으며, 피드백 과정이 자연스럽게 진행될 수 있다.As such, according to the embodiment of FIG. 3 , the quality information (voice quality information, text quality information) displayed on the UIs 310 and 320 is updated in real time while the consultation is in progress (a new spoken voice is continuously input). , the counselor can check the quality information provided in real time, and the feedback process can proceed naturally.

한편, 전자 장치(100)는 실시간으로 업데이트되는 품질 정보를 모니터링하여, 상담사의 상담 품질이 실시간으로 개선되고 있는지 여부를 판단할 수 있다.Meanwhile, the electronic device 100 may monitor the quality information updated in real time to determine whether the counseling quality of the counselor is being improved in real time.

일 실시 예로, 상술한 도 3의 실시 예를 참조하면, 전자 장치(100)는 다른 시간대에 생성된 제1 음성 품질 정보와 제2 음성 품질 정보를 비교하여 상담사의 상담 품질이 실시간으로 개선되고 있는지 여부를 판단할 수 있다.As an example, referring to the above-described embodiment of FIG. 3 , the electronic device 100 compares the first voice quality information and the second voice quality information generated at different times to determine whether the counselor's consultation quality is improving in real time. can determine whether

예를 들어, 제1 음성 품질 정보와 제2 음성 품질 정보를 비교했을 때 항목 별 일치율이 점차 증가하는 경우, 상담 품질이 실시간으로 개선되고 있는 것으로 판단될 수 있다.For example, when the match rate for each item gradually increases when the first voice quality information and the second voice quality information are compared, it may be determined that the consultation quality is improving in real time.

만약, 상담사의 상담 품질이 실시간으로 개선되고 있는 것으로 판단된 경우, 전자 장치(100)는 이후 (새롭게) 입력되는 상담사의 발화 음성을 제1 주기로 획득하여 주기 별로 음성 품질 정보(ex. UI(310)에 표시되는 음성 품질 정보)를 업데이트할 수 있다.If it is determined that the counselor's counseling quality is improving in real time, the electronic device 100 acquires the counselor's uttered voice input later (newly) in the first cycle, and provides voice quality information (eg, UI 310 ) for each cycle. ) can be updated.

구체적으로, 전자 장치(100)는 제1 주기에 따라 구분되는 복수의 시간 구간 중 제1 시간 구간 동안 입력된 발화 음성을 분석하여 음성 품질 정보를 업데이트하고, 업데이트된 음성 품질 정보를 제공할 수 있다. 마찬가지로, 전자 장치(100)는 복수의 시간 구간 중 제1 시간 구간 다음의 제2 시간 구간 동안 입력된 발화 음성을 분석하여 다시 음성 품질 정보를 업데이트 및 제공할 수 있다.Specifically, the electronic device 100 may update the voice quality information by analyzing the spoken voice input during a first time period among a plurality of time periods divided according to the first period, and may provide the updated voice quality information. . Similarly, the electronic device 100 may update and provide voice quality information again by analyzing the spoken voice input during a second time section following the first time section among the plurality of time sections.

반면, 상담사의 상담 품질이 실시간으로 개선되고 있지 않은 것으로 판단된 경우, 전자 장치(100)는 이후 (새롭게) 입력되는 상담사의 발화 음성을 제1 주기보다 짧은 제2 주기로 더 자주 획득하여 주기 별로 음성 품질 정보(ex. UI(310)에 표시되는 음성 품질 정보)를 업데이트할 수 있다.On the other hand, when it is determined that the counselor's counseling quality is not improving in real time, the electronic device 100 acquires the counselor's uttered voice, which is input later (newly) more frequently in a second cycle shorter than the first cycle, Quality information (eg, voice quality information displayed on the UI 310) may be updated.

이 경우, 상담 품질의 개선이 더딜수록 품질 정보가 더 빨리 업데이트되므로, 상담사에게 더 강하게 피드백을 제공할 수 있다는 효과가 있다.In this case, the slower the improvement of the consultation quality, the faster the quality information is updated, so that it is possible to provide stronger feedback to the counselor.

다만, 더 짧은 제2 주기로 음성 품질 정보가 제공됨에도 상담 품질이 개선되지 않고 있는 것으로 판단되는 경우, 전자 장치(100)는 제1 주기보다 더 긴 제3 주기에 따라 발화 음성을 획득하여 주기 별로 음성 품질 정보를 업데이트할 수 있다. 즉, 강한 피드백의 효과가 없는 상담사에게는 오히려 느린 피드백이 제공될 수 있다.However, if it is determined that the consultation quality is not improved even though the voice quality information is provided in the second shorter period, the electronic device 100 acquires the spoken voice according to the third period longer than the first period, You can update quality information. In other words, slow feedback may be provided to counselors who do not have the effect of strong feedback.

한편, 상술한 실시 예에 따라 주기가 다양하게 변경되었음에도 상담 품질이 개선되지 않는 경우, 전자 장치(100)는 음성 품질 정보를 포함하는 UI(310)의 크기/비중을 실시간으로 변경할 수도 있다. 이 경우, 음성 품질 정보를 구성하는 이미지/문자의 크기 역시 변경될 수 있다.Meanwhile, according to the above-described embodiment, when the consultation quality is not improved even though the period is variously changed, the electronic device 100 may change the size/weight of the UI 310 including the voice quality information in real time. In this case, the size of the image/text constituting the voice quality information may also be changed.

예를 들어, 전자 장치(100)는 상담사 단말(300)의 화면에 표시되는 UI(310)의 크기(: 화면 내 UI(310)의 비중)가 확대되도록 상담사 단말(300)을 제어할 수 있다.For example, the electronic device 100 may control the counselor terminal 300 to enlarge the size of the UI 310 displayed on the screen of the counselor terminal 300 (: the specific gravity of the UI 310 in the screen). .

만약, UI(310)의 크기가 확대됨에 따라 상담 품질이 개선되는 것으로 판단되는 경우, 확대된 UI(310)가 유지될 수 있다.If it is determined that the consultation quality is improved as the size of the UI 310 is enlarged, the enlarged UI 310 may be maintained.

반면, UI(310)의 크기가 확대되었음에도 상담 품질이 개선되지 않는 것으로 판단되는 경우, 전자 장치(100)는 상담사 단말(300)의 화면에 표시되는 UI(310)의 크기(: 화면 내 UI(310)의 비중)가 기존의 크기보다도 더 축소되도록 상담사 단말(300)을 제어할 수 있다. 여기서, 상담 품질이 개선되는 경우, 축소된 UI(310)가 유지될 수 있다. 반면, UI(310)가 축소되었음에도 상담 품질이 개선되지 않는 경우, 전자 장치(100)는 일정 시간 동안 UI(310)를 표시하지 않도록 상담사 단말(300)을 제어할 수 있다.On the other hand, when it is determined that the consultation quality is not improved even though the size of the UI 310 is enlarged, the electronic device 100 displays the size of the UI 310 displayed on the screen of the counselor terminal 300 (: UI in the screen) 310) may be controlled to reduce the size of the counselor terminal 300 more than the existing size. Here, when the consultation quality is improved, the reduced UI 310 may be maintained. On the other hand, when the consultation quality is not improved even though the UI 310 is reduced, the electronic device 100 may control the counselor terminal 300 not to display the UI 310 for a predetermined period of time.

한편, 음성 품질 정보를 통해 상담 품질의 개선 여부가 판단되는 상술한 실시 예와 유사하게, 텍스트 품질 정보를 통해 실시간 상담 품질의 개선 여부가 판단될 수도 있다.Meanwhile, similar to the above-described embodiment in which it is determined whether the consultation quality is improved through the voice quality information, it may be determined whether the real-time consultation quality is improved through the text quality information.

일 실시 예로, 상술한 도 3의 실시 예를 참조하면, 전자 장치(100)는 다른 시간대에 생성된 제1 텍스트 품질 정보와 제2 텍스트 품질 정보를 비교하여 상담사의 상담 품질이 실시간으로 개선되고 있는지 여부를 판단할 수 있다.As an example, referring to the above-described embodiment of FIG. 3 , the electronic device 100 compares the first text quality information and the second text quality information generated at different time periods to determine whether the counselor's consultation quality is improving in real time. can determine whether

예를 들어, 제1 텍스트 품질 정보와 제2 텍스트 품질 정보를 비교했을 때 항목 별 일치율이 점차 증가하는 경우, 상담 품질이 실시간으로 개선되고 있는 것으로 판단될 수 있다.For example, when the matching rate for each item gradually increases when the first text quality information and the second text quality information are compared, it may be determined that the consultation quality is improving in real time.

만약, 상담사의 상담 품질이 실시간으로 개선되고 있는 것으로 판단된 경우, 전자 장치(100)는 이후 (새롭게) 입력되는 상담사의 발화 음성을 제1 주기로 획득하여 변환함으로써 텍스트를 획득하고, 획득된 텍스트를 활용하여 주기 별로 텍스트 품질 정보(ex. UI(320)에 표시되는 텍스트 품질 정보)를 업데이트할 수 있다.If it is determined that the counselor's counseling quality is improving in real time, the electronic device 100 acquires the text by converting the counselor's uttered voice inputted later (newly) in the first cycle, and converts the obtained text. By using it, text quality information (eg, text quality information displayed on the UI 320 ) may be updated for each cycle.

반면, 상담사의 상담 품질이 실시간으로 개선되고 있지 않은 것으로 판단된 경우, 전자 장치(100)는 이후 (새롭게) 입력되는 상담사의 발화 음성을 제1 주기보다 짧은 제2 주기로 획득하여 변환함으로써 텍스트를 획득하고, 획득된 텍스트를 활용하여 주기 별로 텍스트 품질 정보(ex. UI(320)에 표시되는 텍스트 품질 정보)를 업데이트할 수 있다.On the other hand, when it is determined that the counselor's counseling quality is not improving in real time, the electronic device 100 acquires and converts the counselor's uttered voice input later (newly) with a second cycle shorter than the first cycle, thereby acquiring text And, by using the acquired text, text quality information (eg, text quality information displayed on the UI 320 ) may be updated for each cycle.

다만, 더 짧은 제2 주기로 텍스트 품질 정보가 제공됨에도 상담 품질이 개선되지 않고 있는 것으로 판단되는 경우, 전자 장치(100)는 제1 주기보다 더 긴 제3 주기에 따라 발화 음성을 획득하여 주기 별로 텍스트 품질 정보를 업데이트할 수 있다.However, if it is determined that the consultation quality is not improved even though the text quality information is provided in the second shorter period, the electronic device 100 acquires the spoken voice according to the third period longer than the first period and provides text for each period. You can update quality information.

또는, 일 실시 예로, 전자 장치(100)는 제1 음성 품질 정보 및 제2 음성 품질 정보에 따라 음성 품질이 개선되고 있는지 여부를 판단하고, 제1 텍스트 품질 정보 및 제2 텍스트 품질 정보에 따라 텍스트 품질이 개선되고 있는지 여부를 판단할 수 있다.Alternatively, according to an embodiment, the electronic device 100 determines whether the voice quality is being improved according to the first voice quality information and the second voice quality information, and uses the text according to the first text quality information and the second text quality information. It can be judged whether the quality is improving or not.

여기서, 음성 품질 및 텍스트 품질이 모두 개선되고 있는 것으로 판단된 경우, 전자 장치(100)는 이후 (새롭게) 입력되는 상담사의 발화 음성을 제1 주기로 획득하여 음성 품질 정보 및 텍스트 품질 정보를 주기마다 업데이트할 수 있다. 그리고, 업데이트 된 음성 품질 정보 및 텍스트 품질 정보는 UI(310, 320)를 통해 표시될 수 있다.Here, when it is determined that both the voice quality and the text quality are being improved, the electronic device 100 acquires the (new) input counselor's utterance voice in the first cycle and updates the voice quality information and the text quality information every cycle. can do. In addition, the updated voice quality information and text quality information may be displayed through the UIs 310 and 320 .

반면, 음성 품질 및 텍스트 품질 중 적어도 하나가 개선되고 있지 않은 것으로 판단된 경우, 전자 장치(100)는 이후 (새롭게) 입력되는 상담사의 발화 음성을 제2 주기(<제1 주기)로 획득하여 음성 품질 정보 및 텍스트 품질 정보를 주기마다 업데이트할 수 있다. 그리고, 업데이트 된 음성 품질 정보 및 텍스트 품질 정보는 UI(310, 320)를 통해 표시될 수 있다.On the other hand, when it is determined that at least one of the voice quality and the text quality is not improving, the electronic device 100 acquires the (newly) inputted voice of the counselor in the second period (<first period) and provides a voice Quality information and text quality information may be updated every cycle. In addition, the updated voice quality information and text quality information may be displayed through the UIs 310 and 320 .

한편, 일 실시 예로, 전자 장치(100)는 상담사 단말(300)을 통해 진행되는 각 상담 건마다 음성 품질 정보 및 텍스트 품질 정보를 업데이트하는 주기를 다르게 설정할 수 있다.Meanwhile, as an embodiment, the electronic device 100 may set a different period for updating the voice quality information and the text quality information for each counseling case performed through the counselor terminal 300 .

예를 들어, 상담사 단말(300)을 통해 첫 번째 상담이 진행되는 동안, 전자 장치(100)는 특정 주기(: A 주기)마다 발화 음성을 수집하여 주기 별로 음성 품질 정보 및 텍스트 품질 정보를 업데이트하여 (상담사 단말(300)의 화면 상에) 제공할 수 있다. 여기서, 전자 장치(100)는 첫 번째 상담이 종료됨에 따라 최종적으로 상담의 제1 품질 정보(음성 품질 정보 및 텍스트 품질 정보)를 식별할 수 있다.For example, while the first consultation is in progress through the counselor terminal 300 , the electronic device 100 collects the spoken voice at a specific cycle (: cycle A) and updates the voice quality information and text quality information for each cycle. (on the screen of the counselor terminal 300) may be provided. Here, as the first consultation ends, the electronic device 100 may finally identify first quality information (voice quality information and text quality information) of the consultation.

또한, 상담사 단말(300)을 통해 두 번째 상담이 진행되는 동안, 전자 장치(100)는 특정 주기(: B 주기)마다 발화 음성을 수집하여 주기 별로 음성 품질 정보 및 텍스트 품질 정보를 업데이트하여 제공할 수 있다. 여기서, 전자 장치(100)는 두 번째 상담이 종료됨에 따라 최종적으로 상담의 제2 품질 정보(음성 품질 정보 및 텍스트 품질 정보)를 식별할 수 있다.In addition, while the second consultation is in progress through the counselor terminal 300 , the electronic device 100 collects the spoken voice at a specific cycle (: cycle B) and updates the voice quality information and the text quality information for each cycle. can Here, as the second consultation ends, the electronic device 100 may finally identify second quality information (voice quality information and text quality information) of the consultation.

또한, 상담사 단말(300)을 통해 세 번째 상담이 진행되는 동안, 전자 장치(100)는 특정 주기(: C 주기)마다 발화 음성을 수집하여 주기 별로 음성 품질 정보 및 텍스트 품질 정보를 업데이트하여 제공할 수 있다. 여기서, 전자 장치(100)는 세 번째 상담이 종료됨에 따라 최종적으로 상담의 제3 품질 정보(음성 품질 정보 및 텍스트 품질 정보)를 식별할 수 있다.In addition, while the third consultation is in progress through the counselor terminal 300 , the electronic device 100 collects the spoken voice at a specific cycle (: cycle C) and updates the voice quality information and text quality information for each cycle. can Here, as the third consultation ends, the electronic device 100 may finally identify third quality information (voice quality information and text quality information) of the consultation.

이때, 전자 장치(100)는 제1 내지 제3 품질 정보를 비교함으로써, 첫 번째 내지 세 번째 상담 중 가장 품질 정보가 우수한 상담을 식별할 수 있다. 그리고, 전자 장치(100)는 품질 정보가 가장 우수했던 상담에 대하여 품질 정보가 업데이트되어 제공되었던 주기를 선택할 수 있다.In this case, by comparing the first to third quality information, the electronic device 100 may identify a consultation having the best quality information among the first to third consultations. Then, the electronic device 100 may select a period in which the quality information is updated and provided for the consultation in which the quality information is the best.

예를 들어, 제1 내지 제3 품질 정보 중 제2 품질 정보가 가장 뛰어난 경우, 전자 장치(100)는 상담사 단말(300)의 상담사에게 B 주기에 따른 품질 정보의 업데이트가 가장 잘 맞는 것으로 식별할 수 있다.For example, when the second quality information among the first to third quality information is the best, the electronic device 100 identifies the counselor of the counselor terminal 300 as the best fit for the update of the quality information according to the B cycle. can

그리고, 상담사 단말(300)의 이후 상담 건에 대해서는, 상담(음성 통화)이 제공되는 동안, 전자 장치(100)는 해당 주기(ex. B 주기)에 따라 음성 품질 정보 및 텍스트 품질 정보를 업데이트하여 실시간으로 상담사 단말(300)의 화면 상에 표시할 수 있다.In addition, with respect to a subsequent consultation case of the counselor terminal 300 , while a consultation (voice call) is provided, the electronic device 100 updates the voice quality information and the text quality information according to the corresponding cycle (eg, the B cycle). It can be displayed on the screen of the counselor terminal 300 in real time.

이 경우, 각 상담사에게 가장 적당한 속도의 피드백 주기(: 품질 정보가 업데이트되어 UI(310, 320) 상에 표시되는 주기)가 선택되어 활용됨으로써, 맞춤형 피드백이 수행될 수 있다.In this case, a feedback cycle (a cycle in which the quality information is updated and displayed on the UIs 310 and 320) is selected and utilized for each counselor, so that customized feedback can be performed.

한편, 도 4는 본 개시의 일 실시 예에 따른 전자 장치의 구성을 설명하기 위한 블록도이다.Meanwhile, FIG. 4 is a block diagram illustrating a configuration of an electronic device according to an embodiment of the present disclosure.

도 4를 참조하면, 전자 장치(100)는 적어도 하나의 메모리(110), 통신부(120), 및 프로세서(130)를 포함할 수 있다.Referring to FIG. 4 , the electronic device 100 may include at least one memory 110 , a communication unit 120 , and a processor 130 .

메모리(110)는 전자 장치(100)의 구성요소들의 전반적인 동작을 제어하기 위한 운영체제(OS: Operating System) 및 전자 장치(100)의 구성요소와 관련된 적어도 하나의 인스트럭션 또는 데이터를 저장하기 위한 구성이다.The memory 110 is a configuration for storing an operating system (OS) for controlling overall operations of the components of the electronic device 100 and at least one instruction or data related to the components of the electronic device 100 . .

메모리(110)는 ROM, 플래시 메모리 등의 비휘발성 메모리를 포함할 수 있으며, DRAM 등으로 구성된 휘발성 메모리를 포함할 수 있다. 또한, 메모리(110)는 하드 디스크, SSD(Solid state drive) 등을 포함할 수도 있다.The memory 110 may include non-volatile memory such as ROM and flash memory, and may include volatile memory such as DRAM. In addition, the memory 110 may include a hard disk, a solid state drive (SSD), or the like.

메모리(110)는 상술한 다양한 실시 예와 관련된 인공지능 모델(ex. 제1 인공지능 모델, 제2 인공지능 모델)을 포함할 수 있다.The memory 110 may include an artificial intelligence model (eg, a first artificial intelligence model, a second artificial intelligence model) related to the various embodiments described above.

본 개시에 따른 인공지능 모델은, 다양한 개념, 조건, 관계 내지는 합의된 지식이 컴퓨터가 처리할 수 있는 형태로 표현된 온톨로지 기반의 데이터 구조로 구성될 수 있다.The artificial intelligence model according to the present disclosure may be composed of an ontology-based data structure in which various concepts, conditions, relationships, or agreed knowledge is expressed in a computer-processable form.

인공지능 모델은, 다양한 학습 알고리즘을 통해 전자 장치(100) 또는 별도의 서버/시스템을 통해 학습된 것일 수 있다. The artificial intelligence model may be learned through the electronic device 100 or a separate server/system through various learning algorithms.

학습 알고리즘의 예로는, 지도형 학습(supervised learning), 비지도형 학습(unsupervised learning), 준지도형 학습(semi-supervised learning) 또는 강화 학습(reinforcement learning)이 있으며, 본 개시에서의 학습 알고리즘은 명시한 경우를 제외하고 전술한 예에 한정되지 않는다.Examples of the learning algorithm include supervised learning, unsupervised learning, semi-supervised learning, or reinforcement learning, and the learning algorithm in the present disclosure is specified when It is not limited to the above-described example except for.

인공지능 모델은 신경망(Neural Network)을 기반으로 하는 네트워크 모델(신경망 모델)에 해당할 수 있다. 네트워크 모델은 가중치를 가지는 복수의 네트워크 노드들을 포함할 수 있다. 복수의 네트워크 노드들은 서로 다른 레이어의 노드 간 가중치를 기반으로 연결 관계를 형성할 수 있다.The artificial intelligence model may correspond to a network model (neural network model) based on a neural network. The network model may include a plurality of network nodes having weights. A plurality of network nodes may form a connection relationship based on weights between nodes of different layers.

신경망 모델은, CNN (Convolutional Neural Network), DNN (Deep Neural Network), RNN (Recurrent Neural Network), RBM (Restricted Boltzmann Machine), DBN (Deep Belief Network), BRDNN(Bidirectional Recurrent Deep Neural Network), GAN(Generative Adversarial Network) 및 심층 Q-네트워크(Deep Q-Networks) 등으로 구성될 수 있으나, 이에 한정되지 않는다.Neural network models include Convolutional Neural Network (CNN), Deep Neural Network (DNN), Recurrent Neural Network (RNN), Restricted Boltzmann Machine (RBM), Deep Belief Network (DBN), Bidirectional Recurrent Deep Neural Network (BRDNN), and GAN (GAN). Generative Adversarial Network) and deep Q-networks (Deep Q-Networks), etc. may be configured, but is not limited thereto.

통신부(120)는 다양한 유무선 통신방식으로 적어도 하나의 외부 장치와 통신을 수행하기 위한 회로, 모듈, 칩 등을 포함할 수 있다.The communication unit 120 may include circuits, modules, chips, etc. for performing communication with at least one external device in various wired/wireless communication methods.

통신부(120)는 다양한 네트워크를 통해 외부 장치들(ex. 고객 단말(200), 상담사 단말(300))과 연결될 수 있다.The communication unit 120 may be connected to external devices (eg, the customer terminal 200 and the counselor terminal 300) through various networks.

네트워크는 영역 또는 규모에 따라 개인 통신망(PAN; Personal Area Network), 근거리 통신망(LAN; Local Area Network), 광역 통신망(WAN; Wide Area Network) 등일 수 있으며, 네트워크의 개방성에 따라 인트라넷(Intranet), 엑스트라넷(Extranet), 또는 인터넷(Internet) 등일 수 있다.The network may be a personal area network (PAN), a local area network (LAN), a wide area network (WAN), etc. depending on the area or size, and depending on the openness of the network, an intranet, It may be an extranet or the Internet.

통신부(120)는 LTE(long-term evolution), LTE-A(LTE Advance), 5G(5th Generation) 이동통신, CDMA(code division multiple access), WCDMA(wideband CDMA), UMTS(universal mobile telecommunications system), WiBro(Wireless Broadband), GSM(Global System for Mobile Communications), DMA(Time Division Multiple Access), WiFi(Wi-Fi), WiFi Direct, Bluetooth, NFC(near field communication), Zigbee 등 다양한 무선 통신 방식을 통해 외부 장치들과 연결될 수 있다. The communication unit 120 is a long-term evolution (LTE), LTE Advance (LTE-A), 5th generation (5G) mobile communication, code division multiple access (CDMA), wideband CDMA (WCDMA), universal mobile telecommunications system (UMTS) , WiBro (Wireless Broadband), GSM (Global System for Mobile Communications), DMA (Time Division Multiple Access), WiFi (Wi-Fi), WiFi Direct, Bluetooth, NFC (near field communication), Zigbee, etc. It can be connected to external devices through

또한, 통신부(120)는 이더넷(Ethernet), 광 네트워크(optical network), USB(Universal Serial Bus), 선더볼트(ThunderBolt) 등의 유선 통신 방식을 통해 외부 장치들과 연결될 수도 있다.Also, the communication unit 120 may be connected to external devices through a wired communication method such as Ethernet, an optical network, a Universal Serial Bus (USB), or a ThunderBolt.

프로세서(130)는 전자 장치(100)를 전반적으로 제어하기 위한 구성이다. 구체적으로, 프로세서(130)는 메모리(110)와 연결되는 한편 메모리(110)에 저장된 적어도 하나의 인스트럭션을 실행함으로써 본 개시의 다양한 실시 예들에 따른 동작을 수행할 수 있다.The processor 130 is a configuration for overall controlling the electronic device 100 . Specifically, the processor 130 may perform operations according to various embodiments of the present disclosure by executing at least one instruction stored in the memory 110 while being connected to the memory 110 .

프로세서(130)는 CPU, AP, DSP(Digital Signal Processor) 등과 같은 범용 프로세서, GPU, VPU(Vision Processing Unit) 등과 같은 그래픽 전용 프로세서 또는 NPU와 같은 인공지능 전용 프로세서 등을 포함할 수 있다. 인공지능 전용 프로세서는, 특정 인공지능 모델의 훈련 내지는 이용에 특화된 하드웨어 구조로 설계될 수 있다.The processor 130 may include a general-purpose processor such as a CPU, an AP, a digital signal processor (DSP), or the like, a graphics-only processor such as a GPU, a vision processing unit (VPU), or the like, or an artificial intelligence-only processor such as an NPU. The AI-only processor may be designed with a hardware structure specialized for training or use of a specific AI model.

도 5는 본 개시의 일 실시 예에 따른 전자 장치의 기능적 구성 및 동작을 설명하기 위한 블록도이다.5 is a block diagram illustrating a functional configuration and operation of an electronic device according to an embodiment of the present disclosure.

도 5를 참조하면, 프로세서(130)는 입력부(510), 학습부(520), 분석부(530), 및 표출부(540)를 포함하는 다양한 기능적 구성을 제어할 수 있다. 도 5의 구성들(510, 520, 530, 540)은 소프트웨어 및/또는 하드웨어로 구성된 모듈 내지는 엔진에 해당할 수 있다.Referring to FIG. 5 , the processor 130 may control various functional configurations including the input unit 510 , the learning unit 520 , the analysis unit 530 , and the display unit 540 . The components 510 , 520 , 530 , and 540 of FIG. 5 may correspond to a module or an engine composed of software and/or hardware.

입력부(510)는 과거의 상담 사례를 입력 받고 인공지능 모델의 훈련을 위한 훈련 데이터(학습 데이터)를 구축하기 위한 구성이다.The input unit 510 is configured to receive past counseling cases and build training data (learning data) for training the artificial intelligence model.

구체적으로, 입력부(510)는 과거의 상담 사례에 해당하는 기존의 상담콜을 조회할 수 있다(S511). 구체적으로, 입력부(510)는 과거의 상담 데이터를 구성하는 다양한 상담사의 발화 음성에 대한 오디오 데이터 및 텍스트를 획득할 수 있다.Specifically, the input unit 510 may inquire about an existing consultation call corresponding to a past consultation case (S511). Specifically, the input unit 510 may acquire audio data and texts for voices uttered by various counselors constituting past counseling data.

여기서, 과거의 상담 데이터는 전자 장치(100)의 메모리(120)에 기저장되어 있거나, 또는 전자 장치(100)와 통신 가능한 적어도 하나의 외부 장치(데이터 베이스)에 저장되어 있을 수 있다.Here, the past consultation data may be stored in advance in the memory 120 of the electronic device 100 or may be stored in at least one external device (database) capable of communicating with the electronic device 100 .

입력부(510)는 과거의 상담 데이터 중 우수 상담 및/또는 열위 상담의 상담콜과 관련된 상담 데이터를 선택하여 수집할 수 있다(S512).The input unit 510 may select and collect counseling data related to counseling calls of excellent counseling and/or inferior counseling among past counseling data (S512).

이 경우, 사용자 입력에 따라 선택된 상담 데이터가 우수 상담 또는 열위 상담으로 식별될 수 있으나 이에 한정되지 않는다.In this case, the counseling data selected according to the user input may be identified as excellent counseling or poor counseling, but is not limited thereto.

한편, 입력부(510)는 상담 스크립트 및 규칙 정보를 등록할 수 있다(S513). 여기서, 상담 스크립트 및 규칙 정보는, 표준이 되는 상담 스크립트 및 규칙 정보로서, 사용자 입력에 따라 등록될 수 있으나, 이에 한정되지 않는다.Meanwhile, the input unit 510 may register the consultation script and rule information (S513). Here, the consultation script and rule information is standard consultation script and rule information, and may be registered according to a user input, but is not limited thereto.

학습부(520)는 품질 평가에 이용되는 인공지능 모델을 훈련시키기 위한 구성이다.The learning unit 520 is configured to train an artificial intelligence model used for quality evaluation.

학습부(520)는 선택된 상담 데이터로부터 녹취 데이터(발화 음성에 해당하는 오디오 데이터)를 수집하고(S521), 오디오 데이터로부터 특징 정보를 추출할 수 있다(S522).The learning unit 520 may collect recorded data (audio data corresponding to spoken voice) from the selected consultation data ( S521 ), and extract characteristic information from the audio data ( S522 ).

이 경우, 오디오 데이터로부터 특징 정보를 추출하도록 훈련된 제1 인공지능 모델이 활용될 수 있다.In this case, a first artificial intelligence model trained to extract feature information from audio data may be utilized.

그리고, 학습부(520)는 우수 상담 및/또는 열위 상담에 따른 오디오 데이터를 활용하여 제1 인공지능 모델을 훈련시킬 수 있다(S523). 아울러, 학습부(520)는 우수 상담에 매칭되는 오디오 데이터의 특징 정보를 저장할 수 있다.In addition, the learning unit 520 may train the first AI model by using audio data according to excellent counseling and/or inferior counseling ( S523 ). In addition, the learning unit 520 may store characteristic information of audio data matched with excellent counseling.

한편, 학습부(520)는 선택된 상담 데이터 내 오디오 데이터를 텍스트로 변환하여(S524), 규칙 정보를 추출할 수 있다(S525).Meanwhile, the learning unit 520 may convert the audio data in the selected consultation data into text (S524) and extract rule information (S525).

이 경우, 텍스트로부터 규칙 정보를 추출하도록 훈련된 제2 인공지능 모델이 활용될 수 있다.In this case, the second artificial intelligence model trained to extract rule information from the text may be utilized.

그리고, 학습부(520)는 우수 상담 및/또는 열위 상담에 따른 텍스트를 활용하여 제2 인공지능 모델을 훈련시킬 수 있다(S526). 아울러, 학습부(520)는 우수 상담에 매칭되는 텍스트의 규칙 정보를 저장할 수 있다.Then, the learning unit 520 may train the second artificial intelligence model by using the text according to the excellent counseling and/or the inferior counseling (S526). In addition, the learning unit 520 may store rule information of texts matching excellent counseling.

분석부(530)는 실시간으로 상담(음성 통화)이 진행되는 동안 상담의 품질을 분석하기 위한 구성이다.The analysis unit 530 is configured to analyze the quality of a consultation while a consultation (voice call) is in progress in real time.

분석부(530)는 실시간 통화 음성에 포함되는 상담사의 발화 음성에 해당하는 오디오 데이터를 수집할 수 있으며(S531), 수집된 오디오 데이터로부터 특징 정보를 추출할 수 있다(S532).The analysis unit 530 may collect audio data corresponding to the counselor's uttered voice included in the real-time call voice (S531), and extract characteristic information from the collected audio data (S532).

그리고, 분석부(530)는 추출된 특징 정보를 기저장된 특징 정보(ex. 우수 상담/열위 상담의 오디오 데이터의 특징 정보)와 비교하여 일치율을 획득할 수 있다(S533). Then, the analysis unit 530 may obtain a matching rate by comparing the extracted characteristic information with pre-stored characteristic information (eg, characteristic information of audio data of excellent counseling/poor counseling) ( S533 ).

일 예로, 우수 상담의 오디오 데이터의 특징 정보에 대한 일치율이 클수록, 현재 실시간으로 제공되고 있는 상담의 품질이 더 좋은 것으로 식별될 수 있다.For example, it may be identified that the higher the matching rate with respect to the characteristic information of audio data of excellent counseling is, the better the quality of counseling currently being provided in real time.

다른 예로, 열위 상담의 오디오 데이터의 특징 정보에 대한 일치율이 작을수록, 현재 실시간으로 제공되고 있는 상담의 품질이 더 좋은 것으로 식별될 수 있다.As another example, the lower the matching rate with respect to the characteristic information of the audio data of the inferior counseling, the better the quality of the counseling currently being provided in real time may be identified.

한편, 분석부(530)는 실시간으로 수집된 오디오 데이터를 텍스트로 변환하고(S534), 변환된 텍스트의 규칙 정보를 기저장된 규칙 정보(ex. 우수 상담/열위 상담의 텍스트의 규칙 정보)와 비교하여 일치율을 획득할 수 있다(S535).On the other hand, the analysis unit 530 converts the audio data collected in real time into text (S534), and compares rule information of the converted text with pre-stored rule information (eg, rule information of text of excellent counseling/poor counseling). to obtain a matching rate (S535).

일 예로, 우수 상담의 텍스트의 규칙 정보에 대한 일치율이 클수록, 현재 실시간으로 제공되고 있는 상담의 품질이 더 좋은 것으로 식별될 수 있다.For example, it may be identified that the higher the matching rate of the text of the excellent consultation with the rule information, the better the quality of the consultation currently being provided in real time.

다른 예로, 열위 상담의 텍스트의 규칙 정보에 대한 일치율이 작을수록, 현재 실시간으로 제공되고 있는 상담의 품질이 더 좋은 것으로 식별될 수 있다.As another example, it may be identified that the lower the matching rate with respect to the rule information of the text of the inferior consultation is, the better the quality of the consultation currently being provided in real time.

표출부(540)는 실시간으로 제공되는 상담의 품질에 대한 정보를 제공하기 위한 구성이다(S541).The display unit 540 is configured to provide information on the quality of counseling provided in real time (S541).

표출부(540)는 고객 단말(200)과 상담사 단말(300) 간에 주고받는 발화 음성이 변환된 텍스트를 표시하도록 상담사 단말(300)을 제어할 수 있다(S541).The display unit 540 may control the counselor terminal 300 to display a text in which the spoken voice exchanged between the customer terminal 200 and the counselor terminal 300 is converted (S541).

아울러, 표출부(540)는 앞서 분석부(530)를 통해 획득된 일치율에 대한 정보(: 음성 품질 정보, 텍스트 품질 정보)를 표시하도록 상담사 단말(300)을 제어할 수 있다(S542, S543).In addition, the display unit 540 may control the counselor terminal 300 to display information (: voice quality information, text quality information) about the matching rate obtained through the analysis unit 530 previously (S542, S543). .

한편, 이상에서 설명된 다양한 실시 예들은 서로 저촉되거나 모순되지 않는 한 두 개 이상의 실시 예가 서로 결합되어 구현될 수 있다.Meanwhile, the various embodiments described above may be implemented by combining two or more embodiments as long as they do not conflict with or contradict each other.

한편, 이상에서 설명된 다양한 실시 예들은 소프트웨어(software), 하드웨어(hardware) 또는 이들의 조합된 것을 이용하여 컴퓨터(computer) 또는 이와 유사한 장치로 읽을 수 있는 기록 매체 내에서 구현될 수 있다.Meanwhile, various embodiments described above may be implemented in a recording medium readable by a computer or a similar device using software, hardware, or a combination thereof.

하드웨어적인 구현에 의하면, 본 개시에서 설명되는 실시 예들은 ASICs(Application Specific Integrated Circuits), DSPs(digital signal processors), DSPDs(digital signal processing devices), PLDs(programmable logic devices), FPGAs(field programmable gate arrays), 프로세서(processors), 제어기(controllers), 마이크로 컨트롤러(micro-controllers), 마이크로 프로세서(microprocessors), 기타 기능 수행을 위한 전기적인 유닛(unit) 중 적어도 하나를 이용하여 구현될 수 있다. According to the hardware implementation, the embodiments described in the present disclosure are ASICs (Application Specific Integrated Circuits), DSPs (digital signal processors), DSPDs (digital signal processing devices), PLDs (programmable logic devices), FPGAs (field programmable gate arrays) ), processors, controllers, micro-controllers, microprocessors, and other electrical units for performing other functions may be implemented using at least one.

일부의 경우에 본 명세서에서 설명되는 실시 예들이 프로세서 자체로 구현될 수 있다. 소프트웨어적인 구현에 의하면, 본 명세서에서 설명되는 절차 및 기능과 같은 실시 예들은 별도의 소프트웨어 모듈들로 구현될 수 있다. 상술한 소프트웨어 모듈들 각각은 본 명세서에서 설명되는 하나 이상의 기능 및 작동을 수행할 수 있다.In some cases, the embodiments described herein may be implemented by the processor itself. According to the software implementation, embodiments such as the procedures and functions described in this specification may be implemented as separate software modules. Each of the above-described software modules may perform one or more functions and operations described herein.

한편, 상술한 본 개시의 다양한 실시 예들에 따른 전자 장치(100)에서의 처리동작을 수행하기 위한 컴퓨터 명령어(computer instructions) 또는 컴퓨터 프로그램은 비일시적 컴퓨터 판독 가능 매체(non-transitory computer-readable medium)에 저장될 수 있다. 이러한 비일시적 컴퓨터 판독 가능 매체에 저장된 컴퓨터 명령어 또는 컴퓨터 프로그램은 특정 기기의 프로세서에 의해 실행되었을 때 상술한 다양한 실시 예에 따른 전자 장치(100)에서의 처리 동작을 상술한 특정 기기가 수행하도록 한다. On the other hand, the computer instructions or computer program for performing the processing operation in the electronic device 100 according to various embodiments of the present disclosure described above is a non-transitory computer-readable medium. can be stored in When the computer instructions or computer program stored in such a non-transitory computer-readable medium are executed by the processor of the specific device, the specific device performs the processing operation in the electronic device 100 according to the various embodiments described above.

비일시적 컴퓨터 판독 가능 매체란 레지스터, 캐쉬, 메모리 등과 같이 짧은 순간 동안 데이터를 저장하는 매체가 아니라 반영구적으로 데이터를 저장하며, 기기에 의해 판독(reading)이 가능한 매체를 의미한다. 비일시적 컴퓨터 판독 가능 매체의 구체적인 예로는, CD, DVD, 하드 디스크, 블루레이 디스크, USB, 메모리카드, ROM 등이 있을 수 있다.The non-transitory computer-readable medium refers to a medium that stores data semi-permanently, rather than a medium that stores data for a short moment, such as a register, a cache, a memory, and the like, and can be read by a device. Specific examples of the non-transitory computer-readable medium may include a CD, DVD, hard disk, Blu-ray disk, USB, memory card, ROM, and the like.

이상에서는 본 개시의 바람직한 실시 예에 대하여 도시하고 설명하였지만, 본 개시는 상술한 특정의 실시 예에 한정되지 아니하며, 청구범위에서 청구하는 본 개시의 요지를 벗어남이 없이 당해 개시에 속하는 기술분야에서 통상의 지식을 가진 자에 의해 다양한 변형실시가 가능한 것은 물론이고, 이러한 변형실시들은 본 개시의 기술적 사상이나 전망으로부터 개별적으로 이해되어서는 안될 것이다.In the above, preferred embodiments of the present disclosure have been illustrated and described, but the present disclosure is not limited to the specific embodiments described above, and is commonly used in the technical field pertaining to the present disclosure without departing from the gist of the present disclosure as claimed in the claims. Of course, various modifications may be made by those having the knowledge of

100: 전자 장치 200: 고객 단말
300: 상담사 단말100: electronic device 200: customer terminal
300: agent terminal

Claims

A method for evaluating consultation quality of an electronic device, the method comprising:
providing a voice call between the customer terminal and the counselor terminal;
acquiring audio data based on the counselor's spoken voice input through the counselor's terminal while the voice call is being provided;
converting the audio data to obtain text;
acquiring characteristic information of the audio data by analyzing the audio data, and generating voice quality information by comparing the acquired characteristic information with pre-stored characteristic information;
obtaining rule information of the text by analyzing the text, and generating text quality information by comparing the obtained rule information with pre-stored rule information; and
providing the voice quality information and the text quality information through the counselor terminal while the voice call is provided;
The method for evaluating the consultation quality of the electronic device,
when the counselor's first spoken voice is received through the counselor's terminal, displaying the first text converted from the first spoken voice through the counselor's terminal;
generating first voice quality information based on the first spoken voice;
generating first text quality information based on the first text;
displaying the first voice quality information and the first text quality information together with the first text through the counselor terminal;
displaying the second text converted from the second spoken voice together with the first text through the counselor's terminal when the counselor's second spoken voice is received through the counselor's terminal while the first text is displayed ;
generating second voice quality information based on the first uttered voice and the second uttered voice;
generating second text quality information based on the first text and the second text; and
Displaying the second voice quality information and the second text quality information together with the second text through the counselor terminal;
The method for evaluating the consultation quality of the electronic device,
comparing the first voice quality information and the second voice quality information to determine whether the counseling quality of the counselor is being improved in real time;
when it is determined that the counselor's counseling quality is not improving in real time, controlling the counselor's terminal so that the size of a user interface (UI) including voice quality information is larger than the existing size;
when it is determined that the counseling quality of the counselor is not improving in real time, controlling the counselor terminal so that the size of the UI is reduced compared to the existing size; and
When it is determined that the counseling quality of the counselor is not improving in real time, controlling the counselor terminal not to display the UI for a predetermined period of time.

According to claim 1,
The method for evaluating the consultation quality of the electronic device,
generating a first artificial intelligence model for determining a match rate by extracting feature information of audio data; and
Training the first artificial intelligence model based on audio data matching the speech voice of the excellent counselor and the audio data matching the speech voice of the poor counselor;
The step of obtaining the voice quality information includes:
Through the artificial intelligence model, the obtained characteristic information is compared with the pre-stored characteristic information to obtain a matching rate,
The pre-stored feature information is,
A method for evaluating counseling quality of an electronic device, which is characteristic information of audio data that matches the speech voice of at least one excellent counselor.

According to claim 1,
The method for evaluating the consultation quality of the electronic device,
generating a second artificial intelligence model for determining a match rate by extracting rule information from the text; and
Training the second artificial intelligence model based on the converted text of the excellent counselor's speech and the converted text of the inferior counselor's speech;
The step of obtaining the text quality information includes:
Through the artificial intelligence model, the obtained rule information is compared with the pre-stored rule information to obtain a matching rate,
The pre-stored rule information is
A method for evaluating counseling quality of an electronic device, wherein the speech voice of at least one excellent counselor is rule information of the converted text.

According to claim 1,
The characteristic information is
Containing at least one item of pitch, resonance, intensity, and speed of the voice;
The voice quality information is
A method for evaluating counseling quality of an electronic device, comprising information on a matching rate for each item of the feature information and the pre-stored feature information.

According to claim 1,
The rule information is
including one or more items related to at least one of whether a preset script is uttered, the number of utterances of a preset script, and an utterance order of a plurality of scripts,
The text quality information is
The method for evaluating consultation quality of an electronic device, comprising information on a matching rate for each item of the rule information and the pre-stored rule information.

delete

According to claim 1,
Comparing the first voice quality information and the second voice quality information to determine whether the counseling quality of the counselor is improving in real time,
when it is determined that the counseling quality of the counselor is improving in real time, acquiring the uttered voice of the counselor input thereafter in a first cycle and updating the voice quality information for each cycle; and
When it is determined that the counseling quality of the counselor is not improving in real time, acquiring the uttered voice of the counselor, which is subsequently input, in a second cycle shorter than the first cycle, and updating the voice quality information for each cycle; , a method for evaluating the quality of consultation in electronic devices.

In an electronic device,
a memory in which at least one instruction is stored; and
The electronic device comprising a; processor for executing the instruction to perform the consultation quality evaluation method of claim 1 .

A computer program stored in a computer readable medium,
A computer program stored in a computer-readable medium, which is executed by a processor of an electronic device to cause the electronic device to perform the consultation quality evaluation method of claim 1 .