KR20040039586A

KR20040039586A - Ststem and method for extracting from ars-information using speech recognition

Info

Publication number: KR20040039586A
Application number: KR1020020067704A
Authority: KR
Inventors: 곽태수
Original assignee: (주)포이시스
Priority date: 2002-11-04
Filing date: 2002-11-04
Publication date: 2004-05-12

Abstract

PURPOSE: A system and method for extracting ARS(Automatic Response System) information using voice recognition is provided to achieve the information collection of an ARS system through the Internet using voice recognition. CONSTITUTION: An ARS voice information collection unit comprises a voice transmit-receive part(220), a scenario storage part(260), a DTMF generation part(240), an ASR(Automatic Speech Recognition) server(300), and a control part(280). The voice transmit-receive part(220) receives an announcement, outputted from an ARS server, through a PSTN, converts it into a digital signal, and transmits a DTMF signal for the connection to the ARS server. The scenario storage part(260) stores dialing numbers to be inputted according to the announcements outputted from the ARS server. The DTMF generation part(240) creates a DTMF signal equivalent to a dialing number outputted from the scenario storage part(260). The ASR server(300) extracts health insurance premium information data from a digital voice signal outputted from the voice transmit-receive part(220) and converts the extracted data into a text signal.

Description

ARS information extraction system and method using speech recognition {STSTEM AND METHOD FOR EXTRACTING FROM ARS-INFORMATION USING SPEECH RECOGNITION}

본 발명은 이용자에게 필요한 정보를 제공하는 ARS시스템에 접근하여 원하는 정보가 출력되도록 하고 출력된 정보를 음성인식을 통해 텍스트로 추출되도록 하는 음성인식을 이용한 ARS정보 추출 시스템 및 방법에 관한 것이다.The present invention relates to a system and method for extracting ARS information using speech recognition, which accesses an ARS system providing information necessary for a user to output desired information and extracts the output information into text through speech recognition.

일반적으로 ARS(Automatic Response System)는 자동응답시스템으로서 사용자가 전화기를 사용하여 전화를 걸면 미리 저장되어 있는 안내음이 출력되고, 사용자는 안내음의 내용에 따라 숫자를 입력함으로써 자신이 원하는 정보를 청취할 수 있도록 하는 시스템이다.In general, ARS (Automatic Response System) is an automatic answering system. When a user makes a call using a telephone, a pre-stored guide tone is output. The user listens to the information he / she wants by entering a number according to the guide tone. It's a system that makes it possible.

상기 ARS시스템은 다양한 부분에 활용되고 있으며 특히 다량의 고객 전화를 기존의 인력으로 소화하기 힘든 기업체, 정부기관 등에서 많이 사용한다. 즉, 사용자는 일반전화 또는 이동전화를 이용하여 원하는 정보를 제공받고자 하는 기업체 또는 정부기관에 다이얼링하면 안내음이 출력되고 안내음의 지시에 따라 필요한 정보를 입력하면 자동으로 사용자에게 필요한 정보가 음성으로 출력되도록 한다.The ARS system is used in various parts and is especially used in corporations, government agencies, etc., where it is difficult to digest a large number of customer phones into existing personnel. That is, when a user dials to a company or a government agency that wants to receive desired information by using a regular phone or a mobile phone, a guide sound is output, and when the user enters necessary information according to the guide sound, the information required by the user is automatically spoken. To be printed.

상기와 같은 과정을 도 1을 참조하여 설명하면 다음과 같다.The above process will be described with reference to FIG. 1.

도 1은 종래의 ARS시스템을 이용한 정보 수집 방법을 설명하기 위한 개략도이다.1 is a schematic diagram for explaining a method of collecting information using a conventional ARS system.

도 1을 참조하여 설명하면, 종래의 ARS시스템을 통한 정보 수집 방법은, ARS시스템이 고객에게 제공할 안내음성 및 고객에 관한 정보가 저장되는 응답정보DB (100a)와; 고객으로부터 입력된 신상정보로부터 응답정보DB(100a)를 검색하고 필요한 정보를 독출하여 음성으로 출력하는 ARS서버(100)와; 고객이 ARS서버(100)에 공중전화망(Public Switched Telephone Network; PSTN)을 통하여 접속하는 유선전화 (10)와; 고객이 기지국(30) 및 PSTN망에 연결된 이동교환기(Mobile Switching Center; MSC)를 거쳐서 ARS서버(100)에 접속하는 이동전화(20)를 포함한다.Referring to Figure 1, the conventional information collecting method through the ARS system, the response information DB (100a) for storing the information on the announcement voice and the customer to provide to the customer ARS system; An ARS server 100 for retrieving the response information DB 100a from the personal information input from the customer, reading out necessary information, and outputting it in voice; A landline telephone 10 which a customer connects to the ARS server 100 through a public switched telephone network (PSTN); It includes a mobile telephone 20 that a customer connects to the ARS server 100 via a mobile switching center (MSC) connected to a base station 30 and a PSTN network.

상기의 구성으로 이루어진 종래의 ARS시스템을 통한 정보 수집 방법의 동작에 관하여 설명하면, 이용자는 자신이 원하는 정보를 수집하기 위해 유선전화(10) 또는 이동전화(20)를 사용하여 ARS전화번호로 다이알링한다. 유선전화(10)는 상기전화번호로 다이알링하면 PSTN망을 거쳐서 ARS서버(100)에 연결되고, 이동전화(20)는 기지국(30)을 거쳐서 MSC망으로 연결되고 다시 PSTN망을 거쳐서 ARS서버(100)에 연결된다.Referring to the operation of the information collection method through the conventional ARS system having the above configuration, the user dials the ARS phone number using the landline telephone 10 or the mobile telephone 20 to collect the information desired by the user. Ring. The landline telephone 10 is dialed to the telephone number to be connected to the ARS server 100 via the PSTN network, and the mobile telephone 20 is connected to the MSC network via the base station 30 and again to the ARS server via the PSTN network. Connected to 100.

해당 전화번호를 수신한 ARS서버(100)는 응답DB(100a)에 저장되어 있는 안내음성 데이터를 독출하여 사용자의 유선전화(10) 및 이동전화(20)로 송출한다. 이용자는 ARS서버(100)로부터 출력되는 안내음성 데이터를 청취하고 안내음에 따라 요구하는 정보를 키패드를 사용하여 입력한다. 예컨데, 주민등록번호이면 전화의 키패드를 사용하여 입력한다. 입력한 주민등록번호 데이터는 ARS서버(100)로 전송되고, ARS서버(100)는 전송된 데이터를 수신하여 해당 주민등록번호에 따른 고객정보를 응답정보DB(100a)로부터 독출한다. ARS서버(100)는 독출한 데이터를 음성으로 변환하여 PSTN망 및 MSC망을 거쳐서 이용자의 유선전화(10) 및 이동전화(20)로 출력한다. 상기에서 출력된 음성을 수신한 이용자는 자신이 원하는 정보를 출력되는 음성속에서 수집한다.Receiving the telephone number, the ARS server 100 reads out the guide voice data stored in the response DB 100a and transmits it to the user's landline telephone 10 and mobile telephone 20. The user listens to the guide voice data output from the ARS server 100 and inputs information requested according to the guide sound using the keypad. For example, if the social security number is entered, enter it using the keypad of the telephone. The entered social security number data is transmitted to the ARS server 100, and the ARS server 100 receives the transmitted data and reads customer information according to the corresponding social security number from the response information DB 100a. The ARS server 100 converts the read data into voice and outputs it to the user's landline telephone 10 and mobile telephone 20 via the PSTN network and the MSC network. The user who receives the output voice collects information desired by the user in the output voice.

상기와 같이 종래의 ARS시스템을 통한 정보 수집 방법은 사용자가 직접 전화기를 통해 다이얼링하여 원하는 정보를 수집하므로 정보 수집을 음성에 의존해야 하는 불편함이 따른다.As described above, in the conventional method of collecting information through the ARS system, the user collects the desired information by dialing through a telephone directly, which is inconvenient to rely on voice collection.

또한, 인터넷의 대중화에 따라 일반적으로 컴퓨터를 사용하여 상대방과 인터페이스하는 방식이 일반적인데 반해 ARS시스템은 사용자가 전화망을 통해서만 접속이 가능한 상태이므로 인터넷 망을 효율적으로 활용하지 못하는 한계가 있다.In addition, according to the popularization of the Internet, in general, a method of interfacing with a counterpart using a computer is general, whereas the ARS system has a limitation in that the user cannot efficiently use the Internet network because the user can access only through a telephone network.

따라서, 본 발명이 이루고자 하는 기술적 과제는, 고객에게 음성으로 정보를 제공하는 ARS시스템으로부터의 정보수집을 음성인식을 이용하여 인터넷을 통하여 수집할 수 있도록 하는 음성인식을 이용한 ARS정보 추출 시스템 및 방법을 제공하는 데 있다.Accordingly, a technical problem of the present invention is to provide a system and method for extracting ARS information using speech recognition, which enables the collection of information from an ARS system that provides information to a customer via voice using the Internet. To provide.

또한, 본 발명이 이루고자 하는 다른 기술적 과제는, ARS시스템으로부터 출력되는 정보를 청취를 통하여 입수하지 않고 텍스트를 통하여 입수함으로 정보의 정확도를 높일 수 있도록 하는 음성인식을 이용한 ARS정보 추출 시스템 및 방법을 제공하는 데 있다.In addition, another technical problem to be achieved by the present invention is to provide a system and method for extracting ARS information using speech recognition to increase the accuracy of the information by obtaining the information output from the ARS system through the text instead of listening. There is.

도 1은 종래의 ARS시스템을 이용한 정보 수집 방법을 설명하기 위한 개략도1 is a schematic diagram for explaining a method of collecting information using a conventional ARS system

도 2는 본 발명에 따른 음성인식을 이용한 ARS정보 추출 시스템의 개략도2 is a schematic diagram of a system for extracting ARS information using speech recognition according to the present invention;

도 3은 본 발명에 따른 ARS음성정보수집장치의 블록도3 is a block diagram of an ARS voice information collecting device according to the present invention;

도 4는 본 발명에 따른 음성인식을 이용한 ARS정보 추출 방법을 설명하기 위한 흐름도4 is a flowchart illustrating a method of extracting ARS information using speech recognition according to the present invention.

도 5는 본 발명에 따른 타겟음성신호 유무 판단 단계를 설명하기 위한 상세흐름도5 is a detailed flowchart illustrating a step of determining whether a target voice signal exists according to the present invention.

상기 기술적 과제를 달성하기 위한 음성인식을 이용한 ARS정보 추출 시스템은, ARS시스템으로부터 출력되는 음성을 인식하여 컴퓨터가 인식할 수 있는 신호로 변환하는 시스템에 관한 것으로서, 상기 ARS시스템으로 전화번호 및 입력정보를 DTMF신호로 입력하고, ARS시스템으로부터 출력되는 음성신호를 인식하여 수집하고자 하는 타겟음성신호를 추출하여 텍스트신호로 변환하는 ARS음성정보수집수단; 상기 ARS음성정보수집수단에서 출력된 텍스트 신호를 인터넷을 통하여 고객 컴퓨터로 전송하고, 고객으로부터 전송된 ARS입력정보를 저장하는 웹서버; 및 상기 웹서버에 구비되어 고객 컴퓨터로부터 전송된 ARS입력정보를 저장하고, ARS시스템 접속시 저장하고 있는 ARS입력정보를 상기 ARS음성정보수집수단으로 출력하는 고객정보데이터베이스;를 포함하는 것을 특징으로 한다.The ARS information extraction system using the voice recognition to achieve the technical problem, relates to a system for recognizing the voice output from the ARS system and converting it into a signal that can be recognized by the computer, the phone number and input information to the ARS system ARS voice information collecting means for inputting the signal as a DTMF signal, extracting a target voice signal to be collected by recognizing a voice signal output from the ARS system, and converting the target voice signal into a text signal; A web server for transmitting the text signal output from the ARS voice information collecting means to a customer computer through the Internet, and storing the ARS input information transmitted from the customer; And a customer information database provided in the web server for storing ARS input information transmitted from a customer computer and outputting the ARS input information stored when the ARS system is connected to the ARS voice information collecting means. .

본 발명에 있어서, 상기 ARS음성정보수집수단은, 상기 ARS시스템으로부터 출력되는 안내 음성을 PSTN망을 통하여 수신하여 아날로그 신호에서 디지털신호로 변환하고, 상기 ARS서버로의 접속을 위한 DTMF신호를 송신하는 음성송/수신부; 상기 음성송/수신부로 수신되는 안내음성에 따라 입력해야할 다이얼링번호를 저장하고 있는 시나리오저장부; 상기 시나리오저장부에서 출력되는 다이얼링번호에 해당하는 DTMF신호를 생성하는 DTMF발생부와; 상기 음성송/수신부에서 수신되는 디지털 음성신호에서 고객이 의뢰한 타겟음성신호를 추출하여 텍스트 신호로 변환하는 ASR서버; 및 상기 음성송/수신부에서 수신되는 음성신호에서 타겟음성신호를 추출하여 텍스트신호로 변환하도록 제어하는 제어부;를 포함하는 것을 특징으로 한다.In the present invention, the ARS voice information collecting means receives the guide voice output from the ARS system through a PSTN network, converts the analog signal into a digital signal, and transmits a DTMF signal for connection to the ARS server. Voice transmitter / receiver; A scenario storage unit storing a dialing number to be input according to the guide voice received by the voice transmitter / receiver; A DTMF generation unit for generating a DTMF signal corresponding to the dialing number output from the scenario storage unit; An ASR server extracting a target voice signal requested by a customer from the digital voice signal received by the voice transmitter / receiver and converting the target voice signal into a text signal; And a controller which controls to extract a target voice signal from the voice signal received by the voice transmitter / receiver and convert the target voice signal into a text signal.

본 발명에 있어서, 상기 ASR서버는, 상기 음성송/수신부로부터 인가되는 디지털 음성신호를 인식하는 음성인식부; 인가된 디지털 음성신호에서 타겟이 되는 음성신호의 전방과 후방에 진행되는 음성신호를 디지털 데이터로 저장하고 있는 샘플음성저장부; 상기 음성인식부에서 출력되는 음성신호와 샘플음성저장부에 저장된 음성신호를 독출하여 서로 비교하여 타겟 음성신호를 가려내는 음성비교부; 상기 음성비교부에서 비교된 결과에 따라 음성인식부로 수신된 음성신호에서 타겟 음성신호를 추출하는 음성추출부; 및 상기 음성추출부에서 출력되는 타겟 음성신호를 텍스트신호로 변환하는 TEXT변환부;를 포함하는 것을 특징으로 한다.In the present invention, the ASR server, a voice recognition unit for recognizing a digital voice signal applied from the voice transmission / reception unit; A sample voice storage unit for storing the voice signals traveling in front and rear of the target voice signal from the applied digital voice signal as digital data; A voice comparator that reads the voice signal output from the voice recognition unit and the voice signal stored in the sample voice storage unit and compares each other to select a target voice signal; A voice extraction unit for extracting a target voice signal from the voice signal received by the voice recognition unit according to the result compared by the voice comparison unit; And a TEXT converter for converting the target voice signal output from the voice extractor into a text signal.

본 발명에 있어서, 상기 음성추출부의 타겟음성신호추출은, 상기 음성인식부로부터 출력되는 음성신호에서 전방음성과 후방음성 사이에 있는 음성신호를 타겟음성신호로 추출하는 것이 바람직하다.In the present invention, the target voice signal extraction of the voice extractor preferably extracts a voice signal between the front voice and the rear voice from the voice signal output from the voice recognizer as the target voice signal.

한편, 상기 기술적 과제를 달성하기 위한 음성인식을 이용한 ARS정보 추출방법은, ARS시스템으로부터 출력되는 음성을 인식하여 컴퓨터가 인식할 수 있는 신호로 변환하는 방법에 관한 것으로서, (a)웹서버의 고객정보데이터베이스에서 독출된 의뢰인의 ARS입력정보를 ARS음성정보수집장치가 수신하는 단계; (b)ARS음성정보수집장치가 수신한 의뢰인의 ARS입력정보를 시나리오저장부에 저장시키고 DTMF신호를 발생시켜 ARS서버에 접속하는 단계; (c)상기 DTMF신호를 수신한 ARS서버의 응답데이터베이스로부터 독출되어 출력되는 안내음성을 ARS음성정보수집장치가 수신하는 단계; (d)ARS음성정보수집장치가 안내음성을 수신하여 시나리오저장부에 저장된 응답신호를 DTMF신호로서 ARS서버로 송신하는 단계; (e)ARS음성정보수집장치가 상기 응답신호에 대한 ARS서버의 응답데이터베이스로부터 독출된 응답안내음성을 수신하는 단계; (f)ARS음성정보수집장치의 ASR서버가 상기 음성신호를 수신하여 샘플음성저장부에 저장된 음성신호와 비교하여 현재 음성에 타겟음성신호가 포함되었는가를 판단하는 단계; 및 (g)타겟음성신호가 있으면 ASR서버가 전송된 음성신호에서 타겟음성신호를 추출하고 TEXT신호로 변환하는 단계;를 포함하는 것을 특징으로 한다.On the other hand, the ARS information extraction method using the speech recognition to achieve the technical problem, relates to a method for recognizing the voice output from the ARS system and converting it into a signal that can be recognized by the computer, (a) a customer of the web server Receiving, by the ARS voice information collecting device, the ARS input information of the client read from the information database; (b) storing the client's ARS input information received by the ARS voice information collecting device in a scenario storage unit and generating a DTMF signal to access the ARS server; (c) receiving, by the ARS voice information collecting device, a guide voice read and output from the response database of the ARS server receiving the DTMF signal; (d) receiving the guide voice by the ARS voice information collecting device and transmitting a response signal stored in the scenario storage unit as a DTMF signal to the ARS server; (e) receiving, by the ARS voice information collecting device, a response guide voice read from the response database of the ARS server for the response signal; (f) determining, by the ASR server of the ARS voice information collecting device, whether the target voice signal is included in the current voice by comparing the voice signal with the voice signal stored in the sample voice storage; And (g) extracting the target voice signal from the transmitted voice signal and converting the target voice signal into a TEXT signal when the target voice signal is present.

본 발명에 있어서, 상기 (f)단계는, (f-1)ASR서버의 음성인식부가 음성송/수신부에서 전송된 디지털음성신호의 입력을 인식하는 단계; (f-2)음성비교부가 음성인식부로부터 디지털음성신호의 입력이 인식되면 샘플음성저장부에 저장된 샘플음성을 독출하는 단계; 및 (f-3)음성비교부가 음성인식부로부터 전송되는 음성신호와 샘플음성저장부로부터 독출된 샘플음성을 비교하여 샘플음성신호가 포함되어 있는가를 판단하는 단계;를 포함하는 것을 특징으로 한다.In the present invention, step (f) may include: (f-1) recognizing an input of a digital voice signal transmitted from a voice transmitter / receiver by a voice recognition unit of the ASR server; (f-2) when the voice comparator recognizes the input of the digital voice signal from the voice recognition unit, reading the sample voice stored in the sample voice storage unit; And (f-3) comparing the voice signal transmitted from the voice recognition unit with the sample voice read out from the sample voice storage to determine whether the sample voice signal is included.

이하, 본 발명을 구체적으로 설명하기 위해 실시예를 들어 설명하고, 발명에 대한 이해를 돕기 위해 첨부 도면을 참조하여 상세하게 설명하기로 한다. 그러나, 본 발명에 따른 실시예들은 여러 가지 다른 형태로 변형될 수 있으며, 본 발명의 범위가 아래에서 상술하는 실시예들에 한정되는 것으로 해석되어지지 않아야 한다. 본 발명의 실시예들은 본 발명이 속한 기술분야에서 평균적인 지식을 가진 자에게 본 발명을 보다 명확하고 용이하게 설명하기 위해서 제공되어지는 것이다. 도면상에서 동일한 부호는 동일한 요소를 지칭한다.Hereinafter, the present invention will be described in detail with reference to examples, and detailed description will be made with reference to the accompanying drawings to help understand the present invention. However, embodiments according to the present invention can be modified in many different forms, and the scope of the present invention should not be construed as being limited to the embodiments described below. Embodiments of the present invention are provided to more clearly and easily describe the present invention to those skilled in the art. In the drawings like reference numerals refer to like elements.

본 발명의 실시예에서 ARS정보는 고객이 납부한 건강보험료 정보를 제공하는 것을 예를 들어 설명하도록 하겠다.In the embodiment of the present invention, the ARS information will be described as an example of providing health insurance information paid by the customer.

도 2는 본 발명에 따른 음성인식을 이용한 ARS정보 추출 시스템의 개략도이다.2 is a schematic diagram of an ARS information extraction system using speech recognition according to the present invention.

도 2를 참조하여 설명하면, 본 발명에 따른 음성인식을 이용한 ARS정보 추출 시스템은, ARS서버(100)를 통해 출력되는 안내음성 및 각 고객의 주민번호에 상응하는 건강보험료 데이터를 저장하고 있는 응답정보DB(100a)와; PSTN망으로부터 고객의 원하는 건강보험료정보에 대한 의뢰가 있으면 건강보험료정보를 송출하기 위한 안내음성을 출력하고, 고객이 안내에 응하여 필요한 정보를 키패드를 통해 입력하면 상기 응답정보DB(100a)로부터 건강보험료 데이터를 독출하여 이를 음성으로 변환하여 출력하는 ARS서버(100)와; 상기 ARS서버(100)에 DTMF(Dual Tone Multi Frequency)신호를 발생시켜 접속하고 출력되는 건강보험정보 음성신호에서 고객에게 필요한 정보만을 추출하여 텍스트로 변환하는 ARS음성정보수집장치(200)와; 고객의 인터넷을 통한 접속을 허용하여 고객의 건강보험료에 대한 정보를 수집할 수 있도록 주민번호를 입력받고, 상기 ARS음성정보수집장치(200)로부터 출력되는 건강보험료 텍스트 데이터 및 의뢰하는 고객의 주민번호를 고객정보DB(150a)에 저장시키는 웹서버(150)와; 고객의 주민번호 데이터 및 건강보험료 텍스트 데이터를 저장하고 있는 고객정보DB(150a)와; 상기 웹서버(150)에 접속하여 건강보험료정보를 수집하기 위한 주민번호를 입력하는 고객컴퓨터(50)를 포함한다. 상기에서 고객은 다양한 사람의 주민번호를 전송함으로써 건강보험료정보 데이터를 수집하고 이를 통해 소득액을 산출하여 대출을 위한 대출한도, 신용한도 등을 결정하는 신용기관 등이 바람직하다.Referring to Figure 2, the ARS information extraction system using the voice recognition according to the present invention, the response that stores the health insurance premium data corresponding to the voice guidance and each customer's social security number output through the ARS server 100 Information DB 100a; If there is a request for the customer's desired health insurance information from the PSTN network, a guide voice for transmitting the health insurance premium information is output, and when the customer inputs the necessary information through the keypad in response to the guidance, the health insurance premium is received from the response information DB 100a. An ARS server 100 for reading data and converting the data into voice; An ARS voice information collecting device (200) for generating and connecting a dual tone multi frequency (DTMF) signal to the ARS server 100 and extracting only information necessary for a customer from the health insurance information voice signal that is output; Receives a social security number input to allow the customer to access information through the Internet to collect information on the health insurance premiums of the customer, health insurance text data output from the ARS voice information collection device 200 and the client's social security number A web server 150 for storing the information in the customer information DB 150a; A customer information DB (150a) for storing the social security number data and health insurance text data of the customer; Access to the web server 150 includes a customer computer 50 for inputting a social security number for collecting health insurance premium information. In the above, the customer collects the health insurance information data by transmitting the social security numbers of various people, and calculates the income amount through the credit institution for determining the loan limit, credit limit, etc. for the loan.

상기에서 ARS음성정보수집장치에 관하여 도 3의 블록도를 참조하여 상세히 설명하면 다음과 같다.The ARS voice information collecting device is described in detail with reference to the block diagram of FIG. 3 as follows.

도 3은 본 발명에 따른 ARS음성정보수집장치의 블록도이다.3 is a block diagram of an ARS voice information collecting device according to the present invention.

도 3을 참조하여 설명하면, ARS서버(100)로부터 출력되는 안내 음성을 PSTN망을 통하여 수신하여 아날로그 신호에서 디지털신호로 변환하고, 상기 ARS서버 (100)로의 접속을 위한 DTMF신호를 송신하는 음성송/수신부(220)와; 상기 ARS서버(100)에서 출력되는 안내음성에 따라 입력해야할 시점에 맞는 다이얼링번호를 저장하고 있는 시나리오저장부(260)와; 상기 시나리오저장부(260)에서 출력되는 다이얼링번호에 해당하는 DTMF신호를 생성하는 DTMF발생부(240)와; 상기 음성송/수신부(220)에서 출력되는 디지털 음성신호에서 고객이 의뢰한 건강보험료 정보 데이터를 추출하여 텍스트 신호로 변환하는 ASR(Automatic Speech Recognition) 서버(300)와; 상기의 모든 기능을 제어하고 특히 ARS서버(100)에서 출력되는 음성신호에서 건강보험료 정보 데이터를 텍스트신호로 추출하도록 제어하는 제어부 (280)를 포함한다.Referring to FIG. 3, the voice received from the ARS server 100 through the PSTN network converts an analog signal into a digital signal, and transmits a DTMF signal for connection to the ARS server 100. A transmitter / receiver 220; A scenario storage unit 260 for storing a dialing number corresponding to a time point to be input according to the guide voice output from the ARS server 100; DTMF generation unit 240 for generating a DTMF signal corresponding to the dialing number output from the scenario storage unit 260; Automatic Speech Recognition (ASR) server 300 for extracting the health insurance premium information data requested by the customer from the digital voice signal output from the voice transmitter / receiver 220 and converting it into a text signal; And a control unit 280 for controlling all the above functions and in particular, extracting the health insurance premium information data from the voice signal output from the ARS server 100 as a text signal.

상기 ASR서버(300)의 구성에 관하여 더 상세히 설명하면 다음과 같다.The configuration of the ASR server 300 will be described in more detail as follows.

상기 음성송/수신부(220)로부터 인가되는 디지털 음성신호를 인식하는 음성인식부(360)와; 인가된 디지털 음성신호에서 건강보험료에 해당하는 음성만을 추출할 수 있도록 출력 음성에서 타겟이 되는 음성신호의 전방과 후방에 진행되는 음성신호를 디지털 데이터로 저장하고 있는 샘플음성저장부(380)와; 상기 음성인식부 (360)에서 출력되는 음성신호와 샘플음성저장부(380)에 저장된 음성신호를 독출하여 서로 비교하여 타겟 음성신호를 가려내는 음성비교부(400)와; 상기 음성비교부 (400)에서 비교된 결과에 따라 음성인식부(360)로 수신된 음성신호에서 타겟 음성신호를 추출하는 음성추출부(340)와; 상기 음성추출부(340)에서 출력되는 타겟 음성신호인 건강보험료에 해당하는 디지털 음성신호를 텍스트신호로 변환하여 웹서버 (150)로 출력하는 TEXT변환부(320)를 포함한다.A voice recognition unit 360 for recognizing a digital voice signal applied from the voice transmitter / receiver 220; A sample voice storage unit 380 for storing voice signals which proceed in front and rear of the target voice signal in the output voice as digital data so that only the voice corresponding to the health insurance premium can be extracted from the applied digital voice signal; A voice comparison unit 400 which reads the voice signal output from the voice recognition unit 360 and the voice signal stored in the sample voice storage unit 380 and compares each other to select a target voice signal; A voice extraction unit 340 for extracting a target voice signal from the voice signal received by the voice recognition unit 360 according to the result compared by the voice comparator 400; And a TEXT converter 320 for converting a digital voice signal corresponding to a health insurance premium, which is a target voice signal output from the voice extractor 340, into a text signal and outputting the text signal to the web server 150.

상기의 구성으로 이루어진 본 발명에 따른 음성인식을 이용한 ARS정보 추출 시스템의 동작 태양에 관하여 도2 및 도3을 참조하여 설명하면 다음과 같다.Referring to Figures 2 and 3 with respect to the operation of the ARS information extraction system using the voice recognition according to the present invention having the above configuration as follows.

예컨대, 고객이 대출을 의뢰하는 대출 의뢰인의 신상정보를 보유하고 있다면, 고객은 의뢰인에 대한 신용평가를 하여야 대출을 할 수 있다. 그러므로 고객은 컴퓨터(50)를 사용하여 인터넷을 통하여 웹서버(150)에 접속한다. 고객은 웹서버 (150)에 접속한 후 컴퓨터(50)의 기억장치에 저장된 의뢰인의 신상정보 중 주민번호를 웹서버(150)로 전송한다. 웹서버(150)는 고객의 컴퓨터(50)로부터 전송된 주민번호 데이터를 수신하여 고객정보DB(150a)에 저장시킨다. 웹서버(150)는 고객으로부터 의뢰된 의뢰인의 신용평가를 위한 건강보험료 정보를 얻기위해 ARS음성정보수집장치(200)에 접속한다. 웹서버(150)는 ARS음성정보수집장치(200)에 의뢰인의 주민번호를 고객정보DB(150a)로부터 독출하여 전송하면, 제어부(280)는 이를 수신하고 의뢰인에 대한 건강보험료 정보를 얻기 위해 제어부(280)는 시나리오저장부 (260)에 저장되어 있는 전화번호데이터를 독출한다. 그리고, 제어부(280)는 DTMF발생부(240)를 제어하여 독출한 전화번호를 DTMF신호로 발생시켜 건강보험정보를 제공하는 국민건강관리공단의 ARS서버(100)에 다이얼링한다.For example, if a customer has personal information of a loan client requesting a loan, the customer can make a loan only by performing a credit evaluation on the client. Therefore, the customer uses the computer 50 to access the web server 150 through the Internet. The customer connects to the web server 150 and transmits the social security number of the client's personal information stored in the storage device of the computer 50 to the web server 150. The web server 150 receives the social security number data transmitted from the computer 50 of the customer and stores it in the customer information DB 150a. The web server 150 connects to the ARS voice information collecting device 200 to obtain health insurance premium information for the client's credit evaluation. When the web server 150 reads and transmits the client's social security number from the customer information DB 150a to the ARS voice information collecting device 200, the control unit 280 receives the control unit and receives the health insurance premium information for the client. 280 reads out the phone number data stored in the scenario storage unit 260. Then, the controller 280 controls the DTMF generator 240 to generate a telephone number read out as a DTMF signal and dial the ARS server 100 of the National Health Care Corporation that provides health insurance information.

DTMF발생부(240)에 의해 발생된 DTMF신호는 음성송/수신부(220)에서 송신되어 PSTN망을 거쳐서 국민건강관리공단의 ARS서버(100)에 입력되고, ARS서버(100)는 상기 DTMF신호의 입력에 따라 응답DB(100a)에 저장된 안내음성을 PSTN망을 통하여 ARS음성정보수집장치(200)로 전송한다. 상기에서 예컨대 처음 안내음성이 "안녕하십니까? 국민건강보험공단입니다. 건강보험료 조회는 1번, 건강보험료 산출방법 안내는 2번,....., 원하시는 번호를 누르신후 * 버튼을 눌러 주십시요?" 이고, 그 다음 1번을 선택하였을 경우에 나오는 안내음성이 "조회를 하시려는 분의 주민번호 13자리를 눌러주신후 * 버튼을 눌러주십시요?" 이고 여기서 만약 주민번호를 700000-1234567 로 입력하면 안내음성은 "홍길동님의 10월 건강보험료는 50000원 입니다." 이라면, 시나리오저장부(260)는 상기 음성이 출력되는 시간정보 및 DTMF신호를 입력해야할 순간에 입력될 정보 데이터를 저장하여야 한다.The DTMF signal generated by the DTMF generator 240 is transmitted from the voice transmitter / receiver 220 and input to the ARS server 100 of the National Health Care Corporation through the PSTN network, and the ARS server 100 transmits the DTMF signal. According to the input of the guide voice stored in the DB (100a) transmits to the ARS voice information collecting device 200 through the PSTN network. In the above, for example, the first voice guidance "Hello? National Health Insurance Corporation. Health insurance premium inquiry number 1, health insurance premium calculation method 2, ....., please press the desired number and press the * button?" If you select No. 1, the voice prompt will appear "Please press 13 digits of the social security number of the person you want to search and press the * button?" If you enter the resident registration number 700000-1234567, the voice guidance "Hong Gil-dong's health insurance in October is 50000 won." If so, the scenario storage unit 260 should store the time information to output the voice and the information data to be input at the moment when the DTMF signal should be input.

즉, 시나리오저장부(260)에는 처음 안내방송이 시작된 후 일정시간 후 1번과 * 버튼이 입력되도록 하고, 그리고 다음 안내방송이 시작된 후 일정시간 후 의뢰인의 주민번호와 *가 입력되도록 데이터가 저장되어야 한다. 또한, 상기에서 주민번호 데이터는 의뢰인에 따라 계속해서 갱신되어야 하므로 웹서버(150)로부터 전송되는 주민번호데이터를 수신한 제어부(280)는 시나리오저장부(260)에 저장된 기존의 주민번호데이터를 새로 수신한 주민번호데이터로 갱신시킨다.That is, the scenario storage unit 260 stores the data so that the first and the * button is input after a certain time after the first announcement is started, and the resident number and * of the client is input after a certain time after the next announcement is started. Should be. In addition, since the social security number data must be continuously updated according to the client, the control unit 280 receiving the social security number data transmitted from the web server 150 newly updates the existing social security number data stored in the scenario storage unit 260. Update the received social security number data.

상기에서 DTMF신호의 입력을 수신한 ARS서버(100)는 텍스트로 응답DB(100a)에 저장된 안내음성인 "안녕하십니까? 국민건강보험공단입니다. 건강보험료 조회는 1번, 건강보험료 산출방법 안내는 2번,....., 원하시는 번호를 누르신후 * 버튼을 눌러 주십시요?"를 TTS(Text To Speech)서버(도시안됨)가 음성신호로 합성하여 PSTN망을 거쳐서 ARS음성정보수집장치(200)로 전송한다. 음성송/수신부(220)는 상기 아날로그 음성신호를 수신하여 디지털음성신호로 변환하여 음성인식부(360)로 전송한다. 음성인식부(360)는 해당음성신호를 음성비교부(400)로 전송하고 샘플음성저장부(380)에 저장된 음성신호와 비교하여 현재의 음성신호 내에 건강보험료정보 즉, 타겟 음성신호가 저장되어 있는가를 판단하여 제어부(280)로 그 결과를 인가한다. 제어부는 현재의 음성신호에 타겟음성신호가 포함되어 있지 않음을 인식하고 시나리오저장부(260)로부터 현재 음성신호가 종료되는 시간에 대한 정보를 독출하고 그 시점이 오면 시나리오저장부(260)에 저장된 1번 및 * 에 해당하는 데이터를 DTMF발생부(240)로 전송한다. DTMF발생부(240)는 1번 및 *에 해당하는 데이터를 DTMF신호로 변환하여 음성송/수신부(220)를 통해 PSTN망에 접속된 ARS서버(100)로전송한다.The ARS server 100 receiving the DTMF signal in the above is a guide voice stored in the response DB 100a as text "Hello? National Health Insurance Corporation. The health insurance premium inquiry number 1, the health insurance premium calculation method guide 2 , ....., press the desired number and press the * button? "TTS (Text To Speech) server (not shown) synthesizes the voice signal through the PSTN network ARS voice information collecting device 200 To send. The voice transmitter / receiver 220 receives the analog voice signal, converts it into a digital voice signal, and transmits the analog voice signal to the voice recognition unit 360. The voice recognition unit 360 transmits the corresponding voice signal to the voice comparator 400 and compares the voice signal stored in the sample voice storage unit 380 with the health insurance premium information, that is, the target voice signal in the current voice signal. The controller 280 determines the presence of the result and applies the result to the controller 280. The control unit recognizes that the current voice signal does not include the target voice signal, and reads information on the time when the current voice signal ends from the scenario storage unit 260, and when the time comes, stored in the scenario storage unit 260. The data corresponding to No. 1 and * is transmitted to the DTMF generator 240. The DTMF generation unit 240 converts data corresponding to No. 1 and * into DTMF signals and transmits the DTMF signal to the ARS server 100 connected to the PSTN network through the voice transmitter / receiver 220.

1번 및 * DTMF신호를 수신한 ARS서버(100)는 다음 음성안내 데이터를 TTS서버가 응답DB(100a)로부터 독출하여 "조회를 하시려는 분의 주민번호 13자리를 눌러주신후 * 버튼을 눌러주십시요?" 라는 안내음성을 ARS음성정보수집장치(200)로 전송한다. 마찬가지로 제어부(280)는 상기 음성신호에 타겟음성신호가 포함되어 있지 않음을 인식하고 시나리오저장부(260)로부터 현재 음성신호가 종료되는 시간에 대한 정보를 독출하고 그 시점이 오면 시나리오저장부(260)에 저장된 "7000001234567"을 DTMF발생부(240)를 거쳐 DTMF신호로 발생시켜 ARS서버(100)로 전송한다.The ARS server 100, which has received the first and * DTMF signals, reads the following voice guidance data from the response DB 100a and presses the 13-digit social security number of the person to be queried. ? " The guide voice is transmitted to the ARS voice information collecting device 200. Similarly, the controller 280 recognizes that the voice signal does not include the target voice signal, and reads information on the time at which the current voice signal ends from the scenario storage unit 260, and the scenario storage unit 260 when the time comes. "7000001234567" stored in the) is generated as a DTMF signal via the DTMF generator 240 and transmitted to the ARS server 100.

7000001234567에 해당하는 DTMF신호를 수신한 ARS서버(100)는 응답정보DB (100a)에 저장된 수신 주민번호에 해당하는 고객의 성명 및 건강보험료 정보를 독출하여 TTS서버에 의해 안내음성으로 PSTN망으로 ARS음성정보수집장치(200)로 출력한다. 예컨대 안내음성이 "홍길동님의 10월 건강보험료는 50000원 입니다."이라면, 음성송/수신부(220)는 상기 음성신호를 아날로그에서 디지털 음성신호로 변환하고, 음성인식부(360)로 인가한다. 음성인식부(360)는 전송된 음성신호를 인식하여 음성비교부(400)로 전송하고, 음성비교부(400)는 샘플음성저장부(380)에 저장되어 있는 음성신호와 비교한다.The ARS server 100 receiving the DTMF signal corresponding to 7000001234567 reads the name and health insurance information of the customer corresponding to the received social security number stored in the response information DB (100a) and transmits the ARS to the PSTN network as a guide voice by the TTS server. Output to the voice information collecting device 200. For example, if the guide voice is "Hong Gil Dong's October health insurance premium is 50000 won.", The voice transmitter / receiver 220 converts the voice signal from analog to digital voice signal and applies it to the voice recognition unit 360. . The voice recognition unit 360 recognizes the transmitted voice signal and transmits it to the voice comparator 400, and the voice comparator 400 compares the voice signal stored in the sample voice storage 380.

상기에서 샘플음성저장부(380)는 타겟음성신호를 가려내기 위한 음성신호를 저장하고 있는데, 상기의 안내음성에서 타겟음성신호인 50000원을 가려내기 위해서는 50000원의 전방의 안내음성과 후방의 안내음성을 알아야 한다. 상기에서는 전방의 안내음성은 "건강보험료는" 이고 후방의 안내음성은 "입니다"이다. 즉, 샘플음성저장부(380)에는 "건강보험료는" 과 "입니다"에 해당하는 음성데이터가 저장되어 있다. 그러므로 음성비교부(400)는 상기 안내음성과 샘플음성저장부(380)에 저장되어 있는 음성신호를 비교하면서 전방 안내음성 및 후방 안내음성과 일치하는 음성신호가 발견되면, 제어부(280) 및 음성추출부(340)에 그 결과를 전송하여 음성추출부(340)로 하여금 전방 안내음성 및 후방 안내음성 사이에 존재하는 음성신호를 추출하도록 한다.The sample voice storage unit 380 stores a voice signal for screening the target voice signal. In order to screen the target voice signal 50000 won from the guide voice, the sample voice storage unit 380 has a guide voice of 500,000 won in front and a guide in the rear. You need to know your voice. In the above, the front voice is "health insurance premium" and the rear voice is "." That is, the sample voice storage unit 380 stores voice data corresponding to "health insurance premium" and "is". Therefore, the voice comparator 400 compares the voice signal stored in the guide voice and the sample voice storage 380 and finds a voice signal matching the front guide voice and the rear guide voice. The result is transmitted to the extractor 340 to cause the voice extractor 340 to extract a voice signal existing between the front guide voice and the rear guide voice.

상기에 의해 "50000원" 이라는 음성신호가 추출되면, 음성추출부(340)는 추출된 음성신호를 TEXT변환부(320)로 전송한다. TEXT변환부(320)는 전송된 음성신호를 TEXT신호로 변환한 후 웹서버(150)로 전송한다. 웹서버(150)는 전송된 건강보험료에 해당하는 TEXT신호를 수신한 후 인터넷을 통하여 정보를 요청한 고객 컴퓨터(50)로 전송한다. 고객은 전송된 건강보험료 TEXT데이터를 수신하여 의뢰인의 대출을 위한 신용평가에 참고한다. 상기에서 웹서버(150)와 ARS음성정보수집장치(200)와의 통신 프로토콜은 TCP/IP Socket를 사용하는 것이 바람직하다.When the voice signal of "50000 won" is extracted by the above, the voice extractor 340 transmits the extracted voice signal to the TEXT converter 320. The TEXT converter 320 converts the transmitted voice signal into a TEXT signal and transmits the converted text signal to the web server 150. The web server 150 receives the TEXT signal corresponding to the transmitted health insurance premium and transmits the information to the client computer 50 that requested the information through the Internet. The client receives the transmitted health insurance premium text data and refers to the credit rating for the client's loan. In the above, the communication protocol between the web server 150 and the ARS voice information collecting device 200 preferably uses a TCP / IP socket.

이하에서는 전술한 구성에 따른 음성인식을 이용한 ARS정보 추출 방법에 관하여 도 4 및 도 5의 흐름도를 참조하여 설명하도록 하겠다.Hereinafter, a method of extracting ARS information using speech recognition according to the above-described configuration will be described with reference to the flowcharts of FIGS. 4 and 5.

도 4는 본 발명에 따른 음성인식을 이용한 ARS정보 추출 방법을 설명하기 위한 흐름도이다.4 is a flowchart illustrating a method of extracting ARS information using speech recognition according to the present invention.

도 4를 참조하여 설명하면, 웹서버(150)는 고객의 컴퓨터(50)에 저장된 의뢰인의 신상정보 중 주민번호를 인터넷을 통하여 수신하여 고객정보DB(150a)에 저장시킨다(S100). 웹서버(150)는 고객정보DB(150a)에 저장된 의뢰인의 주민번호를 독출하여 ARS음성정보수집장치(200)로 전송한다(S200). ARS음성정보수집장치(200)는 의뢰인의 주민번호를 수신하면 시나리오저장부(260)에 주민번호를 저장시키고 DTMF신호를 발생시켜 PSTN망을 통하여 ARS서버(100)에 접속한다(S300). 상기에서 DTMF신호는 ARS서버(100)에 접속 전화번호에 해당하는 DTMF신호로서 제어부(280)의 제어에 따라 DTMF발생부(240)에 의해 생성된다. ARS서버(100)는 상기 DTMF신호를 수신하면 응답DB(100a)에 텍스트로 저장된 안내음성을 TTS서버에 의해 음성으로 변환하여 PSTN망으로 ARS음성정보수집장치(200)로 출력한다(S400). ARS음성정보수집장치(200)의 제어부(280)는 음성의 출력을 수신하여 건강보험료에 해당하는 타겟음성신호를 수집하도록 시나리오저장부(260)에 저장된 응답신호를 DTMF신호로서 ARS서버(100)로 송신한다(S500). 상기에서 응답신호에 포함되는 데이터는 타겟음성신호를 출력하기 위한 선택번호 및 사용자 주민번호를 포함한다. ARS서버(100)가 ARS음성정보수집장치(200)로부터 전송된 응답신호를 수신하면 이에 대한 응답 안내음성을 출력하기 위해 응답정보DB(100a)로부터 주민번호에 대한 건강보험료 데이터를 독출하고 TTS서버에 의해 음성신호로 변환하여 PSTN망으로 ARS음성정보수집장치 (200)로 출력한다(S600). ARS음성정보수집장치(200)의 ASR서버(300)는 상기 음성신호를 수신하여 샘플음성저장부(380)에 저장된 음성신호와 비교하여 현재 음성에 타겟음성신호가 포함되었는가를 판단한다(S700). 판단한 결과에 따라서, 타겟음성신호가 없으면 상기 과정을 반복하여 수행하고, 타겟음성신호가 있으면 ASR서버(300)의 음성추출부(340)는 전송된 음성신호에서 타겟음성신호를 추출한다(S800). ASR서버(300)의 TEXT변환부는 음성추출부(340)에서 추출된 음성신호를 TEXT신호로 변환한다(S900). ARS음성정보수집장치(200)는 상기 타겟음성신호인 건강보험료에 해당하는 TEXT데이터를 인터넷을 통하여 고객컴퓨터(50)로 전송한다(S1000).Referring to FIG. 4, the web server 150 receives the social security number of the client's personal information stored in the customer's computer 50 through the Internet and stores it in the customer information DB 150a (S100). The web server 150 reads the resident number of the client stored in the customer information DB 150a and transmits it to the ARS voice information collecting device 200 (S200). When the ARS voice information collecting device 200 receives the resident number of the client, the ARS voice information collecting device 200 stores the resident number in the scenario storage unit 260 and generates a DTMF signal to access the ARS server 100 through the PSTN network (S300). In the above, the DTMF signal is generated by the DTMF generator 240 under the control of the controller 280 as a DTMF signal corresponding to the telephone number connected to the ARS server 100. When receiving the DTMF signal, the ARS server 100 converts the guide voice stored as text in the response DB 100a into voice by the TTS server and outputs the voice to the ARS voice information collecting device 200 through the PSTN network (S400). The control unit 280 of the ARS voice information collecting device 200 receives the output of the voice and collects the target voice signal corresponding to the health insurance premium, using the response signal stored in the scenario storage unit 260 as a DTMF signal. To transmit (S500). The data included in the response signal includes a selection number and user resident number for outputting the target voice signal. When the ARS server 100 receives the response signal transmitted from the ARS voice information collecting device 200, the health insurance premium data for the social security number is read from the response information DB 100a and the TTS server is outputted in order to output the response guide voice. By converting the voice signal to the PSTN network and outputs to the ARS voice information collecting device 200 (S600). The ASR server 300 of the ARS voice information collecting device 200 receives the voice signal and compares the voice signal with the voice signal stored in the sample voice storage 380 to determine whether the target voice signal is included in the current voice (S700). . According to the determination result, if there is no target voice signal, the above process is repeated, and if there is a target voice signal, the voice extractor 340 of the ASR server 300 extracts the target voice signal from the transmitted voice signal (S800). . The TEXT converter of the ASR server 300 converts the voice signal extracted by the voice extractor 340 into a TEXT signal (S900). The ARS voice information collecting device 200 transmits the TEXT data corresponding to the health insurance premium which is the target voice signal to the customer computer 50 through the Internet (S1000).

상기에서 ASR서버가 수신된 음성신호에서 타겟음성신호가 있는가를 판단하는 단계 S700에 관하여 도 5를 참조하여 상세히 설명하면 다음과 같다.The step S700 in which the ASR server determines whether the target voice signal is present in the received voice signal will be described in detail with reference to FIG. 5 as follows.

도 5는 본 발명에 따른 타겟음성신호 유무 판단 단계를 설명하기 위한 상세흐름도이다.5 is a detailed flowchart illustrating a target voice signal presence determination step according to the present invention.

도 5를 참조하면, 음성송/수신부(220)는 ARS서버(100)로부터 출력된 음성신호를 수신하여 아날로그음성신호를 디지털음성신호로 변환환다(S700-1). ASR서버 (300)의 음성인식부(360)는 음성송/수신부(220)에서 전송된 디지털음성신호의 입력을 인식한다(S700-2). 음성비교부(400)는 음성인식부(360)로부터 디지털음성신호의 입력이 인식되면 샘플음성저장부(380)에 저장된 샘플음성을 독출한다(S700-3). 음성비교부(400)는 음성인식부(360)로부터 전송되는 음성신호와 샘플음성저장부(380)로부터 독출된 샘플음성을 비교하여 샘플음성신호가 포함되어 있는가를 판단하여 일치하면 전술한 단계 S800부터 진행하고, 일치하는 부분이 없으면 계속해서 반복수행한다(S700-4).Referring to FIG. 5, the voice transmitter / receiver 220 receives an audio signal output from the ARS server 100 and converts an analog voice signal into a digital voice signal (S700-1). The voice recognition unit 360 of the ASR server 300 recognizes the input of the digital voice signal transmitted from the voice transmitter / receiver 220 (S700-2). The voice comparator 400 reads the sample voice stored in the sample voice storage 380 when the input of the digital voice signal is recognized from the voice recognizer 360 (S700-3). The voice comparing unit 400 compares the voice signal transmitted from the voice recognition unit 360 with the sample voice read out from the sample voice storage unit 380 to determine whether the sample voice signal is included, and if so, from step S800 described above. It proceeds and repeats if there is no match (S700-4).

상기와 같은 과정으로 의뢰인의 주민번호정보로부터 건강보험료 정보가 수집되고 계속해서 같은 과정을 반복함으로써 새로운 의뢰인의 주민번호가 다시 ARS서버로 전송되면 다수의 의뢰인에 대한 건강보험료정보를 수집할 수 있다.The health insurance premium information is collected from the client's social security number information as described above, and by repeating the same process, if the new client's social security number is transmitted back to the ARS server, the health insurance information for the plurality of clients can be collected.

또한, 본 발명의 실시예에서는 의뢰인의 건강보험료 정보를 수집하기 위한과정을 예를 들어 설명하였지만, 본 발명의 다양한 실시예에서는 다양한 정보를 제공하는 ARS시스템으로부터 소정 정보를 수집하는데 필요한 입력정보만을 구비함으로써 정보의 수집이 가능하다.In addition, in the embodiment of the present invention, a process for collecting health insurance premium information of a client has been described as an example, but various embodiments of the present invention include only input information necessary for collecting predetermined information from an ARS system that provides various information. Information can be collected by doing so.

상기의 모든 구성 및 단계를 거치게 되면 본 발명에 다른 음성인식을 이용한 ARS정보 추출 시스템 및 방법을 구현함에 있어서, 고객이 직접 ARS시스템에 전화를 걸어서 청취를 통해 수집해야할 정보를 컴퓨터를 통하여 텍스트로 수집할 수 있다.When all the above configurations and steps are implemented, in implementing the ARS information extraction system and method using other voice recognition according to the present invention, the customer directly calls the ARS system and collects the information to be collected by listening through a computer as a text. can do.

이상에서는 본 발명에 따른 바람직한 실시예를 첨부한 도면을 참조하여 상세하게 설명하였다. 하지만, 본 발명의 실시예들은 본 발명이 속한 기술분야에서 통상의 지식을 가진 자에 의하여 다양한 변형이나 응용이 가능하며, 본 발명에 따른 기술적 사상의 범위는 하기되는 특허청구범위에 의하여 정해져야 할 것이다.In the above described with reference to the accompanying drawings, preferred embodiments of the present invention in detail. However, embodiments of the present invention may be variously modified or applied by those skilled in the art, the scope of the technical idea according to the present invention should be determined by the claims below. will be.

본 발명의 일 측면에 따르면, 고객이 인터넷을 통하여 ARS시스템에서 요구하는 정보만을 컴퓨터로 입력하기만 하면 원하는 정보를 텍스트로 전송받을 수 있는 편리함이 있다.According to an aspect of the present invention, there is a convenience that a customer can receive desired information as text only by inputting only the information required by the ARS system through a computer.

본 발명의 다른 측면에 따르면, 기존의 전화를 통해 ARS시스템에 접근해야하는 방식을 인터넷을 통한 접근이 가능하게 함으로써 ARS시스템을 인터넷망으로 흡수시킬 수 있는 효과가 있다.According to another aspect of the present invention, there is an effect that the ARS system can be absorbed into the Internet network by allowing access through the Internet to access the ARS system through an existing telephone.

본 발명의 또 다른 측면에 따르면, ARS시스템으로부터 출력되는 정보를 귀로 청취하여 입수하던 것을 텍스트로 입수함으로써 정보의 정확도를 높여준다.According to another aspect of the present invention, by listening to the information output from the ARS system with the ear obtained from the text to increase the accuracy of the information.

Claims

The present invention relates to a system for recognizing a voice output from an ARS system and converting it into a signal that can be recognized by a computer.

ARS voice information collecting means for inputting a telephone number and input information into the AMF system as a DTMF signal, extracting a target voice signal to be collected by recognizing a voice signal output from the ARS system, and converting the target voice signal into a text signal;

A web server for transmitting the text signal output from the ARS voice information collecting means to a customer computer through the Internet, and storing the ARS input information transmitted from the customer; And

And a customer information database provided in the web server for storing ARS input information transmitted from a customer computer and outputting the ARS input information stored when the ARS system is connected to the ARS voice information collecting means. ARS information extraction system using recognition.

The method of claim 1, wherein the ARS voice information collecting means,

A voice transmitting / receiving unit which receives a guide voice output from the ARS system through a PSTN network, converts an analog signal into a digital signal, and transmits a DTMF signal for connection to the ARS server;

A scenario storage unit storing a dialing number to be input according to the guide voice received by the voice transmitter / receiver;

A DTMF generation unit for generating a DTMF signal corresponding to the dialing number output from the scenario storage unit;

An ASR server extracting a target voice signal requested by a customer from the digital voice signal received by the voice transmitter / receiver and converting the target voice signal into a text signal; And

And a control unit which controls to extract the target voice signal from the voice signal received by the voice transmitter / receiver and convert the target voice signal into a text signal.

The method of claim 2, wherein the ASR server,

A voice recognition unit for recognizing a digital voice signal applied from the voice transmitter / receiver;

A sample voice storage unit for storing the voice signals traveling in front and rear of the target voice signal from the applied digital voice signal as digital data;

A voice comparator that reads the voice signal output from the voice recognition unit and the voice signal stored in the sample voice storage unit and compares each other to select a target voice signal;

A voice extraction unit for extracting a target voice signal from the voice signal received by the voice recognition unit according to the result compared by the voice comparison unit; And

And a text conversion unit for converting a target voice signal output from the voice extraction unit into a text signal.

The method of claim 3, wherein the target voice signal extraction of the voice extraction unit,

ARS information extraction system using speech recognition, characterized in that to extract a voice signal between the front voice and the rear voice from the voice signal output from the voice recognition unit as a target voice signal.

The present invention relates to a method for recognizing a voice output from an ARS system and converting it into a signal that can be recognized by a computer.

(a) receiving, by the ARS voice information collecting device, the ARS input information of the client read from the customer information database of the web server;

(b) storing the client's ARS input information received by the ARS voice information collecting device in a scenario storage unit and generating a DTMF signal to access the ARS server;

(c) receiving, by the ARS voice information collecting device, a guide voice read and output from the response database of the ARS server receiving the DTMF signal;

(d) receiving the guide voice by the ARS voice information collecting device and transmitting a response signal stored in the scenario storage unit as a DTMF signal to the ARS server;

(e) receiving, by the ARS voice information collecting device, a response guide voice read from the response database of the ARS server for the response signal;

(f) determining, by the ASR server of the ARS voice information collecting device, whether the target voice signal is included in the current voice by comparing the voice signal with the voice signal stored in the sample voice storage; And

(g) if the target voice signal is present, the ASR server extracts the target voice signal from the transmitted voice signal and converts it into a TEXT signal. ARS information extraction system using a voice recognition, characterized in that it comprises a.

The method of claim 5, wherein step (f) comprises:

(f-1) the voice recognition unit of the ASR server recognizing the input of the digital voice signal transmitted from the voice transmitter / receiver;

(f-2) when the voice comparator recognizes the input of the digital voice signal from the voice recognition unit, reading the sample voice stored in the sample voice storage unit; And

(f-3) comparing the voice signal transmitted from the voice recognition unit with the sample voice read out from the sample voice storage unit to determine whether the sample voice signal is included; ARS information extraction system using.