KR19990015749A

KR19990015749A - Recognize your voice and email

Info

Publication number: KR19990015749A
Application number: KR1019970038047A
Authority: KR
Inventors: 한몽주
Original assignee: 구자홍; 엘지전자 주식회사
Priority date: 1997-08-09
Filing date: 1997-08-09
Publication date: 1999-03-05

Abstract

본 발명은 음성으로 입력한 내용을 문자정보로 변환하여 전송하는 기술로서, 특히 음성을 인식하여 전자우편(e-mail)을 보내는 장치와 방법에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention [0001] The present invention relates to a technique for converting text input into text information and transmitting the same, and more particularly, to an apparatus and a method for sending an email by recognizing a voice.

종래의 e-mail처리장치에 의하면 e-mail 서버 또는 클라이언트(client) 프로그램이 설치되어 있는 PC, 워크스테이션 등의 시스템에서만 e-mail 내용을 작성하여 전송할 수 있는 제약이 있다.According to the conventional e-mail processing apparatus, there is a limitation that an e-mail content can be created and transmitted only in a system such as a PC or a workstation where an e-mail server or a client program is installed.

본 발명에서는, 사용자가 전송할 e-mail에 대응하는 내용을 일련의 약속된 수순에 따라 음성신호로 입력하면 이 음성을 인식하여 해당 내용의 메일을 작성하고, 이 작성된 메일을 수신자 시스템으로 전송해 줄 수 있도록 한 음성을 인식하여 전자우편(e-mail)을 보내는 장치와 방법을 제공한다.According to the present invention, when a user inputs a content corresponding to an e-mail to be transmitted as a voice signal according to a series of promised procedures, the user recognizes the voice, creates a mail of the corresponding content, and transmits the created mail to a recipient system. The present invention provides an apparatus and method for sending an e-mail by recognizing a voice.

특히 본 발명은 e-mail서버 또는 클라이언트 프로그램이 설치된 시스템이 없는 원격지에서도 전화선을 이용해서 e-mail의 내용을 음성으로 입력하여 작성하고, 그 작성된 e-mail을 전송해줄 수 있도록 한 음성을 인식하여 전자우편(e-mail)을 보내는 장치와 방법을 제공한다.In particular, the present invention by inputting the contents of the e-mail by voice using a telephone line even in a remote place without an e-mail server or a client program is installed, and recognizes the voice to be able to transmit the e-mail Provided are a device and method for sending e-mail.

Description

Apparatus and method for sending e-mails by recognizing voices

도1은 종래의 e-mail 처리장치의 블럭 구성도로서, PC통신 관련 구성요소를 포함하는 PC나 기타 단말기에 구비된다.1 is a block diagram of a conventional e-mail processing apparatus, which is provided in a PC or other terminal including PC communication related components.

통신접속부(101)는 모뎀이나 LAN카드 등으로서 전화선을 이용해서 서버에 접속하고 또 서버로부터의 데이타를 전송받아 단말기에 제공하기 위한 회로이며, 통신접속부(101)에 연결된 마이컴부(102)는 e-mail전송에 관련된 제어 즉, e-mail 클라이언트 프로그램을 수행하여 통신접속부를 통해 서버와 통신을 행하여 e-mail을 송수신 제어하고, 메모리부(103)는 마이컴(102)에 연결되어 상기 서버로부터 수신한 e-mail을 저장하며, 비디오부(104)는 마이컴(102)에 연결되어 상기 수신된 e-mail을 아날로그신호로 변환하며, 디스플레이부(105)는 상기 비디오부(104)에서 출력된 e-mail을 화면상에 영상신호로서 디스플레이 한다.The communication connection unit 101 is a circuit for connecting to a server using a telephone line as a modem or a LAN card, and for receiving data from the server and providing the data to the terminal. The microcomputer unit 102 connected to the communication connection unit 101 is e. Control related to the transmission of e-mail, that is, the e-mail is transmitted and controlled by performing communication with a server through a communication connection unit, and the memory unit 103 is connected to the microcomputer 102 and received from the server. The e-mail is stored, and the video unit 104 is connected to the microcomputer 102 to convert the received e-mail into an analog signal, and the display unit 105 outputs the e-mail output from the video unit 104. -mail is displayed as a video signal on the screen.

도1에 나타낸 종래의 e-mail처리장치에 의하면 e-mail 서버 또는 클라이언트(client) 프로그램이 설치되어 있는 PC, 워크스테이션 등의 시스템에서만 e-mail 내용을 작성하여 전송할 수 있는 제약이 있다.According to the conventional e-mail processing apparatus shown in FIG. 1, there is a limitation that an e-mail content can be created and transmitted only in a system such as a PC or a workstation in which an e-mail server or a client program is installed.

도1은 종래의 e-mail 처리장치의 블럭 구성도1 is a block diagram of a conventional e-mail processing apparatus

도2는 본 발명의 e-mail을 음성으로 전송하는 개념을 나타낸 도면2 is a diagram illustrating the concept of transmitting an e-mail of the present invention by voice.

도3은 본 발명의 e-mail 전송장치의 블럭 구성도Figure 3 is a block diagram of an e-mail transmission device of the present invention

도4는 본 발명에서 음성인식 패턴 생성과정의 플로우차트4 is a flowchart of a voice recognition pattern generation process in the present invention

도5는 본 발명에서 음성인식 패턴 생성을 위한 음성신호 처리 파형도Figure 5 is a voice signal processing waveform diagram for generating a voice recognition pattern in the present invention

도6은 본 발명에서 e-mail 음성 데이타 포맷을 나타낸 도면6 illustrates an e-mail voice data format in the present invention.

도7a,7b는 본 발명에서 e-mail을 음성으로 전송하는 과정의 플로우차트Figure 7a, 7b is a flow chart of the process of transmitting e-mail by voice in the present invention

도8은 도7의 서브루틴A를 상세하게 나타낸 플로우차트FIG. 8 is a flowchart showing subroutine A of FIG. 7 in detail.

도9는 도8의 서브루틴B를 상세하게 나타낸 플로우차트9 is a flowchart showing in detail the subroutine B of FIG.

도2는 본 발명의 음성을 인식하여 전자우편(e-mail)을 보내는 시스템의 개념을 나타낸 도면으로서, 사용자가 전화 또는 마이크를 이용해서 음성신호를 입력하고 입력된 음성을 사용자 시스템에서 인식하며, 그 인식된 음성내용에 대응하는 문자화된 메일을 작성하게 된다. 이렇게 작성된 e-mail 데이타를 e-mail서버에 접속하여 전송하고 e-mail서버를 수신자 시스템에서 접속하여 상기 음성/메일 변환 전송된 e-mail을 수신,확인해 볼 수 있도록 한 시스템을 보이고 있다.2 is a diagram illustrating a concept of a system for sending an e-mail by recognizing a voice of the present invention, wherein a user inputs a voice signal using a telephone or a microphone and recognizes the input voice in a user system. A text message corresponding to the recognized voice content is created. The e-mail data thus created is connected to an e-mail server for transmission, and the e-mail server is connected to a recipient system to receive and confirm the voice / mail converted e-mail.

작성된 e-mail데이타는 수신자의 e-mail주소, 제목, 본문을 가지는 문자화된 e-mail 문서(파일)을 보이고 있다.The created email data shows a textual email document (file) with the recipient's email address, subject, and body.

도3은 본 발명의 음성을 인식하여 전자우편(e-mail)을 보내는 장치의 블럭 구성도로서, 전화망과 시스템의 통신선로를 구축하고 e-mail정보의 송수신을 담당하는 통신접속부(201)와, 상기 통신접속부와 외부에 연결된 마이크로 입력되는 음성신호를 디지탈 데이타로 변환하는 음성 처리부(202)와, 상기 통신접속부(201)를 제어하여 e-mail정보를 송수신함과 함께 음성신호를 인식하여 e-mail을 작성하는 마이컴(203)과, 상기 마이컴(203)의 제어를 받아 상기 음성처리부의 음성데이타를 저장함과 함께 음성데이타를 인식하여 작성된 e-mail메시지,사용자의 주소록 등을 저장하는 메모리부(204)와, 상기 마이컴(203)의 제어를 받아 송수신된 e-mail데이타를 디스플레이부(206)에 표시해주는 비디오부(205)를 포함한다.Fig. 3 is a block diagram of a device for sending an e-mail by recognizing the voice of the present invention, which establishes a communication line between a telephone network and a system and transmits and receives e-mail information. And a voice processing unit 202 for converting a voice signal input to the communication connection unit and an external microphone into digital data, and controlling the communication connection unit 201 to transmit and receive e-mail information and recognize a voice signal. A microcomputer 203 for creating a mail, and a memory unit for storing e-mail messages generated by recognizing voice data, an address book of a user, and the like, under the control of the microcomputer 203, and storing voice data of the voice processing unit. 204 and a video unit 205 for displaying the e-mail data transmitted and received under the control of the microcomputer 203 on the display unit 206.

상기 마이컴(203)의 제어를 받는 통신 접속부(201)는 전화선 또는 LAN라인에 연결되며, 전화선 또는 LAN라인을 절체하기 위한 스위칭부, 다이얼 톤을 생성하기 위한 톤발생기, 걸려오는 전화를 검출하기 위한 톤검출기, 데이타 송수신을 위한 디지탈 아날로그 변환기 및 아날로그 디지탈 변환기와 통신용 모뎀, 이들을 적절하게 제어하는 콘트롤러 및 마이컴(203)과의 인터페이스 등을 포함하여, 상기 마이컴(203)의 제어에 따라 전화번호 또는 IP어드레스로 통신선로를 구축하게 된다.The communication connection unit 201 under the control of the microcomputer 203 is connected to a telephone line or a LAN line, a switching unit for switching the telephone line or a LAN line, a tone generator for generating a dial tone, and a call for detecting an incoming call. Phone number or IP under the control of the microcomputer 203, including a tone detector, a digital analog converter for data transmission and reception, an analog digital converter and a communication modem, a controller for controlling them appropriately, and an interface with the microcomputer 203. The communication line is established by the address.

상기 음성처리부(202)에는 입력 아날로그 음성신호를 디지탈 신호로 변환하기 위한 A/D변환기, 음성신호의 잡음을 제거하거나 전화망의 음성전송 대역에 적합한 주파수 대역으로 변환하기 위한 필터링 등을 수행하기 위한 필터회로 등을 포함한다.The voice processing unit 202 includes an A / D converter for converting an input analog voice signal into a digital signal, a filter for removing noise of the voice signal, or filtering for converting to a frequency band suitable for a voice transmission band of a telephone network. Circuits and the like.

상기 마이컴(203)은 음성인식을 위한 공지의 음성인식 알고리즘을 수행하는 수단과, 인식된 음성을 e-mail포맷의 파일로 작성하기 위한 수단을 포함한다.The microcomputer 203 includes means for performing a known speech recognition algorithm for speech recognition, and means for creating the recognized speech into a file in an e-mail format.

상기 메모리부(204)에는 디지탈로 변환된 음성데이타와, 이 음성데이타를 마이컴(203)이 인식하여 작성한 e-mail메시지(데이타) 예를들면 euc-kr코드와 그와 결합된 음성패턴 데이타, 사용자의 주소록(수신자의 이름과 그와 결합된 수신자 e-mail주소) 등의 정보가 기억된다.The memory unit 204 includes digitally converted voice data, an e-mail message (data) generated by the microcomputer 203 by recognizing the voice data, for example, an euc-kr code and voice pattern data associated therewith, Information such as the user's address book (recipient's name and recipient e-mail address associated with it) is stored.

상기 도2 및 도3에 나타낸 바와같이, 사용자가 외부에서 전화를 걸어서 통신 접속부(201)를 통해 본 발명의 음성을 인식하여 전자우편(e-mail)을 보내는 서비스 시스템 또는 이 기능을 탑재한 사용자 시스템에 접속하면, 마이컴(203)은 사용자가 입력하는 음성신호를 음성처리부(202)를 통해서 디지탈 데이타로 전달받아, 기 저장된 사용자 음성인식 패턴정보로부터 사용자 음성을 인식하고, 인식된 음성이 지시하는 e-mail을 작성하여 수신자 시스템으로 전송해 준다.As shown in Fig. 2 and Fig. 3, a user who makes a phone call from the outside and recognizes the voice of the present invention through the communication connection unit 201 and sends an e-mail or a user equipped with this function Upon connecting to the system, the microcomputer 203 receives the voice signal input by the user as digital data through the voice processing unit 202, recognizes the user's voice from the pre-stored user's voice recognition pattern information, and indicates the recognized voice. Create an e-mail and send it to the recipient system.

이때 인식된 음성데이타와 작성된 e-mail메시지는 메모리부(204)에 저장하며, 한편, 사용자의 요구가 있는 경우는 작성된 e-mail을 비디오부(205)를 통해서 디스플레이부(206)로 표시해준다.At this time, the recognized voice data and the created e-mail message are stored in the memory unit 204. On the other hand, when the user requests, the displayed e-mail is displayed on the display unit 206 through the video unit 205. .

도4는 상기한 본 발명에서의 사용자의 음성인식을 위한 패턴생성 과정이고, 도5는 음성인식 패턴의 일예로서 끝점 검출(End Point Detection)구간을 나타내고, 도6은 음성인식의 결과로 작성된 e-mail 음성데이타 포맷이며, 도7 내지 도9는 e-mail의 작성과 전송과정을 나타낸다.Figure 4 is a process for generating a pattern for voice recognition of the user in the present invention described above, Figure 5 shows an end point detection section as an example of the speech recognition pattern, Figure 6 is e as a result of speech recognition -mail is a voice data format, and FIGS. 7 to 9 show a process of creating and transmitting an e-mail.

도4 및 도5를 참조하여 상기 음성처리부(202) 및 마이컴(201)에 의한 사용자의 음성인식 패턴 생성과정부터 설명한다.A voice recognition pattern generation process of the user by the voice processor 202 and the microcomputer 201 will be described with reference to FIGS. 4 and 5.

음성인식의 기준이 되는 패턴생성 방법은 널리 알려진 기술을 사용한다.The pattern generation method used as a reference for speech recognition uses well-known techniques.

'패턴매칭 음성인식방법'은 입력된 음성을 주파수 변환등의 방법을 사용하여 특정한 패턴으로 만들어서 기존의 같은 방법으로 만들어진 음성패턴들과 비교하여 음성을 인식하는 기술이며, '신경회로망 음성인식방법'은 패턴들을 신경회로망의 학습에 이용하여 입력된 음성을 자동으로 인식하는 방법이다.'Pattern Matching Speech Recognition Method' is a technology that recognizes speech by comparing inputted voices with voice patterns made by the same method by using a method such as frequency conversion and 'Neural Network Speech Recognition Method'. Is a method of automatically recognizing input voice using patterns for neural network learning.

본 발명에서는 소프트웨어로 널리 제공되고 있는 음성인식 기술을 적용하며, 사용자가 여러사람일 경우 사람마다 음성의 특성에 차이가 있기 때문에 이를 평균하여 인식하는데 따른 인식률 저하와, 특정인에 대하여 오히려 높은 인식률을 보이는 점을 감안하여 본 발명에서는 하나의 서비스 시스템에 한명의 사용자를 인식하는 것을 전제로 하여 높은 인식률을 보일 수 있도록 한다.In the present invention, the voice recognition technology, which is widely provided in software, is applied, and when there are several users, since the characteristics of the voice are different for each person, the recognition rate is lowered and the recognition rate is higher for a specific person. In view of the above, the present invention enables a high recognition rate on the premise of recognizing one user in one service system.

또한, 음성 인식 패턴을 사용자 자신의 음성패턴을 기준으로 하는 경우는 물론, 저명한 아나운서의 표준발음을 인식 데이타 베이스로 구축하여 다양한 사용자의 음성을 인식하는 방법도 가능하다.In addition, not only the voice recognition pattern based on the user's own voice pattern, but also a method of recognizing voices of various users by constructing a standard voice of a prominent announcer as a recognition database.

도4에서 음성을 인식하여 전자우편(e-mail)을 보내는 본 발명의 서비스 시스템은 사용자로 하여금 완성형 한글코드 전체에 대한 음성 패턴을 만들기 위하여 한글코드 각각에 대하여 사용자로 하여금 마이크를 통해 음성처리부(202)로 음성을 입력하게 한다.In FIG. 4, the service system of the present invention, which recognizes a voice and sends an e-mail, allows the user to make a voice pattern for the entire Hangul code by using a voice processing unit through a microphone for each of the Hangul codes. 202) to input voice.

완성형 한글코드에 따른 음성이 입력되면 음성처리부(202)내에서 적절한 필터링을 거쳐서 아날로그 신호로 변환하고, 입력 음성의 시작과 끝을 특정한 키를 이용해서 알리도록 한다.When the voice according to the complete Hangul code is input, the voice processor 202 converts the voice into an analog signal through appropriate filtering, and notifies the start and end of the input voice using a specific key.

이 디지탈 변환된 음성데이타는 도5에 나타낸 바와같은 끝점검출 과정을 거친다.This digitally converted voice data is subjected to an end point detection process as shown in FIG.

끝점 검출과정은 저장된 음성 데이타중에 실제로 음성구간만을 검출해내는 기술로서, 시간축상의 진폭의 변화를 검색하여 음성구간과 비음성 구간을 구분하고 비음성 구간을 제거하여 구한 실제 음성구간의 데이타를 가공함으로써 음성패턴을 메모리부(204)에 저장한다.The end point detection process is a technology that actually detects only the speech segment from the stored speech data. By detecting the change of amplitude on the time axis, the speech segment is distinguished from the speech segment and the non-voice segment. The voice pattern is stored in the memory unit 204.

이러한 과정을 모든 한글코드에 대해서 수행한 후 한칸 띄움(space)에 대한 음성패턴을 생성한다.After this process is performed for all Korean codes, a voice pattern for a space is generated.

즉, 마이크로 음성을 입력하지 않고 1내지2초 정도 무음성 입력을 수행한 다음, 이 것을 끝점 검출하면 일정 레벨 이상의 진폭이 없어 구간을 분리하기가 어려워지므로, 저장된 음성데이타 구간의 중심에 저장된 음성데이타 구간의 3/4정도의 음성구간을 임의로 잡게하여 음성패턴으로 저장한다.That is, if a voice is input for 1 to 2 seconds without inputting a micro voice and detecting the end point, it is difficult to separate a section because there is no amplitude above a certain level, and thus the voice data stored at the center of the stored voice data section 3/4 voice sections are randomly picked and stored as voice patterns.

이를 이용해서 실제 전화를 통해서 들어온 음성데이타 중에 아무런 음성이 없거나 노이즈성분 등으로 인하여 신호가 있더라도 매칭되는 패턴이 없기 때문에 한칸 띄움 처리를 할 수 있게한다.By using this, even if there is no voice among the voice data input through the actual telephone or a signal due to noise component, etc., there is no matching pattern so that the space can be processed.

위와같은 과정을 통해서 사용자의 모든 음성에 대한 패턴을 한글코드와 결합하여 저장한다.Through the above process, all the voice patterns of the user are combined with the Hangul code and stored.

한글 코드는 완성형으로 인터넷에서 통용되는 문자코드에 이름을 붙이는 방식에 따라 명명된 euc-kr를 사용한다.Hangul code is complete and uses euc-kr named according to the method of naming the character codes commonly used on the Internet.

최종적으로, 메모리부(204)에 수신자의 이름과 그에 해당하는 수신자의 e-mail주소를 결합하여 주소록을 만들어서 저장한다.Finally, the memory unit 204 combines the recipient's name and the recipient's e-mail address to create and store an address book.

메모리부(204)에 저장되는 내용은 도3의 메모리부(204) 내부에 잘 나타나 있다.The contents stored in the memory unit 204 are well represented in the memory unit 204 of FIG.

이러한 음성인식 패턴의 생성과 기억과정을 거친 후에는, 사용자가 원거리에서 전화를 이용해서 입력하는 음성을 상기 음성인식패턴을 이용해서 인식하고, 인식된 음성이 지시하는대로 e-mail을 작성하여 지시된 수신자 시스템으로 전송하는 도7 내지 도9의 과정을 수행한다.After the process of generating and storing the voice recognition pattern, the user recognizes the voice input by using the telephone at a distance using the voice recognition pattern, and creates and directs an e-mail as the recognized voice indicates. 7 to 9 are transmitted to the receiver system.

즉, 전화망을 통해서 입력된 음성은 음성처리부(202)에서 도6과 같은 e-mail음성데이타 포맷 형태로 메모리부(204)에 저장되고, 이 e-mail 음성데이타는 도2에 나타낸 바와같은 e-mail데이타로 작성되어 수신자에게 메일을 보내게 된다.That is, the voice input through the telephone network is stored in the memory unit 204 in the form of an e-mail voice data format as shown in Fig. 6 by the voice processing unit 202, and this e-mail voice data is stored as e as shown in Fig. 2. It is written as -mail data and is sent to the recipient.

도6에서 알 수 있는 바와같이 e-mail음성 데이타는, 각각의 데이타 앞에 그 데이타의 바이트(byte) 또는 워드크기(word size)가 존재하며, 시스템에 의해서 미리 인식된 수신자의 ASCII값, 제목 음성 데이타의 갯수와 그에 따른 제목음성 데이타들, 본문 음성데이타의 갯수와 메일의 실제 내용인 본문 음성 데이타들로 구성되어 있고, 또한 다이얼 톤(예를 들면 #)을 인지하여 캐리지 리턴정보(줄바꿈)로 지정된 값이 본문 내용안에 저장되어 있다.As can be seen from Fig. 6, the e-mail voice data has a byte or word size of the data before each data, and the ASCII value and the title voice of the receiver previously recognized by the system. It consists of the number of data, the subject voice data, the number of body voice data, and the body voice data which is the actual content of the mail, and also the carriage return information (line feed) by recognizing dial tone (for example #). The value specified by is stored in the body text.

이 음성인식 결과로 작성된 e-mail데이타는 도2에 나타낸 바와같이, euc-kr코드로 형성되며, e-mail전송시에 사용하기 위하여 To코드 다음에 수신자의 e-mail주소가 저장되고, Subject다음에는 메일의 제목, Body다음에는 메일의 내용이 저장된다.As shown in Fig. 2, the e-mail data generated as a result of the voice recognition is formed of an euc-kr code, and the e-mail address of the recipient is stored after the To code for use in e-mail transmission. Next, the subject of the e-mail and the body of the e-mail are stored after the body.

이 것을 수신자 시스템으로 전송할 때 본 발명의 서비스 시스템은 작성된 e-mail를 이용해서 다른 부가적인 e-mail의 요소 예를 들면 송신자,날짜, 한글코딩 정보 등을 부가하여 메일을 전송한다.When transmitting this to the receiver system, the service system of the present invention transmits the mail by adding other additional e-mail elements, for example, the sender, the date, and the Hangul coding information, using the created e-mail.

도7을 참조한다.See FIG. 7.

마이컴(203)은 통신 접속부(201)를 제어하여 자동응답기능을 이용해서, 사용자가 자신의 시스템에 전화를 걸어 메일을 전송하기 위한 약속된 키(비밀번호)를 입력하면 그 진위를 확인하고, 음성으로 메일을 전송하기 위한 기능을 구동한다.The microcomputer 203 controls the communication connection unit 201 and uses the auto answering function to check the authenticity of the user when the user dials his system and inputs a promised key (password) for sending an e-mail. Activates the function to send mail.

사용자가 입력한 키가 정당하면 OK톤을 보내주고 약속된 다이얼톤에 따라 e-mail 작성을 위한 음성 데이타 포맷의 구축을 실행한다.If the key entered by the user is justified, an OK tone is sent and the voice data format for e-mail creation is executed according to the promised dial tone.

약속된 다이얼톤으로서, '*'키는 음성의 일단락, '#'키는 줄바꿈, '1'번키는 수신자, '2'번키는 제목, '3'번키는 본문, '0'번키는 종료로 정의한다.As promised dial tone, '*' key ends the voice, '#' key wraps, '1' key recipients, '2' key headings, '3' key body, and '0' key ends It is defined as

'1'번키가 통신접속부(201)를 통해서 입력되면 이 것을 마이컴(203)이 인식하고 OK톤 전송후에 *키가 입력될 때 까지 입력된 음성을 '수신자'로 처리(디지탈 변환하여 일시 저장)한다.When the '1' key is input through the communication connection unit 201, the microcomputer 203 recognizes this and processes the input voice as a 'receiver' until the * key is input after transmitting the OK tone (digital conversion and temporarily storing). do.

또는 특정한 수신자를 특정한 키번호에 대응시키고, 수신자 성명을 음성으로 인식하는 대신 상기 약속된 키번호로 입력하여 해당 키번호에 대응하는 수신자를 검색해내는 방법도 가능하다.Alternatively, a specific recipient may be mapped to a specific key number, and a recipient name corresponding to the key number may be searched by inputting the promised key number instead of recognizing the recipient name by voice.

'2'번키가 입력되면 *키가 입력될때 까지 입력된 음성을 '제목'으로 처리한다.When '2' key is input, the input voice is treated as 'title' until * key is input.

즉, '2'번키가 입력되면 일단, 메모리에 저장된 '수신자'의 성명을 발음한 음성신호를 인식하여 그 인식된 수신자가 메모리부(204)의 주소록에 존재하는가를 판정하고, 존재하지 않으면 에러(Error)톤을 보내서 재입력을 대기하고, 존재하면 e-mail 음성데이타에 수신자(ASCII 코드)와 수신자 데이타 크기와 함께 저장한 다음, 음성을 수신하여 수신한 음성데이타를 저장한다.That is, when the '2' key is input, the voice signal pronounced the name of the 'recipient' stored in the memory is recognized once, and it is determined whether the recognized receiver exists in the address book of the memory unit 204. Send an (Error) tone to wait for re-entry, if present, store it in the e-mail voice data with the recipient (ASCII code) and the receiver data size, then store the received voice data with the voice.

그리고, '3'번키가 입력되면 이후 입력되는 음성신호를 메일의 본문 내용으로 인식하고, 이전에 저장한 제목의 갯수를 메모리에 저장한 후, 입력되는 음성을 순서대로 그 크기와 함께 저장한다.When the '3' key is input, the voice signal input afterwards is recognized as the body content of the mail, and the number of previously stored titles is stored in the memory, and the input voices are stored in sequence with the size.

이후에는 '0'번 키가 입력될때 까지 본문 내용을 인식하여 저장하며, #키 입력시에는 줄바꿈(CRLF)을 수행해 준다.After that, the text is recognized and stored until the '0' key is entered, and CRLF is executed when the # key is entered.

'0'번 키가 입력되면 메일 본문의 내용이 종료된 것으로 인식하고 상기 수신한 음성의 갯수를 계산하여 저장하고 OK톤을 전송한 다음, 상기 저장된 음성데이타를 인식하여 e-mail을 작성하고 이 것을 전송하는 과정을 수행한다.When the '0' key is input, the contents of the mail body are recognized as being finished, the number of the received voices is calculated and stored, the OK tone is transmitted, the stored voice data is recognized and the e-mail is created. Perform the process of transmitting it.

이 과정은 도8 및 도9에 상세히 나타내었다.This process is illustrated in detail in FIGS. 8 and 9.

마이컴(203)은 음성데이타의 첫부분인 데이타 크기와 그 뒤에 실려있는 실제 데이타 '수신자'코드를 크기만큼 읽어서 주소록을 참조하여 그에 해당하는 e-mail주소를 찾아낸 후, 'To'코드와 함께 e-mail데이타에 저장한다.The microcomputer 203 reads the data size, which is the first part of the voice data, and the actual data 'receiver' code loaded thereafter, finds the corresponding e-mail address by referring to the address book, and then e with the 'To' code. Store in mail data.

다음 단계로는 음성으로 저장되어 있는 제목 음성데이타를 제목의 음성 갯수와 각각의 음성크기 정보를 이용해서 로딩한 다음, 메모리부(204)에 저장되어 있는 인식패턴과 비교하여 음성인식을 수행하고, 그 인식된 음성의 제목을 euc-kr코드로 변환하여 e-mail파일에 'Subject'코드와 함께 저장한다.In the next step, the title voice data stored as voice is loaded using the number of voices of the title and the respective voice size information. Then, the voice recognition is performed by comparing the recognition patterns stored in the memory unit 204. The subject of the recognized voice is converted into an euc-kr code and stored with the 'Subject' code in the e-mail file.

인식과정은 도9와 같다.The recognition process is shown in FIG.

제목 음성을 인식하는 루틴은 본문 내용을 인식할 경우와 같은 서브루틴(B)을 사용하므로 다음에 읽을 정보가 음성 데이타의 크기(size)나 캐리지 리턴정보 (CRLF)두가지가 올 수 있다.The routine for recognizing the subject voice uses the same subroutine (B) as for recognizing the body content, so that the next information to be read may be the size of the voice data or the carriage return information (CRLF).

제목의 경우는 음성 데이타의 크기가 읽혀지므로 이 크기 정보에 따라 음성데이타를 로딩하여 실제 음성구간을 검출하는 끝점 검출을 실행하고, 끝점검출 처리가 이루어진 음성데이타를 미리 저장된 음성인식 패턴정보를 이용해서 인식한 다음, euc-kr코드를 e-mail에 저장한다.In the case of the title, since the size of the voice data is read, the voice data is loaded according to the size information, and the end point detection for detecting the actual voice section is executed, and the pre-stored voice recognition pattern information is used for the end point detection process. After recognizing, euc-kr code is stored in e-mail.

이때, 음성이 없는 경우는 한칸 띄움(space)에 해당하는 코드를 e-mail에 저장한다.In this case, when there is no voice, a code corresponding to a space is stored in an e-mail.

이와같이 하여 제목의 음성 데이타 갯수만큼 인식하여 코드를 저장한 후에는 도8의 루틴으로 돌아가서 본문의 내용을 읽고 인식하여 작성한다.In this way, after recognizing the number of voice data of the title and storing the code, the routine returns to the routine of FIG. 8 to read, recognize and write the content of the text.

본문을 읽기전에 먼저 'Body'코드를 e-mail데이타에 저장하여 실제로 메일을 전송할 때 구분이 가능하도록 하며, 제목음성을 인식하는 도9의 루틴과 동일한 수순으로 본문의 음성갯수를 읽고 이 것을 하나씩 로딩하여 인식한다.Before reading the text, save the 'Body' code in the e-mail data so that it can be distinguished when sending the e-mail, and read the number of voices in the text in the same procedure as in the routine of FIG. Load it and recognize it.

이때 데이타 크기 대신 #키가 올 경우에는 캐리지 리턴-라인이송 정보이므로 e-mail 데이타에는 캐리지 리턴코드에 저장된다.In this case, if the # key is used instead of the data size, the carriage return-line transfer information is stored in the carriage return code in the e-mail data.

전화선을 통해서 들어온 #키를 음성데이타 중간에 저장할 때 원래의 캐리지 리턴코드를 사용하지 않는 이유는 시스템이 이 코드를 데이타 크기로 인식할 수 있기 때문이다.The reason for not using the original carriage return code when storing the # key coming from the telephone line in the middle of voice data is that the system can recognize this code as the data size.

이렇게 하여 모든 음성데이타는 도2에 나타낸 바와같이 작성된 e-mail데이타가 된다.In this way, all voice data becomes e-mail data created as shown in FIG.

다음의 동작은 수신자에게 메일을 보내는 것으로서, 시스템은 e-mail데이타의 정보를 이용해서 실제 e-mail 파일을 만든다.The next action is to send the mail to the recipient. The system uses the information in the e-mail data to create the actual e-mail file.

이 것은 수신자, 제목, 본문 내용을 그대로 사용하고 여기에 날짜, 송신자, 메일의 코딩타입 등을 추가한 후에 통신 접속부(201)를 통해서 e-mail서버에 접속하여 메일을 송신하는 것이다.This is to use the receiver, the subject, the body content as it is, add the date, sender, mail coding type, etc., and then send the mail by accessing the e-mail server through the communication connection unit 201.

또한, 본 발명의 음성을 인식하여 전자우편(e-mail)을 보내는 서비스 시스템에서, 인식된 음성데이타를 팩시밀리 전송규격에 적합한 포맷으로 가공하여 팩시밀리 파일로 작성하고, 이것을 지정된 수신처로 자동 송신하는 적용도 용이하다.Further, in a service system for sending an e-mail by recognizing the voice of the present invention, an application for processing the recognized voice data into a format suitable for the facsimile transmission standard, creating a facsimile file, and automatically transmitting the same to a designated destination. Is also easy.

더 넓게는 음성을 인식한 데이타를 소정의 전송규격에 합당한 파일로 편집하여 작성하고 이 것을 지정된 수신처로 자동 송신하는 적용도 용이하다.More broadly, it is also easy to apply an application to edit a voice-recognized data into a file complying with a predetermined transmission standard, and to automatically transmit it to a designated destination.

본 발명에 의하면 원격지에서도 전화를 걸어 전화음성으로 메일을 작성하여 전송할 수 있고, 사용자 ID만 있으면 장소와 기기의 구애를 받지않고 e-mail 전송을 수행할 수 있다.According to the present invention, it is possible to make and send a mail by telephone voice even from a remote site, and if the user ID is required, the e-mail can be transmitted regardless of the place and the device.

Claims

Means for recognizing a voice signal input by a user in accordance with a predetermined transmission standard, means for creating an e-mail data corresponding to a predetermined transmission standard using the recognized voice signal, and the generated e-mail data. A device for sending an e-mail by recognizing a voice comprising a means for automatically transmitting to a recognized destination.

A first step of checking a user's e-mail transmission request using a telephone line, a second step of receiving a user's voice whose authenticity is determined in the first step, and recognizing the voice, and a voice recognized at the second step The third step of creating an e-mail by processing the text information corresponding to the e-mail transmission standard and the fourth step of automatically transmitting the e-mail created in the third step to a recognized destination. How to send an e-mail by recognizing a voice characterized in that.

According to claim 2, In order to meet the e-mail transmission standard in the voice input, the phone recognizes the e-mail by recognizing the voice, characterized in that the key processing information characterized in that the process is processed. How to send.

3. The method of claim 2, wherein the recognition of the destination is recognized by a predetermined key input.

The method of claim 2, wherein the voice recognition e-mail service system is a user system.

The method of claim 2, wherein the voice recognition e-mail service system is an e-mail server.

The method of claim 2, wherein the first step of confirming the transmission request comprises: a; A method of sending an e-mail by recognizing a voice, characterized in that it comprises a process of checking a transmission request and a process of determining the authenticity of the user.

The method of claim 2, wherein the second process of recognizing the voice comprises; A voice is inputted through a current microphone or a telephone, a process of comparing the input signal with a pre-generated pattern, and a process of recognizing a specific pattern based on a comparison result and storing the result. How to recognize and send e-mail.

The method of claim 2, wherein the fourth process of automatic transmission comprises; A recipient recognition process of recognizing the recipient by reading the recipient memory data stored in the storage means and referring to the address book; and automatically transmitting a mail to the recognized recipient. how to send mail).

The method of claim 2, further comprising a method of creating a space pattern by inputting unvoiced sound for a predetermined time and grabbing about 3/4 of the voice section of the stored voice data section at the center of the stored voice data section. How to send an e-mail by recognizing a voice.

The method of claim 8, wherein the pattern generation process; Inputting voices for all the Hangul codes of the user, End-Point Detection process for detecting the end point by the change of amplitude, and Generation pattern for storing the frequency components of the detected signals in association with the Hangul code A method of sending an e-mail by recognizing a voice comprising a storing process.