KR20000036801A

KR20000036801A - Method of emboding sound recognize on the web browser

Info

Publication number: KR20000036801A
Application number: KR1020000016249A
Authority: KR
Inventors: 정현태
Original assignee: 정현태; 주식회사 아이엠티이십일
Priority date: 2000-03-29
Filing date: 2000-03-29
Publication date: 2000-07-05

Abstract

PURPOSE: A method for recognizing voice on web browsers is provided to recognize voice using VOIP(voice over internet protocol) and to adopt two method for storing recognized voice. CONSTITUTION: An apparatus adopting the method for recognizing voice on web browsers include a web browser(300), a file analyzer(410), a vocabulary recognizer(420), an isolated vocabulary generator(430), a result analyzer(440), an isolated vocabulary storage and result inspector. The method includes a first through a forth steps. At the step, a web browser for recognizing voice is executed. At the second step, a user is authenticated in the file analyzer(410). At the third step, a voice text of isolated vocabulary is recognized. At the forth step, the isolated vocabulary is analyzed in the result analyzer(440).

Description

How to Implement Speech Recognition on Web Browsers {Method of emboding sound recognize on the web browser}

본 발명은 음성인식의 구현방법에 관한 것으로, 특히 인터넷에 접속하여 웹브라우저를 이용할 때 음성 텍스트 방식을 이용하여 웹브라우저를 조작하고 이를 통해 용이하게 정보를 취득할 수 있도록 하는 웹브라우저 상에서의 음성인식 구현방법에 관한 것이다.The present invention relates to a method of implementing voice recognition, and in particular, when using a web browser when accessing the Internet, a voice recognition on a web browser to operate a web browser using a voice text method and to easily obtain information therethrough. It relates to an implementation method.

종래의 인터넷 사용은 여러 방면에서 보편화되어 있으며, 네트웍을 통한 정보의 전송 및 공유에 있어 표준적인 방법으로 자리잡고 있다. 이러한 인터넷망에 접속되어 사용하기 위해서는 웹브라우저의 프로그램을 이용하였다.Conventional use of the Internet has become commonplace in many ways and has become a standard method for the transmission and sharing of information over a network. In order to access and use such an internet network, a program of a web browser was used.

현재 운용되고 있는 월드와이드웹은 MS사의 Expiorer 또는 Netscape사의 Netscape Navigator라는 인터넷 브라우저를 사용하여 그래픽 또는 텍스트로 구현된 서버에 접속하여 내용을 볼 수 있도록 구성되어 있다. 이러한 웹브라우저의 접속에는 보통 키보드 내지 마우스를 사용하여야 하며, 조금 더 발전된 방법으로 터치스크린식의 방법이 사용되었다.The World Wide Web, which is currently in operation, is configured to connect to a graphic or text server and view its contents using an Internet browser called Microsoft's Expiorer or Netscape's Netscape Navigator. In general, a keyboard or a mouse must be used to access the web browser, and a touch screen type is used as a more advanced method.

이러한 문제점을 보완하기 위해 음성을 통한 명령 또는 음성을 텍스트로 변환하여 검색하는 방법을 사용하는 것이 편리성을 증대시킬 수 있다. 이와 같은 음성인식 방법은 음성에 대한 텍스트를 구현하는 방식을 사용하게 된다.In order to compensate for this problem, it is possible to increase the convenience of using a command through a voice or a method of converting a voice into a text and searching. This speech recognition method uses a method of implementing text for speech.

즉, 서버에 음성인식 엔진과 텍스트 변환기를 두어 인터넷상에 들어온 음성을 받아들어 같은 음성을 찾아 텍스트로 표현하게 된다.In other words, a voice recognition engine and a text converter are installed on the server to receive voices received from the Internet, and the same voices are found and expressed as text.

그러나, 음성을 이용한 웹브라우저의 사용은 다른 사이트에 연결할 경우 매번 다른 명령어 체계를 가질 수 있다. 또한, 우리말의 특성상 같은 뜻을 가진 여러 단어가 존재하게 된다.However, the use of voice-enabled web browsers can have different command systems each time you connect to different sites. In addition, there are several words with the same meaning because of the characteristics of Korean.

이러한 문제점으로 인해 업데이트되는 단어는 무한적인 데이터로 저장 공간과 처리속도 등의 한계에 부딪히게 된다. 또한, 기존 마이크를 사용 아날로그 음성신호를 컴퓨터가 인지할 수 있는 디지털신호는 전달하여 주는 방식을 이용하기 때문에 사용자측에서 디지털 신호로 가공하여 서버로 보내주는 속도는 1초 발음당 256Kb 정도의 양이 보내지게 되어 저속 인터넷 사용자에게는 효율성이 떨어지는 문제점이 있었다.Due to this problem, the updated word is bound to limit the storage space and processing speed as infinite data. In addition, because the digital microphone that can recognize the analog voice signal using the existing microphone is transmitted, the speed of processing the digital signal from the user side to the server is about 256Kb per second pronunciation. There is a problem that the low-speed Internet users are less efficient.

따라서, 본 발명은 상기한 문제점을 해결하기 위한 것으로서, 본 발명의 목적은 VoIP(Voice Over Internet Protocol 이하, VoIP)라는 인터넷음성전달 방식을 이용함과 더불어 음성에 대한 저장 방법을 이원화함으로서 웹브라우저 상에서 편리하게 음성인식 기능을 구현토록 하는 웹브라우저 상에서의 음성인식 구현방법을 제공하는데 그 목적이 있다.Accordingly, an object of the present invention is to solve the above problems, and an object of the present invention is to use an Internet voice transmission method called Voice Over Internet Protocol (VoIP) and to dualize a storage method for voice, thereby making it convenient on a web browser. The purpose of the present invention is to provide a voice recognition method on a web browser to implement a voice recognition function.

상기 본 발명의 목적을 달성하기 위한 기술적 사상으로서 본 발명은The present invention as a technical idea for achieving the object of the present invention

인터넷을 통하여 접속되는 웹브라우저와, 파일분석부, 어휘인식부, 고립단어생성부, 인식결과분석부, 고립단어저장부 및 결과물검색부로 이루어진 음성인식구동수단을 구비하여 웹브라우저 상에서의 음성인식을 구현방법에 있어서,Voice recognition means comprising a web browser connected through the Internet, a file analysis unit, a lexical recognition unit, an isolated word generation unit, a recognition result analysis unit, an isolated word storage unit, and a result search unit. In the implementation method,

인터넷에 접속하여 음성인식전용 웹브라우저를 실행하는 제 1과정과;A first step of accessing the Internet and executing a web browser for speech recognition;

상기 파일분석부에서 사용자의 인증을 검증하는 제 2과정;A second step of verifying a user's authentication in the file analyzer;

상기 어휘인식부에서 사용자의 고립 단어의 음성 텍스트를 인식하는 제 3과정;A third step of recognizing a speech text of a user's isolated word in the lexical recognition unit;

사용자로부터 입력되는 고립단어의 음성 텍스트가 웹브라우저용인지, 웹서버용인지의 여부를 상기 인식결과분석부에서 분석하여 상기 고립단어저장부에 해당하는 고립단어의 음성텍스트를 저장함과 더불어 명령어를 실행하는 제 4과정; 및The recognition result analyzing unit analyzes whether the voice text of the isolated word input from the user is for a web browser or a web server, and stores the voice text of the isolated word corresponding to the isolated word storage unit, and executes a command. Fourth process; And

상기 결과물검색부에서 검색하는 사용자의 음성 텍스트에 대한 고립단어가 상기 고립단어저장부에 저장되어 있지 않은 새로운 URL인 경우 사용자의 음성 텍스트에 대한 고립단어를 인식, 분석, 저장하는 일련의 과정을 반복적으로 수행하는 제 5과정으로 구성되는 웹브라우저 상에서의 음성인식 구현방법을 제공한다.If the isolated word of the user's voice text searched by the result search unit is a new URL that is not stored in the isolated word storage unit, a series of processes for recognizing, analyzing, and storing the isolated word of the user's voice text is repeated. It provides a speech recognition implementation method on a web browser consisting of a fifth process performed by.

도 1은 본 발명에 따라 웹브라우저 상에서의 음성인식 시스템을 구현하기 위한 전체적인 장치구성도1 is an overall device configuration for implementing a voice recognition system on a web browser according to the present invention

도 2는 도 1의 음성인식 시스템을 구체적으로 구현하기 위한 부분적인 장치구성도2 is a partial block diagram of a device for implementing the voice recognition system of FIG.

도 3은 본 발명에 따라 전체적인 음성인식 시스템이 구현되는 동작 흐름을 나타내는 플로우차트3 is a flowchart illustrating an operation flow in which an overall speech recognition system is implemented according to the present invention.

〈도면의 주요부분에 대한 부호의 설명〉<Explanation of symbols for main parts of drawing>

100: 사용자단말기 110: 음성입력수단100: user terminal 110: voice input means

120: 음성전달수단 130: 웹서버120: voice transmission means 130: web server

140: 음성인식변환부 150: 음성데이터저장부140: voice recognition conversion unit 150: voice data storage unit

200: 인터넷망 300: 웹브라우저200: Internet network 300: Web browser

410: 파일분석부 420: 어휘인식부410: File analysis unit 420: Vocabulary recognition unit

430: 고립단어생성부 440: 인식결과분석부430: isolated word generation unit 440: recognition result analysis unit

450: 고립단어저장부 460: 결과물검색부450: isolated word storage unit 460: results search unit

이하, 본 발명의 실시예에 대한 구성 및 그 작용을 첨부한 도면을 참조하면서 상세히 설명하기로 한다.Hereinafter, with reference to the accompanying drawings, the configuration and operation of the embodiment of the present invention will be described in detail.

도 1은 본 발명에 따라 웹브라우저 상에서의 음성인식 시스템을 구현하기 위한 전체적인 장치구성도이다.1 is an overall device configuration for implementing a voice recognition system on a web browser according to the present invention.

도 1에 도시된 바와 같이 시스템의 구성 요소를 살펴보면, 사용자단말기(100)와, 음성입력수단(110), 음성전달수단(120), 웹서버(130), 음성인식변환부(140) 및 음성데이터저장부(150)으로 구성되어 이루어져 있다.Looking at the components of the system as shown in Figure 1, the user terminal 100, the voice input means 110, voice transmission means 120, web server 130, voice recognition conversion unit 140 and voice It consists of a data storage unit 150.

사용자단말기(100)는 일반적인으로 사용되는 PC를 말하며, PC를 사용수단으로 인터넷에 접속하여 다양한 정보를 취득할 수 있다.The user terminal 100 refers to a PC generally used, and may access various types of information by accessing the Internet using a PC.

음성입력수단(110)은 사용자들로부터 발생되는 음성을 이용하여 텍스트 입력방식을 적용시키기 위한 입력장치로서 웹폰(wap phone), 마이크 등을 이용하여 인터넷망을 통해 음성전달수단에 접속하기 위한 입력수단을 나타낸다.Voice input means 110 is an input device for applying a text input method using the voice generated from the user input means for connecting to the voice transmission means through the Internet using a web phone (wap phone), microphone, etc. Indicates.

음성전달수단(120)은 음성입력수단(110)을 통한 음성데이터를 인터넷으로 전달하는 수단으로서 VoIP방식을 이용하며, 네트웍에서 표준 프로토골로 하는 TCP/IP를 이용해 데이터 뿐만아니라 음성까지 함께 실어보낼 수 있도록 한다.The voice transmission means 120 uses a VoIP method as a means for delivering voice data through the voice input means 110 to the Internet, and can carry not only data but also voice using TCP / IP as a standard protocol in a network. Make sure

웹서버(130)는 사용자의 ID를 입력하여 라우터(Router)와 허브(hub)를 거쳐 ID인증이 검증된 사용자에 한해서 웹브라우저에 접속하도록 해주는 서버를 나타낸다.The web server 130 represents a server for inputting a user's ID to access a web browser only for a user whose ID authentication is verified through a router and a hub.

웹서버(130)에는 입력된 음성데이터를 변환하여 텍스트로 출력하여 주는 음성인식변환부(140)와 입력된 음성에 대한 음성데이터를 저장하여 주는 음성데이터저장부(150)가 연계되어 있다.The web server 130 has a voice recognition converter 140 for converting the inputted voice data and outputting the text as a text, and a voice data storage unit 150 for storing voice data for the input voice.

도 2는 본 발명의 음성인식 시스템을 구현하기 위한 부분적인 장치구성도이다.Figure 2 is a partial block diagram for implementing a voice recognition system of the present invention.

도 2에 도시된 바와 같이 본 발명에 적용되는 부분적인 장치구성도를 살펴보면, 인터넷망(200)과, 웹브라우저(300), 파일분석부(410), 어휘인식부(420), 고립단어생성부(430), 인식결과분석부(430), 고립단어저장부(450), 결과물검색부(460)로 구성되어 이루어져 있다.Referring to the partial configuration of the device applied to the present invention as shown in Figure 2, the Internet network 200, the web browser 300, file analysis unit 410, lexical recognition unit 420, isolated word generation It consists of a unit 430, recognition result analysis unit 430, isolated word storage unit 450, the result search unit 460.

웹브라우저(300)는 인터넷망(200)과의 접속을 통해 사용자가 원하는 인터넷 사이트에 접속할 수 있도록 하는 역할을 한다.The web browser 300 plays a role of allowing a user to access a desired Internet site through a connection with the Internet network 200.

파일분석부(410)는 인터넷망(200)을 통해 전달되는 예컨대 HTML(Hypertext Makeup Language)파일에서 음성인식 대상이 되는 링크를 가지는 하이퍼텍스트를 찾아내는 역할을 하게 된다. 또한, 음성전달수단(120)의 VoIP를 통하여 들어오는 압축된 파일을 분석하게 된다.The file analysis unit 410 serves to find a hypertext having a link that is a voice recognition target in, for example, an HTML (Hypertext Makeup Language) file transmitted through the Internet network 200. In addition, the compressed file coming through the VoIP of the voice transmission means 120 is analyzed.

어휘인식부(420)는 별도의 훈련 없이 수시로 변화하는 어휘를 인식할 수 있는 인식기로서 하기의 고립단어생성부에서 전달되는 고립 단어을 사용하여 사용자가 새로운 사이트로 접속을 위해 발성한 음성이나 웹브라우저(300)를 제어하기 위해 발성한 음성을 인식하여 그 결과를 인식결과분석부에 넘겨주게 된다.The vocabulary recognition unit 420 is a recognizer capable of recognizing a vocabulary that changes at any time without additional training, and uses the isolated word transmitted from the following isolated word generator to make a voice or a web browser spoken for the user to access a new site. The control unit 300 recognizes the voice spoken to control 300 and passes the result to the recognition result analyzing unit.

고립단어생성부(430)는 파일분석부(410)로부터 찾아낸 하이퍼텍스트에서 실질적으로 인식대상 어휘를 생성하게 된다. 즉, 고립단어를 자동으로 생성할 수 있으며, 사용자의 검색란에 사용된 검색단어를 따로 저장하여 둔다.The isolated word generator 430 substantially generates the recognition target vocabulary from the hypertext found from the file analyzer 410. That is, isolated words can be automatically generated, and the search words used in the user's search box are stored separately.

이 때, 고립단어란 미리 생성하여 훈련시킨 단어를 지칭하며, 일반적으로 음성인식에 대한 기술은 서술식 문장의 음성은 음성을 분석하는 데 문제가 있어 미리 정한 단어를 훈련시켜 완벽하게 찾을 수 있는 방법을 택하고 있다.In this case, the isolated word refers to a word that has been generated and trained in advance, and in general, the technique of speech recognition has a problem in analyzing the speech of the speech of the descriptive sentence, so that a method of finding a perfect word by training a predetermined word can be found. Is choosing.

인식결과분석부(440)는 어휘인식부(420)에서 넘겨준 결과를 고립단어생성부(430)에서 받은 인식 후보들과 매칭시켜 웹브라우저(300)에 적절한 명령을 전달하게 된다.The recognition result analyzing unit 440 matches the result passed from the lexical recognition unit 420 with the recognition candidates received from the isolated word generation unit 430 to transmit an appropriate command to the web browser 300.

이 때, 웹브라우저(300)에서 파일을 가져오고, 그 인식 결과를 가지고 웹브라우저(300) 상에서 원하는 명령을 내리는 등 웹브라우저(300)와 상호 연계되어 있다.At this time, the file is retrieved from the web browser 300 and the web browser 300 is connected to the web browser 300 by giving a desired command on the web browser 300 with the recognition result.

고립단어저장부(450)는 고립단어생성부(430)로부터 생성된 고립단어들을 데이터베이스하여 저장하여 두는 저장부를 나타내고, 결과물검색부(460)는 음성전달수단(120)을 통해 인식되는 음성을 전달하는 메뉴, 음성인식기, 음성을 편집하는 툴과 텍스트에 대한 음성생성기, 고립단어 생성, 삭제 및 동일한 언어를 검색하는데 이용된다.The isolated word storage unit 450 indicates a storage unit storing and storing isolated words generated from the isolated word generation unit 430, and the result search unit 460 delivers the voice recognized through the voice transmission unit 120. Menus, voice recognizers, tools to edit voices, voice generators for text, isolated word generation, deletion, and searching for the same language.

도 3은 본 발명의 전체적인 흐름도로서, 본 발명이 어떠한 과정을 거쳐 음성을 사용하여 웹 브라우저를 제어할 수 있는지를 보여준다.3 is an overall flowchart of the present invention, which shows how the present invention can be used to control a web browser using voice.

먼저, 사용자가 인터넷에 접속한 후(S500) 사용자가 사용할 음성전용 웹브라우저(300)를 실행시키게 된다.(S510)First, the user accesses the Internet (S500) and then executes the voice-only web browser 300 to be used by the user (S510).

음성전용 웹브라우저(300)를 실행시키게 되면 웹서버(130)에 연결되어(S520) 사용자의 음성 텍스트를 인식하기 위한 전단계로서 사용자의 인증을 검증하게 된다.(S530)When the voice-only web browser 300 is executed, it is connected to the web server 130 (S520) and verifies the user's authentication as a previous step for recognizing the user's voice text (S530).

즉, VoIP라는 음성전달수단(120)를 통해 들어온 압축된 파일을 파일분석기(410)에서 분석하게 된다.That is, the file analyzer 410 analyzes the compressed file received through the voice transmission means 120 called VoIP.

사용자의 인증이 검증되었다고 판단되는 경우 파일분석기(410)로부터 분석된 파일을 변환하여 주는 웹브라우저(300)의 음성 전달기를 작동시키게 되며(S540), 음성인식기의 어휘인식부(420)가 작동된다.(S550)When it is determined that the user's authentication is verified, the voice transmitter of the web browser 300 for converting the analyzed file from the file analyzer 410 is operated (S540), and the lexical recognition unit 420 of the voice recognizer is operated. (S550)

이어서, 마이크 또는 웹폰 등의 음성입력수단(110)를 통해 음성의 텍스트를 입력하게 된다.(S560)Subsequently, the text of the voice is input through the voice input means 110 such as a microphone or a web phone.

이 때, 사용자는 VoIP방식을 통해 음성을 전달하는 메뉴, 음성인식기, 음성을 편집하는 툴과 텍스트에 대한 음성생성기, 고립단어생성, 삭제, 동일한 언어를 찾는 검색기를 이용할 수 있다.In this case, a user may use a menu for delivering voice through a VoIP method, a voice recognizer, a tool for editing a voice, a voice generator for text, an isolated word generation, deletion, and a searcher for searching the same language.

사용자가 검색란 등의 텍스트 입력란에 커서를 놓으면 자동적으로 고립단어생성부(430)가 작동하여 입력을 받는 형식을 취하게 된다.When the user places the cursor in a text input field such as a search box, the isolated word generation unit 430 operates to take an input form.

사용자의 음성 텍스트가 입력되면 어휘인식부(420)에서는 사용자의 고립단어 음성 텍스트를 인식하게 된다.(S570)When the user's voice text is input, the lexical recognition unit 420 recognizes the user's isolated word voice text (S570).

이 때, 고립단어란 미리 생성하여 훈련시킨 단어를 지칭하며, 웹서버(130)에는 기본적인 한글의 자음, 모음의 발음(예컨대 ㄱ - 기역, ㅏ - 아....)과 문자(예컨대 가, 갸, 거, 겨....), 특수문자(% - 퍼센트, * - 아스트리크(별)....) 그리고 알파벳 , 숫자의 발음을 두어 임의적인 글자를 만들어 사용할 수 있는 고립단어생성부(430)를 두게 된다.In this case, the isolated word refers to a word that is generated and trained in advance, and the web server 130 includes basic Korean consonants and pronunciations of vowels (eg, a-base, ㅏ-a ...) and letters (eg, Gya, giant, bran ....), special characters (%-percent, *-astrik (star) ....) and alphabets, numbers, pronunciation word generator that can be used to create random letters (430).

또한, 음성인식전용 웹브라우저에 다수의 사용자가 동시 접속하여 발생되는 시스템의 부화시에는 음성데이터저장부를 각 채널별로 두어 접속하는 순서에 의해 배분하는 방식을 이용하게 된다.In addition, when incubating a system generated by a plurality of users simultaneously connected to a voice recognition web browser, a method of distributing voice data storage units by channel is used.

사용자로부터 입력되는 음성 텍스트가 고립단어 형태인지의 여부를 판단하여(S580) 고립단어라고 판단되는 경우 인식결과분석부(440)에서는 웹브라우저(130)의 고립단어저장부(450)에 고립단어를 저장하게 된다.(S610)If it is determined that the voice text input from the user is in the form of an isolated word (S580), if it is determined that it is an isolated word, the recognition result analyzing unit 440 stores the isolated word in the isolated word storage unit 450 of the web browser 130. It is stored (S610).

이 때, 고립단어 생성은 웹서버(130)와 연결하여 얻어진 음성에 대한 단어를 선택적으로 지정하여 저장할 수 있도록 하며, 재 명령시에는 서술적인 방법으로 사용자 자체에서 음성인식을 처리하여 음성 텍스트만이 웹브라우저에 기입되도록 하여 네트웨크 상의 속도를 유지시킨다.At this time, the isolated word generation allows to selectively designate and store a word for the voice obtained by connecting with the web server 130, and when re-command, the voice recognition is processed by the user itself in a descriptive manner. It keeps speed on the network by allowing it to be entered in a web browser.

다만, 고립단어의 갯수를 제한하여 무한적인 생성을 막고 삭제 및 추가 할 수 있는 툴을 마련하여 사용자 스스로가 관리할 수 있도록 한다.However, by limiting the number of isolated words to prevent infinite creation, to provide a tool that can be deleted and added so that users can manage themselves.

상기 판단결과, 인식결과분석부(440)에서 고립단어가 아니라고 판단되는 경우에는 음성 인식결과를 출력하게 된다.(S590)If it is determined that the recognition result analysis unit 440 is not an isolated word, the voice recognition result is output.

이어서, 사용자의 음성이 고립단어로 저장되어 있는지의 여부를 판별하여(S600) 고립단어로 저장되어 있다고 판단되는 경우 웹브라우저(130)의 고립단어저장부(450)에 고립단어를 저장하게 된다.Subsequently, it is determined whether the user's voice is stored in the isolated word (S600), and if it is determined that the user's voice is stored in the isolated word, the isolated word is stored in the isolated word storage unit 450 of the web browser 130.

이 때, 음성의 텍스트가 영어인 경우 사람마다 단어의 발음 차이가 있어 고립단어의 역할을 할 수 없는 문제가 발생됨으로 사용자 스스로 자신의 발음을 훈련시켜 음성을 인식할 수 있는 고립단어 생성툴을 이용하게 된다.In this case, if the text of the voice is English, there is a problem that cannot be played as an isolated word because there is a difference in the pronunciation of the word for each person, so that the user can train his pronunciation and use the isolated word generation tool that can recognize the voice. Done.

상기 판단결과, 고립단어로 저장되어 있지 않다고 판단되는 경우 사용자의 음성 텍스트의 명령을 실행하게 된다.(S620)As a result of the determination, if it is determined that the word is not stored as an isolated word, a command of the user's voice text is executed.

이어서, 결과물을 출력하는지의 여부를 판단하게 되는 데(S630) 결과물을 출력하는 경우 결과물을 웹서버(130)의 고립단어저장부(450)에 저장함과 더불어 결과물을 검색하게 된다.(S640)Subsequently, it is determined whether to output the result (S630). When outputting the result, the result is stored in the isolated word storage unit 450 of the web server 130 and the result is searched.

이 때, 사용자의 입력된 음성의 영어가 예컨대, S(에스), T(티), A(에이), R(알), T(티)인 경우 스타트, 스타트-START로 결과물을 받을 수 있도록 한다.At this time, if the English of the input voice of the user is S (S), T (T), A (A), R (R), T (T), so that the result can be received by Start and Start-START. do.

또한, 자동적인 고립단어의 생성은 틀린 음성의 결과물을 무한적인 생성에 대하여 이를 막기 위해 선택적으로 저장하게 된다.In addition, automatic generation of isolated words may optionally store the result of wrong speech to prevent it from being infinitely generated.

이어서, 사용자의 새로운 음성 입력이 들어왔다고 판단되는 경우, 음성 인식을 수행하고(S560), 그에 따른 인식 결과를 분석한 후(S570) 인식 결과가 새로운 유.알.엘(Uniform Resource Locator 이하, URL)을 가지는지의 여부를 판별하여(S650) 새로운 URL을 가지는 것이면 상기와 같은 작업 과정이 다시 이루어진다.Subsequently, when it is determined that the user's new voice input is input, after performing voice recognition (S560), analyzing the recognition result (S570), the recognition result is a new Uniform Resource Locator (URL) or less. If it is determined whether or not to have a (S650) having a new URL, the above-described process is performed again.

상기 판단 결과, 새로운 URL을 가지지 않았다고 판단되는 경우 인식 결과가 웹브라우저(300)의 프로그램을 종료시키게 된다.As a result of the determination, if it is determined that the user does not have a new URL, the recognition result terminates the program of the web browser 300.

이상에서와 같이 본 발명에 따르면, 인터넷에 접속하는데 이용되는 웹브라우저를 인간이 사용하는 가장 보편적이고 편리한 의사소통수단인 음성을 통한 텍스트 입력방식을 사용하여 웹브라우저 상에서의 음성인식 기능을 구현함으로서 다음과 같은 효과가 있다.As described above, according to the present invention, by implementing a speech recognition function on the web browser by using a text input method through the voice, which is the most common and convenient communication means that humans use the web browser used to access the Internet It has the same effect.

첫째, 웹브라우저 상에서 키보드 조작을 통한 텍스트 입력방식이 아닌 음성을 통한 텍스트 입력 방식을 사용자의 인터페이스로 제공함으로서 보다 편리하게 웹브라우저를 사용할 수 있게 해준다.First, by providing a text input method using a voice as a user interface rather than a text input method through a keyboard operation on the web browser, it is possible to use the web browser more conveniently.

둘째, 음성을 통한 텍스트 입력방식을 사용함으로서 컴퓨터의 사용에 익숙하지 않은 사용자와 손을 사용하기가 불편한 장애인들도 보다 쉽게 웹브라우저를 사용할 수 있게 해준다.Second, by using text input method through voice, users who are not used to using computer and handicapped people who are uncomfortable with hand can use web browser more easily.

셋째, 음성을 통한 텍스트 입력방식을 사용함으로서 인터넷에 접속하여 사용될 수 있는 모든 제품, 예컨대 인터넷 TV, 웹폰 등에도 폭넓게 적용할 수 있다.Third, by using the text input method through the voice can be widely applied to all products that can be used to access the Internet, such as Internet TV, web phones, and the like.

넷째, 텍스트 입력방식으로 명사와 고유명사로 조합된 단어들을 음성데이터저장부에 저장함으로서 이를 통하여 기존의 검색엔진 예컨대 야후, 심마니, 라이코스 등에 적용하여 정보 검색 등에도 이용될 수 있다.Fourth, by storing words combined with nouns and proper nouns as a text input method in the voice data storage unit, the words can be applied to existing search engines such as Yahoo, Simmani, Lycos, etc., for information retrieval.

다섯째, 텍스트 입력방식을 이용하여 사용자가 원하는 검색어가 없을 경우 단어의 조합으로 자기만의 검색어를 만들어 이용할 수 있다.Fifth, if there is no search word desired by the user using a text input method, a user can make and use his or her own search word using a combination of words.

Claims

Voice recognition means comprising a web browser connected through the Internet, a file analysis unit, a lexical recognition unit, an isolated word generation unit, a recognition result analysis unit, an isolated word storage unit, and a result search unit. In the implementation method,

A first step of accessing the Internet and executing a web browser for speech recognition;

A second step of verifying a user's authentication in the file analyzer;

A third step of recognizing a speech text of a user's isolated word in the lexical recognition unit;

A fourth step of analyzing the voice text of the isolated word input from the user in the recognition result analyzing unit to store the voice text of the isolated word corresponding to the isolated word storage unit and to execute a command; And

If the isolated word of the user's voice text searched by the result search unit is a new URL that is not stored in the isolated word storage unit, a series of processes for recognizing, analyzing, and storing the isolated word of the user's voice text is repeated. Speech recognition implementation on a web browser consisting of a fifth process performed by.

The method of claim 1, wherein the voice transmission method in the Internet in the first process uses a VoIP method.

The method of claim 1, wherein the voice text transmission method of the user in the third process uses a TCP / IP method.

The method of claim 1, wherein in the third step, the isolated word of the voice text is formed by combining arbitrary letters by putting each pronunciation corresponding to a consonant, a vowel, a character, a special character, an alphabet, and a number of Korean characters. How to Implement Speech Recognition on Web Browsers.

The method of claim 1, wherein in the first step of incubation of the system generated by a plurality of users connected to the voice recognition web browser at the same time, the voice data storage unit for each channel by using the method of distribution in the order of access. A voice recognition method on a web browser.

The method of claim 1, wherein in the first process, the web browser comprises a menu window for selecting Korean, English, special characters, and numbers.

The method of claim 1, wherein in the fourth process, the isolated word storage unit may selectively designate and store an isolated word for a voice obtained by connecting to a web server.

The method of claim 1, wherein when re-entering the voice text of the isolated word input from the isolated word generator, the voice recognition is processed in a descriptive manner so that only the text is written in the web browser. Voice recognition implementation method.

The speech recognition method of claim 1, wherein the web browser is provided with a tool for limiting the number of isolated words generated from the isolated word generator to prevent infinite generation and to delete and add an infinite word. Implementation method.

The method of claim 1, wherein the web browser is provided with an isolated word generation tool capable of recognizing a voice by training a user's own pronunciation when the voice text to be spoken is English. .