KR20020024045A - An unseen interpretation & practice speaking foreign language make use of cellular phone - Google Patents

An unseen interpretation & practice speaking foreign language make use of cellular phone Download PDF

Info

Publication number
KR20020024045A
KR20020024045A KR1020020003743A KR20020003743A KR20020024045A KR 20020024045 A KR20020024045 A KR 20020024045A KR 1020020003743 A KR1020020003743 A KR 1020020003743A KR 20020003743 A KR20020003743 A KR 20020003743A KR 20020024045 A KR20020024045 A KR 20020024045A
Authority
KR
South Korea
Prior art keywords
data
speaker
interpretation
cellular phone
foreign language
Prior art date
Application number
KR1020020003743A
Other languages
Korean (ko)
Inventor
배성윤
Original Assignee
배성윤
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 배성윤 filed Critical 배성윤
Priority to KR1020020003743A priority Critical patent/KR20020024045A/en
Publication of KR20020024045A publication Critical patent/KR20020024045A/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/725Cordless telephones
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/74Details of telephonic subscriber devices with voice recognition means

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)

Abstract

PURPOSE: A system for supporting foreign language conversation study and interpretation using a cellular phone is provided to allow a user to be able to study conversation and interpret through a voice recognition system without inputting character data. CONSTITUTION: The voice of a speaker is inputted and converted into a digital signal, to be spooled in a host. A standardization recognition module corresponding to personal data is executed on the basis of a personal interface. The speaker's database and an arbitrary speaker's standardization data are divided from each other. A composite voice recognition process for converting phoneme into an English word or other language and a continuous vocabulary synthesizing process for synthesizing continuous vocabularies are continuously performed to collect similar phoneme of candidate words. By doing so, at least one data is extracted, and the data and candidate vocabularies are output to a mobile terminal.

Description

셀룰러폰을 이용한 외국어 회화학습과 통역지원 시스템{An unseen interpretation & practice speaking foreign language make use of cellular phone}An unseen interpretation & practice speaking foreign language make use of cellular phone}

본 발명은 이동 단말기 사용의 음성인식 시스템에 복합음성 인식 프로세스와 연속어휘 합성 인식과정을 거친 실시간 번역 및 회화학습을 제공하는 전화정보 서비스에 관한 시스템이다.The present invention is a system for a telephone information service that provides a real-time translation and conversation learning through a complex speech recognition process and a continuous vocabulary synthesis recognition process in a speech recognition system using a mobile terminal.

일반적으로 이동단말기(2)는 무선 RF전파를 송수신하고 변복조를 거치며 무선 LAN전파를 송수신하여 신호를 처리하는 통신모듈과 음성 입력수단 및 음성 출력수단을 가지며 통화상태 및 화상 데이터 표시수단을 가지고 있다. 화자에 의해 입력되는 임의의 자료를 A/D 어뎁터(4)를 거쳐 개인의 데이터를 표준화 데이터로 바꾸어 주는 단계(6)와 언어프로세싱 방법으로 음운을 영어단어나 다른 언어로 바꾸어주는 단계(8)와 비터비 검색을 거쳐 연속어휘를 인식하고 합성하는 단계로 합성된 데이터를 끝까지 반복하는 단계(10)를 거쳐 후보인식단어의 알고리즘 검출과 특징추출을 거친 출력기를 통하여 최종 인식어휘를 적용하여 적어도 하나이상의 데이터를 이동 단말기에 전송하는 단계를 포함하는 방법이다.In general, the mobile terminal 2 has a communication module, a voice input means, a voice output means, and a call state and image data display means for transmitting and receiving wireless RF waves, undergoing modulation and demodulation, and transmitting and receiving wireless LAN waves. (6) converting any data input by the speaker through the A / D adapter (4) into personalized data (6) and changing the phonology into English words or other languages by the language processing method (8) And recognizing and synthesizing the continuous vocabulary through Viterbi search and repeating the synthesized data to the end (10) and applying the final recognized vocabulary through the output of the algorithm detected and extracted through the candidate recognition word. The method includes transmitting the above data to the mobile terminal.

본 고안에 따르면 이동단말기에 지원되는 컨텐츠의 제공에 관한 서비스의 일환으로서 새로운 기술에 대한 제품의 혁신에 따르는 R&D에 대하여 제품 내에 추가되는 부품에 대한 지출이나 추가되는 부품에 의한 새로운 기종의 등장을 줄이는 기존제품의 최대한의 활용에 대한 것으로서 현재 사용되고 있는 음성인식 시스템의 효용을 증가시켜 제품의 서비스질의 개선과 고객의 확보에 관한 문제를 해결하고자 하는 것이다. 다른 새로운 제품의 4PS를 줄이고 이동통신기기를 이용한 영어 또는 다른 언어의 학습과 외국인과의 의사소통에 필요한 통역 또는 번역을 언제 어디서든지 전문가의 지원 없이도 일상생활에서 사용할 수 있으며 개인의 제품구매의 추가부담에 대한 문제를 풀어내고자 하는 것이다. 기존의 통신기기에 메모리칩과 시스템 모듈을 보완하거나 새롭게 출시되는 기종에 대해서만 상용화를 시킴으로써 생산원가의 절감과 개인 데이터베이스 구축의 대량화를 막는 것에 특징이 있다.According to the present invention, as part of a service for providing contents supported by a mobile terminal, the R & D following a product innovation for a new technology reduces spending on parts added in a product or appearance of a new model by the added parts. It is to maximize the utilization of existing products and to increase the utility of the voice recognition system currently used to solve the problems of improving the service quality of the product and securing the customer. Reduce the 4PS of other new products, and use the translation or translation necessary for learning English or other languages using mobile communication devices and communicating with foreigners in everyday life anytime, anywhere without the support of an individual, and the additional burden of purchasing a product I'm trying to solve the problem of. Complementing memory chips and system modules in existing communication devices or commercializing only newly released models prevents the reduction of production costs and the mass build of personal databases.

도 1은 종래의 셀룰러폰을 도시한 사시도1 is a perspective view showing a conventional cellular phone

도 2는 본 고안을 이용한 시스템의 구조도2 is a structural diagram of a system using the present invention

〈도면의 주요부분에 대한 부호의 설명〉<Explanation of symbols for main parts of drawing>

2. 셀룰러폰 4. A/D, D/A adapter2. Cellphone 4. A / D, D / A adapter

6. 음성 인식 시스템 8. 연속음성 인식 프로세스6. Speech Recognition System 8. Continuous Speech Recognition Process

10. 연속어휘 합성 프로세스 12. 데이터 출력모듈10. Continuous Vocabulary Synthesis Process 12. Data Output Module

14. 표준화 데이터 베이스 16. 복합음성 인식 프로세스14. Standardized Database 16. Complex Speech Recognition Process

본 발명에 따르면 도2와 같이 화자의 음성이 입력되어 아날로그가 디지털로 변환되는 어뎁터를 거쳐 호스트에 스풀링되는 단계 : 정규화 과정을 거쳐 변화폭을 최소화시킨 후 저역통과 필터를 거쳐 끝점 검출 특징 추출을 하는 단계에서 화자와 임의의 화자를 구분하는 단계 : 접속된 개인의 인터페이스를 기초로 하여 개인의 데이터와 상응하는 표준화 인식모듈을 시행하는 디코딩 단계 : 화자의 데이터베이스와 임의 화자의 표준화 데이터로의 구분적 실행 단계 : 음성인식 시스템에서 표준화된 데이터를 적용하여 언어프로세싱 방법으로 음운을 영어단어나 다른 언어로 변환시켜주는 복합음성인식 프로세스와 집단화된 코드북으로 부호화된 데이터를 이산 HMM과 Viterbi 알고리즘을 수행하며 연속되는 어휘의 합성인식을 거치는 연속어휘합성 프로세스를 연속적으로 거치는 단계로 인식된 결과에 따라 후보단어 유사음소를 집합하는 프로세스를 거쳐 최소한 한 개 이상의 데이터를 도출하고 후보어휘를 포함하여 디지털코더로 이동 단말기에 출력하는 것이다.According to the present invention, as shown in Fig. 2, the speaker's voice is input and the analog is converted to digital through the adapter spooled to the host: a step of minimizing the variation through the normalization process and extracting the end point detection feature through a low pass filter. Distinguishing the speaker from the random speaker in the decoding step of implementing a standardized recognition module corresponding to the individual's data based on the connected individual's interface: the step of separately executing the speaker's database and the arbitrary speaker into the standardized data : Speech Recognition System applies standardized data to process multiple vocabularies by using a speech processing method that converts phonology into English words or other languages by using language processing method and discrete HMM and Viterbi algorithms. Continuous Vocabulary Synthesis Process through Synthesis Recognition According to the recognition result to the continuous phase after going through the process of set of the candidate words similar phoneme deriving at least one or more of the data and outputs it to the mobile terminal to a digital coder including candidate words.

새로운 기술에 대한 제품의 혁신에 따르는 R&D에 대하여 현재 상용되고 있는 이동통신기기의 제품 내에 추가되는 부품에 대한 지출이나 추가되는 부품에 의한 새로운 기종의 등장이 필요하지 않고 현재 사용되고 있는 음성인식 시스템의 효용을 증가시켜 제품의 서비스질의 개선과 고객의 확보에 관한 문제를 해결하고자 하는 것으로서 영어 또는 다른 언어의 학습과 외국인과의 의사소통에 필요한 통역 또는 번역을 일상 생활에서 사용할 수 있으며 개인의 제품구매의 추가부담에 대한문제를 풀어내고자 하는 것이다. 마케팅 분야에서의 비교우위로서의 고객유지와 서비스 특화로서의 효용의 극대화를 실현시키며 새로운 컨셉으로서의 글로벌시장 진출을 모색할 수 있다는 것이다.The R & D following the innovation of the product for new technology does not require the expenditure of parts added in the products of the mobile communication devices which are currently commercially available, or the appearance of a new model by the added parts. In order to solve problems related to improving the service quality of the product and securing the customer by increasing the quality of the product, an interpreter or translation necessary for learning English or other languages and communicating with foreigners can be used in daily life. It is trying to solve the problem of burden. In this regard, the company will be able to seek to enter the global market as a new concept while realizing customer retention as a comparative advantage in the field of marketing and maximizing its utility as a specialized service.

Claims (1)

셀룰러폰을 이용하여 직접적으로 문자데이터를 입력하지 않아도 음성인식 시스템을 통해 임의화자에 대해서도 실시간 회화학습과 통역이 가능한 시스템을 구현하는 것이다.It is to implement a system that enables real-time conversational learning and interpretation for random talkers through voice recognition system without directly inputting text data using cellular phone.
KR1020020003743A 2002-01-22 2002-01-22 An unseen interpretation & practice speaking foreign language make use of cellular phone KR20020024045A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020020003743A KR20020024045A (en) 2002-01-22 2002-01-22 An unseen interpretation & practice speaking foreign language make use of cellular phone

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020020003743A KR20020024045A (en) 2002-01-22 2002-01-22 An unseen interpretation & practice speaking foreign language make use of cellular phone

Publications (1)

Publication Number Publication Date
KR20020024045A true KR20020024045A (en) 2002-03-29

Family

ID=19718743

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020020003743A KR20020024045A (en) 2002-01-22 2002-01-22 An unseen interpretation & practice speaking foreign language make use of cellular phone

Country Status (1)

Country Link
KR (1) KR20020024045A (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08265445A (en) * 1995-03-07 1996-10-11 Siemens Ag Communication device
KR20000036769A (en) * 2000-03-28 2000-07-05 이헌 Foreign language learning system using mobile station(or PC & telephone)
KR20010010772A (en) * 1999-07-22 2001-02-15 윤종용 Method for studying language using voice recognition function in wireless communication terminal
JP2001127846A (en) * 1999-10-29 2001-05-11 Nec Telecom Syst Ltd Radio telephone set
JP2001222294A (en) * 1999-11-24 2001-08-17 Phone.Com Japan Kk Voice recognition based on user interface for radio communication equipment
KR20020033414A (en) * 2001-10-16 2002-05-06 지창진 Apparatus for interpreting and method thereof
KR20020071054A (en) * 2001-03-02 2002-09-12 주식회사 머큐리 System for language translation by using mobile communication network
KR20020094188A (en) * 2000-04-15 2002-12-18 손영기 Wireless communication service system or phone voice translation system and use way
KR20030008336A (en) * 2001-07-20 2003-01-25 최석천 An interpreter using mobile phone

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08265445A (en) * 1995-03-07 1996-10-11 Siemens Ag Communication device
KR20010010772A (en) * 1999-07-22 2001-02-15 윤종용 Method for studying language using voice recognition function in wireless communication terminal
JP2001127846A (en) * 1999-10-29 2001-05-11 Nec Telecom Syst Ltd Radio telephone set
JP2001222294A (en) * 1999-11-24 2001-08-17 Phone.Com Japan Kk Voice recognition based on user interface for radio communication equipment
KR20000036769A (en) * 2000-03-28 2000-07-05 이헌 Foreign language learning system using mobile station(or PC & telephone)
KR20020094188A (en) * 2000-04-15 2002-12-18 손영기 Wireless communication service system or phone voice translation system and use way
KR20020071054A (en) * 2001-03-02 2002-09-12 주식회사 머큐리 System for language translation by using mobile communication network
KR20030008336A (en) * 2001-07-20 2003-01-25 최석천 An interpreter using mobile phone
KR20020033414A (en) * 2001-10-16 2002-05-06 지창진 Apparatus for interpreting and method thereof

Similar Documents

Publication Publication Date Title
US7089184B2 (en) Speech recognition for recognizing speaker-independent, continuous speech
US9570066B2 (en) Sender-responsive text-to-speech processing
Rudnicky et al. Survey of current speech technology
US20120150538A1 (en) Voice message converter
KR20140121580A (en) Apparatus and method for automatic translation and interpretation
KR20070007882A (en) Voice over short message service
JPH09507105A (en) Distributed speech recognition system
GB2423403A (en) Distributed language processing system and method of outputting an intermediary signal
US8077835B2 (en) Method and system of providing interactive speech recognition based on call routing
US20120221335A1 (en) Method and apparatus for creating voice tag
CN111785258A (en) Personalized voice translation method and device based on speaker characteristics
CN101825953A (en) Chinese character input product with combined voice input and Chinese phonetic alphabet input functions
CN113380222A (en) Speech synthesis method, speech synthesis device, electronic equipment and storage medium
CN101320561A (en) Method and module for improving individual speech recognition rate
CN114187914A (en) Voice recognition method and system
CN114120979A (en) Optimization method, training method, device and medium of voice recognition model
CN111341300A (en) Method, device and equipment for acquiring voice comparison phonemes
KR20020024045A (en) An unseen interpretation &amp; practice speaking foreign language make use of cellular phone
KR101233655B1 (en) Apparatus and method of interpreting an international conference based speech recognition
CN112259093A (en) Intelligent customer service interaction system based on voice recognition
JP2011039468A (en) Word searching device using speech recognition in electronic dictionary, and method of the same
KR100369732B1 (en) Method and Apparatus for intelligent dialog based on voice recognition using expert system
Prukkanon et al. F0 contour approximation model for a one-stream tonal word recognition system
Ananthakrishna et al. Effect of time-domain windowing on isolated speech recognition system performance
JP4445371B2 (en) Recognition vocabulary registration apparatus, speech recognition apparatus and method

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E601 Decision to refuse application