KR20140065214A

KR20140065214A - Mobile terminal and operation method thereof

Info

Publication number: KR20140065214A
Application number: KR1020120132465A
Authority: KR
Inventors: 송호성; 정우수; 조재민; 김준엽
Original assignee: 엘지전자 주식회사
Priority date: 2012-11-21
Filing date: 2012-11-21
Publication date: 2014-05-29
Also published as: KR101989093B1

Abstract

An operation method of a mobile terminal according to the present invention includes the steps of operating a camera in response to a first user input; displaying a preview image on a screen of a display unit; activating a voice recognition mode to recognize a voice input from a microphone; and recording the preview image in response to the voice input from the microphone and tagging the voice input on the recorded image data to store the data in a memory.

Description

[0001] MOBILE TERMINAL AND OPERATION METHOD THEREOF [0002]

본 발명은 이동 단말기 및 그 동작 방법에 관한 것으로, 더욱 상세하게는 음성 인식 기술을 기반으로 영상 또는 오디오 데이터에 음성 태깅(voice tagging)을 수행하고, 상기 음성 태그된 영상 또는 오디오 데이터를 다양한 용도로 활용하는 이동 단말기 및 그 동작 방법에 관한 것이다.The present invention relates to a mobile terminal and an operation method thereof. More particularly, the present invention relates to a mobile terminal and a method of operating the same. More particularly, the present invention relates to a mobile terminal and a method of operating the same, And a method of operating the mobile terminal.

이동 단말기는 휴대가 가능하면서 음성 및 영상 통화를 수행할 수 있는 기능, 정보를 입·출력할 수 있는 기능 및 데이터를 저장할 수 있는 기능 등을 하나 이상 갖춘 휴대용 기기이다. 이러한 이동 단말기는 그 기능이 다양화됨에 따라, 사진이나 동영상의 촬영, 음악 파일이나 동영상 파일의 재생, 게임, 방송의 수신, 무선 인터넷, 메시지 송수신 등과 같은 복잡한 기능들을 갖추게 되었으며, 종합적인 멀티미디어 기기 형태로 구현되고 있다.A mobile terminal is a portable device having one or more functions capable of carrying out voice and video communication, capable of inputting and outputting information, and storing data, while being portable. As the functions of the mobile terminal are diversified, the mobile terminal has complicated functions such as photographing and photographing of a moving picture, playback of a music file or a moving picture file, reception of a game, broadcasting, wireless Internet, and transmission / reception of a message. .

이러한 멀티미디어 기기의 형태로 구현된 이동 단말기에는, 복잡한 기능을 구현하기 위해 하드웨어나 소프트웨어적 측면에서 새로운 시도들이 다양하게 적용되고 있다. 일 예로, 사용자가 쉽고 편리하게 기능을 검색하거나 선택하기 위한 사용자 인터페이스(User Interface) 환경 등이 있다. 또한, 음성 인식 기술을 기반으로 다양한 동작을 수행하기 위한 사용자 인터페이스 환경 등이 있다. In order to implement complex functions, mobile terminals implemented in the form of multimedia devices are being applied variously in terms of hardware and software. For example, there is a user interface environment in which a user can easily and conveniently search for or select a function. Also, there is a user interface environment for performing various operations based on the speech recognition technology.

이러한 음성인식 기술은, 사람이 일상생활 속에서 사용하는 단말기들의 제어나 정보 서비스를 마우스나 키보드를 사용하지 않고, 사람이 갖는 가장 친화적이면서 편리한 의사소통 도구인 목소리를 사용하여 원하는 기기의 제어나 정보 서비스를 제공받을 수 있도록 지원하는 기술을 말한다. This speech recognition technology can be applied to control or information service of terminals used in daily life by using a voice which is the most friendly and convenient communication tool of a person without using a mouse or a keyboard, It is a technology that supports the provision of services.

최근, 급속하게 발전하는 정보통신기술 환경에서 정보기기가 소형화 및 복잡화되고, 이동성이 중요시되기 때문에 음성인식 기술이 더욱 절실히 요구되고 있는 상황이다. 따라서, 이러한 상황 변화에 발맞춰, 음성 인식 기술을 활용해 다양한 동작을 수행할 수 있도록 하는 사용자 인터페이스의 개발이 요구된다.In recent years, in a rapidly developing information and communication technology environment, information devices have become more compact and complicated, and mobility has become more important. Accordingly, in order to cope with such a situation change, development of a user interface for performing various operations using voice recognition technology is required.

본 발명은, 음성 인식을 통해 사진을 촬영함과 동시에 해당 영상 데이터에 음성 명칭을 태깅하여 저장하는 이동 단말기 및 그 동작 방법을 제안한다.The present invention proposes a mobile terminal for capturing a picture through voice recognition and tagging and storing a voice name in the video data and an operation method thereof.

또한, 본 발명은, 음성 인식 기술을 통해 음성 태그된 영상 데이터를 검색하는 이동 단말기 및 그 동작 방법을 제안한다.In addition, the present invention proposes a mobile terminal for searching for voice-tagged image data through a speech recognition technology and an operation method thereof.

또한, 본 발명은, 음성 태그된 기준 영상 데이터와 타 영상 데이터들 간의 이미지 매칭을 통해 유사도를 결정하고, 상기 결정된 유사도를 기반으로 영상 데이터를 검색하는 이동 단말기 및 그 동작 방법을 제안한다.Also, the present invention proposes a mobile terminal for determining similarity through image matching between voice tagged reference image data and other image data, and searching for image data based on the determined similarity.

또한, 본 발명은, 음성 녹음 시, 음성 인식 엔진을 통해 미리 결정된 단어들을 검출하고, 상기 검출된 단어들을 녹음 중인 오디오 데이터에 태깅하여 저장하는 이동 단말기 및 그 동작 방법을 제안한다.In addition, the present invention proposes a mobile terminal for detecting predetermined words through a voice recognition engine at the time of voice recording, tagging the detected words to audio data being recorded, and an operation method thereof.

또한, 본 발명은, 오디오 파일 재생 시, 오디오 데이터에 태깅된 복수의 단어들을 검출하여 재생화면에 표시하고, 상기 표시된 단어들 중 어느 하나의 선택에 대응하여 상기 오디오 파일의 재생 위치를 변경하는 이동 단말기 및 그 동작 방법을 제안한다.According to another aspect of the present invention, there is provided a method for reproducing an audio file, the method comprising: detecting a plurality of words tagged with audio data and displaying the tagged word on a reproduction screen; And a method of operating the terminal.

본 발명은 제1 사용자 입력에 대응하여 카메라를 구동하는 단계; 디스플레이부의 화면에 프리뷰 영상을 표시하는 단계; 마이크를 통해 입력되는 음성을 인식하기 위해 음성 인식 모드를 활성화하는 단계; 및 상기 마이크를 통한 음성 입력에 대응하여, 상기 프리뷰 영상을 촬영하고, 촬영된 영상 데이터에 상기 입력된 음성을 태깅하여 메모리에 저장하는 단계를 포함하는 이동 단말기의 동작 방법을 제공한다.The present invention provides a method comprising: driving a camera corresponding to a first user input; Displaying a preview image on a screen of a display unit; Activating a voice recognition mode to recognize a voice input through a microphone; And capturing the preview image corresponding to the voice input through the microphone, tagging the input voice in the photographed image data, and storing the tagged voice in the memory.

본 발명의 일 실시 예에 따르면, 이동 단말기는 음성 인식을 통해 영상을 촬영함과 동시에, 촬영된 영상 데이터에 음성 인식된 명칭을 태깅하여 메모리 내에 저장하고, 상기 메모리 내에 저장된 음성 태그된 영상 데이터를 음성 인식 기술을 통해 검색할 수 있도록 한다.According to an embodiment of the present invention, a mobile terminal captures an image through voice recognition, stores a tag name of a voice recognized in photographed image data and stores it in a memory, and stores voice tagged image data stored in the memory It is possible to search through speech recognition technology.

또한, 본 발명의 다른 실시 예에 따르면, 이동 단말기는 음성 태그된 기준 영상 데이터와 타 영상 데이터들 간의 이미지 매칭을 통해 유사도를 결정하고, 상기 결정된 유사도 정보를 기반으로 현재 입력된 음성 명칭과 관련된 영상 데이터를 검색할 수 있도록 한다.According to another embodiment of the present invention, a mobile terminal determines a similarity through image matching between voice tagged reference image data and other image data, and based on the determined similarity information, Allows data to be retrieved.

또한, 본 발명의 또 다른 실시 예에 따르면, 이동 단말기는 오디오 파일 재생 시, 오디오 데이터에 태깅된 단어들을 검출하여 재생화면에 표시하고, 상기 표시된 단어들 중 어느 하나의 선택에 대응하여 상기 오디오 파일의 재생 위치를 이동할 수 있도록 한다.According to another embodiment of the present invention, a mobile terminal detects words tagged with audio data on a reproduction screen during reproduction of an audio file, and displays the tagged words in the audio file in response to selection of any one of the displayed words. So that the playback position of the player can be moved.

도 1은 본 발명의 일 실시 예에 따른 이동 단말기의 블럭 구성도;
도 2는 본 발명의 일 실시 예에 따른 이동 단말기를 전면에서 바라본 사시도;
도 3은 도 2에 도시한 이동 단말기의 후면 사시도;
도 4 및 도 5는 본 발명의 제1 실시 예에 따른 이동 단말기에서, 음성 인식을 통해 영상을 촬영함과 동시에 해당 영상 데이터에 음성 명칭을 태깅하는 동작을 설명하는 도면;
도 6은 본 발명의 제1 실시 예에 따라, 영상 보관함에 저장된 일반 영상 데이터 및 음성 태그된 영상 데이터의 일 예를 나타내는 도면;
도 7 및 도 8은 본 발명의 제1 실시 예에 따른 이동 단말기에서, 음성 인식 기술을 통해 음성 태그된 영상 데이터를 검색하는 동작을 설명하는 도면;
도 9는 본 발명의 일 실시 예에 따른 이동 단말기에서, 음성 인식 기술을 통해 음성 태그된 영상 데이터를 검출함과 동시에, 해당 음성 명칭과 관련된 다른 컨텐츠를 추가로 제공하는 동작을 설명하는 도면;
도 10 내지 도 12는 본 발명의 제2 실시 예에 따른 이동 단말기에서, 음성 태그된 기준 영상 데이터와 타 영상 데이터들 간의 이미지 매칭을 통해 유사도를 결정하고, 상기 결정된 유사도 정보를 저장하는 동작을 설명하는 도면;
도 13 및 도 14는 본 발명의 제2 실시 예에 따른 이동 단말기에서, 음성 인식 기술을 통해, 입력된 음성과 관련된 영상 데이터를 검색하는 동작을 설명하는 도면;
도 15는 본 발명의 다른 실시 예에 다른 이동 단말기에서, 음성 태그된 기준 영상 데이터를 기반으로 동영상 데이터에 대해 이미지 매칭을 수행하여, 상기 기준 영상 데이터와 동일 및/또는 유사한 영상 데이터가 동영상 데이터의 어느 위치에 존재하는지를 나타내는 동작을 설명하는 도면;
도 16은 본 발명의 또 다른 실시 예에 따른 이동 단말기에서, 음성 인식 기술을 통한 통화 요청 시, 주소록 목록에 저장할 통화 상대방의 영상 데이터로 음성 태그된 영상 데이터를 자동 추천하는 동작을 설명하는 도면;
도 17 및 도 18은 본 발명의 제3 실시 예에 따른 이동 단말기에서, 음성 녹음 시, 음성 인식 엔진을 통해 핵심 단어를 검출하고, 상기 검출된 핵심 단어를 오디오 데이터에 태깅하여 저장하는 동작을 설명하는 도면;
도 19 및 도 20은 본 발명의 제3 실시 예에 따른 이동 단말기에서, 오디오 파일 재생 시, 오디오 데이터에 태깅된 단어들을 재생화면에 표시하고, 상기 표시된 단어들 중 어느 하나의 선택에 대응하여 상기 오디오 파일의 재생 위치를 변경하는 동작을 설명하는 도면.1 is a block diagram of a mobile terminal according to an embodiment of the present invention;
FIG. 2 is a perspective view of a mobile terminal according to an embodiment of the present invention; FIG.
FIG. 3 is a rear perspective view of the mobile terminal shown in FIG. 2; FIG.
FIGS. 4 and 5 are diagrams illustrating an operation of capturing an image through speech recognition and tagging a voice name in the corresponding image data in the mobile terminal according to the first embodiment of the present invention; FIG.
6 is a diagram illustrating an example of general image data and voice tagged image data stored in an image storage box according to a first embodiment of the present invention;
FIGS. 7 and 8 are diagrams illustrating an operation of searching for voice-tagged image data through voice recognition technology in a mobile terminal according to a first embodiment of the present invention;
9 is a view for explaining an operation of detecting voice-tagged image data through speech recognition technology and further providing other contents related to the voice name in the mobile terminal according to an embodiment of the present invention;
10 to 12 illustrate an operation of determining similarity through image matching between voice tagged reference image data and other image data and storing the determined similarity information in the mobile terminal according to the second embodiment of the present invention Fig.
13 and 14 are diagrams illustrating an operation of searching for image data related to an input voice through a voice recognition technology in a mobile terminal according to a second embodiment of the present invention;
15 is a flowchart illustrating a method of performing image matching on moving image data based on voice tagged reference image data in a mobile terminal according to another exemplary embodiment of the present invention, And explaining an operation indicating which position is present;
16 is a view for explaining an operation of automatically recommending voice-tagged image data as video data of a call destination to be stored in an address book list in a mobile terminal according to another embodiment of the present invention when a call is requested through a voice recognition technology;
17 and 18 illustrate an operation of detecting a key word through a voice recognition engine and storing the detected key word in the audio data when voice recording is performed in the mobile terminal according to the third embodiment of the present invention Fig.
FIGS. 19 and 20 are diagrams illustrating a method for displaying words tagged with audio data on a reproduction screen when audio files are reproduced in a mobile terminal according to a third embodiment of the present invention, Fig. 5 is a diagram for explaining an operation of changing the playback position of an audio file. Fig.

이하에서는 도면을 참조하여 본 발명을 보다 상세하게 설명한다.Hereinafter, the present invention will be described in detail with reference to the drawings.

본 명세서에서 기술되는 이동 단말기에는, 휴대폰, 스마트 폰(smart phone), 노트북 컴퓨터(notebook computer), 디지털방송용 단말기, PDA(Personal Digital Assistants), PMP(Portable Multimedia Player), 카메라, 네비게이션, 타블렛 컴퓨터(tablet computer), 이북(e-book) 단말기 등이 포함된다. 또한, 이하의 설명에서 사용되는 구성요소에 대한 접미사 "모듈" 및 "부"는 단순히 본 명세서 작성의 용이함만이 고려되어 부여되는 것으로서, 그 자체로 특별히 중요한 의미 또는 역할을 부여하는 것은 아니다. 따라서, 상기 "모듈" 및 "부"는 서로 혼용되어 사용될 수도 있다.Examples of the mobile terminal described in the present specification include a mobile phone, a smart phone, a notebook computer, a digital broadcasting terminal, a PDA (Personal Digital Assistants), a PMP (Portable Multimedia Player), a camera, tablet computers, e-book terminals, and the like. In addition, suffixes "module" and " part "for the components used in the following description are given merely for convenience of description, and do not give special significance or role in themselves. Accordingly, the terms "module" and "part" may be used interchangeably.

도 1은 본 발명의 일 실시 예에 따른 이동 단말기의 블록도이다. 도 1을 참조하여 본 발명의 일 실시 예에 따른 이동 단말기를 기능에 따른 구성요소 관점에서 살펴보면 다음과 같다.1 is a block diagram of a mobile terminal according to an embodiment of the present invention. Referring to FIG. 1, a mobile terminal according to an exemplary embodiment of the present invention will be described in terms of functional components.

도 1을 참조하면, 이동 단말기(100)는 무선 통신부(110), A/V(Audio/Video) 입력부(120), 사용자 입력부(130), 센싱부(140), 출력부(150), 메모리(160), 인터페이스부(170), 제어부(180), 및 전원 공급부(190)를 포함할 수 있다. 이와 같은 구성요소들은 실제 응용에서 구현될 때 필요에 따라 2 이상의 구성요소가 하나의 구성요소로 합쳐지거나, 혹은 하나의 구성요소가 2 이상의 구성요소로 세분되어 구성될 수 있다.1, a mobile terminal 100 includes a wireless communication unit 110, an audio / video (A / V) input unit 120, a user input unit 130, a sensing unit 140, an output unit 150, A controller 160, an interface 170, a controller 180, and a power supply 190. When such components are implemented in practical applications, two or more components may be combined into one component, or one component may be divided into two or more components as necessary.

무선 통신부(110)는 방송수신 모듈(111), 이동통신 모듈(113), 무선 인터넷 모듈(115), 근거리 통신 모듈(117), 및 GPS 모듈(119) 등을 포함할 수 있다.The wireless communication unit 110 may include a broadcast receiving module 111, a mobile communication module 113, a wireless Internet module 115, a short distance communication module 117, and a GPS module 119.

방송수신 모듈(111)은 방송 채널을 통하여 외부의 방송관리 서버로부터 방송 신호 및 방송관련 정보 중 적어도 하나를 수신한다. 이때, 방송 채널은 위성 채널, 지상파 채널 등을 포함할 수 있다. 방송관리 서버는, 방송 신호 및 방송 관련 정보 중 적어도 하나를 생성하여 송신하는 서버나, 기 생성된 방송 신호 및 방송관련 정보 중 적어도 하나를 제공받아 단말기에 송신하는 서버를 의미할 수 있다.The broadcast receiving module 111 receives at least one of a broadcast signal and broadcast related information from an external broadcast management server through a broadcast channel. At this time, the broadcast channel may include a satellite channel, a terrestrial channel, and the like. The broadcast management server may refer to a server for generating and transmitting at least one of a broadcast signal and broadcast related information and a server for receiving at least one of the generated broadcast signal and broadcast related information and transmitting the broadcast signal to the terminal.

방송 신호는, TV 방송 신호, 라디오 방송 신호, 데이터 방송 신호를 포함할 뿐만 아니라, TV 방송 신호 또는 라디오 방송 신호에 데이터 방송 신호가 결합한 형태의 방송 신호도 포함할 수 있다. 방송관련 정보는, 방송 채널, 방송 프로그램 또는 방송 서비스 제공자에 관련한 정보를 의미할 수 있다. 방송관련 정보는, 이동통신망을 통하여도 제공될 수 있으며, 이 경우에는 이동통신 모듈(113)에 의해 수신될 수 있다. 방송관련 정보는 다양한 형태로 존재할 수 있다. The broadcast signal may include a TV broadcast signal, a radio broadcast signal, a data broadcast signal, and a broadcast signal in which a data broadcast signal is combined with a TV broadcast signal or a radio broadcast signal. The broadcast-related information may mean information related to a broadcast channel, a broadcast program, or a broadcast service provider. The broadcast-related information can also be provided through a mobile communication network, in which case it can be received by the mobile communication module 113. Broadcast-related information can exist in various forms.

방송수신 모듈(111)은, 각종 방송 시스템을 이용하여 방송 신호를 수신하는데, 특히, DMB-T(Digital Multimedia Broadcasting-Terrestrial), DMB-S(Digital Multimedia Broadcasting-Satellite), MediaFLO(Media Forward Link Only), DVB-H(Digital Video Broadcast-Handheld), ISDB-T(Integrated Services Digital Broadcast-Terrestrial) 등의 디지털 방송 시스템을 이용하여 디지털 방송 신호를 수신할 수 있다. 또한, 방송수신 모듈(111)은, 이와 같은 디지털 방송 시스템뿐만 아니라 방송 신호를 제공하는 모든 방송 시스템에 적합하도록 구성될 수 있다. 방송수신 모듈(111)을 통해 수신된 방송 신호 및/또는 방송 관련 정보는 메모리(160)에 저장될 수 있다.The broadcast receiving module 111 receives broadcast signals using various broadcasting systems. In particular, the broadcast receiving module 111 may be a Digital Multimedia Broadcasting-Terrestrial (DMB-T), a Digital Multimedia Broadcasting-Satellite (DMB-S) ), Digital Video Broadcast-Handheld (DVB-H), Integrated Services Digital Broadcast-Terrestrial (ISDB-T), and the like. In addition, the broadcast receiving module 111 may be configured to be suitable for all broadcasting systems that provide broadcasting signals, as well as the digital broadcasting system. The broadcast signal and / or broadcast related information received through the broadcast receiving module 111 may be stored in the memory 160.

이동통신 모듈(113)은, 이동 통신망 상에서 기지국, 외부의 단말, 서버 중 적어도 하나와 무선 신호를 송수신한다. 여기서, 무선 신호는, 음성 호 신호, 화상 통화 호 신호, 또는 문자/멀티미디어 메시지 송수신에 따른 다양한 형태의 데이터를 포함할 수 있다. The mobile communication module 113 transmits and receives a radio signal to at least one of a base station, an external terminal, and a server on a mobile communication network. Here, the wireless signal may include various types of data according to a voice call signal, a video call signal, or a text / multimedia message transmission / reception.

무선 인터넷 모듈(115)은 무선 인터넷 접속을 위한 모듈을 말하는 것으로, 무선 인터넷 모듈(115)은 이동 단말기(100)에 내장되거나 외장될 수 있다. 무선 인터넷 기술로는 WLAN(Wireless LAN)(Wi-Fi), Wibro(Wireless broadband), Wimax(World Interoperability for Microwave Access), HSDPA(High Speed Downlink Packet Access) 등이 이용될 수 있다. The wireless Internet module 115 is a module for wireless Internet access, and the wireless Internet module 115 can be built in or externally attached to the mobile terminal 100. WLAN (Wi-Fi), Wibro (Wireless broadband), Wimax (World Interoperability for Microwave Access), HSDPA (High Speed Downlink Packet Access) and the like can be used as wireless Internet technologies.

근거리 통신 모듈(117)은 근거리 통신을 위한 모듈을 말한다. 근거리 통신 기술로 블루투스(Bluetooth), RFID(Radio Frequency Identification), 적외선 통신(IrDA, infrared Data Association), UWB(Ultra Wideband), 지그비(ZigBee), NFC(Near Field Communication) 등이 이용될 수 있다.The short-range communication module 117 refers to a module for short-range communication. Bluetooth, radio frequency identification (RFID), infrared data association (IrDA), ultra wideband (UWB), ZigBee, and Near Field Communication (NFC) may be used as the short distance communication technology.

GPS(Global Position System) 모듈(119)은 복수 개의 GPS 인공위성으로부터 위치 정보를 수신한다.A GPS (Global Position System) module 119 receives position information from a plurality of GPS satellites.

A/V(Audio/Video) 입력부(120)는 오디오 신호 또는 비디오 신호 입력을 위한 것으로, 이에는 카메라(121)와 마이크(123) 등이 포함될 수 있다. 카메라(121)는 화상 통화모드 또는 촬영 모드에서 이미지 센서에 의해 얻어지는 정지영상 또는 동영상 등의 화상 프레임을 처리한다. 그리고, 처리된 화상 프레임은 디스플레이부(151)에 표시될 수 있다.The A / V (Audio / Video) input unit 120 is for inputting an audio signal or a video signal, and may include a camera 121 and a microphone 123. The camera 121 processes image frames such as still images or moving images obtained by the image sensor in the video communication mode or the photographing mode. Then, the processed image frame can be displayed on the display unit 151. [

카메라(121)에서 처리된 화상 프레임은 메모리(160)에 저장되거나 무선 통신부(110)를 통하여 외부로 전송될 수 있다. 카메라(121)는 단말기의 구성 태양에 따라 2개 이상이 구비될 수도 있다.The image frame processed by the camera 121 may be stored in the memory 160 or transmitted to the outside through the wireless communication unit 110. [ The camera 121 may be equipped with two or more cameras according to the configuration of the terminal.

마이크(123)는 통화모드 또는 녹음모드, 음성인식 모드 등에서 마이크로폰(Microphone)에 의해 외부의 음향 신호를 입력받아 전기적인 음성 데이터로 처리한다. 그리고, 처리된 음성 데이터는 통화 모드인 경우 이동통신 모듈(113)를 통하여 이동통신 기지국으로 송신 가능한 형태로 변환되어 출력될 수 있다. 마이크 (123)는 외부의 음향 신호를 입력받는 과정에서 발생하는 잡음(noise)를 제거하기 위한 다양한 잡음 제거 알고리즘이 사용될 수 있다.The microphone 123 receives an external sound signal by a microphone in a communication mode, a recording mode, a voice recognition mode, or the like, and processes it as electrical voice data. The processed voice data can be converted into a form that can be transmitted to the mobile communication base station through the mobile communication module 113 and output when the voice data is in the call mode. The microphone 123 may be a variety of noise reduction algorithms for eliminating noise generated in receiving an external sound signal.

사용자 입력부(130)는 사용자가 단말기의 동작 제어를 위하여 입력하는 키 입력 데이터를 발생시킨다. 사용자 입력부(130)는 사용자의 누름 또는 터치 조작에 의해 명령 또는 정보를 입력받을 수 있는 키 패드(key pad), 돔 스위치(dome switch), 터치 패드(정압/정전) 등으로 구성될 수 있다. 또한, 사용자 입력부(130)는 키를 회전시키는 조그 휠 또는 조그 방식이나 조이스틱과 같이 조작하는 방식이나, 핑거 마우스 등으로 구성될 수 있다. 특히, 터치 패드가 후술하는 디스플레이부(151)와 상호 레이어 구조를 이룰 경우, 이를 터치스크린(touch screen)이라 부를 수 있다.The user input unit 130 generates key input data that the user inputs to control the operation of the terminal. The user input unit 130 may include a key pad, a dome switch, and a touch pad (static / static) capable of receiving commands or information by a user's pressing or touching operation. The user input unit 130 may be a jog wheel for rotating the key, a jog type or a joystick, or a finger mouse. Particularly, when the touch pad has a mutual layer structure with the display unit 151 described later, it can be called a touch screen.

센싱부(140)는 이동 단말기(100)의 개폐 상태, 이동 단말기(100)의 위치, 사용자 접촉 유무 등과 같이 이동 단말기(100)의 현 상태를 감지하여 이동 단말기(100)의 동작을 제어하기 위한 센싱 신호를 발생시킨다. 예를 들어 이동 단말기(100)가 슬라이드 폰 형태인 경우 슬라이드 폰의 개폐 여부를 센싱할 수 있다. 또한, 전원 공급부(190)의 전원 공급 여부, 인터페이스부(170)의 외부 기기 결합 여부 등과 관련된 센싱 기능을 담당할 수 있다.The sensing unit 140 senses the current state of the mobile terminal 100 such as the open / close state of the mobile terminal 100, the position of the mobile terminal 100, Thereby generating a sensing signal. For example, when the mobile terminal 100 is in the form of a slide phone, it is possible to sense whether the slide phone is opened or closed. In addition, a sensing function related to whether or not the power supply unit 190 is powered on, whether the interface unit 170 is coupled to an external device, and the like can be handled.

센싱부(140)는 근접센서(141), 압력센서(143), 및 모션 센서(145) 등을 포함할 수 있다. 근접센서(141)는 이동 단말기(100)로 접근하는 물체나, 이동 단말기(100)의 근방에 존재하는 물체의 유무 등을 기계적 접촉이 없이 검출할 수 있도록 한다. 근접센서(141)는, 교류자계의 변화나 정자계의 변화를 이용하거나, 혹은 정전용량의 변화율 등을 이용하여 근접물체를 검출할 수 있다. 근접센서(141)는 구성 태양에 따라 2개 이상이 구비될 수 있다.The sensing unit 140 may include a proximity sensor 141, a pressure sensor 143, a motion sensor 145, and the like. The proximity sensor 141 can detect an object approaching the mobile terminal 100 or the presence or absence of an object in the vicinity of the mobile terminal 100 without mechanical contact. The proximity sensor 141 can detect a nearby object by using a change in the alternating magnetic field or a change in the static magnetic field, or a rate of change in capacitance. The proximity sensor 141 may be equipped with two or more sensors according to the configuration.

압력센서(143)는 이동 단말기(100)에 압력이 가해지는지 여부와, 그 압력의 크기 등을 검출할 수 있다. 압력센서(143)는 사용환경에 따라 이동 단말기(100)에서 압력의 검출이 필요한 부위에 설치될 수 있다. 만일, 압력센서(143)가 디스플레이부(151)에 설치되는 경우, 압력센서(143)에서 출력되는 신호에 따라, 디스플레이부(151)를 통한 터치 입력과, 터치 입력보다 더 큰 압력이 가해지는 압력터치 입력을 식별할 수 있다. 또한, 압력센서(143)에서 출력되는 신호에 따라, 압력터치 입력시 디스플레이부(151)에 가해지는 압력의 크기도 알 수 있다.The pressure sensor 143 can detect whether or not pressure is applied to the mobile terminal 100, the magnitude of the pressure, and the like. The pressure sensor 143 may be installed at a portion where the pressure of the mobile terminal 100 is required according to the use environment. When the pressure sensor 143 is installed on the display unit 151, the touch input through the display unit 151 and the pressure applied by the touch input The pressure touch input can be identified. Also, the magnitude of the pressure applied to the display unit 151 at the time of the pressure touch input can be determined according to the signal output from the pressure sensor 143. [

모션 센서(145)는 가속도 센서, 자이로 센서 등을 이용하여 이동 단말기(100)의 위치나 움직임 등을 감지한다. 모션 센서(145)에 사용될 수 있는 가속도 센서는 어느 한 방향의 가속도 변화에 대해서 이를 전기 신호로 바꾸어 주는 소자로서, MEMS(micro-electromechanical systems) 기술의 발달과 더불어 널리 사용되고 있다.The motion sensor 145 senses the position or movement of the mobile terminal 100 using an acceleration sensor, a gyro sensor, or the like. An acceleration sensor that can be used for the motion sensor 145 is a device that converts an acceleration change in one direction into an electric signal and is widely used along with the development of MEMS (micro-electromechanical systems) technology.

가속도 센서에는, 자동차의 에어백 시스템에 내장되어 충돌을 감지하는데 사용하는 큰 값의 가속도를 측정하는 것부터, 사람 손의 미세한 동작을 인식하여 게임 등의 입력 수단으로 사용하는 미세한 값의 가속도를 측정하는 것까지 다양한 종류가 있다. 가속도 센서는 보통 2축이나 3축을 하나의 패키지에 실장하여 구성되며, 사용 환경에 따라서는 Z축 한 축만 필요한 경우도 있다. 따라서, 어떤 이유로 Z축 방향 대신 X축 또는 Y축 방향의 가속도 센서를 써야 할 경우에는 별도의 조각 기판을 사용하여 가속도 센서를 주 기판에 세워서 실장할 수도 있다.The acceleration sensor measures the acceleration of a small value built in the airbag system of an automobile and recognizes the minute motion of the human hand and measures the acceleration of a large value used as an input means such as a game There are various types. Acceleration sensors are usually constructed by mounting two or three axes in one package. Depending on the usage environment, only one axis of Z axis is required. Therefore, when the acceleration sensor in the X-axis direction or the Y-axis direction is used instead of the Z-axis direction for some reason, the acceleration sensor may be mounted on the main substrate by using a separate piece substrate.

또한, 자이로 센서는 각속도를 측정하는 센서로서, 기준 방향에 대해 돌아간 방향을 감지할 수 있다.The gyro sensor is a sensor for measuring the angular velocity, and it can sense the direction of rotation with respect to the reference direction.

출력부(150)는 오디오 신호 또는 비디오 신호 또는 알람(alarm) 신호의 출력을 위한 것이다. 출력부(150)에는 디스플레이부(151), 음향출력 모듈(153), 알람부(155), 및 햅틱 모듈(157) 등이 포함될 수 있다.The output unit 150 is for outputting an audio signal, a video signal, or an alarm signal. The output unit 150 may include a display unit 151, an audio output module 153, an alarm unit 155, and a haptic module 157.

디스플레이부(151)는 이동 단말기(100)에서 처리되는 정보를 표시 출력한다. 예를 들어 이동 단말기(100)가 통화 모드인 경우 통화와 관련된 UI(User Interface) 또는 GUI(Graphic User Interface)를 표시한다. 그리고 이동 단말기(100)가 화상 통화 모드 또는 촬영 모드인 경우, 촬영되거나 수신된 영상을 각각 혹은 동시에 표시할 수 있으며, UI, GUI를 표시한다. The display unit 151 displays and outputs information processed by the mobile terminal 100. For example, when the mobile terminal 100 is in the call mode, a UI (User Interface) or a GUI (Graphic User Interface) associated with a call is displayed. When the mobile terminal 100 is in the video communication mode or the photographing mode, the photographed or received images can be displayed individually or simultaneously, and the UI and the GUI are displayed.

한편, 전술한 바와 같이, 디스플레이부(151)와 터치패드가 상호 레이어 구조를 이루어 터치스크린으로 구성되는 경우, 디스플레이부(151)는 출력 장치 이외에 사용자의 터치에 의한 정보의 입력이 가능한 입력 장치로도 사용될 수 있다. Meanwhile, as described above, when the display unit 151 and the touch pad have a mutual layer structure to constitute a touch screen, the display unit 151 may be an input device capable of inputting information by a user's touch in addition to the output device Can also be used.

만일, 디스플레이부(151)가 터치스크린으로 구성되는 경우, 터치스크린 패널, 터치스크린 패널 제어기 등을 포함할 수 있다. 이 경우, 터치스크린 패널은 외부에 부착되는 투명한 패널로서, 이동 단말기(100)의 내부 버스에 연결될 수 있다. 터치스크린 패널은 접촉 결과를 주시하고 있다가, 터치입력이 있는 경우 대응하는 신호들을 터치스크린 패널 제어기로 보낸다. 터치스크린 패널 제어기는 그 신호들을 처리한 다음 대응하는 데이터를 제어부(180)로 전송하여, 제어부(180)가 터치입력이 있었는지 여부와 터치스크린의 어느 영역이 터치 되었는지 여부를 알 수 있도록 한다. If the display unit 151 is configured as a touch screen, it may include a touch screen panel, a touch screen panel controller, and the like. In this case, the touch screen panel is a transparent panel that is attached to the outside, and can be connected to the internal bus of the mobile terminal 100. The touch screen panel keeps a watch on the contact result, and if there is a touch input, sends the corresponding signals to the touch screen panel controller. The touch screen panel controller processes the signals, and then transmits corresponding data to the controller 180 so that the controller 180 can determine whether the touch input has been made and which area of the touch screen has been touched.

디스플레이부(151)는 전자종이(e-Paper)로 구성될 수도 있다. 전자종이(e-Paper)는 일종의 반사형 디스플레이로서, 기존의 종이와 잉크처럼 높은 해상도, 넓은 시야각, 밝은 흰색 배경으로 우수한 시각 특성을 가진다. 전자종이(e-Paper)는 플라스틱, 금속, 종이 등 어떠한 기판상에도 구현이 가능하고, 전원을 차단한 후에도 화상이 유지되고 백라이트(back light) 전원이 없어 이동 단말기(100)의 배터리 수명이 오래 유지될 수 있다. 전자종이로는 정전화가 충전된 반구형 트위스트 볼을 이용하거나, 전기영동법 및 마이크로 캡슐 등을 이용할 수 있다. The display unit 151 may be formed of an e-paper. Electronic paper (e-Paper) is a kind of reflective display, and has excellent visual characteristics such as high resolution, wide viewing angle and bright white background as conventional paper and ink. The electronic paper (e-paper) can be implemented on any substrate such as plastic, metal, paper, and the image is maintained even after the power is shut off, and the battery life of the mobile terminal 100 is long Can be maintained. As the electronic paper, a hemispherical twist ball filled with a telephone can be used, or an electrophoresis method and a microcapsule can be used.

이외에도 디스플레이부(151)는 액정 디스플레이(liquid crystal display), 박막 트랜지스터 액정 디스플레이(thin film transistor-liquid crystal display), 유기 발광 다이오드(organic light-emitting diode), 플렉시블 디스플레이(flexible display), 3차원 디스플레이(3D display) 중에서 적어도 하나를 포함할 수도 있다. 그리고, 이동 단말기(100)의 구현 형태에 따라 디스플레이부(151)가 2개 이상 존재할 수도 있다. 예를 들어, 이동 단말기(100)에 외부 디스플레이부(미도시)와 내부 디스플레이부(미도시)가 동시에 구비될 수 있다.In addition, the display unit 151 may be a liquid crystal display, a thin film transistor-liquid crystal display, an organic light-emitting diode, a flexible display, a three- (3D display). In addition, there may be two or more display units 151 according to the embodiment of the mobile terminal 100. For example, the mobile terminal 100 may include an external display unit (not shown) and an internal display unit (not shown) at the same time.

음향출력 모듈(153)은 호 신호 수신, 통화 모드 또는 녹음 모드, 음성인식 모드, 방송수신 모드 등에서 무선 통신부(110)로부터 수신되거나 메모리(160)에 저장된 오디오 데이터를 출력한다. 또한, 음향출력 모듈(153)은 이동 단말기(100)에서 수행되는 기능, 예를 들어, 호 신호 수신음, 메시지 수신음 등과 관련된 음향 신호를 출력한다. 이러한 음향출력 모듈(153)에는 스피커(speaker), 버저(Buzzer) 등이 포함될 수 있다.The audio output module 153 outputs audio data received from the wireless communication unit 110 or stored in the memory 160 in a call signal reception mode, a call mode or a recording mode, a voice recognition mode, a broadcast reception mode, In addition, the sound output module 153 outputs sound signals related to functions performed in the mobile terminal 100, for example, call signal reception tones, message reception tones, and the like. The sound output module 153 may include a speaker, a buzzer, and the like.

알람부(155)는 이동 단말기(100)의 이벤트 발생을 알리기 위한 신호를 출력한다. 이동 단말기(100)에서 발생하는 이벤트의 예로는 호 신호 수신, 메시지 수신, 키 신호 입력 등이 있다. 알람부(155)는 오디오 신호나 비디오 신호 이외에 다른 형태로 이벤트 발생을 알리기 위한 신호를 출력한다. 예를 들면, 진동 형태로 신호를 출력할 수 있다. 알람부(155)는 호 신호가 수신되거나 메시지가 수신된 경우, 이를 알리기 위해 신호를 출력할 수 있다. 또한, 알람부(155)는 키 신호가 입력된 경우, 키 신호 입력에 대한 피드백으로 신호를 출력할 수 있다. 이러한 알람부(155)가 출력하는 신호를 통해 사용자는 이벤트 발생을 인지할 수 있다. 이동 단말기(100)에서 이벤트 발생 알림을 위한 신호는 디스플레이부(151)나 음향출력 모듈(153)를 통해서도 출력될 수 있다.The alarm unit 155 outputs a signal for notifying the occurrence of an event of the mobile terminal 100. Examples of events that occur in the mobile terminal 100 include call signal reception, message reception, and key signal input. The alarm unit 155 outputs a signal for notifying the occurrence of an event in a form other than an audio signal or a video signal. For example, it is possible to output a signal in a vibration mode. The alarm unit 155 can output a signal to notify when a call signal is received or a message is received. Also, when the key signal is inputted, the alarm unit 155 can output the signal as the feedback to the key signal input. The user can recognize the occurrence of an event through the signal output by the alarm unit 155. A signal for notifying the occurrence of an event in the mobile terminal 100 may also be output through the display unit 151 or the sound output module 153. [

햅틱 모듈(haptic module)(157)은 사용자가 느낄 수 있는 다양한 촉각 효과를 발생시킨다. 햅틱 모듈(157)이 발생시키는 촉각 효과의 대표적인 예로는 진동 효과가 있다. 햅틱 모듈(157)이 촉각 효과로 진동을 발생시키는 경우, 햅택 모듈(157)이 발생하는 진동의 세기와 패턴 등은 변환가능하며, 서로 다른 진동을 합성하여 출력하거나 순차적으로 출력할 수도 있다.The haptic module 157 generates various tactile effects that the user can feel. A typical example of the haptic effect generated by the haptic module 157 is a vibration effect. When the haptic module 157 generates vibration with a haptic effect, the intensity and pattern of the vibration generated by the haptic module 157 can be converted, and the different vibrations can be synthesized and output or sequentially output.

햅틱 모듈(157)은 진동 외에도, 접촉 피부 면에 대해 수직 운동하는 핀 배열에 의한 자극에 의한 효과, 분사구나 흡입구를 통한 공기의 분사력이나 흡입력을 통한 자극에 의한 효과, 피부 표면을 스치는 자극에 의한 효과, 전극(eletrode)의 접촉을 통한 자극에 의한 효과, 정 전기력을 이용한 자극에 의한 효과, 흡열이나 발열이 가능한 소자를 이용한 냉/온감 재현에 의한 효과 등 다양한 촉각 효과를 발생시킬 수 있다. 햅틱 모듈(157)은 직접적인 접촉을 통해 촉각 효과의 전달할 수 있을 뿐만 아니라, 사용자의 손가락이나 팔 등의 근감각을 통해 촉각 효과를 느낄 수 있도록 구현할 수도 있다. 햅틱 모듈(157)은 이동 단말기(100)의 구성 태양에 따라 2개 이상이 구비될 수 있다.In addition to the vibration, the haptic module 157 may be provided with a function of stimulating by a pin arrangement vertically moving with respect to the contact skin surface, an effect of stimulating air through the injection or suction force of the air through the injection port or the suction port, The effect of stimulation through contact of the electrode (eletrode), the effect of stimulation by the electrostatic force, and the effect of reproducing the cool / warm using the device capable of endothermic or exothermic can be generated. The haptic module 157 can be implemented not only to transmit the tactile effect through direct contact but also to feel the tactile effect through the muscular sense of the user's finger or arm. The haptic module 157 may include two or more haptic modules 157 according to the configuration of the mobile terminal 100.

메모리(160)는 제어부(180)의 처리 및 제어를 위한 프로그램이 저장될 수도 있고, 입력되거나 출력되는 데이터들(예를 들어, 폰북, 메시지, 정지영상, 동영상 등)의 임시 저장을 위한 기능을 수행할 수도 있다. The memory 160 may store a program for processing and controlling the control unit 180 and may store a function for temporarily storing input or output data (e.g., a phone book, a message, a still image, .

메모리(160)는 플래시 메모리 타입(flash memory type), 하드디스크 타입(hard disk type), 멀티미디어 카드 마이크로 타입(multimedia card micro type), 카드 타입의 메모리(예를 들어 SD 또는 XD 메모리 등), 램, 롬 중 적어도 하나의 타입의 저장매체를 포함할 수 있다. 또한, 이동 단말기(100)는 인터넷(internet)상에서 메모리(150)의 저장 기능을 수행하는 웹 스토리지(web storage)를 운영할 수도 있다.The memory 160 may be a flash memory type, a hard disk type, a multimedia card micro type, a card type memory (for example, SD or XD memory), a RAM , And a ROM. &Lt; / RTI > In addition, the mobile terminal 100 may operate a web storage for storing the memory 150 on the Internet.

인터페이스부(170)는 이동 단말기(100)에 연결되는 모든 외부기기와의 인터페이스 역할을 수행한다. 이동 단말기(100)에 연결되는 외부기기의 예로는, 유/무선 헤드셋, 외부 충전기, 유/무선 데이터 포트, 메모리 카드(Memory card), SIM(Subscriber Identification Module) 카드, UIM(User Identity Module) 카드 등과 같은 카드 소켓, 오디오 I/O(Input/Output) 단자, 비디오 I/O(Input/Output) 단자, 이어폰 등이 있다. 인터페이스부(170)는 이러한 외부 기기로부터 데이터를 전송받거나 전원을 공급받아 이동 단말기(100) 내부의 각 구성 요소에 전달할 수 있고, 이동 단말기(100) 내부의 데이터가 외부 기기로 전송되도록 할 수 있다.The interface unit 170 serves as an interface with all external devices connected to the mobile terminal 100. Examples of the external device connected to the mobile terminal 100 include a wired / wireless headset, an external charger, a wired / wireless data port, a memory card, a SIM (Subscriber Identification Module) card, a UIM An audio input / output (I / O) terminal, a video I / O (input / output) terminal, and an earphone. The interface unit 170 may receive data from the external device or supply power to the respective components in the mobile terminal 100 and may transmit data in the mobile terminal 100 to the external device .

인터페이스부(170)는 이동 단말기(100)가 외부 크래들(cradle)과 연결될 때 연결된 크래들로부터의 전원이 이동 단말기(100)에 공급되는 통로가 되거나, 사용자에 의해 크래들에서 입력되는 각종 명령 신호가 이동 단말기(100)로 전달되는 통로가 될 수 있다.The interface unit 170 may be a path through which the power from the cradle connected to the mobile terminal 100 is connected to the cradle when the mobile terminal 100 is connected to the cradle, And may be a passage to be transmitted to the terminal 100.

제어부(180)는 통상적으로 상기 각부의 동작을 제어하여 이동 단말기(100)의 전반적인 동작을 제어한다. 예를 들어 음성 통화, 데이터 통신, 화상 통화 등을 위한 관련된 제어 및 처리를 수행한다. 또한, 제어부(180)는 멀티 미디어 재생을 위한 멀티미디어 재생 모듈(181)을 구비할 수도 있다. 멀티미디어 재생 모듈(181)은 제어부(180) 내에 하드웨어로 구성될 수도 있고, 제어부(180)와 별도로 소프트웨어로 구성될 수도 있다.The controller 180 typically controls the operation of the respective units to control the overall operation of the mobile terminal 100. For example, voice communication, data communication, video communication, and the like. In addition, the control unit 180 may include a multimedia playback module 181 for multimedia playback. The multimedia playback module 181 may be configured in hardware in the controller 180 or separately from software in the controller 180. [

음성 인식부(182)는 음성 인식 알고리즘이 적용된 음성 인식 엔진을 구동하여 마이크(123)로 입력된 사용자의 음성을 인식한다. The voice recognition unit 182 drives the voice recognition engine to which the voice recognition algorithm is applied to recognize the voice of the user input to the microphone 123. [

즉, 음성 인식부(182)는 마이크(122)를 통해 입력되는 사용자의 음성을 디지털 데이터로 변환하고, 상기 디지털 데이터를 증폭(Pre-emphasis)한 후, 상기 디지털 변환된 음성의 시작 지점과 끝 지점을 검출한다.That is, the voice recognition unit 182 converts the voice of the user input through the microphone 122 into digital data, pre-emphasizes the digital data, Point.

이어서, 음성 인식부(182)는 상기 검출한 시작 지점과 끝 지점 사이의 음성에 대한 음성 특징값들을 추출하고, 메모리(160)에 구비된 음성 인식용 데이터 베이스에서 상기 추출된 음성 특징값과 매칭되는 이동 단말기(100)의 동작을 제어부(180)에게 알려준다. 이때, 상기 음성 특징값들은, 마이크(123)로부터 입력된 음성의 파형, 음성의 포맷(format) 및 음성의 피치(pitch) 중 적어도 하나를 포함할 수 있다. Then, the voice recognition unit 182 extracts voice feature values for the voice between the detected start point and end point, and matches the extracted voice feature values in the voice recognition database provided in the memory 160 And informs the controller 180 of the operation of the mobile terminal 100. At this time, the voice feature values may include at least one of a waveform of a voice input from the microphone 123, a format of a voice, and a pitch of a voice.

일 예로, 상기 음성 인식용 데이터 베이스에서 상기 음성 인식부(182)가 추출한 음성 특징값과 매칭되는 동작이 "전화번호부"이면, 제어부(180)는 "전화번호부" 메뉴 기능을 실행하고, 디스플레이부(151) 화면 상에 "전화번호부" 메뉴를 표시한다. 또한, 상기 음성 인식용 데이터 베이스에서 상기 음성 인식부(182)가 추출한 음성 특징값과 매칭되는 동작이 "텍스트"이면, 제어부(180)는 디스플레이부(151) 화면 상에 "텍스트"를 표시한다.For example, if the operation that matches the voice feature value extracted by the voice recognition unit 182 in the voice recognition database is the "telephone directory ", the control unit 180 executes the" Quot; phone book " If the operation that matches the voice feature value extracted by the voice recognition unit 182 in the voice recognition database is "text", the control unit 180 displays "text" on the screen of the display unit 151 .

상기와 같은, 음성 인식부(182)는 하나의 "모듈" 또는 "유닛" 형태로 이동 단말기(100)에 구비될 수 있거나, 또는 메모리(160)에 소프트웨어 형태로 구비될 수 있다. 또한, 음성 인식부(182)는 제어부(180) 내에 포함되어 구성될 수 있고, 이 경우 제어부(180)는 음성 인식부(182)의 동작을 동일하게 수행할 수 있다.The voice recognition unit 182 may be provided in the mobile terminal 100 in the form of one "module" or "unit", or may be provided in the memory 160 in the form of software. The speech recognition unit 182 may be included in the control unit 180 and the control unit 180 may perform the operation of the speech recognition unit 182 in the same manner.

한편, 전원 공급부(190)는 제어부(180)의 제어에 의해 외부의 전원, 내부의 전원을 인가받아 각 구성요소들의 동작에 필요한 전원을 공급한다.Meanwhile, the power supply unit 190 receives external power and internal power under the control of the controller 180, and supplies power necessary for operation of the respective components.

이와 같은 구성의 이동 단말기(100)는 유무선 통신 시스템 및 위성 기반 통신 시스템을 포함하여, 프레임(frame) 또는 패킷(packet)을 통하여 데이터(data)를 전송할 수 있는 통신 시스템에서 동작 가능하도록 구성될 수 있다.The mobile terminal 100 having such a configuration can be configured to be operable in a communication system capable of transmitting data through a frame or a packet, including a wired / wireless communication system and a satellite-based communication system. have.

도 2는 본 발명의 일 실시 예에 따른 이동 단말기를 전면에서 바라본 사시도이고, 도 3은 도 2에 도시된 이동 단말기의 후면 사시도이다. 이하에서는 도 2 및 도 3을 참조하여, 본 발명과 관련된 이동 단말기를 외형에 따른 구성요소 관점에서 살펴 보기로 한다. 또한, 이하에서는 설명의 편의상, 폴더 타입, 바 타입, 스윙타입, 슬라이더 타입 등과 같은 여러 타입의 이동 단말기들 중에서 전면 터치스크린이 구비되어 있는, 바 타입의 이동 단말기를 예로 들어 설명한다. 그러나, 본 발명은 바 타입의 이동 단말기에 한정되는 것은 아니고 전술한 타입을 포함한 모든 타입의 이동 단말기에 적용될 수 있다.FIG. 2 is a perspective view of a mobile terminal according to an embodiment of the present invention, and FIG. 3 is a rear perspective view of the mobile terminal shown in FIG. Hereinafter, with reference to FIG. 2 and FIG. 3, a mobile terminal according to the present invention will be described in terms of components according to the external appearance. Hereinafter, for convenience of description, a bar type mobile terminal having a front touch screen among various types of mobile terminals such as a folder type, a bar type, a swing type, a slider type, etc. will be described as an example. However, the present invention is not limited to the bar-type mobile terminal but can be applied to all types of mobile terminals including the above-mentioned types.

도 2를 참조하면, 이동 단말기(100)의 외관을 이루는 케이스는, 프론트 케이스(100-1)와 리어 케이스(100-2)에 의해 형성된다. 프론트 케이스(100-1)와 리어 케이스(100-2)에 의해 형성된 공간에는 각종 전자부품들이 내장된다.Referring to FIG. 2, the case constituting the appearance of the mobile terminal 100 is formed by the front case 100-1 and the rear case 100-2. Various electronic components are incorporated in the space formed by the front case 100-1 and the rear case 100-2.

본체, 구체적으로 프론트 케이스(100-1)에는 디스플레이부(151), 제1 음향출력모듈(153a), 제1 카메라(121a), 및 제1 내지 제3 사용자 입력부(130a, 130b, 130c)가 배치될 수 있다. 그리고, 리어 케이스(100-2)의 측면에는 제4 사용자 입력부(130d), 제5 사용자 입력부(130e), 및 마이크(123)가 배치될 수 있다.The display unit 151, the first sound output module 153a, the first camera 121a, and the first through third user input units 130a, 130b, and 130c are connected to the main body, specifically, the front case 100-1 . A fourth user input unit 130d, a fifth user input unit 130e, and a microphone 123 may be disposed on a side surface of the rear case 100-2.

디스플레이부(151)는 터치패드가 레이어 구조로 중첩됨으로써, 디스플레이부(151)가 터치스크린으로 동작하여 사용자의 터치에 의한 정보의 입력이 가능하도록 구성할 수도 있다.The display unit 151 may be constructed such that the touch pad is overlapped with the layer structure so that the display unit 151 operates as a touch screen so that information can be input by a user's touch.

제1 음향출력 모듈(153a)은 리시버 또는 스피커의 형태로 구현될 수 있다. 제1 카메라(121a)는 사용자 등에 대한 이미지 또는 동영상을 촬영하기에 적절한 형태로 구현될 수 있다. 그리고, 마이크(123)는 사용자의 음성, 기타 소리 등을 입력받기 적절한 형태로 구현될 수 있다.The first acoustic output module 153a may be implemented in the form of a receiver or a speaker. The first camera 121a may be implemented in a form suitable for capturing an image or a moving image of a user. The microphone 123 may be implemented in a form suitable for receiving a user's voice, other sounds, and the like.

제1 내지 제5 사용자 입력부(130a, 130b, 130c, 130d, 130e)와 후술하는 제6 및 제7 사용자 입력부(130f, 130g)는 사용자 입력부(130)라 통칭할 수 있으며, 사용자가 촉각적인 느낌을 주면서 조작하게 되는 방식(tactile manner)이라면 어떤 방식이든 채용될 수 있다.The first through fifth user input units 130a 130b 130c 130d and 130e and the sixth and seventh user input units 130f and 130g described below may be collectively referred to as a user input unit 130, Any manner can be employed in a tactile manner.

예를 들어, 사용자 입력부(130)는 사용자의 누름 또는 터치 조작에 의해 명령 또는 정보를 입력받을 수 있는 돔 스위치 또는 터치 패드로 구현되거나, 키를 회전시키는 휠 또는 조그 방식이나 조이스틱과 같이 조작하는 방식 등으로도 구현될 수 있다. For example, the user input unit 130 may be embodied as a dome switch or a touch pad capable of receiving a command or information by a user's pressing or touching operation, or may be a wheel, a jog type or a joystick Or the like.

기능적인 면에서, 제1 내지 제3 사용자 입력부(130a, 130b, 130c)는 시작, 종료, 스크롤, 메뉴 키, 홈(home) 키, 이전(back) 키 등과 같은 명령을 입력하기 위한 것이고, 제4 사용자 입력부(130d)는 동작 모드의 선택 등을 입력하기 위한 것이다. 또한, 제5 사용자 입력부(130e)는 이동 단말기(100) 내의 특수한 기능을 활성화하기 위한 핫 키(hot-key)로서 작동할 수 있다.In a functional aspect, the first to third user inputs 130a, 130b and 130c are for inputting commands such as start, end, scroll, menu key, home key, back key, 4 The user input unit 130d is for inputting an operation mode selection or the like. In addition, the fifth user input unit 130e may operate as a hot-key for activating a special function in the mobile terminal 100. [

도 3을 참조하면, 리어 케이스(100-2)의 후면에는 제2 카메라(121b)가 추가로 장착될 수 있으며, 리어 케이스(100-2)의 측면에는 제6 및 제7 사용자 입력부(130f, 130g)와, 인터페이스부(170)가 배치될 수 있다.3, a second camera 121b may be additionally mounted on a rear surface of the rear case 100-2. On the side surface of the rear case 100-2, a sixth and a seventh user input units 130f, 130g, and an interface unit 170 may be disposed.

제2 카메라(121b)는 제1 카메라(121a)와 실질적으로 반대되는 촬영 방향을 가지며, 제1 카메라(121a)와 서로 다른 화소를 가질 수 있다. 제2 카메라(121b)에 인접하게는 플래쉬(미도시)와 거울(미도시)이 추가로 배치될 수도 있다. 또한, 제2 카메라(121b) 인접하게 다른 카메라를 더 설치하여 3차원 입체 영상의 촬영을 위해 사용할 수도 있다.The second camera 121b has a photographing direction substantially opposite to that of the first camera 121a, and may have pixels different from those of the first camera 121a. A flash (not shown) and a mirror (not shown) may be additionally disposed adjacent to the second camera 121b. In addition, another camera may be installed adjacent to the second camera 121b to use it for shooting a three-dimensional stereoscopic image.

플래쉬는 제2 카메라(121b)로 피사체를 촬영하는 경우에 상기 피사체를 향해 빛을 비추게 된다. 거울은 사용자가 제2 카메라(121b)를 이용하여 자신을 촬영(셀프 촬영)하고자 하는 경우에, 사용자 자신의 얼굴 등을 비춰볼 수 있게 한다.The flash illuminates the subject when the subject is photographed by the second camera 121b. The mirror enables the user to illuminate the user's own face or the like when the user intends to photograph (self-photograph) himself / herself using the second camera 121b.

리어 케이스(100-2)에는 제2 음향출력 모듈(미도시)가 추가로 배치될 수도 있다. 제2 음향출력 모듈은 제1 음향출력 모듈(153a)와 함께 스테레오 기능을 구현할 수 있으며, 스피커폰 모드로 통화를 위해 사용될 수도 있다.A second sound output module (not shown) may be further disposed in the rear case 100-2. The second sound output module may implement the stereo function together with the first sound output module 153a, and may be used for talking in the speakerphone mode.

인터페이스부(170)는 외부 기기와 데이터가 교환되는 통로로 사용될 수 있다. 그리고, 프론트 케이스(100-1) 및 리어 케이스(100-2)의 일 영역에는 통화 등을 위한 안테나 외에 방송신호 수신용 안테나(미도시)가 배치될 수 있다. 안테나는 리어 케이스(100-2)에서 인출 가능하게 설치될 수 있다.The interface unit 170 can be used as a path for exchanging data with an external device. An antenna for receiving broadcast signals (not shown) may be disposed in one area of the front case 100-1 and the rear case 100-2 in addition to the antenna for communication. The antenna may be installed to be capable of being drawn out from the rear case 100-2.

리어 케이스(100-2) 측에는 이동 단말기(100)에 전원을 공급하기 위한 전원 공급부(190)가 장착될 수 있다. 전원 공급부(190)는, 예를 들어 충전 가능한 배터리로서, 충전 등을 위하여 리어 케이스(100-2)에 착탈 가능하게 결합될 수 있다.A power supply unit 190 for supplying power to the mobile terminal 100 may be mounted on the rear case 100-2. The power supply unit 190 may be a rechargeable battery, for example, and may be detachably coupled to the rear case 100-2 for charging or the like.

한편, 본 실시 예에서, 제2 카메라(121b) 등이 리어 케이스(100-2)에 배치되는 것으로 설명하였으나, 반드시 이에 제한되는 것은 아니다. 또한, 제2 카메라(121b)가 별도로 구비되지 않더라도, 제1 카메라(121a)를 회전 가능하게 형성되어 제2 카메라(121b)의 촬영 방향까지 촬영 가능하도록 구성될 수도 있다.On the other hand, in the present embodiment, the second camera 121b and the like are disposed in the rear case 100-2, but the present invention is not limited thereto. Also, the first camera 121a may be rotatably formed so that the second camera 121b can be photographed up to the photographing direction, even if the second camera 121b is not separately provided.

이상에서는 도 1 내지 도 3을 참조하여, 본 발명에 따른 이동 단말기(100)의 구성에 대하여 살펴 보았다. 이하에서는, 본 발명의 제1 실시 예에 따라, 음성 인식 기술을 이용하여 음성 태그된 영상 데이터를 저장 및 검색하는 이동 단말기 및 그 동작 방법에 대해 상세히 설명하도록 한다.In the foregoing, the configuration of the mobile terminal 100 according to the present invention has been described with reference to FIG. 1 to FIG. Hereinafter, a mobile terminal for storing and retrieving voice-tagged image data using a voice recognition technology and an operation method thereof according to a first embodiment of the present invention will be described in detail.

도 4 및 도 5는 본 발명의 제1 실시 예에 따른 이동 단말기에서, 음성 인식을 통해 사진을 촬영함과 동시에 해당 영상 데이터에 음성 명칭을 태깅하는 동작을 설명하는 도면이다.4 and 5 are views for explaining an operation of capturing a picture through voice recognition and tagging a voice name in the corresponding video data in the mobile terminal according to the first embodiment of the present invention.

도 4 및 도 5를 참조하면, 사용자에 의해 카메라 메뉴가 선택되면, 제어부(180)는 카메라(121)를 구동시키고(S410), 카메라(121)의 렌즈를 통해 입력되는 프리뷰 영상(510)을 디스플레이부(151)에 표시한다(S420). 4 and 5, when a camera menu is selected by the user, the controller 180 drives the camera 121 (S410) and displays the preview image 510 input through the lens of the camera 121 And displays it on the display unit 151 (S420).

상기 프리뷰 영상(510)이 표시되면, 제어부(180)는 음성 인식 모드를 활성화한다(S430). 즉, 제어부(180)는 사용자로부터 촬영 중인 영상 데이터의 음성 명칭을 입력받기 위해 마이크(123) 및 음성 인식부(182)를 구동한다. When the preview image 510 is displayed, the controller 180 activates the voice recognition mode (S430). That is, the control unit 180 drives the microphone 123 and the voice recognition unit 182 to receive the voice name of the video data being shot from the user.

한편, 이동 단말기(100)의 촬영 모드에서 항상 음성 인식이 가능한 상태이면, 이동 단말기(100)의 전력 소모가 많아지고, 카메라 촬영 시 발생하는 주변 소음으로 인해 음성 인식률이 떨어진다. 따라서, 본 발명의 다른 실시 형태로, 도 5의 (a)에 도시된 바와 같은 음성 인식 아이콘(530)이 사용자에 의해 선택된 경우에만, 음성 인식 모드를 활성화하여 불필요한 전력 소모 방지 및 음성 인식률을 높일 수 있다. On the other hand, if voice recognition is always possible in the photographing mode of the mobile terminal 100, the power consumption of the mobile terminal 100 is increased and the voice recognition rate is lowered due to the ambient noise generated when photographing the camera. Therefore, in another embodiment of the present invention, only when the voice recognition icon 530 as shown in FIG. 5A is selected by the user, the voice recognition mode is activated to prevent unnecessary power consumption and increase the voice recognition rate .

제어부(180)는, 촬영 중인 영상 데이터의 음성 명칭(520)이 마이크(123)를 통해 입력되는지 여부를 확인한다(S440).The control unit 180 checks whether the voice name 520 of the video data being shot is input through the microphone 123 (S440).

상기 확인 결과, 음성 명칭(520)이 마이크(123)를 통해 입력되지 않으면, 제어부(180)는 상기 디스플레이부(151)를 통해 프리뷰 영상(510)을 계속 표시한다. 반면, 상기 음성 명칭(520)이 마이크(123)를 통해 입력되면, 제어부(180)는 음성 인식부(182)를 제어하여 상기 음성 명칭(520)을 인식한다(S450).If the voice name 520 is not inputted through the microphone 123, the control unit 180 continuously displays the preview image 510 through the display unit 151. As a result, If the voice name 520 is inputted through the microphone 123, the control unit 180 controls the voice recognition unit 182 to recognize the voice name 520 (S450).

상기 음성 명칭(520)이 인식되면, 제어부(180)는 카메라(121)의 렌즈에 맺혀진 피사체 영상을 촬영한다(S460). 이때, 상기 마이크(123)를 통해 입력되는 음성 데이터(즉, 음성 명칭)는, 사진 촬영을 위한 키 신호로 동작한다. 따라서, 마이크(123)를 통해 입력된 소리가 사람의 음성으로 인식된 경우에만 촬영 동작을 개시하고, 음성이 아닌 기계음 또는 기타 소음 등에 대해서는 촬영 동작을 개시하지 않는다.If the voice name 520 is recognized, the control unit 180 captures an image of the subject formed in the lens of the camera 121 (S460). At this time, the voice data (i.e., voice name) input through the microphone 123 operates as a key signal for photographing. Therefore, the photographing operation is started only when the sound input through the microphone 123 is recognized as the human voice, and the photographing operation is not started for the mechanical sound or other noise, etc.

이와 동시에, 제어부(180)는, 촬영된 영상 데이터에 음성 명칭(520)을 태깅(voice tagging)한 후 메모리(160)에 저장한다(S470, S480).At the same time, the controller 180 performs voice tagging of the voice name 520 on the photographed image data and stores the voice tag name in the memory 160 (S470, S480).

가령, 도 5의 (a)에 도시된 바와 같이, 디스플레이부(151)를 통해 프리뷰 영상(510)이 표시된 상태에서, "다보탑"이라는 음성 명칭(520)이 마이크(123)를 통해 입력되면, 제어부(180)는 현재 화면에 표시되고 있는 피사체 영상(510)을 촬영함과 동시에, 상기 음성 명칭(520)을 촬영된 영상 데이터에 태깅하여 메모리(160)에 저장한다. 5 (a), when the voice name 520 called "Dabotop" is inputted through the microphone 123 while the preview image 510 is displayed through the display unit 151, The control unit 180 photographs the subject image 510 displayed on the current screen and simultaneously tags the voice name 520 on the photographed image data and stores the tagged image name in the memory 160. [

한편, 본 발명의 다른 실시 형태로, 음성 태그된 영상 데이터를 메모리에 저장하기에 앞서, 제어부(180)는 저장 여부에 대한 사용자의 확인을 요청하는 팝업창을 제공할 수 있다. 가령, 도 5의 (b)에 도시된 바와 같이, 제어부(180)는 음성 태그된 영상 데이터를 저장할지 여부를 사용자에게 문의하기 위한 팝업창(540)을 표시한다. 따라서, 상기 팝업창(540)에 표시된 "OK" 버튼이 사용자에 의해 선택되면, 제어부(180)는 상기 음성 태그된 영상 데이터를 메모리(160)에 저장한다.Meanwhile, in another embodiment of the present invention, the controller 180 may provide a pop-up window for requesting the user to confirm whether or not to store the voice-tagged image data in the memory. 5B, the control unit 180 displays a pop-up window 540 for inquiring the user whether to store the voice-tagged image data. Accordingly, when the 'OK' button displayed on the pop-up window 540 is selected by the user, the control unit 180 stores the voice-tagged image data in the memory 160.

도 6은 본 발명의 제1 실시 예에 따라, 영상 보관함에 저장된 일반 영상 데이터 및 음성 태그된 영상 데이터의 일 예를 나타낸다. FIG. 6 shows an example of general image data and voice tagged image data stored in an image storage box according to the first embodiment of the present invention.

사용자에 의해 영상 목록 보기 메뉴가 선택되면, 제어부(180)는 도 6의 (a)에 도시된 바와 같은 목록화면(610)을 디스플레이부(151)에 표시한다. 이때, 상기 목록화면(610)은 일반적인 영상 데이터 항목들(620)과 음성 태그된 영상 데이터 항목들(630)을 포함한다. When the user selects the video list view menu, the controller 180 displays the list screen 610 on the display 151 as shown in FIG. 6A. At this time, the list screen 610 includes general image data items 620 and voice-tagged image data items 630.

일반적인 영상 데이터 항목(620)은 영상 데이터에 대응하는 이미지 파일로만 구성되는 반면, 음성 태그된 영상 데이터 항목(630)은 영상 데이터에 대응하는 이미지 파일과, 해당 영상 데이터의 음성 명칭에 대응하는 오디오 파일로 구성된다. 여기서, 상기 오디오 파일(635)은, 영상 데이터에 음성 명칭이 태깅되어 있음을 나타내기 위해 미리 결정된 식별자로 표시될 수 있다. The general video data item 620 is composed only of the image file corresponding to the video data while the audio tagged video data item 630 is composed of the image file corresponding to the video data and the audio file corresponding to the audio name of the video data . Here, the audio file 635 may be displayed with a predetermined identifier to indicate that the audio name is tagged in the video data.

또한, 상기 표시된 식별자(635)가 선택되면, 제어부(180)는, 도 6의 (b)에 도시된 바와 같이, 음향 출력 모듈(153)을 통해 해당 영상 데이터에 대한 음성 명칭인 "다보탑"이 출력되도록 제어할 수 있다. 6 (b), when the displayed identifier 635 is selected, the control unit 180 determines whether or not the voice title "Daobo Tower" corresponding to the video data is transmitted through the audio output module 153 And the like.

도 7 및 도 8은 본 발명의 제1 실시 예에 따른 이동 단말기에서, 음성 인식 기술을 통해 음성 태그된 영상 데이터를 검색하는 동작을 설명하는 도면이다.FIGS. 7 and 8 are views for explaining an operation of searching for voice-tagged image data through a voice recognition technique in a mobile terminal according to the first embodiment of the present invention.

도 7 및 도 8을 참조하면, 사용자 입력부(130) 또는 터치 스크린(151)를 통해 영상 데이터 검색을 위한 사용자 입력이 수신되면, 제어부(180)는 영상 데이터 검색 모드로 진입한다(S710). 즉, 제어부(180)는 도 8의 (a)에 도시된 바와 같은 영상 데이터 검색화면(810)을 디스플레이부(151)에 표시한다. 7 and 8, when a user input for searching for image data is received through the user input unit 130 or the touch screen 151, the controller 180 enters the image data search mode (S710). That is, the control unit 180 displays the image data search screen 810 as shown in FIG. 8 (a) on the display unit 151.

상기 검색화면(810)이 표시된 상태에서, 음성 인식 아이콘(820)이 선택되면, 제어부(180)는, 음성 인식 모드를 활성화한다(S720). 즉, 제어부(180)는 사용자로부터 검색할 영상 데이터의 음성 명칭을 입력받기 위해 마이크(123) 및 음성 인식부(182)를 구동한다. 이와 동시에, 제어부(180)는, "검색할 영상 데이터를 말하세요!"라는 안내 문구를 포함하는 팝업창(840)을 디스플레이부(151)에 표시한다. If the voice recognition icon 820 is selected while the search screen 810 is displayed, the controller 180 activates the voice recognition mode (S720). That is, the control unit 180 drives the microphone 123 and the voice recognition unit 182 to receive the voice name of the video data to be searched by the user. At the same time, the control unit 180 displays on the display unit 151 a pop-up window 840 containing a message saying " Tell video data to be searched ".

제어부(180)는, 상기 검색할 영상 데이터의 음성 명칭(850)이 마이크(123)를 통해 입력되는지 여부를 확인한다(S730).The control unit 180 determines whether the voice name 850 of the video data to be searched is input through the microphone 123 (S730).

상기 확인 결과, 음성 명칭(850)이 마이크(123)를 통해 입력되지 않으면, 제어부(180)는 디스플레이부(151)를 통해 상기 팝업창(840)을 계속 표시한다. 반면, 음성 명칭(850)이 마이크(123)를 통해 입력되면, 제어부(180)는 음성 인식부(182)를 제어하여 해당 음성 명칭(850)을 인식한다(S740).If the voice name 850 is not input through the microphone 123, the control unit 180 continues to display the pop-up window 840 through the display unit 151. As a result, On the other hand, if the voice name 850 is inputted through the microphone 123, the control unit 180 controls the voice recognition unit 182 to recognize the voice name 850 (S740).

상기 음성 명칭(850)이 인식되면, 제어부(180)는 입력된 음성이 태깅된 영상 데이터 또는 입력된 음성을 포함하는 음성이 태깅된 영상 데이터를 메모리(160)로부터 검출하고(S750), 상기 검출된 음성 태그된 영상 데이터를 디스플레이부(151)에 표시한다(S760).If the voice name 850 is recognized, the control unit 180 detects the tagged video data including the tagged video data or the input voice from the memory 160 (S750) And displays the voice-tagged image data on the display unit 151 (S760).

가령, 도 8의 (b) 및 (c)에 도시된 바와 같이, 음성 모드 활성화 상태에서, 사용자가 검색하고자 하는 영상 데이터의 음성 명칭인 "다보탑"(750)을 마이크(123)에 대고 말하면, 제어부(180)는 "다보탑"이라는 음성 명칭이 태그된 영상 데이터(850)를 검출하여 화면에 표시한다. 8B and 7C, when speaking to the microphone 123 with the "Dabotop" 750, which is the voice name of the image data to be searched by the user, in the voice mode activation state, The control unit 180 detects the image data 850 tagged with the voice name "Daobo top" and displays it on the screen.

또한, 본 발명의 다른 실시 형태로, 제어부(180)는, 상기 음성 태그된 영상 데이터와 함께, 해당 음성 명칭과 관련된 다른 컨텐츠를 추가로 제공할 수 있다. In addition, in another embodiment of the present invention, the control unit 180 can additionally provide other contents related to the voice name together with the voice-tagged video data.

가령, 도 9에 도시된 바와 같이, 음성 모드 활성화 상태에서, 사용자가 검색하고자 하는 영상 데이터의 음성 명칭인 "다보탑"(910)을 마이크(123)에 대고 말하면, 제어부(180)는 "다보탑"이라는 음성 명칭이 태그된 영상 데이터(920)와 함께, 상기 "다보탑"과 관련된 내용을 포함하는 메모장(930), 음성 메모(940)를 검출하여 화면에 표시한다. 이때, 상기 메모장(930) 및 음성 메모(940)는, 위젯, 아이콘 또는 썸네일 이미지 등의 형태로 표시될 수 있다. 9, when the user instructs the microphone 123 to speak the voice name of the image data to be searched by the user in the voice mode activation state, the control unit 180 controls the "dabotop" And the voice memo 940 including the contents related to the "dabo top" are detected and displayed on the screen together with the video data 920 tagged with the voice name " At this time, the notepad 930 and the voice memo 940 may be displayed in the form of a widget, an icon, or a thumbnail image.

또한, 제어부(180)는 미리 결정된 웹 사이트에 접속하여, 상기 "다보탑"이란 단어를 검색어로 입력하여 검색한 초기 결과화면(950)을 디스플레이부(151)에 추가로 표시할 수 있다. 한편, 상술한 메모장, 음성 메모 및 웹 페이지 외에도, 사용자 설정에 따라 다양한 컨텐츠들이 추가로 표시될 수 있음은 당업자에게 자명할 것이다.Further, the control unit 180 may access a predetermined web site and display the initial result screen 950, which is searched by inputting the word " dabo top "as a search term, on the display unit 151. [ In addition to the above-mentioned notepad, voice memo, and web page, various contents may be additionally displayed according to the user setting.

이상 상술한 바와 같이, 본 발명의 제1 실시 예에 따른 발명은, 음성 인식을 통해 영상을 촬영함과 동시에, 촬영된 영상 데이터에 음성 인식된 명칭을 태깅하여 메모리 내에 저장하고, 상기 메모리 내에 저장된 음성 태그된 영상 데이터를 음성 인식 기술을 통해 검색할 수 있도록 한다.As described above, the invention according to the first embodiment of the present invention is an image capture apparatus for capturing an image through voice recognition, tagging the names of the voices recognized in the captured image data and storing them in a memory, So that the voice tagged image data can be searched through the voice recognition technology.

도 10 내지 도 12는 본 발명의 제2 실시 예에 따른 이동 단말기에서, 음성 태그된 기준 영상 데이터와 타 영상 데이터들 간의 이미지 매칭을 통해 유사도를 결정하고, 상기 결정된 유사도 정보를 저장하는 동작을 설명하는 도면이다.10 to 12 illustrate an operation of determining similarity through image matching between voice tagged reference image data and other image data and storing the determined similarity information in the mobile terminal according to the second embodiment of the present invention FIG.

도 10을 참조하면, 사용자에 의해 카메라 메뉴가 선택되면, 제어부(180)는 카메라(121)를 구동시키고, 카메라(121)의 렌즈를 통해 입력되는 프리뷰 영상을 디스플레이부(151)에 표시한다(S1010). 10, when a camera menu is selected by the user, the controller 180 drives the camera 121 and displays a preview image input through the lens of the camera 121 on the display unit 151 S1010).

상기 프리뷰 영상이 표시되면, 제어부(180)는 음성 인식 모드를 활성화하고, 촬영 중인 영상 데이터의 음성 명칭이 마이크(123)를 통해 입력되는지 여부를 확인한다(S1020).When the preview image is displayed, the control unit 180 activates the voice recognition mode and confirms whether the voice name of the image data being captured is input through the microphone 123 (S1020).

상기 확인 결과, 음성 명칭이 마이크(123)를 통해 입력되지 않으면, 제어부(180)는 디스플레이부(151)를 통해 프리뷰 영상을 계속 표시한다.If the voice name is not input through the microphone 123, the control unit 180 continuously displays the preview image through the display unit 151. As a result,

한편, 상기 음성 명칭이 마이크(123)를 통해 입력되면, 제어부(180)는 음성 인식부(182)를 제어하여 음성 명칭을 인식한 다음, 카메라(121)의 렌즈에 맺혀진 피사체 영상을 촬영한다(S1030). 이와 동시에, 제어부(180)는, 촬영된 영상 데이터에 상기 음성 명칭을 태깅(voice tagging)하여 메모리(160)에 저장한다(S1040).When the voice name is inputted through the microphone 123, the control unit 180 controls the voice recognition unit 182 to recognize the voice name and then photographs the subject image formed on the lens of the camera 121 (S1030). At the same time, the controller 180 performs voice tagging of the voice name on the photographed image data and stores the voice tag in the memory 160 (S1040).

또한, 제어부(180)는, 음성 태그된 영상 데이터와 메모리(160) 내에 저장된 타 영상 데이터들 사이의 유사도를 결정하기 위해, 상기 음성 태그된 영상 데이터를 기준 영상 데이터로 설정한다(S1050). In addition, the controller 180 sets the voice-tagged image data as the reference image data to determine the similarity between the voice-tagged image data and the other image data stored in the memory 160 (S1050).

상기 기준 영상 데이터가 결정되면, 제어부(180)는 기준 영상 데이터와 메모리(160) 내에 저장된 모든 타 영상 데이터들 간의 유사도를 결정한다(S1060). 이때, 상기 유사도는, 공지된 이미지 매칭(image matching) 알고리즘들 중 적어도 하나의 알고리즘을 통해 결정될 수 있다. If the reference image data is determined, the controller 180 determines the degree of similarity between the reference image data and all the other image data stored in the memory 160 (S1060). At this time, the similarity may be determined through at least one algorithm of known image matching algorithms.

이러한 이미지 매칭 알고리즘은, CBIR(Content-based image retrieval) 분야에서 주로 사용되며, 영상의 색상 값이나 분포 등을 히스토그램이나 텍스쳐 분석, 웨이블릿 등을 통하여 계산하는 저 수준 영상분석에서부터, 영상 내에 존재하는 객체의 모양이나 패턴 등을 계산하는 고 수준 분석에 이르기까지 다양한 방법들이 존재한다. This image matching algorithm is mainly used in the field of CBI (Content-based image retrieval), and it can be applied to low-level image analysis which calculates the color value or distribution of the image through histogram, texture analysis or wavelet, There are various methods ranging from high-level analysis to calculation of the shape and pattern of the image.

가령, 도 11의 (a) 에 도시된 바와 같이, "이태희"라는 음성 명칭(1120)이 수신되면, 제어부(180)는 프리뷰 영상(1110)을 촬영하고, 촬영된 영상 데이터에 음성 명칭(1120)을 태깅하여 저장한다.11 (a), the control unit 180 photographs the preview image 1110 and displays the voice name 1120 ) Are tagged and stored.

이와 동시에, 제어부(180)는 상기 음성 명칭(1120)이 태그된 영상 데이터(1130)를 기준 영상 데이터로 설정한다. 그리고, 제어부(180)는 해당 영상 데이터가 기준 영상 데이터임을 식별하기 위해 매칭 스코어(matching score) 값을 100으로 설정한다.At the same time, the controller 180 sets the video data 1130 tagged with the voice name 1120 as reference video data. Then, the controller 180 sets the matching score value to 100 in order to identify the corresponding image data as the reference image data.

이후, 제어부(180)는 메모리(160) 내에 저장된 모든 영상 데이터들에 대해 이미지 매칭을 수행하여 유사도(즉, 매칭 스코어)를 결정한다. 특히, 기준 영상 데이터가 사람의 얼굴인 경우, 제어부(180)는 얼굴인식 알고리즘을 통해 유사도를 결정할 수 있다.Then, the controller 180 performs image matching on all the image data stored in the memory 160 to determine the similarity (i.e., matching score). In particular, when the reference image data is a human face, the controller 180 can determine the similarity through the face recognition algorithm.

이러한 유사도 결정 과정이 완료되면, 제어부(180)는 메모리 내에 저장된 모든 영상 데이터들 각각에 대해, 기준이 되는 음성 태그 정보(즉, "이태희") 및 유사도 정보(즉, 매칭 스코어)를 저장한다(S1070). 이때, 상기 음성 태그 정보 및 유사도 정보는 해당 이미지 파일과 링크된 텍스트 파일 형태로 저장될 수 있다. 또한, 상기 음성 태깅 정보 및 유사도 정보는 메모리(160)의 특정 영역 내에 저장되거나 혹은 별도의 음성태그 DB에 저장될 수 있다. Upon completion of the similarity determination process, the control unit 180 stores reference voice tag information (i.e., "Itehee") and similarity information (i.e., matching score) for each of the image data stored in the memory S1070). At this time, the voice tag information and the similarity information may be stored in the form of a text file linked with the corresponding image file. In addition, the voice tagging information and the similarity information may be stored in a specific area of the memory 160 or may be stored in a separate voice tag DB.

가령, 도 11의 (b)에 도시된 바와 같이, 기준 영상 데이터(1130)와 유사한 영상 데이터들(1140, 1150)이 메모리 내에 존재하는 경우, 제어부(180)는 상기 유사한 영상 데이터들(1140, 1150) 각각에 대해, 기준이 되는 음성 태그 정보 및 매칭 스코어 정보를 텍스트 파일 형태로 음성태그 DB에 저장할 수 있다. 11 (b), when the image data 1140 and 1150 similar to the reference image data 1130 exist in the memory, the control unit 180 outputs the similar image data 1140, 1150, the reference voice tag information and the matching score information can be stored in the voice tag DB in the form of a text file.

또한, 도 12의 (a) 에 도시된 바와 같이, "김동건"이라는 음성 명칭(1220)이 수신되면, 제어부(180)는 프리뷰 영상(1210)을 촬영하고, 촬영된 영상 데이터에 음성 명칭(1220)을 태깅하여 저장한다. 이후, 제어부(180)는 상기 음성 명칭(1220)이 태그된 영상 데이터(1230)를 기준 영상 데이터로 설정한 다음, 메모리(160) 내에 저장된 모든 영상 데이터들에 대해 이미지 매칭을 수행하여 유사도를 결정한다.12 (a), when the voice name 1220 of "Kim Dong Gun" is received, the control unit 180 photographs the preview image 1210 and displays the voice name 1220 ) Are tagged and stored. The control unit 180 sets the video data 1230 tagged with the voice name 1220 as reference video data and then performs image matching on all the video data stored in the memory 160 to determine the similarity do.

도 12의 (b)에 도시된 바와 같이, 기준 영상 데이터(1230)와 유사한 영상 데이터들(1240, 1250)이 메모리 내에 존재하는 경우, 제어부(180)는 상기 유사한 영상 데이터들(1240, 1250) 각각에 대해, 기준이 되는 음성 태그 정보(즉, "김동건") 및 매칭 스코어 정보를 텍스트 파일 형태로 음성태그 DB 내에 저장할 수 있다. 12B, when the image data 1240 and 1250 similar to the reference image data 1230 exist in the memory, the controller 180 stores the similar image data 1240 and 1250, (I.e., "Kim Dong Gun") and matching score information can be stored in the voice tag DB in the form of a text file.

그런데, 특정 영상 데이터(1250)에 대해, 다른 음성 태그 정보 (즉, "이태희") 및 매칭 스코어 정보를 포함하는 텍스트 정보가 메모리(160) 내에 기 저장되어 있는 경우, 새로운 텍스트 정보를 생성할 필요없이, 상기 텍스트 정보 내에 새로운 음성 태그 정보(즉, "김동건") 및 매칭 스코어 정보를 추가하여 저장할 수 있다. However, when text information including other voice tag information (i.e., "Lee Tai-hee") and matching score information is stored in the memory 160 for the specific video data 1250, it is necessary to generate new text information , New voice tag information (i.e., "Kim Dong Gun") and matching score information may be added and stored in the text information.

도 13 및 도 14는 본 발명의 제2 실시 예에 따른 이동 단말기에서, 음성 인식 기술을 통해, 입력된 음성과 관련된 영상 데이터를 검색하는 동작을 설명하는 도면이다.13 and 14 are views for explaining an operation of searching for image data related to an input voice through a voice recognition technology in a mobile terminal according to a second embodiment of the present invention.

도 13 및 도 14를 참조하면, 사용자 입력부(130) 또는 터치 스크린(151)를 통해 영상 데이터 검색을 위한 사용자 입력이 수신되면, 제어부(180)는 영상 데이터 검색 모드로 진입한다(S1310).13 and 14, when a user input for searching for image data is received through the user input unit 130 or the touch screen 151, the controller 180 enters the image data search mode (S1310).

상기 영상 데이터 검색 모드로 진입하면, 제어부(180)는, 음성 인식 모드를 활성화한다(S1320). 즉, 제어부(180)는 사용자로부터 검색할 영상 데이터의 음성 명칭을 입력받기 위해 마이크(123) 및 음성 인식부(182)를 구동한다. 이와 동시에, 제어부(180)는, 도 14의 (a) 및 (c)에 도시된 바와 같은 팝업창(1420)을 디스플레이부(151)에 표시한다. 이때, 상기 팝업창(1420)은 "검색할 영상 데이터를 말하세요!"라는 안내 문구를 포함한다.Upon entering the image data search mode, the controller 180 activates the voice recognition mode (S1320). That is, the control unit 180 drives the microphone 123 and the voice recognition unit 182 to receive the voice name of the video data to be searched by the user. At the same time, the control unit 180 displays a pop-up window 1420 as shown in Figs. 14A and 14C on the display unit 151. Fig. At this time, the pop-up window 1420 includes a message saying "Tell image data to be searched ".

제어부(180)는, 상기 검색할 영상 데이터의 음성 명칭(1430, 1440)이 마이크(123)를 통해 입력되는지 여부를 확인한다(S1330).The control unit 180 checks whether the voice names 1430 and 1440 of the video data to be searched are inputted through the microphone 123 (S1330).

상기 확인 결과, 음성 명칭(1430, 1440)이 마이크(123)를 통해 입력되지 않으면, 제어부(180)는 디스플레이부(151)를 통해 상기 팝업창(1420)을 계속 표시한다. 반면, 음성 명칭(1430, 1440)이 마이크(123)를 통해 입력되면, 제어부(180)는 음성 인식부(182)를 제어하여 상기 음성 명칭(1430, 1440)을 인식한다(S1340).If the voice names 1430 and 1440 are not input through the microphone 123, the control unit 180 continuously displays the pop-up window 1420 through the display unit 151. As a result, If the voice names 1430 and 1440 are inputted through the microphone 123, the control unit 180 controls the voice recognition unit 182 to recognize the voice names 1430 and 1440 (S1340).

상기 음성 명칭(1430, 1440)이 인식되면, 제어부(180)는 입력된 음성과 관련된 영상 데이터들을 메모리(160)로부터 검출하고(S1350), 상기 검출된 영상 데이터들을 유사도(즉, 매칭 스코어) 순으로 표시한다(S1360). 즉, 유사도의 크기에 따라, 상기 검출된 영상 데이터를 화면의 좌측 방향에서 우측 방향으로, 혹은 앞쪽 방향에서 뒤쪽 방향으로 표시되도록 배열할 수 있다. 또한, 유사도의 크기에 따라, 상기 검출된 영상 데이터들의 크기를 가변하여 화면에 표시할 수 있다. 또한, 임계값 이상의 유사도를 갖는 영상 데이터들에 대해서만 검출하여 화면에 표시할 수도 있다.If the voice names 1430 and 1440 are recognized, the controller 180 detects the video data related to the input voice from the memory 160 (S1350) (S1360). That is, the detected image data may be arranged to be displayed in the right direction from the left side of the screen or in the backward direction from the front direction, depending on the degree of similarity. In addition, the size of the detected image data can be varied according to the size of the degree of similarity to be displayed on the screen. It is also possible to detect only image data having a degree of similarity equal to or greater than a threshold value and to display the image data on the screen.

가령, 도 14의 (b)에 도시된 바와 같이, 음성 모드 활성화 상태에서, 사용자가 검색하고자 하는 영상 데이터의 음성 명칭인 "이태희"(1430)를 마이크(123)에 대고 말하면, 제어부(180)는 "이태희"라는 음성 명칭이 태그된 영상 데이터(1450)와 함께 이와 유사한 영상 데이터들(1460, 1470)을 검출하고, 상기 검출된 영상 데이터들을 매칭 스코어 순으로 크기를 달리하여 화면에 표시한다.14 (b), when the user speaks the voice name "Lee, Tae-hee" 1430 of the video data to be searched by the user to the microphone 123, Detects similar image data 1460 and 1470 together with the image data 1450 tagged with the voice name "Lee Tae-hee ", and displays the detected image data on the screen with different sizes in order of matching score.

또한, 도 14의 (d)에 도시된 바와 같이, 음성 모드 활성화 상태에서, 사용자가 검색하고자 하는 영상 데이터의 음성 명칭인 "김동건과 이태희"(1430)를 마이크(123)에 대고 말하면, 제어부(180)는 "김동건과 이태희"라는 음성 명칭이 태그된 영상 데이터(1480)와 함께 이와 유사한 영상 데이터들(1490)을 검출하고, 상기 검출된 영상 데이터들을 매칭 스코어 순으로 크기를 달리하여 화면에 표시한다.14 (d), when the user speaks the voice names "Kim Dong Gun and Lee Taehee" 1430 of the video data to be searched by the user to the microphone 123, the control unit 180 detects similar image data 1490 together with the image data 1480 tagged with the voice name "Kim Dong Kun and Lee Tae Hee ", and displays the detected image data on the screen in the order of matching score do.

이상 상술한 바와 같이, 본 발명의 제2 실시 예에 따른 발명은, 음성 태그된 기준 영상 데이터와 타 영상 데이터들 간의 이미지 매칭을 통해 유사도를 결정하고, 상기 결정된 유사도 정보를 기반으로 현재 입력된 음성 명칭과 관련된 영상 데이터를 검색할 수 있도록 한다.As described above, according to the second embodiment of the present invention, the similarity is determined through image matching between the voice tagged reference image data and the other image data, and based on the determined similarity information, So that image data related to the name can be searched.

또한, 본 발명의 다른 실시 형태로, 제어부(180)는, 음성 태그된 기준 영상 데이터를 기반으로 동영상 데이터 전체에 대해 이미지 매칭을 수행하여, 상기 기준 영상 데이터와 동일 및/또는 유사한 영상 데이터가 상기 동영상 데이터의 어느 위치에 존재하는지를 나타내는 기능을 제공한다. In addition, in another embodiment of the present invention, the control unit 180 performs image matching on the entire moving image data based on the voice tagged reference image data, so that the same and / And provides a function indicating the position of the moving picture data.

가령, 도 15를 참조하면, 동영상 데이터 내에 존재하는 특정 이미지의 위치를 검색하기 위한 사용자 입력이 수신되면, 제어부(180)는 영상 데이터 검색 모드로 진입하고 음성 인식 모드를 활성화한다. 즉, 제어부(180)는 사용자로부터 통화 상대방의 음성 명칭을 입력받기 위해 마이크(123) 및 음성 인식부(182)를 구동한다. For example, referring to FIG. 15, when a user input for searching for the position of a specific image in the moving image data is received, the controller 180 enters the image data search mode and activates the voice recognition mode. That is, the control unit 180 drives the microphone 123 and the voice recognition unit 182 to receive the voice name of the other party from the user.

이와 동시에, 제어부(180)는, 도 15의 (a)에 도시된 바와 같은 팝업창(1510)을 디스플레이부(151)에 표시한다. 이때, 상기 팝업창(1510)은 "검색할 영상 데이터를 말하세요"라는 안내 문구를 포함한다.At the same time, the control unit 180 displays a popup window 1510 as shown in FIG. 15 (a) on the display unit 151. At this time, the pop-up window 1510 includes a message saying "Tell image data to be searched ".

제어부(180)는, 검색할 영상 데이터의 음성 명칭(1520)이 마이크(123)를 통해 입력되는지 여부를 확인한다. 상기 확인 결과, 음성 명칭(1520)이 마이크(123)를 통해 입력되면, 제어부(180)는 음성 인식부(182)를 제어하여 상기 음성 명칭(1520)을 인식한다.The control unit 180 confirms whether the voice name 1520 of the video data to be searched is inputted through the microphone 123. [ If the voice name 1520 is input through the microphone 123, the control unit 180 controls the voice recognition unit 182 to recognize the voice name 1520.

상기 음성 명칭(1520)이 인식되면, 제어부(180)는 입력된 음성 명칭(1520)이 태그된 영상 데이터를 메모리(160)로부터 검출하고, 상기 검출된 영상 데이터를 기준으로 동영상 데이터 전체에 대해 이미지 매칭을 수행한다. 이러한 이미지 매칭을 통해, 제어부(180)는 상기 음성 명칭(1520)이 태그된 영상 데이터와 동일 및/또는 유사한 영상 데이터들이 상기 동영상 데이터의 어느 위치에 존재하는지를 결정한다.If the voice name 1520 is recognized, the control unit 180 detects the video data tagged with the inputted voice name 1520 from the memory 160, Matching is performed. Through such image matching, the control unit 180 determines at which position of the moving image data the image data having the same and / or similar image data as the voice name 1520 is present.

상기 동일 및/또는 유사한 영상 데이터의 위치가 결정되면, 제어부(180)는 동영상 재생화면의 하단에 표시된 진행 바의 해당 위치에 상기 동일 및/또는 유사한 영상 데이터들의 썸네일 이미지들(1530, 1540, 1550)을 표시한다. 이에 따라, 상기 표시된 복수의 썸네일 이미지들(1530, 1540, 1550) 중 어느 하나가 선택되면, 제어부(180)는 상기 선택된 썸네일 이미지에 해당하는 위치로 동영상 데이터를 이동하여 재생한다. When the positions of the same and / or similar image data are determined, the controller 180 displays thumbnail images 1530, 1540, and 1550 of the same and / or similar image data at corresponding positions of the progress bar displayed at the bottom of the moving image playback screen ). Accordingly, when any one of the displayed thumbnail images 1530, 1540, and 1550 is selected, the controller 180 moves and reproduces the moving image data to a position corresponding to the selected thumbnail image.

또한, 본 발명의 또 다른 실시 형태로, 제어부(180)는, 음성 인식 기술을 이용한 통화 요청 시, 제어부(180)는 주소록에 저장될 통화 상대방의 영상 데이터를 자동 추천하는 기능을 제공할 수 있다. In addition, in another embodiment of the present invention, the control unit 180 may provide a function of automatically recommending video data of a call destination to be stored in the address book when a call request is made using the voice recognition technology .

가령, 도 16을 참조하면, 사용자 입력부(130) 또는 터치 스크린(151)를 통해 통화 발신을 위한 사용자 입력이 수신되면, 제어부(180)는 통화 발신 모드로 진입하고 음성 인식 모드를 활성화한다. 즉, 제어부(180)는 사용자로부터 통화 상대방의 음성 명칭을 입력받기 위해 마이크(123) 및 음성 인식부(182)를 구동한다. For example, referring to FIG. 16, when a user input for making a call is received through the user input unit 130 or the touch screen 151, the controller 180 enters a call origination mode and activates a voice recognition mode. That is, the control unit 180 drives the microphone 123 and the voice recognition unit 182 to receive the voice name of the other party from the user.

이와 동시에, 제어부(180)는, 도 16의 (a)에 도시된 바와 같은 팝업창(1610)을 디스플레이부(151)에 표시한다. 이때, 상기 팝업창(1610)은 "통화 상대방을 말하세요"라는 안내 문구를 포함한다.At the same time, the control unit 180 displays a pop-up window 1610 as shown in Fig. 16 (a) on the display unit 151. Fig. At this time, the pop-up window 1610 includes a message saying "Speak to the other party."

이후, 제어부(180)는, 통화 상대방의 음성 명칭을 포함하는 음성 명령(1620)이 마이크(123)를 통해 입력되는지 여부를 확인한다.Thereafter, the control unit 180 determines whether a voice command 1620 including the voice name of the calling party is input via the microphone 123. [

상기 확인 결과, 음성 명령(1520)이 마이크(123)를 통해 입력되면, 제어부(180)는 음성 인식부(182)를 제어하여 통화 상대방의 음성 명칭(즉, "김동건")을 인식한다.If the voice command 1520 is input through the microphone 123, the control unit 180 controls the voice recognition unit 182 to recognize the voice name of the called party (i.e., "Kim Dong Gun").

상기 음성 명칭이 인식되면, 제어부(180)는 인식된 음성 명칭에 대응하는 통화 상대방의 전화번호로 통화를 요청한다. 그리고, 제어부(180)는 주소록 내에 통화 상대방에 대한 사진이 등록되어 있는지 여부를 확인한다.If the voice name is recognized, the control unit 180 requests a call to the telephone number of the calling party corresponding to the recognized voice name. Then, the control unit 180 checks whether or not a picture of the calling party is registered in the address book.

상기 확인 결과, 주소록 내에 통화 상대방에 대한 사진이 등록되어 있지 않은 경우, 제어부(180)는 "김동건"이라는 음성 명칭이 태그된 영상 데이터를 메모리로(160)부터 검출하고, 상기 검출된 영상 데이터(1630)를 화면에 표시한다. As a result of the checking, if the photograph of the communication partner is not registered in the address book, the control unit 180 detects the image data tagged with the voice name "Kim Dong Gun" from the memory 160, 1630) on the screen.

이와 동시에, 제어부(180)는 검출된 영상 데이터가 통화 상대방과 관련된 영상 데이터인지 여부를 사용자에게 확인하는 팝업창(1640)을 표시할 수 있다. 이러한 팝업창(1640)이 표시된 상태에서, 사용자에 의해 "YES"라는 버튼이 터치 되면, 제어부(180)는 화면에 표시된 영상 데이터(1630)를 주소록의 해당 목록에 저장한다.At the same time, the control unit 180 may display a pop-up window 1640 for confirming to the user whether the detected image data is image data related to the communication partner. If the user touches the button "YES " in the state that the pop-up window 1640 is displayed, the control unit 180 stores the image data 1630 displayed on the screen in the corresponding list of the address book.

도 17 및 도 18은 본 발명의 제3 실시 예에 따른 이동 단말기에서, 음성 녹음 시, 음성 인식 엔진을 통해 미리 결정된 단어들을 검출하고, 상기 검출된 단어들을 오디오 데이터에 태깅하여 저장하는 동작을 설명하는 도면이다.17 and 18 illustrate an operation of detecting predetermined words through a speech recognition engine and tagging and storing the detected words in audio data in voice recording in a mobile terminal according to a third embodiment of the present invention FIG.

도 17 및 도 18을 참조하면, 사용자에 의해 음성녹음 메뉴(1810)가 선택되면, 제어부(180)는 음성 녹음 실행화면(1820)을 디스플레이부(151)에 표시한다(S1710). 17 and 18, when the voice recording menu 1810 is selected by the user, the controller 180 displays the voice recording execution screen 1820 on the display unit 151 (S1710).

이러한 음성 녹음 실행화면(1820)이 표시된 상태에서, 시작 버튼(미도시)이 선택되면, 제어부(180)는 음성 인식 모드를 활성화하고, 녹음 동작을 개시한다(S1720). 즉, 제어부(180)는 외부에서 발생하는 소리 및 음성을 입력받기 위해 마이크(123) 및 음성 인식부(182)를 구동한다. If the start button (not shown) is selected while the voice recording execution screen 1820 is displayed, the controller 180 activates the voice recognition mode and starts the recording operation (S1720). That is, the control unit 180 drives the microphone 123 and the voice recognition unit 182 to receive sound and voice generated from the outside.

또한, 제어부(180)는 음성 인식 엔진을 구동하고(S1730), 상기 음성 인식 엔진을 통해 미리 결정된 단어(또는 핵심 단어)를 검출한다(S1740). 이때, 상기 음성 인식 엔진은, 미리 결정된 시간 주기마다 녹음 중인 오디오 데이터를 샘플링하여, 처음으로 음성 인식되는 명사를 핵심 단어로 설정할 수 있다. 한편, 본 실시 예에서, 상기 핵심 단어는 미리 결정된 시간 주기마다 처음으로 음성 인식되는 명사임을 예시하여 설명하고 있으나, 이를 제한하지는 않으며, 사용자의 설정 등에 따라 가변될 수 있음은 당업자에게 자명할 것이다. Further, the control unit 180 drives a speech recognition engine (S1730) and detects a predetermined word (or a key word) through the speech recognition engine (S1740). At this time, the speech recognition engine may sample the audio data being recorded every predetermined time period, and set the first recognized noun as a key word. Meanwhile, in the present embodiment, the key word is a noun that is first recognized by speech in a predetermined time period, but it is not limited thereto, and it will be apparent to those skilled in the art that the key word may vary according to the setting of the user.

이러한 핵심 단어가 검출되면, 제어부(180)는 해당 핵심 단어 및 그 재생 위치에 대한 정보를 녹음 중인 오디오 데이터에 태깅한다(S1750). 이때, 상기 핵심 단어에 대한 정보는, 오디오 파일 또는 텍스트 파일 중 어느 하나로 태그될 수 있다.If such a key word is detected, the control unit 180 tags the key word and information on the playback position to the audio data being recorded (S1750). At this time, the information on the key word may be tagged as either an audio file or a text file.

이후, 제어부(180)는 음성 녹음을 종료하기 위한 사용자 입력이 수신되는지 여부를 확인한다(S1760).Thereafter, the control unit 180 determines whether a user input for terminating the voice recording is received (S1760).

상기 확인 결과, 녹음 종료를 위한 사용자 입력이 수신되지 않으면, 제어부(180) 진행 중인 음성 녹음을 계속 실행한다. 한편, 상기 확인 결과, 녹음 종료를 위한 사용자 입력이 수신되면, 제어부(180)는 적어도 하나의 핵심 단어가 태그된 오디오 파일을 메모리(160)에 저장한다(1770).If it is determined that the user input for ending the recording is not received, the control unit 180 continues to execute the voice recording in progress. If the user input for ending the recording is received, the control unit 180 stores the audio file tagged with at least one key word in the memory 160 (step 1770).

도 19 및 도 20은 본 발명의 제3 실시 예에 따른 이동 단말기에서, 오디오 파일 재생 시, 오디오 데이터에 태깅된 단어들을 재생화면에 표시하고, 상기 표시된 단어들 중 어느 하나를 선택함에 따라 상기 오디오 파일의 재생 위치를 변경하는 동작을 설명하는 도면이다.FIGS. 19 and 20 are diagrams illustrating a method for displaying words tagged with audio data on a reproduction screen when reproducing an audio file in the mobile terminal according to the third embodiment of the present invention, Fig. 7 is a diagram for explaining an operation of changing a playback position of a file. Fig.

도 19 및 도 20을 참조하면, 사용자 입력부(130) 또는 터치 스크린(151)를 통해 음성 녹음된 오디오 파일의 검색을 위한 사용자 입력이 수신되면, 제어부(180)는 오디오 파일 목록화면(또는 녹음 파일 목록화면, 2010)을 디스플레이부(151)에 표시한다.19 and 20, when a user input for searching for an audio file recorded via the user input unit 130 or the touch screen 151 is received, the control unit 180 displays the audio file list screen 2010) on the display unit 151, as shown in Fig.

상기 오디오 파일 목록화면(2010)이 표시된 상태에서, 특정 오디오 파일(2020)이 선택되면(S1920), 제어부(180)는, 선택된 오디오 파일의 재생을 개시한다(S1930). 즉, 제어부(180)는 도 20의 (b)에 도시된 바와 같은 재생화면(2030)을 디스플레이부(151)에 표시함과 동시에, 음향 출력 모듈(153)을 통해 녹음된 오디오를 출력한다. If the specific audio file 2020 is selected in the state that the audio file list screen 2010 is displayed in step S1920, the control unit 180 starts reproducing the selected audio file in step S1930. That is, the control unit 180 displays the playback screen 2030 as shown in FIG. 20 (b) on the display unit 151 and outputs the recorded audio through the audio output module 153.

또한, 제어부(180)는 해당 오디오 파일에 태그된 복수의 단어들을 검출하고, 상기 검출된 단어들을 재생화면(2030)의 진행 바 주위에 표시한다(S1940). 그리고, 상기 표시된 단어들 중 어느 하나가 선택되면(S1950), 제어부(180)는 상기 선택된 단어의 위치로 이동하여 해당 오디오 파일을 재생한다(S1960). In addition, the control unit 180 detects a plurality of words tagged to the audio file and displays the detected words around the progress bar of the play screen 2030 (S1940). If one of the displayed words is selected (S1950), the controller 180 moves to the selected word position and plays the corresponding audio file (S1960).

가령, 도 20의 (b) 및 (c)에 도시된 바와 같이, 오디오 파일 재생 중, 화면(2030)에 표시된 복수의 단어들 중 "무궁화"(2040)란 단어가 선택되면, 제어부(180)는 상기 "무궁화"(2040)란 단어를 포함하는 위치로 이동하여 해당 오디오 파일을 재생한다.20 (b) and 20 (c), when the word "archipelago" 2040 is selected from a plurality of words displayed on the screen 2030 during audio file reproduction, Moves to a position including the word "Mugunghwa" 2040 and reproduces the corresponding audio file.

이상 상술한 바와 같이, 본 발명의 제3 실시 예에 따른 발명은, 오디오 파일 재생 시, 오디오 데이터에 태깅된 미리 결정된 단어들을 재생화면에 표시하고, 상기 표시된 단어들 중 어느 하나를 선택함에 따라 상기 오디오 파일의 재생 위치를 이동할 수 있도록 한다.As described above, according to the third embodiment of the present invention, when audio files are reproduced, predetermined words tagged with audio data are displayed on a reproduction screen, and when one of the displayed words is selected, Allows you to move the playback position of the audio file.

이상, 상술한 바와 같이, 본 발명의 또 다른 실시 예에 따른 발명은, 통화 수신 시, 수신된 날씨 정보에 대응하는 벨 소리 및/또는 통화 수신화면을 제공함으로써, 사용자로 하여금 현재의 날씨 정보를 직감할 수 있도록 한다.As described above, according to another embodiment of the present invention, when a call is received, a bell sound and / or a call reception screen corresponding to the received weather information are provided to allow a user to input current weather information Make sure you can feel intuition.

한편, 본 발명은 이동 단말기에 구비된 프로세서가 읽을 수 있는 기록매체에 프로세서가 읽을 수 있는 코드로서 구현하는 것이 가능하다. 프로세서가 읽을 수 있는 기록매체는 프로세서에 의해 읽혀질 수 있는 데이터가 저장되는 모든 종류의 기록장치를 포함한다. 프로세서가 읽을 수 있는 기록매체의 예로는 ROM, RAM, CD-ROM, 자기 테이프, 플로피디스크, 광 데이터 저장장치 등이 있으며, 또한 인터넷을 통한 전송 등과 같은 캐리어 웨이브의 형태로 구현되는 것도 포함한다. 또한 프로세서가 읽을 수 있는 기록매체는 네트워크로 연결된 컴퓨터 시스템에 분산되어, 분산방식으로 프로세서가 읽을 수 있는 코드가 저장되고 실행될 수 있다.Meanwhile, the present invention can be implemented as a code readable by a processor in a processor-readable recording medium provided in a mobile terminal. The processor-readable recording medium includes all kinds of recording apparatuses in which data that can be read by the processor is stored. Examples of the recording medium readable by the processor include a ROM, a RAM, a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like, and also a carrier wave such as transmission over the Internet. In addition, the processor readable recording medium may be distributed over networked computer systems so that code readable by the processor in a distributed manner can be stored and executed.

또한, 이상에서는 본 발명의 바람직한 실시 예에 대하여 도시하고 설명하였지만, 본 발명은 상술한 특정의 실시 예에 한정되지 아니하며, 청구범위에서 청구하는 본 발명의 요지를 벗어남이 없이 당해 발명이 속하는 기술분야에서 통상의 지식을 가진자에 의해 다양한 변형실시가 가능한 것은 물론이고, 이러한 변형실시들은 본 발명의 기술적 사상이나 전망으로부터 개별적으로 이해되어서는 안 될 것이다.While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments, but, on the contrary, It will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the present invention.

110 : 무선 통신부 120 : A/V 입력부
130 : 사용자 입력부 140 : 센싱부
150 : 출력부 151 : 디스플레이부
160 : 메모리 170 : 인터페이스부
180 : 제어부 182 : 음성 인식부110: wireless communication unit 120: A / V input unit
130: user input unit 140: sensing unit
150: output unit 151: display unit
160: memory 170: interface section
180: control unit 182: voice recognition unit

Claims

Driving a camera corresponding to a first user input;
Displaying a preview image on a screen of a display unit;
Activating a voice recognition mode to recognize a voice input via a microphone; And
Capturing the preview image corresponding to voice input through the microphone, tagging the input voice in the photographed image data, and storing the tagged voice in a memory.

The method according to claim 1,
Detecting the tagged image data from the memory when the voice for searching specific image data is input; And
And displaying the detected voice-tagged image data on the screen.

3. The method of claim 2,
Further displaying the content related to the input voice together with the detected voice-tagged image data.

The method of claim 3,
Wherein the content includes at least one of a voice memo, a notepad, and a web page.

The method according to claim 1,
Setting voice tagged image data as reference image data; And
Determining similarity of the other image data stored in the memory through image matching with the reference image data, and storing the determined degree of similarity information.

6. The method of claim 5, wherein the step of storing the similarity-
Wherein the similarity degree information and the voice tag information are further stored in a text file together with the similarity degree information.

6. The method of claim 5,
Detecting from the memory image data associated with the input voice when a voice for retrieving specific image data is input; And
And displaying the detected image data on a screen according to the degree of similarity information.

The method according to claim 1,
Reproducing moving picture data corresponding to a second user input; And
And detecting the tagged video data from the memory when a voice for searching specific video data is input during playback of the moving picture data.

9. The method of claim 8,
Setting the detected voice-tagged image data as reference image data;
Performing image matching on the moving image data based on the reference image data to detect the reproduction position of the moving image data that is the same as or similar to the reference image data; And
And displaying a thumbnail image corresponding to the same or similar image data at a position of the detected reproduction screen.

The method according to claim 1,
Requesting a call to a telephone number of a specific party through voice recognition; And
And detecting and automatically recommending the tagged video data if the picture of the communication partner is not registered in the address book.