KR102158210B1

KR102158210B1 - Speech recognition apparatus and method thereof

Info

Publication number: KR102158210B1
Application number: KR1020130106218A
Authority: KR
Inventors: 정해근; 박기원
Original assignee: 엘지전자 주식회사
Priority date: 2013-09-04
Filing date: 2013-09-04
Publication date: 2020-09-22
Also published as: KR20150027592A

Abstract

본 명세서는 사용자 사용 패턴에 따라 임계치를 변경함으로써 사용자 음성을 정확히 인식할 수 있는 음성 인식 장치 및 그 방법에 관한 것으로서, 본 명세서에 개시된 실시예에 따른 음성 인식 장치는, 신뢰도 점수를 제공하는 미리결정된 음성 모델들을 근거로 사용자 음성을 인식하는 음성 인식부와; 상기 인식된 사용자 음성의 신뢰도 점수와 임계치를 근거로 상기 인식된 사용자 음성을 허용 또는 거절하는 제어부를 포함하며, 상기 제어부는 사용자 사용 음성 패턴에 따라 상기 임계치를 변경할 수 있다.The present specification relates to a speech recognition apparatus and method capable of accurately recognizing a user's voice by changing a threshold value according to a user usage pattern. The speech recognition apparatus according to an embodiment disclosed in the present specification includes a predetermined reliability score A voice recognition unit for recognizing a user's voice based on voice models; And a control unit for allowing or rejecting the recognized user voice based on a reliability score and a threshold value of the recognized user voice, and the control unit may change the threshold value according to a user voice pattern.

Description

Speech recognition apparatus and its method TECHNICAL FIELD [SPEECH RECOGNITION APPARATUS AND METHOD THEREOF}

본 명세서는 음성 인식 장치 및 그 방법에 관한 것이다. The present specification relates to a speech recognition apparatus and method thereof.

일반적으로, 종래 기술에 따른 음성 인식 장치는 사용자가 발성(utterance)했을 때 그 발성된 음성 신호를 수신하고, 그 수신된 음성 신호와 미리결정된 모델을 비교하고, 그 비교 결과에 따라 사용자 음성을 인식한다. 즉, 종래 기술에 따른 음성 인식 시스템은 신뢰도(confidence) 측정 방법을 통해 사용자 음성을 인식하였다. 종래 기술에 따른 음성 인식 장치는 한국 특허 출원 번호 10-2009-0057093에 개시되어 있다. In general, a speech recognition apparatus according to the prior art receives the uttered speech signal when a user utters, compares the received speech signal with a predetermined model, and recognizes the user speech according to the comparison result. do. That is, the speech recognition system according to the prior art recognizes the user's voice through a method of measuring confidence. A speech recognition apparatus according to the prior art is disclosed in Korean Patent Application No. 10-2009-0057093.

본 명세서는 사용자 사용 패턴에 따라 임계치를 변경함으로써 사용자 음성을 정확히 인식할 수 있는 음성 인식 장치 및 그 방법을 제공하는 데 그 목적이 있다.An object of the present specification is to provide a speech recognition apparatus and method capable of accurately recognizing a user's voice by changing a threshold value according to a user usage pattern.

본 명세서에 개시된 실시예에 따른 음성 인식 장치는, 신뢰도 점수를 제공하는 미리결정된 음성 모델들을 근거로 사용자 음성을 인식하는 음성 인식부와; 상기 인식된 사용자 음성의 신뢰도 점수와 임계치를 근거로 상기 인식된 사용자 음성을 허용 또는 거절하는 제어부를 포함하며, 상기 제어부는 사용자 사용 음성 패턴에 따라 상기 임계치를 변경할 수 있다.A speech recognition apparatus according to an embodiment disclosed in the present specification includes: a speech recognition unit for recognizing a user speech based on predetermined speech models providing a reliability score; And a control unit for allowing or rejecting the recognized user voice based on a reliability score and a threshold value of the recognized user voice, and the control unit may change the threshold value according to a user voice pattern.

본 명세서와 관련된 일 예로서, 상기 제어부는 상기 사용자 사용 음성 패턴에 따라 상기 임계치를 증가시키거나 감소시킬 수 있다.As an example related to the present specification, the controller may increase or decrease the threshold value according to the voice pattern used by the user.

본 명세서와 관련된 일 예로서, 상기 제어부는 상기 인식된 사용자 음성의 신뢰도 점수가 상기 증가 또는 감소한 임계치보다 높으면 상기 인식된 사용자 음성을 허용할 수 있다.As an example related to the present specification, the controller may allow the recognized user voice if the reliability score of the recognized user voice is higher than the increased or decreased threshold.

본 명세서와 관련된 일 예로서, 상기 제어부는 상기 인식된 사용자 음성의 신뢰도 점수가 상기 증가 또는 감소한 임계치보다 낮으면 상기 인식된 사용자 음성을 거절할 수 있다.As an example related to the present specification, the control unit may reject the recognized user voice when the reliability score of the recognized user voice is lower than the increased or decreased threshold.

본 명세서와 관련된 일 예로서, 상기 제어부는 영상 표시 장치의 기능에 따라 미리 분류된 상기 사용자 사용 음성 패턴을 근거로 상기 임계치를 변경할 수 있다. As an example related to the present specification, the controller may change the threshold value based on the voice pattern used by the user classified in advance according to the function of the video display device.

본 명세서와 관련된 일 예로서, 상기 미리 분류된 사용자 사용 음성 패턴은, 상기 영상 표시 장치의 채널 및 볼륨을 제어하기 위한 제1 사용자 음성의 사용률, 상기 영상 표시 장치를 통해 웹 브라우저를 실행시키기 위한 제2 사용자 음성의 사용률, 상기 영상 표시 장치를 통해 응용 프로그램을 실행시키기 위한 제3 사용자 음성의 사용률, 상기 영상 표시 장치의 방송 프로그램 검색을 위한 제4 사용자 음성의 사용률 중에서 적어도 어느 하나 이상을 포함하는 것을 특징으로 하는 음성 처리 장치.As an example related to the present specification, the pre-categorized user use voice pattern includes a first user voice usage rate for controlling a channel and volume of the video display device, and a first user voice usage rate for controlling a channel and volume of the video display device, 2 Including at least one or more of a usage rate of a user's voice, a usage rate of a third user's voice for executing an application program through the video display device, and a usage rate of a fourth user voice for searching a broadcast program by the video display device. An audio processing device characterized in that.

본 명세서와 관련된 일 예로서, 상기 제어부는 상기 사용자 사용 음성 패턴을 미리 설정할 수 있다. As an example related to the present specification, the control unit may preset the user use voice pattern.

본 명세서와 관련된 일 예로서, 상기 제어부는 서버로부터 상기 사용자 사용 음성 패턴을 수신할 수 있다. As an example related to the present specification, the controller may receive the user use voice pattern from a server.

본 명세서와 관련된 일 예로서, 상기 사용자 음성의 사용률은 사용자 평균 사용 음성의 사용률일 수 있다. As an example related to the present specification, the user voice usage rate may be an average user voice usage rate.

본 명세서와 관련된 일 예로서, 상기 제어부는 유사 발음을 갖는 다수의 사용자 음성이 인식되면 상기 임계치를 증가시킴으로써 상기 유사 발음을 갖는 다수의 사용자 음성의 인식 확률을 증가시킬 수 있다.As an example related to the present specification, when multiple user voices having similar pronunciation are recognized, the controller may increase the probability of recognizing the plurality of user voices having similar pronunciation by increasing the threshold.

본 명세서에 개시된 실시예에 따른 음성 인식 방법은, 신뢰도 점수를 제공하는 미리결정된 음성 모델들을 근거로 사용자 음성을 인식하는 단계와; 사용자 사용 음성 패턴에 따라 임계치를 변경하는 단계와; 상기 인식된 사용자 음성의 신뢰도 점수와 상기 임계치를 근거로 상기 인식된 사용자 음성을 허용 또는 거절하는 단계를 포함할 수 있다. A speech recognition method according to an embodiment disclosed herein includes the steps of recognizing a user's speech based on predetermined speech models providing a confidence score; Changing a threshold according to a user's voice pattern; And allowing or rejecting the recognized user voice based on the reliability score of the recognized user voice and the threshold.

본 발명의 실시예들에 따른 음성 인식 장치 및 그 방법은, 사용자 사용 패턴(영상 표시기기, 이동 통신 단말기 등의 전자 장치의 기능들에 따라 미리 분류된 각 사용자 음성의 사용률)에 따라 임계치를 변경함으로써 사용자 음성을 정확히 인식할 수 있다. The voice recognition apparatus and method thereof according to embodiments of the present invention change a threshold according to a user usage pattern (a usage rate of each user's voice classified in advance according to functions of an electronic device such as a video display device and a mobile communication terminal). Thus, the user's voice can be accurately recognized.

본 발명의 실시예들에 따른 음성 인식 장치 및 그 방법은, 사용자 사용 기능 패턴(영상 표시기기, 이동 통신 단말기 등의 전자 장치의 각 기능들의 사용률)에 따라 임계치를 변경함으로써 사용자 음성을 정확히 인식할 수 있다. The voice recognition apparatus and method according to embodiments of the present invention can accurately recognize a user voice by changing a threshold value according to a user use function pattern (a usage rate of each function of an electronic device such as a video display device and a mobile communication terminal). I can.

도 1은 본 발명과 관련된 영상 표시 장치 및 외부 입력 장치를 보여주는 블록도이다.
도 2는 본 발명의 실시예들에 따른 영상 처리 장치가 적용된 3D 영상 표시 장치의 구성을 나타낸 구성도이다.
도 3은 본 발명의 실시예에 따른 음성 인식 방법을 나타낸 흐름도이다.
도 4는 본 발명의 실시예에 따른 사용자 사용 음성 패턴을 나타낸 예시도이다.
도 5는 본 발명의 다른 실시예에 따른 음성 인식 방법을 나타낸 흐름도이다.
도 6은 본 발명의 실시예에 따른 사용자 사용 기능 패턴을 나타낸 예시도이다.1 is a block diagram showing an image display device and an external input device related to the present invention.
2 is a block diagram illustrating a configuration of a 3D image display device to which an image processing device according to embodiments of the present invention is applied.
3 is a flow chart showing a speech recognition method according to an embodiment of the present invention.
4 is an exemplary diagram showing a user use voice pattern according to an embodiment of the present invention.
5 is a flowchart illustrating a speech recognition method according to another embodiment of the present invention.
6 is an exemplary diagram showing a user use function pattern according to an embodiment of the present invention.

본 명세서에서 사용되는 기술적 용어는 단지 특정한 실시 예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아님을 유의해야 한다. 또한, 본 명세서에서 사용되는 기술적 용어는 본 명세서에서 특별히 다른 의미로 정의되지 않는 한, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 의미로 해석되어야 하며, 과도하게 포괄적인 의미로 해석되거나, 과도하게 축소된 의미로 해석되지 않아야 한다. 또한, 본 명세서에서 사용되는 기술적인 용어가 본 발명의 사상을 정확하게 표현하지 못하는 잘못된 기술적 용어일 때에는, 당업자가 올바르게 이해할 수 있는 기술적 용어로 대체되어 이해되어야 할 것이다. 또한, 본 발명에서 사용되는 일반적인 용어는 사전에 정의되어 있는 바에 따라, 또는 전후 문맥상에 따라 해석되어야 하며, 과도하게 축소된 의미로 해석되지 않아야 한다. It should be noted that the technical terms used in the present specification are only used to describe specific embodiments, and are not intended to limit the present invention. In addition, the technical terms used in the present specification should be interpreted as generally understood by those of ordinary skill in the technical field to which the present invention belongs, unless otherwise defined in the present specification, and excessively comprehensive It should not be construed as a human meaning or an excessively reduced meaning. In addition, when a technical term used in the present specification is an incorrect technical term that does not accurately express the spirit of the present invention, it will be replaced with a technical term that can be correctly understood by those skilled in the art to be understood. In addition, general terms used in the present invention should be interpreted as defined in the dictionary or according to the context before and after, and should not be interpreted as an excessively reduced meaning.

또한, 본 명세서에서 사용되는 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 출원에서, "구성된다" 또는 "포함한다" 등의 용어는 명세서 상에 기재된 여러 구성 요소들, 또는 여러 단계들을 반드시 모두 포함하는 것으로 해석되지 않아야 하며, 그 중 일부 구성 요소들 또는 일부 단계들은 포함되지 않을 수도 있고, 또는 추가적인 구성 요소 또는 단계들을 더 포함할 수 있는 것으로 해석되어야 한다. In addition, the singular expression used in the present specification includes a plurality of expressions unless the context clearly indicates otherwise. In the present application, terms such as "consist of" or "include" should not be construed as necessarily including all of the various elements or various steps described in the specification, and some of the elements or some steps It may not be included, or it should be interpreted that it may further include additional elements or steps.

또한, 본 명세서에서 사용되는 제1, 제2 등과 같이 서수를 포함하는 용어는 다양한 구성 요소들을 설명하는데 사용될 수 있지만, 상기 구성 요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성 요소로 명명될 수 있고, 유사하게 제2 구성 요소도 제1 구성 요소로 명명될 수 있다. In addition, terms including ordinal numbers such as first and second used herein may be used to describe various elements, but the elements should not be limited by the terms. These terms are used only for the purpose of distinguishing one component from another component. For example, without departing from the scope of the present invention, a first component may be referred to as a second component, and similarly, a second component may be referred to as a first component.

이하, 첨부된 도면을 참조하여 본 발명에 따른 바람직한 실시 예를 상세히 설명하되, 도면 부호에 관계없이 동일하거나 유사한 구성 요소는 동일한 참조 번호를 부여하고 이에 대한 중복되는 설명은 생략하기로 한다. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings, but the same or similar components are assigned the same reference numerals regardless of the reference numerals, and redundant descriptions thereof will be omitted.

또한, 본 발명을 설명함에 있어서 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우 그 상세한 설명을 생략한다. 또한, 첨부된 도면은 본 발명의 사상을 쉽게 이해할 수 있도록 하기 위한 것일 뿐, 첨부된 도면에 의해 본 발명의 사상이 제한되는 것으로 해석되어서는 아니 됨을 유의해야 한다. In addition, in describing the present invention, when it is determined that a detailed description of a related known technology may obscure the subject matter of the present invention, a detailed description thereof will be omitted. In addition, it should be noted that the accompanying drawings are only for easily understanding the spirit of the present invention and should not be construed as limiting the spirit of the present invention by the accompanying drawings.

본 명세서에서, 영상 표시 장치는 방송을 수신하여 표시하거나, 동영상을 기록 및 재생하는 장치와 오디오를 기록 및 재생하는 장치를 모두 포함한다. 이하, 이러한 예로서, 텔레비전을 예를 들어 설명한다.In this specification, the video display device includes both a device for receiving and displaying a broadcast, recording and reproducing a video, and a device for recording and reproducing audio. Hereinafter, as such an example, a television will be described as an example.

도 1은 본 발명과 관련된 영상 표시 장치(100) 및 외부 입력 장치(190)를 보여주는 블록도이다. 영상 표시 장치(100)는, 튜너(110), 복조부(120), 신호 입출력부(130), 인터페이스부(140), 제어부(150), 저장부(160), 디스플레이부(170) 및 오디오 출력부(180)를 포함한다. 다만, 외부 입력 장치(190)는 영상 표시 장치(100)와 별도의 장치이나, 영상 표시 장치(100)의 일 구성요소로 포함될 수도 있다.1 is a block diagram showing an image display device 100 and an external input device 190 related to the present invention. The video display device 100 includes a tuner 110, a demodulation unit 120, a signal input/output unit 130, an interface unit 140, a control unit 150, a storage unit 160, a display unit 170, and an audio system. It includes an output unit 180. However, the external input device 190 may be a separate device from the image display device 100 or may be included as a component of the image display device 100.

도 1을 참조하면, 튜너(110)는 안테나를 통해 수신되는 RF(Radio Frequency) 방송 신호 중 사용자에 의해 선택된 채널에 대응하는 RF 방송 신호를 선택하고, RF 방송 신호를 중간 주파수 신호 또는 베이스 밴드 영상/음성 신호로 변환한다. 예를 들어, RF 방송 신호가 디지털 방송 신호이면, 튜너(110)는 RF 방송 신호를 디지털 IF 신호(DIF)로 변환한다. 반면, RF 방송 신호가 아날로그 방송 신호이면, 튜너(110)는 RF 방송 신호를 아날로그 베이스 밴드 영상/음성신호(CVBS/SIF)로 변환된다. 이와 같이, 튜너(110)는 디지털 방송 신호와 아날로그 방송 신호를 처리할 수 있는 하이브리드 튜너일 수 있다.Referring to FIG. 1, the tuner 110 selects an RF broadcast signal corresponding to a channel selected by a user among radio frequency (RF) broadcast signals received through an antenna, and converts the RF broadcast signal to an intermediate frequency signal or a baseband image. /Convert to audio signal. For example, if the RF broadcast signal is a digital broadcast signal, the tuner 110 converts the RF broadcast signal into a digital IF signal (DIF). On the other hand, if the RF broadcast signal is an analog broadcast signal, the tuner 110 converts the RF broadcast signal into an analog baseband video/audio signal (CVBS/SIF). As such, the tuner 110 may be a hybrid tuner capable of processing a digital broadcast signal and an analog broadcast signal.

튜너(110)에서 출력되는 디지털 IF 신호(DIF)는 복조부(120)로 입력되고, 튜너(110)에서 출력되는 아날로그 베이스 밴드 영상/음성신호(CVBS/SIF)는 제어부(160)로 입력될 수 있다. 튜너(120)는 ATSC(Advanced Television Systems Committee) 방식에 따른 단일 캐리어의 RF 방송 신호 또는 DVB(Digital Video Broadcasting) 방식에 따른 복수 캐리어의 RF 방송 신호를 수신할 수 있다.The digital IF signal (DIF) output from the tuner 110 is input to the demodulator 120, and the analog baseband video/audio signal (CVBS/SIF) output from the tuner 110 is input to the controller 160. I can. The tuner 120 may receive an RF broadcast signal of a single carrier according to an Advanced Television Systems Committee (ATSC) method or an RF broadcast signal of multiple carriers according to a Digital Video Broadcasting (DVB) method.

비록 도면에는 하나의 튜너(110)가 도시되나, 이에 한정되지 않고, 영상 표시 장치(100)는 다수의 튜너, 예를 들어, 제 1 및 제 2 튜너를 구비할 수 있다. 이런 경우, 제 1 튜너는 사용자가 선택한 방송 채널에 대응하는 제 1 RF 방송 신호를 수신하고, 제 2 튜너는 기저장된 방송 채널에 대응하는 제 2 RF 방송 신호를 순차적으로 또는 주기적으로 수신할 수 있다. 제 2 튜너는 제 1 튜너와 마찬가지 방식으로 RF 방송 신호를 디지털 IF 신호(DIF) 또는 아날로그 베이스 밴드 영상/음성신호(CVBS/SIF)로 변환할 수 있다. 본 발명의 실시예들에서는 하나의 튜너(110)만을 사용함으로써 이에 따른 제조 비용을 절감할 수 있다.Although one tuner 110 is shown in the drawing, the present invention is not limited thereto, and the video display device 100 may include a plurality of tuners, for example, first and second tuners. In this case, the first tuner may receive the first RF broadcast signal corresponding to the broadcast channel selected by the user, and the second tuner may sequentially or periodically receive the second RF broadcast signal corresponding to the previously stored broadcast channel. . The second tuner may convert an RF broadcast signal into a digital IF signal (DIF) or an analog baseband video/audio signal (CVBS/SIF) in the same manner as the first tuner. In embodiments of the present invention, only one tuner 110 is used, thereby reducing manufacturing cost.

복조부(120)는 튜너(110)에서 변환되는 디지털 IF 신호(DIF)를 수신하여 복조 동작을 수행한다. 예를 들어, 튜너(110)에서 출력되는 디지털 IF 신호(DIF)가 ATSC 방식이면, 복조부(120)는 8-VSB(8-Vestigal Side Band) 복조를 수행한다. 이때, 복조부(120)는 트렐리스 복호화, 디인터리빙(de-interleaving), 리드 솔로몬 복호화 등의 채널 복호화를 수행할 수도 있다. 이를 위해, 복조부(120)는 트렐리스 디코더(Trellis decoder), 디인터리버(de-interleaver) 및 리드 솔로몬 디코더(Reed Solomon decoder) 등을 구비할 수 있다.The demodulator 120 receives the digital IF signal DIF converted by the tuner 110 and performs a demodulation operation. For example, if the digital IF signal DIF output from the tuner 110 is an ATSC method, the demodulator 120 performs 8-VSB (8-Vestigal Side Band) demodulation. In this case, the demodulator 120 may perform channel decoding such as trellis decoding, de-interleaving, and Reed-Solomon decoding. To this end, the demodulator 120 may include a Trellis decoder, a de-interleaver, a Reed Solomon decoder, and the like.

다른 예를 들어, 튜너(110)에서 출력되는 디지털 IF 신호(DIF)가 DVB 방식이면, 복조부(120)는 COFDMA(Coded Orthogonal Frequency Division Modulation) 복조를 수행한다. 이때, 복조부(120)는 컨벌루션 복호화, 디인터리빙, 리드 솔로몬 복호화 등의 채널 복호화를 수행할 수도 있다. 이를 위해, 복조부(120)는 컨벌루션 디코더(convolution decoder), 디인터리버 및 리드-솔로몬 디코더 등을 구비할 수 있다.For another example, if the digital IF signal DIF output from the tuner 110 is a DVB type, the demodulator 120 performs Coded Orthogonal Frequency Division Modulation (COFDMA) demodulation. In this case, the demodulator 120 may perform channel decoding such as convolutional decoding, deinterleaving, and Reed-Solomon decoding. To this end, the demodulator 120 may include a convolution decoder, a deinterleaver, and a Reed-Solomon decoder.

신호 입출력부(130)는 외부 기기와 연결되어 신호 입력 및 출력 동작을 수행하고, 이를 위해, A/V 입출력부 및 무선 통신부를 포함할 수 있다.The signal input/output unit 130 is connected to an external device to perform signal input and output operations, and to this end, may include an A/V input/output unit and a wireless communication unit.

A/V 입출력부는 이더넷(Ethernet) 단자, USB 단자, CVBS(Composite Video Banking Sync) 단자, 컴포넌트 단자, S-비디오 단자(아날로그), DVI(Digital Visual Interface) 단자, HDMI(High Definition Multimedia Interface) 단자, MHL (Mobile High-definition Link) 단자, RGB 단자, D-SUB 단자, IEEE 1394 단자, SPDIF 단자, 리퀴드(Liquid) HD 단자 등을 포함할 수 있다. 이러한 단자들을 통해 입력되는 디지털 신호는 제어부(150)에 전달될 수 있다. 이때, CVBS 단자 및 S-비디오 단자를 통해 입력되는 아날로그 신호는 아날로그-디지털 변환부(미도시)를 통해 디지털 신호로 변환되어 제어부(150)로 전달될 수 있다.A/V input/output is an Ethernet terminal, USB terminal, CVBS (Composite Video Banking Sync) terminal, component terminal, S-video terminal (analog), DVI (Digital Visual Interface) terminal, HDMI (High Definition Multimedia Interface) terminal , Mobile High-definition Link (MHL) terminal, RGB terminal, D-SUB terminal, IEEE 1394 terminal, SPDIF terminal, Liquid HD terminal, and the like. Digital signals input through these terminals may be transmitted to the controller 150. In this case, the analog signal input through the CVBS terminal and the S-video terminal may be converted into a digital signal through an analog-digital converter (not shown) and transmitted to the controller 150.

무선 통신부는 무선 인터넷 접속을 수행할 수 있다. 예를 들어, 무선 통신부는 WLAN(Wireless LAN)(Wi-Fi), Wibro(Wireless broadband), Wimax(World Interoperability for Microwave Access), HSDPA(High Speed Downlink Packet Access) 등을 이용하여 무선 인터넷 접속을 수행할 수 있다. 또한, 무선 통신부는 다른 전자기기와 근거리 무선 통신을 수행할 수 있다. 예를 들어, 무선 통신부는 블루투스(Bluetooth), RFID(Radio Frequency Identification), 적외선 통신(IrDA, infrared Data Association), UWB(Ultra Wideband), 지그비(ZigBee) 등을 이용하여 근거리 무선 통신을 수행할 수 있다.The wireless communication unit may perform wireless Internet access. For example, the wireless communication unit performs wireless Internet access using WLAN (Wireless LAN) (Wi-Fi), Wibro (Wireless broadband), Wimax (World Interoperability for Microwave Access), HSDPA (High Speed Downlink Packet Access), etc. can do. In addition, the wireless communication unit may perform short-range wireless communication with other electronic devices. For example, the wireless communication unit can perform short-range wireless communication using Bluetooth, Radio Frequency Identification (RFID), infrared data association (IrDA), Ultra Wideband (UWB), and ZigBee. have.

신호 입출력부(130)는 DVD(Digital Versatile Disk) 플레이어, 블루레이(Blu-ray) 플레이어, 게임기기, 캠코더, 컴퓨터(노트북), 휴대기기, 스마트 폰 등과 같은 외부 기기로부터 제공되는 영상 신호, 음성 신호 및 데이터 신호를 제어부(150)로 전달할 수 있다. 또한, 메모리장치, 하드디스크 등과 같은 외부 저장 장치에 저장된 다양한 미디어 파일의 영상 신호, 음성 신호 및 데이터 신호를 제어부(150)로 전달할 수 있다. 또한, 제어부(150)에 의해 처리된 영상 신호, 음성 신호 및 데이터 신호를 다른 외부 기기로 출력할 수 있다.The signal input/output unit 130 includes video signals and audio provided from external devices such as a digital versatile disk (DVD) player, a Blu-ray player, a game device, a camcorder, a computer (laptop), a mobile device, and a smart phone. Signals and data signals may be transmitted to the controller 150. In addition, video signals, audio signals, and data signals of various media files stored in an external storage device such as a memory device or a hard disk may be transmitted to the controller 150. Also, an image signal, an audio signal, and a data signal processed by the control unit 150 may be output to another external device.

신호 입출력부(130)는 상술한 각종 단자 중 적어도 하나를 통해 셋톱 박스, 예를 들어, IPTV(Internet Protocol TV)용 셋톱 박스와 연결되어 신호 입력 및 출력 동작을 수행할 수 있다. 예를 들어, 신호 입출력부(130)는 양방향 통신이 가능하도록 IPTV용 셋톱 박스에 의해 처리된 영상 신호, 음성 신호 및 데이터 신호를 제어부(150)로 전달할 수 있고, 제어부(150)에 의해 처리된 신호들을 IPTV용 셋톱 박스로 전달할 수도 있다. 여기서, IPTV는 전송 네트워크에 따라 구분되는 ADSL-TV, VDSL-TV, FTTH-TV 등을 포함할 수 있다.The signal input/output unit 130 may be connected to a set-top box, for example, a set-top box for Internet Protocol TV (IPTV) through at least one of the aforementioned various terminals to perform signal input and output operations. For example, the signal input/output unit 130 may transmit a video signal, an audio signal, and a data signal processed by the IPTV set-top box to enable two-way communication, and the processed by the control unit 150 The signals can also be delivered to a set-top box for IPTV. Here, IPTV may include ADSL-TV, VDSL-TV, FTTH-TV, etc. classified according to transmission networks.

복조부(120) 및 신호 출력부(130)에서 출력되는 디지털 신호는 스트림 신호(TS)를 포함할 수 있다. 스트림 신호(TS)는 영상 신호, 음성 신호 및 데이터 신호가 다중화된 신호일 수 있다. 예를 들어, 스트림 신호(TS)는 MPEG-2 규격의 영상 신호, 돌비(Dolby) AC-3 규격의 음성 신호 등이 다중화된 MPEG-2 TS(Transprt Stream)일 수 있다. 여기서, MPEG-2 TS는 4 바이트(byte)의 헤더와 184 바이트의 페이로드(payload)를 포함할 수 있다.The digital signals output from the demodulation unit 120 and the signal output unit 130 may include a stream signal TS. The stream signal TS may be a signal in which a video signal, an audio signal, and a data signal are multiplexed. For example, the stream signal TS may be an MPEG-2 Transprt Stream (TS) in which an MPEG-2 standard video signal and a Dolby AC-3 standard audio signal are multiplexed. Here, the MPEG-2 TS may include a header of 4 bytes and a payload of 184 bytes.

인터페이스부(140)는 외부 입력 장치(190)로부터 전원 제어, 채널 선택, 화면 설정 등을 위한 입력 신호를 수신하거나, 제어부(160)에 의해 처리된 신호를 외부 입력 장치(190)로 전송할 수 있다. 인터페이스부(140)와 외부 입력 장치(190)는 유선 또는 무선으로 연결될 수 있다.The interface unit 140 may receive an input signal for power control, channel selection, screen setting, etc. from the external input device 190 or may transmit a signal processed by the controller 160 to the external input device 190. . The interface unit 140 and the external input device 190 may be connected by wire or wirelessly.

상기 인터페이스부(140)의 일 예로서, 센서부가 구비될 수 있으며, 센서부는 원격조정기, 예를 들어 리모컨으로부터 상기 입력 신호를 감지하도록 이루어진다. As an example of the interface unit 140, a sensor unit may be provided, and the sensor unit is configured to detect the input signal from a remote controller, for example, a remote control.

네트워크 인터페이스부(미도시)는, 영상 표시 장치(100)를 인터넷망을 포함하는 유/무선 네트워크와 연결하기 위한 인터페이스를 제공한다. 네트워크 인터페이스부는, 유선 네트워크와의 접속을 위해, 이더넷(Ethernet) 단자 등을 구비할 수 있으며, 무선 네트워크와의 접속을 위해, WLAN(Wireless LAN)(Wi-Fi), Wibro(Wireless broadband), Wimax(World Interoperability for Microwave Access), HSDPA(High Speed Downlink Packet Access) 통신 규격 등이 이용될 수 있다. The network interface unit (not shown) provides an interface for connecting the video display device 100 to a wired/wireless network including an Internet network. The network interface unit may include an Ethernet terminal for connection to a wired network, and for connection to a wireless network, WLAN (Wireless LAN) (Wi-Fi), Wibro (Wireless broadband), Wimax (World Interoperability for Microwave Access), HSDPA (High Speed Downlink Packet Access) communication standards, etc. may be used.

네트워크 인터페이스부(미도시)는, 네트워크를 통해, 소정 웹 페이지에 접속할 수 있다. 즉, 네트워크를 통해 소정 웹 페이지에 접속하여, 해당 서버와 데이터를 송신 또는 수신할 수 있다. 그 외, 콘텐츠 제공자 또는 네트워크 운영자가 제공하는 컨텐츠 또는 데이터들을 수신할 수 있다. 즉, 네트워크를 통하여 컨텐츠 제공자 또는 네트워크 제공자로부터 제공되는 영화, 광고, 게임, VOD, 방송 신호 등의 컨텐츠 및 그와 관련된 정보를 수신할 수 있다. 또한, 네트워크 운영자가 제공하는 펌웨어의 업데이트 정보 및 업데이트 파일을 수신할 수 있다. 또한, 인터넷 또는 컨텐츠 제공자 또는 네트워크 운영자에게 데이터들을 송신할 수 있다.The network interface unit (not shown) may access a predetermined web page through a network. That is, by accessing a predetermined web page through a network, the server and data can be transmitted or received. In addition, content or data provided by a content provider or a network operator may be received. That is, content such as movies, advertisements, games, VODs, broadcast signals, and related information provided from a content provider or a network provider may be received through a network. In addition, it is possible to receive update information and an update file of the firmware provided by the network operator. It can also transmit data to the Internet or content provider or network operator.

또한, 네트워크 인터페이스부(미도시)는, 네트워크를 통해, 공중에 공개(open)된 애플리케이션들 중 원하는 애플리케이션을 선택하여 수신할 수 있다. In addition, the network interface unit (not shown) may select and receive a desired application from among applications open to the public through a network.

제어부(150)는 영상 표시 장치(100)의 전반적인 동작을 제어할 수 있다. 보다 구체적으로, 제어부(150)는 영상의 생성 및 출력을 제어하도록 형성된다. 예를 들어, 제어부(150)는 사용자가 선택한 채널 또는 기저장된 채널에 대응하는 RF 방송 신호를 튜닝(tuning)하도록 튜너(110)를 제어할 수 있다. 비록 도면에는 도시되지 않았으나, 제어부(150)는 역다중화부, 영상 처리부, 음성 처리부, 데이터 처리부, OSD(On Screen Display) 생성부 등을 포함할 수 있다. 또한, 제어부(150)는 하드웨어적으로 CPU 나 주변기기 등을 포함할 수 있다.The controller 150 may control the overall operation of the image display device 100. More specifically, the controller 150 is formed to control the generation and output of an image. For example, the controller 150 may control the tuner 110 to tune an RF broadcast signal corresponding to a channel selected by the user or a pre-stored channel. Although not shown in the drawings, the control unit 150 may include a demultiplexer, an image processing unit, an audio processing unit, a data processing unit, an On Screen Display (OSD) generation unit, and the like. In addition, the control unit 150 may include a CPU or a peripheral device in hardware.

제어부(150)는 스트림 신호(TS), 예를 들어, MPEG-2 TS를 역다중화하여 영상 신호, 음성 신호 및 데이터 신호로 분리할 수 있다.The controller 150 may demultiplex the stream signal TS, for example, MPEG-2 TS, and divide it into a video signal, an audio signal, and a data signal.

제어부(150)는 역다중화된 영상 신호에 대한 영상 처리, 예를 들어, 복호화를 수행할 수 있다. 좀더 상세하게, 제어부(150)는 MPEG-2 디코더를 이용하여 MPEG-2 규격의 부호화된 영상 신호를 복호화하고, H.264 디코더를 이용하여 DMB(Digital Multimedia Broadcasting) 방식 또는 DVB-H에 따른 H.264 규격의 부호화된 영상 신호를 복호화할 수 있다. 또한, 제어부(150)는 영상 신호의 밝기(brightness), 틴트(tint) 및 색조(color) 등이 조절되도록 영상 처리할 수 있다. 제어부(150)에 의해 영상 처리된 영상 신호는 디스플레이부(170)로 전달되거나, 외부 출력 단자를 통해 외부 출력 장치(미도시)로 전달될 수 있다.The controller 150 may perform image processing, for example, decoding on the demultiplexed image signal. In more detail, the control unit 150 decodes an MPEG-2 standard coded video signal using an MPEG-2 decoder, and uses an H.264 decoder to convert a digital multimedia broadcasting (DMB) method or DVB-H. It is possible to decode an encoded video signal of the .264 standard. In addition, the controller 150 may process an image so that brightness, tint, and color of an image signal are adjusted. The image signal processed by the control unit 150 may be transmitted to the display unit 170 or may be transmitted to an external output device (not shown) through an external output terminal.

제어부(150)는 역다중화된 음성 신호에 대한 음성 처리, 예를 들어, 복호화를 수행할 수 있다. 좀더 상세하게, 제어부(150)는 MPEG-2 디코더를 이용하여 MPEG-2 규격의 부호화된 음성 신호를 복호화하고, MPEG 4 디코더를 이용하여 DMB 방식에 따른 MPEG 4 BSAC(Bit Sliced Arithmetic Coding) 규격의 부호화된 음성 신호를 복호화하며, AAC 디코더를 이용하여 위성 DMB 방식 또는 DVB-H에 따른 MPEG 2의 AAC(Advanced Audio Codec) 규격의 부호화된 음성 신호를 복호화할 수 있다. 또한, 제어부(150)는 베이스(Base), 트레블(Treble), 음량 조절 등을 처리할 수 있다. 제어부(150)에서 처리된 음성 신호는 오디오 출력부(180), 예를 들어, 스피커로 전달되거나, 외부 출력 장치로 전달될 수 있다.The controller 150 may perform voice processing, for example, decoding on the demultiplexed voice signal. In more detail, the control unit 150 decodes the encoded audio signal of the MPEG-2 standard using an MPEG-2 decoder, and uses the MPEG 4 decoder to comply with the MPEG 4 Bit Sliced Arithmetic Coding (BSAC) standard according to the DMB method. The encoded audio signal may be decoded, and an encoded audio signal of the AAC (Advanced Audio Codec) standard of MPEG 2 according to the satellite DMB method or DVB-H may be decoded using the AAC decoder. In addition, the control unit 150 may process a base, a treble, and a volume control. The audio signal processed by the controller 150 may be transmitted to the audio output unit 180, for example, a speaker, or may be transmitted to an external output device.

제어부(150)는 아날로그 베이스 밴드 영상/음성신호(CVBS/SIF)에 대한 신호 처리를 수행할 수 있다. 여기서, 제어부(150)에 입력되는 아날로그 베이스 밴드 영상/음성신호(CVBS/SIF)는 튜너(110) 또는 신호 입출력부(130)에서 출력된 아날로그 베이스 밴드 영상/음성신호일 수 있다. 신호 처리된 영상 신호는 디스플레이부(170)를 통해 표시되고, 신호 처리된 음성 신호는 오디오 출력부(180)를 통해 출력된다.The controller 150 may perform signal processing on an analog baseband video/audio signal (CVBS/SIF). Here, the analog baseband video/audio signal CVBS/SIF input to the controller 150 may be an analog baseband video/audio signal output from the tuner 110 or the signal input/output unit 130. The signal-processed video signal is displayed through the display unit 170, and the signal-processed audio signal is output through the audio output unit 180.

제어부(150)는 역다중화된 데이터 신호에 대한 데이터 처리, 예를 들어, 복호화를 수행할 수 있다. 여기서, 데이터 신호는 각각의 채널에서 방영되는 방송프로그램의 시작시간, 종료시간 등의 방송정보를 포함하는 EPG(Electronic Program Guide) 정보를 포함할 수 있다. EPG 정보는, 예를 들어, ATSC 방식에서는 TSC-PSIP(ATSC-Program and System Information Protocol) 정보를 포함하고, DVB 방식에서는 DVB-SI(DVB-Service Information) 정보를 포함할 수 있다. ATSC-PSIP 정보 또는 DVB-SI 정보는 MPEG-2 TS의 헤더(4 byte)에 포함될 수 있다.The controller 150 may perform data processing, for example, decoding on the demultiplexed data signal. Here, the data signal may include EPG (Electronic Program Guide) information including broadcast information such as a start time and an end time of a broadcast program aired on each channel. The EPG information may include, for example, ATSC-Program and System Information Protocol (TSC-PSIP) information in the ATSC scheme, and DVB-Service Information (DVB-SI) information in the DVB scheme. ATSC-PSIP information or DVB-SI information may be included in the header (4 bytes) of the MPEG-2 TS.

제어부(150)는 OSD 처리를 위한 제어 동작을 수행할 수 있다. 좀더 상세하게, 제어부(150)는 영상 신호 및 데이터 신호 중 적어도 하나 또는 외부 입력 장치(190)로부터 수신되는 입력 신호에 근거하여 각종 정보를 그래픽(Graphic)이나 텍스트(Text) 형태로 표시하기 위한 OSD 신호를 생성할 수 있다. OSD 신호는 영상 표시 장치(100)의 사용자 인터페이스 화면, 메뉴 화면, 위젯, 아이콘 등의 다양한 데이터를 포함할 수 있다.The controller 150 may perform a control operation for OSD processing. In more detail, the controller 150 is an OSD for displaying various types of information in a graphic or text format based on at least one of an image signal and a data signal or an input signal received from the external input device 190. Can generate signals. The OSD signal may include various data such as a user interface screen, a menu screen, a widget, and an icon of the video display device 100.

저장부(160)는 제어부(150)의 신호 처리 및 제어를 위한 프로그램이 저장될 수도 있고, 신호 처리된 영상 신호, 음성 신호 및 데이터 신호를 저장할 수도 있다. 저장부(160)는 플래시 메모리(flash memory), 하드디스크(hard disk), 멀티미디어 카드 마이크로 타입(multimedia card micro type), 카드 타입의 메모리(예를 들어 SD 또는 XD 메모리 등), 램(random access memory; RAM), SRAM(static random access memory), 롬(read-only memory; ROM), EEPROM(electrically erasable programmable read-only memory), PROM(programmable read-only memory), 자기 메모리, 자기 디스크, 광디스크 중 적어도 하나의 저장매체를 포함할 수 있다.The storage unit 160 may store a program for signal processing and control of the controller 150, or may store a signal-processed video signal, an audio signal, and a data signal. The storage unit 160 includes a flash memory, a hard disk, a multimedia card micro type, a card type memory (for example, SD or XD memory), and a random access memory. memory; RAM), static random access memory (SRAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, optical disk It may include at least one of the storage media.

디스플레이부(170)는 제어부(150)에 의해 처리된 영상 신호, 데이터 신호, OSD 신호 등을 RGB 신호로 변환하여 구동 신호를 생성할 수 있다. 이를 통하여, 디스플레이부(170)는 영상을 출력하게 된다. 디스플레이부(170)는 플라즈마 디스플레이 패널(Plasma Display Panel: PDP), 액정 디스플레이(Liquid Crystal Display: LCD), 박막 트랜지스터 액정 디스플레이(Thin Film Transistor-Liquid Crystal Display: TFT- LCD), 유기 발광 다이오드(Organic Light Emitting Diode: OLED), 플렉시블 디스플레이(flexible display), 3차원 디스플레이(3D display), 전자잉크 디스플레이(e-ink display) 등의 다양한 형태로 구현될 수 있다. 또한, 디스플레이(180)는 터치 스크린으로 구현되어 입력 장치의 기능도 수행할 수 있다.The display unit 170 may generate a driving signal by converting an image signal, a data signal, an OSD signal, and the like processed by the controller 150 into an RGB signal. Through this, the display unit 170 outputs an image. The display unit 170 includes a plasma display panel (PDP), a liquid crystal display (LCD), a thin film transistor liquid crystal display (TFT-LCD), and an organic light emitting diode (Organic). Light Emitting Diode: OLED), flexible display (flexible display), 3D display (3D display), electronic ink display (e-ink display) can be implemented in various forms. In addition, the display 180 may be implemented as a touch screen to perform a function of an input device.

오디오 출력부(180)는 제어부(150)에 의해 처리된 음성 신호, 예를 들어, 스테레오 신호 또는 5.1 채 신호를 출력한다. 오디오 출력부(180)는 다양한 형태의 스피커로 구현될 수 있다.The audio output unit 180 outputs an audio signal processed by the controller 150, for example, a stereo signal or a 5.1 signal. The audio output unit 180 may be implemented with various types of speakers.

한편, 사용자를 촬영하는 촬영부(미도시)를 더 구비할 수 있다. 촬영부(미도시)는 1 개의 카메라로 구현되는 것이 가능하나, 이에 한정되지 않으며, 복수 개의 카메라로 구현되는 것도 가능하다. 촬영부(미도시)에서 촬영된 영상 정보는 제어부(150)에 입력된다.Meanwhile, a photographing unit (not shown) for photographing a user may be further provided. The photographing unit (not shown) may be implemented with one camera, but is not limited thereto, and may be implemented with a plurality of cameras. Image information captured by the photographing unit (not shown) is input to the control unit 150.

한편, 사용자의 제스처를 감지하기 위해, 상술한 바와 같이, 터치 센서, 음성 센서, 위치 센서, 동작 센서 중 적어도 하나를 구비하는 센싱부(미도시)가 영상 표시 장치(100)에 더 구비될 수 있다. 센싱부(미도시)에서 감지된 신호는 사용자입력 인터페이스부(140)를 통해 제어부(150)로 전달될 수 있다. Meanwhile, in order to detect a user's gesture, as described above, a sensing unit (not shown) including at least one of a touch sensor, a voice sensor, a position sensor, and a motion sensor may be further provided in the image display device 100. have. The signal detected by the sensing unit (not shown) may be transmitted to the control unit 150 through the user input interface unit 140.

제어부(150)는, 촬영부(미도시)로부터 촬영된 영상, 또는 센싱부(미도시)로부터의 감지된 신호를 각각 또는 조합하여 사용자의 제스처를 감지할 수도 있다. The controller 150 may detect a user's gesture by combining or combining an image captured from a photographing unit (not shown) or a signal detected from a sensing unit (not shown).

전원 공급부(미도시)는, 영상 표시 장치(100) 전반에 걸쳐 해당 전원을 공급한다. 특히, 시스템 온 칩(System On Chip,SOC)의 형태로 구현될 수 있는 제어부(150)와, 영상 표시를 위한 디스플레이부(170), 및 오디오 출력을 위한 오디오 출력부(180)에 전원을 공급할 수 있다. A power supply unit (not shown) supplies corresponding power to the entire video display device 100. In particular, power is supplied to the control unit 150 that can be implemented in the form of a System On Chip (SOC), the display unit 170 for displaying an image, and the audio output unit 180 for outputting audio. I can.

이를 위해, 전원 공급부(미도시)는, 교류 전원을 직류 전원으로 변환하는 컨버터(미도시)를 구비할 수 있다. 한편, 예를 들어, 디스플레이부(170)가 다수의 백라이트 램프를 구비하는 액정패널로서 구현되는 경우, 휘도 가변 또는 디밍(dimming) 구동을 위해, PWM 동작이 가능한 인버터(미도시)를 더 구비할 수도 있다.To this end, the power supply unit (not shown) may include a converter (not shown) for converting AC power into DC power. Meanwhile, for example, when the display unit 170 is implemented as a liquid crystal panel having a plurality of backlight lamps, an inverter (not shown) capable of PWM operation may be further provided for luminance variable or dimming driving. May be.

외부 입력 장치(190)는 유선 또는 무선으로 인터페이스부(140)와 연결되며,사용자 입력에 따라 생성되는 입력 신호를 인터페이스부(140)로 전송한다. 외부 입력 장치(190)는 원격조정기, 마우스, 키보드 등을 포함할 수 있다. 원격조정기는 블루투스(Bluetooth), RF 통신, 적외선 통신, UWB(Ultra Wideband), 지그비(ZigBee) 방식 등을 통해 입력 신호를 인터페이스부(140)로 전송할 수 있다. 원격조정기는 공간 원격 제어 장치로서 구현될 수 있다. 공간 원격 제어 장치는 공간에서 본체의 동작을 감지하여 입력 신호를 생성할 수 있다.The external input device 190 is connected to the interface unit 140 by wire or wirelessly, and transmits an input signal generated according to a user input to the interface unit 140. The external input device 190 may include a remote controller, a mouse, and a keyboard. The remote controller may transmit an input signal to the interface unit 140 through Bluetooth, RF communication, infrared communication, UWB (Ultra Wideband), ZigBee method, or the like. The remote controller can be implemented as a space remote control device. The space remote control device may generate an input signal by detecting the motion of the body in space.

영상 표시 장치(100)는 ATSC 방식(8-VSB 방식)의 디지털 방송, DVB-T 방식(COFDM 방식)의 디지털 방송, DVB-C 방식(QAM 방식)의 디지털 방송, DVB-S 방식(QPSK 방식)의 디지털 방송, ISDB-T 방식(BST-OFDM방식)의 디지털 방송 등 중 적어도 하나를 수신 가능한 고정형 디지털 방송 수신기로 구현될 수 있다. 또한, 영상 표시 장치(100)는 지상파 DMB 방식의 디지털 방송, 위성 DMB 방식의 디지털 방송, ATSC-M/H 방식의 디지털 방송, DVB-H 방식(COFDM 방식)의 디지털 방송, 미디어플로(Media Foward Link Only) 방식의 디지털 방송 등 중 적어도 하나를 수신 가능한 이동형 디지털 방송 수신기로 구현될 수 있다. 또한, 영상 표시 장치(100)는 케이블, 위성통신, IPTV용 디지털 방송 수신기로 구현될 수 있다.The video display device 100 includes ATSC system (8-VSB system) digital broadcasting, DVB-T system (COFDM system) digital broadcasting, DVB-C system (QAM system) digital broadcasting, DVB-S system (QPSK system) ) Digital broadcasting, ISDB-T method (BST-OFDM method) digital broadcasting, etc. may be implemented as a fixed digital broadcasting receiver capable of receiving at least one of. In addition, the video display device 100 includes terrestrial DMB type digital broadcasting, satellite DMB type digital broadcasting, ATSC-M/H type digital broadcasting, DVB-H type (COFDM type) digital broadcasting, and media forward. It may be implemented as a mobile digital broadcasting receiver capable of receiving at least one of digital broadcasting of the Link Only method. In addition, the video display device 100 may be implemented as a digital broadcasting receiver for cable, satellite communication, and IPTV.

한편, 본 발명의 영상 표시 장치는 입체영상을 제공하도록 이루어진다. 3-D 또는 3D 라는 용어는 깊이의 착시 효과를 갖는 입체영상(이하, '3D 영상'이라 한다)을 재생하려고 하는 시각적 표현 또는 표시 기술을 설명하는데 사용된다. 좌안 영상과 우안 영상에 대해, 관찰자의 시각 피질(visual cortex)은 두 영상을 하나의 3D 영상으로 해석한다.Meanwhile, the video display device of the present invention is configured to provide a stereoscopic image. The term 3-D or 3D is used to describe a visual expression or display technology that attempts to reproduce a stereoscopic image (hereinafter, referred to as '3D image') having an optical illusion of depth. For the left eye image and the right eye image, the viewer's visual cortex interprets the two images as one 3D image.

3차원(3D) 표시기술은 3D 영상 표시가 가능한 장치에 대해 3D 영상 처리 및 표현의 기술을 채용한다. 선택적으로는, 3D 영상 표시가 가능한 장치는 관찰자에게 3차원 영상을 효과적으로 제공하기 위해 특수한 관찰장치를 사용해야 할 수 있다.Three-dimensional (3D) display technology employs a technology of 3D image processing and expression for a device capable of displaying a 3D image. Optionally, a device capable of displaying a 3D image may need to use a special observation device to effectively provide a 3D image to an observer.

3D 영상 처리 및 표현의 예로는 스테레오스코픽 영상/비디오 캡처, 다수의 카메라를 이용한 다시점 영상/비디오 캡처, 이차원 영상과 깊이 정보의 처리 등이 있다. 3D 영상 표시가 가능한 표시 장치의 예로는, 3D 영상 표시기술을 지원하는 적절한 하드웨어 및/또는 소프트웨어를 구비한 LCD(Liquid Crystal Display), 디지털 TV 화면, 컴퓨터 모니터 등이 있다. 특수한 관찰장치의 예로는, 특수화 안경, 고글, 헤드기어, 안경류(eyewear) 등이 있다.Examples of 3D image processing and representation include stereoscopic image/video capture, multi-view image/video capture using multiple cameras, and processing of two-dimensional images and depth information. Examples of display devices capable of displaying 3D images include liquid crystal displays (LCDs), digital TV screens, and computer monitors provided with appropriate hardware and/or software supporting 3D image display technology. Examples of special observation devices include specialized glasses, goggles, headgear, and eyewear.

구체적으로, 3D 영상 표시기술은, 애너글리프(anaglyph) 입체영상(통상적으로 수동형 적청 안경을 함께 사용), 편광 입체영상(통상적으로 수동형 편광 안경과 함께 사용), 프레임-교대 시퀀싱(alternate-frame sequencing)(통상적으로 능동형 셔터 안경/헤드기어와 함께 사용), 렌티큘러(lenticular) 또는 배리어(barrier) 스크린을 사용한 오토스테레오스코픽 디스플레이(autostereoscopic display) 등이 있다. Specifically, 3D image display technology includes anaglyph stereoscopic images (usually using passive red and red glasses), polarized stereoscopic images (usually used with passive polarized glasses), and alternate-frame sequencing. ) (Usually used with active shutter glasses/headgear), autostereoscopic displays with lenticular or barrier screens.

3D 영상 처리를 위하여, 스테레오 영상 또는 다시점 영상은 MPEG(Moving Picture Experts Group)을 포함하는 여러가지 방법으로 압축 부호화되어 전송될 수 있다. 예를 들어, 스테레오 영상 또는 다시점 영상은 H.264/AVC(Advanced Video Coding) 방식으로 압축 부호화되어 전송될 수 있다. 이때 수신 시스템은 H.264/AVC 코딩 방식의 역으로 수신 영상을 복호하여 3D 영상을 얻을 수 있다. 이 경우에, 상기 수신 시스템은 3D 입체 영상 표시 장치의 일 구성으로서 구비될 수 있다. For 3D image processing, a stereo image or a multi-view image may be compressed and encoded and transmitted by various methods including MPEG (Moving Picture Experts Group). For example, a stereo image or a multiview image may be compressed and encoded using H.264/AVC (Advanced Video Coding) and transmitted. In this case, the receiving system may obtain a 3D image by decoding the received image in the reverse of the H.264/AVC coding scheme. In this case, the receiving system may be provided as a component of a 3D stereoscopic image display device.

이하에서는, 3D 입체 영상 표시 장치(200)의 구성을 도 2를 참조하여 설명한다. Hereinafter, the configuration of the 3D stereoscopic image display apparatus 200 will be described with reference to FIG. 2.

도 2는 본 발명의 실시예들에 따른 영상 처리 장치가 적용된 3D 영상 표시 장치의 구성을 나타낸 구성도이다.2 is a block diagram illustrating a configuration of a 3D image display device to which an image processing device according to embodiments of the present invention is applied.

도 2에 도시한 바와 같이, 본 발명의 실시예들에 의한 3D 영상 표시 장치(200)는 튜너(210), 복조부(220), 외부장치 인터페이스부(230), 네트워크 인터페이스부(235), 저장부(240), 사용자입력 인터페이스부(250), 제어부(270), 디스플레이부(280), 오디오 출력부(285), 및 3D 시청장치(295)를 포함할 수 있다. 이하, 도 1과 동일한 구성에 대하여는 3D 영상의 출력과 관련된 부분을 중점으로 설명하며, 전술한 부분과 중복되는 부분은 생략한다.As shown in FIG. 2, the 3D image display apparatus 200 according to the embodiments of the present invention includes a tuner 210, a demodulation unit 220, an external device interface unit 230, a network interface unit 235, and A storage unit 240, a user input interface unit 250, a control unit 270, a display unit 280, an audio output unit 285, and a 3D viewing device 295 may be included. Hereinafter, for the same configuration as in FIG. 1, a portion related to the output of a 3D image will be mainly described, and a portion overlapping with the above-described portion will be omitted.

튜너(튜너부)(210)는, 방송 신호를 수신하여 해당 신호를 검파하고 오류를 정정하여 좌안 및 우안 영상에 대한, 트랜스포트 스트림(Trasport Stream)을 생성한다. The tuner (tuner unit) 210 receives a broadcast signal, detects a corresponding signal, corrects an error, and generates a transport stream for left-eye and right-eye images.

복조부(220)는 기준시점 비디오를 디코딩하는 제1 디코더이고, 확장시점 비디오를 디코딩하는 제2 디코더로 이루어질 수 있다. 이 경우에, 역다중화부에 의하여, 비디오 스트림은 기준시점 비디오에 해당하면 제1 디코더로 출력되고, 확장시점 비디오에 해당하면 제2 디코더로 출력된다.The demodulator 220 is a first decoder that decodes a reference view video, and may include a second decoder that decodes an extended view video. In this case, by the demultiplexer, the video stream is output to the first decoder if it corresponds to the reference view video, and to the second decoder if it corresponds to the extended view video.

외부장치 인터페이스부(230)는, 접속된 외부 장치와 데이터를 송신 또는 수신할 수 있다. 이를 위해, 외부장치 인터페이스부(230)는, A/V 입출력부(도시하지 않음) 또는 무선 통신부(도시하지 않음)를 포함할 수 있다. The external device interface unit 230 may transmit or receive data with a connected external device. To this end, the external device interface unit 230 may include an A/V input/output unit (not shown) or a wireless communication unit (not shown).

외부장치 인터페이스부(230)는, DVD(Digital Versatile Disk), 블루레이(Blu ray), 게임기기, 카메라, 캠코더, 컴퓨터(노트북) 등과 같은 외부 장치(도시하지 않음)와 유/무선으로 접속될 수 있다. 외부장치 인터페이스부(230)는 접속된 외부 장치를 통하여 외부에서 입력되는 영상, 음성 또는 데이터 신호를 영상표시장치(200)의 제어부(270)로 전달한다. 또한, 제어부(270)에서 처리된 영상, 음성 또는 데이터 신호를 연결된 외부 장치로 출력할 수 있다. 이를 위해, 외부장치 인터페이스부(230)는, A/V 입출력부(도시하지 않음) 또는 무선 통신부(도시하지 않음)를 포함할 수 있다. The external device interface unit 230 may be connected to an external device (not shown) such as a digital versatile disk (DVD), a Blu ray, a game device, a camera, a camcorder, a computer (laptop), etc. by wire/wireless. I can. The external device interface unit 230 transmits a video, audio or data signal input from the outside through a connected external device to the control unit 270 of the image display device 200. In addition, the image, audio, or data signal processed by the controller 270 may be output to a connected external device. To this end, the external device interface unit 230 may include an A/V input/output unit (not shown) or a wireless communication unit (not shown).

A/V 입출력부는, 외부 장치의 영상 및 음성 신호를 영상표시장치(200)로 입력할 수 있도록, USB 단자, CVBS(Composite Video Banking Sync) 단자, 컴포넌트 단자, S-비디오 단자(아날로그), DVI(Digital Visual Interface) 단자, HDMI(High Definition Multimedia Interface) 단자, RGB 단자, D-SUB 단자 등을 포함할 수 있다. The A/V input/output unit is a USB terminal, CVBS (Composite Video Banking Sync) terminal, component terminal, S-video terminal (analog), DVI so that video and audio signals from external devices can be input to the video display device 200. It may include a (Digital Visual Interface) terminal, a High Definition Multimedia Interface (HDMI) terminal, an RGB terminal, and a D-SUB terminal.

무선 통신부는, 다른 전자기기와 근거리 무선 통신을 수행할 수 있다. 영상표시장치(200)는 블루투스(Bluetooth), RFID(Radio Frequency Identification), 적외선 통신(IrDA, infrared Data Association), UWB(Ultra Wideband), 지그비(ZigBee), DLNA(Digital Living Network Alliance) 등의 통신 규격에 따라 다른 전자기기와 네트워크 연결될 수 있다. The wireless communication unit may perform short-range wireless communication with another electronic device. The image display device 200 communicates with Bluetooth, Radio Frequency Identification (RFID), infrared data association (IrDA), Ultra Wideband (UWB), ZigBee, and Digital Living Network Alliance (DLNA). Depending on the standard, other electronic devices can be connected to the network.

또한, 외부장치 인터페이스부(230)는, 다양한 셋탑 박스와 상술한 각종 단자 중 적어도 하나를 통해 접속되어, 셋탑 박스와 입력/출력 동작을 수행할 수도 있다. In addition, the external device interface unit 230 may be connected through various set-top boxes and at least one of the aforementioned various terminals to perform input/output operations with the set-top box.

한편, 외부장치 인터페이스부(230)는, 3D 시청장치(295)와 데이터를 송수신할 수 있다. Meanwhile, the external device interface unit 230 may transmit and receive data to and from the 3D viewing device 295.

네트워크 인터페이스부(235)는, 영상표시장치(200)를 인터넷망을 포함하는 유/무선 네트워크와 연결하기 위한 인터페이스를 제공한다. 네트워크 인터페이스부(235)는, 유선 네트워크와의 접속을 위해, 이더넷(Ethernet) 단자 등을 구비할 수 있으며, 무선 네트워크와의 접속을 위해, WLAN(Wireless LAN)(Wi-Fi), Wibro(Wireless broadband), Wimax(World Interoperability for Microwave Access), HSDPA(High Speed Downlink Packet Access) 통신 규격 등이 이용될 수 있다. The network interface unit 235 provides an interface for connecting the image display device 200 to a wired/wireless network including an Internet network. The network interface unit 235 may include an Ethernet terminal or the like for connection to a wired network, and for connection to a wireless network, WLAN (Wireless LAN) (Wi-Fi), Wibro (Wireless broadband), World Interoperability for Microwave Access (Wimax), High Speed Downlink Packet Access (HSDPA) communication standards, etc. may be used.

네트워크 인터페이스부(235)는, 네트워크를 통해, 인터넷 또는 콘텐츠 제공자 또는 네트워크 운영자가 제공하는 콘텐츠 또는 데이터들을 수신할 수 있다. 즉, 네트워크를 통하여 인터넷, 콘텐츠 제공자 등으로부터 제공되는 영화, 광고, 게임, VOD, 방송 신호 등의 콘텐츠 및 그와 관련된 정보를 수신할 수 있다. 또한, 네트워크 운영자가 제공하는 펌웨어의 업데이트 정보 및 업데이트 파일을 수신할 수 있다. 또한, 인터넷 또는 콘텐츠 제공자 또는 네트워크 운영자에게 데이터들을 송신할 수 있다. The network interface unit 235 may receive content or data provided by the Internet or a content provider or a network operator through a network. That is, content such as movies, advertisements, games, VODs, broadcast signals, etc. provided from the Internet and content providers, and related information may be received through the network. In addition, it is possible to receive update information and an update file of the firmware provided by the network operator. It can also transmit data to the Internet or content provider or network operator.

또한, 네트워크 인터페이스부(235)는, 예를 들어, IP(internet Protocol) TV와 접속되어, 양방향 통신이 가능하도록, IPTV용 셋탑 박스에서 처리된 영상, 음성 또는 데이터 신호를 수신하여 제어부(270)로 전달할 수 있으며, 제어부(270)에서 처리된 신호들을 IPTV용 셋탑 박스로 전달할 수 있다.In addition, the network interface unit 235 is connected to an Internet Protocol (IP) TV, for example, to enable two-way communication, by receiving a video, audio, or data signal processed by an IPTV set-top box, and the controller 270 It can be transferred to, and the signals processed by the control unit 270 can be transferred to the IPTV set-top box.

한편, 상술한 IPTV는, 전송네트워크의 종류에 따라 ADSL-TV, VDSL-TV, FTTH-TV 등을 포함하는 의미일 수 있으며, TV over DSL, Video over DSL, TV overIP(TVIP), Broadband TV(BTV) 등을 포함하는 의미일 수 있다. 또한, IPTV는 인터넷 접속이 가능한 인터넷 TV, 풀브라우징 TV를 포함하는 의미일 수도 있다.Meanwhile, the above-described IPTV may mean including ADSL-TV, VDSL-TV, FTTH-TV, etc., depending on the type of transmission network, and TV over DSL, Video over DSL, TV overIP (TVIP), and Broadband TV ( BTV) or the like. In addition, IPTV may also mean Internet TV and full browsing TV with Internet access.

저장부(240)는, 제어부(270) 내의 각 신호 처리 및 제어를 위한 프로그램이 저장될 수도 있고, 신호 처리된 영상, 음성 또는 데이터 신호를 저장할 수도 있다. The storage unit 240 may store a program for processing and controlling each signal in the control unit 270 or may store a signal-processed image, audio, or data signal.

또한, 저장부(240)는 외부장치 인터페이스부(230)로 입력되는 영상, 음성 또는 데이터 신호의 임시 저장을 위한 기능을 수행할 수도 있다. 또한, 저장부(240)는, 채널 맵 등의 채널 기억 기능을 통하여 소정 방송 채널에 관한 정보를 저장할 수 있다. In addition, the storage unit 240 may perform a function for temporary storage of an image, audio, or data signal input to the external device interface unit 230. In addition, the storage unit 240 may store information on a predetermined broadcast channel through a channel storage function such as a channel map.

저장부(240)는 플래시 메모리 타입(flash memory type), 하드디스크 타입(hard disk type), 멀티미디어 카드 마이크로 타입(multimedia card micro type), 카드 타입의 메모리(예를 들어 SD 또는 XD 메모리 등), 램, 롬(EEPROM 등) 중 적어도 하나의 타입의 저장매체를 포함할 수 있다. 영상표시장치(200)는, 저장부(240) 내에 저장되어 있는 파일(동영상 파일, 정지영상 파일, 음악 파일, 문서 파일 등)을 재생하여 사용자에게 제공할 수 있다. 도 2는 저장부(240)가 제어부(270)와 별도로 구비된 실시예를 도시하고 있으나, 본 발명의 범위는 이에 한정되지 않는다. 저장부(240)는 제어부(270) 내에 포함될 수 있다. The storage unit 240 is a flash memory type, a hard disk type, a multimedia card micro type, a card type memory (eg, SD or XD memory, etc.), It may include at least one type of storage medium among RAM and ROM (EEPROM, etc.). The image display device 200 may reproduce and provide a file (movie file, still image file, music file, document file, etc.) stored in the storage unit 240 to a user. 2 shows an embodiment in which the storage unit 240 is provided separately from the control unit 270, but the scope of the present invention is not limited thereto. The storage unit 240 may be included in the control unit 270.

사용자입력 인터페이스부(250)에 대한 설명은 도 1을 참조하여, 전술한 인터페이스부(140)의 설명으로 대체한다.The description of the user input interface unit 250 is replaced with the description of the interface unit 140 described above with reference to FIG. 1.

제어부(270)는, 튜너(210) 또는 복조부(220) 또는 외부장치 인터페이스부(230)를 통하여, 입력되는 스트림을 역다중화하거나, 역다중화된 신호들을 처리하여, 영상 또는 음성 출력을 위한 신호를 생성 및 출력할 수 있다. The control unit 270 demultiplexes the input stream through the tuner 210 or the demodulation unit 220 or the external device interface unit 230 or processes the demultiplexed signals to output a video or audio signal. Can be created and printed.

제어부(270)에서 영상 처리된 영상 신호는 디스플레이부(280)로 입력되어, 해당 영상 신호에 대응하는 영상으로 표시될 수 있다. 또한, 제어부(270)에서 영상 처리된 영상 신호는 외부장치 인터페이스부(230)를 통하여 외부 출력장치로 입력될 수 있다. The image signal processed by the control unit 270 may be input to the display unit 280 and displayed as an image corresponding to the corresponding image signal. In addition, the image signal processed by the controller 270 may be input to an external output device through the external device interface unit 230.

제어부(270)에서 처리된 음성 신호는 오디오 출력부(285)로 음향 출력될 수 있다. 또한, 제어부(270)에서 처리된 음성 신호는 외부장치 인터페이스부(230)를 통하여 외부 출력장치로 입력될 수 있다. The audio signal processed by the controller 270 may be sound output to the audio output unit 285. In addition, the voice signal processed by the controller 270 may be input to an external output device through the external device interface unit 230.

제어부(270)는 역다중화부, 영상처리부 등을 포함할 수 있다. 제어부(270)는, 영상표시장치(200) 내의 전반적인 동작을 제어할 수 있다. 예를 들어, 제어부(270)는 튜너(210)를 제어하여, 사용자가 선택한 채널 또는 기저장된 채널에 해당하는 RF 방송을 선택(Tuning)하도록 제어할 수 있다. The control unit 270 may include a demultiplexer, an image processing unit, and the like. The controller 270 may control the overall operation of the image display device 200. For example, the controller 270 may control the tuner 210 to select (Tuning) an RF broadcast corresponding to a channel selected by the user or a pre-stored channel.

또한, 제어부(270)는 사용자입력 인터페이스부(250)를 통하여 입력된 사용자 명령 또는 내부 프로그램에 의하여 영상표시장치(200)를 제어할 수 있다. In addition, the controller 270 may control the image display apparatus 200 according to a user command or an internal program input through the user input interface unit 250.

예를 들어, 제어부(270)는, 사용자입력 인터페이스부(250)를 통하여 수신한 소정 채널 선택 명령에 따라 선택한 채널의 신호가 입력되도록 튜너(210)를 제어한다. 그리고, 선택한 채널의 영상, 음성 또는 데이터 신호를 처리한다. 제어부(270)는, 사용자가 선택한 채널 정보 등이 처리한 영상 또는 음성신호와 함께 디스플레이부(280) 또는 오디오 출력부(285)를 통하여 출력될 수 있도록 한다. For example, the controller 270 controls the tuner 210 to input a signal of a selected channel according to a predetermined channel selection command received through the user input interface unit 250. Then, the video, audio, or data signal of the selected channel is processed. The control unit 270 enables channel information selected by the user to be output through the display unit 280 or the audio output unit 285 together with the processed image or audio signal.

다른 예로, 제어부(270)는, 사용자입력 인터페이스부(250)를 통하여 수신한 외부장치 영상 재생 명령에 따라, 외부장치 인터페이스부(230)를 통하여 입력되는 외부 장치, 예를 들어, 카메라 또는 캠코더로부터의 영상 신호 또는 음성 신호가 디스플레이부(280) 또는 오디오 출력부(285)를 통해 출력될 수 있도록 한다. As another example, the control unit 270 may be configured from an external device input through the external device interface unit 230, for example, a camera or camcorder, according to an external device image playback command received through the user input interface unit 250. The video signal or audio signal of can be output through the display unit 280 or the audio output unit 285.

제어부(270)는, 영상을 표시하도록 디스플레이부(280)를 제어할 수 있다. 예를 들어, 튜너(210)를 통해 입력되는 방송 영상, 외부장치 인터페이스부(230)를 통해 입력되는 외부 입력 영상 또는 네트워크 인터페이스부(235)를 통해 입력되는 영상 또는 저장부(240)에 저장된 영상을 디스플레이부(280)에 표시하도록 제어할 수 있다. 이때, 디스플레이부(280)에 표시되는 영상은, 정지 영상 또는 동영상일 수 있으며, 2D 영상 또는 3D 영상일 수 있다.The controller 270 may control the display unit 280 to display an image. For example, a broadcast image input through the tuner 210, an external input image input through the external device interface unit 230, an image input through the network interface unit 235, or an image stored in the storage unit 240 May be controlled to be displayed on the display unit 280. In this case, the image displayed on the display unit 280 may be a still image or a moving image, and may be a 2D image or a 3D image.

제어부(270)는 디스플레이부(280)에 표시되는 영상 중에, 소정 오브젝트에 대해 3D 오브젝트로 생성하여 표시되도록 한다. 예를 들어, 오브젝트는, 접속된 웹 화면(신문, 잡지 등), EPG(Electronic Program Guide), 다양한 메뉴, 위젯, 아이콘, 정지 영상, 동영상, 텍스트 중 적어도 하나일 수 있다. 이러한 3D 오브젝트는, 디스플레이부(280)에 표시되는 영상과 다른 깊이를 가지도록 처리될 수 있다. 제어부(270)는 3D 오브젝트가 디스플레이부(280)에 표시되는 영상에 비해 돌출되어 보이도록 처리할 수 있다. The control unit 270 generates and displays a predetermined object as a 3D object among the images displayed on the display unit 280. For example, the object may be at least one of a connected web screen (newspaper, magazine, etc.), EPG (Electronic Program Guide), various menus, widgets, icons, still images, moving pictures, and text. Such a 3D object may be processed to have a depth different from that of an image displayed on the display unit 280. The controller 270 may process the 3D object to protrude compared to the image displayed on the display unit 280.

제어부(270)는, 촬영부(도시하지 않음)로부터 촬영된 영상에 기초하여, 사용자의 위치를 인식한다. 예를 들어, 사용자와 영상표시장치(200)간의 거리(z축 좌표)를 파악할 수 있다. 그 외, 사용자 위치에 대응하는 디스플레이부(280) 내의 x축 좌표, 및 y축 좌표를 파악할 수 있다.The control unit 270 recognizes a user's location based on an image captured by a photographing unit (not shown). For example, a distance (z-axis coordinate) between the user and the image display device 200 may be determined. In addition, the x-axis coordinates and the y-axis coordinates in the display unit 280 corresponding to the user's location may be identified.

한편, 도 2에 도시하지 않았지만, 채널 신호 또는 외부 입력 신호에 대응하는 썸네일 영상을 생성하는 채널 브라우징 처리부가 더 구비되는 것도 가능하다. 채널 브라우징 처리부는, 복조부(220)에서 출력한 스트림 신호(TS) 또는 외부장치 인터페이스부(230)에서 출력한 스트림 신호 등을 입력받아, 입력되는 스트림 신호로부터 영상을 추출하여 썸네일 영상을 생성할 수 있다. 생성된 썸네일 영상은 그대로 또는 부호화되어 제어부(270)로 입력될 수 있다. 또한, 생성된 썸네일 영상은 스트림 형태로 부호화되어 제어부(270)로 입력되는 것도 가능하다. Meanwhile, although not shown in FIG. 2, a channel browsing processing unit for generating a thumbnail image corresponding to a channel signal or an external input signal may be further provided. The channel browsing processing unit receives the stream signal TS output from the demodulator 220 or the stream signal output from the external device interface unit 230, and extracts an image from the input stream signal to generate a thumbnail image. I can. The generated thumbnail image may be input to the controller 270 as it is or after being encoded. In addition, the generated thumbnail image may be encoded in a stream format and input to the controller 270.

제어부(270)는 입력된 썸네일 영상을 이용하여 복수의 썸네일 영상을 구비하는 썸네일 리스트를 디스플레이부(280)에 표시할 수 있다. 이때의 썸네일 리스트는, 디스플레이부(280)에 소정 영상을 표시한 상태에서 일부 영역에 표시되는 간편 보기 방식으로 표시되거나, 디스플레이부(280)의 대부분 영역에 표시되는 전체 보기 방식으로 표시될 수 있다. 이러한 썸네일 리스트 내의 썸네일 영상은 순차적으로 업데이트 될 수 있다. The controller 270 may display a thumbnail list including a plurality of thumbnail images on the display unit 280 by using the input thumbnail images. In this case, the thumbnail list may be displayed in a simple view method displayed in a partial area while a predetermined image is displayed on the display unit 280, or may be displayed in a full view method displayed in most areas of the display unit 280. . The thumbnail images in the thumbnail list may be sequentially updated.

디스플레이부(280)는, 제어부(270)에서 처리된 영상 신호, 데이터 신호, OSD 신호, 제어 신호 또는 외부장치 인터페이스부(230)에서 수신되는 영상 신호, 데이터 신호, 제어 신호 등을 변환하여 구동 신호를 생성한다. The display unit 280 converts an image signal, a data signal, an OSD signal, a control signal or an image signal, a data signal, and a control signal received from the external device interface unit 230 processed by the control unit 270 to convert a driving signal Create

디스플레이부(280)는 PDP, LCD, OLED, 플렉시블 디스플레이(flexible display)등이 가능하며, 특히, 본 발명의 실시예에 따라, 3차원 디스플레이(3D display)가 가능할 수 있다. The display unit 280 may be a PDP, an LCD, an OLED, a flexible display, or the like, and in particular, according to an embodiment of the present invention, a 3D display may be possible.

3차원 영상 시청을 위해 디스플레이부(280)는, 추가 디스플레이 방식과 단독 디스플레이 방식으로 나뉠 수 있다. 단독 디스플레이 방식은, 별도의 추가 디스플레이, 예를 들어 안경(glass) 등이 없이, 디스플레이부(280)(무안경 3D 디스플레이) 단독으로 3D 영상을 구현할 수 있는 것으로서, 그 예로, 렌티큘라 방식, 파라랙스 베리어(parallax barrier) 등 다양한 방식이 적용될 수 있다. 상기 추가 디스플레이 방식은, 디스플레이부(280) 외에 추가 디스플레이를 사용하여 3D 영상을 구현할 수 있는 것으로서, 그 예로, 헤드 마운트 디스플레이(HMD) 타입, 안경 타입 등 다양한 방식이 적용될 수 있다. For viewing a 3D image, the display unit 280 may be divided into an additional display method and an independent display method. The single display method is a display unit 280 (no glasses 3D display) alone, without a separate additional display, for example, without glasses, can implement a 3D image, for example, lenticular method, para Various methods, such as a parallax barrier, can be applied. The additional display method may implement a 3D image by using an additional display in addition to the display unit 280, and various methods such as a head mounted display (HMD) type and a glasses type may be applied.

상기 안경 타입은, 편광 안경 타입 등의 패시브(passive) 방식과, 셔터 글래스(ShutterGlass) 타입 등의 액티브(active) 방식으로 다시 나뉠 수 있다. 한편, 헤드 마운트 디스플레이 타입에서도 패시브 방식과 액티브 방식으로 나뉠 수 있다.The glasses type may be further divided into a passive method such as a polarizing glasses type and an active method such as a shutter glass type. Meanwhile, the head mounted display type can also be divided into a passive type and an active type.

입체 영상을 시청하기 위한 3D 시청 장치(3D용 글래스)(295)는, 패시브 방식의 편광 글래스 또는 액티브 방식의 셔트 글래스를 포함할 수 있으며, 상술한 헤드 마운트 타입도 포함하는 개념으로 기술된다. The 3D viewing apparatus (glass for 3D) 295 for viewing a stereoscopic image may include a passive polarizing glass or an active shirt glass, and is described as a concept including the aforementioned head mount type.

한편, 디스플레이부(280)는, 터치 스크린으로 구성되어 출력 장치 이외에 입력 장치로 사용되는 것도 가능하다.Meanwhile, the display unit 280 may be configured as a touch screen and used as an input device other than an output device.

오디오 출력부(285)는, 제어부(270)에서 음성 처리된 신호, 예를 들어, 스테레오 신호, 3.1 채널 신호 또는 5.1 채널 신호를 입력 받아 음성으로 출력한다. 음성 출력부(185)는 다양한 형태의 스피커로 구현될 수 있다.The audio output unit 285 receives a signal processed by the control unit 270, for example, a stereo signal, a 3.1 channel signal, or a 5.1 channel signal, and outputs it as audio. The audio output unit 185 may be implemented with various types of speakers.

한편, 사용자의 제스처를 감지하기 위해, 상술한 바와 같이, 터치 센서, 음성 센서, 위치 센서, 동작 센서 중 적어도 하나를 구비하는 센싱부(도시하지 않음)가 영상표시장치(200)에 더 구비될 수 있다. 센싱부(도시하지 않음)에서 감지된 신호는 사용자입력 인터페이스부(150)를 통해 제어부(170)로 전달된다. Meanwhile, in order to detect the user's gesture, as described above, a sensing unit (not shown) including at least one of a touch sensor, a voice sensor, a position sensor, and a motion sensor is further provided in the image display device 200. I can. The signal detected by the sensing unit (not shown) is transmitted to the control unit 170 through the user input interface unit 150.

제어부(270)는, 촬영부(도시하지 않음)로부터 촬영된 영상, 또는 센싱부(도시하지 않음)로부터의 감지된 신호를 각각 또는 조합하여 사용자의 제스처를 감지할 수 있다. The controller 270 may detect a user's gesture by combining or combining an image captured from a photographing unit (not shown) or a signal detected from a sensing unit (not shown).

원격제어장치(260)는, 사용자 입력을 사용자입력 인터페이스부(250)로 송신한다. 이를 위해, 원격제어장치(260)는, 블루투스(Bluetooth), RF(Radio Frequency) 통신, 적외선(IR) 통신, UWB(Ultra Wideband), 지그비(ZigBee) 방식 등을 사용할 수 있다. 또한, 원격제어장치(260)는, 사용자입력 인터페이스부(250)에서 출력한 영상, 음성 또는 데이터 신호 등을 수신하여, 이를 원격제어장치(260)에서 표시하거나 음성 출력할 수 있다.The remote control device 260 transmits a user input to the user input interface unit 250. To this end, the remote control device 260 may use Bluetooth, Radio Frequency (RF) communication, infrared (IR) communication, Ultra Wideband (UWB), ZigBee, or the like. In addition, the remote control device 260 may receive an image, audio, or data signal output from the user input interface unit 250 and display or output an audio signal on the remote control device 260.

상술한 영상표시장치(200)는, 고정형으로서 ATSC 방식(7-VSB 방식)의 디지털 방송, DVB-T 방식(COFDM 방식)의 디지털 방송, ISDB-T 방식(BST-OFDM방식)의 디지털 방송 등 중 적어도 하나를 수신 가능한 디지털 방송 수신기일 수 있다. 또한, 이동형으로서 지상파 DMB 방식의 디지털 방송, 위성 DMB 방식의 디지털 방송, ATSC-M/H 방식의 디지털 방송, DVB-H 방식(COFDM 방식)의 디지털 방송, 미디어플로(Media Foward Link Only) 방식의 디지털 방송 등 중 적어도 하나를 수신 가능한 디지털 방송 수신기일 수 있다. 또한, 케이블, 위성통신, IPTV 용 디지털 방송 수신기일 수도 있다.The above-described video display device 200 is a fixed type, such as ATSC system (7-VSB system) digital broadcasting, DVB-T system (COFDM system) digital broadcasting, ISDB-T system (BST-OFDM system) digital broadcasting, etc. It may be a digital broadcast receiver capable of receiving at least one of. In addition, as a mobile type, terrestrial DMB method digital broadcasting, satellite DMB method digital broadcasting, ATSC-M/H method digital broadcasting, DVB-H method (COFDM method) digital broadcasting, media flow (Media Forward Link Only) method It may be a digital broadcasting receiver capable of receiving at least one of digital broadcasting and the like. In addition, it may be a digital broadcasting receiver for cable, satellite communication, and IPTV.

본 명세서에서 기술되는 영상표시장치는, TV 수상기, 휴대폰, 스마트 폰(smart phone), 노트북 컴퓨터(notebook computer), 디지털 방송용 단말기, PDA(Personal Digital Assistants), PMP(Portable Multimedia Player) 등이 포함될 수 있다.The image display device described in this specification may include a TV receiver, a mobile phone, a smart phone, a notebook computer, a digital broadcasting terminal, a personal digital assistant (PDA), a portable multimedia player (PMP), and the like. have.

도 2에 도시된 영상표시장치(200)의 구성도는 본 발명의 실시예들을 위한 구성도이다. 구성도의 각 구성요소는 실제 구현되는 영상표시장치(200)의 사양에 따라 통합, 추가, 또는 생략될 수 있다. 즉, 필요에 따라 2 이상의 구성요소가 하나의 구성요소로 합쳐지거나, 혹은 하나의 구성요소가 2 이상의 구성요소로 세분되어 구성될 수 있다. 또한, 각 블록에서 수행하는 기능은 본 발명의 실시예를 설명하기 위한 것이며, 그 구체적인 동작이나 장치는 본 발명의 권리범위를 제한하지 아니한다.The configuration diagram of the image display device 200 shown in FIG. 2 is a configuration diagram for embodiments of the present invention. Each component of the configuration diagram may be integrated, added, or omitted according to the specifications of the image display device 200 that is actually implemented. That is, if necessary, two or more components may be combined into one component, or one component may be subdivided into two or more components. In addition, the functions performed by each block are for explaining the embodiments of the present invention, and specific operations or devices thereof do not limit the scope of the present invention.

상기 영상 표시 장치(200)에서 복호화된 영상 신호는, 다양한 포맷의 3D 영상 신호일 수 있다. 예를 들면, 색차 영상(color image) 및 깊이 영상(depth image)으로 이루어진 3D 영상 신호일 수 있으며, 또는 복수 시점 영상 신호로 이루어진 3D 영상 신호 등일 수 있다. 복수 시점 영상 신호는, 예를 들어, 좌안 영상 신호와 우안 영상 신호를 포함할 수 있다. 여기서, 3D 영상 신호의 포맷은, 좌안 영상 신호(L)와 우안 영상 신호(R)를 좌,우로 배치하는 사이드 바이 사이드(Side by Side) 포맷, 상,하로 배치하는 탑 다운(Top / Down) 포맷, 시분할로 배치하는 프레임 시퀀셜(Frame Sequential) 포맷, 좌안 영상 신호와 우안 영상 신호를 라인 별로 혼합하는 인터레이스 (Interlaced) 포맷, 좌안 영상 신호와 우안 영상 신호를 박스 별로 혼합하는 체커 박스(Checker Box) 포맷 등일 수 있다. The video signal decoded by the video display device 200 may be a 3D video signal of various formats. For example, it may be a 3D image signal composed of a color image and a depth image, or may be a 3D image signal composed of a multi-view image signal. The multi-view image signal may include, for example, a left-eye image signal and a right-eye image signal. Here, the format of the 3D video signal is a side-by-side format in which the left-eye video signal (L) and the right-eye video signal (R) are arranged left and right, and the top-down (Top / Down) format is arranged up and down. Format, Frame Sequential format arranged by time division, Interlaced format that mixes left-eye video signal and right-eye video signal by line, Checker Box that mixes left-eye video signal and right-eye video signal by box It may be a format, etc.

또한, 상기에서 설명된 영상 표시 장치는 이동 단말기에도 적용될 수 있다. 상기 이동 단말기에는 휴대폰, 스마트 폰(smart phone), 노트북 컴퓨터(laptop computer), 디지털방송용 단말기, PDA(personal digital assistants), PMP(portable multimedia player), 네비게이션, 슬레이트 PC(slate PC), 태블릿 PC(tablet PC), 울트라북(ultrabook) 등이 포함될 수 있다. In addition, the video display device described above can also be applied to a mobile terminal. The mobile terminal includes a mobile phone, a smart phone, a laptop computer, a digital broadcasting terminal, a personal digital assistant (PDA), a portable multimedia player (PMP), a navigation system, a slate PC, and a tablet PC. Tablet PC), ultrabook, etc.

영상 표시 장치가 이동 단말기로서 사용되는 경우에는 무선 통신부가 추가될 수 있다.When the video display device is used as a mobile terminal, a wireless communication unit may be added.

무선 통신부는 영상 표시 장치(100)와 무선 통신 시스템 사이 또는 이동 단말기와 이동 단말기가 위치한 네트워크 사이의 무선 통신을 가능하게 하는 하나 이상의 모듈을 포함할 수 있다. 예를 들어, 무선 통신부는 방송 수신 모듈, 이동통신 모듈 무선 인터넷 모듈, 근거리 통신 모듈 및 위치정보 모듈 중 적어도 하나를 포함할 수 있다.The wireless communication unit may include one or more modules that enable wireless communication between the image display device 100 and a wireless communication system or between a mobile terminal and a network in which the mobile terminal is located. For example, the wireless communication unit may include at least one of a broadcast reception module, a mobile communication module, a wireless Internet module, a short-range communication module, and a location information module.

방송 수신 모듈은 방송 채널을 통하여 외부의 방송 관리 서버로부터 방송 신호 및/또는 방송 관련된 정보를 수신한다. The broadcast receiving module receives a broadcast signal and/or broadcast-related information from an external broadcast management server through a broadcast channel.

상기 방송 채널은 위성 채널, 지상파 채널을 포함할 수 있다. 상기 방송 관리 서버는, 방송 신호 및/또는 방송 관련 정보를 생성하여 송신하는 서버 또는 기 생성된 방송 신호 및/또는 방송 관련 정보를 제공받아 단말기에 송신하는 서버를 의미할 수 있다. 상기 방송 신호는, TV 방송 신호, 라디오 방송 신호, 데이터 방송 신호를 포함할 뿐만 아니라, TV 방송 신호 또는 라디오 방송 신호에 데이터 방송 신호가 결합한 형태의 방송 신호도 포함할 수 있다. The broadcast channel may include a satellite channel and a terrestrial channel. The broadcast management server may mean a server that generates and transmits a broadcast signal and/or broadcast-related information, or a server that receives and transmits a previously-generated broadcast signal and/or broadcast-related information to a terminal. The broadcast signal may include not only a TV broadcast signal, a radio broadcast signal, and a data broadcast signal, but also a broadcast signal in a form in which a data broadcast signal is combined with a TV broadcast signal or a radio broadcast signal.

상기 방송 관련 정보는, 방송 채널, 방송 프로그램 또는 방송 서비스 제공자에 관련한 정보를 의미할 수 있다. 상기 방송 관련 정보는, 이동통신망을 통하여도 제공될 수 있다. 이러한 경우에는 상기 이동통신 모듈(112)에 의해 수신될 수 있다. The broadcast related information may mean information related to a broadcast channel, a broadcast program, or a broadcast service provider. The broadcast-related information may also be provided through a mobile communication network. In this case, it may be received by the mobile communication module 112.

상기 방송 관련 정보는 다양한 형태로 존재할 수 있다. 예를 들어, DMB(Digital Multimedia Broadcasting)의 EPG(Electronic Program Guide) 또는 DVB-H(Digital Video Broadcast-Handheld)의 ESG(Electronic Service Guide) 등의 형태로 존재할 수 있다.The broadcast-related information may exist in various forms. For example, it may exist in the form of an Electronic Program Guide (EPG) of Digital Multimedia Broadcasting (DMB) or an Electronic Service Guide (ESG) of Digital Video Broadcast-Handheld (DVB-H).

상기 방송 수신 모듈은, 예를 들어, DMB-T(Digital Multimedia Broadcasting-Terrestrial), DMB-S(Digital Multimedia Broadcasting-Satellite), MediaFLO(Media Forward Link Only), DVB-H(Digital Video Broadcast-Handheld), ISDB-T(Integrated Services Digital Broadcast-Terrestrial) 등의 디지털 방송 시스템을 이용하여 디지털 방송 신호를 수신할 수 있다. 물론, 상기 방송 수신 모듈(111)은, 상술한 디지털 방송 시스템뿐만 아니라 다른 방송 시스템에 적합하도록 구성될 수도 있다.The broadcast receiving module includes, for example, Digital Multimedia Broadcasting-Terrestrial (DMB-T), Digital Multimedia Broadcasting-Satellite (DMB-S), Media Forward Link Only (MediaFLO), and Digital Video Broadcast-Handheld (DVB-H). , ISDB-T (Integrated Services Digital Broadcast-Terrestrial), and other digital broadcasting systems may be used to receive digital broadcasting signals. Of course, the broadcast reception module 111 may be configured to be suitable for not only the digital broadcasting system described above, but also other broadcasting systems.

방송 수신 모듈을 통해 수신된 방송 신호 및/또는 방송 관련 정보는 메모리에 저장될 수 있다.Broadcast signals and/or broadcast related information received through the broadcast reception module may be stored in a memory.

이동통신 모듈은, 이동 통신망 상에서 기지국, 외부의 단말, 서버 중 적어도 하나와 무선 신호를 송수신한다. 상기 무선 신호는, 음성 호 신호, 화상 통화 호 신호 또는 문자/멀티미디어 메시지 송수신에 따른 다양한 형태의 데이터를 포함할 수 있다. The mobile communication module transmits and receives a radio signal with at least one of a base station, an external terminal, and a server on a mobile communication network. The wireless signal may include a voice call signal, a video call signal, or various types of data according to transmission/reception of text/multimedia messages.

상기 이동통신 모듈은 화상통화모드 및 음성통화모드를 구현하도록 이루어진다. 화상통화모드는 상대방의 영상을 보면서 통화하는 상태를 지칭하고, 음성통화모드는 상대방의 영상을 보지 않으면서 통화를 하는 상태를 지칭한다. 화상통화모드 및 음성통화모드를 구현하기 위하여 이동통신 모듈(112)은 음성 및 영상 중 적어도 하나를 송수신하도록 형성된다.The mobile communication module is configured to implement a video call mode and a voice call mode. The video call mode refers to a state in which a call is made while viewing the video of the other party, and the voice call mode refers to a state in which a call is made without viewing the image of the other party. In order to implement the video call mode and the voice call mode, the mobile communication module 112 is formed to transmit and receive at least one of audio and video.

무선 인터넷 모듈은 무선 인터넷 접속을 위한 모듈을 말하는 것으로, 이동 단말기(100)에 내장되거나 외장될 수 있다. 무선 인터넷 기술로는 WLAN(Wireless LAN), WiFi(Wireless Fidelity) Direct, DLNA(Digital Living Network Alliance), Wibro(Wireless broadband), Wimax(World Interoperability for Microwave Access), HSDPA(High Speed Downlink Packet Access) 등이 이용될 수 있다. The wireless Internet module refers to a module for wireless Internet access, and may be built-in or external to the mobile terminal 100. Wireless Internet technologies include WLAN (Wireless LAN), WiFi (Wireless Fidelity) Direct, DLNA (Digital Living Network Alliance), Wibro (Wireless broadband), Wimax (World Interoperability for Microwave Access), HSDPA (High Speed Downlink Packet Access), etc. Can be used.

근거리 통신 모듈은 근거리 통신을 위한 모듈을 말한다. 근거리 통신(short range communication) 기술로 블루투스(Bluetooth™), RFID(Radio Frequency Identification), 적외선 통신(Infrared Data Association; IrDA), UWB(Ultra Wideband), ZigBee, NFC(Near Field Communication), 와이-파이 다이렉트 등이 이용될 수 있다.The short-range communication module refers to a module for short-range communication. Short-range communication technologies include Bluetooth™, RFID (Radio Frequency Identification), Infrared Data Association (IrDA), UWB (Ultra Wideband), ZigBee, NFC (Near Field Communication), Wi-Fi. Direct or the like can be used.

위치정보 모듈은 이동 단말기의 위치를 획득하기 위한 모듈로서, 그의 대표적인 예로는 GPS(Global Position System) 모듈 또는 WiFi(Wireless Fidelity) 모듈이 있다.The location information module is a module for acquiring a location of a mobile terminal, and representative examples thereof include a GPS (Global Position System) module or a WiFi (Wireless Fidelity) module.

한편, 디스플레이부와 터치 동작을 감지하는 센서(이하, '터치 센서'라 함)가 상호 레이어 구조를 이루는 경우(이하, '터치 스크린'이라 함)에, 디스플레이부(151)는 출력 장치 이외에 입력 장치로도 사용될 수 있다. 터치 센서는, 예를 들어, 터치 필름, 터치 시트, 터치 패드 등의 형태를 가질 수 있다.On the other hand, when the display unit and the sensor for detecting a touch motion (hereinafter referred to as'touch sensor') form a mutual layer structure (hereinafter, referred to as'touch screen'), the display unit 151 inputs input other than the output device. It can also be used as a device. The touch sensor may have, for example, a touch film, a touch sheet, a touch pad, or the like.

터치 센서는 디스플레이부의 특정 부위에 가해진 압력 또는 디스플레이부의 특정 부위에 발생하는 정전 용량 등의 변화를 전기적인 입력신호로 변환하도록 구성될 수 있다. 터치 센서는 터치 대상체가 터치 센서 상에 터치 되는 위치 및 면적뿐만 아니라, 터치 시의 압력까지도 검출할 수 있도록 구성될 수 있다. 여기에서, 터치 대상체는 상기 터치 센서에 터치를 인가하는 물체로서, 예를 들어, 손가락, 터치펜 또는 스타일러스 펜(Stylus pen), 포인터 등이 될 수 있다.The touch sensor may be configured to convert a change in pressure applied to a specific portion of the display unit or a capacitance generated in a specific portion of the display unit into an electrical input signal. The touch sensor may be configured to detect not only a location and an area at which a touch object is touched on the touch sensor, but also a pressure when a touch object is touched. Here, the touch object is an object that applies a touch to the touch sensor, and may be, for example, a finger, a touch pen, a stylus pen, or a pointer.

터치 센서에 대한 터치 입력이 있는 경우, 그에 대응하는 신호(들)는 터치 제어기로 보내진다. 터치 제어기는 그 신호(들)를 처리한 다음 대응하는 데이터를 제어부로 전송한다. 이로써, 제어부는 디스플레이부(151)의 어느 영역이 터치 되었는지 여부 등을 알 수 있게 된다.When there is a touch input to the touch sensor, a signal(s) corresponding thereto is transmitted to the touch controller. The touch controller processes the signal(s) and then transmits the corresponding data to the controller. As a result, the control unit can know whether an area of the display unit 151 has been touched.

위치 검출부(291)는 헤드 트래킹(Head tracking) 기법을 통해 사용자의 위치를 검출하고, 그 검출한 사용자 위치를 제어부(270)에 출력한다. 예를 들면, 상기 위치 검출부(291)는 카메라(293)를 통해 촬영된 영상으로부터 사용자의 헤드를 검출 및 추적함으로써 사용자의 위치를 검출할 수 있다.The position detection unit 291 detects the user's position through a head tracking technique, and outputs the detected user position to the control unit 270. For example, the position detection unit 291 may detect the user's position by detecting and tracking the user's head from an image captured by the camera 293.

음성 인식부(292)는 사용자 음성을 인식하고, 그 인식된 사용자 음성을 제어부(270)에 출력한다. 상기 음성 인식부(292)는 마이크로 폰을 포함할 수 있다.The voice recognition unit 292 recognizes a user voice and outputs the recognized user voice to the control unit 270. The voice recognition unit 292 may include a microphone.

이하에서는, 사용자 음성 패턴에 따라 임계치를 변경함으로써 사용자 음성을 정확히 인식할 수 있는 음성 인식 장치 및 그 방법을 설명한다.Hereinafter, a voice recognition apparatus capable of accurately recognizing a user voice by changing a threshold value according to a user voice pattern and a method thereof will be described.

도 3은 본 발명의 실시예에 따른 음성 인식 방법을 나타낸 흐름도이다.3 is a flow chart showing a speech recognition method according to an embodiment of the present invention.

먼저, 상기 음성 인식부(292)는 사용자 발화에 따른 사용자 음성을 수신한다(S11). First, the voice recognition unit 292 receives a user voice according to a user's speech (S11).

상기 음성 인식부(292)는 입력 음성과 다수의 음성 모델들 간의 유사도를 나타내는 신뢰도 점수들(confidence scores)을 제공하는 미리결정된 모델들과 상기 사용자의 발성에 의해 입력되는 사용자 음성을 비교함으로써, 상기 사용자 음성을 인식하고, 그 인식된 결과를 상기 제어부(270)에 출력한다(S12). 상기 인식된 사용자 음성은 신뢰도 점수를 포함한다. 예를 들면, 사용자 발화에 따른 사용자 음성이 채널 올려, 채널 내려, 볼륨 올려, 등이라고 가정할 때, 상기 채널 올려는 6252 신뢰도 점수를 가질 수 있으며, 채널 내려는 8242 신뢰도 점수를 가질 수 있으며, 볼륨 올려는 2024 신뢰도 점수를 가질 수 있다. 여기서, 상기 신뢰도 점수를 이용하여 사용자 음성을 인식하는 방법은 미국 특허 번호 6,735,562에도 개시되어 있다.The speech recognition unit 292 compares the user speech input by the user's utterance with predetermined models that provide confidence scores representing the similarity between the input speech and a plurality of speech models. The user's voice is recognized, and the recognized result is output to the control unit 270 (S12). The recognized user voice includes a confidence score. For example, assuming that the user voice according to the user's utterance is channel up, channel down, volume up, etc., the channel up may have a 6252 reliability score, the channel down may have a 8242 reliability score, and the volume up May have a 2024 confidence score. Here, a method of recognizing a user's voice using the reliability score is also disclosed in US Patent No. 6,735,562.

상기 제어부(270)는 상기 인식된 사용자 음성이 임계치(threshold value)보다 높은 신뢰도 점수에 해당되는지를 결정하고, 상기 인식된 사용자 음성이 임계치(threshold value)보다 높은 신뢰도 점수에 해당되면 상기 사용자 음성을 허용(accpept)한다. The controller 270 determines whether the recognized user voice corresponds to a reliability score higher than a threshold value, and if the recognized user voice corresponds to a reliability score higher than a threshold value, the user voice Accpept.

반면, 상기 제어부(270)는, 상기 인식된 사용자 음성이 상기 임계치보다 낮은 신뢰도 점수에 해당되면 상기 인식된 사용자 음성 신호를 거절(reject)한 후, 미리설정된 안내 메시지(예를 들면, 조금 천천히 이야기 해 주십시오)를 제공하여 음성 인식을 재유도한다. 예를 들면, 상기 제어부(270)는, 상기 임계치(threshold value)가 6000 이라고 가정할 때, 상기 6252 신뢰도 점수를 갖는 "채널 올려"와 상기 8242 신뢰도 점수를 갖는 "채널 내려"를 상기 사용자 음성으로 인식하고, 상기 2024 신뢰도 점수를 갖는 "볼륨 올려"를 사용자 음성으로 인식하지 않고 거절한다.On the other hand, the controller 270 rejects the recognized user voice signal when the recognized user voice corresponds to a reliability score lower than the threshold, and then a preset guide message (for example, talk a little slowly. Please) to re-induce speech recognition. For example, assuming that the threshold value is 6000, the control unit 270 sets "channel up" with the 6252 reliability score and "channel down" with the 8242 reliability score as the user's voice. Recognizes and rejects the "volume up" having the 2024 confidence score without recognizing the user's voice.

상기 제어부(270)는 상기 임계치(threshold value) 이상에 해당하는 사용자 음성을 신뢰도 점수 순서대로 나열한다. 예를 들면, 상기 제어부(270)는 상기 8242 신뢰도 점수를 갖는 "채널 내려", 상기 6252 신뢰도 점수를 갖는 "채널 올려" 순서대로 나열할 수도 있다.The control unit 270 lists user voices corresponding to the threshold value or higher in order of reliability scores. For example, the control unit 270 may order "channel down" having the 8242 reliability score and "channel up" having the 6252 reliability score.

상기 제어부(270)는 상기 임계치(threshold value)를 사용자 음성 사용률(사용자 음성 사용 패턴)에 따라 변경할 수도 있다(S13). 상기 사용자 사용 음성 패턴은, 상기 영상 표시 장치의 채널 및 볼륨을 제어하기 위한 제1 사용자 음성의 사용률, 상기 영상 표시 장치를 통해 웹 브라우저를 실행시키기 위한 제2 사용자 음성의 사용률, 상기 영상 표시 장치를 통해 응용 프로그램을 실행시키기 위한 제3 사용자 음성의 사용률, 상기 영상 표시 장치의 방송 프로그램 검색을 위한 제4 사용자 음성의 사용률 중에서 적어도 어느 하나 이상을 포함할 수 있다.The control unit 270 may change the threshold value according to a user voice usage rate (user voice usage pattern) (S13). The user use voice pattern includes a first user voice usage rate for controlling a channel and volume of the video display device, a second user voice usage rate for executing a web browser through the video display device, and the video display device. At least one or more of a usage rate of a third user's voice for executing an application program and a usage rate of a fourth user's voice for searching a broadcast program of the video display device may be included.

상기 제어부(270)는 상기 영상 표시 장치(200)의 기능에 따라 미리 분류된(Categorized) 사용자 음성 사용률(사용자 사용 음성 패턴)을 근거로 상기 임계치를 변경함으로써 사용자 음성 인식 확률을 증가시킬 수 있다. 예를 들면, 상기 제어부(270)는, 상기 영상 표시 장치(200)의 채널 및 볼륨을 제어하기 위한 제1 사용자 음성(예를 들면, 채널 올려, 채널 내려, 볼륨 내려, 볼륨 올려 등), 상기 영상 표시 장치(200)를 통해 웹 브라우저를 실행시키기 위한 제2 사용자 음성(예를 들면, 인터넷, 웹 사이트, 등), 상기 영상 표시 장치(200)를 통해 다양한 응용 프로그램(예를 들면, 메신저, 메일 응용 프로그램 등)을 실행시키기 위한 제3 사용자 음성(예를 들면, 메신저, 메일 등), 상기 영상 표시 장치(200)의 방송 프로그램 검색을 위한 제4 사용자 음성(예를 들면, 드라마 이름, 예능 프로그램 이름 등)으로 분류할 수 있다. 상기 제1 내지 제4 사용자 음성은 미리 정의되거나, 서버로부터 수신될 수 있다. The control unit 270 may increase the probability of user voice recognition by changing the threshold value based on a user voice usage rate (user use voice pattern) classified in advance according to the function of the video display device 200. For example, the controller 270 may include a first user's voice (eg, channel up, channel down, volume down, volume up, etc.) for controlling the channel and volume of the video display device 200, A second user's voice (eg, Internet, web site, etc.) for executing a web browser through the video display device 200, and various application programs (eg, messenger, etc.) through the video display device 200 A third user voice (e.g., messenger, mail, etc.) for executing a mail application program, and a fourth user voice (e.g., drama name, entertainment) for searching a broadcast program of the video display device 200 Program name, etc.). The first to fourth user voices may be predefined or may be received from a server.

도 4는 본 발명의 실시예에 따른 사용자 사용 음성 패턴을 나타낸 예시도이다.4 is an exemplary diagram showing a user use voice pattern according to an embodiment of the present invention.

도 4에 도시한 바와 같이, 상기 제1 사용자 음성의 사용률이 전체(100%) 중에서 60%(4-1)이고, 상기 제2 사용자 음성의 사용률이 전체(100%) 중에서 10%(4-2)이고, 상기 제3 사용자 음성의 사용률이 전체(100%) 중에서 20%(4-3)이고, 상기 제4 사용자 음성의 사용률이 전체(100%) 중에서 10%(4-4)라고 가정할 때, 상기 제어부(270)는, 상기 제1 사용자 음성(예를 들면, 채널 올려, 채널 내려, 볼륨 내려, 볼륨 올려 등)이 상기 음성 인식부(292)에 의해 인식되면 상기 임계치를 6000에서 6500으로 증가시킴으로써, 발음상 비슷한 발화(예를 들면, 채널 올려, 채널 내려, 볼륨 내려, 볼륨 올려 등)에 대한 사용자 음성 인식률을 높인다. 상기 제어부(270)는, 상기 제4 사용자 음성(예를 들면, 드라마 이름, 예능 프로그램 이름 등)이 상기 음성 인식부(292)에 의해 인식되면 상기 임계치를 6000에서 5000으로 감소시킴으로써, 자주 사용하지 않는 사용자 음성에 대한 사용자 음성 인식률을 높인다. 상기 제1 내지 제4 사용자 음성의 사용률은 사용자가 평균적으로 사용하는 음성의 사용률일 수도 있다.As shown in FIG. 4, the first user's voice usage rate is 60% (4-1) of the total (100%), and the second user voice usage rate is 10% (4-1) of the total (100%). 2), and the third user voice usage rate is 20% (4-3) out of the total (100%), and the fourth user voice usage rate is 10% (4-4) out of the total (100%). When the first user voice (eg, channel up, channel down, volume down, volume up, etc.) is recognized by the voice recognition unit 292, the control unit 270 sets the threshold at 6000. By increasing it to 6500, the user's voice recognition rate for speech similar in pronunciation (eg, channel up, channel down, volume down, volume up, etc.) is increased. When the fourth user voice (e.g., drama name, entertainment program name, etc.) is recognized by the voice recognition unit 292, the control unit 270 reduces the threshold value from 6000 to 5000, which is not used frequently. Increases the user's voice recognition rate for the non-user voice The first to fourth user voice usage rates may be the average voice usage rates used by users.

따라서, 상기 제어부(270)는, 상기 제1 사용자 음성(예를 들면, 채널 올려, 채널 내려, 볼륨 내려, 볼륨 올려 등)과 같이 사용률이 높은 경우 유사한 발음을 갖는 음성(예를 들면, 채널 올려, 채널 내려, 볼륨 내려, 볼륨 올려 등)이 다수 존재할 가능성이 높으므로, 상기 임계치를 증가시킴으로써 발음상 비슷한 발화(예를 들면, 채널 올려, 채널 내려, 볼륨 내려, 볼륨 올려 등)에 대한 사용자 음성 인식률을 높인다. 상기 제어부(270)는, 상기 제4 사용자 음성(예를 들면, 드라마 이름, 예능 프로그램 이름 등)과 같이 사용률이 낮은 경우 유사한 발음을 갖는 음성(예를 들면, 드라마 이름, 예능 프로그램 이름 등)이 다수 존재할 가능성이 낮으므로, 상기 임계치를 감소시킴으로써 상기 제4 사용자 음성에 대한 사용자 음성 인식률을 높인다. 상기 제어부(270)는 유사한 발음을 갖는 다수의 사용자 음성이 인식되면 상기 임계치를 증가시킴으로써 상기 유사한 발음을 갖는 다수의 사용자 음성의 인식 확률을 높일 수 있다.Therefore, the control unit 270, when the use rate is high, such as the first user voice (for example, channel up, channel down, volume down, volume up, etc.), a voice having a similar pronunciation (for example, channel up , Channel down, volume down, volume up, etc.), the user's voice for similar pronunciation (e.g., channel up, channel down, volume down, volume up, etc.) by increasing the threshold. Increase recognition rate. When the usage rate is low, such as the fourth user voice (for example, a drama name, an entertainment program name, etc.), the control unit 270 may have a voice (for example, a drama name, an entertainment program name, etc.) Since there is a low possibility that there are many, the user voice recognition rate for the fourth user voice is increased by reducing the threshold. When a plurality of user voices having similar pronunciation are recognized, the controller 270 may increase the probability of recognizing the plurality of user voices having similar pronunciation by increasing the threshold.

상기 제어부(270)는 다양한 사용자 음성이 인식될 때마다 상기 다양한 사용자 음성의 사용 횟수(발화 횟수)를 카운트함으로써 상기 다양한 사용자 음성 각각의 사용률을 계산하고, 상기 계산된 각 사용자 음성 사용률에 따라 상기 임계치를 변경할 수도 있다.The controller 270 calculates the usage rate of each of the various user voices by counting the number of times of use of the various user voices (the number of utterances) whenever various user voices are recognized, and the threshold value according to the calculated user voice usage rates. You can also change

상기 제어부(270)는 상기 인식된 사용자 음성이 상기 변경된 임계치(threshold value)보다 높은 신뢰도 점수에 해당되는지를 결정하고(S14), 상기 인식된 사용자 음성이 임계치(threshold value)보다 높은 신뢰도 점수에 해당되면 상기 사용자 음성을 허용(accpept)한다(S15). The controller 270 determines whether the recognized user voice corresponds to a reliability score higher than the changed threshold value (S14), and the recognized user voice corresponds to a reliability score higher than a threshold value. If so, the user's voice is allowed (accpept) (S15).

반면, 상기 제어부(270)는, 상기 인식된 사용자 음성이 상기 변경된 임계치보다 낮은 신뢰도 점수에 해당되면 상기 인식된 사용자 음성 신호를 거절(reject)한 후(S16), 미리설정된 안내 메시지(예를 들면, 조금 천천히 이야기 해 주십시오)를 제공하여 음성 인식을 재유도한다. On the other hand, the controller 270 rejects the recognized user voice signal (S16) when the recognized user voice corresponds to a reliability score lower than the changed threshold (S16), and then a preset guidance message (for example, , Please talk a little slowly) to reinduce speech recognition.

따라서, 본 발명의 실시예에 따른 음성 인식 장치 및 그 방법은, 사용자 음성 패턴(영상 표시기기, 이동 통신 단말기 등의 전자 장치의 기능들에 따라 미리 분류된 각 사용자 음성의 사용률)에 따라 임계치를 변경함으로써 사용자 음성을 정확히 인식할 수 있다. Accordingly, the voice recognition apparatus and method thereof according to an embodiment of the present invention have a threshold value according to a user voice pattern (a usage rate of each user voice classified in advance according to functions of an electronic device such as a video display device and a mobile communication terminal). By changing, the user's voice can be accurately recognized.

도 5는 본 발명의 다른 실시예에 따른 음성 인식 방법을 나타낸 흐름도이다.5 is a flowchart illustrating a speech recognition method according to another embodiment of the present invention.

먼저, 상기 음성 인식부(292)는 사용자 발화에 따른 사용자 음성을 수신한다(S21). First, the voice recognition unit 292 receives a user voice according to a user's speech (S21).

상기 음성 인식부(292)는 입력 음성과 다수의 음성 모델들 간의 유사도를 나타내는 신뢰도 점수들(confidence scores)을 제공하는 미리결정된 모델들과 상기 사용자의 발성에 의해 입력되는 사용자 음성을 비교함으로써, 상기 사용자 음성을 인식하고, 그 인식된 결과를 상기 제어부(270)에 출력한다(S22). 상기 인식된 사용자 음성은 신뢰도 점수를 포함한다. 예를 들면, 사용자 발화에 따른 사용자 음성이 채널 올려, 채널 내려, 볼륨 올려, 등이라고 가정할 때, 상기 채널 올려는 6252 신뢰도 점수를 가질 수 있으며, 채널 내려는 8242 신뢰도 점수를 가질 수 있으며, 볼륨 올려는 2024 신뢰도 점수를 가질 수 있다. The speech recognition unit 292 compares the user speech input by the user's utterance with predetermined models that provide confidence scores representing the similarity between the input speech and a plurality of speech models. The user's voice is recognized, and the recognized result is output to the control unit 270 (S22). The recognized user voice includes a confidence score. For example, assuming that the user voice according to the user's utterance is channel up, channel down, volume up, etc., the channel up may have a 6252 reliability score, the channel down may have a 8242 reliability score, and the volume up May have a 2024 confidence score.

상기 제어부(270)는 상기 임계치(threshold value)를 사용자의 사용 기능 패턴(예를 들면, 영상 표시 기기, 이동 통신 단말기 등과 같은 전자 기기의 각 기능의 사용률)에 따라 변경할 수도 있다(S23). 상기 사용 기능 패턴은, 상기 영상 표시 장치의 채널 및 볼륨을 제어하기 위한 제1 사용자 사용 기능의 사용률, 상기 영상 표시 장치를 통해 웹 브라우저를 실행시키기 위한 제2 사용자 사용 기능의 사용률, 상기 영상 표시 장치를 통해 응용 프로그램을 실행시키기 위한 제3 사용자 사용 기능의 사용률, 상기 영상 표시 장치의 방송 프로그램 검색을 위한 제4 사용자 사용 기능의 사용률 중에서 적어도 어느 하나 이상을 포함할 수 있다.The controller 270 may change the threshold value according to a user's usage function pattern (eg, a usage rate of each function of an electronic device such as a video display device and a mobile communication terminal) (S23). The usage function pattern may include a usage rate of a first user usage function for controlling a channel and volume of the video display device, a usage rate of a second user usage function for executing a web browser through the video display device, and the video display device. At least one or more of a use rate of a third user use function for executing an application program through and a use rate of a fourth user use function for searching a broadcast program of the video display device may be included.

상기 제어부(270)는 상기 영상 표시 장치(200)의 기능에 따라 미리 분류된(Categorized) 사용자 사용 기능 패턴을 근거로 상기 임계치를 변경함으로써 사용자 음성 인식 확률을 증가시킬 수 있다. 예를 들면, 상기 제어부(270)는, 상기 영상 표시 장치(200)의 채널 및 볼륨을 제어하기 위한 제1 사용자 사용 기능(예를 들면, 채널 올림 버튼, 채널 내림 버튼, 볼륨 내림 버튼, 볼륨 올림 버튼 등), 상기 영상 표시 장치(200)를 통해 웹 브라우저를 실행시키기 위한 제2 사용자 사용 기능(예를 들면, 인터넷, 웹 사이트, 등), 상기 영상 표시 장치(200)를 통해 다양한 응용 프로그램(예를 들면, 메신저, 메일 응용 프로그램 등)을 실행시키기 위한 제3 사용자 사용 기능(예를 들면, 메신저, 메일 등), 상기 영상 표시 장치(200)의 방송 프로그램 검색을 위한 제4 사용자 사용 기능(예를 들면, 드라마 이름 검색, 예능 프로그램 이름 검색 등)으로 분류할 수 있다. 상기 제1 내지 제4 사용자 사용 기능은, 각 사용자에 따라 수집되어 정의될 수 있다. The control unit 270 may increase a user voice recognition probability by changing the threshold value based on a user use function pattern categorized according to the function of the video display device 200. For example, the control unit 270 includes a first user use function for controlling the channel and volume of the video display device 200 (e.g., a channel up button, a channel down button, a volume down button, a volume up Buttons, etc.), a second user use function (for example, the Internet, a web site, etc.) for executing a web browser through the video display device 200, and various application programs through the video display device 200 ( For example, a third user use function (for example, a messenger, mail, etc.) for executing a messenger, mail application program, etc., and a fourth user use function for searching a broadcast program of the video display device 200 ( For example, it can be classified as a drama name search, an entertainment program name search, etc.). The first to fourth user use functions may be collected and defined according to each user.

도 6은 본 발명의 실시예에 따른 사용자 사용 기능 패턴을 나타낸 예시도이다.6 is an exemplary diagram showing a user use function pattern according to an embodiment of the present invention.

도 6에 도시한 바와 같이, 상기 제1 사용자 사용 기능의 사용률이 전체(100%) 중에서 60%(6-1)이고, 상기 제2 사용자 사용 기능의 사용률이 전체(100%) 중에서 10%(6-2)이고, 상기 제3 사용자 사용 기능의 사용률이 전체(100%) 중에서 20%(6-3)이고, 상기 제4 사용자 사용 기능의 사용률이 전체(100%) 중에서 10%(6-4)라고 가정할 때, 상기 제어부(270)는, 상기 제1 사용자 사용 기능(예를 들면, 채널 올림 버튼, 채널 내림 버튼, 볼륨 내림 버튼, 볼륨 올림 버튼 등)이 검출되면 상기 임계치를 6000에서 6500으로 증가시킴으로써, 발음상 비슷한 발화(예를 들면, 채널 올려, 채널 내려, 볼륨 내려, 볼륨 올려 등)에 대한 사용자 음성 인식률을 높인다. 상기 제어부(270)는, 상기 제4 사용자 사용 기능(예를 들면, 드라마 이름 검색, 예능 프로그램 이름 검색 등)이 검출되면 상기 임계치를 6000에서 5000으로 감소시킴으로써, 자주 사용하지 않는 사용자 음성에 대한 사용자 음성 인식률을 높인다. 상기 제1 내지 제4 사용자 사용 기능의 사용률은 사용자가 평균적으로 사용하는 기능의 사용률이거나, 평균 사용자 사용 기능의 사용률일 수도 있다. 상기 제어부(270)는 상기 사용자 사용 기능 패턴(제1 내지 제4 사용자 사용 기능의 사용률)을 서버에 전송한 후 서버로부터 상기 사용자 사용 기능 패턴에 대응하는 임계치를 수신하고, 그 수신된 임계치를 수신할 수도 있다. As shown in Fig. 6, the use rate of the first user function is 60% (6-1) of the total (100%), and the use rate of the second user function is 10% ( 6-2), the use rate of the third user function is 20% (6-3) out of the total (100%), and the use rate of the fourth user function is 10% (6- 4), when the first user use function (for example, a channel up button, a channel down button, a volume down button, a volume up button, etc.) is detected, the threshold is at 6000. By increasing it to 6500, the user's voice recognition rate for speech similar in pronunciation (eg, channel up, channel down, volume down, volume up, etc.) is increased. When the fourth user use function (eg, drama name search, entertainment program name search, etc.) is detected, the control unit 270 reduces the threshold value from 6000 to 5000, Increase speech recognition rate. The usage rate of the first to fourth user-use functions may be a usage rate of an average user function or a usage rate of an average user-use function. The control unit 270 transmits the user use function pattern (the use rate of the first to fourth user use functions) to the server, then receives a threshold value corresponding to the user use function pattern from the server, and receives the received threshold value. You may.

따라서, 상기 제어부(270)는, 상기 제1 사용자 사용 기능(예를 들면, 채널 올림 버튼, 채널 내림 버튼, 볼륨 내림 버튼, 볼륨 올림 버튼 등)과 같이 사용률이 높은 경우 유사한 발음을 갖는 음성(예를 들면, 채널 올려, 채널 내려, 볼륨 내려, 볼륨 올려 등)이 다수 존재할 가능성이 높으므로, 상기 임계치를 증가시킴으로써 발음상 비슷한 발화(예를 들면, 채널 올려, 채널 내려, 볼륨 내려, 볼륨 올려 등)에 대한 사용자 음성 인식률을 높인다. 상기 제어부(270)는, 상기 제4 사용자 사용 기능(예를 들면, 드라마 이름 검색, 예능 프로그램 이름 검색 등)과 같이 사용률이 낮은 경우 유사한 발음을 갖는 음성(예를 들면, 드라마 이름, 예능 프로그램 이름 등)이 다수 존재할 가능성이 낮으므로, 상기 임계치를 감소시킴으로써 상기 제4 사용자 사용 기능에 대한 사용자 음성 인식률을 높인다. 상기 제어부(270)는 유사한 발음을 갖는 다수의 사용자 음성이 인식되면 상기 임계치를 증가시킴으로써 상기 유사한 발음을 갖는 다수의 사용자 음성의 인식 확률을 높일 수 있다.Therefore, the control unit 270, the first user function (e.g., channel up button, channel down button, volume down button, volume up button, etc.) For example, since there is a high possibility that there are a large number of channels up, channel down, volume down, volume up, etc.), by increasing the threshold, speech similar in pronunciation (e.g., channel up, channel down, volume down, volume up, etc.) ) To increase the user's voice recognition rate. When the usage rate is low, such as the fourth user use function (for example, a drama name search, an entertainment program name search, etc.), the control unit 270 may have a voice having a similar pronunciation (for example, a drama name, an entertainment program name, etc.). And the like) is unlikely to exist, thereby increasing the user's voice recognition rate for the fourth user use function by reducing the threshold. When a plurality of user voices having similar pronunciation are recognized, the controller 270 may increase the probability of recognizing the plurality of user voices having similar pronunciation by increasing the threshold.

상기 제어부(270)는 상기 인식된 사용자 음성이 상기 변경된 임계치(threshold value)보다 높은 신뢰도 점수에 해당되는지를 결정하고(S14), 상기 인식된 사용자 음성이 임계치(threshold value)보다 높은 신뢰도 점수에 해당되면 상기 사용자 음성을 허용(accpept)한다(S15). 상기 제어부(270)는 상기 인식된 사용자 음성이 상기 서버로부터 수신된 임계치(threshold value)보다 높은 신뢰도 점수에 해당되는지를 결정하고, 상기 인식된 사용자 음성이 상기 서버로부터 수신된임계치보다 높은 신뢰도 점수에 해당되면 상기 사용자 음성을 허용(accpept)할 수도 있다. The controller 270 determines whether the recognized user voice corresponds to a reliability score higher than the changed threshold value (S14), and the recognized user voice corresponds to a reliability score higher than a threshold value. If so, the user's voice is allowed (accpept) (S15). The controller 270 determines whether the recognized user voice corresponds to a reliability score higher than a threshold value received from the server, and the recognized user voice corresponds to a reliability score higher than the threshold value received from the server. If applicable, the user's voice may be allowed (accpept).

반면, 상기 제어부(270)는, 상기 인식된 사용자 음성이 상기 변경된 임계치보다 낮은 신뢰도 점수에 해당되면 상기 인식된 사용자 음성 신호를 거절(reject)한 후(S16), 미리설정된 안내 메시지(예를 들면, 조금 천천히 이야기 해 주십시오)를 제공하여 음성 인식을 재유도한다. 상기 제어부(270)는, 상기 인식된 사용자 음성이 상기 서버로부터 수신된 임계치보다 낮은 신뢰도 점수에 해당되면 상기 인식된 사용자 음성 신호를 거절(reject)한 후, 미리설정된 안내 메시지(예를 들면, 조금 천천히 이야기 해 주십시오)를 제공하여 음성 인식을 재유도할 수도 있다. On the other hand, the controller 270 rejects the recognized user voice signal (S16) when the recognized user voice corresponds to a reliability score lower than the changed threshold (S16), and then a preset guidance message (for example, , Please talk a little slowly) to reinduce speech recognition. When the recognized user voice corresponds to a reliability score lower than the threshold value received from the server, the controller 270 rejects the recognized user voice signal, and then a preset guide message (for example, a little Please speak slowly) to re-induce speech recognition.

따라서, 본 발명의 다른 실시예에 따른 음성 인식 장치 및 그 방법은, 사용자 사용 기능 패턴(영상 표시기기, 이동 통신 단말기 등의 전자 장치의 각 기능들의 사용률)에 따라 임계치를 변경함으로써 사용자 음성을 정확히 인식할 수 있다. Accordingly, the voice recognition apparatus and method thereof according to another embodiment of the present invention accurately change the user voice by changing the threshold value according to the user use function pattern (the use rate of each function of an electronic device such as a video display device and a mobile communication terminal). I can recognize it.

이상에서 설명한 바와 같이, 본 발명의 실시예들에 따른 음성 인식 장치 및 그 방법은, 사용자 음성 패턴(영상 표시기기, 이동 통신 단말기 등의 전자 장치의 기능들에 따라 미리 분류된 각 사용자 음성의 사용률)에 따라 임계치를 변경함으로써 사용자 음성을 정확히 인식할 수 있다. As described above, the voice recognition apparatus and method thereof according to embodiments of the present invention include a user voice pattern (a usage rate of each user voice classified in advance according to functions of an electronic device such as a video display device and a mobile communication terminal). ), the user's voice can be accurately recognized by changing the threshold.

본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자라면 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 다양한 수정 및 변형이 가능할 것이다. 따라서, 본 발명에 개시된 실시예들은 본 발명의 기술 사상을 한정하기 위한 것이 아니라 설명하기 위한 것이고, 이러한 실시예에 의하여 본 발명의 기술 사상의 범위가 한정되는 것은 아니다. 본 발명의 보호 범위는 아래의 청구범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 발명의 권리범위에 포함되는 것으로 해석되어야 할 것이다. Those of ordinary skill in the art to which the present invention pertains will be able to make various modifications and variations without departing from the essential characteristics of the present invention. Accordingly, the embodiments disclosed in the present invention are not intended to limit the technical idea of the present invention, but to explain the technical idea, and the scope of the technical idea of the present invention is not limited by these embodiments. The scope of protection of the present invention should be interpreted by the following claims, and all technical ideas within the scope equivalent thereto should be interpreted as being included in the scope of the present invention.

270: 제어부 292: 음성 인식부270: control unit 292: voice recognition unit

Claims

A speech recognition unit for recognizing a user's speech based on predetermined speech models providing a reliability score;
And a control unit for allowing or rejecting the recognized user voice based on a reliability score and a threshold value of the recognized user voice,
The control unit changes the threshold value according to a user voice usage rate of the recognized user voice,
The user voice usage rate is,
According to the function of the video display device, a first user voice usage rate for controlling the channel and volume of the video display device, a second user voice usage rate for executing a web browser through the video display device, through the video display device It may be classified as at least one of a third user voice usage rate for executing an application program and a fourth user voice usage rate for searching a broadcast program of the video display device,
Whenever the user voice is recognized, it is calculated by counting the number of utterances of the user voice,
Wherein the control unit changes the threshold value according to the first user voice usage rate to the fourth user voice usage rate.

The method of claim 1, wherein the control unit,
And increasing or decreasing the threshold according to the user's voice usage rate.

The method of claim 2, wherein the control unit,
And allowing the recognized user voice if the reliability score of the recognized user voice is higher than the increased or decreased threshold.

The method of claim 2, wherein the control unit,
And rejecting the recognized user voice when the reliability score of the recognized user voice is lower than the increased or decreased threshold.

delete

The method of claim 1, wherein the control unit,
A voice processing apparatus, characterized in that the user voice usage rate is preset.

The method of claim 1, wherein the control unit,
A voice processing device, characterized in that receiving the user voice usage rate from a server.

The method of claim 1, wherein the user voice usage rate is
A voice processing device, characterized in that it is a usage rate of an average user voice.

The method of claim 1, wherein the control unit,
When a plurality of user voices having similar pronunciation are recognized, the threshold value is increased to increase a recognition probability of the plurality of user voices having similar pronunciation.

delete