KR20060050173A

KR20060050173A - Apparatus for processing text data according to a script characteristic, and method thereof

Info

Publication number: KR20060050173A
Application number: KR1020050063765A
Authority: KR
Inventors: 정길수; 유성열
Original assignee: 삼성전자주식회사
Priority date: 2004-07-30
Filing date: 2005-07-14
Publication date: 2006-05-19
Also published as: CN1728146A

Abstract

정보저장매체에 기록된 텍스트 데이터를 그 텍스트 데이터의 속성에 맞게 처리하는 방법 및 장치가 개시된다. 본 발명에 따라 텍스트 데이터의 처리방법은, (a) 상기 텍스트의 언어 속성에 따라 분류된 스크립트 카테고리 정보를 추출하는 단계; 및 (b) 상기 추출된 카테고리에 포함되어 있는 스크립트 정보들에 따라 상기 텍스트 데이터를 렌더링하는 단계를 포함하는 것을 특징으로 한다. 본 발명에 따르면, 재생장치의 텍스트 생성기가 처리할 수 있는 언어 정보로써, 스크립트 단위로 분류된 스크립트 카테고리 정보를 저장하고 이 정보를 이용하여 텍스트 데이터를 처리함으로써, 재생장치의 리소스 낭비를 방지할 수 있다.A method and apparatus for processing text data recorded on an information storage medium in accordance with attributes of the text data is disclosed. According to the present invention, a method of processing text data includes: (a) extracting script category information classified according to a language attribute of the text; And (b) rendering the text data according to the script information included in the extracted category. According to the present invention, as the language information that can be processed by the text generator of the playback apparatus, by storing the script category information classified in units of scripts and processing the text data using this information, it is possible to prevent the waste of the playback apparatus. have.

텍스트 처리기, 텍스트 생성기, 스크립트, 렌더링, script, rendering Text handler, text generator, script, rendering

Description

Apparatus for processing text data according to a script characteristic, and method

도 1a는 텍스트 생성기가 텍스트 데이터를 처리하여 출력하는 것을 설명하기 위한 도면,1A is a diagram for explaining that a text generator processes and outputs text data;

도 1b는 bidirectional 속성값이 "right to left"인 경우의 텍스트 데이터 출력화면을 도시한 도면,1B is a diagram illustrating a text data output screen when the bidirectional attribute value is "right to left";

도 1c는 텍스트 생성기가 숫자 및 기호들의 묶음을 올바르게 표시할 수 있도록 "Arabic Script"를 처리할 수 있다는 정보를 포함하고 있는 경우에 텍스트 데이터를 렌더링하는 것을 도시한 도면,1C is a diagram illustrating rendering text data when the text generator contains information that the "Arabic Script" can be processed to correctly display a bundle of numbers and symbols;

도 1d는 "Hebrew Script"를 처리할 수 있다는 정보가 추가된 경우의 텍스트 데이터 렌더링 결과를 도시한 도면,FIG. 1D is a view illustrating a text data rendering result when information indicating that "Hebrew Script" can be processed is added. FIG.

도 2a 내지 도 2b는 본 발명에 따라 재생장치 내의 텍스트 생성기가 처리할 수 있는 언어 코드를 나타내는 정보로서 스크립트를 사용한 일 예를 도시한 도면,2A to 2B are diagrams showing an example of using a script as information representing a language code that a text generator in a playback apparatus can process according to the present invention;

도 3은 본 발명에 따른 재생장치의 블록도,3 is a block diagram of a playback apparatus according to the present invention;

도 4는 본 발명의 텍스트 데이터 처리방법의 플로우차트이다.4 is a flowchart of the text data processing method of the present invention.

본 발명은 텍스트 데이터의 처리에 관한 것으로, 보다 상세하게는 정보저장매체에 기록된 텍스트 데이터를 그 텍스트 데이터의 속성에 맞게 처리하는 방법 및 장치에 관한 것이다.The present invention relates to the processing of text data, and more particularly, to a method and apparatus for processing text data recorded on an information storage medium in accordance with attributes of the text data.

텍스트 데이터를 저장하는 정보저장매체에서, 하나의 텍스트는 여러 가지 언어로 인코딩된 텍스트 데이터로 만들어져 정보저장매체에 저장된다. 그리고, 이 텍스트 데이터를 재생하는 재생장치는, 사용자의 선택에 따라, 여러 가지 언어로 인코딩된 텍스트 데이터 중에서 하나를 읽어와 텍스트 생성기를 통해 렌더링하여 화면에 디스플레이한다. 따라서, 텍스트 데이터를 저장하는 정보저장매체에는 여러 가지 언어로 인코딩된 텍스트 데이터가 저장되어 있으므로, 만일 재생장치가 이들 언어로 만들어진 텍스트 데이터를 처리하여 디스플레이할 수 있기 위해 재생장치의 리소스가 많이 필요할 뿐만 아니라, 재생장치가 처리할 수 있는 언어에 대한 정보를 별도로 저장하고 있어야 한다.In an information storage medium storing text data, one text is made of text data encoded in various languages and stored in the information storage medium. The playback apparatus for reproducing the text data reads one of the text data encoded in various languages, renders it through a text generator, and displays the text data on the screen according to a user's selection. Therefore, the information storage medium storing the text data stores text data encoded in various languages, so that if the playback apparatus can process and display the text data created in these languages, it requires a lot of resources of the playback apparatus. Rather, it must separately store information about the languages that the playback device can process.

그러나, 재생장치가 리소스가 제한되는 가전기기(Consumer Electronic)인 경우에 해당 가전기기에서 지원하는 언어 전용 텍스트 생성기가 필요하다.However, when the playback device is a consumer electronic device with limited resources, a language-only text generator supported by the home appliance is required.

따라서, 본 발명이 이루고자 하는 기술적 과제는 여러 가지 언어로 작성된 텍스트 데이터들을 어떻게 처리하는가를 나타내는 속성정보에 따라 정의된 스크립트를 카테고리 별로 분류하고, 그 분류한 카테고리에 따라 재생장치가 텍스트 데이 터를 처리하는 텍스트 데이터 처리방법 및 장치를 제공하는 것이다.Accordingly, a technical problem of the present invention is to classify a script defined by attribute information indicating how to process text data written in various languages by category, and the playback apparatus processes the text data according to the classified category. To provide a text data processing method and apparatus.

또한, 본 발명이 이루고자 하는 또 다른 기술적 과제는, 보다 효율적으로 텍스트 데이터를 처리하는 특정 언어 전용의 재생장치를 제공하는 것이다.In addition, another technical problem to be solved by the present invention is to provide a playback apparatus for a specific language that processes text data more efficiently.

상기 기술적 과제는 본 발명에 따라, 텍스트 데이터의 처리방법에 있어서, (a) 텍스트의 언어 속성에 따라 분류된 스크립트 카테고리 정보를 추출하는 단계; 및 (b) 추출된 카테고리에 포함되어 있는 스크립트 정보들에 따라 텍스트 데이터를 렌더링하는 단계를 포함하는 것을 특징으로 하는 텍스트 데이터 처리방법에 의해 달성된다.According to an aspect of the present invention, there is provided a method of processing text data, comprising: (a) extracting script category information classified according to a language attribute of text; And (b) rendering the text data according to the script information included in the extracted category.

스크립트 카테고리 정보는 복수개의 스크립트 정보를 포함하며, 스크립트는 복수의 유니코드 심벌들이 묶여서 하나의 단위로 처리되는 것이 바람직하며, 스크립트는 유니코드에서 문자셋을 표현하기 위해 사용되는 스크립트인 것이 바람직하다.The script category information includes a plurality of script information, and a script is preferably a plurality of Unicode symbols are bundled and processed as a unit, and the script is preferably a script used for representing a character set in Unicode.

또한, 스크립트 카테고리 정보는 재생장치가 지원하는 언어에 대한 정보를 나타내는 것이 바람직하며, 스크립트 카테고리 정보는 재생장치의 시스템 파라미터로 저장되어 있는 것이 특히 바람직하다.In addition, the script category information preferably represents information about a language supported by the playback apparatus, and the script category information is particularly preferably stored as a system parameter of the playback apparatus.

한편, 본 발명의 다른 분야에 따르면 전술한 기술적 과제는, 복수개의 언어로 인코딩되어 있는 텍스트 데이터; 및 텍스트 데이터의 언어 속성에 따라 분류된 스크립트 카테고리 정보를 포함하고 있는 것을 특징으로 하는 정보저장매체에 의해 달성된다.On the other hand, according to another field of the present invention, the above technical problem, the text data is encoded in a plurality of languages; And script category information classified according to language attributes of the text data.

한편, 본 발명의 또 다른 분야에 따르면 전술한 기술적 과제는, 텍스트 데이터의 처리장치에 있어서, 텍스트의 언어 속성에 따라 분류된 스크립트 카테고리 정보를 추출하는 추출부; 및 추출된 카테고리에 포함되어 있는 스크립트 정보들에 따라 텍스트 데이터를 렌더링하는 텍스트 생성부를 포함하는 것을 특징으로 하는 텍스트 데이터 처리장치에 의해 달성된다.On the other hand, according to another field of the present invention, the above technical problem, an apparatus for processing text data, extracting unit for extracting the script category information classified according to the language attribute of the text; And a text generation unit that renders the text data according to the script information included in the extracted category.

한편, 본 발명의 다른 분야에 따르면 전술한 기술적 과제는, 복수개의 언어로 인코딩되어 있는 텍스트 데이터와 그 언어의 속성에 따라 분류된 스크립트 카테고리 정보를 저장하는 텍스트 데이터 저장부; 및 텍스트 데이터를 읽어 스크립트 카테고리 정보에 포함된 스크립트 정보들에 따라 텍스트 데이터를 렌더링하는 텍스트 데이터 처리부를 포함하는 것을 특징으로 하는 재생장치에 의해서도 달성된다.On the other hand, according to another field of the present invention, the above technical problem, the text data storage unit for storing the text data encoded in a plurality of languages and the script category information classified according to the attribute of the language; And a text data processing unit which reads the text data and renders the text data in accordance with the script information included in the script category information.

한편, 본 발명의 또 다른 분야에 따르면 전술한 기술적 과제는, (a) 텍스트의 언어 속성에 따라 분류된 스크립트 카테고리 정보를 추출하는 단계; 및 (b) 추출된 카테고리에 포함되어 있는 스크립트 정보들에 따라 텍스트 데이터를 렌더링하는 단계를 포함하는 것을 특징으로 하는 텍스트 데이터 처리방법을 컴퓨터에서 실행시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 의해서 달성된다.On the other hand, according to another field of the present invention, the above technical problem, (a) extracting the script category information classified according to the language attribute of the text; And (b) rendering the text data according to the script information included in the extracted category on a computer readable recording medium having recorded thereon a program for executing the text data processing method on a computer. Is achieved by

이하 첨부된 도면을 참조하여 본 발명의 바람직한 실시예에 대해 상세히 설명한다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1a는 텍스트 생성기가 텍스트 데이터를 처리하여 출력하는 것을 설명하기 위한 도면이다.FIG. 1A is a diagram for describing that a text generator processes and outputs text data.

도 1a를 참조하면, 텍스트 생성기가 텍스트 데이터를 렌더링하여 출력하기 위해서 텍스트 데이터와 폰트 데이터를 입력받음을 알 수 있다. 예를 들어, 텍스트 데이터의 언어가 영어이고 폰트 데이터는 Arial 폰트를 사용하였을 때, "Text Data (10-12)"라는 텍스트 데이터를 Arial 폰트를 사용하여 텍스트 생성기가 처리하면, "Text Data (10-12)"(110)가 화면에 디스플레이된다. 여기서, 전체 텍스트 데이터를 이루고 있는 각각의 구성요소, 예를 들어 '(', 'T', '1', '-' 등을 심벌(symbol)이라고 하며, 텍스트 데이터를 어떻게 처리하는가에 따라 여러 가지 스크립트가 존재한다. 예를 들어, "left to right" 스크립트는 텍스트 데이터를 왼쪽에서 오른쪽으로 디스플레이하는 스크립트이고, "Arabic" 스트립트는 숫자나 기호 등으로 이루어진 단위를 한번에 처리하는 것을 의미하는 스트립트이다. 즉, 스크립트는 동일한 속성을 가지는 복수의 심벌들을 처리하는 방법을 기술한 프로그램 형태로 재생장치의 텍스트 생성기에 포함될 수 있다. 따라서 스크립트 정보에 따라서, 텍스트 데이터의 처리단위가 달라지게 된다. 폰트가 개개의 심벌들에 대하여 적용되는 데 비해, 스크립트는 동일한 속성을 가지는 복수의 심벌들에 대하여 일괄적으로 적용된다.Referring to FIG. 1A, it can be seen that the text generator receives text data and font data in order to render and output text data. For example, when the language of the text data is English and the font data uses Arial font, if the text generator processes the text data "Text Data (10-12)" using Arial font, "Text Data (10"). 110) is displayed on the screen. In this case, each component constituting the full text data, for example, '(', 'T', '1', '-', etc. is called a symbol, and depending on how the text data is processed, Scripts exist, for example, a "left to right" script is a script that displays text data from left to right, and an "Arabic" script is a script that means processing units of numbers or symbols at once. That is, the script may be included in the text generator of the playback apparatus in the form of a program describing a method of processing a plurality of symbols having the same attribute, so that the processing unit of the text data varies depending on the script information. Compared to the symbols of the symbol, the script does not apply to multiple symbols with the same property. It is.

도 1a의 예에서, 텍스트 데이터는 심벌(symbol) 단위로 렌더링되며, 텍스트 데이터 저장매체의 제작시에 특정한 속성값이 할당되지 않은 경우에는, 영어 텍스트 데이터는 bidirectional 속성값으로 "left to right" 값을 가지므로 "Text Data (10-12)"(110)가 출력된다.In the example of FIG. 1A, text data is rendered in symbol units, and when a specific attribute value is not assigned at the time of manufacture of the text data storage medium, the English text data is a bidirectional attribute value as a "left to right" value. Since "Text Data (10-12)" 110 is output.

도 1b는 bidirectional 속성값이 "right to left"인 경우의 텍스트 데이터 출력화면을 도시한 도면이다.FIG. 1B is a diagram illustrating a text data output screen when the bidirectional attribute value is "right to left".

텍스트 생성기는 심벌(symbol) 단위로 렌더링을 하기 때문에, 각각의 심벌들이 오른쪽에서 왼쪽으로 하나하나씩 출력되어 도 1b에서 도시한 바와 같이 ")21-01( ataD txeT"(120)가 출력된다. 따라서, 알파벳 데이터는 심벌단위로 처리되어도 문제가 없지만, 숫자와 기호 데이터는 올바르게 출력되지 않는다. 따라서, 텍스트 생성기는 동일한 속성을 가진 심벌들을 올바르게 표시할 수 있도록 속성정보, 즉 스크립트를 포함한다. 이를 도 1c를 참조하여 설명한다.Since the text generator renders in symbols, each symbol is output one by one from right to left, and as shown in FIG. 1B, ") 21-01 (ataD txeT" 120 is output. However, alphabetic data can be processed symbolically, but numeric and symbolic data are not output correctly, so the text generator includes attribute information, or scripts, to correctly display symbols with the same attributes. It demonstrates with reference to 1c.

도 1c는 텍스트 생성기가 숫자 및 기호들의 묶음을 올바르게 표시할 수 있도록 "Arabic Script"를 처리할 수 있다는 정보를 포함하고 있는 경우에 텍스트 데이터를 렌더링하는 것을 도시한 도면이다.FIG. 1C is a diagram illustrating rendering text data when the text generator includes information that the "Arabic Script" can be processed to correctly display a bundle of numbers and symbols.

도 1c를 참조하면, 텍스트 생성기는 심벌 단위로 텍스트 데이터를 렌더링 하는 것이 아니라 스크립트 단위로 처리하며, 이때 "Arabic Script"라는 정보를 이용하여 숫자 및 기호들의 묶음을 묶음 단위로 처리하여 렌더링한다. 따라서, 도 1c와 같이 숫자를 포함하는 단어인 "(10-12)"는 하나의 심벌로 간주되어 렌더링된 것과 같이 올바르게 디스플레이 되어 "(10-12) ataD txeT" (130)이 출력된다.Referring to FIG. 1C, the text generator does not render text data in symbol units, but in script units. In this case, the text generator processes a bundle of numbers and symbols in units of bundles using information called "Arabic Script" and renders them. Accordingly, the word "(10-12)" including a number as shown in FIG. 1C is regarded as a symbol and correctly displayed as rendered, and "(10-12) ataD txeT" 130 is output.

도 1d는 "Hebrew Script"를 처리할 수 있다는 정보가 추가된 경우의 텍스트 데이터 렌더링 결과를 도시한 도면이다.FIG. 1D is a diagram illustrating a text data rendering result when information indicating that "Hebrew Script" can be processed is added.

즉, 텍스트 생성기가 "Hebrew Script"에 관한 정보까지 처리할 수 있는 경우에는 숫자 10, 12가 각각 처리되므로 "(10-12)"와 같이 디스플레이되는 것이 아니라 "(12-10)"으로 디스플레이되어 결과적으로 "(12-10) ataD txeT"(140)이 출력된 다.That is, when the text generator can process information about "Hebrew Script", the numbers 10 and 12 are processed, respectively, so that they are displayed as "(12-10)" rather than as "(10-12)". As a result, "(12-10) ataD txeT" 140 is output.

상술한 바와 같이 텍스트 생성기가 텍스트 데이터를 렌더링 할 때, 심벌단위가 아니라 스크립트 단위로 렌더링을 한다. 따라서 본 발명에서는 텍스트 생성기가 처리할 수 있는 특정 언어 정보를 나타낼 때, 많은 리소스가 소모되는 언어정보를 모두 사용하지 않고, 스크립트를 기준으로 분류하여 만든 카테고리 정보를 사용하여 텍스트 생성기의 텍스트 처리여부를 알아낸다. 특히 본 발명에 따른 스크립트 카테고리 정보를 사용하는 텍스트 생성기의 경우, 모든 언어에 관련된 스크립트 정보를 모두 포함할 필요가 없이 해당 재생장치가 지원하고자 하는 특정 지역언어에 필요한 스크립트 정보만을 포함하도록 함으로써, 재생장치의 제한된 리소스를 효율적으로 이용할 수 있다. 즉, 보다 효율적으로 특정 지역 언어만을 지원하는 특정 지역 전용 재생장치를 제공할 수 있다.As described above, when the text generator renders the text data, the text generator renders the data instead of the symbol unit. Therefore, in the present invention, when the text generator indicates specific language information that the text generator can process, text processing of the text generator is performed using category information generated by classifying the script based on the script information without using all the language information that consumes a lot of resources. Find out. In particular, in the case of the text generator using the script category information according to the present invention, the playback apparatus does not need to include all the script information related to all languages, so that the playback apparatus includes only the script information necessary for the specific local language to be supported by the playback apparatus. You can efficiently use limited resources. In other words, it is possible to provide a specific region-specific playback apparatus that supports only a specific region language more efficiently.

도 2a 내지 도 2b는 본 발명에 따라 재생장치 내의 텍스트 생성기가 처리할 수 있는 언어 코드를 나타내는 정보로서 스크립트를 사용한 일 예를 도시한 도면이다.2A to 2B are diagrams showing an example of using a script as information representing a language code that can be processed by a text generator in a playback apparatus according to the present invention.

도 2a를 참조하면, 종래의 재생장치는 재생장치가 처리할 수 있는 언어 정보(200)를 각 언어별로 가지고 있었다. 우리나라 언어를 예로 들면, 우리나라 언어로 만들어진 텍스트 데이터라고 하더라도 중간중간에 영어, 숫자, 기호, 기타 그리스 문자 등이 있을 수 있다. 따라서 재생장치의 시스템 파라미터는 이러한 여러 가지 언어를 처리하기 위해서 "Arabic", "Hangul", "Greek" 등의 속성정보, 즉 스크립트 정보를 가지고 있어야 한다. 이와 같이 일반적으로 하나의 언어로 만들어진 텍스트 데이터라고 하더라도, 상술한 바와 같이 약 100여개가 넘는 복수의 스크립트 정보를 포함하고 있으므로 재생장치의 리소스가 많이 필요했다. 이러한 문제점을 해결하고자 본 발명에서는 도 2a와 같이 동일한 스크립트 정보를 갖는 언어 코드를 동일 카테고리(202)로 구분하였다.Referring to FIG. 2A, the conventional playback apparatus has language information 200 that can be processed by the playback apparatus for each language. For example, in Korean language, even text data made in Korean language may include English, numbers, symbols, and other Greek characters in the middle. Therefore, the system parameters of the playback apparatus must have attribute information, that is, script information, such as "Arabic", "Hangul", and "Greek" in order to process these various languages. As described above, even text data generally made in one language includes a plurality of script information of about 100 as described above, and thus requires a lot of resources of a playback device. In order to solve this problem, in the present invention, as shown in FIG. 2A, language codes having the same script information are divided into the same category 202.

이때 스크립트로서, 유니코드(unicode)에서 문자셋을 표현하기 위해 사용되는 스크립트가 사용된다. 유니코드에서 문자셋을 사용하는 스크립트는 도 2a에 도시한 바와 같은 스크립트가 존재한다. 이와 같이 동일한 스크립트를 사용하는 카테고리별로 언어를 나누었을 경우, 도 2b에 도시한 바와 같이 약 8개의 카테고리로 나눌 수 있고, 재생장치의 텍스트 생성기가 최소 1개 이상의 카테고리를 처리할 수 있음을 나타내는 정보를 시스템 파라미터의 형태로 저장하고 있다. 이렇게 하여 카테고리 내에 포함되는 모든 스크립트들을 처리할 수 있다. 따라서 복수의 언어로 만들어진 텍스트 데이터가 저장된 정보저장매체가 재생장치에 의해 재생될 때, 사용자에 의해 특정 언어가 선택되면 재생장치는 텍스트 데이터가 만들어진 유니코드 값을 근거로, 사용해야 할 스크립트를 알아낸 후, 재생장치 내의 텍스트 생성기가 렌더링 할 수 있는 스크립트인가를 시스템 파라미터 등에 저장되어 있는 카테고리 정보를 참조하여 결정한다.As a script, a script used to represent a character set in Unicode is used. In the script using the character set in Unicode, there is a script as shown in Figure 2a. When the languages are divided by categories using the same script as described above, the information can be divided into about eight categories as shown in FIG. 2B, and the information indicating that the text generator of the playback apparatus can process at least one category. Is stored in the form of a system parameter. This way you can process all the scripts that fall into the category. Therefore, when an information storage medium storing text data made in a plurality of languages is played by the playback device, if a specific language is selected by the user, the playback device finds a script to use based on the Unicode value in which the text data is created. Then, it is determined by referring to category information stored in system parameters or the like whether the text generator in the playback apparatus can render the script.

또한, 재생장치의 시스템 파라미터로 해당 재생장치가 지원하는 언어에 대응하는 스크립트 카테고리 정보를 지정하고, 재생장치에 포함된 텍스트 생성기는 지정된 스크립트 카테고리 정보(202)에 해당하는 스크립트 정보들만을 포함하면 되므로, 적은 리소스를 이용하여 특정 지역 언어 전용의 재생장치를 제공할 수 있다.In addition, since the script category information corresponding to the language supported by the playback apparatus is specified as a system parameter of the playback apparatus, the text generator included in the playback apparatus may include only the script information corresponding to the specified script category information 202. In addition, a playback device dedicated to a specific local language can be provided using less resources.

도 3은 본 발명에 따른 재생장치의 블록도이다.3 is a block diagram of a playback apparatus according to the present invention.

텍스트 데이터 처리부(320)는 텍스트 데이터를 렌더링한다. 텍스트 데이터는 정보저장매체에 기록되어 있거나 재생 장치에 구비된 메모리에 기록되어 있을 수 있다. 도 3에서는 텍스트 데이터가 기록되어 있는 정보저장매체 또는 메모리를 텍스트 데이터 저장부(300)라 하였다.The text data processor 320 renders the text data. The text data may be recorded in an information storage medium or in a memory provided in the reproduction device. In FIG. 3, an information storage medium or memory in which text data is recorded is referred to as a text data storage unit 300.

재생중인 동영상에 대응하여 제작된 텍스트 데이터 파일과 텍스트 데이터의 렌더링에 사용될 폰트 데이터가 텍스트 데이터 저장부(300)로부터 읽혀져 버퍼(310)에 저장된다. 버퍼에 저장된 텍스트 데이터는 텍스트 데이터 처리부(320)로 전달되어 텍스트의 렌더링에 필요한 정보들을 파싱한다. 그리고, 텍스트 렌더링에 필요한 자막 텍스트, 폰트 정보, 렌더링 스타일 정보 등도 텍스트 데이터 처리부(320)에 전달되어 텍스트 데이터를 렌더링하여 비트맵 이미지를 생성하고 텍스트의 각 항목의 출력 시작 시간과 출력 완료 시간을 지정하여 출력 데이터를 만들어 프리젠테이션 엔진(330)으로 전송한다. 따라서, 텍스트 데이터 처리부(320), 텍스트의 언어 속성에 따라 분류된 스크립트 카테고리 정보를 추출하는 추출부(322)와 추출한 카테고리에 포함되어 있는 스크립트 정보들에 따라 텍스트 데이터를 렌더링하는 텍스트 생성기(324)를 포함한다.The text data file produced in correspondence with the video being played and the font data to be used for rendering the text data are read from the text data storage unit 300 and stored in the buffer 310. The text data stored in the buffer is transferred to the text data processor 320 to parse information necessary for rendering the text. Subtitle text, font information, and rendering style information necessary for text rendering are also passed to the text data processing unit 320 to render the text data to generate a bitmap image, and to specify an output start time and an output completion time of each item of text. To generate the output data and transmit it to the presentation engine 330. Accordingly, the text data processor 320, the extractor 322 for extracting the script category information classified according to the language property of the text, and the text generator 324 for rendering the text data according to the script information included in the extracted category. It includes.

프리젠테이션 엔진(330)은 텍스트 데이터 저장부(300)에 저장된 비트맵 텍스트 데이터와 텍스트 데이터 처리부(320)에 의해 렌더링된 텍스트 데이터를 합쳐 디스플레이 장치로 출력한다.The presentation engine 330 combines the bitmap text data stored in the text data storage unit 300 and the text data rendered by the text data processing unit 320 to output to the display device.

텍스트의 언어 속성에 따라 분류된 스크립트 카테고리 정보를 추출한다(S410). 추출된 스크립트 카테고리가 재생장치의 시스템 파라미터에 저장되어 있는 처리 가능한 스크립트인가를 판단하여(S420), 처리 가능한 스크립트 카테고리에 해당되면, 추출한 카테고리에 포함되어 있는 스크립트 정보들에 따라 상기 텍스트 데이터를 렌더링한다(S430). 재생장치가 처리할 수 없는 스크립트 카테고리이면 텍스트 데이터를 처리할 수 없으므로 종료한다.Script category information classified according to the language attribute of the text is extracted (S410). It is determined whether the extracted script category is a processable script stored in a system parameter of the playback apparatus (S420), and if it corresponds to a processable script category, the text data is rendered according to the script information included in the extracted category. (S430). If the script category cannot be processed by the playback device, the text data cannot be processed, thus ending.

전술한 바와 같이 본 발명에 따르면, 재생장치의 텍스트 생성기가 처리할 수 있는 언어 정보로써, 스크립트 단위로 분류된 스크립트 카테고리 정보를 저장하고 이 정보를 이용하여 텍스트 데이터를 처리함으로써, 재생장치의 리소스 낭비를 방지할 수 있다.As described above, according to the present invention, as the language information that the text generator of the playback apparatus can process, the script category information classified by the script is stored and the text data is processed using this information, thereby wasting resources of the playback apparatus. Can be prevented.

또한, 재생장치가 지원해야 하는 특정 지역언어에만 해당되는 스크립트 카테고리 정보를 재생장치의 시스템 파라미터로 지정하고, 지정된 카테고리에 포함되는 스크립트 정보들만을 재생장치의 텍스트 생성기가 포함하도록 함으로써, 제한된 리소스를 갖는 재생장치에서 특정 지역 언어 전용의 텍스트 생성기를 제공할 수 있다.한편, 전술한 텍스트 데이터 처리방법은 컴퓨터 프로그램으로 작성 가능하다. 상기 프로그램을 구성하는 코드들 및 코드 세그먼트들은 당해 분야의 컴퓨터 프로그래머에 의하여 용이하게 추론될 수 있다. 또한, 상기 프로그램은 컴퓨터가 읽을 수 있는 정보저장매체(computer readable media)에 저장되고, 컴퓨터에 의하여 읽혀지고 실행됨으로써 텍스트 데이터 처리방법을 구현한다. 상기 정보저장매체는 자기 기록매체, 광 기록매체, 및 캐리어 웨이브 매체를 포함한다.In addition, by specifying the script category information corresponding to a specific local language that the playback apparatus should support as a system parameter of the playback apparatus, and including only the script information included in the designated category by the text generator of the playback apparatus, The playback apparatus can provide a text generator for a specific local language. [0026] Meanwhile, the above-described text data processing method can be created by a computer program. Codes and code segments constituting the program can be easily inferred by a computer programmer in the art. In addition, the program is stored in a computer readable media, and read and executed by a computer to implement a text data processing method. The information storage medium includes a magnetic recording medium, an optical recording medium, and a carrier wave medium.

이제까지 본 발명에 대하여 그 바람직한 실시예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far I looked at the center of the preferred embodiment for the present invention. Those skilled in the art will appreciate that the present invention can be implemented in a modified form without departing from the essential features of the present invention. Therefore, the disclosed embodiments should be considered in descriptive sense only and not for purposes of limitation. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the scope will be construed as being included in the present invention.

또한, 제한된 리소스를 가지는 재생장치에서 지원되는 언어에 대응하는 스크립트 카테고리 정보를 시스템 파라미터로 지정하고, 지정된 카테고리에 포함되는 스크립트 정보만을 포함함으로써 보다 효율적으로 특정 지역언어 전용의 재생장치를 구현할 수 있다.In addition, by specifying script category information corresponding to a language supported by a playback apparatus having limited resources as a system parameter, and including only script information included in the designated category, a playback apparatus for a specific local language can be implemented more efficiently.

Claims

In the text data processing method,

extracting script category information classified according to a language attribute of the text; And

(b) rendering the text data according to script information included in the extracted category.

The method of claim 1,

The script category information includes a plurality of script information, wherein the script is a plurality of Unicode symbols are bundled and processed as a unit.

The method of claim 2,

The script is a text data processing method, characterized in that the script used to represent the character set in Unicode.

The method of claim 1,

And the script category information indicates information about a language supported by the playback device.

The method of claim 4, wherein

The script category information is stored as a system parameter of the playback apparatus.

Text data encoded in a plurality of languages; And

And information on a script category classified according to a language attribute of the text data.

The method of claim 6,

The method of claim 7, wherein

The script is an information storage medium, characterized in that the script used to represent the character set in Unicode.

The method of claim 6,

The script category information indicates information on a language supported by a playback device.

The method of claim 9,

The script category information is stored as a system parameter of the playback device.

In the text data processing apparatus,

An extraction unit for extracting script category information classified according to the language attribute of the text; And

And a text generator configured to render the text data according to the script information included in the extracted category.

The method of claim 11,

The method of claim 12,

The script is a text data processing apparatus, characterized in that the script used to represent the character set in Unicode.

The method of claim 11,

And the script category information indicates information about a language supported by a playback device.

The method of claim 14,

And the script category information is stored as a system parameter of the playback apparatus.

A text data storage unit for storing text data encoded in a plurality of languages and script category information classified according to attributes of the languages; And

And a text data processor configured to read the text data and render the text data according to script information included in the script category information.

The method of claim 16,

And a system parameter storage unit for storing script information that can be processed by the playback device as a system parameter.

(b) rendering the text data according to the script information included in the extracted category; a computer-readable recording medium having recorded thereon a program for executing the text data processing method on a computer. .