CN114428658B - Method for displaying Burmese by set top box - Google Patents
Method for displaying Burmese by set top box Download PDFInfo
- Publication number
- CN114428658B CN114428658B CN202210088977.8A CN202210088977A CN114428658B CN 114428658 B CN114428658 B CN 114428658B CN 202210088977 A CN202210088977 A CN 202210088977A CN 114428658 B CN114428658 B CN 114428658B
- Authority
- CN
- China
- Prior art keywords
- innwa
- win
- top box
- set top
- unicode
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 23
- 238000013507 mapping Methods 0.000 claims description 6
- 230000004048 modification Effects 0.000 abstract description 3
- 239000011159 matrix material Substances 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/451—Execution arrangements for user interfaces
- G06F9/454—Multi-language systems; Localisation; Internationalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/126—Character encoding
- G06F40/129—Handling non-Latin characters, e.g. kana-to-kanji conversion
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/485—End-user interface for client configuration
- H04N21/4856—End-user interface for client configuration for language selection, e.g. for the menu or subtitles
Abstract
The invention discloses a method for displaying Burmese by a set top box. The existing method can not directly use the Win Innwa word stock to display the Myanmar, and can not meet the multi-language requirements of the product. Firstly, selecting a Unicode section which is not used in a UTF-8 code word library of a set top box, and putting Win Innwa characters into corresponding offset addresses one by one; the set top box increases the offset of Win Innwa codes one by one and converts the Win Innwa codes into Unicode codes; and drawing and displaying the Burmese characters in the designated Unicode coding section in the extended word library by using Unicode coding. The method of the invention supports the display of Win Innwa coding service information sent by an operator, supports the local character string coding display of Burmese, realizes the simultaneous support of Burmese and UTF-8 coding display on the basis of small software modification, and meets the requirements of Burmese operators.
Description
Technical Field
The invention belongs to the field of set top boxes, and particularly relates to a method for displaying Burmese by a set top box.
Background
ASCII (American Standard Code for Information Interchange ) is a set of coding systems based on latin letters. The coding range is 0x00 to 0x7F, and is mainly used for displaying modern English. The Extended version of EASCII (Extended ASCII) can partially support other western European languages, and the EASCII is formed by extending an ASCII code from 7 bits to 8 bits. The EASCII codes range from 0x00 to 0xFF, and the extended symbols include table symbols, calculation symbols, greek letters and special pull Ding Fuhao.
Unicode (Unicode) is an industry standard in the field of computer science, including character sets, coding schemes, and the like. Unicode is generated to solve the limitations of traditional character coding schemes, and sets a uniform and unique binary code for each character in each language so as to meet the requirements of cross-language and cross-platform text conversion and processing. Unicode is implemented differently than coding. Unicode encoding of a character is deterministic. However, in the actual transmission process, the implementation of Unicode coding is different due to the inconsistent design of different system platforms and for the purpose of space saving.
The implementation of Unicode is called Unicode conversion format (Unicode Transformation Format, abbreviated UTF). UTF-8 is one of the most commonly seen coding formats. This is a variable length coding format, the purpose of which is to save memory space. All valid encoding points in the Unicode character set are encoded with one to four bytes.
UTF-8 for backward compatibility with ASCII code, the first 128 characters in Unicode are encoded with a single byte of the same binary value as the ASCII code, and the characters are in one-to-one correspondence with the characters of the ASCII code, which makes the original software for processing ASCII characters continue to be used without or with little modification. UTF-8 has been the most dominant form of encoding on the world Wide Web since 2009. In Unicode, a field is defined as a range of a set of consecutive code bits. The segments are given unique names and there is no overlap between segments. For example:
basic Latin letters (Basic Latin) range u+0000 to u+007F;
latin letters Supplement-1 (Latin-1 Supplement) segment ranges from U+0080 to U+00FF;
braille Patterns (Braille Patterns) range from U+2800 to U+28FF;
the region of the unified ideogram (CJK Unified Ideographs) of the middle Japanese and Korean ranges from U+4E00 to U+9FFF.
Burmese is a character belonging to the salomide family, and is characterized by being round in writing. The number of characters is large, and most of the characters are formed by combining hundreds of graphic characters. For a long time, there is no official unified code standard for Burmese for historical reasons, and many font manufacturers have devised to release different Burmese codes, including Win Innwa, myazedi, zawgyi, etc. Wherein Win Innwa is the Burmese font which circulates in the Burmese nationwide at the earliest, and Win Innwa is developed in an ASCII coding system for single-byte coding. Essentially, the English font is changed into a shape, and is incompatible with English, as follows, the corresponding relation between UTF-8 coding Latin characters and Win Innwa coding Burmese characters is as follows:
character encoding 0x21: english' ≡! 'Burmese' characterCharacter encoding 0x62: english 'b', burmese->Character encoding 0x63: english 'c', burmese->Although the burmese government has announced in 2019 that Unicode standards are uniformly adopted nationally, the burmese program operators still keep using Win Innwa to code the offered services in order to support the cost pressures of the products sold and the upgrade system. The program name, the electronic program list and the like of the operation program are all Win Innwa codes, an operator uses a private protocol to transmit Win Innwa character string codes, a start identifier is defined as '(', an end identifier is '))', and data between the start identifier and the end identifier is Win Innwa code character string. Typically, set top boxes use standard UTF-8 word libraries to support multi-language displays, and there are conflicts due to the incompatibility of Win Innwa coding with UTF-8 coding. Therefore, the Win Innwa word library cannot be directly used for displaying the Myanmar, the display of Latin characters is abnormal, the multi-language requirement of products cannot be met, the Win Innwa coded Myanmar and the UTF-8 coded character strings cannot be supported at the same time, and the product requirement of an Myanmar operator cannot be met.
The invention patent with the patent number 201010577378.X discloses a method for displaying Burmese by a set top box, which converts character strings of a Burmese character library on a PC into dot matrix data, simultaneously records the width and height data of each character string into the dot matrix character library, registers character strings which are required to be combined with other character strings in the Burmese character library, and registers the initial coordinates and positions of the character strings which participate in the combination in the dot matrix character library; when the set top box is to perform the Burmese display, the required character string in the dot matrix character library is searched according to the character codes, the character data of the character string is taken out to obtain the width and height of the character string, the next character code to be combined is read, the required character string is taken out, and the combination is performed according to the initial coordinates and the position of the previous character string, so that the display of a complete Burmese character is completed. Although this method solves the problem of the display of the Burmese, by converting to a dot matrix word stock, a standard word stock processing program, such as a word size adjustment, cannot be used.
Disclosure of Invention
The invention aims to solve the problem that a set top box cannot support to display a UTF-8 coding character string and Win Innwa coding Myanmar at the same time, and provides a method for displaying Myanmar by the set top box, so that the set top box can support to display the UTF-8 coding character string and the Myanmar coded Win Innwa at the same time, and can support to display a UTF-8 coding multilingual display and a Win Innwa coding Myanmar display at the same time.
The invention is realized by the following technical scheme:
firstly, win Innwa characters are mapped into a UTF-8 code word library one by one to obtain an extended word library; then the set top box converts the acquired Win Innwa character code into a Unicode code, and the Burmese is displayed by using an extended word stock; the method specifically comprises the following steps:
selecting a Unicode section which is not used in a UTF-8 code word library of the set top box, putting Win Innwa characters into corresponding offset addresses one by using a word library editing tool, and simultaneously supporting UTF-8 coding and Win Innwa coding by using an extended word library after mapping;
the set top box increases the offset of Win Innwa codes one by one and converts the Win Innwa codes into Unicode codes; and the set top box uses Unicode coding to draw and display the Burmese characters in the designated Unicode coding section in the extended word library.
The set top box uses an extended word stock to display Win Innwa code Burmese of an operator, and the method is specifically as follows:
step (1), the set top box receives service information of an operator, the operator uses a private protocol to transmit Win Innwa character string codes, a start identifier and an end identifier are defined, and data between the start identifier and the end identifier is Win Innwa code character strings;
step (2), the set top box detects a start identifier and an end identifier, and takes out Win Innwa codes;
step (3), the set top box increases the offset of Win Innwa codes one by one and converts the Win Innwa codes into Unicode codes;
and (4) the set top box uses Unicode coding to draw and display Burmese characters in a designated Unicode coding section in the extended word library.
The set top box uses the extended word stock to display the built-in Burmese of the set top box, and the method comprises the following steps:
step (I), the local character string to be displayed by the set top box is encoded by Win Innwa;
step (II) increasing offset of Win Innwa codes one by one, and converting the Win Innwa codes into Unicode codes;
and (III) the set top box uses Unicode coding to draw and display Burmese characters in a designated Unicode coding section in the extended word library.
According to the method, win Innwa characters are mapped to the UTF-8 coding word library one by one, and the mapped extended word library can support UTF-8 coding and Win Innwa coding simultaneously. On one hand, the display of Win Innwa coding service information sent by an operator is supported, and on the other hand, the display of Burmese by local character string coding is supported. The method of the invention realizes the simultaneous support of the display of the Burmese and the UTF-8 codes on the basis of small change of software, and meets the requirements of Burmese operators.
Drawings
FIG. 1 is a flow chart of the Win Innwa character mapping of the extended word stock of the present invention;
FIG. 2 is a flow chart showing the Win Innwa code Burmese of the operator using the extended word stock according to the present invention;
FIG. 3 is a flowchart showing the built-in Burmese of a set-top box using an extended word stock in accordance with the present invention;
FIG. 4 is a schematic diagram of the expanded word stock processing and character display of the present invention.
Detailed Description
Embodiments of the present invention will be described in detail below with reference to the drawings.
A method for displaying Burmese by set top box includes mapping Win Innwa characters to UTF-8 code word library one by one to obtain extended word library; and then the set top box converts the acquired Win Innwa character code into a Unicode code, and the Burmese is displayed by using the extended word stock.
FIG. 1 is a flow chart of mapping Win Innwa characters, wherein Win Innwa characters are mapped into a UTF-8 codeword library one by one. The Win Innwa coding range is 0x00 to 0xFF, a Unicode section which is not used in a UTF-8 coding word stock of the set top box is selected, win Innwa characters are placed into corresponding offset addresses one by using a word stock editing tool, and the mapped extended word stock supports UTF-8 coding and Win Innwa coding simultaneously. The specific flow is as follows:
selecting an unused Unicode coding section from UTF-8 coding fonts of the set top box, wherein a Braille pattern model (Braille Patterns) section is selected as an example in the embodiment, the coding range is U+2800 to U+28FF, and the offset is 0x2800;
step (2) Win Innwa codes range from 0x00 to 0xFF, and the Win Innwa code characters are put into offset addresses of the selected Unicode code coding sections one by using a word stock editing tool, for example:
win Innwa character encodes a character of 0x21Put into UTF-8 coding U+2821;
win Innwa character encodes a character of 0x62Put into UTF-8 coding U+2862;
win Innwa character encodes a character of 0x63Put into UTF-8 coding U+2863;
the mapped extended word library supports Win Innwa Burmese characters under the condition of not affecting the original Unicode coding section, and the set top box supports Win Innwa code Burmese display sent by an operator in the following manner and displays the built-in Burmese of the set top box.
Fig. 2 shows a procedure of displaying a remote control program in Win Innwa by the set top box using an extended word stock, specifically as follows:
step (1), the set top box receives service information of an operator, the operator uses a private protocol to transmit Win Innwa character string codes, a start identifier (such as ' (') and an end identifier (such as ')) are defined, and data between the start identifier and the end identifier is Win Innwa code character string;
step (2), the set top box detects a start identifier and an end identifier, and takes out Win Innwa codes;
step (3), the set top box increases the offset of Win Innwa codes one by one and converts the Win Innwa codes into Unicode codes; the added offset is the offset of the Unicode section selected when expanding the word stock; the offset in this embodiment is 0x2800;
and (4) the set top box uses Unicode coding to draw and display the Burmese characters in the designated Unicode coding section in the extended word library.
Taking the receiving operator Win Innwa code "((tm & S))" as an example:
detecting the start identifier and the end identifier, and taking out the Win Innwa code, namely' tm&S'; the Win Innwa coding sequence is 0x74 0x6D 0x26 0x53 and corresponds to four charactersAnd->I.e. Burmese character string +.>Increasing the offset to convert Unicode code into "0x2874 0x286D 0x28260x2853", and using Unicode code "0x2874 0x286D 0x28260x2853" the set top box uses the Burmese characters in the specified Unicode section in the extended word stock to draw and display, i.e. display as->
Fig. 3 is a flowchart of a set-top box using an extended word stock to display the built-in burmese of the set-top box, which is specifically as follows:
step (1) the local character string required to be displayed by the set top box is encoded by Win Innwa;
step (2) converting Win Innwa codes with one-to-one increasing offset into Unicode codes;
and (3) using Unicode coding to draw and display the Burmese characters in the designated Unicode coding section in the extended word library.
Burmese character string with required display semantic of AsiaFor example, it is composed of four Burmese charactersAnd->The conversion into Win Innwa code sequence is '0x74 0x6D 0x26 0x53'; adding an offset to convert to Unicode segment code "0x2874 0x286D 0x2826 0x2853"; the set top box uses Unicode code 0x2874 0x286D 0x28260x2853 to draw and display the Burmese characters in the designated Unicode section in the extended character library, namely, the characters are shown as +.>
FIG. 4 is a schematic diagram of the processing of an extended word stock and displaying characters according to the present invention, showing the procedure of the processing of the extended word stock and displaying the corresponding characters of the characters encoded as 0x21 under Win Innwa encoding and UTF-8 encoding.
The technical key points of the invention mainly comprise:
1. first, win Innwa characters need to be mapped into a UTF-8 coding word library one by one. The Win Innwa coding range is 0x00 to 0xFF, a Unicode section which is not used in a UTF-8 coding word library of the set top box is selected, win Innwa characters are placed into corresponding offset addresses one by using a word library editing tool, and the mapped extended word library can simultaneously support UTF-8 coding and Win Innwa coding;
2. the mapped extended word library supports Win Innwa Burmese characters under the condition of not affecting the original Unicode coding section. The method comprises the steps that a set top box receives business information of an operator, the operator uses a private protocol to transmit Win Innwa character string codes, data between a start identifier and an end identifier are Win Innwa code character strings, the set top box detects the start identifier and the end identifier, offset is added one by one after Win Innwa codes are taken out and converted into Unicode codes, and the Unicode codes are used for drawing and displaying Burma characters in a designated Unicode coding section in an extended character library;
3. on the other hand, supporting to display local character strings, using Win Innwa codes to display the character strings, then converting the Win Innwa codes with one-to-one increased offset into Unicode codes, and using the Unicode codes to draw and display the Burma characters in the designated Unicode code sections in the extended character library;
4. according to the invention, the Win Innwa characters are mapped to the UTF-8 code word library one by one, and the mapped extended word library can support UTF-8 coding and Win Innwa coding simultaneously. On one hand, the display of Win Innwa coding service information sent by an operator is supported, and on the other hand, the display of Burmese by local character string coding is supported. The display of the Burmese and UTF-8 codes is supported simultaneously on the basis of small software modification, and the requirements of Burmese operators are met.
Claims (1)
1. A method for displaying Burmese by a set top box, which is characterized in that: the method comprises the steps of firstly mapping Win Innwa characters into a UTF-8 coding word stock one by one to obtain an extended word stock; then the set top box converts the acquired Win Innwa character code into a Unicode code, and the Burmese is displayed by using an extended word stock; the method specifically comprises the following steps:
selecting a Unicode section which is not used in a UTF-8 code word library of the set top box, putting WinInnwa characters into corresponding offset addresses one by using a word library editing tool, and simultaneously supporting UTF-8 coding and Win Innwa coding by using an extended word library after mapping;
the set top box increases offset one by one for WinInnwa codes and converts the WinInnwa codes into Unicode codes; the set top box uses Unicode coding to draw and display the Burmese characters in the designated Unicode coding section in the extended character library;
the set top box uses an extended word stock to display Win Innwa code Burmese of an operator, and the method is specifically as follows:
step (1), the set top box receives service information of an operator, the operator uses a private protocol to transmit Win Innwa character string codes, a start identifier and an end identifier are defined, and data between the start identifier and the end identifier is Win Innwa code character strings;
step (2), the set top box detects a start identifier and an end identifier, and takes out Win Innwa codes;
step (3), the set top box increases the offset of Win Innwa codes one by one, converts the Win Innwa codes into Unicode codes, and the increased offset is the offset of a Unicode section selected when expanding a word stock;
the set top box uses Unicode coding to draw and display Burmese characters in a designated Unicode coding section in the extended word library;
the set top box uses the extended word stock to display the built-in Burmese of the set top box, and the method comprises the following steps:
step (I), the local character string to be displayed by the set top box is encoded by Win Innwa;
step (II) increasing offset of Win Innwa codes one by one, and converting the Win Innwa codes into Unicode codes;
and (III) the set top box uses Unicode coding to draw and display Burmese characters in a designated Unicode coding section in the extended word library.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210088977.8A CN114428658B (en) | 2022-01-25 | 2022-01-25 | Method for displaying Burmese by set top box |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210088977.8A CN114428658B (en) | 2022-01-25 | 2022-01-25 | Method for displaying Burmese by set top box |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114428658A CN114428658A (en) | 2022-05-03 |
CN114428658B true CN114428658B (en) | 2024-03-08 |
Family
ID=81312736
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210088977.8A Active CN114428658B (en) | 2022-01-25 | 2022-01-25 | Method for displaying Burmese by set top box |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114428658B (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1592372A (en) * | 2003-08-26 | 2005-03-09 | 扬智科技股份有限公司 | Method for displaying dibit word set and image sound encoding / replaying /copying system |
CN101888505A (en) * | 2010-06-13 | 2010-11-17 | 杭州海康威视数字技术股份有限公司 | Method and device for displaying characters on screen |
CN102141916A (en) * | 2011-01-19 | 2011-08-03 | 深圳创维数字技术股份有限公司 | Embedded equipment and method for displaying language word on OSD interface |
CN109086256A (en) * | 2018-07-12 | 2018-12-25 | 郑州众智科技股份有限公司 | A kind of generation and application method of the character library dot matrix that length is variable |
CN111178061A (en) * | 2019-12-20 | 2020-05-19 | 沈阳雅译网络技术有限公司 | Multi-lingual word segmentation method based on code conversion |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TW591515B (en) * | 2003-08-04 | 2004-06-11 | Ali Corp | Method to display double byte character set by OSD and the decoding/playing/duplication system to update the display frame using the method |
-
2022
- 2022-01-25 CN CN202210088977.8A patent/CN114428658B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1592372A (en) * | 2003-08-26 | 2005-03-09 | 扬智科技股份有限公司 | Method for displaying dibit word set and image sound encoding / replaying /copying system |
CN101888505A (en) * | 2010-06-13 | 2010-11-17 | 杭州海康威视数字技术股份有限公司 | Method and device for displaying characters on screen |
CN102141916A (en) * | 2011-01-19 | 2011-08-03 | 深圳创维数字技术股份有限公司 | Embedded equipment and method for displaying language word on OSD interface |
CN109086256A (en) * | 2018-07-12 | 2018-12-25 | 郑州众智科技股份有限公司 | A kind of generation and application method of the character library dot matrix that length is variable |
CN111178061A (en) * | 2019-12-20 | 2020-05-19 | 沈阳雅译网络技术有限公司 | Multi-lingual word segmentation method based on code conversion |
Also Published As
Publication number | Publication date |
---|---|
CN114428658A (en) | 2022-05-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5682158A (en) | Code converter with truncation processing | |
US5793381A (en) | Unicode converter | |
JP2502021B2 (en) | Multibyte data conversion method and system | |
US5784069A (en) | Bidirectional code converter | |
US10649704B2 (en) | Global printing system and method for rendering encoding scheme independent labels having serialization data | |
US7251667B2 (en) | Unicode input method editor | |
US5784071A (en) | Context-based code convertor | |
US20040215647A1 (en) | Processing fixed-format data in a unicode environment | |
EP1679614B1 (en) | Method and apparatus for providing foreign language text display when encoding is not available | |
CN102567293A (en) | Coded format detection method and coded format detection device for text files | |
KR20030072574A (en) | Large character set browser | |
JPS6077222A (en) | Terminal equipment for processing many kinds of characters | |
CN114428658B (en) | Method for displaying Burmese by set top box | |
JP4451908B2 (en) | Unicode converter | |
CN106406560B (en) | Mechanical engineering character vector fonts output method and system in desktop operating system | |
CN100390783C (en) | Method and system for case conversion | |
CN111428441B (en) | Information system cross-platform application oriented Chinese character code conversion method and equipment | |
CN112015425A (en) | Data storage method, data reading device, electronic equipment and storage medium | |
KR100755533B1 (en) | Method and apparatus of generating character set | |
Korpela | A tutorial on character code issues | |
KR19980070531A (en) | Method and system for converting multi-byte character strings between exchange codes in a computer system | |
EP1152347B1 (en) | Method to convert UNICODE text to mixed codepages | |
CN101739389A (en) | Method and system for realizing Tibetan expansion sets conveniently | |
KR20040069880A (en) | Unicode converting method for mobile terminal source file | |
Liu et al. | Distributed multilingual applications of the OSI architecture |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |