CN114428658B - Method for displaying Burmese by set top box - Google Patents

Method for displaying Burmese by set top box Download PDF

Info

Publication number
CN114428658B
CN114428658B CN202210088977.8A CN202210088977A CN114428658B CN 114428658 B CN114428658 B CN 114428658B CN 202210088977 A CN202210088977 A CN 202210088977A CN 114428658 B CN114428658 B CN 114428658B
Authority
CN
China
Prior art keywords
innwa
win
top box
set top
unicode
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210088977.8A
Other languages
Chinese (zh)
Other versions
CN114428658A (en
Inventor
闫文志
刘建华
周真荣
沈丛则
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hangzhou Nationalchip Science & Technology Co ltd
Original Assignee
Hangzhou Nationalchip Science & Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Nationalchip Science & Technology Co ltd filed Critical Hangzhou Nationalchip Science & Technology Co ltd
Priority to CN202210088977.8A priority Critical patent/CN114428658B/en
Publication of CN114428658A publication Critical patent/CN114428658A/en
Application granted granted Critical
Publication of CN114428658B publication Critical patent/CN114428658B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces
    • G06F9/454Multi-language systems; Localisation; Internationalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4856End-user interface for client configuration for language selection, e.g. for the menu or subtitles

Abstract

The invention discloses a method for displaying Burmese by a set top box. The existing method can not directly use the Win Innwa word stock to display the Myanmar, and can not meet the multi-language requirements of the product. Firstly, selecting a Unicode section which is not used in a UTF-8 code word library of a set top box, and putting Win Innwa characters into corresponding offset addresses one by one; the set top box increases the offset of Win Innwa codes one by one and converts the Win Innwa codes into Unicode codes; and drawing and displaying the Burmese characters in the designated Unicode coding section in the extended word library by using Unicode coding. The method of the invention supports the display of Win Innwa coding service information sent by an operator, supports the local character string coding display of Burmese, realizes the simultaneous support of Burmese and UTF-8 coding display on the basis of small software modification, and meets the requirements of Burmese operators.

Description

Method for displaying Burmese by set top box
Technical Field
The invention belongs to the field of set top boxes, and particularly relates to a method for displaying Burmese by a set top box.
Background
ASCII (American Standard Code for Information Interchange ) is a set of coding systems based on latin letters. The coding range is 0x00 to 0x7F, and is mainly used for displaying modern English. The Extended version of EASCII (Extended ASCII) can partially support other western European languages, and the EASCII is formed by extending an ASCII code from 7 bits to 8 bits. The EASCII codes range from 0x00 to 0xFF, and the extended symbols include table symbols, calculation symbols, greek letters and special pull Ding Fuhao.
Unicode (Unicode) is an industry standard in the field of computer science, including character sets, coding schemes, and the like. Unicode is generated to solve the limitations of traditional character coding schemes, and sets a uniform and unique binary code for each character in each language so as to meet the requirements of cross-language and cross-platform text conversion and processing. Unicode is implemented differently than coding. Unicode encoding of a character is deterministic. However, in the actual transmission process, the implementation of Unicode coding is different due to the inconsistent design of different system platforms and for the purpose of space saving.
The implementation of Unicode is called Unicode conversion format (Unicode Transformation Format, abbreviated UTF). UTF-8 is one of the most commonly seen coding formats. This is a variable length coding format, the purpose of which is to save memory space. All valid encoding points in the Unicode character set are encoded with one to four bytes.
UTF-8 for backward compatibility with ASCII code, the first 128 characters in Unicode are encoded with a single byte of the same binary value as the ASCII code, and the characters are in one-to-one correspondence with the characters of the ASCII code, which makes the original software for processing ASCII characters continue to be used without or with little modification. UTF-8 has been the most dominant form of encoding on the world Wide Web since 2009. In Unicode, a field is defined as a range of a set of consecutive code bits. The segments are given unique names and there is no overlap between segments. For example:
basic Latin letters (Basic Latin) range u+0000 to u+007F;
latin letters Supplement-1 (Latin-1 Supplement) segment ranges from U+0080 to U+00FF;
braille Patterns (Braille Patterns) range from U+2800 to U+28FF;
the region of the unified ideogram (CJK Unified Ideographs) of the middle Japanese and Korean ranges from U+4E00 to U+9FFF.
Burmese is a character belonging to the salomide family, and is characterized by being round in writing. The number of characters is large, and most of the characters are formed by combining hundreds of graphic characters. For a long time, there is no official unified code standard for Burmese for historical reasons, and many font manufacturers have devised to release different Burmese codes, including Win Innwa, myazedi, zawgyi, etc. Wherein Win Innwa is the Burmese font which circulates in the Burmese nationwide at the earliest, and Win Innwa is developed in an ASCII coding system for single-byte coding. Essentially, the English font is changed into a shape, and is incompatible with English, as follows, the corresponding relation between UTF-8 coding Latin characters and Win Innwa coding Burmese characters is as follows:
character encoding 0x21: english' ≡! 'Burmese' characterCharacter encoding 0x62: english 'b', burmese->Character encoding 0x63: english 'c', burmese->Although the burmese government has announced in 2019 that Unicode standards are uniformly adopted nationally, the burmese program operators still keep using Win Innwa to code the offered services in order to support the cost pressures of the products sold and the upgrade system. The program name, the electronic program list and the like of the operation program are all Win Innwa codes, an operator uses a private protocol to transmit Win Innwa character string codes, a start identifier is defined as '(', an end identifier is '))', and data between the start identifier and the end identifier is Win Innwa code character string. Typically, set top boxes use standard UTF-8 word libraries to support multi-language displays, and there are conflicts due to the incompatibility of Win Innwa coding with UTF-8 coding. Therefore, the Win Innwa word library cannot be directly used for displaying the Myanmar, the display of Latin characters is abnormal, the multi-language requirement of products cannot be met, the Win Innwa coded Myanmar and the UTF-8 coded character strings cannot be supported at the same time, and the product requirement of an Myanmar operator cannot be met.
The invention patent with the patent number 201010577378.X discloses a method for displaying Burmese by a set top box, which converts character strings of a Burmese character library on a PC into dot matrix data, simultaneously records the width and height data of each character string into the dot matrix character library, registers character strings which are required to be combined with other character strings in the Burmese character library, and registers the initial coordinates and positions of the character strings which participate in the combination in the dot matrix character library; when the set top box is to perform the Burmese display, the required character string in the dot matrix character library is searched according to the character codes, the character data of the character string is taken out to obtain the width and height of the character string, the next character code to be combined is read, the required character string is taken out, and the combination is performed according to the initial coordinates and the position of the previous character string, so that the display of a complete Burmese character is completed. Although this method solves the problem of the display of the Burmese, by converting to a dot matrix word stock, a standard word stock processing program, such as a word size adjustment, cannot be used.
Disclosure of Invention
The invention aims to solve the problem that a set top box cannot support to display a UTF-8 coding character string and Win Innwa coding Myanmar at the same time, and provides a method for displaying Myanmar by the set top box, so that the set top box can support to display the UTF-8 coding character string and the Myanmar coded Win Innwa at the same time, and can support to display a UTF-8 coding multilingual display and a Win Innwa coding Myanmar display at the same time.
The invention is realized by the following technical scheme:
firstly, win Innwa characters are mapped into a UTF-8 code word library one by one to obtain an extended word library; then the set top box converts the acquired Win Innwa character code into a Unicode code, and the Burmese is displayed by using an extended word stock; the method specifically comprises the following steps:
selecting a Unicode section which is not used in a UTF-8 code word library of the set top box, putting Win Innwa characters into corresponding offset addresses one by using a word library editing tool, and simultaneously supporting UTF-8 coding and Win Innwa coding by using an extended word library after mapping;
the set top box increases the offset of Win Innwa codes one by one and converts the Win Innwa codes into Unicode codes; and the set top box uses Unicode coding to draw and display the Burmese characters in the designated Unicode coding section in the extended word library.
The set top box uses an extended word stock to display Win Innwa code Burmese of an operator, and the method is specifically as follows:
step (1), the set top box receives service information of an operator, the operator uses a private protocol to transmit Win Innwa character string codes, a start identifier and an end identifier are defined, and data between the start identifier and the end identifier is Win Innwa code character strings;
step (2), the set top box detects a start identifier and an end identifier, and takes out Win Innwa codes;
step (3), the set top box increases the offset of Win Innwa codes one by one and converts the Win Innwa codes into Unicode codes;
and (4) the set top box uses Unicode coding to draw and display Burmese characters in a designated Unicode coding section in the extended word library.
The set top box uses the extended word stock to display the built-in Burmese of the set top box, and the method comprises the following steps:
step (I), the local character string to be displayed by the set top box is encoded by Win Innwa;
step (II) increasing offset of Win Innwa codes one by one, and converting the Win Innwa codes into Unicode codes;
and (III) the set top box uses Unicode coding to draw and display Burmese characters in a designated Unicode coding section in the extended word library.
According to the method, win Innwa characters are mapped to the UTF-8 coding word library one by one, and the mapped extended word library can support UTF-8 coding and Win Innwa coding simultaneously. On one hand, the display of Win Innwa coding service information sent by an operator is supported, and on the other hand, the display of Burmese by local character string coding is supported. The method of the invention realizes the simultaneous support of the display of the Burmese and the UTF-8 codes on the basis of small change of software, and meets the requirements of Burmese operators.
Drawings
FIG. 1 is a flow chart of the Win Innwa character mapping of the extended word stock of the present invention;
FIG. 2 is a flow chart showing the Win Innwa code Burmese of the operator using the extended word stock according to the present invention;
FIG. 3 is a flowchart showing the built-in Burmese of a set-top box using an extended word stock in accordance with the present invention;
FIG. 4 is a schematic diagram of the expanded word stock processing and character display of the present invention.
Detailed Description
Embodiments of the present invention will be described in detail below with reference to the drawings.
A method for displaying Burmese by set top box includes mapping Win Innwa characters to UTF-8 code word library one by one to obtain extended word library; and then the set top box converts the acquired Win Innwa character code into a Unicode code, and the Burmese is displayed by using the extended word stock.
FIG. 1 is a flow chart of mapping Win Innwa characters, wherein Win Innwa characters are mapped into a UTF-8 codeword library one by one. The Win Innwa coding range is 0x00 to 0xFF, a Unicode section which is not used in a UTF-8 coding word stock of the set top box is selected, win Innwa characters are placed into corresponding offset addresses one by using a word stock editing tool, and the mapped extended word stock supports UTF-8 coding and Win Innwa coding simultaneously. The specific flow is as follows:
selecting an unused Unicode coding section from UTF-8 coding fonts of the set top box, wherein a Braille pattern model (Braille Patterns) section is selected as an example in the embodiment, the coding range is U+2800 to U+28FF, and the offset is 0x2800;
step (2) Win Innwa codes range from 0x00 to 0xFF, and the Win Innwa code characters are put into offset addresses of the selected Unicode code coding sections one by using a word stock editing tool, for example:
win Innwa character encodes a character of 0x21Put into UTF-8 coding U+2821;
win Innwa character encodes a character of 0x62Put into UTF-8 coding U+2862;
win Innwa character encodes a character of 0x63Put into UTF-8 coding U+2863;
the mapped extended word library supports Win Innwa Burmese characters under the condition of not affecting the original Unicode coding section, and the set top box supports Win Innwa code Burmese display sent by an operator in the following manner and displays the built-in Burmese of the set top box.
Fig. 2 shows a procedure of displaying a remote control program in Win Innwa by the set top box using an extended word stock, specifically as follows:
step (1), the set top box receives service information of an operator, the operator uses a private protocol to transmit Win Innwa character string codes, a start identifier (such as ' (') and an end identifier (such as ')) are defined, and data between the start identifier and the end identifier is Win Innwa code character string;
step (2), the set top box detects a start identifier and an end identifier, and takes out Win Innwa codes;
step (3), the set top box increases the offset of Win Innwa codes one by one and converts the Win Innwa codes into Unicode codes; the added offset is the offset of the Unicode section selected when expanding the word stock; the offset in this embodiment is 0x2800;
and (4) the set top box uses Unicode coding to draw and display the Burmese characters in the designated Unicode coding section in the extended word library.
Taking the receiving operator Win Innwa code "((tm & S))" as an example:
detecting the start identifier and the end identifier, and taking out the Win Innwa code, namely' tm&S'; the Win Innwa coding sequence is 0x74 0x6D 0x26 0x53 and corresponds to four charactersAnd->I.e. Burmese character string +.>Increasing the offset to convert Unicode code into "0x2874 0x286D 0x28260x2853", and using Unicode code "0x2874 0x286D 0x28260x2853" the set top box uses the Burmese characters in the specified Unicode section in the extended word stock to draw and display, i.e. display as->
Fig. 3 is a flowchart of a set-top box using an extended word stock to display the built-in burmese of the set-top box, which is specifically as follows:
step (1) the local character string required to be displayed by the set top box is encoded by Win Innwa;
step (2) converting Win Innwa codes with one-to-one increasing offset into Unicode codes;
and (3) using Unicode coding to draw and display the Burmese characters in the designated Unicode coding section in the extended word library.
Burmese character string with required display semantic of AsiaFor example, it is composed of four Burmese charactersAnd->The conversion into Win Innwa code sequence is '0x74 0x6D 0x26 0x53'; adding an offset to convert to Unicode segment code "0x2874 0x286D 0x2826 0x2853"; the set top box uses Unicode code 0x2874 0x286D 0x28260x2853 to draw and display the Burmese characters in the designated Unicode section in the extended character library, namely, the characters are shown as +.>
FIG. 4 is a schematic diagram of the processing of an extended word stock and displaying characters according to the present invention, showing the procedure of the processing of the extended word stock and displaying the corresponding characters of the characters encoded as 0x21 under Win Innwa encoding and UTF-8 encoding.
The technical key points of the invention mainly comprise:
1. first, win Innwa characters need to be mapped into a UTF-8 coding word library one by one. The Win Innwa coding range is 0x00 to 0xFF, a Unicode section which is not used in a UTF-8 coding word library of the set top box is selected, win Innwa characters are placed into corresponding offset addresses one by using a word library editing tool, and the mapped extended word library can simultaneously support UTF-8 coding and Win Innwa coding;
2. the mapped extended word library supports Win Innwa Burmese characters under the condition of not affecting the original Unicode coding section. The method comprises the steps that a set top box receives business information of an operator, the operator uses a private protocol to transmit Win Innwa character string codes, data between a start identifier and an end identifier are Win Innwa code character strings, the set top box detects the start identifier and the end identifier, offset is added one by one after Win Innwa codes are taken out and converted into Unicode codes, and the Unicode codes are used for drawing and displaying Burma characters in a designated Unicode coding section in an extended character library;
3. on the other hand, supporting to display local character strings, using Win Innwa codes to display the character strings, then converting the Win Innwa codes with one-to-one increased offset into Unicode codes, and using the Unicode codes to draw and display the Burma characters in the designated Unicode code sections in the extended character library;
4. according to the invention, the Win Innwa characters are mapped to the UTF-8 code word library one by one, and the mapped extended word library can support UTF-8 coding and Win Innwa coding simultaneously. On one hand, the display of Win Innwa coding service information sent by an operator is supported, and on the other hand, the display of Burmese by local character string coding is supported. The display of the Burmese and UTF-8 codes is supported simultaneously on the basis of small software modification, and the requirements of Burmese operators are met.

Claims (1)

1. A method for displaying Burmese by a set top box, which is characterized in that: the method comprises the steps of firstly mapping Win Innwa characters into a UTF-8 coding word stock one by one to obtain an extended word stock; then the set top box converts the acquired Win Innwa character code into a Unicode code, and the Burmese is displayed by using an extended word stock; the method specifically comprises the following steps:
selecting a Unicode section which is not used in a UTF-8 code word library of the set top box, putting WinInnwa characters into corresponding offset addresses one by using a word library editing tool, and simultaneously supporting UTF-8 coding and Win Innwa coding by using an extended word library after mapping;
the set top box increases offset one by one for WinInnwa codes and converts the WinInnwa codes into Unicode codes; the set top box uses Unicode coding to draw and display the Burmese characters in the designated Unicode coding section in the extended character library;
the set top box uses an extended word stock to display Win Innwa code Burmese of an operator, and the method is specifically as follows:
step (1), the set top box receives service information of an operator, the operator uses a private protocol to transmit Win Innwa character string codes, a start identifier and an end identifier are defined, and data between the start identifier and the end identifier is Win Innwa code character strings;
step (2), the set top box detects a start identifier and an end identifier, and takes out Win Innwa codes;
step (3), the set top box increases the offset of Win Innwa codes one by one, converts the Win Innwa codes into Unicode codes, and the increased offset is the offset of a Unicode section selected when expanding a word stock;
the set top box uses Unicode coding to draw and display Burmese characters in a designated Unicode coding section in the extended word library;
the set top box uses the extended word stock to display the built-in Burmese of the set top box, and the method comprises the following steps:
step (I), the local character string to be displayed by the set top box is encoded by Win Innwa;
step (II) increasing offset of Win Innwa codes one by one, and converting the Win Innwa codes into Unicode codes;
and (III) the set top box uses Unicode coding to draw and display Burmese characters in a designated Unicode coding section in the extended word library.
CN202210088977.8A 2022-01-25 2022-01-25 Method for displaying Burmese by set top box Active CN114428658B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210088977.8A CN114428658B (en) 2022-01-25 2022-01-25 Method for displaying Burmese by set top box

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210088977.8A CN114428658B (en) 2022-01-25 2022-01-25 Method for displaying Burmese by set top box

Publications (2)

Publication Number Publication Date
CN114428658A CN114428658A (en) 2022-05-03
CN114428658B true CN114428658B (en) 2024-03-08

Family

ID=81312736

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210088977.8A Active CN114428658B (en) 2022-01-25 2022-01-25 Method for displaying Burmese by set top box

Country Status (1)

Country Link
CN (1) CN114428658B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1592372A (en) * 2003-08-26 2005-03-09 扬智科技股份有限公司 Method for displaying dibit word set and image sound encoding / replaying /copying system
CN101888505A (en) * 2010-06-13 2010-11-17 杭州海康威视数字技术股份有限公司 Method and device for displaying characters on screen
CN102141916A (en) * 2011-01-19 2011-08-03 深圳创维数字技术股份有限公司 Embedded equipment and method for displaying language word on OSD interface
CN109086256A (en) * 2018-07-12 2018-12-25 郑州众智科技股份有限公司 A kind of generation and application method of the character library dot matrix that length is variable
CN111178061A (en) * 2019-12-20 2020-05-19 沈阳雅译网络技术有限公司 Multi-lingual word segmentation method based on code conversion

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW591515B (en) * 2003-08-04 2004-06-11 Ali Corp Method to display double byte character set by OSD and the decoding/playing/duplication system to update the display frame using the method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1592372A (en) * 2003-08-26 2005-03-09 扬智科技股份有限公司 Method for displaying dibit word set and image sound encoding / replaying /copying system
CN101888505A (en) * 2010-06-13 2010-11-17 杭州海康威视数字技术股份有限公司 Method and device for displaying characters on screen
CN102141916A (en) * 2011-01-19 2011-08-03 深圳创维数字技术股份有限公司 Embedded equipment and method for displaying language word on OSD interface
CN109086256A (en) * 2018-07-12 2018-12-25 郑州众智科技股份有限公司 A kind of generation and application method of the character library dot matrix that length is variable
CN111178061A (en) * 2019-12-20 2020-05-19 沈阳雅译网络技术有限公司 Multi-lingual word segmentation method based on code conversion

Also Published As

Publication number Publication date
CN114428658A (en) 2022-05-03

Similar Documents

Publication Publication Date Title
US5682158A (en) Code converter with truncation processing
US5793381A (en) Unicode converter
JP2502021B2 (en) Multibyte data conversion method and system
US5784069A (en) Bidirectional code converter
US10649704B2 (en) Global printing system and method for rendering encoding scheme independent labels having serialization data
US7251667B2 (en) Unicode input method editor
US5784071A (en) Context-based code convertor
US20040215647A1 (en) Processing fixed-format data in a unicode environment
EP1679614B1 (en) Method and apparatus for providing foreign language text display when encoding is not available
CN102567293A (en) Coded format detection method and coded format detection device for text files
KR20030072574A (en) Large character set browser
JPS6077222A (en) Terminal equipment for processing many kinds of characters
CN114428658B (en) Method for displaying Burmese by set top box
JP4451908B2 (en) Unicode converter
CN106406560B (en) Mechanical engineering character vector fonts output method and system in desktop operating system
CN100390783C (en) Method and system for case conversion
CN111428441B (en) Information system cross-platform application oriented Chinese character code conversion method and equipment
CN112015425A (en) Data storage method, data reading device, electronic equipment and storage medium
KR100755533B1 (en) Method and apparatus of generating character set
Korpela A tutorial on character code issues
KR19980070531A (en) Method and system for converting multi-byte character strings between exchange codes in a computer system
EP1152347B1 (en) Method to convert UNICODE text to mixed codepages
CN101739389A (en) Method and system for realizing Tibetan expansion sets conveniently
KR20040069880A (en) Unicode converting method for mobile terminal source file
Liu et al. Distributed multilingual applications of the OSI architecture

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant