WO2010069171A1 - Mobile terminal and method for implementing television caption dynamic scaling - Google Patents

Mobile terminal and method for implementing television caption dynamic scaling Download PDF

Info

Publication number
WO2010069171A1
WO2010069171A1 PCT/CN2009/072563 CN2009072563W WO2010069171A1 WO 2010069171 A1 WO2010069171 A1 WO 2010069171A1 CN 2009072563 W CN2009072563 W CN 2009072563W WO 2010069171 A1 WO2010069171 A1 WO 2010069171A1
Authority
WO
WIPO (PCT)
Prior art keywords
mobile terminal
subtitle
module
television program
terminal television
Prior art date
Application number
PCT/CN2009/072563
Other languages
French (fr)
Chinese (zh)
Inventor
张晓勇
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2010069171A1 publication Critical patent/WO2010069171A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/41407Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/426Internal components of the client ; Characteristics thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • H04N21/4355Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream involving reformatting operations of additional data, e.g. HTML pages on a television screen
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4858End-user interface for client configuration for modifying screen layout parameters, e.g. fonts, size of the windows
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles

Definitions

  • the present invention relates to the field of television based on mobile terminals, and more particularly to a mobile terminal and method for realizing dynamic zooming of television subtitles.
  • the manner of dynamically displaying subtitles on a mobile terminal television program can be divided into two types.
  • One method is that subtitles and video images are synthesized on the terminal side, which is referred to as a terminal synthesis subtitle.
  • the network side and the terminal side need to cooperate together.
  • the network side separately transmits the subtitle to be displayed and the video image of the mobile terminal television program to the terminal side, and is synthesized by the terminal side, wherein the network side transmits the UTF8 encoding of the subtitle to the terminal side in real time according to the progress of the television video image playing of the mobile terminal;
  • the subtitle to be displayed and the video image of the mobile terminal have been synthesized in advance on the network side, which is simply referred to as source synthesized subtitle, and the synthesized video image is transmitted as complete image data to the terminal side for display.
  • the subtitles of the mobile terminal television programs are relatively small, especially in foreign language programs, dialect programs, cultural programs, teaching programs, and the small subtitles cause the visibility of the mobile terminal television programs to be extremely poor. .
  • the technical problem to be solved by the present invention is to provide a mobile terminal and method for dynamically zooming TV subtitles, thereby dynamically scaling subtitles displayed on the mobile terminal television to achieve excellent visibility.
  • the present invention discloses a mobile terminal for realizing dynamic zooming of a television subtitle, wherein the mobile terminal comprises: a word map synthesis module, and a mobile terminal television function module and a subtitle connected to the word map synthesis module. a processing module and a user operation parsing module, wherein the user operation parsing module is configured to parse a subtitle size command currently initiated by the user, and send the user instruction to the word graph synthesizing module and the mobile terminal television function respectively Module
  • the mobile terminal television function module is configured to receive a mobile terminal television program, decode and obtain a mobile terminal television program image frame, and send the mobile terminal television program image frame to the word map when receiving the user instruction Synthesis module
  • the subtitle processing module is configured to send a subtitle in a mobile terminal television program to the word map synthesis module;
  • the word map synthesizing module is configured to change a subtitle size in the mobile terminal television program according to the user instruction, and superimpose the changed subtitle on the mobile terminal television program image frame.
  • the mobile terminal television function module is connected to the subtitle processing module, wherein the mobile terminal television function module is further configured to send a message to query whether the network side separately transmits the subtitle of the mobile terminal television program, if And the mobile terminal television function module notifies the subtitle processing module to receive the subtitle;
  • the subtitle processing module is further configured to separately receive a subtitle of the mobile terminal television program upon receiving the notification of the mobile terminal television function module, and send the subtitle to the word map synthesis module.
  • the subtitle processing module is further configured to separately receive subtitles of the mobile terminal television program, and send the subtitles to the word map synthesis module.
  • the mobile terminal television function module sends the message to the network side, if the network side confirms that the subtitle of the mobile terminal television program is not separately transmitted, the mobile terminal television function module is further used to Transmitting, by the mobile terminal, a television program image frame to the subtitle processing module;
  • the subtitle processing module is configured to identify a subtitle from a television program image frame of the mobile terminal, and send the subtitle to the word map synthesis module. Further, the subtitle processing module identifies the subtitle from the mobile terminal television program image frame by using neural network pattern recognition, genetic algorithm, wavelet analysis, or database-based feature comparison.
  • the mobile terminal further includes:
  • a font module configured to store a font to be displayed, when the font synthesis module changes the size of the subtitle in the mobile terminal television program
  • a display module configured to display, to the user, an image frame sent by the word map synthesis module or a mobile terminal television program image frame sent by the mobile terminal television function module to the sub-picture synthesis module.
  • the invention also discloses a method for realizing dynamic zooming of a mobile terminal television subtitle, wherein after the mobile terminal starts the mobile terminal television application service, if the user initiates a change of the subtitle size instruction, the mobile terminal changes according to the user's instruction.
  • the subtitle size of the mobile terminal television program received by itself, and the changed subtitle is superimposed on the image frame of the mobile terminal television program received by itself.
  • the method further includes: the mobile terminal sending a message asking whether the network side separately transmits the subtitle of the mobile terminal television program, and if yes, the mobile terminal separately receives the mobile Subtitles of the terminal television program, and changing the size of the received subtitles according to the instructions of the user.
  • the method further includes: the mobile terminal determining whether the caption of the mobile terminal television program is separately received, and if yes, the mobile terminal changes according to the user instruction The size of the subtitle received separately.
  • the mobile terminal After the mobile terminal sends the message, if the network side confirms that the subtitle of the mobile terminal television program is not separately transmitted, the mobile terminal receives the mobile terminal television program from itself. After the subtitle is recognized in the image frame, the subtitle size is changed according to the user's instruction.
  • the mobile terminal uses a neural network pattern recognition, a genetic algorithm, a wavelet analysis, or a database-based feature comparison to identify a subtitle from an image frame of the mobile terminal television program.
  • the mobile terminal changes the subtitle size of the mobile terminal television program received by itself, the mobile terminal also invokes the font to be displayed; After the changed subtitle is superimposed on the image frame of the mobile terminal television program received by itself, the image frame is displayed to the user.
  • the technical solution of the invention optimizes the visibility of the mobile terminal television, and solves the problem that the mobile terminal television subtitle is too small to be scaled.
  • FIG. 1 is a schematic structural diagram of a mobile terminal proposed in this embodiment
  • FIG. 2 is a schematic flow chart of a method for implementing dynamic zooming of a television subtitle by the mobile terminal shown in FIG. 1.
  • a mobile terminal for realizing dynamic zooming of television subtitles includes a word map synthesizing module 500; and the synthesizing module
  • the mobile terminal television function module 300 is also respectively associated with the subtitle processing module 200, the user operation parsing module 400, and the display module. 600 connected, where:
  • the font module 100 is configured to store a font to be displayed during the scaling process for the font map synthesis module 500 to call. For example, when the font module 100 uses the dot matrix font, it needs to store the bitmap image information of each character, wherein the size of the font is indicated by the number of the dot matrix, so that multiple sets of dot matrix fonts of different sizes need to be placed; When the module 100 uses the vector font, it needs to store the vector information of the character and the vector font engine API, wherein the vector outline of the character is outputted as a dot matrix after passing through the vector engine, and finally outputted on the screen through the dot matrix, the font module 100 When a plurality of fonts are supported, such as a black body, a Song, a Wei, a baby, etc., a plurality of outline description files are included.
  • a plurality of fonts are supported, such as a black body, a Song, a Wei, a baby, etc., a plurality of outline description files are included.
  • the subtitle processing module 200 is configured to process the subtitles of the mobile terminal television program and send them to the word map synthesis module 500.
  • the subtitle processing module 200 further includes a font encoding conversion unit 202, and an image recognizing unit 201 and a subtitle receiving unit 203 connected to the font encoding conversion unit.
  • the subtitle processing module 200 is connected to the mobile terminal television function module 300 through the image recognizing unit 201.
  • the over font encoding conversion unit 202 is connected to the word map synthesis module 500, wherein
  • the image recognition unit 201 is configured to read an image frame of the mobile terminal television program in the mobile terminal television function module 300, identify the subtitle from the subtitle, convert the recognized subtitle code into UTF8 code and send it to the font encoding unit 202, and perform image recognition.
  • Unit 201 can be implemented by existing mature technologies, such as neural network based pattern recognition, genetic algorithms, wavelet analysis, or traditional database-based feature alignment, etc., to effectively identify specific information, such as characters, from images. ;
  • the image recognition unit 201 implements image recognition by using a neural network algorithm, and the implementation steps are: (1) normalizing the input image frame data; (2) inputting the normalized input layer neurons to the neural network. (3) performing matrix operations using a neural network weight matrix; (4) outputting the output layer neurons as a set of character codes;
  • the font encoding conversion unit 202 converts the subtitle encoded by the image recognizing unit 201 or the subtitle receiving unit 203 into an encoding used by the font module 100, and transmits the converted character encoding to the font synthesizing module 500, for example, a subtitle receiving unit.
  • the character encoding sent by 203 is UTF8 format
  • the character encoding recognized by image recognition unit 201 is also UTF8 format, and when font library module 100 stores font information in GBK encoding order, font encoding conversion unit 202 needs to display characters in UTF8 format.
  • the encoding is converted to the GBK encoding format.
  • the font encoding conversion unit 202 also supports conversion between other encodings, such as UTF8 to UNICODE. Of course, in some embodiments, the unit may be omitted and recognized by the image.
  • the unit 201 and the subtitle receiving unit 203 convert the character encoding;
  • the subtitle receiving unit 203 is configured to receive and parse the subtitle data packet sent by the network side, and send the subtitle encoding to the font encoding conversion unit 202, where the subtitle data sent by the network side includes the following field, the encoding format of the character (UNICODE, UTF8 or GBK), a set of character encoding, character encoding terminal special display effect (italic, bold, etc.), the start and end image frame number of the group of codes, in other embodiments, the caption receiving unit 203 may also receive according to whether a subtitle data packet, to determine whether the current mobile terminal television program supports the terminal to synthesize the subtitle, that is, when the subtitle receiving unit 203 receives the subtitle data packet, it is considered that the current mobile terminal television program supports the terminal to synthesize the subtitle, when the subtitle receiving unit 203 does not When receiving the subtitle data packet, it is considered that the current mobile terminal television program supports the source end to synthesize the subtitle, and notifies the image recognition unit in the subtitle processing module to read the image frame of the mobile terminal television
  • the mobile terminal television function module 300 is configured to receive and decode the mobile terminal television program sent by the network side, and send the decoded mobile terminal television program to the subtitle processing module 200 or the display module according to the user operation sent by the user operation parsing module 400. 600, wherein, when the user operates to enlarge or reduce the font, the mobile terminal television function module 300 transmits the decoded mobile terminal television program to the subtitle processing module 200, and when the user operates for other operations, the mobile terminal television function module 300 The decoded mobile terminal television program is directly sent to the display module 600;
  • the mobile terminal television function module 300 may further send a message to the network side, and query whether the current mobile terminal television program on the network side separately transmits the subtitle to determine whether the current mobile terminal television program supports the terminal synthesized subtitle or the source side synthesized subtitle.
  • the current mobile terminal television program support terminal synthesizes the subtitle, otherwise it indicates that the current mobile terminal television program supports the source end synthesis subtitle.
  • the mobile terminal television function module 300 further transmits the decoded mobile terminal television program image frame to the subtitle processing module 200 according to the call of the user operation parsing module 400 (the user performs subtitle enlargement or reduction).
  • the mobile terminal television function module will decode the data regardless of any operation of the user.
  • the mobile terminal television program image frame is sent to the word map synthesis module 500;
  • the mobile terminal television function module 300 may also not send a message to the network side, but the subtitle processing module 200 determines whether the current mobile terminal television program separately transmits the subtitle, that is, the subtitle processing module determines whether the network side sends the message. Subtitle packet.
  • the user operation parsing module 400 is configured to parse the keyboard of the underlying driver, the GUI/GDI report, or touch the message, thereby obtaining whether the current user operation is to enlarge the font, reduce the font, or other operations, when the operation selected by the user is to enlarge or reduce the font.
  • the user operation is simultaneously sent to the word map synthesizing module 500 and the mobile terminal television function module 300.
  • the mobile terminal television function module 300 is directly decoded.
  • the image frame of the obtained mobile terminal television program is sent to the display module 600;
  • the user-selected operation may be converted into a character size value and stored in a global variable.
  • the subtitle is displayed according to the saved character size.
  • the parsing module may also not save the character size value.
  • the subtitle is displayed according to the default character size.
  • the word map synthesis module 500 is configured to extract a subtitle of a corresponding size from the font module 100 according to an instruction of the user operation parsing module 400, and superimpose the subtitle on the image frame of the mobile terminal television program, and send the subtitle to the display module 600, where
  • the word map synthesis module 500 calls the GUI to provide a correlation function, and directly outputs the font to the display buffer;
  • the word map synthesis module 500 directly outputs the font to a specific location of the HDC-SCREEN;
  • the display module 600 is configured to display an image sent by the word map synthesis module 500 or a mobile terminal television program image frame sent by the mobile terminal television function module 300.
  • the method for implementing the dynamic zooming of the subtitle of the mobile terminal by the mobile terminal includes the following steps: Step 201: The user opens the mobile terminal to start the mobile application service of the mobile terminal;
  • Step 202 The mobile terminal television function is initialized to receive the mobile terminal television program.
  • Step 203 The mobile terminal determines the mobile terminal television program that is received by the mobile terminal, that is, whether the current mobile terminal television program supports the terminal to synthesize the subtitle. If yes, the step is performed. 211, otherwise, performing step 204;
  • the mobile terminal may send a message to the network side to query whether the network separately sends the subtitle of the current mobile terminal television program. If the network side returns the confirmation message, it considers that the current mobile terminal television program supports the terminal to synthesize the subtitle; When the side returns "no support" message or does not return a message, it is determined that the current mobile terminal television program does not support the terminal synthesized subtitle.
  • the mobile terminal may also determine whether the current mobile terminal television program supports the terminal to synthesize the subtitle based on whether a separate subtitle file has been received. That is to say, when the mobile terminal has received a separate subtitle file, it is considered that the current mobile terminal television program supports the terminal to synthesize the subtitle, otherwise the current mobile terminal television program supports the source end to synthesize the subtitle.
  • Step 204 Parse the current operation of the user.
  • Step 205 Determine whether the current operation of the user changes the initial value of the subtitle size. If yes, execute step 206. Otherwise, the image synthesized by the source end, that is, the received mobile terminal television program is decoded. Show to the user, and return to step 204;
  • Step 206 The mobile terminal decodes the image frame of the current mobile terminal television program, and saves the size of the subtitle in the current mobile terminal television program image frame, that is, the size of the enlarged or reduced subtitle selected by the user;
  • Step 207 The mobile terminal analyzes the decoded image frame, identifies subtitles in the image frame, that is, each character displayed, and encodes the characters according to UTF8;
  • the mobile terminal can identify subtitles in the image frame through existing mature technologies, such as neural network based pattern recognition, genetic algorithm, wavelet analysis, or traditional database-based feature comparison, etc., which can effectively Identify specific information, such as characters, from the image;
  • existing mature technologies such as neural network based pattern recognition, genetic algorithm, wavelet analysis, or traditional database-based feature comparison, etc., which can effectively Identify specific information, such as characters, from the image;
  • Step 208 The mobile terminal converts the character code into an encoding used by the font module.
  • the mobile terminal uses a plurality of font modules, for example, a dot matrix font, a vector font, or the like.
  • Step 209 The mobile terminal extracts the corresponding font from the font module by using the converted character encoding as an index according to the font size set by the user operation, where the font mainly refers to the character size.
  • the font also includes fonts, thicknesses, and other font display effects;
  • Step 210 the mobile terminal superimposes the extracted font on the decoded image frame, and displays it to the user, and returns to step 204;
  • Step 211 parsing a current operation of the user
  • Step 212 determining whether the current operation of the user changes the initial value of the subtitle size, and if yes, executing step 213, otherwise returning to step 211;
  • Step 213 After receiving the subtitle data packet from the network side, the mobile terminal converts the character encoding into an encoding used by the font module.
  • the subtitle data contains: the encoding format of the character, such as UNICODE, UTF8 or
  • Step 214 The mobile terminal extracts the corresponding font from the font module according to the font size set by the user operation, and uses the converted character encoding as an index to extract the corresponding font from the font module.
  • Step 215 The mobile terminal superimposes the extracted font onto the decoded font. On the image frame, it is displayed to the user, and returns to step 211.
  • the technical solution of the present invention mainly uses the dynamic zooming technology of the subtitles according to user requirements on the terminal side, thereby optimizing the visibility of the mobile terminal television, and solving the problem that the mobile terminal television subtitles are too small to be scaled. The problem.
  • the present invention utilizes a technique of dynamically scaling subtitles according to user requirements on the terminal side, optimizes the visibility of the mobile terminal television, and solves the problem that the mobile terminal television subtitles are too small to be scaled, and thus has an industrial Practicality.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

A mobile terminal for implementing television caption dynamic scaling is provided, the mobile terminal includes: a caption-image combining module, and the modules which connect to said caption-image combining module: a mobile terminal television function module, a caption processing module and a user operation analyzing module. The user operation analyzing module is used for analyzing the current caption size changing instruction launched by a user and sending this instruction to the caption-image combining module and the mobile terminal television function module respectively; the mobile terminal television function module is used for receiving mobile terminal television programs, decoding and gaining the mobile terminal television program image frames, once the mobile terminal television function module receives the instruction, then sends the mobile terminal television program image frames to the caption-image combining module; the caption processing module is used for sending the captions in the mobile terminal television programs to the caption-image combining module; the caption-image combining module is used for changing the size of the captions in the mobile terminal television programs according to the instruction and superposing the changed captions on the mobile terminal television program image frames.

Description

一种实现电视字幕动态缩放的移动终端及方法  Mobile terminal and method for realizing dynamic subtitle of television subtitle
技术领域 Technical field
本发明涉及基于移动终端的电视领域, 特别是指一种实现电视字幕动态 缩放的移动终端及方法。  The present invention relates to the field of television based on mobile terminals, and more particularly to a mobile terminal and method for realizing dynamic zooming of television subtitles.
背景技术 Background technique
随着第三代移动通讯技术的不断普及, 基于移动终端的多媒体应用越来 越成为发展的趋势, 尤其移动终端电视的节目信息越来越丰富和多样化。 其 中, 大多数的移动终端电视节目都需要显示字幕。  With the continuous popularization of the third generation of mobile communication technologies, multimedia applications based on mobile terminals are becoming more and more popular, especially the program information of mobile terminal televisions is becoming more and more abundant and diversified. Among them, most mobile TV programs need to display subtitles.
目前, 在移动终端电视节目上动态显示字幕的方式可分为两种, 一种方 式是, 字幕与视频图像在终端侧合成, 简称终端合成字幕, 此时, 需要网络 侧和终端侧共同配合完成, 网络侧将所要显示的字幕与移动终端电视节目视 频图像分别发送到终端侧, 由终端侧合成, 其中, 网络侧将字幕的 UTF8编 码按照移动终端电视视频图像播放进度实时传送给终端侧;  At present, the manner of dynamically displaying subtitles on a mobile terminal television program can be divided into two types. One method is that subtitles and video images are synthesized on the terminal side, which is referred to as a terminal synthesis subtitle. In this case, the network side and the terminal side need to cooperate together. The network side separately transmits the subtitle to be displayed and the video image of the mobile terminal television program to the terminal side, and is synthesized by the terminal side, wherein the network side transmits the UTF8 encoding of the subtitle to the terminal side in real time according to the progress of the television video image playing of the mobile terminal;
另一种方式是, 所要显示的字幕与移动终端电视视频图像事先已经在网 络侧合成完毕, 简称为源端合成字幕, 合成后的视频图像作为完整的图像数 据发送到终端侧进行显示。  Alternatively, the subtitle to be displayed and the video image of the mobile terminal have been synthesized in advance on the network side, which is simply referred to as source synthesized subtitle, and the synthesized video image is transmitted as complete image data to the terminal side for display.
但是, 受到移动终端屏幕大小的限制, 移动终端电视节目的字幕都比较 小, 尤其在外语节目、 方言节目、 文艺节目、 教学节目中, 较小的字幕造成 移动终端电视节目的可视性极差。  However, due to the limitation of the screen size of the mobile terminal, the subtitles of the mobile terminal television programs are relatively small, especially in foreign language programs, dialect programs, cultural programs, teaching programs, and the small subtitles cause the visibility of the mobile terminal television programs to be extremely poor. .
因此,急需提出一种实现移动终端电视字幕动态缩放的移动终端及方法。  Therefore, there is an urgent need to propose a mobile terminal and method for realizing dynamic zooming of mobile terminal television subtitles.
发明内容 Summary of the invention
本发明所要解决的技术问题是, 提供一种电视字幕动态缩放的移动终端 及方法, 从而将移动终端电视上显示的字幕进行动态缩放, 以达到优秀的可 视性。 为了解决上述问题, 本发明公开了一种实现电视字幕动态缩放的移动终 端, 其中, 该移动终端包括: 字图合成模块, 以及均与所述字图合成模块相 连的移动终端电视功能模块、 字幕处理模块和用户操作解析模块, 其中, 所述用户操作解析模块, 用于解析用户当前发起的更改字幕大小指令, 并将该用户指令分别发送到所述字图合成模块和所述移动终端电视功能模 块; The technical problem to be solved by the present invention is to provide a mobile terminal and method for dynamically zooming TV subtitles, thereby dynamically scaling subtitles displayed on the mobile terminal television to achieve excellent visibility. In order to solve the above problem, the present invention discloses a mobile terminal for realizing dynamic zooming of a television subtitle, wherein the mobile terminal comprises: a word map synthesis module, and a mobile terminal television function module and a subtitle connected to the word map synthesis module. a processing module and a user operation parsing module, wherein the user operation parsing module is configured to parse a subtitle size command currently initiated by the user, and send the user instruction to the word graph synthesizing module and the mobile terminal television function respectively Module
所述移动终端电视功能模块, 用于接收移动终端电视节目, 解码得到移 动终端电视节目图像帧, 并在收到所述用户指令时, 将所述移动终端电视节 目图像帧发送到所述字图合成模块;  The mobile terminal television function module is configured to receive a mobile terminal television program, decode and obtain a mobile terminal television program image frame, and send the mobile terminal television program image frame to the word map when receiving the user instruction Synthesis module
所述字幕处理模块, 用于将移动终端电视节目中的字幕发送到所述字图 合成模块;  The subtitle processing module is configured to send a subtitle in a mobile terminal television program to the word map synthesis module;
所述字图合成模块, 用于按照所述用户指令更改所述移动终端电视节目 中的字幕大小, 并将更改后的字幕叠加到所述移动终端电视节目图像帧上。  The word map synthesizing module is configured to change a subtitle size in the mobile terminal television program according to the user instruction, and superimpose the changed subtitle on the mobile terminal television program image frame.
进一步地, 所述移动终端电视功能模块与所述字幕处理模块相连, 其中, 所述移动终端电视功能模块, 还用于发送消息询问网络侧是否单独发送 所述移动终端电视节目的字幕, 如果是, 则所述移动终端电视功能模块通知 所述字幕处理模块接收字幕;  Further, the mobile terminal television function module is connected to the subtitle processing module, wherein the mobile terminal television function module is further configured to send a message to query whether the network side separately transmits the subtitle of the mobile terminal television program, if And the mobile terminal television function module notifies the subtitle processing module to receive the subtitle;
所述字幕处理模块, 还用于在收到移动终端电视功能模块的通知时, 单 独接收所述移动终端电视节目的字幕, 并将所述字幕发送到所述字图合成模 块。  The subtitle processing module is further configured to separately receive a subtitle of the mobile terminal television program upon receiving the notification of the mobile terminal television function module, and send the subtitle to the word map synthesis module.
较佳地, 所述字幕处理模块, 还用于单独接收所述移动终端电视节目的 字幕, 并将所述字幕发送到所述字图合成模块。  Preferably, the subtitle processing module is further configured to separately receive subtitles of the mobile terminal television program, and send the subtitles to the word map synthesis module.
进一步地, 所述移动终端电视功能模块向网络侧发送所述消息后, 若所 述网络侧确认不会单独发送所述移动终端电视节目的字幕时, 所述移动终端 电视功能模块还用于将所述移动终端电视节目图像帧发送到所述字幕处理模 块;  Further, after the mobile terminal television function module sends the message to the network side, if the network side confirms that the subtitle of the mobile terminal television program is not separately transmitted, the mobile terminal television function module is further used to Transmitting, by the mobile terminal, a television program image frame to the subtitle processing module;
所述字幕处理模块,用于从所述移动终端电视节目图像帧中识别出字幕, 并将所述字幕发送到所述字图合成模块。 进一步地, 所述字幕处理模块, 釆用神经网络模式识别、 遗传算法、 小 波分析、 或基于数据库的特征比对, 从所述移动终端电视节目图像帧中识别 出所述字幕。 The subtitle processing module is configured to identify a subtitle from a television program image frame of the mobile terminal, and send the subtitle to the word map synthesis module. Further, the subtitle processing module identifies the subtitle from the mobile terminal television program image frame by using neural network pattern recognition, genetic algorithm, wavelet analysis, or database-based feature comparison.
较佳地, 该移动终端还包括:  Preferably, the mobile terminal further includes:
字库模块, 用于存储所要显示的字型, 以供字图合成模块更改所述移动 终端电视节目中的字幕大小时调用; 及  a font module, configured to store a font to be displayed, when the font synthesis module changes the size of the subtitle in the mobile terminal television program; and
显示模块, 用于向用户显示字图合成模块发来的图像帧或者移动终端电 视功能模块发送给所述子图合成模块的移动终端电视节目图像帧。  And a display module, configured to display, to the user, an image frame sent by the word map synthesis module or a mobile terminal television program image frame sent by the mobile terminal television function module to the sub-picture synthesis module.
本发明还公开了一种实现移动终端电视字幕动态缩放的方法, 其中, 移动终端启动移动终端电视应用业务后,若用户发起更改字幕大小指令, 则所述移动终端按照所述用户的指令, 更改自身所接收到的移动终端电视节 目的字幕大小, 并将更改后的字幕叠加到自身所接收到的移动终端电视节目 图像帧上。  The invention also discloses a method for realizing dynamic zooming of a mobile terminal television subtitle, wherein after the mobile terminal starts the mobile terminal television application service, if the user initiates a change of the subtitle size instruction, the mobile terminal changes according to the user's instruction. The subtitle size of the mobile terminal television program received by itself, and the changed subtitle is superimposed on the image frame of the mobile terminal television program received by itself.
进一步地, 用户发起更改字幕大小指令之后, 该方法进一步包括: 所述移动终端发送消息询问网络侧是否单独发送所述移动终端电视节目 的字幕, 如果是, 则所述移动终端单独接收所述移动终端电视节目的字幕, 并按照所述用户的指令更改所接收的字幕大小。  Further, after the user initiates the change of the subtitle size instruction, the method further includes: the mobile terminal sending a message asking whether the network side separately transmits the subtitle of the mobile terminal television program, and if yes, the mobile terminal separately receives the mobile Subtitles of the terminal television program, and changing the size of the received subtitles according to the instructions of the user.
进一步地, 用户发起更改字幕大小指令之后, 该方法进一步包括: 所述移动终端判断是否单独接收到了所述移动终端电视节目的字幕, 如 果是, 则所述移动终端按照所述用户的指令, 更改所述单独接收的字幕大小。  Further, after the user initiates the change of the caption size instruction, the method further includes: the mobile terminal determining whether the caption of the mobile terminal television program is separately received, and if yes, the mobile terminal changes according to the user instruction The size of the subtitle received separately.
较佳地, 所述移动终端发送所述消息后, 若所述网络侧确认不会单独发 送所述移动终端电视节目的字幕时, 所述移动终端则从自身所接收到的移动 终端电视节目的图像帧中识别出字幕后, 再按照用户的指令更改所述字幕大 小。  Preferably, after the mobile terminal sends the message, if the network side confirms that the subtitle of the mobile terminal television program is not separately transmitted, the mobile terminal receives the mobile terminal television program from itself. After the subtitle is recognized in the image frame, the subtitle size is changed according to the user's instruction.
较佳地, 所述移动终端釆用神经网络模式识别、 遗传算法、 小波分析、 或基于数据库的特征比对,从所述移动终端电视节目的图像帧中识别出字幕。  Preferably, the mobile terminal uses a neural network pattern recognition, a genetic algorithm, a wavelet analysis, or a database-based feature comparison to identify a subtitle from an image frame of the mobile terminal television program.
进一步地, 该移动终端更改自身所接收到的移动终端电视节目的字幕大 小时, 还调用所要显示的字型; 且 将更改后的字幕叠加到自身所接收到的移动终端电视节目图像帧上后, 将该图像帧显示给用户。 Further, when the mobile terminal changes the subtitle size of the mobile terminal television program received by itself, the mobile terminal also invokes the font to be displayed; After the changed subtitle is superimposed on the image frame of the mobile terminal television program received by itself, the image frame is displayed to the user.
本发明技术方案, 优化了移动终端电视的可视性, 解决了移动终端电视 字幕过小, 无法缩放的问题。 The technical solution of the invention optimizes the visibility of the mobile terminal television, and solves the problem that the mobile terminal television subtitle is too small to be scaled.
附图概述 BRIEF abstract
图 1是本实施例中所提出的移动终端结构示意图;  1 is a schematic structural diagram of a mobile terminal proposed in this embodiment;
图 2是图 1所示移动终端实现电视字幕动态缩放的方法流程示意图。  2 is a schematic flow chart of a method for implementing dynamic zooming of a television subtitle by the mobile terminal shown in FIG. 1.
本发明的较佳实施方式 Preferred embodiment of the invention
下面结合附图及实施例对本发明的具体实施作进一步的详细描述: 一种实现电视字幕动态缩放的移动终端, 如图 1所示, 包括字图合成模 块 500; 以及均与该字图合成模块相连的字库模块 100、 字幕处理模块 200、 移动终端电视功能模块 300、 用户操作解析模块 400及显示模块 600; 移动终 端电视功能模块 300还分别与字幕处理模块 200、 用户操作解析模块 400及 显示模块 600相连, 其中:  The specific implementation of the present invention is further described in detail below with reference to the accompanying drawings and embodiments. A mobile terminal for realizing dynamic zooming of television subtitles, as shown in FIG. 1, includes a word map synthesizing module 500; and the synthesizing module The connected font module 100, the subtitle processing module 200, the mobile terminal television function module 300, the user operation parsing module 400, and the display module 600; the mobile terminal television function module 300 is also respectively associated with the subtitle processing module 200, the user operation parsing module 400, and the display module. 600 connected, where:
字库模块 100 , 用于存储缩放过程中所要显示的字型, 以供字图合成模 块 500调用。 例如, 字库模块 100釆用点阵字库时, 需要存放每个字符的点 阵图像信息, 其中, 字型的大小通过点阵的多少标明, 这样就需要放置多套 大小不同的点阵字库; 字库模块 100釆用矢量字库时, 需要存放字符的矢量 信息及矢量字库引擎 API, 其中, 字符的矢量轮廓通过矢量引擎后, 输出为 字符的点阵, 最终通过点阵输出在屏幕上, 字库模块 100支持多种字型, 如 黑体、 宋体、 魏碑、 幼圓等时, 需包括多种轮廓描述文件; 字幕处理模块 200 , 用于将移动终端电视节目的字幕经过处理后发送到 字图合成模块 500; 字幕处理模块 200进一步包括字体编码转换单元 202, 以 及与字体编码转换单元相连的图像识别单元 201和字幕接收单元 203 , 字幕 处理模块 200通过图像识别单元 201与移动终端电视功能模块 300相连, 通 过字体编码转换单元 202与字图合成模块 500相连, 其中, The font module 100 is configured to store a font to be displayed during the scaling process for the font map synthesis module 500 to call. For example, when the font module 100 uses the dot matrix font, it needs to store the bitmap image information of each character, wherein the size of the font is indicated by the number of the dot matrix, so that multiple sets of dot matrix fonts of different sizes need to be placed; When the module 100 uses the vector font, it needs to store the vector information of the character and the vector font engine API, wherein the vector outline of the character is outputted as a dot matrix after passing through the vector engine, and finally outputted on the screen through the dot matrix, the font module 100 When a plurality of fonts are supported, such as a black body, a Song, a Wei, a baby, etc., a plurality of outline description files are included. The subtitle processing module 200 is configured to process the subtitles of the mobile terminal television program and send them to the word map synthesis module 500. The subtitle processing module 200 further includes a font encoding conversion unit 202, and an image recognizing unit 201 and a subtitle receiving unit 203 connected to the font encoding conversion unit. The subtitle processing module 200 is connected to the mobile terminal television function module 300 through the image recognizing unit 201. The over font encoding conversion unit 202 is connected to the word map synthesis module 500, wherein
图像识别单元 201 , 用于读取移动终端电视功能模块 300 中的移动终端 电视节目的图像帧, 从中识别出字幕, 将识别出的字幕编码转换为 UTF8编 码发送到字体编码单元 202中, 图像识别单元 201可通过现有的成熟技术实 现, 如基于神经网络的模式识别、 遗传算法、 小波分析、 或传统的基于数据 库的特征比对等, 都能有效地从图像中识别出特定信息, 如字符;  The image recognition unit 201 is configured to read an image frame of the mobile terminal television program in the mobile terminal television function module 300, identify the subtitle from the subtitle, convert the recognized subtitle code into UTF8 code and send it to the font encoding unit 202, and perform image recognition. Unit 201 can be implemented by existing mature technologies, such as neural network based pattern recognition, genetic algorithms, wavelet analysis, or traditional database-based feature alignment, etc., to effectively identify specific information, such as characters, from images. ;
本实施例中, 图像识别单元 201釆用神经网络算法实现图像识别, 其实 现步骤为 (1 ) 归一化输入的图像帧数据; ( 2 )将归一化输入到神经网络的 输入层神经元; ( 3 )使用神经网络权阵进行矩阵运算; ( 4 )输出层神经元 的输出结果为一组字符编码;  In this embodiment, the image recognition unit 201 implements image recognition by using a neural network algorithm, and the implementation steps are: (1) normalizing the input image frame data; (2) inputting the normalized input layer neurons to the neural network. (3) performing matrix operations using a neural network weight matrix; (4) outputting the output layer neurons as a set of character codes;
字体编码转换单元 202, 将图像识别单元 201或者字幕接收单元 203发 送的字幕编码转换为字库模块 100所使用的编码, 并将转换后的字符编码发 送到字图合成模块 500, 例如, 字幕接收单元 203发送的字符编码为 UTF8 格式, 图像识别单元 201识别出来的字符编码也为 UTF8格式, 而字库模块 100釆用 GBK编码顺序存放字型信息时,则字体编码转换单元 202要将 UTF8 格式的字符编码转换为 GBK编码格式,在其它实施例中, 字体编码转换单元 202也支持其它的编码之间的转换, 如 UTF8转为 UNICODE, 当然在有些实 施例中, 也可以省略该单元, 由图像识别单元 201和字幕接收单元 203转换 字符编码;  The font encoding conversion unit 202 converts the subtitle encoded by the image recognizing unit 201 or the subtitle receiving unit 203 into an encoding used by the font module 100, and transmits the converted character encoding to the font synthesizing module 500, for example, a subtitle receiving unit. The character encoding sent by 203 is UTF8 format, and the character encoding recognized by image recognition unit 201 is also UTF8 format, and when font library module 100 stores font information in GBK encoding order, font encoding conversion unit 202 needs to display characters in UTF8 format. The encoding is converted to the GBK encoding format. In other embodiments, the font encoding conversion unit 202 also supports conversion between other encodings, such as UTF8 to UNICODE. Of course, in some embodiments, the unit may be omitted and recognized by the image. The unit 201 and the subtitle receiving unit 203 convert the character encoding;
字幕接收单元 203 , 用于接收并解析网络侧发送的字幕数据包, 并将字 幕编码发送到字体编码转换单元 202, 其中, 网络侧发送的字幕数据包含有 以下字段, 字符的编码格式(UNICODE, UTF8或 GBK ) 、 一组字符编码、 字符编码的终端特殊显示效果(斜体、 加粗等)、 该组编码的起止图像帧号, 在其它实施例中, 字幕接收单元 203也可以根据是否接收到字幕数据包, 来 判断当前移动终端电视节目是否支持终端合成字幕, 也就是说, 当字幕接收 单元 203收到字幕数据包时,认为当前移动终端电视节目支持终端合成字幕, 当字幕接收单元 203未收到字幕数据包时, 认为当前移动终端电视节目支持 源端合成字幕, 并通知字幕处理模块中的图像识别单元从移动终端电视功能 模块 300读取移动终端电视节目的图像帧, 并从中识别出字幕。 移动终端电视功能模块 300 , 用于接收和解码网络侧发送的移动终端电 视节目, 并根据用户操作解析模块 400发送的用户操作, 将解码后的移动终 端电视节目发送到字幕处理模块 200或显示模块 600, 其中, 当用户操作为 放大或缩小字体时, 移动终端电视功能模块 300将解码后的移动终端电视节 目发送到字幕处理模块 200 , 当用户操作为其它操作时, 移动终端电视功能 模块 300将解码后的移动终端电视节目直接发送到显示模块 600; The subtitle receiving unit 203 is configured to receive and parse the subtitle data packet sent by the network side, and send the subtitle encoding to the font encoding conversion unit 202, where the subtitle data sent by the network side includes the following field, the encoding format of the character (UNICODE, UTF8 or GBK), a set of character encoding, character encoding terminal special display effect (italic, bold, etc.), the start and end image frame number of the group of codes, in other embodiments, the caption receiving unit 203 may also receive according to whether a subtitle data packet, to determine whether the current mobile terminal television program supports the terminal to synthesize the subtitle, that is, when the subtitle receiving unit 203 receives the subtitle data packet, it is considered that the current mobile terminal television program supports the terminal to synthesize the subtitle, when the subtitle receiving unit 203 does not When receiving the subtitle data packet, it is considered that the current mobile terminal television program supports the source end to synthesize the subtitle, and notifies the image recognition unit in the subtitle processing module to read the image frame of the mobile terminal television program from the mobile terminal television function module 300, and recognizes from it subtitle. The mobile terminal television function module 300 is configured to receive and decode the mobile terminal television program sent by the network side, and send the decoded mobile terminal television program to the subtitle processing module 200 or the display module according to the user operation sent by the user operation parsing module 400. 600, wherein, when the user operates to enlarge or reduce the font, the mobile terminal television function module 300 transmits the decoded mobile terminal television program to the subtitle processing module 200, and when the user operates for other operations, the mobile terminal television function module 300 The decoded mobile terminal television program is directly sent to the display module 600;
在本实施例中, 移动终端电视功能模块 300还可以向网络侧发送消息, 询问网络侧当前移动终端电视节目是否单独发送字幕, 以判断当前移动终端 电视节目是支持终端合成字幕还是源端合成字幕。 这里, 单独发送字幕时表 示当前移动终端电视节目支持终端合成字幕, 否则表示当前移动终端电视节 目支持源端合成字幕。 当前移动终端电视节目支持源端合成字幕时, 移动终 端电视功能模块 300还根据用户操作解析模块 400的调用将解码得到的移动 终端电视节目图像帧发送到字幕处理模块 200 (用户进行字幕放大或者缩小 操作)或者显示模块 600 (用户选择直接显示移动终端电视节目或者未进行 字幕缩放操作) ; 当前移动终端电视节目支持终端合成字幕时, 无论用户进 行任何操作, 移动终端电视功能模块均将解码得到的移动终端电视节目图像 帧发送到字图合成模块 500;  In this embodiment, the mobile terminal television function module 300 may further send a message to the network side, and query whether the current mobile terminal television program on the network side separately transmits the subtitle to determine whether the current mobile terminal television program supports the terminal synthesized subtitle or the source side synthesized subtitle. . Here, when the subtitle is sent separately, the current mobile terminal television program support terminal synthesizes the subtitle, otherwise it indicates that the current mobile terminal television program supports the source end synthesis subtitle. When the current mobile terminal television program supports the source side to synthesize the subtitle, the mobile terminal television function module 300 further transmits the decoded mobile terminal television program image frame to the subtitle processing module 200 according to the call of the user operation parsing module 400 (the user performs subtitle enlargement or reduction). Operation) or display module 600 (the user selects to directly display the mobile terminal television program or does not perform the subtitle zoom operation); when the current mobile terminal television program supports the terminal to synthesize the subtitle, the mobile terminal television function module will decode the data regardless of any operation of the user. The mobile terminal television program image frame is sent to the word map synthesis module 500;
在其它实施例中, 移动终端电视功能模块 300也可以不用向网络侧发送 消息, 而由字幕处理模块 200来判断当前移动终端电视节目是否单独发送字 幕, 即字幕处理模块判断是否收到网络侧发送的字幕数据包。  In other embodiments, the mobile terminal television function module 300 may also not send a message to the network side, but the subtitle processing module 200 determines whether the current mobile terminal television program separately transmits the subtitle, that is, the subtitle processing module determines whether the network side sends the message. Subtitle packet.
用户操作解析模块 400, 用于解析底层驱动、 GUI/GDI上报的键盘或者 触摸消息, 从而得出当前用户的操作是放大字体、 缩小字体还是其它操作, 当用户选择的操作为放大或者缩小字体时, 将用户操作同时发送到字图合成 模块 500和移动终端电视功能模块 300, 当前用户选择的操作为直接显示移 动终端电视节目或者未选择放大缩小字体时,调用移动终端电视功能模块 300 直接将解码后得到的移动终端电视节目的图像帧发送到显示模块 600;  The user operation parsing module 400 is configured to parse the keyboard of the underlying driver, the GUI/GDI report, or touch the message, thereby obtaining whether the current user operation is to enlarge the font, reduce the font, or other operations, when the operation selected by the user is to enlarge or reduce the font. The user operation is simultaneously sent to the word map synthesizing module 500 and the mobile terminal television function module 300. When the current user selects an operation to directly display the mobile terminal television program or does not select to enlarge or reduce the font, the mobile terminal television function module 300 is directly decoded. The image frame of the obtained mobile terminal television program is sent to the display module 600;
用户操作解析模块 400解析用户操作后, 还可以将用户选择的操作转换 为字符的大小值保存在全局变量中, 此时, 用户再次进入移动终端电视应用 业务时, 按照所保存的字符大小显示字幕, 当然在其它实施例中, 用户操作 解析模块也可以不保存字符大小值, 此时, 用户再次进入移动终端电视应用 业务时, 按照默认的字符大小显示字幕。 After the user operation parsing module 400 parses the user operation, the user-selected operation may be converted into a character size value and stored in a global variable. At this time, when the user enters the mobile terminal television application service again, the subtitle is displayed according to the saved character size. , of course, in other embodiments, user operations The parsing module may also not save the character size value. At this time, when the user enters the mobile terminal television application service again, the subtitle is displayed according to the default character size.
字图合成模块 500, 用于根据用户操作解析模块 400的指令, 从字库模 块 100中提取相应大小的字幕, 并将该字幕叠加到移动终端电视节目图像帧 上, 发送到显示模块 600 , 其中, 当前移动终端电视节目支持终端合成字幕 时, 字图合成模块 500调用 GUI提供相关函数, 直接把字型输出到显示緩冲 区中; 当前移动终端电视节目支持源端合成字幕时, 字图合成模块 500直接 将字型输出到 HDC— SCREEN的特定位置上;  The word map synthesis module 500 is configured to extract a subtitle of a corresponding size from the font module 100 according to an instruction of the user operation parsing module 400, and superimpose the subtitle on the image frame of the mobile terminal television program, and send the subtitle to the display module 600, where When the current mobile terminal television program supports the terminal to synthesize the subtitle, the word map synthesis module 500 calls the GUI to provide a correlation function, and directly outputs the font to the display buffer; when the current mobile terminal television program supports the source side to synthesize the subtitle, the word map synthesis module 500 directly outputs the font to a specific location of the HDC-SCREEN;
显示模块 600, 用于显示字图合成模块 500发送的图像或者移动终端电 视功能模块 300发送的移动终端电视节目图像帧。  The display module 600 is configured to display an image sent by the word map synthesis module 500 or a mobile terminal television program image frame sent by the mobile terminal television function module 300.
上述移动终端实现移动终端电视的字幕动态缩放的方法,包括以下步骤: 步骤 201 , 用户打开移动终端, 启动移动终端电视应用业务; The method for implementing the dynamic zooming of the subtitle of the mobile terminal by the mobile terminal includes the following steps: Step 201: The user opens the mobile terminal to start the mobile application service of the mobile terminal;
步骤 202, 移动终端电视功能进行初始化, 接收移动终端电视节目; 步骤 203 , 移动终端判断自身所接收到的移动终端电视节目, 即当前移 动终端电视节目是否支持终端合成字幕, 如果是, 则执行步骤 211 , 否则, 执行步骤 204;  Step 202: The mobile terminal television function is initialized to receive the mobile terminal television program. Step 203: The mobile terminal determines the mobile terminal television program that is received by the mobile terminal, that is, whether the current mobile terminal television program supports the terminal to synthesize the subtitle. If yes, the step is performed. 211, otherwise, performing step 204;
该步骤中, 移动终端可以向网络侧发送消息, 用于询问网络是否单独发 送当前移动终端电视节目的字幕, 如果网络侧返回确认消息时, 则认为当前 移动终端电视节目支持终端合成字幕; 如果网络侧返回 "不支持" 消息或未 返回消息时, 则判定当前移动终端电视节目不支持终端合成字幕。  In this step, the mobile terminal may send a message to the network side to query whether the network separately sends the subtitle of the current mobile terminal television program. If the network side returns the confirmation message, it considers that the current mobile terminal television program supports the terminal to synthesize the subtitle; When the side returns "no support" message or does not return a message, it is determined that the current mobile terminal television program does not support the terminal synthesized subtitle.
在其它实施例中, 移动终端也可以根据是否已接收到单独的字幕文件来 判断当前移动终端电视节目是否支持终端合成字幕。 也就是说, 当移动终端 已接收到单独的字幕文件,则认为当前移动终端电视节目支持终端合成字幕, 否则认为当前移动终端电视节目支持源端合成字幕。  In other embodiments, the mobile terminal may also determine whether the current mobile terminal television program supports the terminal to synthesize the subtitle based on whether a separate subtitle file has been received. That is to say, when the mobile terminal has received a separate subtitle file, it is considered that the current mobile terminal television program supports the terminal to synthesize the subtitle, otherwise the current mobile terminal television program supports the source end to synthesize the subtitle.
步骤 204, 解析用户当前操作;  Step 204: Parse the current operation of the user.
步骤 205: 判断用户当前操作是否更改字幕大小初始值, 如果是, 则执 行步骤 206 , 否则将源端合成的图像, 即接收的移动终端电视节目解码后显 示给用户, 并返回步骤 204; Step 205: Determine whether the current operation of the user changes the initial value of the subtitle size. If yes, execute step 206. Otherwise, the image synthesized by the source end, that is, the received mobile terminal television program is decoded. Show to the user, and return to step 204;
该步骤中, 若用户当前操作是退出移动终端电视业务, 则结束本流程。 步骤 206 , 移动终端将当前移动终端电视节目的图像帧进行解码, 并保 存当前移动终端电视节目图像帧中字幕大小, 即用户所选择的放大或缩小的 字幕的大小;  In this step, if the current operation of the user is to exit the mobile terminal television service, the process ends. Step 206: The mobile terminal decodes the image frame of the current mobile terminal television program, and saves the size of the subtitle in the current mobile terminal television program image frame, that is, the size of the enlarged or reduced subtitle selected by the user;
步骤 207 , 移动终端分析解码后的图像帧, 识别出图像帧中的字幕, 即 显示的各个字符, 并将这些字符按 UTF8进行编码;  Step 207: The mobile terminal analyzes the decoded image frame, identifies subtitles in the image frame, that is, each character displayed, and encodes the characters according to UTF8;
该步骤中, 移动终端可通过现有的成熟技术识别出图像帧中的字幕, 如 基于神经网络的模式识别、 遗传算法、 小波分析、 或传统的基于数据库的特 征比对等, 都能有效地从图像中识别出特定信息, 如字符;  In this step, the mobile terminal can identify subtitles in the image frame through existing mature technologies, such as neural network based pattern recognition, genetic algorithm, wavelet analysis, or traditional database-based feature comparison, etc., which can effectively Identify specific information, such as characters, from the image;
步骤 208, 移动终端将上述字符编码转换为字库模块所使用的编码; 该步骤中, 移动终端所釆用的字库模块有多种, 例如, 可以釆用点阵字 库, 或者矢量字库等。  Step 208: The mobile terminal converts the character code into an encoding used by the font module. In this step, the mobile terminal uses a plurality of font modules, for example, a dot matrix font, a vector font, or the like.
步骤 209 , 移动终端按照用户操作所设定的字体大小, 将转换后的字符 编码作为索引从字库模块中提取出对应的字型, 其中, 字型主要指字符大小, 在优选的实施例中, 字型还包括字体、 粗细等字体显示效果;  Step 209: The mobile terminal extracts the corresponding font from the font module by using the converted character encoding as an index according to the font size set by the user operation, where the font mainly refers to the character size. In a preferred embodiment, The font also includes fonts, thicknesses, and other font display effects;
步骤 210 , 移动终端将提取到的字型叠加到解码后的图像帧上, 显示给 用户, 返回步骤 204;  Step 210, the mobile terminal superimposes the extracted font on the decoded image frame, and displays it to the user, and returns to step 204;
步骤 211 , 解析用户当前操作;  Step 211, parsing a current operation of the user;
步骤 212 , 判断用户当前操作是否更改字幕大小初始值, 如果是, 则执 行步骤 213 , 否则返回步骤 211 ;  Step 212, determining whether the current operation of the user changes the initial value of the subtitle size, and if yes, executing step 213, otherwise returning to step 211;
该步骤中, 若用户当前操作是退出移动终端电视业务, 则结束本流程。 步骤 213 , 移动终端从网络侧接收字幕数据包后, 将字符编码转换为字 库模块所使用的编码;  In this step, if the current operation of the user is to exit the mobile terminal television service, the process ends. Step 213: After receiving the subtitle data packet from the network side, the mobile terminal converts the character encoding into an encoding used by the font module.
该步骤中, 字幕数据包含有: 字符的编码格式, 如 UNICODE, UTF8或 In this step, the subtitle data contains: the encoding format of the character, such as UNICODE, UTF8 or
GBK等; 一组字符编码; 字符编码的终端特殊显示效果, 如斜体、 加粗等; 及该组编码的起止图像帧号等。 步骤 214 , 移动终端按照用户操作所设定的字体大小, 将转换后的字符 编码作为索引从字库模块中提取出对应的字型; 步骤 215 , 移动终端将提取到的字型叠加到解码后的图像帧上, 显示给 用户, 返回步骤 211。 GBK, etc.; a set of character encoding; character encoding terminal special display effects, such as italic, bold, etc.; and the start and stop image frame number of the group of codes. Step 214: The mobile terminal extracts the corresponding font from the font module according to the font size set by the user operation, and uses the converted character encoding as an index to extract the corresponding font from the font module. Step 215: The mobile terminal superimposes the extracted font onto the decoded font. On the image frame, it is displayed to the user, and returns to step 211.
从上述实施例可以看出, 本发明技术方案主要釆用了在终端侧按照用户 需求对字幕动态缩放技术, 从而优化了移动终端电视的可视性, 解决了移动 终端电视字幕过小, 无法缩放的问题。 It can be seen from the above embodiments that the technical solution of the present invention mainly uses the dynamic zooming technology of the subtitles according to user requirements on the terminal side, thereby optimizing the visibility of the mobile terminal television, and solving the problem that the mobile terminal television subtitles are too small to be scaled. The problem.
当然, 本发明还可以有其它多种实施例, 在不背离本发明精神及其实质 但这些相应的改变和变形都应属于本发明所附的权利要求的保护范围。  It is a matter of course that the invention can be embodied in various other forms without departing from the spirit and scope of the invention.
工业实用性 本发明釆用了在终端侧按照用户需求对字幕动态缩放的技术, 优化了移 动终端电视的可视性, 解决了移动终端电视字幕过小, 无法缩放的问题, 因 此具有 虽的工业实用性。 Industrial Applicability The present invention utilizes a technique of dynamically scaling subtitles according to user requirements on the terminal side, optimizes the visibility of the mobile terminal television, and solves the problem that the mobile terminal television subtitles are too small to be scaled, and thus has an industrial Practicality.

Claims

权 利 要 求 书 Claim
1、 一种实现电视字幕动态缩放的移动终端, 其中, 该移动终端包括: 字 图合成模块, 以及均与所述字图合成模块相连的移动终端电视功能模块、 字 幕处理模块和用户操作解析模块, 其中, A mobile terminal for realizing dynamic zooming of a television subtitle, wherein the mobile terminal comprises: a word map synthesis module, and a mobile terminal television function module, a subtitle processing module, and a user operation analysis module both connected to the word map synthesis module , among them,
所述用户操作解析模块, 用于解析用户当前发起的更改字幕大小指令, 并将该用户指令分别发送到所述字图合成模块和所述移动终端电视功能模 块;  The user operation parsing module is configured to parse a change caption size instruction currently initiated by the user, and send the user instruction to the word graph synthesizing module and the mobile terminal television function module respectively;
所述移动终端电视功能模块, 用于接收移动终端电视节目, 解码得到移 动终端电视节目图像帧, 并在收到所述用户指令时, 将所述移动终端电视节 目图像帧发送到所述字图合成模块;  The mobile terminal television function module is configured to receive a mobile terminal television program, decode and obtain a mobile terminal television program image frame, and send the mobile terminal television program image frame to the word map when receiving the user instruction Synthesis module
所述字幕处理模块, 用于将移动终端电视节目中的字幕发送到所述字图 合成模块;  The subtitle processing module is configured to send a subtitle in a mobile terminal television program to the word map synthesis module;
所述字图合成模块, 用于按照所述用户指令更改所述移动终端电视节目 中的字幕大小, 并将更改后的字幕叠加到所述移动终端电视节目图像帧上。  The word map synthesizing module is configured to change a subtitle size in the mobile terminal television program according to the user instruction, and superimpose the changed subtitle on the mobile terminal television program image frame.
2、 如权利要求 1所述的移动终端, 其中, 所述移动终端电视功能模块与 所述字幕处理模块相连, 其中,  2. The mobile terminal according to claim 1, wherein the mobile terminal television function module is connected to the subtitle processing module, wherein
所述移动终端电视功能模块, 还用于发送消息询问网络侧是否单独发送 所述移动终端电视节目的字幕, 如果是, 则所述移动终端电视功能模块通知 所述字幕处理模块接收字幕;  The mobile terminal television function module is further configured to send a message to query whether the network side separately transmits the subtitle of the mobile terminal television program, and if yes, the mobile terminal television function module notifies the subtitle processing module to receive the subtitle;
所述字幕处理模块, 还用于在收到移动终端电视功能模块的通知时, 单 独接收所述移动终端电视节目的字幕, 并将所述字幕发送到所述字图合成模 块。  The subtitle processing module is further configured to separately receive a subtitle of the mobile terminal television program upon receiving the notification of the mobile terminal television function module, and send the subtitle to the word map synthesis module.
3、 如权利要求 1所述的移动终端, 其中,  3. The mobile terminal of claim 1, wherein
所述字幕处理模块, 还用于单独接收所述移动终端电视节目的字幕, 并 将所述字幕发送到所述字图合成模块。  The subtitle processing module is further configured to separately receive subtitles of the mobile terminal television program, and send the subtitles to the word map synthesis module.
4、 如权利要求 2所述的移动终端, 其中,  4. The mobile terminal of claim 2, wherein
所述移动终端电视功能模块向网络侧发送所述消息后, 若所述网络侧确 认不会单独发送所述移动终端电视节目的字幕时, 所述移动终端电视功能模 块还用于将所述移动终端电视节目图像帧发送到所述字幕处理模块; After the mobile terminal television function module sends the message to the network side, if the network side is When the subtitle of the mobile terminal television program is not separately transmitted, the mobile terminal television function module is further configured to send the mobile terminal television program image frame to the subtitle processing module;
所述字幕处理模块,用于从所述移动终端电视节目图像帧中识别出字幕, 并将所述字幕发送到所述字图合成模块。  The subtitle processing module is configured to identify a subtitle from a television program image frame of the mobile terminal, and send the subtitle to the word map synthesis module.
5、 如权利要求 4所述的移动终端, 其中,  5. The mobile terminal of claim 4, wherein
所述字幕处理模块, 釆用神经网络模式识别、 遗传算法、 小波分析、 或 基于数据库的特征比对,从所述移动终端电视节目图像帧中识别出所述字幕。  The caption processing module identifies the caption from the mobile terminal television program image frame by using neural network pattern recognition, genetic algorithm, wavelet analysis, or database-based feature comparison.
6、 如权利要求 1所述的移动终端, 其中, 该移动终端还包括:  The mobile terminal of claim 1, wherein the mobile terminal further comprises:
字库模块, 用于存储所要显示的字型, 以供字图合成模块更改所述移动 终端电视节目中的字幕大小时调用; 及  a font module, configured to store a font to be displayed, when the font synthesis module changes the size of the subtitle in the mobile terminal television program; and
显示模块, 用于向用户显示字图合成模块发来的图像帧或者移动终端电 视功能模块发送给所述子图合成模块的移动终端电视节目图像帧。  And a display module, configured to display, to the user, an image frame sent by the word map synthesis module or a mobile terminal television program image frame sent by the mobile terminal television function module to the sub-picture synthesis module.
7、 一种实现移动终端电视字幕动态缩放的方法, 其中,  7. A method for realizing dynamic zooming of a mobile terminal television subtitle, wherein
移动终端启动移动终端电视应用业务后,若用户发起更改字幕大小指令, 则所述移动终端按照所述用户的指令, 更改自身所接收到的移动终端电视节 目的字幕大小, 并将更改后的字幕叠加到自身所接收到的移动终端电视节目 图像帧上。  After the mobile terminal starts the mobile terminal television application service, if the user initiates the change of the subtitle size instruction, the mobile terminal changes the subtitle size of the mobile terminal television program received by the mobile terminal according to the instruction of the user, and changes the subtitle size. Superimposed on the image frame of the mobile terminal television program received by itself.
8、 如权利要求 7所述的方法, 其中, 用户发起更改字幕大小指令之后, 该方法进一步包括:  8. The method of claim 7, wherein after the user initiates the change of the caption size instruction, the method further comprises:
所述移动终端发送消息询问网络侧是否单独发送所述移动终端电视节目 的字幕, 如果是, 则所述移动终端单独接收所述移动终端电视节目的字幕, 并按照所述用户的指令更改所接收的字幕大小。  The mobile terminal sends a message to inquire whether the network side separately transmits the subtitle of the mobile terminal television program, and if so, the mobile terminal separately receives the subtitle of the mobile terminal television program, and changes the received according to the user's instruction. Subtitle size.
9、 如权利要求 7所述的方法, 其中, 用户发起更改字幕大小指令之后, 该方法进一步包括:  9. The method of claim 7, wherein after the user initiates the change of the caption size instruction, the method further comprises:
所述移动终端判断是否单独接收到了所述移动终端电视节目的字幕, 如 果是, 则所述移动终端按照所述用户的指令, 更改所述单独接收的字幕大小。  The mobile terminal determines whether the subtitle of the mobile terminal television program is separately received, and if so, the mobile terminal changes the separately received subtitle size according to an instruction of the user.
10、 如权利要求 8所述的方法, 其中, 所述移动终端发送所述消息后, 若所述网络侧确认不会单独发送所述移 动终端电视节目的字幕时, 所述移动终端则从自身所接收到的移动终端电视 节目的图像帧中识别出字幕后, 再按照用户的指令更改所述字幕大小。 10. The method of claim 8 wherein After the mobile terminal sends the message, if the network side confirms that the subtitle of the mobile terminal television program is not separately transmitted, the mobile terminal identifies from the image frame of the mobile terminal television program received by itself. After the subtitles are output, the subtitle size is changed according to the user's instruction.
11、 如权利要求 10所述的方法, 其中,  11. The method of claim 10, wherein
所述移动终端釆用神经网络模式识别、 遗传算法、 小波分析、 或基于数 据库的特征比对, 从所述移动终端电视节目的图像帧中识别出字幕。  The mobile terminal identifies a subtitle from an image frame of the mobile terminal television program using neural network pattern recognition, genetic algorithm, wavelet analysis, or database based feature alignment.
12、 如权利要求 7所述的方法, 其中, 该移动终端更改自身所接收到的 移动终端电视节目的字幕大小时, 还调用所要显示的字型; 且  The method of claim 7, wherein when the mobile terminal changes the subtitle size of the mobile terminal television program received by itself, the mobile terminal also invokes the font to be displayed;
将更改后的字幕叠加到自身所接收到的移动终端电视节目图像帧上后, 将该图像帧显示给用户。  After the changed subtitle is superimposed on the image frame of the mobile terminal television program received by itself, the image frame is displayed to the user.
PCT/CN2009/072563 2008-12-19 2009-06-30 Mobile terminal and method for implementing television caption dynamic scaling WO2010069171A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CNA2008101864693A CN101437121A (en) 2008-12-19 2008-12-19 Mobile terminal and method for implementing dynamic zoom of mobile phone television subtitling
CN200810186469.3 2008-12-19

Publications (1)

Publication Number Publication Date
WO2010069171A1 true WO2010069171A1 (en) 2010-06-24

Family

ID=40711316

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2009/072563 WO2010069171A1 (en) 2008-12-19 2009-06-30 Mobile terminal and method for implementing television caption dynamic scaling

Country Status (2)

Country Link
CN (1) CN101437121A (en)
WO (1) WO2010069171A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101437121A (en) * 2008-12-19 2009-05-20 中兴通讯股份有限公司 Mobile terminal and method for implementing dynamic zoom of mobile phone television subtitling
CN102088631B (en) * 2011-01-30 2013-04-24 深圳市同洲电子股份有限公司 Live and demand broadcast method of digital television (TV) programs as well as related device and system
CN102724232A (en) * 2011-05-06 2012-10-10 新奥特(北京)视频技术有限公司 UDP-based network subtitle generator method and network subtitle generator system
CN103312863A (en) 2012-03-08 2013-09-18 中兴通讯股份有限公司 present method and device of mobile terminal video
CN103679208A (en) * 2013-11-27 2014-03-26 北京中科模识科技有限公司 Broadcast and television caption recognition based automatic training data generation and deep learning method
CN105872845A (en) * 2016-04-18 2016-08-17 深圳Tcl数字技术有限公司 Method for intelligently regulating size of subtitles and television
CN110363832B (en) * 2019-07-24 2021-05-25 广州方硅信息技术有限公司 Subtitle generating method and device
CN112055261A (en) * 2020-07-14 2020-12-08 北京百度网讯科技有限公司 Subtitle display method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1139338A (en) * 1995-06-28 1997-01-01 大宇电子株式会社 Apparatus for controling caption display on wide aspect ratio screen
US20010009445A1 (en) * 2000-01-24 2001-07-26 Chung Jung Oh Caption display method of digital television
CN1374803A (en) * 2001-03-02 2002-10-16 通用仪器公司 Method and apparatus for providing users with selective, improved closed captions
CN101071562A (en) * 2006-05-12 2007-11-14 上海乐金广电电子有限公司 Caption size regulating method for karaoke audio device
CN101179669A (en) * 2006-11-08 2008-05-14 中兴通讯股份有限公司 Session television terminal subtitling generating and stacking method
CN101437121A (en) * 2008-12-19 2009-05-20 中兴通讯股份有限公司 Mobile terminal and method for implementing dynamic zoom of mobile phone television subtitling

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1139338A (en) * 1995-06-28 1997-01-01 大宇电子株式会社 Apparatus for controling caption display on wide aspect ratio screen
US20010009445A1 (en) * 2000-01-24 2001-07-26 Chung Jung Oh Caption display method of digital television
CN1374803A (en) * 2001-03-02 2002-10-16 通用仪器公司 Method and apparatus for providing users with selective, improved closed captions
CN101071562A (en) * 2006-05-12 2007-11-14 上海乐金广电电子有限公司 Caption size regulating method for karaoke audio device
CN101179669A (en) * 2006-11-08 2008-05-14 中兴通讯股份有限公司 Session television terminal subtitling generating and stacking method
CN101437121A (en) * 2008-12-19 2009-05-20 中兴通讯股份有限公司 Mobile terminal and method for implementing dynamic zoom of mobile phone television subtitling

Also Published As

Publication number Publication date
CN101437121A (en) 2009-05-20

Similar Documents

Publication Publication Date Title
WO2010069171A1 (en) Mobile terminal and method for implementing television caption dynamic scaling
US20030023424A1 (en) Multimedia dictionary
CN1333385C (en) Voice browser dialog enabler for a communication system
EP2315201B1 (en) Transmitting and receiving apparatus and method, computer program, and broadcasting system with speech to sign language conversion
US20090157407A1 (en) Methods, Apparatuses, and Computer Program Products for Semantic Media Conversion From Source Files to Audio/Video Files
JP2006333460A (en) Device and method for providing additional information using expanded superimposition file
US20130212514A1 (en) Method and Device for Displaying Start-Up Interface of Multimedia Terminal
CN102074257A (en) Software and hardware-decoding general multi-media playing equipment and playing method thereof
US20070038781A1 (en) Apparatus and method for converting contents
CN101478661A (en) System and method for providing high quality subtitle adding in video stream
CN116229977A (en) System for realizing intelligent real-time interactive question and answer based on virtual digital person and processing method thereof
CN101513070B (en) Method and apparatus for displaying lightweight applying scene contents
WO2016136468A1 (en) Transmitting device, transmitting method, receiving device, receiving method, information processing device and information processing method
CN201018599Y (en) Video on-demand system based computer television integrated machine
US8269815B2 (en) Dynamic image distribution device and method thereof
WO2011000269A1 (en) Method and device for high definition display of subtitles in video conference system
WO2010111861A1 (en) Voice interactive method for mobile terminal based on vocie xml and apparatus thereof
WO2021097892A1 (en) Translation system, translation method, translation machine, and storage medium
CN101964850A (en) Method for pushing data in video customer service system, and video customer service system
CN111757187A (en) Multi-language subtitle display method, device, terminal equipment and storage medium
US20080297657A1 (en) Method and system for processing text in a video stream
WO2013174337A2 (en) Subtitle extraction method and apparatus
JP2003061098A (en) Image processor, image processing method, recording medium and program
TWI260531B (en) Communications terminal apparatus, reception apparatus, and method therefor
CN104166654A (en) System and method for inquiring audio message relevant information

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09832853

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09832853

Country of ref document: EP

Kind code of ref document: A1