CN113225615B - Television program playing method, terminal equipment, server and storage medium - Google Patents

Television program playing method, terminal equipment, server and storage medium Download PDF

Info

Publication number
CN113225615B
CN113225615B CN202110427756.4A CN202110427756A CN113225615B CN 113225615 B CN113225615 B CN 113225615B CN 202110427756 A CN202110427756 A CN 202110427756A CN 113225615 B CN113225615 B CN 113225615B
Authority
CN
China
Prior art keywords
television program
description information
server
text description
playing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110427756.4A
Other languages
Chinese (zh)
Other versions
CN113225615A (en
Inventor
朱星龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Jiuzhou Electric Appliance Co Ltd
Original Assignee
Shenzhen Jiuzhou Electric Appliance Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Jiuzhou Electric Appliance Co Ltd filed Critical Shenzhen Jiuzhou Electric Appliance Co Ltd
Priority to CN202110427756.4A priority Critical patent/CN113225615B/en
Publication of CN113225615A publication Critical patent/CN113225615A/en
Application granted granted Critical
Publication of CN113225615B publication Critical patent/CN113225615B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440236Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by media transcoding, e.g. video is transformed into a slideshow of still pictures, audio is converted into text
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47202End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting content on demand, e.g. video on demand
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/482End-user interface for program selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4888Data services, e.g. news ticker for displaying teletext characters

Abstract

The invention discloses a television program playing method, which is used for terminal equipment and comprises the following steps: when receiving a target television program, acquiring a key frame of the television program; the key frames are sent to a server, so that the server can acquire data of the key frames and obtain text description information; when text description information sent by the server is received, reading audio is obtained based on the text description information; and playing the read-aloud audio. The invention also discloses a terminal device, a server and a computer readable storage medium. By utilizing the television program playing method, a technician is not required to record description audio manually, the labor cost is saved, and the user experience is good.

Description

Television program playing method, terminal equipment, server and storage medium
Technical Field
The present invention relates to the field of television program processing, and in particular, to a television program playing method, a terminal device, a server, and a computer readable storage medium.
Background
At present, in order to enable people with vision impairment to watch television programs, two television program playing methods are disclosed; firstly, adding a plurality of voice-overs in original audio in a television program to describe a current scene; second, the normal audio is separated from the audio of the voice-over, and the user can select whether the voice-over is required.
In the two television program playing methods, a technician performs voice description on pictures in a television program, and stores the voice description in the television program in a voice-over manner so as to enable visually impaired people to watch the television program.
However, with the existing television program playing method, the manpower cost is high.
Disclosure of Invention
The invention mainly aims to provide a television program playing method, terminal equipment, a server and a computer readable storage medium, and aims to solve the technical problem of high labor cost consumption in the prior art by using the existing television program playing method.
In order to achieve the above objective, the present invention provides a method for playing a television program, which is used for a terminal device, and the method includes the following steps:
when receiving a target television program, acquiring a key frame of the television program;
the key frames are sent to a server, so that the server can acquire data of the key frames and obtain text description information;
when text description information sent by the server is received, reading audio is obtained based on the text description information;
and playing the read-aloud audio.
Optionally, before the step of sending the key frame to the server, the method further includes:
acquiring the current time corresponding to the key frame;
the step of sending the key frame to a server includes:
and sending the key frame and the current time to a server so that the server can acquire data of the key frame to obtain picture description information, and marking the picture description information by utilizing the current time to obtain text description information.
Optionally, the target television program is a television program formed by a GOP sequence; the step of acquiring key frames of the television program when receiving the target television program comprises the following steps:
when receiving a target television program, acquiring the GOP sequence in the target television program;
and determining the I frames in the GOP sequence as the key frames.
Optionally, when receiving the text description information sent by the server, the step of obtaining the read-aloud audio based on the text description information includes:
decoding the text description information when the text description information is received, so as to obtain natural language information;
and when the current time in the text description information arrives, reading the natural language information to obtain the reading audio.
In addition, in order to achieve the above objective, the present invention further provides a method for playing a television program, which is used for a server, and the method includes the following steps:
receiving a key frame sent by terminal equipment, wherein the key frame is acquired from a television program when a target program is received;
acquiring data of the key frames to obtain text description information;
and sending the text description information to the terminal equipment so that the terminal equipment obtains the reading audio based on the text description information and plays the reading audio.
Optionally, before the step of acquiring the text description information by performing data acquisition on the key frame, the method further includes:
receiving the current time sent by the terminal equipment, wherein the current time corresponds to the key frame;
the step of acquiring the data of the key frame to obtain the text description information comprises the following steps:
acquiring data of the key frames to obtain picture description information;
and marking the picture description information by using the current time to obtain the text description information.
Optionally, the text description information includes the current time, a name of the object in the key frame, location information of the object, and a feature of the object.
In addition, to achieve the above object, the present invention also proposes a terminal device including: the television program playing system comprises a memory, a processor and a television program playing program stored in the memory and running on the processor, wherein the television program playing program is executed by the processor to realize the steps of the television program playing method.
In addition, to achieve the above object, the present invention also proposes a server including: the television program playing system comprises a memory, a processor and a television program playing program stored in the memory and running on the processor, wherein the television program playing program is executed by the processor to realize the steps of the television program playing method.
In addition, to achieve the above object, the present invention also proposes a computer-readable storage medium having stored thereon a television program playing program which, when executed by a processor, implements the steps of the television program playing method as set forth in any one of the above.
The technical scheme of the invention provides a television program playing method which is used for terminal equipment and comprises the following steps: when receiving a target television program, acquiring a key frame of the television program; the key frames are sent to a server, so that the server can acquire data of the key frames and obtain text description information; when text description information sent by the server is received, reading audio is obtained based on the text description information; and playing the read-aloud audio.
In the existing television program playing method, a technician is required to manually record descriptive audio of a picture so as to obtain a voice-over, and the voice-over is added into a television program, so that more labor cost is consumed. In the invention, the server performs data acquisition on the key frames to obtain the text description information, and meanwhile, the terminal equipment obtains the reading audio based on the text description information, so that a technician is not required to record the description audio manually, the labor cost is saved, and the user experience is good.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to the structures shown in these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a schematic diagram of a terminal device structure of a hardware running environment according to an embodiment of the present invention;
fig. 2 is a flowchart of a first embodiment of a method for playing a television program according to the present invention;
FIG. 3 is a schematic diagram of an exemplary keyframe according to the present invention;
fig. 4 is a flowchart of a second embodiment of a method for playing a television program according to the present invention;
fig. 5 is a block diagram of a first embodiment of a television program playing apparatus according to the present invention;
fig. 6 is a block diagram of a second embodiment of a television program playing apparatus according to the present invention.
The achievement of the objects, functional features and advantages of the present invention will be further described with reference to the accompanying drawings, in conjunction with the embodiments.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are only some, but not all embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Referring to fig. 1, fig. 1 is a schematic diagram of a terminal device structure of a hardware running environment according to an embodiment of the present invention.
The terminal device may be a Mobile phone, a smart phone, a notebook computer, a digital broadcast receiver, a Personal Digital Assistant (PDA), a tablet personal computer (PAD), or other User Equipment (UE), a handheld device, a vehicle mounted device, a wearable device, a computing device, or other processing device connected to a wireless modem, a Mobile Station (MS), or the like. The terminal device may be referred to as a user terminal, a portable terminal, a desktop terminal, etc.
In general, a terminal device includes: at least one processor 301, a memory 302 and a television program playing program stored on said memory and executable on said processor, said television program playing program being configured to implement the steps of the television program playing method as described above.
Processor 301 may include one or more processing cores, such as a 4-core processor, an 8-core processor, and the like. The processor 301 may be implemented in at least one hardware form of DSP (Digital Signal Processing ), FPGA (Field-Programmable Gate Array, field programmable gate array), PLA (Programmable Logic Array ). The processor 301 may also include a main processor, which is a processor for processing data in an awake state, also called a CPU (Central ProcessingUnit ), and a coprocessor; a coprocessor is a low-power processor for processing data in a standby state. In some embodiments, the processor 301 may integrate a GPU (Graphics Processing Unit, image processor) for rendering and drawing of content required to be displayed by the display screen. The processor 301 may also include an AI (Artificial Intelligence ) processor for processing related television programming method operations so that the television programming method model may be trained and learned autonomously, improving efficiency and accuracy.
Memory 302 may include one or more computer-readable storage media, which may be non-transitory. Memory 302 may also include high-speed random access memory, as well as non-volatile memory, such as one or more magnetic disk storage devices, flash memory storage devices. In some embodiments, a non-transitory computer readable storage medium in memory 302 is used to store at least one instruction for execution by processor 301 to implement the television program playback method provided by the method embodiments herein.
In some embodiments, the terminal may further optionally include: a communication interface 303, and at least one peripheral device. The processor 301, the memory 302 and the communication interface 303 may be connected by a bus or signal lines. The respective peripheral devices may be connected to the communication interface 303 through a bus, signal line, or circuit board. Specifically, the peripheral device includes: at least one of radio frequency circuitry 304, a display screen 305, and a power supply 306.
The communication interface 303 may be used to connect at least one peripheral device associated with an I/O (Input/Output) to the processor 301 and the memory 302. In some embodiments, processor 301, memory 302, and communication interface 303 are integrated on the same chip or circuit board; in some other embodiments, either or both of the processor 301, the memory 302, and the communication interface 303 may be implemented on separate chips or circuit boards, which is not limited in this embodiment.
The Radio Frequency circuit 304 is configured to receive and transmit RF (Radio Frequency) signals, also known as electromagnetic signals. The radio frequency circuitry 304 communicates with a communication network and other communication devices via electromagnetic signals. The radio frequency circuit 304 converts an electrical signal into an electromagnetic signal for transmission, or converts a received electromagnetic signal into an electrical signal. Optionally, the radio frequency circuit 304 includes: antenna systems, RF transceivers, one or more amplifiers, tuners, oscillators, digital signal processors, codec chipsets, subscriber identity module cards, and so forth. The radio frequency circuitry 304 may communicate with other terminals via at least one wireless communication protocol. The wireless communication protocol includes, but is not limited to: metropolitan area networks, various generations of mobile communication networks (2G, 3G, 4G, and 5G), wireless local area networks, and/or WiFi (Wireless Fidelity ) networks. In some embodiments, the radio frequency circuitry 304 may also include NFC (Near Field Communication ) related circuitry, which is not limited in this application.
The display screen 305 is used to display a UI (User Interface). The UI may include graphics, text, icons, video, and any combination thereof. When the display 305 is a touch screen, the display 305 also has the ability to collect touch signals at or above the surface of the display 305. The touch signal may be input as a control signal to the processor 301 for processing. At this point, the display 305 may also be used to provide virtual buttons and/or virtual keyboards, also referred to as soft buttons and/or soft keyboards. In some embodiments, the display 305 may be one, the front panel of an electronic device; in other embodiments, the display screen 305 may be at least two, respectively disposed on different surfaces of the electronic device or in a folded design; in still other embodiments, the display 305 may be a flexible display disposed on a curved surface or a folded surface of the electronic device. Even more, the display screen 305 may be arranged in an irregular pattern other than rectangular, i.e., a shaped screen. The display 305 may be made of LCD (LiquidCrystal Display ), OLED (Organic Light-Emitting Diode) or other materials.
The power supply 306 is used to power the various components in the electronic device. The power source 306 may be alternating current, direct current, disposable or rechargeable. When the power source 306 comprises a rechargeable battery, the rechargeable battery may support wired or wireless charging. The rechargeable battery may also be used to support fast charge technology.
It will be appreciated by those skilled in the art that the structure shown in fig. 1 does not constitute a limitation of the terminal device, and may include more or less components than illustrated, or may combine certain components, or may be arranged in different components.
In addition, the embodiment of the invention also provides a server, and the description of the structure of the server refers to the description of the terminal equipment, and the structure is similar and is not repeated here.
In addition, the embodiment of the invention also provides a computer readable storage medium, wherein the computer readable storage medium stores a television program playing program, and the television program playing program realizes the steps of the television program playing method when being executed by a processor. Therefore, a detailed description will not be given here. In addition, the description of the beneficial effects of the same method is omitted. For technical details not disclosed in the embodiments of the computer-readable storage medium according to the present application, please refer to the description of the method embodiments of the present application. As determined as an example, the program instructions may be deployed to be executed on one terminal device or on multiple terminal devices located at one site or, alternatively, on multiple terminal devices distributed across multiple sites and interconnected by a communication network.
Those skilled in the art will appreciate that implementing all or part of the above-described methods may be accomplished by way of computer programs, which may be stored on a computer-readable storage medium, and which, when executed, may comprise the steps of the embodiments of the methods described above. The computer readable storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a Random access Memory (Random AccessMemory, RAM), or the like.
Based on the above hardware structure, the embodiment of the television program playing method is provided.
Referring to fig. 2, fig. 2 is a flowchart of a first embodiment of a method for playing a television program according to the present invention, where the method is used for a terminal device, and the method includes the following steps:
step S11: and when receiving a target television program, acquiring key frames of the television program.
It should be noted that, the execution main body of the method is a terminal device and a server, the terminal device and the server respectively install a corresponding terminal television program playing program and a corresponding server television program playing program, and the terminal device and the server execute the corresponding terminal television program playing program and the corresponding server television program playing program, thereby realizing the television program playing method of the invention.
Further, the target television program is a television program formed by a GOP sequence; the step of acquiring key frames of the television program when receiving the target television program comprises the following steps: when receiving a target television program, acquiring the GOP sequence in the target television program; and determining the I frames in the GOP sequence as the key frames.
It will be appreciated that the target television program may be a television program for viewing by a user (visually impaired), typically video data, which is a group of pictures consisting of a fixed series of encoded picture frames, i.e. the GOP sequence, one GOP sequence may be in the form of an IBBPBBPBBPBBPBB (one GOP sequence comprises a number of frames which is not a fixed value, associated with the specific data of the television program, wherein the number of frames in one GOP sequence is one I frame and a number of alternating B frames and P frames). In this application, a plurality of GOP sequences related to the target television program are already decoded in the memory of the terminal device to be played, and the terminal device plays out the GOP sequences into continuous pictures according to time stamps (play time in video frames) or a synchronization mechanism.
For example, there are currently two GOP sequences: IBBPBBPBBPBBPBB and IBBPBBPBBPBBPBB are playing the first GOP sequence IBBPBBPBBPBBPBB currently, at this time, the key frame of the next GOP sequence, I frame, is acquired and sent to the server.
In the present application, what the terminal device plays is a group of pictures corresponding to a GOP sequence, and the I frame is usually the first frame in a GOP sequence; the data transmission between the terminal device and the server may be fast transmission via millimeter wave (5G) protocol; meanwhile, when a group of pictures corresponding to one GOP sequence is played, a key frame of the next GOP sequence is acquired, and the key frame (and the current time, wherein the current time is the key frame playing time) is sent to a server; based on the three points, when the terminal equipment plays the picture group corresponding to one GOP sequence, the server and the terminal equipment have enough time to perform data transmission and data processing, so that before the picture group corresponding to the GOP sequence is played, the text description information is obtained, and the reading audio corresponding to the text description information is played.
Step S12: and sending the key frame to a server so that the server can acquire data of the key frame and acquire text description information.
Further, before step S12, the method further includes: acquiring the current time corresponding to the key frame; correspondingly, step S12 includes: and sending the key frame and the current time to a server so that the server can acquire data of the key frame to obtain picture description information, and marking the picture description information by utilizing the current time to obtain text description information.
The server collects picture description information in a key frame based on a preset algorithm (the preset algorithm may be set by a user based on a requirement, the invention is not limited), the picture description information includes a name of a thing in the key frame, position information of the thing and characteristics of the thing (i.e., the text description information includes the current time, the name of the thing in the key frame, the position information of the thing and the characteristics of the thing). Wherein, the characteristics of the object can comprise the size, the color, the shape and the like of the object.
The server generates a transmission file according to the acquired text description information and a preset script, wherein the transmission file can be in txt format or json format, and the text description information is sent to the terminal equipment in a mode of the transmission file. The preset script may be in the following format:
step S13: and when the text description information sent by the server is received, reading audio is obtained based on the text description information.
Step S14: and playing the read-aloud audio.
It can be understood that the text description information is sent in a txt format or json format file, and the terminal device extracts the text description information from the transmission file.
Further, when receiving the text description information sent by the server, the step of obtaining the read-aloud audio based on the text description information includes: decoding the text description information when the text description information is received, so as to obtain natural language information; and when the current time in the text description information arrives, reading the natural language information to obtain the reading audio.
When the text description information is read, the obtained voice corresponding to the coded data is not natural language audio, the text description information needs to be decoded and translated, and natural language information which can be understood by a user (a person with vision impairment) is obtained, so that the natural language information is read, and the read audio is obtained, and at the moment, the read audio can be understood by the user.
Referring to fig. 3, fig. 3 is a schematic diagram of an exemplary keyframe according to the present invention. The key frame is an I frame in a GOP sequence, and the corresponding current time is 08:00:00 (24 hours, i.e. 8 a.m.), at this time, the content of the picture description information obtained by the server is as follows:
then, the server sends the text description information (including the above picture description information and the target event) to the terminal device in a manner of transmitting text (txt format or json format file), and the terminal device decodes and translates the text description information to obtain natural language information:
the character in the current picture plays golf in the golf course; an adult man wearing a blue short sleeve on the left side of the picture and wearing a white five-part pants on the lower body of the picture has a height of about one meter seven and swings the golf club; the sky with blue color at one third of the upper part of the picture has a lot of white clouds; the third of the middle part of the picture is far away from the trees with green or brown leaves, and the branches of the trees are connected with one another; there is a small lake about 100 meters from an adult man. The lake surface is provided with a plurality of trees, and the lake water is clear in wave light; the grass is green and tidy at one third of the lower part of the picture, sunlight is sprinkled on the grass, and a plurality of shadows of trees exist.
When the text description information is obtained, when the current time in the text description information arrives, the natural language information is read, the read-aloud audio is obtained, and the read-aloud audio comprises audio corresponding to the text.
The technical scheme of the invention provides a television program playing method which is used for terminal equipment and comprises the following steps: when receiving a target television program, acquiring a key frame of the television program; the key frames are sent to a server, so that the server can acquire data of the key frames and obtain text description information; when text description information sent by the server is received, reading audio is obtained based on the text description information; and playing the read-aloud audio.
In the existing television program playing method, a technician is required to manually record descriptive audio of a picture so as to obtain a voice-over, and the voice-over is added into a television program, so that more labor cost is consumed. In the invention, the server performs data acquisition on the key frames to obtain the text description information, and meanwhile, the terminal equipment obtains the reading audio based on the text description information, so that a technician is not required to record the description audio manually, the labor cost is saved, and the user experience is good.
In addition, in the existing method, a technician manually inputs voice description (voice over), the information included in the voice over cannot be continuously updated or optimized, and meanwhile, the information included in the picture in the television program may be updated at any time, so that the information included in the voice over is lagged, not comprehensive enough and poor in user experience. In the invention, the server can update the algorithm at any time, so that the obtained text description information is not lagged, the comprehensiveness is better, and the user experience is better.
Referring to fig. 4, fig. 4 is a flowchart of a second embodiment of a method for playing a television program according to the present invention, the method being used for a server, the method comprising the steps of:
step S21: receiving a key frame sent by terminal equipment, wherein the key frame is acquired from a television program when a target program is received;
step S22: acquiring data of the key frames to obtain text description information;
step S23: and sending the text description information to the terminal equipment so that the terminal equipment obtains the reading audio based on the text description information and plays the reading audio.
Further, before the step of acquiring the text description information by performing data acquisition on the key frame, the method further includes: receiving the current time sent by the terminal equipment, wherein the current time corresponds to the key frame; correspondingly, the step of acquiring the data of the key frame to obtain the text description information comprises the following steps: acquiring data of the key frames to obtain picture description information; and marking the picture description information by using the current time to obtain the text description information.
The description of the content of the server side refers to the above description, and will not be repeated here.
Referring to fig. 5, fig. 5 is a block diagram of a first embodiment of a television program playing apparatus according to the present invention, the apparatus being used for a terminal device, the apparatus comprising:
the first receiving module 10 is configured to obtain a key frame of a target television program when receiving the television program;
the first sending module 20 is configured to send the key frame to a server, so that the server performs data acquisition on the key frame to obtain text description information;
the obtaining module 30 is configured to obtain a reading audio based on the text description information when the text description information sent by the server is received;
and the playing module 40 is used for playing the read-aloud audio.
Referring to fig. 6, fig. 6 is a block diagram of a first embodiment of a television program playing apparatus according to the present invention, the apparatus being used for a server, the apparatus comprising:
a second receiving module 50, configured to receive a key frame sent by a terminal device, where the key frame is acquired from the television program when receiving a target program;
the acquisition module 60 is configured to perform data acquisition on the key frame to obtain text description information;
and the second sending module 70 is configured to send the text description information to the terminal device, so that the terminal device obtains the speaking audio based on the text description information, and plays the speaking audio.
The foregoing description is only of the optional embodiments of the present invention, and is not intended to limit the scope of the invention, and all the equivalent structural changes made by the description of the present invention and the accompanying drawings or the direct/indirect application in other related technical fields are included in the scope of the invention.

Claims (9)

1. A method for playing a television program, for use with a terminal device, the method comprising the steps of:
when receiving a target television program, acquiring a key frame of the target television program;
the key frames are sent to a server, so that the server can acquire data of the key frames and obtain text description information; the text description information comprises names of things in the key frames, position information of the things in the pictures and characteristics of the things; the data transmission of the terminal equipment and the server is fast transmitted through millimeter wave 5G protocol;
when text description information sent by the server is received, reading audio is obtained based on the text description information;
playing the read-aloud audio;
the step of acquiring the key frames of the target television program when receiving the target television program comprises the following steps:
when a picture group corresponding to the current GOP sequence is played, acquiring the next GOP sequence in the target television program; the target television program is a television program formed by a GOP sequence;
and determining the I frame in the next GOP sequence as the key frame.
2. The method of claim 1, wherein prior to the step of sending the key frame to a server, the method further comprises:
acquiring the current time corresponding to the key frame; the current time is the playing time of the key frame;
the step of sending the key frame to a server includes:
and sending the key frame and the current time to a server so that the server can acquire data of the key frame to obtain picture description information, and marking the picture description information by utilizing the current time to obtain text description information.
3. The method of claim 2, wherein the step of obtaining the speakable audio based on the text description information when the text description information sent by the server is received comprises:
decoding the text description information when the text description information is received, so as to obtain natural language information;
and when the current time in the text description information arrives, reading the natural language information to obtain the reading audio.
4. A method of playing a television program for a server, the method comprising the steps of:
receiving a key frame sent by terminal equipment, wherein data transmission between the terminal equipment and a server is fast transmitted through millimeter wave 5G protocol; the key frame is an I frame in a next GOP sequence obtained from the target television program when the terminal equipment plays a picture group corresponding to the current GOP sequence of the target television program; the target television program is a television program formed by a GOP sequence;
acquiring data of the key frames to obtain text description information; the text description information comprises names of things in the key frames, position information of the things in the pictures and characteristics of the things;
and sending the text description information to the terminal equipment so that the terminal equipment obtains the reading audio based on the text description information and plays the reading audio.
5. The method of claim 4, wherein prior to the step of data gathering the keyframes to obtain textual description information, the method further comprises:
receiving the current time sent by the terminal equipment, wherein the current time is the playing time corresponding to the key frame;
the step of acquiring the data of the key frame to obtain the text description information comprises the following steps:
acquiring data of the key frames to obtain picture description information;
and marking the picture description information by using the current time to obtain the text description information.
6. The method of claim 5, wherein the textual description information further includes the current time.
7. A terminal device, characterized in that the terminal device comprises: a memory, a processor and a television program playing program stored on the memory and running on the processor, which when executed by the processor, implements the steps of the television program playing method of any one of claims 1 to 3.
8. A server, the server comprising: memory, a processor and a television program playing program stored on the memory and running on the processor, which when executed by the processor, implements the steps of the television program playing method according to any of claims 5 to 6.
9. A computer readable storage medium, wherein a television program playing program is stored on the computer readable storage medium, which when executed by a processor, implements the steps of the television program playing method according to any one of claims 1 to 6.
CN202110427756.4A 2021-04-20 2021-04-20 Television program playing method, terminal equipment, server and storage medium Active CN113225615B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110427756.4A CN113225615B (en) 2021-04-20 2021-04-20 Television program playing method, terminal equipment, server and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110427756.4A CN113225615B (en) 2021-04-20 2021-04-20 Television program playing method, terminal equipment, server and storage medium

Publications (2)

Publication Number Publication Date
CN113225615A CN113225615A (en) 2021-08-06
CN113225615B true CN113225615B (en) 2023-08-08

Family

ID=77088080

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110427756.4A Active CN113225615B (en) 2021-04-20 2021-04-20 Television program playing method, terminal equipment, server and storage medium

Country Status (1)

Country Link
CN (1) CN113225615B (en)

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5677739A (en) * 1995-03-02 1997-10-14 National Captioning Institute System and method for providing described television services
CN101286274A (en) * 2008-05-08 2008-10-15 李卫红 Digital video automatic explaining system for blind men
CN101458951A (en) * 2008-12-30 2009-06-17 胡礼斌 Video and audio program signal processing system having multiple functions
CN104980790A (en) * 2015-06-30 2015-10-14 北京奇艺世纪科技有限公司 Voice subtitle generating method and apparatus, and playing method and apparatus
WO2018121001A1 (en) * 2016-12-30 2018-07-05 深圳市九洲电器有限公司 Method and system for outputting simultaneous interpretation of digital television program, and smart terminal
CN109275027A (en) * 2018-09-26 2019-01-25 Tcl海外电子(惠州)有限公司 Speech output method, electronic playback devices and the storage medium of video
CN109672932A (en) * 2018-12-29 2019-04-23 深圳Tcl新技术有限公司 Assist method, system, equipment and the storage medium of people with visual impairment viewing video
CN110519636A (en) * 2019-09-04 2019-11-29 腾讯科技(深圳)有限公司 Voice messaging playback method, device, computer equipment and storage medium
CN111046223A (en) * 2019-11-14 2020-04-21 李秉伦 Voice assisting method, terminal, server and system for visually impaired
CN111538862A (en) * 2020-05-15 2020-08-14 北京百度网讯科技有限公司 Method and device for explaining video
CN112087672A (en) * 2020-08-13 2020-12-15 浙江大学 Video stream description generation method using intelligent terminal and server

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8229748B2 (en) * 2008-04-14 2012-07-24 At&T Intellectual Property I, L.P. Methods and apparatus to present a video program to a visually impaired person
US9214093B2 (en) * 2011-10-28 2015-12-15 Sony Corporation Audio description availability notifier

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5677739A (en) * 1995-03-02 1997-10-14 National Captioning Institute System and method for providing described television services
CN101286274A (en) * 2008-05-08 2008-10-15 李卫红 Digital video automatic explaining system for blind men
CN101458951A (en) * 2008-12-30 2009-06-17 胡礼斌 Video and audio program signal processing system having multiple functions
CN104980790A (en) * 2015-06-30 2015-10-14 北京奇艺世纪科技有限公司 Voice subtitle generating method and apparatus, and playing method and apparatus
WO2018121001A1 (en) * 2016-12-30 2018-07-05 深圳市九洲电器有限公司 Method and system for outputting simultaneous interpretation of digital television program, and smart terminal
CN109275027A (en) * 2018-09-26 2019-01-25 Tcl海外电子(惠州)有限公司 Speech output method, electronic playback devices and the storage medium of video
CN109672932A (en) * 2018-12-29 2019-04-23 深圳Tcl新技术有限公司 Assist method, system, equipment and the storage medium of people with visual impairment viewing video
CN110519636A (en) * 2019-09-04 2019-11-29 腾讯科技(深圳)有限公司 Voice messaging playback method, device, computer equipment and storage medium
CN111046223A (en) * 2019-11-14 2020-04-21 李秉伦 Voice assisting method, terminal, server and system for visually impaired
CN111538862A (en) * 2020-05-15 2020-08-14 北京百度网讯科技有限公司 Method and device for explaining video
CN112087672A (en) * 2020-08-13 2020-12-15 浙江大学 Video stream description generation method using intelligent terminal and server

Also Published As

Publication number Publication date
CN113225615A (en) 2021-08-06

Similar Documents

Publication Publication Date Title
CN109766066A (en) A kind of method of Message Processing, relevant apparatus and system
CN111448587A (en) Display method, uploading method and device of advertisement pictures
CN112672053A (en) Photographing method, photographing device, terminal equipment and computer-readable storage medium
CN112565863A (en) Video playing method and device, terminal equipment and computer readable storage medium
CN109474833B (en) Network live broadcast method, related device and system
CN112612526B (en) Application program control method, device, terminal equipment and storage medium
CN112269554B (en) Display system and display method
CN112689172B (en) Program playing method and device, set top box and storage medium
CN113038232A (en) Video playing method, device, equipment, server and storage medium
CN113225615B (en) Television program playing method, terminal equipment, server and storage medium
US20240005830A1 (en) Always-on-display method and electronic device
CN113014830A (en) Video blurring method, device, equipment and storage medium
CN113436576A (en) OLED display screen dimming method and device applied to two-dimensional code scanning
CN113766060B (en) Information screen display method, electronic equipment and computer readable storage medium
CN113099300B (en) Program playing method, device, display terminal and storage medium
CN114999535A (en) Voice data processing method and device in online translation process
CN112397064A (en) Light adjusting method and device, terminal equipment and storage medium
CN103856604A (en) Multimedia terminal easy for operation
CN112437333B (en) Program playing method, device, terminal equipment and storage medium
CN112183217A (en) Gesture recognition method, interaction method based on gesture recognition and mixed reality glasses
CN114495859B (en) Picture display method, device, display terminal and storage medium
CN114173172B (en) Data processing method, device, terminal equipment and storage medium
CN112423004B (en) Video data transmission method, device, transmitting end and storage medium
CN112349248B (en) Screen brightness adjusting method and device, multimedia terminal and computer readable storage medium
CN112423062B (en) Video character information display method, device, terminal equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant