CN114900718A - Multi-region perception automatic multi-subtitle realization method, device and system - Google Patents

Multi-region perception automatic multi-subtitle realization method, device and system Download PDF

Info

Publication number
CN114900718A
CN114900718A CN202210814117.8A CN202210814117A CN114900718A CN 114900718 A CN114900718 A CN 114900718A CN 202210814117 A CN202210814117 A CN 202210814117A CN 114900718 A CN114900718 A CN 114900718A
Authority
CN
China
Prior art keywords
directory
content
subtitle
file
terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210814117.8A
Other languages
Chinese (zh)
Inventor
韦月飞
张灵晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen SDMC Technology Co Ltd
Original Assignee
Shenzhen SDMC Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen SDMC Technology Co Ltd filed Critical Shenzhen SDMC Technology Co Ltd
Priority to CN202210814117.8A priority Critical patent/CN114900718A/en
Publication of CN114900718A publication Critical patent/CN114900718A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/41Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/233Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/258Client or end-user data management, e.g. managing client capabilities, user preferences or demographics, processing of multiple end-users preferences to derive collaborative data
    • H04N21/25808Management of client data
    • H04N21/25841Management of client data involving the geographical location of the client
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43072Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8455Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream

Abstract

The application discloses and provides a method, a device and a system for realizing multi-region perception automatic multi-subtitle, wherein the method comprises the following steps: receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal; searching the area information of the area where the terminal is located according to the IP address of the terminal; acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating a regional language subtitle corresponding to the regional information based on the voice in the audio file, and generating a regional language subtitle output directory; and generating an index file according to the regional language subtitle output directory, and returning the index file to the terminal, so that the problems that in the prior art, subtitle files need to be made in advance, and the subtitle display time of artificially made subtitles is not synchronous with audio and video pictures are solved.

Description

Multi-region perception automatic multi-subtitle realization method, device and system
Technical Field
The invention relates to the technical field of multimedia audio and video, in particular to a method, a device and a system for realizing multi-region perception automatic multi-subtitle.
Background
With the development of internet technology, people can watch tv and movies in different countries and different languages through the internet, but in the process of watching tv or movies in non-native languages, it is difficult to understand the meaning expressed by the character dialogue in tv or movies without the subtitle prompting function, so the subtitle prompting function plays a role of a bridge for understanding the meaning expressed by the character dialogue in movie works in non-native languages, and at present, the traditional method for displaying multiple subtitles in movie works is as follows: the content operator makes multilingual subtitle files of film and television works in advance, such as Chinese, English, German and Korean, when the user plays the film and television works, the user can select corresponding subtitles from a multi-subtitle list, and the server sends the corresponding subtitle files according to the selection of the user, although the realization method solves the problem of subtitle presentation of the film and television works in non-native languages to a certain extent, the realization method still has some defects: firstly, the subtitle files need to be made in advance, so that when a user watches movie works, a subtitle list selectable by the user is fixed, has limited selectivity and lacks flexibility, if the subtitle list does not contain subtitles corresponding to the native language of the user, the subtitles lose due effects for the user, and the user experience is unfriendly; secondly, the problem that the subtitle display time is not synchronous with the audio and video picture can occur when subtitles are artificially made, and the subtitles do not play a due role under the condition.
Disclosure of Invention
Therefore, the technical problem to be solved by the present invention is to overcome the problems in the prior art that the subtitle file needs to be made in advance, when the user watches the movie and television works, the subtitle list selectable by the user is fixed, the selectivity is limited, and the flexibility is lacked, if the subtitle list does not contain the subtitle corresponding to the native language of the user, the subtitle loses the due effect of the subtitle, and the user experience is unfriendly; the problem that the subtitle display time is not synchronous with the audio and video picture when the subtitles are manufactured manually is solved, and the subtitles do not play a due role under the condition, so that the multi-region perception automatic multi-subtitle realization method, device and system are provided.
In order to solve the above technical problems, the embodiments of the present disclosure at least provide a method, an apparatus, and a system for implementing multi-region aware automatic multi-subtitle.
In a first aspect, an embodiment of the present disclosure provides a method for implementing multi-region aware automatic multi-subtitle, including:
receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal;
searching the area information of the area where the terminal is located according to the IP address of the terminal;
acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating a regional language subtitle corresponding to the regional information based on the voice in the audio file, and generating a regional language subtitle output directory;
and generating an index file according to the regional language subtitle output directory, and returning the index file to the terminal.
Optionally, the injection content output catalog is generated according to the following manner:
receiving an injected content request from a content management system, the injected content request including path information of a source file;
generating a content identifier of the injected content and creating a corresponding output directory;
acquiring a source file according to the path information of the source file, separating an audio file from the source file, and writing the audio file into the output directory;
packaging and slicing the source file, outputting a video slice to a first directory, and outputting an audio slice to a second directory;
and writing the first directory information and the second directory information into the output directory to obtain the injection content output directory.
Optionally, after separating the audio file from the source file and writing the audio file into the output directory, generating a complete path of the audio file;
the acquiring the audio file from the pre-generated injection content output directory according to the content identifier comprises:
searching for a complete path of the audio file according to the content identifier;
and acquiring the audio file from the injection content output directory according to the complete path of the audio file.
Optionally, the generating an index file according to the regional language subtitle output directory includes:
acquiring a first directory and a second directory from the injection content output directory;
and generating a primary index file according to the regional language subtitle output directory, the first directory and the second directory.
In a second aspect, an embodiment of the disclosure further provides a multi-region-aware automatic multi-subtitle implementing apparatus, including:
the distribution module is used for receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal;
the area module is used for searching the area information of the area where the terminal is located according to the IP address of the terminal;
the caption module is used for acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating regional language captions corresponding to the regional information based on the voice in the audio file, and generating a regional language caption output directory;
and the distribution module is used for generating an index file according to the regional language subtitle output directory and returning the index file to the terminal.
Optionally, the injection content output catalog is generated according to the following manner:
receiving an injected content request from a content management system, the injected content request including path information of a source file;
generating a content identifier of the injected content and creating a corresponding output directory;
acquiring a source file according to the path information of the source file, separating an audio file from the source file, and writing the audio file into the output directory;
packaging and slicing the source file, outputting a video slice to a first directory, and outputting an audio slice to a second directory;
and writing the first directory information and the second directory information into the output directory to obtain the injection content output directory.
Optionally, the generating an index file according to the regional language subtitle output directory includes:
acquiring a first directory and a second directory from the injection content output directory;
and generating a primary index file according to the regional language subtitle output directory, the first directory and the second directory.
In a third aspect, an embodiment of the present disclosure further provides a system for implementing multi-region aware automatic multi-subtitle, including:
a content management system, a terminal and the multi-region-aware automatic multi-subtitle implementation apparatus of the second aspect.
In a fourth aspect, an embodiment of the present disclosure further provides a computer device, including: a processor, a memory and a bus, the memory storing machine-readable instructions executable by the processor, the processor and the memory communicating via the bus when the computer device is running, the machine-readable instructions when executed by the processor performing the steps of the first aspect described above, or any possible implementation of the first aspect.
In a fifth aspect, the disclosed embodiments of the present invention further provide a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and the computer program is executed by a processor to perform the steps in the first aspect or any possible implementation manner of the first aspect.
The technical scheme provided by the embodiment of the invention has the following beneficial effects:
acquiring an audio file from a pre-generated injection content output directory, creating regional language subtitles corresponding to the regional information based on voice in the audio file, and automatically generating subtitle files of corresponding native languages for users in different regions; the traditional manual subtitle making mode can be avoided, and the subtitle making flexibility is improved; due to the adoption of the technical means of automatically generating the subtitles, the problem of asynchronism with audio possibly occurring in the process of manually making the subtitles is effectively avoided. Furthermore, indexes of video, audio and regional language subtitles of the content are sent to a terminal user by generating a mode of injecting a content output directory and returning an index file, a source file does not need to be changed, and synchronous display of the subtitles and the content at the terminal is also guaranteed.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
FIG. 1 is a flow chart of a multi-region aware automatic multi-subtitle implementation method according to an embodiment of the disclosure;
FIG. 2 is a schematic structural diagram of a multi-region-aware automatic multi-subtitle implementation apparatus according to an embodiment of the disclosure;
FIG. 3 is a schematic structural diagram of a multi-region-aware automatic multi-subtitle implementation system according to an embodiment of the disclosure;
FIG. 4 is a flow chart illustrating another multi-region aware automatic multi-subtitle implementation method provided in the disclosed embodiments of the present invention;
FIG. 5 is a diagram illustrating a directory structure under an output directory provided by the disclosed embodiments;
FIG. 6 is a diagram illustrating a directory structure under a video directory provided by a disclosed embodiment of the invention;
FIG. 7 is a diagram illustrating a directory structure under an audio directory provided by an embodiment of the present disclosure;
FIG. 8 is a table illustrating a tile list of outputs from the output directory after processing is complete, as provided by the disclosed embodiments;
FIG. 9 is a diagram of a primary index file generated by a distribution module provided by a disclosed embodiment of the invention;
fig. 10 shows a schematic structural diagram of a computer device according to an embodiment of the present disclosure.
Detailed Description
Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present invention. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the invention, as detailed in the appended claims.
Example 1
As shown in fig. 1, the method for implementing multi-region aware automatic multi-subtitle provided by the embodiment of the present disclosure includes:
s11: receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal;
s12: searching the area information of the area where the terminal is located according to the IP address of the terminal;
s13: acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating a regional language subtitle corresponding to the regional information based on the voice in the audio file, and generating a regional language subtitle output directory;
s14: and generating an index file according to the regional language subtitle output directory, and returning the index file to the terminal.
In particular practice, the injection content output catalog is generated according to the following manner:
receiving an injected content request from a content management system, the injected content request including path information of a source file;
generating a content identifier of the injected content and creating a corresponding output directory;
acquiring a source file according to the path information of the source file, separating an audio file from the source file, and writing the audio file into the output directory;
packaging and slicing the source file, outputting a video slice to a first directory, and outputting an audio slice to a second directory;
and writing the first directory information and the second directory information into the output directory to obtain the injection content output directory.
In specific practice, after audio files are separated from source files and written into the output directory, an audio file complete path is generated;
the acquiring the audio file from the pre-generated injection content output directory according to the content identifier comprises:
searching for a complete path of the audio file according to the content identifier;
and acquiring the audio file from the injection content output directory according to the complete path of the audio file.
In a specific practice, the generating an index file according to the regional language subtitle output directory includes:
acquiring a first directory and a second directory from the injection content output directory;
and generating a primary index file according to the regional language subtitle output directory, the first directory and the second directory.
It can be understood that, in the technical solution provided in this embodiment, an audio file is obtained from a pre-generated injection content output directory, a subtitle in a region language corresponding to the region information is created based on a voice in the audio file, and a subtitle file in a corresponding native language is automatically generated for users in different regions; the traditional manual subtitle making mode can be avoided, and the subtitle making flexibility is improved; due to the adoption of the technical means of automatically generating the subtitles, the problem of asynchronism with audio possibly occurring in the process of manually making the subtitles is effectively avoided. Furthermore, indexes of video, audio and regional language subtitles of the content are sent to a terminal user by generating a mode of injecting a content output directory and returning an index file, a source file does not need to be changed, and synchronous display of the subtitles and the content at the terminal is also guaranteed.
Example 2
As shown in fig. 2, an embodiment of the present invention further provides a multi-region aware automatic multi-subtitle implementing apparatus, including:
the distribution module is used for receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal;
the area module is used for searching the area information of the area where the terminal is located according to the IP address of the terminal;
the caption module is used for acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating regional language captions corresponding to the regional information based on the voice in the audio file, and generating a regional language caption output directory;
and the distribution module is used for generating an index file according to the regional language subtitle output directory and returning the index file to the terminal.
In particular practice, the injection content output catalog is generated according to the following manner:
receiving an injected content request from a content management system, the injected content request including path information of a source file;
generating a content identifier of the injected content and creating a corresponding output directory;
acquiring a source file according to the path information of the source file, separating an audio file from the source file, and writing the audio file into the output directory;
packaging and slicing the source file, outputting a video slice to a first directory, and outputting an audio slice to a second directory;
and writing the first directory information and the second directory information into the output directory to obtain the injection content output directory.
In a specific practice, the generating an index file according to the regional language subtitle output directory includes:
acquiring a first directory and a second directory from the injection content output directory;
and generating a primary index file according to the regional language subtitle output directory, the first directory and the second directory.
It can be understood that, in the technical solution provided in this embodiment, an audio file is obtained from a pre-generated injection content output directory, a subtitle in a region language corresponding to the region information is created based on a voice in the audio file, and a subtitle file in a corresponding native language is automatically generated for users in different regions; the traditional manual subtitle making mode can be avoided, and the subtitle making flexibility is improved; due to the adoption of the technical means of automatically generating the subtitles, the problem of asynchronism with audio possibly occurring in the process of manually making the subtitles is effectively avoided. Furthermore, indexes of video, audio and regional language subtitles of the content are sent to a terminal user by generating a mode of injecting a content output directory and returning an index file, a source file does not need to be changed, and synchronous display of the subtitles and the content at the terminal is also guaranteed.
Example 3
As shown in fig. 3, an embodiment of the present invention further provides a multi-region aware automatic multi-subtitle implementation system, including:
a server, a terminal and a content management system;
the server side comprises: the system comprises a distribution module, a region module, a caption module and a slicing module;
the distribution module is used for receiving a content playing request from a terminal and synchronizing the IP address of the terminal in the content playing request of the terminal to the area module, wherein the content playing request comprises a content identifier and the IP address of the terminal;
the region module searches the region information of the region where the terminal is located according to the IP address of the terminal, synchronizes the searched region information to the caption module, and requests the caption module to output the caption of the language of the corresponding region;
the subtitle module acquires an audio file from a pre-generated injection content output directory according to the content identification, creates regional language subtitles corresponding to the regional information based on voice in the audio file, generates a regional language subtitle output directory, and synchronizes the regional language subtitle output directory to the distribution module;
and the distribution module generates an index file according to the regional language subtitle output directory and returns the index file to the terminal.
In particular practice, the injection content output catalog is generated according to the following manner:
the slicing module receives an injected content request from a content management system, the injected content request including path information of a source file;
generating a content identifier of the injected content and creating a corresponding output directory;
acquiring a source file according to the path information of the source file, separating an audio file from the source file, and writing the audio file into the output directory;
packaging and slicing the source file, outputting a video slice to a first directory, and outputting an audio slice to a second directory;
and writing the first directory information and the second directory information into the output directory to obtain the injection content output directory.
In a specific practice, the terminal is further configured to receive the first-level index file returned by the distribution module, and then sequentially request the segment index file and the segment file of the video, the audio and the subtitle.
In a specific practice, the content management system is further configured to request, by means of HTTP POST, to inject content into the slicing module, where the request includes: local path or remote path information of the source file.
In specific practice, after audio files are separated from source files and written into the output directory, an audio file complete path is generated;
the acquiring the audio file from the pre-generated injection content output directory according to the content identifier comprises:
searching for a complete path of the audio file according to the content identifier;
and acquiring the audio file from the injection content output directory according to the complete path of the audio file.
In a specific practice, the generating an index file according to the regional language subtitle output directory includes:
acquiring a first directory and a second directory from the injection content output directory;
and generating a primary index file according to the regional language subtitle output directory, the first directory and the second directory.
It can be understood that, in the technical solution provided in this embodiment, an audio file is obtained from a pre-generated injection content output directory, a subtitle in a region language corresponding to the region information is created based on a voice in the audio file, and a subtitle file in a corresponding native language is automatically generated for users in different regions; the traditional manual subtitle making mode can be avoided, and the subtitle making flexibility is improved; due to the adoption of the technical means of automatically generating the subtitles, the problem of asynchronism with audio possibly occurring in the process of manually making the subtitles is effectively avoided. Furthermore, indexes of video, audio and regional language subtitles of the content are sent to a terminal user by generating a mode of injecting a content output directory and returning an index file, a source file does not need to be changed, and synchronous display of the subtitles and the content at the terminal is also guaranteed.
Example 4
As shown in fig. 4, another method for implementing multi-region aware automatic multi-subtitle is further provided in the embodiments of the present invention, including:
s41: the content management system requests injection content from the slicing module through an HTTP POST mode, the injection content request comprises path information of a source file, a local path or a remote path can be designated, such as FTP or HTTP, and if the local path is transmitted, the local path is assumed to be: mp 4;
s42: after receiving the injection content request, the slicing module generates a unique number CID for the injection content, such as: 978e1c5a93c356b50c1e03dd1e3120f2, creating an output directory for the injected content using CID, such as: /output/978e1c5a93c356b50c1e03dd1e3120f 2; separating audio files from the source file, such as: mp4, write-out directory/output/978 e1c5a93c356b50c1e03dd1e3120f2, synchronize source file audio file full path/output/978 e1c5a93c356b50c1e03dd1e3120f2/file _ audio. mp4 to subtitle module; the slicing module packs and slices the source file and outputs the video slice to a first directory, such as: output/978e1c5a93c356b50c1e03dd1e3120f2/video, output audio slices to a second directory, such as: the output/978e1c5a93c356b50c1e03dd1e3120f2/audio, when the output directory has the following directory structure as shown in FIG. 5:
under the video directory are the video slice file and the slice index file, as shown in fig. 6:
under the audio directory are audio slice files and corresponding slice index files, as shown in fig. 7:
s43: the slicing module outputs directory information to the distribution module for synchronizing the content, and after the information synchronization is successful, a pull stream address of the injected content is returned, and if the access domain name corresponding to the distribution module is edge. https:// edge.movie.tv/output/978e1c5a93c356b50c1e03dd1e3120f 2/master.m3u8;
s44: after the content injection is successful, the operator puts the content on shelf;
s45: assuming that at time T0, a user in beijing, china requests to play the content on shelf, assuming that the IP of the user accessing the internet is 221.221.151.40, CID =978e1c5a93c356b50c1e03dd1e3120f2, after the distribution module receives the request for injecting the content, the distribution module synchronizes the IP address of the terminal requesting the injection content to the area module, assuming that the terminal IP: 221.221.151.40, the area module searches the area corresponding to the injected content request according to the terminal IP address, the area module synchronizes the searched area information to the caption module, the caption module checks whether the caption of the corresponding area has been created under the output directory, if the caption file of the area has been created, go to step S47; otherwise, go to step S46:
s46: the caption module acquires an audio file from a pre-generated injection content output directory according to the content identification, creates regional language captions corresponding to the regional information based on the voice in the audio file, and generates a regional language caption output directory, wherein the output directory is as follows: /output/978e1c5a93c356b50c1e03dd1e3120f 2/subtitle/zh/;
the list of fragments output under the output directory after the processing is completed is shown in fig. 8;
wherein init.m4s is a fragment containing metadata information;
s47: the distribution module generates a primary index file according to the regional language subtitle output directory and directly returns the primary index file to the client, and the primary index file generated by the distribution module is shown in fig. 9;
in this embodiment, after receiving the first-level index file returned by the distribution module, the terminal sequentially requests the segment index files and the segment files of the video, the audio, and the subtitle.
In this embodiment, the system implementation method in the live scene is similar to the system implementation method in the on-demand scene, and the difference is that the input form of the audio is: the video-on-demand scene is input in the form of an audio file, the live scene is input in the form of a real-time stream, and other processing flows are consistent with the video-on-demand scene flow.
It can be understood that, in the technical solution provided in this embodiment, an audio file is obtained from a pre-generated injection content output directory, a subtitle in a region language corresponding to the region information is created based on a voice in the audio file, and a subtitle file in a corresponding native language is automatically generated for users in different regions; the traditional manual subtitle making mode can be avoided, and the subtitle making flexibility is improved; due to the adoption of the technical means of automatically generating the subtitles, the problem of asynchronism with audio possibly occurring in the process of manually making the subtitles is effectively avoided. Furthermore, indexes of video, audio and regional language subtitles of the content are sent to a terminal user by generating a mode of injecting a content output directory and returning an index file, a source file does not need to be changed, and synchronous display of the subtitles and the content at the terminal is also guaranteed.
Example 5
An embodiment of the present invention further provides a computer device, including a memory 1 and a processor 2, as shown in fig. 10, where the memory 1 stores a computer program, and the processor 2 implements any one of the methods when executing the computer program.
The memory 1 includes at least one type of readable storage medium, which includes a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a magnetic memory, a magnetic disk, an optical disk, and the like. The memory 1 may in some embodiments be an internal storage unit, e.g. a hard disk, of a multi-region aware automatic multi-subtitle implementation system. The memory 1 may also be an external storage device of a multi-domain aware automatic multi-subtitle implementation system in other embodiments, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and so on. Further, the memory 1 may also comprise both internal storage units of a multi-region aware automatic multi-subtitle implementation and external storage devices. The memory 1 may be used not only to store application software installed in a multi-region-aware automatic multi-subtitle implementation and various types of data, such as codes of a multi-region-aware automatic multi-subtitle implementation program, but also to temporarily store data that has been output or will be output.
The processor 2 may be a Central Processing Unit (CPU), a controller, a microcontroller, a microprocessor or other data Processing chip in some embodiments, and is used to run program codes or process data stored in the memory 1, such as a multi-region-aware automatic multi-subtitle implementation program.
It can be understood that, in the technical solution provided in this embodiment, an audio file is obtained from a pre-generated injection content output directory, a subtitle in a region language corresponding to the region information is created based on a voice in the audio file, and a subtitle file in a corresponding native language is automatically generated for users in different regions; the traditional manual subtitle making mode can be avoided, and the subtitle making flexibility is improved; due to the adoption of the technical means of automatically generating the subtitles, the problem of asynchronism with audio possibly occurring in the process of manually making the subtitles is effectively avoided. Furthermore, indexes of video, audio and regional language subtitles of the content are sent to a terminal user by generating a mode of injecting a content output directory and returning an index file, a source file does not need to be changed, and synchronous display of the subtitles and the content at the terminal is also guaranteed.
The disclosed embodiments of the present invention also provide a computer-readable storage medium having a computer program stored thereon, where the computer program is executed by a processor to perform the steps of the method described in the above method embodiments. The storage medium may be a volatile or non-volatile computer-readable storage medium.
The computer program product of the multi-region-aware automatic multi-subtitle implementation method provided in the embodiments of the present disclosure includes a computer-readable storage medium storing a program code, where instructions included in the program code may be used to execute steps of the method described in the above method embodiments, which may be referred to in the above method embodiments specifically, and are not described herein again.
The embodiments disclosed herein also provide a computer program, which when executed by a processor implements any one of the methods of the preceding embodiments. The computer program product may be embodied in hardware, software or a combination thereof. In an alternative embodiment, the computer program product is embodied in a computer storage medium, and in another alternative embodiment, the computer program product is embodied in a Software product, such as a Software Development Kit (SDK), or the like.
It is understood that the same or similar parts in the above embodiments may be mutually referred to, and the same or similar parts in other embodiments may be referred to for the content which is not described in detail in some embodiments.
It should be noted that the terms "first," "second," and the like in the description of the present invention are used for descriptive purposes only and are not to be construed as indicating or implying relative importance. Further, in the description of the present invention, the meaning of "a plurality" means at least two unless otherwise specified.
Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or steps of the process, and alternate implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.
It should be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.
It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.
In addition, functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a separate product, may also be stored in a computer-readable storage medium.
The storage medium mentioned above may be a read-only memory, a magnetic or optical disk, etc.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims (10)

1. A multi-region perception automatic multi-subtitle realization method is characterized by comprising the following steps:
receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal;
searching the area information of the area where the terminal is located according to the IP address of the terminal;
acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating a regional language subtitle corresponding to the regional information based on the voice in the audio file, and generating a regional language subtitle output directory;
and generating an index file according to the regional language subtitle output directory, and returning the index file to the terminal.
2. The method of claim 1, wherein the injection content output catalog is generated according to:
receiving an injected content request from a content management system, the injected content request including path information of a source file;
generating a content identifier of the injected content and creating a corresponding output directory;
acquiring a source file according to the path information of the source file, separating an audio file from the source file, and writing the audio file into the output directory;
packaging and slicing the source file, outputting a video slice to a first directory, and outputting an audio slice to a second directory;
and writing the first directory information and the second directory information into the output directory to obtain the injection content output directory.
3. The method of claim 2, wherein after separating an audio file from a source file and writing to the output directory, generating a complete path of audio files;
the acquiring the audio file from the pre-generated injection content output directory according to the content identifier comprises:
searching for a complete path of the audio file according to the content identifier;
and acquiring the audio file from the injection content output directory according to the complete path of the audio file.
4. The method of claim 3, wherein generating an index file according to the regional language subtitle output directory comprises:
acquiring a first directory and a second directory from the injection content output directory;
and generating a primary index file according to the regional language subtitle output directory, the first directory and the second directory.
5. The utility model provides an automatic many captions realization device of multizone perception which characterized in that includes:
the distribution module is used for receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal;
the area module is used for searching the area information of the area where the terminal is located according to the IP address of the terminal;
the caption module is used for acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating regional language captions corresponding to the regional information based on the voice in the audio file, and generating a regional language caption output directory;
and the distribution module is used for generating an index file according to the regional language subtitle output directory and returning the index file to the terminal.
6. The apparatus of claim 5, wherein the injection content output catalog is generated according to:
receiving an injected content request from a content management system, the injected content request including path information of a source file;
generating a content identifier of the injected content and creating a corresponding output directory;
acquiring a source file according to the path information of the source file, separating an audio file from the source file, and writing the audio file into the output directory;
packaging and slicing the source file, outputting a video slice to a first directory, and outputting an audio slice to a second directory;
and writing the first directory information and the second directory information into the output directory to obtain the injection content output directory.
7. The apparatus of claim 6, wherein the generating an index file according to the regional language subtitle output directory comprises:
acquiring a first directory and a second directory from the injection content output directory;
and generating a primary index file according to the regional language subtitle output directory, the first directory and the second directory.
8. A multi-region-aware automatic multi-subtitle implementation system, comprising a content management system, a terminal, and the multi-region-aware automatic multi-subtitle implementation apparatus according to any one of claims 5-7.
9. A computer device, comprising: a processor, a memory and a bus, the memory storing machine-readable instructions executable by the processor, the processor and the memory communicating over the bus when a computer device is running, the machine-readable instructions when executed by the processor performing the method of any of claims 1 to 4.
10. A computer-readable storage medium, having stored thereon a computer program which, when executed by a processor, performs the method of any one of claims 1 to 4.
CN202210814117.8A 2022-07-12 2022-07-12 Multi-region perception automatic multi-subtitle realization method, device and system Pending CN114900718A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210814117.8A CN114900718A (en) 2022-07-12 2022-07-12 Multi-region perception automatic multi-subtitle realization method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210814117.8A CN114900718A (en) 2022-07-12 2022-07-12 Multi-region perception automatic multi-subtitle realization method, device and system

Publications (1)

Publication Number Publication Date
CN114900718A true CN114900718A (en) 2022-08-12

Family

ID=82729271

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210814117.8A Pending CN114900718A (en) 2022-07-12 2022-07-12 Multi-region perception automatic multi-subtitle realization method, device and system

Country Status (1)

Country Link
CN (1) CN114900718A (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103605710A (en) * 2013-11-12 2014-02-26 天脉聚源(北京)传媒科技有限公司 Distributed audio and video processing device and distributed audio and video processing method
CN105025319A (en) * 2015-07-09 2015-11-04 无锡天脉聚源传媒科技有限公司 Video pushing method and device
CN108055574A (en) * 2017-11-29 2018-05-18 上海网达软件股份有限公司 Media file transcoding generates the method and system of multitone rail multi-subtitle on-demand content
CN109275046A (en) * 2018-08-21 2019-01-25 华中师范大学 A kind of teaching data mask method based on double video acquisitions
CN111246314A (en) * 2020-01-14 2020-06-05 深圳市华曦达科技股份有限公司 Time-shifting live broadcast method, server device, client device and live broadcast system
CN113194356A (en) * 2021-03-23 2021-07-30 武永鑫 Video subtitle adding method and system
CN113286176A (en) * 2021-05-19 2021-08-20 中山亿联智能科技有限公司 STB IPTV AI intelligent translation system
CN113709579A (en) * 2021-08-05 2021-11-26 中移(杭州)信息技术有限公司 Audio and video data transmission method and device and storage medium
CN114040255A (en) * 2021-10-28 2022-02-11 上海网达软件股份有限公司 Live caption generating method, system, equipment and storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103605710A (en) * 2013-11-12 2014-02-26 天脉聚源(北京)传媒科技有限公司 Distributed audio and video processing device and distributed audio and video processing method
CN105025319A (en) * 2015-07-09 2015-11-04 无锡天脉聚源传媒科技有限公司 Video pushing method and device
CN108055574A (en) * 2017-11-29 2018-05-18 上海网达软件股份有限公司 Media file transcoding generates the method and system of multitone rail multi-subtitle on-demand content
CN109275046A (en) * 2018-08-21 2019-01-25 华中师范大学 A kind of teaching data mask method based on double video acquisitions
CN111246314A (en) * 2020-01-14 2020-06-05 深圳市华曦达科技股份有限公司 Time-shifting live broadcast method, server device, client device and live broadcast system
CN113194356A (en) * 2021-03-23 2021-07-30 武永鑫 Video subtitle adding method and system
CN113286176A (en) * 2021-05-19 2021-08-20 中山亿联智能科技有限公司 STB IPTV AI intelligent translation system
CN113709579A (en) * 2021-08-05 2021-11-26 中移(杭州)信息技术有限公司 Audio and video data transmission method and device and storage medium
CN114040255A (en) * 2021-10-28 2022-02-11 上海网达软件股份有限公司 Live caption generating method, system, equipment and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
冯亚涛: "CDN 技术在广电 VOD 平台的设计与应用", 《中国优秀硕士学位论文全文数据库 (信息科技辑)》 *
赵成: "MPEG-DASH 流媒体优化技术研究", 《中国优秀硕士学位论文全文数据库 (信息科技辑)》 *

Similar Documents

Publication Publication Date Title
US9875318B2 (en) Concepts for providing an enhanced media presentation
KR101299639B1 (en) Method and system for content delivery
US8931024B2 (en) Receiving apparatus and subtitle processing method
US9100701B2 (en) Enhanced video systems and methods
US8719869B2 (en) Method for sharing data and synchronizing broadcast data with additional information
KR101138396B1 (en) Method and apparatus for playing contents in IPTV terminal
JP6043089B2 (en) Broadcast communication cooperative receiver
US20080085099A1 (en) Media player apparatus and method thereof
BRPI0821388A2 (en) CONTROL OF REPRODUCTION OF CONTINUOUS MEDIA TRANSMISSION
EP2822288A1 (en) Method and apparatus for frame accurate advertisement insertion
KR101293301B1 (en) System and method for serching images using caption of moving picture in keyword
CN104581399B (en) The method and system that hot word is searched in a kind of TV box
US20190215580A1 (en) Modifying subtitles to reflect changes to audiovisual programs
US10972809B1 (en) Video transformation service
JP2018510552A (en) Method and associated apparatus for providing a media presentation guide in a media streaming over hypertext transfer protocol
CN114900718A (en) Multi-region perception automatic multi-subtitle realization method, device and system
WO2017096883A1 (en) Video recommendation method and system
CN107396168B (en) Television live broadcasting system capable of configuring launchers
KR101869053B1 (en) System of providing speech bubble or score, method of receiving augmented broadcasting contents and apparatus for performing the same, method of providing augmented contents and apparatus for performing the same
EP4027649A1 (en) Video playing method and apparatus, terminal and computer-readable storage medium
KR101749420B1 (en) Apparatus and method for extracting representation image of video contents using closed caption
KR102664295B1 (en) Method and apparatus for providing a platform for sign language subtitles video
CN115474072A (en) Content collaborative distribution processing method, device and equipment for multiple terminal equipment
Cosmas et al. Multimedia broadcast and internet satellite system design and user trial results
KR20160036658A (en) Method, apparatus and system for covert advertising

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20220812