CN114900718A

CN114900718A - Multi-region perception automatic multi-subtitle realization method, device and system

Info

Publication number: CN114900718A
Application number: CN202210814117.8A
Authority: CN
Inventors: 韦月飞; 张灵晶
Original assignee: Shenzhen SDMC Technology Co Ltd
Current assignee: Shenzhen SDMC Technology Co Ltd
Priority date: 2022-07-12
Filing date: 2022-07-12
Publication date: 2022-08-12

Abstract

The application discloses and provides a method, a device and a system for realizing multi-region perception automatic multi-subtitle, wherein the method comprises the following steps: receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal; searching the area information of the area where the terminal is located according to the IP address of the terminal; acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating a regional language subtitle corresponding to the regional information based on the voice in the audio file, and generating a regional language subtitle output directory; and generating an index file according to the regional language subtitle output directory, and returning the index file to the terminal, so that the problems that in the prior art, subtitle files need to be made in advance, and the subtitle display time of artificially made subtitles is not synchronous with audio and video pictures are solved.

Description

Multi-region perception automatic multi-subtitle realization method, device and system

Technical Field

The invention relates to the technical field of multimedia audio and video, in particular to a method, a device and a system for realizing multi-region perception automatic multi-subtitle.

Background

With the development of internet technology, people can watch tv and movies in different countries and different languages through the internet, but in the process of watching tv or movies in non-native languages, it is difficult to understand the meaning expressed by the character dialogue in tv or movies without the subtitle prompting function, so the subtitle prompting function plays a role of a bridge for understanding the meaning expressed by the character dialogue in movie works in non-native languages, and at present, the traditional method for displaying multiple subtitles in movie works is as follows: the content operator makes multilingual subtitle files of film and television works in advance, such as Chinese, English, German and Korean, when the user plays the film and television works, the user can select corresponding subtitles from a multi-subtitle list, and the server sends the corresponding subtitle files according to the selection of the user, although the realization method solves the problem of subtitle presentation of the film and television works in non-native languages to a certain extent, the realization method still has some defects: firstly, the subtitle files need to be made in advance, so that when a user watches movie works, a subtitle list selectable by the user is fixed, has limited selectivity and lacks flexibility, if the subtitle list does not contain subtitles corresponding to the native language of the user, the subtitles lose due effects for the user, and the user experience is unfriendly; secondly, the problem that the subtitle display time is not synchronous with the audio and video picture can occur when subtitles are artificially made, and the subtitles do not play a due role under the condition.

Disclosure of Invention

Therefore, the technical problem to be solved by the present invention is to overcome the problems in the prior art that the subtitle file needs to be made in advance, when the user watches the movie and television works, the subtitle list selectable by the user is fixed, the selectivity is limited, and the flexibility is lacked, if the subtitle list does not contain the subtitle corresponding to the native language of the user, the subtitle loses the due effect of the subtitle, and the user experience is unfriendly; the problem that the subtitle display time is not synchronous with the audio and video picture when the subtitles are manufactured manually is solved, and the subtitles do not play a due role under the condition, so that the multi-region perception automatic multi-subtitle realization method, device and system are provided.

In order to solve the above technical problems, the embodiments of the present disclosure at least provide a method, an apparatus, and a system for implementing multi-region aware automatic multi-subtitle.

In a first aspect, an embodiment of the present disclosure provides a method for implementing multi-region aware automatic multi-subtitle, including:

receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal;

searching the area information of the area where the terminal is located according to the IP address of the terminal;

acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating a regional language subtitle corresponding to the regional information based on the voice in the audio file, and generating a regional language subtitle output directory;

and generating an index file according to the regional language subtitle output directory, and returning the index file to the terminal.

Optionally, the injection content output catalog is generated according to the following manner:

receiving an injected content request from a content management system, the injected content request including path information of a source file;

generating a content identifier of the injected content and creating a corresponding output directory;

acquiring a source file according to the path information of the source file, separating an audio file from the source file, and writing the audio file into the output directory;

packaging and slicing the source file, outputting a video slice to a first directory, and outputting an audio slice to a second directory;

and writing the first directory information and the second directory information into the output directory to obtain the injection content output directory.

Optionally, after separating the audio file from the source file and writing the audio file into the output directory, generating a complete path of the audio file;

the acquiring the audio file from the pre-generated injection content output directory according to the content identifier comprises:

searching for a complete path of the audio file according to the content identifier;

and acquiring the audio file from the injection content output directory according to the complete path of the audio file.

Optionally, the generating an index file according to the regional language subtitle output directory includes:

acquiring a first directory and a second directory from the injection content output directory;

and generating a primary index file according to the regional language subtitle output directory, the first directory and the second directory.

In a second aspect, an embodiment of the disclosure further provides a multi-region-aware automatic multi-subtitle implementing apparatus, including:

the distribution module is used for receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal;

the area module is used for searching the area information of the area where the terminal is located according to the IP address of the terminal;

the caption module is used for acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating regional language captions corresponding to the regional information based on the voice in the audio file, and generating a regional language caption output directory;

and the distribution module is used for generating an index file according to the regional language subtitle output directory and returning the index file to the terminal.

In a third aspect, an embodiment of the present disclosure further provides a system for implementing multi-region aware automatic multi-subtitle, including:

a content management system, a terminal and the multi-region-aware automatic multi-subtitle implementation apparatus of the second aspect.

In a fourth aspect, an embodiment of the present disclosure further provides a computer device, including: a processor, a memory and a bus, the memory storing machine-readable instructions executable by the processor, the processor and the memory communicating via the bus when the computer device is running, the machine-readable instructions when executed by the processor performing the steps of the first aspect described above, or any possible implementation of the first aspect.

In a fifth aspect, the disclosed embodiments of the present invention further provide a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and the computer program is executed by a processor to perform the steps in the first aspect or any possible implementation manner of the first aspect.

The technical scheme provided by the embodiment of the invention has the following beneficial effects:

acquiring an audio file from a pre-generated injection content output directory, creating regional language subtitles corresponding to the regional information based on voice in the audio file, and automatically generating subtitle files of corresponding native languages for users in different regions; the traditional manual subtitle making mode can be avoided, and the subtitle making flexibility is improved; due to the adoption of the technical means of automatically generating the subtitles, the problem of asynchronism with audio possibly occurring in the process of manually making the subtitles is effectively avoided. Furthermore, indexes of video, audio and regional language subtitles of the content are sent to a terminal user by generating a mode of injecting a content output directory and returning an index file, a source file does not need to be changed, and synchronous display of the subtitles and the content at the terminal is also guaranteed.

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.

FIG. 1 is a flow chart of a multi-region aware automatic multi-subtitle implementation method according to an embodiment of the disclosure;

FIG. 2 is a schematic structural diagram of a multi-region-aware automatic multi-subtitle implementation apparatus according to an embodiment of the disclosure;

FIG. 3 is a schematic structural diagram of a multi-region-aware automatic multi-subtitle implementation system according to an embodiment of the disclosure;

FIG. 4 is a flow chart illustrating another multi-region aware automatic multi-subtitle implementation method provided in the disclosed embodiments of the present invention;

FIG. 5 is a diagram illustrating a directory structure under an output directory provided by the disclosed embodiments;

FIG. 6 is a diagram illustrating a directory structure under a video directory provided by a disclosed embodiment of the invention;

FIG. 7 is a diagram illustrating a directory structure under an audio directory provided by an embodiment of the present disclosure;

FIG. 8 is a table illustrating a tile list of outputs from the output directory after processing is complete, as provided by the disclosed embodiments;

FIG. 9 is a diagram of a primary index file generated by a distribution module provided by a disclosed embodiment of the invention;

fig. 10 shows a schematic structural diagram of a computer device according to an embodiment of the present disclosure.

Detailed Description

Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present invention. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the invention, as detailed in the appended claims.

Example 1

As shown in fig. 1, the method for implementing multi-region aware automatic multi-subtitle provided by the embodiment of the present disclosure includes:

s11: receiving a content playing request from a terminal, wherein the content playing request comprises a content identifier and an IP address of the terminal;

s12: searching the area information of the area where the terminal is located according to the IP address of the terminal;

s13: acquiring an audio file from a pre-generated injection content output directory according to the content identification, creating a regional language subtitle corresponding to the regional information based on the voice in the audio file, and generating a regional language subtitle output directory;

s14: and generating an index file according to the regional language subtitle output directory, and returning the index file to the terminal.

In particular practice, the injection content output catalog is generated according to the following manner:

In specific practice, after audio files are separated from source files and written into the output directory, an audio file complete path is generated;

In a specific practice, the generating an index file according to the regional language subtitle output directory includes:

It can be understood that, in the technical solution provided in this embodiment, an audio file is obtained from a pre-generated injection content output directory, a subtitle in a region language corresponding to the region information is created based on a voice in the audio file, and a subtitle file in a corresponding native language is automatically generated for users in different regions; the traditional manual subtitle making mode can be avoided, and the subtitle making flexibility is improved; due to the adoption of the technical means of automatically generating the subtitles, the problem of asynchronism with audio possibly occurring in the process of manually making the subtitles is effectively avoided. Furthermore, indexes of video, audio and regional language subtitles of the content are sent to a terminal user by generating a mode of injecting a content output directory and returning an index file, a source file does not need to be changed, and synchronous display of the subtitles and the content at the terminal is also guaranteed.

Example 2

As shown in fig. 2, an embodiment of the present invention further provides a multi-region aware automatic multi-subtitle implementing apparatus, including:

Example 3

As shown in fig. 3, an embodiment of the present invention further provides a multi-region aware automatic multi-subtitle implementation system, including:

a server, a terminal and a content management system;

the server side comprises: the system comprises a distribution module, a region module, a caption module and a slicing module;

the distribution module is used for receiving a content playing request from a terminal and synchronizing the IP address of the terminal in the content playing request of the terminal to the area module, wherein the content playing request comprises a content identifier and the IP address of the terminal;

the region module searches the region information of the region where the terminal is located according to the IP address of the terminal, synchronizes the searched region information to the caption module, and requests the caption module to output the caption of the language of the corresponding region;

the subtitle module acquires an audio file from a pre-generated injection content output directory according to the content identification, creates regional language subtitles corresponding to the regional information based on voice in the audio file, generates a regional language subtitle output directory, and synchronizes the regional language subtitle output directory to the distribution module;

and the distribution module generates an index file according to the regional language subtitle output directory and returns the index file to the terminal.

the slicing module receives an injected content request from a content management system, the injected content request including path information of a source file;

In a specific practice, the terminal is further configured to receive the first-level index file returned by the distribution module, and then sequentially request the segment index file and the segment file of the video, the audio and the subtitle.

In a specific practice, the content management system is further configured to request, by means of HTTP POST, to inject content into the slicing module, where the request includes: local path or remote path information of the source file.

Example 4

As shown in fig. 4, another method for implementing multi-region aware automatic multi-subtitle is further provided in the embodiments of the present invention, including:

s41: the content management system requests injection content from the slicing module through an HTTP POST mode, the injection content request comprises path information of a source file, a local path or a remote path can be designated, such as FTP or HTTP, and if the local path is transmitted, the local path is assumed to be: mp 4;

s42: after receiving the injection content request, the slicing module generates a unique number CID for the injection content, such as: 978e1c5a93c356b50c1e03dd1e3120f2, creating an output directory for the injected content using CID, such as: /output/978e1c5a93c356b50c1e03dd1e3120f 2; separating audio files from the source file, such as: mp4, write-out directory/output/978 e1c5a93c356b50c1e03dd1e3120f2, synchronize source file audio file full path/output/978 e1c5a93c356b50c1e03dd1e3120f2/file _ audio. mp4 to subtitle module; the slicing module packs and slices the source file and outputs the video slice to a first directory, such as: output/978e1c5a93c356b50c1e03dd1e3120f2/video, output audio slices to a second directory, such as: the output/978e1c5a93c356b50c1e03dd1e3120f2/audio, when the output directory has the following directory structure as shown in FIG. 5:

under the video directory are the video slice file and the slice index file, as shown in fig. 6:

under the audio directory are audio slice files and corresponding slice index files, as shown in fig. 7:

s43: the slicing module outputs directory information to the distribution module for synchronizing the content, and after the information synchronization is successful, a pull stream address of the injected content is returned, and if the access domain name corresponding to the distribution module is edge. https:// edge.movie.tv/output/978e1c5a93c356b50c1e03dd1e3120f 2/master.m3u8;

s44: after the content injection is successful, the operator puts the content on shelf;

s45: assuming that at time T0, a user in beijing, china requests to play the content on shelf, assuming that the IP of the user accessing the internet is 221.221.151.40, CID =978e1c5a93c356b50c1e03dd1e3120f2, after the distribution module receives the request for injecting the content, the distribution module synchronizes the IP address of the terminal requesting the injection content to the area module, assuming that the terminal IP: 221.221.151.40, the area module searches the area corresponding to the injected content request according to the terminal IP address, the area module synchronizes the searched area information to the caption module, the caption module checks whether the caption of the corresponding area has been created under the output directory, if the caption file of the area has been created, go to step S47; otherwise, go to step S46:

s46: the caption module acquires an audio file from a pre-generated injection content output directory according to the content identification, creates regional language captions corresponding to the regional information based on the voice in the audio file, and generates a regional language caption output directory, wherein the output directory is as follows: /output/978e1c5a93c356b50c1e03dd1e3120f 2/subtitle/zh/;

the list of fragments output under the output directory after the processing is completed is shown in fig. 8;

wherein init.m4s is a fragment containing metadata information;

s47: the distribution module generates a primary index file according to the regional language subtitle output directory and directly returns the primary index file to the client, and the primary index file generated by the distribution module is shown in fig. 9;

in this embodiment, after receiving the first-level index file returned by the distribution module, the terminal sequentially requests the segment index files and the segment files of the video, the audio, and the subtitle.

In this embodiment, the system implementation method in the live scene is similar to the system implementation method in the on-demand scene, and the difference is that the input form of the audio is: the video-on-demand scene is input in the form of an audio file, the live scene is input in the form of a real-time stream, and other processing flows are consistent with the video-on-demand scene flow.

Example 5

An embodiment of the present invention further provides a computer device, including a memory 1 and a processor 2, as shown in fig. 10, where the memory 1 stores a computer program, and the processor 2 implements any one of the methods when executing the computer program.

The memory 1 includes at least one type of readable storage medium, which includes a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a magnetic memory, a magnetic disk, an optical disk, and the like. The memory 1 may in some embodiments be an internal storage unit, e.g. a hard disk, of a multi-region aware automatic multi-subtitle implementation system. The memory 1 may also be an external storage device of a multi-domain aware automatic multi-subtitle implementation system in other embodiments, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and so on. Further, the memory 1 may also comprise both internal storage units of a multi-region aware automatic multi-subtitle implementation and external storage devices. The memory 1 may be used not only to store application software installed in a multi-region-aware automatic multi-subtitle implementation and various types of data, such as codes of a multi-region-aware automatic multi-subtitle implementation program, but also to temporarily store data that has been output or will be output.

The processor 2 may be a Central Processing Unit (CPU), a controller, a microcontroller, a microprocessor or other data Processing chip in some embodiments, and is used to run program codes or process data stored in the memory 1, such as a multi-region-aware automatic multi-subtitle implementation program.

The disclosed embodiments of the present invention also provide a computer-readable storage medium having a computer program stored thereon, where the computer program is executed by a processor to perform the steps of the method described in the above method embodiments. The storage medium may be a volatile or non-volatile computer-readable storage medium.

The computer program product of the multi-region-aware automatic multi-subtitle implementation method provided in the embodiments of the present disclosure includes a computer-readable storage medium storing a program code, where instructions included in the program code may be used to execute steps of the method described in the above method embodiments, which may be referred to in the above method embodiments specifically, and are not described herein again.

The embodiments disclosed herein also provide a computer program, which when executed by a processor implements any one of the methods of the preceding embodiments. The computer program product may be embodied in hardware, software or a combination thereof. In an alternative embodiment, the computer program product is embodied in a computer storage medium, and in another alternative embodiment, the computer program product is embodied in a Software product, such as a Software Development Kit (SDK), or the like.

It is understood that the same or similar parts in the above embodiments may be mutually referred to, and the same or similar parts in other embodiments may be referred to for the content which is not described in detail in some embodiments.

It should be noted that the terms "first," "second," and the like in the description of the present invention are used for descriptive purposes only and are not to be construed as indicating or implying relative importance. Further, in the description of the present invention, the meaning of "a plurality" means at least two unless otherwise specified.

Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or steps of the process, and alternate implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.

It should be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. For example, if implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.

It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.

In addition, functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a separate product, may also be stored in a computer-readable storage medium.

The storage medium mentioned above may be a read-only memory, a magnetic or optical disk, etc.

In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above do not necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples.

Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims

1. A multi-region perception automatic multi-subtitle realization method is characterized by comprising the following steps:

2. The method of claim 1, wherein the injection content output catalog is generated according to:

3. The method of claim 2, wherein after separating an audio file from a source file and writing to the output directory, generating a complete path of audio files;

4. The method of claim 3, wherein generating an index file according to the regional language subtitle output directory comprises:

5. The utility model provides an automatic many captions realization device of multizone perception which characterized in that includes:

6. The apparatus of claim 5, wherein the injection content output catalog is generated according to:

7. The apparatus of claim 6, wherein the generating an index file according to the regional language subtitle output directory comprises:

8. A multi-region-aware automatic multi-subtitle implementation system, comprising a content management system, a terminal, and the multi-region-aware automatic multi-subtitle implementation apparatus according to any one of claims 5-7.

9. A computer device, comprising: a processor, a memory and a bus, the memory storing machine-readable instructions executable by the processor, the processor and the memory communicating over the bus when a computer device is running, the machine-readable instructions when executed by the processor performing the method of any of claims 1 to 4.

10. A computer-readable storage medium, having stored thereon a computer program which, when executed by a processor, performs the method of any one of claims 1 to 4.