CN107733876A - A kind of stream media caption display methods, mobile terminal and storage device - Google Patents

A kind of stream media caption display methods, mobile terminal and storage device Download PDF

Info

Publication number
CN107733876A
CN107733876A CN201710880018.9A CN201710880018A CN107733876A CN 107733876 A CN107733876 A CN 107733876A CN 201710880018 A CN201710880018 A CN 201710880018A CN 107733876 A CN107733876 A CN 107733876A
Authority
CN
China
Prior art keywords
video
stream media
caption display
display methods
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710880018.9A
Other languages
Chinese (zh)
Inventor
王凯迪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huizhou TCL Mobile Communication Co Ltd
Original Assignee
Huizhou TCL Mobile Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huizhou TCL Mobile Communication Co Ltd filed Critical Huizhou TCL Mobile Communication Co Ltd
Priority to CN201710880018.9A priority Critical patent/CN107733876A/en
Publication of CN107733876A publication Critical patent/CN107733876A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/762Media network packet handling at the source 
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/65Network streaming protocols, e.g. real-time transport protocol [RTP] or real-time control protocol [RTCP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/488Data services, e.g. news ticker
    • H04N21/4884Data services, e.g. news ticker for displaying subtitles

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephone Function (AREA)

Abstract

The invention discloses a kind of stream media caption display methods, mobile terminal and storage device, wherein methods described includes step:Simultaneously playing stream media video is received first, is secondly obtained synthetic video when streaming media video plays in real time, is again identified that the synthetic video and generate corresponding word, finally by generated text importing on the video image played.Method provided by the present invention, so that by streaming media video when playing, captions need not be added in video for every kind of agreement, cause playout-delay without video flowing is recompiled, it need to only record and identify sound when streaming media video plays, be then converted into text importing on video image.No matter this caption presentation method is all suitable for the stream media technology of which kind of agreement, and cost is relatively low, effectively simply solves streaming media video in the prior art and plays when not having captions, it is inconvenient to watch the problem of.

Description

A kind of stream media caption display methods, mobile terminal and storage device
Technical field
The present invention relates to the processing technology field of video content and additional data, and in particular to a kind of stream media caption is shown Method, mobile terminal and storage device.
Background technology
Streaming Media (Streaming) refers to the media formats played on the internet by the way of stream transmission, due to The memory capacity that video file needs is larger, all downloads and has certain delay;And when using streaming media, server will be more Media file is divided into packet one by one by special compression mode, to user equipment in real time, continuously transmit;User need not wait until Video file is all downloaded and finished, it is possible to starts to watch.
Stream media technology is widely used, in addition to more traditional video website, also emerging network direct broadcasting Platform.Network direct broadcasting and in general video website difference, it is the real-time of content, so typically no captions, so that In inconvenient to watch.And Streaming Media has a variety of different protocol forms, such as traditional Online Video application generally use HTTP, RTSP Or HLS protocol, and real-time live broadcast is then usually using RTMP agreements.And much application is all to use non-public agreement, pin The mode for adding captions in video to every kind of agreement is unrealistic;Recompile video flowing and then have very big expense, simultaneously Also streaming media playing is caused to postpone.How simply and effectively to solve streaming media video does not have inconvenient to watch become during captions to flow matchmaker Body technique progress technical problem urgently to be resolved hurrily.
Therefore, prior art has yet to be improved and developed.
The content of the invention
The technical problem to be solved in the present invention is, for the drawbacks described above of prior art, there is provided a kind of stream media caption Display methods, mobile terminal and storage device, it is intended to solve in the prior art that streaming media video is played when not having captions, viewing is not Just the problem of.
The technical proposal for solving the technical problem of the invention is as follows:
A kind of stream media caption display methods, wherein, the stream media caption display methods comprises the following steps:
Receive simultaneously playing stream media video;
Synthetic video when streaming media video plays is obtained in real time;
Identify the synthetic video and generate corresponding word;
By generated text importing on the video image played.
Further in preferred version, described stream media caption display methods, wherein, it is described to receive simultaneously playing stream media Video is specially:It is installed on the Stream Media Application of mobile terminal playing stream media video while receiving.
Further in preferred version, described stream media caption display methods, wherein, the video of the Stream Media Application Broadcast interface is provided with captions and opens option.
Further in preferred version, described stream media caption display methods, wherein, the Streaming Media that obtains in real time regards Frequently synthetic video during broadcasting is specially:Stream Media Application operates according to user and opens caption display function, voice recording module Start to enroll the sound that mobile terminal is synthesized and exported.
Further in preferred version, described stream media caption display methods, wherein, the identification synthetic video And generate corresponding word and specifically include:
Sound identification module detects that voice recording module starts to enroll sound, and triggering starts speech events, and starts voice Recording module enrolls sound and is converted to word;
Sound identification module persistently carries out speech recognition and text conversion;
Sound identification module detects that voice recording block termination or intermittent sound admission, triggering terminate speech events.
Further in preferred version, described stream media caption display methods, wherein, the identification synthetic video And generate corresponding word, with it is described by generated text importing on the video image played between also include:Word is sent out Send module to obtain sound identification module and change word, and send it to Subtitle Demonstration module.
Further in preferred version, described stream media caption display methods, wherein, it is described by generated text importing It is specially on the video image played:Subtitle Demonstration module shows text box, and root in the mobile terminal system the superiors Captions are updated according to the word come transmitted by word sending module.
Further in preferred version, described stream media caption display methods, wherein, the Stream Media Application is video Play APP or network direct broadcasting platform.
A kind of mobile terminal, wherein, including:Processor, the storage device being connected with processor communication, the storage device Suitable for storing a plurality of instruction;The processor is suitable to call the instruction in the storage device, is realized as described above with performing Stream media caption display methods.
A kind of storage device, wherein, the storage device is stored with computer program, and the computer program can be held Go for realizing stream media caption display methods as described above.
The invention discloses a kind of stream media caption display methods, mobile terminal and storage device, receive and play first Streaming media video, synthetic video when streaming media video plays secondly is obtained in real time, the synthetic video is again identified that and generates Corresponding word, finally by generated text importing on the video image played.So that played by streaming media video When, it is not necessary to captions are added in video for every kind of agreement, are caused playout-delay without video flowing is recompiled, are only needed to record And sound when streaming media video plays is identified, text importing is then converted on video image.This captions show Show no matter method is all suitable for the stream media technology of which kind of agreement, and cost is relatively low, effectively simply solves prior art Middle streaming media video is played when not having captions, it is inconvenient to watch the problem of.
Brief description of the drawings
Fig. 1 is the flow chart of stream media caption display methods preferred embodiment in the present invention.
Fig. 2 is the functional schematic block diagram of mobile terminal in the present invention.
Embodiment
To make the objects, technical solutions and advantages of the present invention clearer, clear and definite, develop simultaneously embodiment pair referring to the drawings The present invention is further described.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and do not have to It is of the invention in limiting.
As shown in figure 1, the stream media caption display methods that present pre-ferred embodiments are provided, comprises the following steps:
Step S100, simultaneously playing stream media video is received.
Streaming Media, streaming video is called, is the media broadcast when passing, is multimedia one kind.Broadcast when passing and refer to media " simultaneously " of the provider in transmission over networks media(Certain delay is inevitable), user on one side constantly receive and watch or Listen to the media being transmitted." stream " of Streaming Media refers to the transmission means (mode of stream) of this media, and does not imply that matchmaker Body is in itself.
The step is specially:It is installed on the Stream Media Application of mobile terminal playing stream media video while receiving.This Locating described Stream Media Application includes traditional video playback APP, and network direct broadcasting platform.By video playback APP is played Video has captions a bit, and some are without captions, therefore the present invention is provided with captions at the video playback interface of the Stream Media Application Option is opened, it is display captions voluntarily to be selected for user, still only plays video.
Mobile terminal in the present invention refers to network-connectable and plays the terminal device of video, including smart mobile phone, flat Plate and notebook computer etc..
Synthetic video when S200, acquisition streaming media video broadcasting in real time.
In the specific implementation, the synthetic video during streaming media video broadcasting of acquisition in real time is specially:Stream Media Application Operated according to user and open caption display function, voice recording module starts to enroll the sound that mobile terminal is synthesized and exported.
The voice recording module can be the system function module of mobile terminal, the functional module in Stream Media Application, Or be mounted to mobile terminal and possess the independent utility of voice recording function, the present invention is not specifically limited to this, can Realize after user operates and opens caption display function, enroll the mobile terminal sound that merges and export in time, preferably its It is have for the system function module of mobile terminal, that is, the voice recording program that system just carries originally, program advantage Effect utilizes mobile terminal own resource, reduces cost and the operation resource of mobile terminal;Or preferably in Stream Media Application Functional module, to realize the seamless connection between caption display function startup and recorded voice, improve captions and video image Broadcasting synchronism.
The sound that mobile terminal is synthesized and exported described in the step specifically refers to system(Such as Android)Synthesize and export Sound.
S300, the identification synthetic video simultaneously generate corresponding word.
When it is implemented, the identification synthetic video and generating corresponding word and specifically including:
S310, sound identification module detect voice recording module start enroll sound, triggering start speech events, and start by Voice recording module enrolls sound and is converted to word;
S320, sound identification module persistently carry out speech recognition and text conversion;
S330, sound identification module detect that voice recording block termination or intermittent sound admission, triggering terminate speech events.
The specific setting of sound identification module similarly, will not be repeated here, it should be noted that language with voice recording module Sound identification module can also add word sending function, and the word that will be identified, which is sent to corresponding module, to be shown.
Sound identification module detects that voice recording module starts to enroll sound in step S310, refers to voice recording module Most starting to enroll sound, or starting the admission of next sound after terminating at upper one.Voice recording module is only used for recording Sound processed, and sound identification module then has two functions, first, identification sound, second, identified sound is converted into word. Speech identifying function is not rare in the prior art, and the phonetic entry belonged in prior art, such as input method is exactly to open Word is converted to after voice recording identification, but does not have be applied to the precedent that streaming media video plays field before this, so It should be noted that this is not the common technology means of those skilled in the art.
Sound identification module persistently carries out speech recognition and text conversion in step S320, refers to sound identification module at certain One identifies and is carried out in real time before converting, and centre is without interval and interrupts.
When end speech events are triggered, also just represent the voice recognition of a certain sentence or whole video and convert , when sound identification module carry out again speech recognition and conversion when, performed object by be same video next voice, Or first voice of another video.
It is understood that in step S200 and S300 other steps, such as the sound to being enrolled can also be set to enter Row processing, the processing include noise filtering or amplification etc..
S400, by generated text importing on the video image played.
When it is implemented, described be specially on the video image played by generated text importing:Subtitle Demonstration Module the mobile terminal system the superiors show text box, and according to transmitted by word sending module come word renewal captions.
The text box may be disposed at alphabetical display layer, and alphabetical display layer is covered on video playback influence, preferably Select to cover comprehensively, and text box is located at the lower end of Subtitle Demonstration layer.It is understood that according to changed word length Difference, the size of text box will change, to be adapted to word length.
It is described to identify the synthetic video and generate corresponding word in the present invention further preferred embodiment, with institute State by generated text importing on the video image played between also include:Word sending module obtains speech recognition mould Block changes word, and sends it to Subtitle Demonstration module.
As shown in Fig. 2 present invention also offers a kind of mobile terminal, it includes:Processor 10, it is connected with processor communication Storage device 20, the storage device 20 is suitable to store a plurality of instruction;The processor 10 is suitable to call the storage device Instruction in 20, stream media caption display methods as described above is realized to perform.
A kind of storage device, wherein, the storage device is stored with computer program, and the computer program can be held Go for realizing stream media caption display methods as described above.
It should be appreciated that the application of the present invention is not limited to above-mentioned citing, for those of ordinary skills, can To be improved or converted according to the above description, all these modifications and variations should all belong to the guarantor of appended claims of the present invention Protect scope.

Claims (10)

1. a kind of stream media caption display methods, it is characterised in that the stream media caption display methods comprises the following steps:
Receive simultaneously playing stream media video;
Synthetic video when streaming media video plays is obtained in real time;
Identify the synthetic video and generate corresponding word;
By generated text importing on the video image played.
2. stream media caption display methods according to claim 1, it is characterised in that the simultaneously playing stream media that receives regards Frequently it is specially:It is installed on the Stream Media Application of mobile terminal playing stream media video while receiving.
3. stream media caption display methods according to claim 2, it is characterised in that the video of the Stream Media Application is broadcast Put interface and be provided with captions unlatching option.
4. stream media caption display methods according to claim 3, it is characterised in that described to obtain streaming media video in real time Synthetic video during broadcasting is specially:Stream Media Application is operated according to user and opens caption display function, and voice recording module is opened Begin the sound that admission mobile terminal is synthesized and exported.
5. stream media caption display methods according to claim 4, it is characterised in that the identification synthetic video is simultaneously Corresponding word is generated to specifically include:
Sound identification module detects that voice recording module starts to enroll sound, and triggering starts speech events, and starts voice Recording module enrolls sound and is converted to word;
Sound identification module persistently carries out speech recognition and text conversion;
Sound identification module detects that voice recording block termination or intermittent sound admission, triggering terminate speech events.
6. stream media caption display methods according to claim 5, it is characterised in that the identification synthetic video is simultaneously Generate corresponding word, with it is described by generated text importing on the video image played between also include:Word is sent Module obtains sound identification module and changes word, and sends it to Subtitle Demonstration module.
7. stream media caption display methods according to claim 6, it is characterised in that it is described by generated text importing in It is specially on the video image played:Subtitle Demonstration module the mobile terminal system the superiors show text box, and according to The word renewal captions come transmitted by word sending module.
8. stream media caption display methods according to claim 2, it is characterised in that the Stream Media Application is broadcast for video Put APP or network direct broadcasting platform.
A kind of 9. mobile terminal, it is characterised in that including:Processor, the storage device being connected with processor communication, the storage Device is suitable to store a plurality of instruction;The processor is suitable to call the instruction in the storage device, and above-mentioned power is realized to perform Profit requires the stream media caption display methods described in any one of 1-8.
10. a kind of storage device, it is characterised in that the storage device is stored with computer program, the computer program energy Enough it is performed to realize the stream media caption display methods as described in claim any one of 1-8.
CN201710880018.9A 2017-09-26 2017-09-26 A kind of stream media caption display methods, mobile terminal and storage device Pending CN107733876A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710880018.9A CN107733876A (en) 2017-09-26 2017-09-26 A kind of stream media caption display methods, mobile terminal and storage device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710880018.9A CN107733876A (en) 2017-09-26 2017-09-26 A kind of stream media caption display methods, mobile terminal and storage device

Publications (1)

Publication Number Publication Date
CN107733876A true CN107733876A (en) 2018-02-23

Family

ID=61207364

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710880018.9A Pending CN107733876A (en) 2017-09-26 2017-09-26 A kind of stream media caption display methods, mobile terminal and storage device

Country Status (1)

Country Link
CN (1) CN107733876A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108427546A (en) * 2018-05-03 2018-08-21 深圳Tcl新技术有限公司 Full screen adaptation method, display device and the storage medium of display device
CN109495792A (en) * 2018-11-30 2019-03-19 北京字节跳动网络技术有限公司 A kind of subtitle adding method, device, electronic equipment and the readable medium of video
CN110475146A (en) * 2019-09-05 2019-11-19 珠海市杰理科技股份有限公司 Subtitle antidote, device and intelligent sound box
CN111107284A (en) * 2019-12-31 2020-05-05 洛阳乐往网络科技有限公司 Real-time generation system and generation method for video subtitles
CN111556372A (en) * 2020-04-20 2020-08-18 北京甲骨今声科技有限公司 Method and device for adding subtitles to video and audio programs in real time

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103327397A (en) * 2012-03-22 2013-09-25 联想(北京)有限公司 Subtitle synchronous display method and system of media file
CN106851401A (en) * 2017-03-20 2017-06-13 惠州Tcl移动通信有限公司 A kind of method and system of automatic addition captions

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103327397A (en) * 2012-03-22 2013-09-25 联想(北京)有限公司 Subtitle synchronous display method and system of media file
CN106851401A (en) * 2017-03-20 2017-06-13 惠州Tcl移动通信有限公司 A kind of method and system of automatic addition captions

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108427546A (en) * 2018-05-03 2018-08-21 深圳Tcl新技术有限公司 Full screen adaptation method, display device and the storage medium of display device
CN109495792A (en) * 2018-11-30 2019-03-19 北京字节跳动网络技术有限公司 A kind of subtitle adding method, device, electronic equipment and the readable medium of video
CN110475146A (en) * 2019-09-05 2019-11-19 珠海市杰理科技股份有限公司 Subtitle antidote, device and intelligent sound box
CN110475146B (en) * 2019-09-05 2022-01-14 珠海市杰理科技股份有限公司 Subtitle correction method and device and intelligent sound box
CN111107284A (en) * 2019-12-31 2020-05-05 洛阳乐往网络科技有限公司 Real-time generation system and generation method for video subtitles
CN111556372A (en) * 2020-04-20 2020-08-18 北京甲骨今声科技有限公司 Method and device for adding subtitles to video and audio programs in real time

Similar Documents

Publication Publication Date Title
CN107733876A (en) A kind of stream media caption display methods, mobile terminal and storage device
EP2901372B1 (en) Using digital fingerprints to associate data with a work
CN103974143B (en) A kind of method and apparatus for generating media data
CN104575550B (en) Multimedia file title skipping method and electronic device
CN102196313A (en) Method and device for continuous playing of cross-platform breakpoint as well as method and device for continuous playing of breakpoint
CN106331733A (en) Desktop cloud terminal's audio and video data real-time processing method and system
TW201517572A (en) A method, device, and system thereof for data processing
US10360913B2 (en) Speech recognition method, device and system based on artificial intelligence
CN110032355B (en) Voice playing method and device, terminal equipment and computer storage medium
CN106161627B (en) Method and device for pushed information
JP2020174339A (en) Method, device, server, computer-readable storage media, and computer program for aligning paragraph and image
CN109862100B (en) Method and device for pushing information
CN111107442A (en) Method and device for acquiring audio and video files, server and storage medium
WO2021227308A1 (en) Video resource generation method and apparatus
CN108260005A (en) A kind of video broadcasting method and device
RU2696767C1 (en) Method and system for broadcasting multimedia information in real time, information collection device and information verification server
CN111541906B (en) Data transmission method, data transmission device, computer equipment and storage medium
CN102904891A (en) Multimedia data sharing method and device and multimedia playing equipment
CN102883188A (en) Method and system of downloading and playing MP4 files in real time
CN112954602A (en) Voice control method, transmission method, device, electronic equipment and storage medium
CN108024140A (en) A kind of live broadcasting method and system
CN105791964A (en) Cross-platform media file playing method and system
CN104754400B (en) A kind of big envelope information sharing method and device based on mobile terminal
US9084011B2 (en) Method for advertising based on audio/video content and method for creating an audio/video playback application
CN112562688A (en) Voice transcription method, device, recording pen and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180223

RJ01 Rejection of invention patent application after publication