CN105763949A

CN105763949A - Audio video file playing method and device

Info

Publication number: CN105763949A
Application number: CN201410796349.0A
Authority: CN
Inventors: 曹虹; 杜鲲
Original assignee: LeTV Mobile Intelligent Information Technology Beijing Co Ltd
Current assignee: Lemobile Information Technology (Beijing) Co Ltd; LeTV Mobile Intelligent Information Technology Beijing Co Ltd
Priority date: 2014-12-18
Filing date: 2014-12-18
Publication date: 2016-07-13

Abstract

The invention provides an audio video file playing method and device. The method includes receiving a fast forward or a fast reverse command for an audio video file being played currently, wherein the fast forward or the fast reverse command comprises information of a command time point; searching and obtaining the initial time point of a last subtitle piece or a next subtitle piece corresponding to the command time point of the fast forward or the fast reverse command; playing the audio video file from the subtitle and the audio video frequency frame corresponding to the initial time point. By adopting the technical scheme provided by the invention, fast forward or fast reverse operation can take one subtitle piece as an adjusting interval, so that practical requirements of users are met and especially special requirements of users having special actual need, such as for learning foreign languages through video and audio, are met.

Description

A kind of audio/video file playing method and device

Technical field

The present invention relates to audio and video playing technical field, particularly relate to a kind of audio/video file playing method and device.

Background technology

The audio/video file play carries out F.F. or fast reverse operation, and current technology scheme is to use progress bar to pull, or utilizes shortcut to carry out F.F. or rewind at set time intervals.

But when watching some program, such as TV play, it is likely to need to return a word repeating to watch certain role just now soon, or want to skip a word that role is current, and use progress bar to pull or click shortcut and can not carry out F.F. or rewind according to interval in short accurately, it is impossible to meet user's request.

Summary of the invention

The present invention provides a kind of audio/video file playing method and device, with the problem solving existing F.F. or rewind mode cannot meet user's request.

In order to solve the problems referred to above, the invention discloses a kind of audio/video file player method, including:

Receive the F.F. to currently playing audio/video file or rewind instruction；Wherein, F.F. or rewind instruction include the information of instruction time point；

Search obtains the initial time of previous sentence captions or the rear captions put corresponding to the instruction time of F.F. or rewind instruction；

The captions corresponding from initial time and audio/video frames, play audio/video file.

Preferably, search obtains the initial time of previous sentence captions or the rear captions put corresponding to the instruction time of F.F. or rewind instruction, including:

In the plug-in caption information or embedded caption information of audio/video file, coupling obtains the previous sentence captions of instruction time point or the initial time of rear captions；Or,

According to the recognition result of captions in the frame of video to audio/video file, obtain the previous sentence captions of instruction time point or the initial time of rear captions.

Preferably, the subtitle file that plug-in caption information is text file format of audio/video file；The embedded caption information of audio/video file is the caption data being present in audio/video file with independent orbital fashion；

Plug-in caption information all includes timeline information and the captions corresponding with timeline information with embedded caption information；Timeline information includes initial time and the termination time of each sentence captions.

Preferably, in the plug-in caption information or embedded caption information of audio/video file, coupling obtains the previous sentence captions of instruction time point or the initial time of rear captions, including:

In the subtitle file of the text file format of the plug-in caption information of audio/video file or in the independent track at embedded caption information place, it is judged that whether instruction time point is positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions；

If so, then these captions is defined as current subtitle, obtains the previous sentence captions of current subtitle or the initial time of rear captions；Wherein, the previous sentence captions of current subtitle are the termination time and terminate the nearest captions of initial time of time gap current subtitle before the initial time of current subtitle；Rear captions of current subtitle are that initial time is after the termination time of current subtitle and the termination time nearest captions of initial time distance current subtitle；

If not, then corresponding to F.F. or rewind instruction, obtain the initial time of the captions that initial time is after instruction time point and initial time distance instruction time point is nearest, or obtain the initial time of the captions that the termination time is before the instruction time puts and termination time gap instruction time point is nearest.

Preferably, according to the recognition result of captions in the frame of video to audio/video file, obtain the previous sentence captions of instruction time point or the initial time of rear captions, including:

Detection obtains the previous frame of video that there are captions or rear of frame of video corresponding to instruction time point and there is the frame of video of captions, obtains the previous frame of video that there are captions or rear and there is the initial time of captions in the frame of video of captions；

Wherein, the previous frame of video that there are captions or rear exists and there is at least one frame frame of video without captions between the frame of video that the frame of video of captions is corresponding with instruction time point.

Preferably, detect the previous frame of video that there are captions or rear obtaining frame of video corresponding to instruction time point and there is the frame of video of captions, including:

The frame of video that instruction time point is corresponding is carried out picture recognition, judges whether frame of video exists word according to the result of picture recognition；

If existing, then according to F.F. or rewind instruction, continuing the frame of video after or before the instruction time is put and carrying out picture recognition, until identifying the frame of video being absent from word；

Continuing according to F.F. or rewind instruction, the frame of video being absent from after or before the frame of video of word being carried out picture recognition, until again identifying that out the frame of video that there is word；

The frame of video of captions is there is in the frame of video of the existence word that would again identify that out as the previous frame of video that there are captions or rear of frame of video corresponding to instruction time point.

Preferably, obtain the previous frame of video that there are captions or rear and there is the initial time of captions in the frame of video of captions, including:

According to F.F. instruction, would again identify that out time point corresponding to the frame of video that the there is word initial time as rear one frame of video that there are captions；

Or,

According to rewind instruction, continue the frame of video before the time point that the frame of video of the existence word identified is corresponding is carried out picture recognition, until again identifying that out the frame of video being absent from word, using the time point corresponding for the nearest frame frame of video after the time point corresponding in the frame of video being absent from word again identified that out the initial time as the previous frame of video that there are captions.

Preferably, method also includes:

If instruction time point is less than or equal to the termination time of first captions in plug-in caption information or embedded caption information, or all without captions in all videos frame before instruction time point, then generates and show the information forbidding rewind；

If instruction time point is be more than or equal to the initial time of last captions in plug-in caption information or embedded caption information, or all without captions in all videos frame after instruction time point, then generates and show the information forbidding F.F..

Preferably, the captions corresponding from initial time and audio frequency and video, before playing audio/video file, method also includes:

Judge whether audio/video frames corresponding to initial time is key frame；

If so, then perform, from captions corresponding to initial time and audio/video frames, to play the step of audio/video file；

If not, then searched for before initial time, apart from the key frame that the audio/video frames that initial time is corresponding is nearest, and start to decode the audio/video frames of audio/video file from nearest key frame, until when decoding is to the audio/video frames that initial time is corresponding, perform, from captions corresponding to initial time and audio/video frames, to play the step of audio/video file.

Preferably, receive the F.F. to currently playing audio/video file or rewind instruction, including:

Receive the F.F. or rewind instruction that are generated by clicking operation, gesture operation or voice operating.

Correspondingly, the invention also discloses a kind of audio/video file playing device, including:

Command reception module, for receiving the F.F. to currently playing audio/video file or rewind instruction；Wherein, F.F. or rewind instruction include the information of instruction time point；

Initial time search module, for searching for the initial time of previous sentence captions or the rear captions obtaining the instruction time point corresponding to F.F. or rewind instruction；

Playing module, for the captions corresponding from initial time and audio/video frames, plays audio/video file.

Preferably, initial time search module mates the initial time of previous sentence captions or the rear captions obtaining instruction time point in the plug-in caption information or embedded caption information of audio/video file；Or,

Initial time search module, according to the recognition result of captions in the frame of video to audio/video file, obtains the previous sentence captions of instruction time point or the initial time of rear captions.

Preferably, initial time search module, including:

Time judges submodule, for in the subtitle file of the text file format of the plug-in caption information of audio/video file or in the independent track at embedded caption information place, it is judged that whether instruction time point is positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions；

The very first time obtains submodule, if be positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions for instruction time point, then these captions is defined as current subtitle, obtains the previous sentence captions of current subtitle or the initial time of rear captions；Wherein, the previous sentence captions of current subtitle are the termination time and terminate the nearest captions of initial time of time gap current subtitle before the initial time of current subtitle；Rear captions of current subtitle are that initial time is after the termination time of current subtitle and the termination time nearest captions of initial time distance current subtitle；

Second time obtained submodule, if being not at the initial time of the timeline information of a certain sentence captions for instruction time point and in the time period of the time of termination, then corresponding to F.F. or rewind instruction, obtain the initial time of the captions that initial time is after instruction time point and initial time distance instruction time point is nearest, or obtain the initial time of the captions that the termination time is before the instruction time puts and termination time gap instruction time point is nearest.

Preferably, initial time search module, including:

3rd time obtained submodule, there is the frame of video of captions for detecting the previous frame of video that there are captions or rear obtaining frame of video corresponding to instruction time point, obtain the previous frame of video that there are captions or rear and there is the initial time of captions in the frame of video of captions；

Preferably, the 3rd time obtained submodule, including:

Identify judgment sub-unit, for the frame of video that instruction time point is corresponding is carried out picture recognition, judge whether frame of video exists word according to the result of picture recognition；

Continue to identify subelement, if there is word in frame of video, then according to F.F. or rewind instruction, continuing the frame of video after or before the instruction time is put and carrying out picture recognition, until identifying the frame of video being absent from word；Continuing according to F.F. or rewind instruction, the frame of video being absent from after or before the frame of video of word being carried out picture recognition, until again identifying that out the frame of video that there is word；

Frame of video determines subelement, there is the frame of video of captions as the previous frame of video that there are captions or rear of frame of video corresponding to instruction time point for the frame of video of existence word that would again identify that out.

Preferably, the 3rd time obtained submodule, also included:

Initial time determines subelement, for according to F.F. instruction, would again identify that out time point corresponding to the frame of video that the there is word initial time as rear one frame of video that there are captions；Or, according to rewind instruction, continue the frame of video before the time point that the frame of video of the existence word identified is corresponding is carried out picture recognition, until again identifying that out the frame of video being absent from word, using the time point corresponding for the nearest frame frame of video after the time point corresponding in the frame of video being absent from word again identified that out the initial time as the previous frame of video that there are captions.

Preferably, device also includes:

Forbid rewind module, if for instruction time point less than or equal to the termination time of first captions in plug-in caption information or embedded caption information, or all without captions in all videos frame before instruction time point, then generate and show the information forbidding rewind；

Forbid F.F. module, if for instruction time point be more than or equal to the initial time of last captions in plug-in caption information or embedded caption information, or all without captions in all videos frame after instruction time point, then generate and show the information forbidding F.F..

Preferably, device also includes:

Key frame judge module, for playing module captions corresponding from initial time and audio frequency and video, before playing audio/video file, it is judged that whether audio/video frames corresponding to initial time is key frame；If so, then playing module performs, from captions corresponding to initial time and audio/video frames, to play the step of audio/video file；

Key frame search and decoder module, if not being key frame for the audio/video frames that initial time is corresponding, then searched for before initial time, apart from the key frame that the audio/video frames that initial time is corresponding is nearest, and start to decode the audio/video frames of audio/video file from nearest key frame, until when decoding is to the audio/video frames that initial time is corresponding, playing module performs, from captions corresponding to initial time and audio/video frames, to play the step of audio/video file.

Preferably, command reception module receives the F.F. or rewind instruction that are generated by clicking operation, gesture operation or voice operating.

Compared with background technology, the present invention includes advantages below:

Receive the information that the F.F. of currently playing audio/video file or rewind instruction, F.F. or rewind instruction are included instruction time point, time when this instruction time point occurs for F.F. or rewind instruction.Search obtains the initial time of previous sentence captions or the rear captions put corresponding to the instruction time of F.F. or rewind instruction, the captions corresponding from initial time and audio/video frames, plays audio/video file.F.F. or fast reverse operation are to put as benchmark with instruction time of F.F. or rewind instruction, by currently playing audio/video file F.F. or fall back on soon the instruction time point previous sentence captions or rear captions, audio/video file is commenced play out from previous sentence captions and audio/video frames or rear captions and audio/video frames, achieve F.F. or fast reverse operation can with the unit of captions for adjusting interval, meet the actual demand of user, especially some user with specific demand is met, such as the actual demand of the user by audio-visual foreign language studying.

Accompanying drawing explanation

Fig. 1 is the flow chart of a kind of audio/video file player method in the embodiment of the present invention one；

Fig. 2 is the flow chart of a kind of audio/video file player method in the embodiment of the present invention two；

Fig. 3 is the flow chart of a kind of audio/video file player method in the embodiment of the present invention three；

Fig. 4 is the structure chart of a kind of audio/video file playing device in the embodiment of the present invention four；

Fig. 5 is the structure chart of a kind of audio/video file playing device in the embodiment of the present invention five.

Detailed description of the invention

Understandable for enabling the above-mentioned purpose of the present invention, feature and advantage to become apparent from, below in conjunction with the drawings and specific embodiments, the present invention is further detailed explanation.

Provided by the invention a kind of audio/video file playing method and device is discussed in detail below by enumerating several specific embodiment.

Embodiment one

A kind of audio/video file player method that the embodiment of the present invention provide is discussed in detail.

With reference to Fig. 1, it is shown that the flow chart of a kind of audio/video file player method in the embodiment of the present invention.

Step 100, receives the F.F. to currently playing audio/video file or rewind instruction.

Wherein, F.F. or rewind instruction include the information of instruction time point.

F.F. instruction includes the information of F.F. instruction time point, and rewind instruction includes the information of rewind instruction time point.The information of instruction time point can be temporal information when F.F. instruction or rewind instruction generation, and this time information is the temporal information that audio/video file is play.Such as, the playing duration of certain audio/video file is 90 minutes, when this audio/video file is played to 10 minutes, receives the F.F. instruction to this audio/video file, then the information of the point of the instruction time in F.F. instruction can be played to 10 minutes for this audio/video file.

Step 102, search obtains the initial time of previous sentence captions or the rear captions put corresponding to the instruction time of F.F. or rewind instruction.

The purpose of F.F. instruction is that currently playing audio/video file is fast-forward to rear captions in currently playing moment, commences play out current audio/video file from rear captions.The purpose of rewind instruction is that currently playing audio/video file falls back on the previous sentence captions in currently playing moment soon, commences play out current audio/video file from previous sentence captions.

One captions is in the playing process of audio/video file, and the time of display is a time period, it is possible to the time period being made up of initial time and termination time.

In the playing process of audio/video file, for instance in the playing process of a film, corresponding to the instruction time point of F.F. or rewind instruction, it is possible to have Subtitle Demonstration, it is also possible to there is no Subtitle Demonstration.Regardless of whether there are captions, all may determine that the previous sentence captions of the instruction time of F.F. or rewind instruction point or rear captions.Such as, certain audio/video file has 3 captions, respectively is captions 1, captions 2 and captions 3 according to time-sequencing, and there is certain time interval between two between captions, namely there is certain time interval between captions 1 and captions 2, between captions 2 and captions 3, there is also certain time interval.If the instruction time point of F.F. or rewind instruction is in the middle of the display time period of captions 2, then previous sentence captions are captions 1, and rear captions are captions 3；If the instruction time point of F.F. or rewind instruction is in the middle of the interval between captions 1 and captions 2, then previous sentence captions are captions 1, and rear captions are captions 2.

Step 104, the captions corresponding from initial time and audio/video frames, plays audio/video file.

If what receive is F.F. instruction, then captions and audio/video frames that the initial time of rear captions put in the instruction time of F.F. instruction is corresponding start, and play audio/video file.

If what receive is rewind instruction, then captions and audio/video frames that the initial time of previous sentence captions put in the instruction time of rewind instruction is corresponding start, and play audio/video file.

In sum, the technical scheme in the embodiment of the present invention, receive the information that the F.F. of currently playing audio/video file or rewind instruction, F.F. or rewind instruction are included instruction time point, time when this instruction time point occurs for F.F. or rewind instruction.Search obtains the initial time of previous sentence captions or the rear captions put corresponding to the instruction time of F.F. or rewind instruction, the captions corresponding from initial time and audio/video frames, plays audio/video file.F.F. or fast reverse operation are to put as benchmark with instruction time of F.F. or rewind instruction, by currently playing audio/video file F.F. or fall back on soon the instruction time point previous sentence captions or rear captions, audio/video file is commenced play out from previous sentence captions and audio/video frames or rear captions and audio/video frames, achieve F.F. or fast reverse operation can with the unit of captions for adjusting interval, meet the actual demand of user, especially some user with specific demand is met, such as the actual demand of the user by audio-visual foreign language studying.

Embodiment two

With reference to Fig. 2, it is shown that the flow chart of a kind of audio/video file player method in the embodiment of the present invention.

Step 200, receives the F.F. to currently playing audio/video file or rewind instruction.

Wherein, F.F. or rewind instruction can include the information of instruction time point.

Preferably, step 200 can be: receives the F.F. or rewind instruction that are generated by clicking operation, gesture operation or voice operating.

Generally, if the audio/video file that player is being play assigns F.F. or rewind instruction, the button of click F.F. or rewind can be utilized, or the shortcut of F.F. or rewind on some beating keyboard, again or can pass through to touch the gesture operation of screen, or F.F. or rewind instruction can also be assigned by voice operating.

Step 202, search obtains the initial time of previous sentence captions or the rear captions put corresponding to the instruction time of F.F. or rewind instruction.

According to the difference in the source of captions in audio/video file, step 202 can perform according to following three kinds of modes:

(1) in the plug-in caption information of audio/video file, coupling obtains the previous sentence captions of instruction time point or the initial time of rear captions.

Wherein, the subtitle file that plug-in caption information is text file format of audio/video file.As extended the subtitle file of ass, srt, smi, ssa or sub by name.

Preferably, in the plug-in caption information of audio/video file, coupling obtains the previous sentence captions of instruction time point or the initial time of rear captions, it is possible to including:

Step 11, in the subtitle file of the text file format of the plug-in caption information of audio/video file, it is judged that whether instruction time point is positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions；If so, step 12 is then performed；If it is not, then perform step 13.

These captions is defined as current subtitle by step 12, obtains the previous sentence captions of current subtitle or the initial time of rear captions.

Wherein, the previous sentence captions of current subtitle are the termination time and terminate the nearest captions of initial time of time gap current subtitle before the initial time of current subtitle.

Rear captions of current subtitle are that initial time is after the termination time of current subtitle and the termination time nearest captions of initial time distance current subtitle.

Such as, the instruction time point of F.F. instruction is positioned at initial time and the time period of the time of termination of the 10th captions of audio/video file, then the 10th captions are current subtitle, and the 11st captions are rear captions, and the 9th captions are previous sentence captions.

Step 13, corresponding to F.F. or rewind instruction, obtain the initial time of the captions that initial time is after instruction time point and initial time distance instruction time point is nearest, or obtain the initial time of the captions that the termination time is before the instruction time puts and termination time gap instruction time point is nearest.

Such as, the instruction time point of F.F. instruction is within the time period terminated between time and the initial time of the 11st captions of the 10th captions of audio/video file, then the 11st captions are rear captions, and the 10th captions are previous sentence captions.

Or,

(2) in the embedded caption information of audio/video file, coupling obtains the previous sentence captions of instruction time point or the initial time of rear captions.

Wherein, the embedded caption information of audio/video file is the caption data being present in audio/video file with independent orbital fashion.As audio/video file can comprise multiple independent track, wherein certain independent track identities the caption data of this audio/video file.

Plug-in caption information may each comprise timeline information and the captions corresponding with timeline information with embedded caption information.Timeline information can include initial time and the termination time of each sentence captions.

Preferably, in the embedded caption information of audio/video file, coupling obtains the previous sentence captions of instruction time point or the initial time of rear captions, it is possible to including:

Step 21, in the independent track at the embedded caption information place of audio/video file, it is judged that whether instruction time point is positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions；If so, step 22 is then performed；If it is not, then perform step 23.

These captions is defined as current subtitle by step 22, obtains the previous sentence captions of current subtitle or the initial time of rear captions.

Step 23, corresponding to F.F. or rewind instruction, obtain the initial time of the captions that initial time is after instruction time point and initial time distance instruction time point is nearest, or obtain the initial time of the captions that the termination time is before the instruction time puts and termination time gap instruction time point is nearest.

It should be noted that above-mentioned steps 12 can be identical with step 22, above-mentioned steps 13 can be identical with step 23.

Or,

(3) basis is to the recognition result of captions in the frame of video of audio/video file, obtains the previous sentence captions of instruction time point or the initial time of rear captions.

Captions in such cases can be understood as " printing " in the frame of video of audio/video file, is an entirety with frame of video, it is possible to obtained by character recognition technology identification.

Preferably, according to the recognition result of captions in the frame of video to audio/video file, obtain the previous sentence captions of instruction time point or the initial time of rear captions, it is possible to including:

Step 31, detects the previous frame of video that there are captions or rear obtaining frame of video corresponding to instruction time point and there is the frame of video of captions.

Such as, the frame of video that instruction time point is corresponding is the 100th frame, and the previous frame of video that there are captions should be the frame of video (having at least a frame in the 98th frame or the 99th frame is the frame of video without captions) before the 98th frame (the 99th frame is the frame of video without captions) or the 98th frame.

Preferably, step 31 may include that

Step 311, carries out picture recognition to the frame of video that instruction time point is corresponding, judges whether there is word in frame of video according to the result of picture recognition；If existing, then perform step 312；If being absent from, then perform step 313.

Wherein, the technological means that frame of video carries out picture recognition can adopt the technological means of existing picture recognition or Text region, and the concrete execution process of picture recognition or Text region is not limited as by the embodiment of the present invention.

Step 312, according to F.F. or rewind instruction, continues the frame of video after or before the instruction time is put and carries out picture recognition, until identifying the frame of video being absent from word；Continuing according to F.F. or rewind instruction, the frame of video being absent from after or before the frame of video of word being carried out picture recognition, until again identifying that out the frame of video that there is word；The frame of video of captions is there is in the frame of video of the existence word that would again identify that out as the previous frame of video that there are captions or rear of frame of video corresponding to instruction time point.

Such as, for F.F. instruction, if step 311 is judged there is word in the frame of video (the 100th frame) that instruction time point is corresponding, then continue frame of video (the 101st frame) afterwards is carried out picture recognition, if the 101st frame there is also word, then continue the 102nd frame is carried out picture recognition, if the 102nd frame is without word, then the frame of video (the 103rd frame) after the 102nd frame is carried out picture recognition, if the 103rd frame has word, then the 103rd frame is the frame of video that rear the one of the frame of video (the 100th frame) that instruction time point is corresponding exists captions.

Step 313, according to F.F. or rewind instruction, carries out picture recognition to the frame of video being absent from after or before the frame of video of word, until again identifying that out the frame of video that there is word；The frame of video of captions is there is in the frame of video of the existence word that would again identify that out as the previous frame of video that there are captions or rear of frame of video corresponding to instruction time point.

Above-mentioned steps 313 can be understood as the part in above-mentioned steps 312 and performs process, it is possible to referring to above-mentioned about the introduction in step 312, do not repeat them here.

Step 32, obtains the previous frame of video that there are captions or rear and there is the initial time of captions in the frame of video of captions.

Preferably, according to the difference of F.F. instruction and rewind instruction, step 32 can be divided into following two to perform process:

1) according to F.F. instruction, would again identify that out time point corresponding to the frame of video that the there is word initial time as rear one frame of video that there are captions.

Or,

2) according to rewind instruction, continue the frame of video before the time point that the frame of video of the existence word identified is corresponding is carried out picture recognition, until again identifying that out the frame of video being absent from word, using the time point corresponding for the nearest frame frame of video after the time point corresponding in the frame of video being absent from word again identified that out the initial time as the previous frame of video that there are captions.

Such as, if the frame of video of instruction time point correspondence is the 51st frame, in this frame of picture recognition, it is absent from word；The 50th frame before 51st frame is carried out picture recognition, identifies in this frame and there is word；49th frame time point before corresponding to the 50th frame of the existence word identified carries out picture recognition, if there is word in the 49th frame, then the 48th frame is carried out picture recognition, if the 48th frame is absent from word, then the nearest frame frame of video time point that is the 49th frame is corresponding after time point corresponding for the 48th frame was put as the instruction time initial time of the previous frame of video that there are captions of the 51st corresponding frame.

Step 204, it is judged that whether audio/video frames corresponding to initial time is key frame；If so, step 206 is then performed；If it is not, then perform step 208.

Wherein, arranging of audio/video frames can adopt conventional setting.For frame of video, key frame can be that frame residing for key operations in role or object of which movement or change, can also be that those skilled in the art set key frame every setting interval according to demand, as set key frames etc., the invention is not limited in this regard at interval of 10 frames.

Step 206, the captions corresponding from initial time and audio/video frames, plays audio/video file.

Step 208, search is before initial time, apart from the key frame that the audio/video frames that initial time is corresponding is nearest, and start to decode the audio/video frames of audio/video file from nearest key frame, until when decoding is to the audio/video frames that initial time is corresponding, the captions corresponding from initial time and audio/video frames, play audio/video file.

Such as, if the audio/video frames that initial time is corresponding (the 50th frame) is not key frame, the nearest key frame of distance the 50th frame is the 40th frame, then start to decode the audio/video frames of audio/video file from the 40th frame, but do not show audio/video file, until decoding is to the 50th frame, commence play out audio/video file from the 50th frame.

In audio/video file, captions are generally together play with audio frequency and video, and the audio frequency and video in audio/video file are generally encoded digital signals format, it is necessary to the audio frequency and video in the audio/video file after coding are decoded, and play after decoding together with captions.

It should be noted that the technical scheme in the embodiment of the present invention, when receiving F.F. or rewind instruction, it is also possible to generate and display reminding information according to practical situation.

If instruction time point is less than or equal to the termination time of first captions in plug-in caption information or embedded caption information, or all without captions in all videos frame before instruction time point, then generates and show the information forbidding rewind.

Such as, the instruction time, point was for 00:05:00,100, plug-in captions or in embedded captions the termination time of first captions be 00:08:00,100, then generate and show the information forbidding rewind.

Such as, the instruction time, point was for 01:29:00,100, plug-in captions or in embedded captions the initial time of last captions be 01:27:00,100, then generate and show the information forbidding F.F..

Embodiment three

With reference to Fig. 3, it is shown that the flow chart of a kind of audio/video file player method in the embodiment of the present invention.

Step 300, receives skip forward/back instruction.

User's operation interface request one/rewind of F.F. one by player, as by button click, or the operation that uses gesture generates F.F. or rewind instruction, and audio-visual broadcasting instrument such as audio and video player receives this F.F. or rewind instruction.

Step 302, reads caption information.

Player, when playing, can read caption information, can record the caption information of current time when the current point in time asking F.F. or rewind exists captions.

Step 304, finds nearest captions.

After player receives the instruction of skip forward/back, with one captions of rewind for example, according to currently playing time point, in captions, look for forward the initial time of nearest captions.

If not had captions before current point in time, then what prompting user was currently playing is first captions, it is impossible to rewind.

If not had captions after current point in time, then what prompting user was currently playing is last captions, it is impossible to F.F..

Step 306, completes skip forward/back.

If the audio/video frames of current point in time is not key frame, then search again for forward first key frame of the audio/video frames of distance current point in time, the namely nearest key frame of the audio/video frames of distance current point in time, the key frame that this is nearest is decoded but does not show, until decoding the audio/video frames of current point in time, starting display and playing.

Embodiment four

A kind of audio/video file playing device that the embodiment of the present invention provide is discussed in detail.

With reference to Fig. 4, it is shown that the structure chart of a kind of audio/video file playing device in the embodiment of the present invention.

Device in the embodiment of the present invention may include that command reception module 400, initial time search module 402, playing module 404.

Relation function and each module of each module between is discussed in detail separately below.

Command reception module 400, for receiving the F.F. to currently playing audio/video file or rewind instruction.Wherein, F.F. or rewind instruction include the information of instruction time point.

Initial time search module 402, for searching for the initial time of previous sentence captions or the rear captions obtaining the instruction time point corresponding to F.F. or rewind instruction.

Playing module 404, for the captions corresponding from initial time and audio/video frames, plays audio/video file.

Embodiment five

With reference to Fig. 5, it is shown that the structure chart of a kind of audio/video file playing device in the embodiment of the present invention.

Device in the embodiment of the present invention may include that command reception module 500, initial time search module 502, key frame judge module 504, key frame search and decoder module 506, playing module 508, forbids rewind module 510, forbids F.F. module 512.

Wherein, initial time search module 502 may include that the time judges submodule 5021, and the very first time obtains submodule 5022, and the second time obtained submodule 5023, and the 3rd time obtained submodule 5024.

Wherein, the 3rd time obtained submodule 5024 and may include that identification judgment sub-unit 50241, continued to identify subelement 50242, and frame of video determines subelement 50243, and initial time determines subelement 50244.

Relation function and each module, respective module, the respective unit of each module, respective module, respective unit between is discussed in detail separately below.

Command reception module 500, for receiving the F.F. to currently playing audio/video file or rewind instruction.Wherein, F.F. or rewind instruction include the information of instruction time point.

Preferably, command reception module 500 can receive the F.F. or rewind instruction that are generated by clicking operation, gesture operation or voice operating.

Initial time search module 502, for searching for the initial time of previous sentence captions or the rear captions obtaining the instruction time point corresponding to F.F. or rewind instruction.

Preferably, initial time search module 502 mates the initial time of previous sentence captions or the rear captions obtaining instruction time point in the plug-in caption information or embedded caption information of audio/video file；Or,

Initial time search module 502, according to the recognition result of captions in the frame of video to audio/video file, obtains the previous sentence captions of instruction time point or the initial time of rear captions.

Wherein, the subtitle file that plug-in caption information is text file format of audio/video file；The embedded caption information of audio/video file is the caption data being present in audio/video file with independent orbital fashion；Plug-in caption information all includes timeline information and the captions corresponding with timeline information with embedded caption information；Timeline information includes initial time and the termination time of each sentence captions.

Preferably, initial time search module 502, including:

Time judges submodule 5021, for in the subtitle file of the text file format of the plug-in caption information of audio/video file or in the independent track at embedded caption information place, it is judged that whether instruction time point is positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions.

The very first time obtains submodule 5022, if be positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions for instruction time point, then these captions is defined as current subtitle, obtains the previous sentence captions of current subtitle or the initial time of rear captions；Wherein, the previous sentence captions of current subtitle are the termination time and terminate the nearest captions of initial time of time gap current subtitle before the initial time of current subtitle；Rear captions of current subtitle are that initial time is after the termination time of current subtitle and the termination time nearest captions of initial time distance current subtitle.

Second time obtained submodule 5023, if being not at the initial time of the timeline information of a certain sentence captions for instruction time point and in the time period of the time of termination, then corresponding to F.F. or rewind instruction, obtain the initial time of the captions that initial time is after instruction time point and initial time distance instruction time point is nearest, or obtain the initial time of the captions that the termination time is before the instruction time puts and termination time gap instruction time point is nearest.

3rd time obtained submodule 5024, there is the frame of video of captions for detecting the previous frame of video that there are captions or rear obtaining frame of video corresponding to instruction time point, obtain the previous frame of video that there are captions or rear and there is the initial time of captions in the frame of video of captions.

Preferably, the 3rd time obtained submodule 5024, including:

Identify judgment sub-unit 50241, for the frame of video that instruction time point is corresponding is carried out picture recognition, judge whether frame of video exists word according to the result of picture recognition.

Continue to identify subelement 50242, if there is word in frame of video, then according to F.F. or rewind instruction, continuing the frame of video after or before the instruction time is put and carrying out picture recognition, until identifying the frame of video being absent from word；Continuing according to F.F. or rewind instruction, the frame of video being absent from after or before the frame of video of word being carried out picture recognition, until again identifying that out the frame of video that there is word.

Frame of video determines subelement 50243, there is the frame of video of captions as the previous frame of video that there are captions or rear of frame of video corresponding to instruction time point for the frame of video of existence word that would again identify that out.

Initial time determines subelement 50244, for according to F.F. instruction, would again identify that out time point corresponding to the frame of video that the there is word initial time as rear one frame of video that there are captions；Or, according to rewind instruction, continue the frame of video before the time point that the frame of video of the existence word identified is corresponding is carried out picture recognition, until again identifying that out the frame of video being absent from word, using the time point corresponding for the nearest frame frame of video after the time point corresponding in the frame of video being absent from word again identified that out the initial time as the previous frame of video that there are captions.

Key frame judge module 504, for playing module 508 captions corresponding from initial time and audio frequency and video, before playing audio/video file, it is judged that whether audio/video frames corresponding to initial time is key frame；If so, then playing module performs, from captions corresponding to initial time and audio/video frames, to play the step of audio/video file.

Key frame search and decoder module 506, if not being key frame for the audio/video frames that initial time is corresponding, then searched for before initial time, apart from the key frame that the audio/video frames that initial time is corresponding is nearest, and start to decode the audio/video frames of audio/video file from nearest key frame, until when decoding is to the audio/video frames that initial time is corresponding, playing module performs, from captions corresponding to initial time and audio/video frames, to play the step of audio/video file.

Playing module 508, for the captions corresponding from initial time and audio/video frames, plays audio/video file.

Forbid rewind module 510, if for instruction time point less than or equal to the termination time of first captions in plug-in caption information or embedded caption information, or all without captions in all videos frame before instruction time point, then generate and show the information forbidding rewind.

Forbid F.F. module 512, if putting be more than or equal to the initial time of last captions in plug-in caption information or embedded caption information for the instruction time, or all without captions in all videos frame after instruction time point, then generate and show the information forbidding F.F..

For device embodiment, due to itself and embodiment of the method basic simlarity, so what describe is fairly simple, relevant part illustrates referring to the part of embodiment of the method.

Each embodiment in this specification all adopts the mode gone forward one by one to describe, and what each embodiment stressed is the difference with other embodiments, between each embodiment identical similar part mutually referring to.

A kind of audio/video file the playing method and device above embodiment of the present invention provided, it is described in detail, principles of the invention and embodiment are set forth by specific case used herein, and the explanation of above example is only intended to help to understand method and the core concept thereof of the present invention；Simultaneously for one of ordinary skill in the art, according to the thought of the present invention, all will change in specific embodiments and applications, in sum, this specification content should not be construed as limitation of the present invention.

Claims

1. an audio/video file player method, it is characterised in that including:

Receive the F.F. to currently playing audio/video file or rewind instruction；Wherein, described F.F. or rewind instruction include the information of instruction time point；

Search obtains the initial time of previous sentence captions or the rear captions put corresponding to the described instruction time of described F.F. or rewind instruction；

The captions corresponding from described initial time and audio/video frames, play described audio/video file.

2. method according to claim 1, it is characterised in that described search obtains the initial time of previous sentence captions or the rear captions put corresponding to the described instruction time of described F.F. or rewind instruction, including:

In the plug-in caption information or embedded caption information of described audio/video file, coupling obtains the previous sentence captions of point of described instruction time or the initial time of rear captions；Or,

According to the recognition result of captions in the frame of video to described audio/video file, obtain the previous sentence captions of point of described instruction time or the initial time of rear captions.

3. method according to claim 2, it is characterised in that the subtitle file that plug-in caption information is text file format of described audio/video file；The embedded caption information of described audio/video file is be present in the caption data in described audio/video file with independent orbital fashion；

Described plug-in caption information all includes timeline information and the captions corresponding with timeline information with embedded caption information；Described timeline information includes initial time and the termination time of each sentence captions.

4. method according to claim 3, it is characterised in that in the described plug-in caption information at described audio/video file or embedded caption information, coupling obtains the previous sentence captions of point of described instruction time or the initial time of rear captions, including:

In the subtitle file of the text file format of the described plug-in caption information of described audio/video file or in the independent track at described embedded caption information place, it is judged that whether point of described instruction time is positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions；

If so, then these captions is defined as current subtitle, obtains the previous sentence captions of described current subtitle or the initial time of rear captions；Wherein, the previous sentence captions of described current subtitle are the termination time and terminate the captions that the initial time of current subtitle described in time gap is nearest before the initial time of described current subtitle；Rear captions of described current subtitle be initial time after the termination time of described current subtitle and initial time apart from the termination time of described current subtitle nearest captions；

If not, then corresponding to described F.F. or rewind instruction, obtain initial time after the described instruction time puts and initial time apart from the initial time of the nearest captions of point of described instruction time, or obtain the termination time and and terminate the initial time of the nearest captions of instruction time point described in time gap before the described instruction time puts.

5. method according to claim 2, it is characterised in that described basis, to the recognition result of captions in the frame of video of described audio/video file, obtains the previous sentence captions of point of described instruction time or the initial time of rear captions, including:

Detection obtains the previous frame of video that there are captions or rear of frame of video corresponding to point of described instruction time and there is the frame of video of captions, obtains the described previous frame of video that there are captions or rear and there is the initial time of captions in the frame of video of captions；

Wherein, the described previous frame of video that there are captions or rear exists and there is at least one frame frame of video without captions between the frame of video that the frame of video of captions is corresponding with point of described instruction time.

6. method according to claim 5, it is characterised in that detect the previous frame of video that there are captions or rear obtaining frame of video corresponding to point of described instruction time and there is the frame of video of captions, including:

The frame of video that point of described instruction time is corresponding is carried out picture recognition, judges whether described frame of video exists word according to the result of described picture recognition；

If existing, then according to described F.F. or rewind instruction, continuing the frame of video after or before the described instruction time is put and carrying out picture recognition, until identifying the frame of video being absent from word；

Continuing according to described F.F. or rewind instruction, the frame of video being absent from after or before the frame of video of word being carried out picture recognition, until again identifying that out the frame of video that there is word；

The frame of video of captions is there is in the frame of video of the existence word that would again identify that out as the previous frame of video that there are captions or rear of frame of video corresponding to point of described instruction time.

7. method according to claim 6, it is characterised in that the described previous frame of video that there are captions of described acquisition or rear exists the initial time of captions in the frame of video of captions, including:

According to described F.F. instruction, would again identify that out time point corresponding to the frame of video that the there is word initial time as after described one frame of video that there are captions；

Or,

According to described rewind instruction, continue the frame of video before the time point that the frame of video of the existence word identified is corresponding is carried out picture recognition, until again identifying that out the frame of video being absent from word, using the time point corresponding for the nearest frame frame of video after the time point corresponding in the frame of video being absent from word again identified that out the initial time as the described previous frame of video that there are captions.

8. method according to claim 2, it is characterised in that described method also includes:

If point of described instruction time is less than or equal to the termination time of first captions in described plug-in caption information or embedded caption information, or all without captions in all videos frame before point of described instruction time, then generates and show the information forbidding rewind；

If point of described instruction time is be more than or equal to the initial time of last captions in described plug-in caption information or embedded caption information, or all without captions in all videos frame after point of described instruction time, then generates and show the information forbidding F.F..

9. method according to claim 1, it is characterised in that the described captions corresponding from described initial time and audio frequency and video, before playing described audio/video file, described method also includes:

Judge whether audio/video frames corresponding to described initial time is key frame；

If so, then perform, from captions corresponding to described initial time and audio/video frames, to play the step of described audio/video file；

If not, then search for before described initial time, apart from the key frame that the audio/video frames that described initial time is corresponding is nearest, and start to decode the audio/video frames of described audio/video file from described nearest key frame, until when decoding is to the audio/video frames that described initial time is corresponding, perform, from captions corresponding to described initial time and audio/video frames, to play the step of described audio/video file.

10. method according to claim 1, it is characterised in that described reception to the F.F. of currently playing audio/video file or rewind instruction, including:

11. an audio/video file playing device, it is characterised in that including:

Command reception module, for receiving the F.F. to currently playing audio/video file or rewind instruction；Wherein, described F.F. or rewind instruction include the information of instruction time point；

Initial time search module, for searching for the initial time of previous sentence captions or the rear captions obtaining the point of described instruction time corresponding to described F.F. or rewind instruction；

Playing module, for the captions corresponding from described initial time and audio/video frames, plays described audio/video file.

12. device according to claim 11, it is characterised in that described initial time search module mates the initial time of previous sentence captions or the rear captions obtaining point of described instruction time in the plug-in caption information or embedded caption information of described audio/video file；Or,

Described initial time search module, according to the recognition result of captions in the frame of video to described audio/video file, obtains the previous sentence captions of point of described instruction time or the initial time of rear captions.

13. device according to claim 12, it is characterised in that the subtitle file that plug-in caption information is text file format of described audio/video file；The embedded caption information of described audio/video file is be present in the caption data in described audio/video file with independent orbital fashion；

14. device according to claim 13, it is characterised in that described initial time search module, including:

Time judges submodule, for in the subtitle file of the text file format of the described plug-in caption information of described audio/video file or in the independent track at described embedded caption information place, it is judged that whether point of described instruction time is positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions；

The very first time obtains submodule, if be positioned at initial time and the time period of the time of termination of the timeline information of a certain sentence captions for point of described instruction time, then these captions is defined as current subtitle, obtains the previous sentence captions of described current subtitle or the initial time of rear captions；Wherein, the previous sentence captions of described current subtitle are the termination time and terminate the captions that the initial time of current subtitle described in time gap is nearest before the initial time of described current subtitle；Rear captions of described current subtitle be initial time after the termination time of described current subtitle and initial time apart from the termination time of described current subtitle nearest captions；

Second time obtained submodule, if being not at the initial time of the timeline information of a certain sentence captions for point of described instruction time and in the time period of the time of termination, then corresponding to described F.F. or rewind instruction, obtain initial time after the described instruction time puts and initial time apart from the initial time of the nearest captions of point of described instruction time, or obtain the termination time and and terminate the initial time of the nearest captions of instruction time point described in time gap before the described instruction time puts.

15. device according to claim 12, it is characterised in that described initial time search module, including:

3rd time obtained submodule, there is the frame of video of captions for detecting the previous frame of video that there are captions or rear obtaining frame of video corresponding to point of described instruction time, obtain the described previous frame of video that there are captions or rear and there is the initial time of captions in the frame of video of captions；

16. device according to claim 15, it is characterised in that described 3rd time obtains submodule, including:

Identify judgment sub-unit, for the frame of video that point of described instruction time is corresponding is carried out picture recognition, judge whether described frame of video exists word according to the result of described picture recognition；

Continue to identify subelement, if there is word in described frame of video, then according to described F.F. or rewind instruction, continuing the frame of video after or before the described instruction time is put and carrying out picture recognition, until identifying the frame of video being absent from word；Continuing according to described F.F. or rewind instruction, the frame of video being absent from after or before the frame of video of word being carried out picture recognition, until again identifying that out the frame of video that there is word；

Frame of video determines subelement, there is the frame of video of captions as the previous frame of video that there are captions or rear of frame of video corresponding to point of described instruction time for the frame of video of existence word that would again identify that out.

17. device according to claim 16, it is characterised in that described 3rd time obtains submodule, also includes:

Initial time determines subelement, for according to described F.F. instruction, would again identify that out time point corresponding to the frame of video that the there is word initial time as after described one frame of video that there are captions；Or, according to described rewind instruction, continue the frame of video before the time point that the frame of video of the existence word identified is corresponding is carried out picture recognition, until again identifying that out the frame of video being absent from word, using the time point corresponding for the nearest frame frame of video after the time point corresponding in the frame of video being absent from word again identified that out the initial time as the described previous frame of video that there are captions.

18. device according to claim 12, it is characterised in that described device also includes:

Forbid rewind module, if putting less than or equal to the termination time of first captions in described plug-in caption information or embedded caption information for the described instruction time, or all without captions in all videos frame before described instruction time point, then the information of rewind is forbidden in generation display；

Forbid F.F. module, if putting be more than or equal to the initial time of last captions in described plug-in caption information or embedded caption information for the described instruction time, or all without captions in all videos frame after described instruction time point, then the information of F.F. is forbidden in generation display.

19. device according to claim 11, it is characterised in that described device also includes:

Key frame judge module, for described playing module captions corresponding from described initial time and audio frequency and video, before playing described audio/video file, it is judged that whether audio/video frames corresponding to described initial time is key frame；If so, then described playing module performs, from captions corresponding to described initial time and audio/video frames, to play the step of described audio/video file；

Key frame search and decoder module, if not being key frame for the audio/video frames that described initial time is corresponding, then search for before described initial time, apart from the key frame that the audio/video frames that described initial time is corresponding is nearest, and start to decode the audio/video frames of described audio/video file from described nearest key frame, until when decoding is to audio/video frames corresponding to described initial time, described playing module performs, from captions corresponding to described initial time and audio/video frames, to play the step of described audio/video file.

20. device according to claim 11, it is characterised in that described command reception module receives the F.F. or rewind instruction that are generated by clicking operation, gesture operation or voice operating.