CN110019936A

CN110019936A - A kind of annotation method and apparatus during playback of media files

Info

Publication number: CN110019936A
Application number: CN201711065075.8A
Authority: CN
Inventors: 涂畅; 张扬; 王砚峰
Original assignee: Beijing Sogou Technology Development Co Ltd
Current assignee: Beijing Sogou Technology Development Co Ltd
Priority date: 2017-11-02
Filing date: 2017-11-02
Publication date: 2019-07-16

Abstract

The embodiment of the present application discloses the annotation method and apparatus during a kind of playback of media files, it identifies that target text corresponds to the first moment of the media file time shaft with the target text in provided text information when playing from media file, and determines the annotation information for illustrating the target text.Monitor the play time of the media file being played on, when the playback of media files to the second moment and first moment meet predetermined condition when, it can then determine that the user for playing the media file is larger a possibility that will or already have the demand for solving the target text at second moment, therefore start to show the annotation information at second moment, so that the user for the demand of knowing about can have gained some understanding to the meaning of the target text by the annotation information of displaying, this mode does not need user and suspends broadcasting media file and make additional search operation, it can complete to understand target text during smooth rating is listened to, improve the audiovisual experience of user.

Description

A kind of annotation method and apparatus during playback of media files

Technical field

This application involves data processing fields, more particularly to the annotation method and dress during a kind of playback of media files It sets.

Background technique

During playing media file such as video/audio, text information, example can be provided by modes such as audios The lyrics in lines, audio in such as video.

These text informations may include some not intelligible vocabulary, such as more profession or uncommon vocabulary, lead The user for listening to or watching media file is caused not can be appreciated that the meaning of this part vocabulary.

User generally can only first suspend broadcasting media file, then internet searching to help in order to understand this part vocabulary Assistant's solution.But this mode not only needs user to make additional search operation, also results in broadcasting and interrupts, reduces audiovisual body It tests.

Summary of the invention

In order to solve the above-mentioned technical problem, this application provides the annotation methods and dress during a kind of playback of media files It sets, can complete to understand target text during smooth rating is listened to, improve the audiovisual experience of user.

The embodiment of the present application discloses following technical solution:

In a first aspect, the embodiment of the present application provides a kind of annotation method during playback of media files, the method Include:

Identify that target text and the target text are corresponding in provided text information when playing from media file First moment of the media file time shaft；

Determine annotation information corresponding to the target text；

When the playback of media files to the second moment, the annotation information, second moment and described first are shown Time difference between moment meets preset condition.

Optionally, the media file provided text information when playing is obtained according to following manner:

In the playing process of the media file, the media data for being directed to media file institute pre-download is obtained；

By obtaining the media data provided text information when playing to the media data analyzing.

It is optionally, described to identify target text in provided text information when playing from media file, comprising:

Judge whether the media file has default text when playing in provided text information；

If so, being described by the default Text region of the media file when playing in provided text information Target text.

Optionally, the default text is to determine that the historical search data includes that user searches according to historical search data The text of rope and the number of search text, the default text are the text searched for the frequency and be higher than threshold value；Alternatively,

The default text is to be determined according to history played data, and the history played data includes that user is playing media The text searched for during file, the default text are the text for including in the history played data.

Optionally, the display annotation information, comprising:

The annotation information is embedded into broadcast interface and is shown；Alternatively,

The annotation information is shown in pop-up.

Optionally, after the display annotation information, further includes:

The annotation information is cancelled after predetermined time and being shown.

It optionally, include the target text in the annotation information.

Second aspect, the embodiment of the present application provide the annotation mechanism during a kind of playback of media files, described device Including recognition unit, determination unit and display unit:

The recognition unit, for identified in provided text information from media file when playing target text and The target text corresponds to the first moment of the media file time shaft；

The determination unit, for determining annotation information corresponding to the target text；

The display unit, for when the playback of media files is to the second moment, showing the annotation information, described the Time difference between two moment and first moment meets preset condition.

Optionally, described device further includes acquiring unit, the media file provided text information root when playing Under type obtains accordingly:

The acquiring unit, it is pre- for the media file institute for obtaining in the playing process of the media file The media data of downloading；Believed by obtaining the media data provided text when playing to the media data analyzing Breath.

Optionally, the recognition unit is also used to judge that the media file is in provided text information when playing It is no that there is default text；If so, the default text by the media file when playing in provided text information is known It Wei not the target text.

Optionally, the display unit is also used to for the annotation information being embedded into broadcast interface and show；Alternatively, by institute Annotation information is stated to be shown in pop-up.

Optionally, the display unit is also used to cancel display to the annotation information after the predetermined time.

It optionally, include the target text in the annotation information.

The third aspect, the embodiment of the present application provide a kind of device for annotating during playback of media files, including Having memory and one, perhaps more than one program one of them or more than one program is stored in memory, and It is configured to execute the one or more programs by one or more than one processor to include following for carrying out The instruction of operation:

Determine annotation information corresponding to the target text；

Fourth aspect, the embodiment of the present application provide a kind of machine readable media, are stored thereon with instruction, when by one or When multiple processors execute, so that device is executed such as the prompt canceling method in first aspect.

Target is identified in provided text information when playing from media file it can be seen from above-mentioned technical proposal Text corresponds to the first moment of the media file time shaft with the target text, and determines for illustrating the target text Annotation information.The play time for monitoring the media file being played on, the second moment arrived when the playback of media files and institute When stating for the first moment and meeting predetermined condition, then can determine play the user of the media file second moment will or A possibility that having the demand for understanding the target text, is larger, therefore starts to show the annotation information at second moment, to have The user of understanding demand can have gained some understanding to the meaning of the target text by the annotation information of displaying, and this mode does not need User, which suspends, plays media file, does not also need user and makes additional search operation, energy during smooth rating is listened to It completes to understand target text, improves the audiovisual experience of user.

Detailed description of the invention

In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of application without any creative labor, may be used also for those of ordinary skill in the art To obtain other drawings based on these drawings.

Fig. 1 is the schematic diagram of a scenario of the annotation scene during a kind of playback of media files provided by the embodiments of the present application；

Fig. 2 is the method flow diagram of the annotation method during a kind of playback of media files provided by the embodiments of the present application；

Fig. 3 is a kind of method flow that target text is identified during playback of media files provided by the embodiments of the present application Figure；

Fig. 4 is provided the acquisition modes of text information by a kind of media file provided by the embodiments of the present application when playing Method flow diagram；

Fig. 5 is the structure drawing of device of the annotation mechanism during a kind of playback of media files provided by the embodiments of the present application；

Fig. 6 is a kind of a kind of frame of device for annotating during playback of media files provided by the embodiments of the present application Figure；

Fig. 7 is a kind of a kind of frame of server for annotating during playback of media files provided by the embodiments of the present application Figure.

Specific embodiment

With reference to the accompanying drawing, embodiments herein is described.

During user plays media file, media file can provide text information in playing process, these Text information is generally related to broadcasting content.If in these text informations including not intelligible vocabulary, cause to play the matchmaker The user of body file cannot understand wherein meaning for the moment, to understand that information expressed by the media file affects to user.

It can only voluntarily be searched for by user in traditional approach, in order not to miss the subsequent content of media file, user can only be temporary Break and puts media file and scan for manually again.This mode not only needs user to make additional search operation, also results in It plays and interrupts, reduce audiovisual experience.

For this purpose, the embodiment of the present application provides a kind of annotation method during playback of media files, in media file In playing process, the annotation information for illustrating the vocabulary can be shown at the time of not readily understood vocabulary occurs or so, it is this Mode does not need user and suspends broadcasting media file, does not need user yet and makes additional search operation.User is in smooth rating It can complete to understand target text during listening to, improve the audiovisual experience of user.

In playback of media files, provided text information can be what media file provided by various modes.Such as Dialogue, explanation, the lyrics, illustrative words that media file is carried by audio data etc. are also possible to be embedded in video pictures In dialogue, explanation, illustrative words etc..

The executing subject for playing media file can be all kinds of processing equipments with media file playing function, such as move Dynamic terminal, PAD, personal computer etc..A such as concrete application shown in FIG. 1 to play media file in the terminal Scene, the media file are a video file, and playing provided text information when the broadcasting of the video file is the video In file under played video scene personage lines, which includes not intelligible vocabulary, such as target text " fierce look wolf Care for ", it is somebody's turn to do and occurs that the first moment of time shaft, such as the 26th second should be appeared in when " fierce look is looked back from time to time as wolf does " plays in the video file. It can first determine the annotation information 200 of " fierce look is looked back from time to time as wolf does " are as follows: " describe sharp-eyed, to be that people is atrocious " etc., when the video file The second moment for being played to and when meeting predetermined condition on the 26th second, such as when being just played to the 26th second, can show that the annotation is believed Breath 200, such as be shown at the position of video playing display area 300.As a result, when user plays the video file to At 26 seconds, see from video pictures or audio or heard " fierce look is looked back from time to time as wolf does ", is not understanding what this vocabulary was intended by When meaning, so that it may the annotation information 200 to " fierce look is looked back from time to time as wolf does " is seen at position 300, to understand this vocabulary Meaning, when user being avoided to watch this video file, since " fierce look is looked back from time to time as wolf does " this uncommon vocabulary is to understanding this video file It is intended by the influence of meaning.

Next annotation method provided herein will be further illustrated in conjunction with Fig. 2, Fig. 2 provides for the embodiment of the present application A kind of playback of media files during annotation method method flow diagram, which comprises

S201: identify that target text and target text are corresponding in provided text information when playing from media file First moment of media file time shaft.

That is, when obtaining media file provided text information when playing, it can be according to scheduled sieve Condition is selected to determine which the target text for including in text information has, which can be to determine that uncommon words is Purpose and be arranged, be also possible to for the purpose of determining professional words and be arranged, be also possible to determine historical events For the purpose of and be arranged.

The embodiment of the present application does not limit how target text is identified from text information provided by media file, only If can recognize that uncommon words.But to improving the accuracy for identifying target text, the mesh identified is played Mark text belongs to the purpose of the not intelligible text of most of user, and the embodiment of the present application provides a kind of recognition methods.Such as Fig. 3 It is shown:

S301: judge whether media file has default text when playing in provided text information；If so, executing S302。

S302: being target text by the default Text region of the media file when playing in provided text information Word.

The default text can be predetermined rarely used word word, and the default text is also that user is allowed to have further Understand the text of its meaning demand.

When including thus the default text in provided text information when playing when media file, with the default text Word can play the effect that the target text identified belongs to the not intelligible text of most of user as target text, improve Identify the accuracy of target text.

The embodiment of the present application provides the mode that text is preset in several determinations, next will carry out for two of them detailed Illustrate:

First way:

Default text is to be determined according to historical search data.

Wherein, which can be the data extracted by big data analysis, which includes The text of user's search and the number of search text, and the behavior that user searches for text may include user passes through all kinds of search The behavior of engine search text.

This is embodied if a text, which searches for the frequency by user, is higher than threshold value by the analysis to historical search data The meaning of text do not allow for a part or most of user it is readily understood, and this certain customers also have by search for come The demand of this text is solved, therefore can be using this text as default text.

Alternatively, the second way:

Default text is to be determined according to history played data.

Wherein, which includes the text that user searches for during playing media file, is playing matchmaker Searched in body file processes text can be understood as the media file be opened broadcasting and during being not turned off (such as still The case where playing or being suspended broadcasting), the text search behavior that user carries out, such case can embody user and be searched for Text there is a strong possibility with the media file when playing provided text information in relation to and the text searched for be to use Family does not know about meaning but knows about the text of demand.Therefore the text that user searches for during playing media file can be made To preset text.

In text information provided by one media file, one or more target texts can be identified.Moreover, not only Can recognize that target text, can also determine the media file in playing process, shown by video or audible or At the time of playing the target text, i.e., the target text is provided on the time shaft for measuring the playback of media files progress The first moment.

S202: annotation information corresponding to target text is determined.

The embodiment of the present application does not limit the annotation information for how determining target text, such as can be and pass through network retrieval Etc. modes obtain.Annotation information corresponding to one target text simple and clear by way of being more easily understood can be said The meaning of the bright target text, for example, for more uncommon " fierce look is looked back from time to time as wolf does " this vocabulary for, corresponding annotation information It may include that this vocabulary meaning is explained using relatively conventional term, such as " describe sharp-eyed, to be that people is atrocious " this note Release information.

S203: when playback of media files to the second moment, annotation information is shown.

Wherein, the time difference between the second moment and the first moment meets preset condition, and predetermined condition described here is It is arranged to allow user to understand the target text, it is shown when meeting the predetermined condition with the first moment at the second moment Annotation information can allow the user for listening to or watching the target text that can understand the mesh by the annotation information of display Mark the meaning of text.Here the second moment and the first moment is obtained according to the determination of the time shaft of the media file.

Therefore it is based on this purpose, which should not differ too much with the first moment, if mutually far short of what is expected, lead to the annotation Information is too early when being shown to user too late, user may not know the annotation information be to provided by the media file which The annotation of a text information to cause to perplex to user, or even interferes with the experience that user listened to or watched the media file. So in general, which can be a lesser numerical value, such as zero, positive negative one second etc..Such as media file is Video file, preset condition are positive one second, and when exceeding one second the first moment at the second moment, i.e., user has just seen target text one When the second, annotation information is shown, to explain to the target text that user has just seen；Such as preset condition can be zero, when When second moment is identical as the first moment, i.e., while being just played to displaying target text, show the annotation letter of the target text Breath, can quickly establish the relationship between target text and annotation information convenient for user.

When showing annotation information, the different display modes for showing user can be used, such as annotation can be believed Breath, which is embedded into broadcast interface, to be shown；Alternatively, showing annotation information in pop-up.

In the case where media file is video file and video file shows text information by video pictures, display The position of annotation information can be what the position according to target text determined.Such as it is shown in the periphery of target text, So that user can quickly determine that currently displayed annotation information is related with which text.

Alternatively, in addition to may include that can also include for illustrating, other than the content of objective of interpretation text in annotation information Target text is assorted so that user can specify the target text that this annotation information to be annotated when seeing the annotation information ?.Such as the corresponding annotation information of target text " fierce look is looked back from time to time as wolf does " may include " fierce look look back from time to time as wolf does it is sharp-eyed for describing, be people It is atrocious ".

In order to improve the experience that user watches and listen media file, the duration of display annotation information can control.It can allow After showing annotation information, the annotation information is cancelled after the predetermined time and being shown.Operating in this way is advantageous in that, it is possible, firstly, to keep away Exempt from annotation information and occupy display area for a long time, user's viewing is influenced, secondly as text provided by a media file It may include multiple target texts in information, and be possible to multiple target texts and be spaced during playing media file It is shorter, if the corresponding annotation information of long-time displaying target text a, it is possible to playing process, when target text b occurs When, the annotation information of target text b may mutually be covered with the annotation information of target text b, influence to annotate effect.For taking Disappear and show that the predetermined time of annotation information can be arranged according to scene or demand, such as can be three seconds.

As it can be seen that identifying target text and the target text pair in provided text information when playing from media file The first moment of the media file time shaft is answered, and determines the annotation information for illustrating the target text.Monitoring is The play time of the media file played makes a reservation for when the second moment and first moment that the playback of media files arrives meet When condition, then it can determine that the user for playing the media file will or already have at second moment and solve the target text Demand a possibility that it is larger, therefore start to show the annotation information at second moment, so that the user for the demand of knowing about can be with It is had gained some understanding by the annotation information of displaying to the meaning of the target text, this mode does not need user and suspends broadcasting media text Part does not need user yet and makes additional search operation, can complete to understand target text during smooth rating is listened to, mention The high audiovisual experience of user.

Next for media file in S201, when playing, the acquisition modes of provided text information are illustrated.

In the embodiment of the present application, provided text information can therefrom be obtained by the pre-processing to media file, Resettle the corresponding relationship between target text in text information, the annotation information of target text and media file.When there is media literary When part is played, the annotation information of the media file corresponding target text and target text can be found, and taken Carve display annotation information.

In the embodiment of the present application, text provided by this media file can also be obtained in a playback of media files Information.It is as shown in Figure 4:

S401: in the playing process of media file, the media data for being directed to media file institute pre-download is obtained.

S402: by obtaining media data provided text information when playing to media data analyzing.

Due in order to guarantee play quality, preparatory downloads of media file when playing media file by processing equipment In after the currently playing moment media data corresponding to content.Therefore this part pre-download can be used in the embodiment of the present application Media data, for therefrom analyzing this corresponding text information of part of media data.Such as it can be pre- to these on backstage The media data of downloading plays out, and therefrom identifies text information provided by these media datas.It optionally, can basis Speech recognition technology identifies to obtain media data provided text information when playing.So as in no subtitle, the lyrics When identify text information.

After identifying the text information in media data, the scheme in S201 can be implemented according to these text informations.

Fig. 5 is the structure drawing of device of the annotation mechanism during a kind of playback of media files provided by the embodiments of the present application, The device includes recognition unit 501, determination unit 502 and display unit 503:

The recognition unit 501, for identifying target text in provided text information when playing from media file Word and the target text correspond to the first moment of the media file time shaft；

The determination unit 502, for determining annotation information corresponding to the target text；

The display unit 503 shows the annotation information, institute for working as the playback of media files to the second moment The time difference stated between the second moment and first moment meets preset condition.

It optionally, include the target text in the annotation information.

Feature in the present embodiment can be with reference to the associated description in embodiment corresponding to Fig. 1-4, and which is not described herein again.

Fig. 6 is a kind of block diagram of device 600 for speech synthesis shown according to an exemplary embodiment.For example, dress Setting 600 can be robot, mobile phone, computer, digital broadcasting terminal, messaging device, game console, and plate is set It is standby, Medical Devices, body-building equipment, personal digital assistant etc..

Referring to Fig. 6, device 600 may include following one or more components: processing component 602, memory 604, power supply Component 606, multimedia component 608, audio component 610, the interface 612 of input/output (I/O), sensor module 614, and Communication component 616.

The integrated operation of the usual control device 600 of processing component 602, such as with display, telephone call, data communication, phase Machine operation and record operate associated operation.Processing element 602 may include that one or more processors 620 refer to execute It enables, to perform all or part of the steps of the methods described above.In addition, processing component 602 may include one or more modules, just Interaction between processing component 602 and other assemblies.For example, processing component 602 may include multi-media module, it is more to facilitate Interaction between media component 603 and processing component 602.

Memory 604 is configured as storing various types of data to support the operation in device 600.These data are shown Example includes the instruction of any application or method for operating on device 600, contact data, and telephone book data disappears Breath, picture, video etc..Memory 604 can be by any kind of volatibility or non-volatile memory device or their group It closes and realizes, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM) is erasable to compile Journey read-only memory (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, flash Device, disk or CD.

Power supply module 606 provides electric power for the various assemblies of device 600.Power supply module 606 may include power management system System, one or more power supplys and other with for device 600 generate, manage, and distribute the associated component of electric power.

Multimedia component 608 includes the screen of one output interface of offer between described device 600 and user.One In a little embodiments, screen may include liquid crystal display (LCD) and touch panel (TP).If screen includes touch panel, screen Curtain may be implemented as touch screen, to receive input signal from the user.Touch panel includes one or more touch sensings Device is to sense the gesture on touch, slide, and touch panel.The touch sensor can not only sense touch or sliding action Boundary, but also detect duration and pressure associated with the touch or slide operation.In some embodiments, more matchmakers Body component 608 includes a front camera and/or rear camera.When device 600 is in operation mode, such as screening-mode or When video mode, front camera and/or rear camera can receive external multi-medium data.Each front camera and Rear camera can be a fixed optical lens system or have focusing and optical zoom capabilities.

Audio component 610 is configured as output and/or input audio signal.For example, audio component 610 includes a Mike Wind (MIC), when device 600 is in operation mode, when such as call mode, recording mode, and voice recognition mode, microphone is matched It is set to reception external audio signal.The received audio signal can be further stored in memory 604 or via communication set Part 616 is sent.In some embodiments, audio component 610 further includes a loudspeaker, is used for output audio signal.

I/O interface 612 provides interface between processing component 602 and peripheral interface module, and above-mentioned peripheral interface module can To be keyboard, click wheel, button etc..These buttons may include, but are not limited to: home button, volume button, start button and lock Determine button.

Sensor module 614 includes one or more sensors, and the state for providing various aspects for device 600 is commented Estimate.For example, sensor module 614 can detecte the state that opens/closes of device 600, and the relative positioning of component, for example, it is described Component is the display and keypad of device 600, and sensor module 614 can be with 600 1 components of detection device 600 or device Position change, the existence or non-existence that user contacts with device 600,600 orientation of device or acceleration/deceleration and device 600 Temperature change.Sensor module 614 may include proximity sensor, be configured to detect without any physical contact Presence of nearby objects.Sensor module 614 can also include optical sensor, such as CMOS or ccd image sensor, at As being used in application.In some embodiments, which can also include acceleration transducer, gyro sensors Device, Magnetic Sensor, pressure sensor or temperature sensor.

Communication component 616 is configured to facilitate the communication of wired or wireless way between device 600 and other equipment.Device 600 can access the wireless network based on communication standard, such as WiFi, 2G or 3G or their combination.In an exemplary implementation In example, communication component 616 receives broadcast singal or broadcast related information from external broadcasting management system via broadcast channel. In one exemplary embodiment, the communication component 616 further includes near-field communication (NFC) module, to promote short range communication.Example Such as, NFC module can be based on radio frequency identification (RFID) technology, Infrared Data Association (IrDA) technology, ultra wide band (UWB) technology, Bluetooth (BT) technology and other technologies are realized.

In the exemplary embodiment, device 500 can be believed by one or more application specific integrated circuit (ASIC), number Number processor (DSP), digital signal processing appts (DSPD), programmable logic device (PLD), field programmable gate array (FPGA), controller, microcontroller, microprocessor or other electronic components are realized, for executing the above method.

In the exemplary embodiment, a kind of non-transitorycomputer readable storage medium including instruction, example are additionally provided It such as include the memory 604 of instruction, above-metioned instruction can be executed by the processor 620 of device 600 to complete the above method.For example, The non-transitorycomputer readable storage medium can be ROM, random access memory (RAM), CD-ROM, tape, floppy disk With optical data storage devices etc..

A kind of non-transitorycomputer readable storage medium, when the instruction in the storage medium is by the processing of mobile terminal When device executes, so that mobile terminal is able to carry out a kind of prompt canceling method, which comprises

Determine annotation information corresponding to the target text；

Fig. 7 is the structural schematic diagram of server in the embodiment of the present invention.The server 700 can be due to configuration or performance be different Generate bigger difference, may include one or more central processing units (central processing units, CPU) 722 (for example, one or more processors) and memory 732, one or more storage application programs 742 or The storage medium 730 (such as one or more mass memory units) of data 744.Wherein, memory 732 and storage medium 730 can be of short duration storage or persistent storage.The program for being stored in storage medium 730 may include one or more modules (diagram does not mark), each module may include to the series of instructions operation in server.Further, central processing unit 722 can be set to communicate with storage medium 730, and the series of instructions behaviour in storage medium 730 is executed on server 700 Make.

Server 700 can also include one or more power supplys 724, one or more wired or wireless networks Interface 750, one or more input/output interfaces 758, one or more keyboards 754, and/or, one or one The above operating system 741, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..

Those of ordinary skill in the art will appreciate that: realize that all or part of the steps of above method embodiment can pass through The relevant hardware of program instruction is completed, and foregoing routine can be stored in a computer readable storage medium, which exists When execution, step including the steps of the foregoing method embodiments is executed；And storage medium above-mentioned can be at least one in following media Kind: read-only memory (English: read-only memory, abbreviation: ROM), RAM, magnetic or disk etc. are various to be can store The medium of program code.

It should be noted that all the embodiments in this specification are described in a progressive manner, each embodiment it Between same and similar part may refer to each other, each embodiment focuses on the differences from other embodiments. For equipment and system embodiment, since it is substantially similar to the method embodiment, so describe fairly simple, The relevent part can refer to the partial explaination of embodiments of method.Equipment and system embodiment described above is only schematic , wherein unit may or may not be physically separated as illustrated by the separation member, it is shown as a unit Component may or may not be physical unit, it can and it is in one place, or may be distributed over multiple networks On unit.Some or all of the modules therein can be selected to achieve the purpose of the solution of this embodiment according to the actual needs. Those of ordinary skill in the art can understand and implement without creative efforts.

The above, only a kind of specific embodiment of the application, but the protection scope of the application is not limited thereto, Within the technical scope of the present application, any changes or substitutions that can be easily thought of by anyone skilled in the art, Should all it cover within the scope of protection of this application.Therefore, the protection scope of the application should be with scope of protection of the claims Subject to.

Claims

1. a kind of annotation method during playback of media files, which is characterized in that the described method includes:

It is identified in provided text information when playing described in target text and target text correspondence from media file First moment of media file time shaft；

Determine annotation information corresponding to the target text；

When the playback of media files to the second moment, the annotation information, second moment and first moment are shown Between time difference meet preset condition.

2. the method according to claim 1, wherein the media file provided text information when playing It is obtained according to following manner:

3. the method according to claim 1, wherein described, from media file, when playing, provided text is believed Target text is identified in breath, comprising:

If so, being the target by the default Text region of the media file when playing in provided text information Text.

4. according to the method described in claim 3, it is characterized in that, the default text be according to historical search data determine, The historical search data includes the text of user's search and the number of search text, and the default text is that the search frequency is higher than The text of threshold value；Alternatively,

The default text is to be determined according to history played data, and the history played data includes that user is playing media file During the text searched for, the default text is the text for including in the history played data.

5. the method according to claim 1, wherein the display annotation information, comprising:

The annotation information is shown in pop-up.

6. the method according to claim 1, wherein after the display annotation information, further includes:

7. according to claim 1 to method described in 6 any one, which is characterized in that include the mesh in the annotation information Mark text.

8. the annotation mechanism during a kind of playback of media files, which is characterized in that described device includes recognition unit, determines list Member and display unit:

The recognition unit, for identifying target text and described in provided text information when playing from media file Target text corresponds to the first moment of the media file time shaft；

The display unit, for when the playback of media files is to the second moment, showing the annotation information, when described second The time difference carved between first moment meets preset condition.

9. a kind of device for annotating during playback of media files, which is characterized in that include memory and one or The more than one program of person, one of them perhaps more than one program be stored in memory and be configured to by one or It includes the instruction for performing the following operation that more than one processor, which executes the one or more programs:

Determine annotation information corresponding to the target text；

10. a kind of machine readable media is stored thereon with instruction, when executed by one or more processors, so that device is held Prompt canceling method of the row as described in one or more in claim 1 to 7.