CN107636645A - Automatically generate the technology of media file bookmark - Google Patents

Automatically generate the technology of media file bookmark Download PDF

Info

Publication number
CN107636645A
CN107636645A CN201680026385.6A CN201680026385A CN107636645A CN 107636645 A CN107636645 A CN 107636645A CN 201680026385 A CN201680026385 A CN 201680026385A CN 107636645 A CN107636645 A CN 107636645A
Authority
CN
China
Prior art keywords
bookmark
media
media file
component
file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201680026385.6A
Other languages
Chinese (zh)
Inventor
O·钱德拉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Publication of CN107636645A publication Critical patent/CN107636645A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/0483Interaction with page-structured environments, e.g. book metaphor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/685Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/44Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7834Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using audio features
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Library & Information Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Describe the technology for the bookmark for automatically generating media file.Device can include being arranged to the logical device for performing bookmark application.Logical device can include the processing system for example with processor and memory.Bookmark application can manage the media file component of media file including operable.Media file can store various types of content of multimedia.Bookmark application can also include media bookmark component, its is operable to identify the media file of media information, scanning medium file is automatically generated for the bookmark of media file based on bookmark designator, and bookmark is presented on the user interface to obtain bookmark designator.Other embodiment is described and claimed as.

Description

Automatically generate the technology of media file bookmark
Background technology
The information to be looked back later is write down using the content record of such as audio or video record.However, in some feelings Under condition, it may be difficult to location related information, particularly meeting, course, interview scene similar with other longer content record In.Generally, only some parts of content record are of special interest for user.If user being capable of fast positioning content note Some parts of record, then can cause the improvement to content record to use.
The content of the invention
The content of the invention is provided to be introduced into following embodiment what is further described in simplified form The selection of concept.The content of the invention is not intended to the key feature or principal character for determining theme claimed, and also purport is not made For the auxiliary of the scope for determining theme claimed.
Various embodiments are usually directed to electronic media system.Some embodiments are more particularly to arranged to automatically generate one Or the electronic media system of the c bookmart of multiple media files.Electronic media system can be automatically generated for media file One or more bookmarks, without user intervention (for example, do not need manual bookmark generate).Bookmark allows user's fast positioning Looked back with media content interested is reproduced for later.Bookmark can be stored as by electronic media system together with media file Metadata, used later for various users.
In one embodiment, for example, device can include being arranged to the logical device for performing bookmark application.Logical device The processing system for example with processor and memory can be included.Bookmark application can manage media file including operable Media file component.Media file can store various types of content of multimedia.Bookmark application can also include media book Component is signed, can be used to identify the media file of media information, scanning medium file is based on obtaining bookmark designator Bookmark designator is automatically generated for the bookmark of media file, and bookmark is presented on the user interface.It is described and claimed as Other embodiment.
By reading readings described in detail below and to relevant drawings, these and other feature and advantage will be aobvious and easy See.It should be appreciated that foregoing general description and it is described in detail below be all side that is explanatory, and not limiting claimed Face.
Brief description of the drawings
Figure 1A shows the embodiment of media system.
Figure 1B shows the different embodiments of media system.
Fig. 2 shows the embodiment of the user interface views for record.
Fig. 3 shows the embodiment of the user interface views for playback.
Fig. 4 shows the embodiment of standby user interface views.
Fig. 5 A show the embodiment of the first user interface views of bookmark
Fig. 5 B show the embodiment of the second user inter-face views of bookmark.
Fig. 5 C show the embodiment of the 3rd user interface views of bookmark.
Fig. 6 shows the embodiment of the 3rd user interface views of bookmark.
Fig. 7 A show the embodiment of the fourth user inter-face views of bookmark.
Fig. 7 B show the embodiment of the 5th user interface views of bookmark.
Fig. 7 C show the embodiment of the 6th user interface views of bookmark.
Fig. 8 A show the embodiment of the first logic flow for generating the bookmark for media file.
Fig. 8 B are shown for scanning medium file to obtain the embodiment of the second logic flow of bookmark designator.
Fig. 8 C are shown for scanning medium file to obtain the embodiment of the 3rd logic flow of bookmark designator.
Fig. 8 D show the embodiment of the 4th logic flow for generating the bookmark for media file.
Fig. 9 shows the embodiment of the 5th logic flow for generating the bookmark for media file.
Figure 10 shows the embodiment of the 6th logic flow for reproducing the media content for adding bookmark from media file.
Figure 11 shows the embodiment of the first suitable counting system structure.
Figure 12 shows the embodiment of the second suitable counting system structure.
Embodiment
User may be frequently necessary to carry out record media content via electronic equipment.For example, user can use it is such as intelligent Wrist-watch, smart phone, the mobile device of tablet personal computer or laptop computer record the sound from lecture, meeting, interview etc. Frequency or video information.Electronic equipment can in some form of computer using the media content of record as media file storage Read in memory.User (identical or different) may want to look back recorded media content in later time.However, it is possible to It is difficult to the location related information in the media content of record, particularly if it is information that is tediously long or including complexity.Implement Example is designed to allow some parts that user rapidly and easily positions content record.This causes more efficient and is efficiently used Content record, so as to provide more preferable Consumer's Experience.In addition, these embodiments save the electricity of electronic equipment (such as mobile device) Pond electricity, memory resource and/or calculating cycle, so as to produce significant technological merit and technique effect.
Various embodiments are usually directed to the electronic medium system for being arranged to the c bookmart for being automatically generated for media file System.Electronic media system can allow user to start Automatic bookmark generation operation, to automatically generate by different source of media (such as sound Frequency source, video source, audio/video source etc.) generation various media files c bookmart.For example, the e-book of media file Label can obtain the software application of different types of bookmark designator to automatically generate by using scanning medium file is intended to, Bookmark designator is, for example, that can indicate that selected one group of media content especially important or related for a user is crucial Word.Keyword scanning can allow to automatically generate c bookmart to be presented in the notes of such as user application, therefore user can With the selected portion easily checked and activate media file, to listen in media file plus bookmark content.It is in addition, electric Philosophical works label are used for speech-to-text (STT) technology automatically to transcribe the selected part of media file, pay special attention to Detect the timeslice of the media file of keyword.
In one embodiment, for example, the user interface elements of such as bookmark icon can be presented that for record and A part for the user interface of the media application of playback of media content.Media content recorded into media file or literary from media Before, during and/or after part playback of media content, user can manually select bookmark icon to start bookmark generation operation. Bookmark application can be with scanning medium file to obtain various bookmark designators (for example, keyword, speaker identity, talker position Put), and it is automatically generated for based on bookmark designator multiple c bookmarts of media file.In addition to other information, electricity Philosophical works label can have media file a part start when at the beginning of between, at the end of the part of media file at the end of Between, user message, the metadata (for example, file name, file identifier) of media file and/or other kinds of information.Electricity Philosophical works label can be presented using various user interface elements in user interface, in the document of such as application program based on text Bookmark annotation or represent, visual indicator in selectable icon or link, media file waveform etc..Except electricity is presented Outside philosophical works label, the selected part of the media file associated with given bookmark can be converted into text and with appropriate book Label are presented (for example, to assist operation of recording the note) together.Then, user can select c bookmart with related to c bookmart Between at the beginning of connection since media file playback of media content.This allows user quickly and efficiently to mark and position user spy Media content not interested.Therefore, embodiment can improve the Affording acquisition, scalability, mould of operator, equipment or network Block, scalability or interoperability.
In a kind of usage scenario, for example, when such asApplication program in Record or when playing audio, user can press button to start bookmark generation operation with will be any desired in audio recording Moment adds bookmark in case reference later.Bookmark can be expressed as take down notes part in text, can also on audio search column with The form of visual indicia symbol represents.MICROSOFT ONENOTE are used on mobile device (for example, smart phone) for example, working as When, when being operated under record or replay mode, audio bookmarks button can be presented in the upper left corner of user interface.Press this by Button can cause the one or more bookmarks for automatically generating media file, and colour-coded then is accorded with into (such as blueness) is placed on audio On recording time line, and/or the user that bookmark text is added to below audio recording timeline is taken down notes in part.In audio weight During putting, user interface can show small blue markings symbol on audio search bar, to be expressed as the bookmark that record is placed.Such as Fruit user clicks on bookmark in its notes, then " being played from this time " button occurs, it is allowed to which user jumps directly to audio note The point for placing the bookmark in record.
Solution in the past is insufficient in many aspects.For example, meeting, course, interview scene similar with other Longer media recording is likely difficult to efficiently utilize.Generally, only some parts of these records are special for given audience Interested, and simply do not identify that part most interesting in recording is quoted after being provided with graceful mode.In order to take The significant data being stored in audio recording is returned, for example, user will generally listen to whole audio recording, is repeatedly jumped on record To jump to attempt to position key message, or write the timestamp for being stored with key message manually.All these solutions All it is time-consuming and/or labor-intensive.Audio transcription technique can also be used, but in the state of at present, this technology Typically inaccurate.
C bookmart provides some advantages compared with previous solution.For example, pass through permission using c bookmart Bookmark application quickly and efficiently marks the crucial moment in media recording to allow user direct during review later The relevant portion of media recording is jumped to strengthen Consumer's Experience.This allows various live events (for example, meeting, course, is visited Talk) audio recording it is more useful because user need not listen to whole audio from the beginning to the end to find most important part Record.On the contrary, user will know for sure, which is partly marked for looking back, and can use what is occurred when enabling bookmark Button jumps directly to them.In addition to strengthening Consumer's Experience, c bookmart can allow user to more rapidly find correlation Information, this may cause various mobile devices (such as smart phone, intelligent watch, tablet PC and portable computer) Lower power consumption and longer cell life.
Symbol and term used herein are typically referenced to, detailed description below can be according in computer or computer The program process that is performed on network is presented.Those skilled in the art are using these program descriptions and represent to come most effectively by it The essence of work is communicated to those skilled in the art.
Program herein and is typically considered the operation sequence for causing being in harmony certainly of expected result (self-consistent) Row.These operations are the operations for the physical manipulation for needing physical quantity.Generally, although not necessarily, this tittle take can be stored, The electricity that transmits, combine, compare and otherwise manipulate, the form of magnetically or optically signal.Primarily to the reason for generally using, These signals are referred to as position, value, element, symbol, character, term, numeral etc., this is proved to be convenient sometimes.However, should Work as attention, all these and similar terms all should be associated with appropriate physical quantity, and is only to be applied to this tittle Facility label.
In addition, performed operation generally carrys out table with term generally associated with the mental operation that human operator performs Show, such as add or compare.In most cases, at described herein of the part for forming one or more embodiments What is not all needed in operating or this ability of undesirable human operator.On the contrary, operation is machine operation.It is various for performing The useful machine of the operation of embodiment includes general purpose digital computer or similar devices.
Various embodiments further relate to the device or system for performing these operations.The device can be specifically constructed for For the desired purposes, or can include leading to by what the computer program that computer stores optionally was activated or reconfigured by Use computer.Set forth herein program it is inherently not relevant with specific computer or other devices.Various general-purpose machinerys can To be used together with the program write according to teaching herein, or it can prove that the more special device of construction is required to perform Method operation is convenient.The required structure of these various machines will occur from the description provided.
With reference now to accompanying drawing, wherein identical reference is used to refer to identical element all the time.In the following description, For purposes of explanation, many details are elaborated to provide its thorough explanation.It may be evident, however, that can there is no this The embodiment of novelty is put into practice in the case of a little details.In other instances, it show in block form well-known knot Structure and equipment, in order to promote its description.Be intended that cover all modifications consistent with theme claimed, it is equivalent and Substitute.
Figure 1A shows the block diagram using 140 media system 100 with bookmark.In one embodiment, for example, media System 100 and bookmark can include various assemblies, such as component 110,130 using 140.As it is used herein, term " is System " and " application " and " component " are intended to indicate that the calculating for including hardware, the combination of hardware and software, software or executory software Machine related entities.For example, component may be implemented as running on a processor process, processor, hard disk drive, Duo Gecun Store up driver (optics and/or magnetic storage medium), object, executable file, execution thread, program and/or computer.As Illustrate, the application and service device run on the server can be component.One or more assemblies may reside within execution In process and/or thread, and component can be positioned on a computer and/or be distributed in two or more computers Between, as desired by for given implementation.Embodiment is unrestricted in this context.
In the diagram embodiment shown in Figure 1A, media system 100 and bookmark can be by electronic equipments Lai real using 140 It is existing.The example of electronic equipment can include but is not limited to ultra-mobile device, mobile device, personal digital assistant (PDA), mobile meter Calculate equipment, smart phone, phone, digital telephone, cell phone, E-book reader, mobile phone, unidirectional pager, two-way paging Machine, message transmission device, computer, personal computer (PC), desktop computer, laptop computer, notebook, on Net this computer, handheld computer, tablet PC, server, server array or server zone, web server, network Server, Internet Server, work station, minicom, mainframe computer, supercomputer, the network equipment, web appliance, Distributed computing system, multicomputer system, the system based on processor, consumption electronic product, programmable consumer electronics, The wearable electronic product of game station, TV, DTV, set top box, such as intelligent watch, WAP, base station, order Family station, mobile subscriber center, radio network controller, router, hub, gateway, bridger, interchanger, machine or its Combination.Although bookmark as shown in Figure 1A has the element of limited quantity using 140 in a certain topology, it will be appreciated that, book Label can include more or less elements according to needed for given implementation using 140 substituting topology.
Component 110,130 can be communicatively coupled via various types of communication medias.Component 110,130 can be coordinated Operation each other.Coordination may relate to the unidirectional or two-way exchange of information.For example, component 110,130 can be with by logical The form of the signal of letter medium transmission transmits information.Information may be implemented as distributing to the signal of various signal wires.At this In the distribution of sample, each message is a signal.However, further embodiment can alternatively use data-message.It is such Data-message can pass through various linkup transmits.Exemplary connection includes parallel interface, serial line interface and EBI.
In the diagram embodiment shown in Figure 1A, media system can include one or more media file 104-c and book Label apply 140.It is worth noting that, " a " used herein is intended to appoint as expression with " b " and " c " and similar designator The variable of what positive integer.Thus, for example, if implementation sets c=5 value, one group of complete media file 104-c can With including media file 104-1,104-2,104-3,104-4 and 104-5.Embodiment is unrestricted in this context.
Media file 104-c can include the media content recorded by analog or digital media sensor, and such as numeral regards Frequency logger, digital audio recorder, digital audio/video (A/V) logger, application program, system program, web applications, Web services etc..Bookmark can be generated by user interface using 140 using the media file 104-c selected by one or more 120 one or more the c bookmart 126-e and/or text chunk 128-h presented.In one embodiment, user can make book Label automatically generate using 140 when generating one or more c bookmart 126-e for media file 104-c.In alternate embodiment In, when it is that media file 104-c generates one or more c bookmart 126-e that user if can manually selecting.
Bookmark can be independent application program using 140, or be integrated with other software program.In one embodiment, For example, bookmark can be with being covered the operating system of the Microsoft timed manufacture by Washington Randt for example using 140It is integrated.In one embodiment, for example, bookmark using 140 can with to be specific The productivity external member of the cross-correlation client application of operating system design, server application and web services integrates, such as by Hua Sheng The Microsoft's manufacture for Randt's illiteracy of pausing is used for MICROSOFT's OFFICE productivity external members.The example of client application can include but is not limited to MICROSOFT WORD, MICROSOFTMICROSOFT MICROSOFT MICROSOFTMICROSOFT MICROSOFT MICROSOFT PROJECT, MICROSOFT PUBLISHER, MICROSOFTWORKSPACE, MICROSOFTMICROSOFT OFFICE INTERCONNECT, MICROSOFT OFFICE PICTURE MANAGER, MICROSOFT SHAREPOINT DESIGNER, MICROSOFT LYNC and MICROSOFT FOR BUSINESS.The example of server application can include but is not limited to MICROSOFT SHAREPOINT SERVER, MICROSOFT LYNC SERVER, MICROSOFT SKYPE FOR BUSINESS SERVER, MICROSOFT OFFICE FORMS SERVER, MICROSOFT OFFICESERVER, MICROSOFT OFFICE PROJECT SERVER, MICROSOFT OFFICE PROJECT PORTFOLIO SERVER and MICROSOFT OFFICESERVER.The example of web services can include but is not limited to MICROSOFT WINDOWS MICROSOFT OFFICE WEB APPLICATIONS, MICROSOFT OFFICE LIVE, MICROSOFT LIVE MEETING, MICROSOFT OFFICE PRODUCT WEB SITE, MICROSOFT UPDATE SERVER and MICROSOFT OFFICE 365.Embodiment is not limited to these examples.
Among other components, bookmark can include media file component 110 and media bookmark component 130 using 140. Media file component 110 can be generally used for managing media file 104, such as record media file 104, reset media file 104, change media file 104, storage media file 104, identification media file 104, etc..Media bookmark component 130 is usual Available for the c bookmart 126 of management media file 104, such as generate and/or detect the bookmark instruction for c bookmart 126 Symbol, c bookmart 126 is generated, the generation text chunk 128 associated with c bookmart 126, c bookmart 126 and/or phase is presented The text chunk 128 of association, activate c bookmart 126, modification c bookmart 126, etc..C bookmart 126 can include being used for Identify various types of information of the ad-hoc location in media file 104.The information can include temporal information, such as with media Each associated temporal information 106-d in file 104-c, spatial information (such as visual indicia symbol on audio volume control) Or other kinds of label information.In one embodiment, for example, media bookmark component 130 can be with usage time information 106 Very first time stamp between at the beginning of bookmark 126 is generated with the media file section including presentation medium file 104, presentation medium text The second timestamp of the end time of the media file section of part 104 and/or the identifier of media file 104.Embodiment is not limited to The example.
In one embodiment, for example, media file component 110, which can be arranged as user interface 120, provides presentation Surface 122.Among other components, surface 122 is presented can be including media file corresponding to bookmark icon 124, expression 104-c one or more media file icon 125-a, and various bookmark 126-1,126- for media file 104-c 2……126-e。
Bookmark is generally operable to create media bookmark (for example, audio bookmark, video bookmarkses) using 140, and is based on Those bookmarks are reset to start.This is for the audio recording with record the note scene while execution or resets possible particularly useful. In one embodiment, when user presses such as particular user interface element of " audio bookmark " button, bookmark can be with using 140 Start Automatic bookmark generation operation to generate multiple c bookmarts 126 for media file 104, and in instruction time stamp and The bookmark 126 of each c bookmart 126 is inserted in the notes part of associated media file 104 (for example, being noted as bookmark Release).In another embodiment, when user presses such as particular user interface element of " audio bookmark " button, the bookmark should Can be that media file 104 generates Single Electron bookmark 126, and the media file for stabbing and being associated in instruction time with 140 Bookmark annotation is inserted in 104 note part.In addition, display mark can be located on audio search bar so as to visual manner table Show bookmark.By selecting audio bookmark, since user audio playback can record placing the point of bookmark so that return and consult sound Crucial moment during frequency records becomes easy.By this way, user oneself need not key in many notes, write the time manually Stamp, listens to whole recording, or jumped on record when searching for crucial moment.
As shown in Figure 1A, bookmark can include media file component 110 to manage media file 104 using 140.Bookmark should It can also be included being operatively coupled to the media bookmark component 130 of media file component 110, media bookmark component 130 with 140 For the bookmark icon 124 of media file 104 to be presented on into user interface 120.Media bookmark component 130 can detect bookmark The activation (for example, input equipment of such as pointer, touch-screen or voice command etc) of icon 124, and in response to bookmark icon 124 activation and multiple c bookmarts 126 of media file 104 are generated based on the temporal information 106 of media file 104.Matchmaker Body bookmark component 130 can be examined before, after or during the record operation of the media content of media file 104 or replay operations Survey the activation of bookmark icon 124.The example user interface view of displaying this feature is shown in Fig. 2-4.
Bookmark icon 124 is that user starts to create a kind of mode of single or multiple c bookmarts 126.However, it is also possible to Start the establishment of bookmark 126 using other users interface element.It is, for example, possible to use other figures or visual representation replace Bookmark icon 124, including image, animation, radio button etc..In addition it is also possible to use traditional menu item and keyboard shortcut To create bookmark 126.Furthermore, it is possible to engaged based on the tactile of the touch screen interface with touch-screen display to create bookmark 126, such as some sliding-modes (for example, from left to right), percussion mode (for example, double-click) etc..For creating bookmark 126 Certain triggers can change according to implementation, and embodiment is unrestricted in this context.
Figure 1B shows the block diagram using 140 media system 100 with bookmark, and bookmark has using 140 to be used to respond In the add-on assemble for multiple c bookmarts 126 that the single activation of bookmark icon 124 automatically generates media file 104.
Figure 1B is shown realizes its at least one of logic with hardware, and wherein logic is arranged to control bookmark application 140 manage the bookmark 126 of the media file 104 for storage media content.In one embodiment, bookmark can using 140 With the media file component 110 including being operatively coupled to media bookmark component 130.Media file component 110 can manage Media file 104.Media bookmark component 130 can identify the media file 104 of media information, scanning medium file 104 To obtain bookmark designator 132, the bookmark 126 of media file 104 is automatically generated for based on bookmark designator 132, and by book Label 126 are presented on user interface 120.
As it was previously stated, bookmark icon 124 can be arranged to using 140 certain operational modes be started not according to bookmark Same bookmark operation collection.Bookmark can be configured as manual mode, automatic mode or existing manual in some cases using 140 Pattern has automatic mode again.In manual mode, user can optionally activate during recording or resetting media file 104 Bookmark icon 124, to generate the single bookmark 126 corresponding with each user activation bookmark icon 124.For example, work as user When wanting each bookmark 126 of the manual creation for media file 104, this is desirable.In automatic mode, user can be single Secondary activation bookmark icon 124, to start Automatic bookmark generation operation to generate multiple bookmarks for whole media file 104 126.User can activate bookmark icon 124 before, after or during recording or playing media file 104.Bookmark applies 140 Can be with scanning medium file 104 to obtain bookmark designator 132-r, and automatically generate one based on bookmark designator 132-r Or multiple bookmarks 126.When bookmark is arranged in a manual mode with the operation of both automatic modes using 140, bookmark applies 140 Can be that media file 104 generates multiple bookmarks 126 in response to unique user control instruction, and bookmark can be with using 140 It is that corresponding (or different) media file 104 generates single bookmark 126 in response to the instruction of each user's control.Therefore, with The associated operation of bookmark icon 124 can be arranged to manual mode or automatic mode using 140 according to bookmark and change.
When in automatic mode, bookmark can come scanning medium file 104, and base using 140 using add-on assemble One or more bookmarks 126 of media file 104 are automatically generated in scanning result.In one embodiment, bookmark application 140 can be with scanning medium file 104 to obtain one or more bookmark designator 132-r.
Bookmark designator 132 can include being suitable to indicate that bookmark using the 140 any letters that when should generate bookmark 126 Breath.The information can be contained in the media content stored by media file 104, such as the part of audio-frequency information, be converted to The part of the audio-frequency information of text message, the part of video information, the part of combined audio/video information, object information etc..Should Information can also include the metadata associated with media file 104, such as temporal information, date and time information, the identity of talker Information, positional information, room information, calendar information, application message, system information, facility information, the network information, wireless messages, Module information, peripheral device information, connect equipment, videoconference information, bridge information, in the room for record media content The facility information of equipment etc..In one embodiment, bookmark designator 132-1 can include or be implemented as one or more Keyword 134.In one embodiment, bookmark designator 132-2 can include or be implemented as one or more identity 136. These are that bookmark designator 132 and other bookmark designators 132 (and associated component) can be used for given implementation Two examples.Embodiment is unrestricted in this context.
As shown in Figure 1B, for example, bookmark can also include language to text (STT) component 150 and speech recognition using 140 Component 160.Bookmark can detect bookmark designator using STT components 150 and speech recognition component 160 respectively using 140 132-1、132-2.Other assemblies can be realized to detect other kinds of bookmark designator 132.Embodiment is in this context It is unrestricted.
STT components 150 can be used for detecting bookmark designator 132-1 in the form of keyword 134.STT components 150 can be with It is operatively coupled to media file component 110 and/or media bookmark component 130.STT components 150 can be arranged to from media File 104 receives audio-frequency information, audio-frequency information is converted into text message, and export text message for media bookmark component 130 use.
STT components 150 can realize any standard STT technologies, so as to which human speech is converted into text from audio form Form.STT components 150 can receive audio-frequency information from media file 104, be detected in the form of word or sentence in audio-frequency information Human speech, and human speech is converted into text from audio.STT components 150 can be on periodically, on demand or continuously basis Upper execution STT conversion operations.Different STT components 150 can be realized to solve various types of human languages, including it is different Language, dialect, accent, vocabulary, geographical position etc..The text of conversion temporarily or can be stored persistently in data structure, So that media bookmark component 130 accesses., can be by the text of conversion and various types of metadata in addition to the text of conversion Store together, such as mark the temporal information that when audio-frequency information corresponding with converting text is spoken in media file 104, Media file identifier etc..Additionally or alternatively, the text of conversion can be streamed to media bookmark component 130, To allow the Automatic bookmark of real-time or near real-time to generate operation.
Media bookmark component 130 can access text or visit after conversion from the data structure of the text for storing conversion Ask from the text after the conversion of the Real Time Streaming of STT components 150.Then, media bookmark component 130 can be attempted from text envelope One or more keywords 134 are detected in breath as bookmark designator 132-1.For example, media bookmark component 130 can be by text Information is matched compared with lists of keywords 134 with detecting.When there is a match, media bookmark component 130 can generate tool There are the bookmark 126 of associated metadata, such as media file identifier and/or temporal information.
Keyword 134 can include important or relevant instant any particular words or short in instruction media file 104 Language.The example of keyword 134 can include but is not limited to name, surname, topic, sentence, problem, the time, the date, descriptor or Any other keyword suitable for giving media file 104.The keyword 134 of difference group can be used for different types of media File 104.
In one embodiment, keyword 134 can be selected by theme or specific topics.For example, if media are literary Part 104 is the record of class's lecture in the topic in particular topic or theme, then the particular media files 104 can be made With one group of keyword 134 for being suitable for the theme or topic.If theme or topic are calculus, for example, one group of keyword 134 The term of such as " equation " or " derivation " or " differential " etc can be included.For example, if theme or topic were biology, one Group keyword 134 can include such as " belonging to " or the term of " species " or " science of heredity " etc.If media file 104 is commercial affairs The record of meeting, then can be used for the particular media files 104, such as " profit suitable for one group of keyword 134 of business environment Profit " or " expense " or " accrual ".
In one embodiment, keyword 134 can be selected by the type of syntax or vocabulary.For example, can be by secretly Word such as " important " or " strategy " or " urgent " for showing importance are used as bookmark designator 132-1.It can also use such as " this will appear from testing " or the phrase of " this is high priority items ".
In one embodiment, media bookmark component 130 may search for aobvious between converted text and keyword 134 Formula matches.In another embodiment, media bookmark component 130 may search for hidden between the text and keyword 134 of conversion Formula matches.For example, media bookmark component 130 can realize the fuzzy logic similar to search engine keywords logic, to check The packet of word is to infer meaning.Media bookmark component 130 may then based on the meaning generation bookmark 126 of deduction.
Except using in addition to bookmark designator 132-1, media bookmark component 130 can be also used for around bookmark 126 or One or more text chunks 128 are nearby created, are presented to be inserted into together with bookmark 126 in surface 122.For example, it is assumed that STT The text of previous conversion for media file 104 is stored in by component 150 during the transcription operation for keyword detection In data structure.Once detecting keyword 134, media bookmark component 130 can generate bookmark 126.In addition, media bookmark group Part 130 can fetch text chunk 128 corresponding with the definition part of the audio-frequency information of media file based on bookmark 126.For example, matchmaker Body bookmark component 130 can receive the sound for the definition for representing occur before, after or during the time associated with bookmark 126 The audio length parameter of frequency length.Media bookmark component 130 can utilize the temporal information that is stored together with the text of conversion from Corresponding to the text fetched in the data structure of the audio length parameter after changing.Then, media bookmark component 130 can be Bookmark 126 and associated text chunk 128 is presented in presenting for application program on surface 122.
Not for the converting text of media file 104 or not by the previous of media file 104 before STT components 150 In the case that the text of conversion is persistently stored in data structure (for example, in streaming transmission mode), media bookmark component 130 Bookmark identification can be accorded with and the audio length parameter associated with bookmark identification symbol is sent to STT components 150.STT components 150 It can be accorded with based on bookmark identification and the length of definition specified by audio length parameter is by the audio-frequency information from media file 104 Definition is partially converted to one group of text message (for example, text chunk 128) of definition, and export one group of text message of definition with Used for media bookmark component 130.Then, one group of text message of bookmark 126 and definition can be in by media bookmark component 130 On the presentation surface 122 of present application program.
Speech recognition component 160 can be used for detecting bookmark designator 132-2 in the form of one or more identity 136. Speech recognition component 160 can be operatively coupled to media file component 110 and/or media bookmark component 130.Speech recognition Component 160 can be arranged to receive audio-frequency information from media file 104, perform speech recognition to determine the body in the source of audio-frequency information Part information, and identity information is exported to be used by media bookmark component 130.
In some cases, it is probably useful based on the particular individual generation bookmark 126 talked.For example, it is assumed that Media file 104 is the record of multiple personal business meetings with being worked on items in commerce.Whenever project leader is sending out Yan Shi, it may all need to generate bookmark 126.It is each in meeting to detect that speech recognition component 160 can analyze media file 104 The identity of talker.For example, speech recognition component 160 can be compared the speech samples from talker and voice editing storehouse Compared with each voice editing includes the audio-frequency information of unique individual.Alternately, speech recognition component 160 can use such as special The contextual information of keyword 134 or crucial phrase 134 is determined to infer the identity of the various talkers in audio recording.Voice is known Other component 160 can be stored in the identity of all talkers detected in media file 104, and by the body of each talker Part information and it is output to data structure on the temporal information when each identity talks.Alternately, speech recognition component 160 Such information can directly be transmitted as a stream to media bookmark component 130 and be used for real-time or near real-time operation.
Media bookmark component 130 can detect identity as bookmark designator 132-2 from identity information.Media bookmark component 130 can be by identity information compared with one group of identity 136.When there is a match, media bookmark component 130 can give birth to automatically Into bookmark 126.In addition, media bookmark component 130 can be automatically generated for the text chunk 128 of bookmark 126.Media bookmark component 130 can be presented bookmark 126 and text chunk 128 on the presentation surface 122 of application program.
It is to be appreciated that bookmark designator 132-1,132-2 are presented by way of example, and not limitation.It can also realize Other bookmark designators 132.For example, other bookmark designators 132 can be based on talker's stress, talker's language, talker Sex, vocabulary, syntax, semanteme, equipment, meeting room, bridge information, facility information etc..Embodiment is unrestricted in this context System.
Fig. 2 shows user interface views 200.User interface views 200 show that the user of exemplary application connects Mouth view, such as MICROSOFT ONENOTE.MICROSOFT ONENOTE provide one group of feature, it is allowed to which user is such as saying Seat, interview or session record while taking notes and play audio.Although it can be retouched using MICROSOFT ONENOTE Various embodiments are stated, it can be appreciated that same or analogous concept can be realized using other software product.
As shown in Fig. 2 it (is in this case MICROSOFT that user interface views 200, which are included with another application program, ONENOTE) integrated bookmark applies 140 one group of user interface controls 204-f with media file component 110 context Function bar 202.User interface controls 204 can include being used for the various controls for managing media file, such as by such as audio Media content recorded the icon of media file, stop the icon to media file record media content, from playback of media files The icon of media content, suspend the icon to media file record media content, media content is put upside down some of media file The icon of period (for example, 15 seconds), and cause some period (for example, 15 seconds) of media content advance media file Icon.Contextual function column 202 can include to the related other users interface element of record audio, such as status indicator, Level indicator and sliding block art figure.The particular user interface element of contextual function column 202 and user interface controls 204 can be with Changed according to the various states of application program, such as whether audio is currently playing, suspending, recording;Currently whether select Audio clips;And/or the page and other factorses that user is currently viewing.In user interface views 200, media file 206 operation in the recording mode of component 110, instruction media file component 110 is just in the media content of record media file 104.
User interface function column 202 can also include bookmark icon 124 and surface 122 is presented.Surface 122, which is presented, to be used In record, storage in electronic memo and electronic notebook is presented.For example, user can use user interface controls at the same time 204 from lecture recording audio while in surface 122 is presented input notes.Before, after or during logging mode 206, User can activate bookmark icon 124 and be operable to media file 104 to start Automatic bookmark generation and generate multiple c bookmarts 126.For example, whenever bookmark is set using 140 in the smart mobile phone such as with touch-screen display or the portable of tablet personal computer During standby upper execution, media bookmark component 130 can be engaged based on the tactile of the touch screen interface with touch-screen display come Detect the activation of bookmark icon 124.Alternately, user can use the defeated of such as mouse pointer, touch pad or stylus button Enter equipment to select and activate bookmark icon 124.
Fig. 3 shows user interface views 300.User interface views 300 are similar to user interface views 200, because it Show the user interface views for such as MICROSOFT ONENOTE exemplary application.In user interface views In 300, media file component 110 operates in play mode 208, and instruction media file component 110 is from media file 104 Play back (" playback ") media content.Before, after, or during play mode 208, user can activate bookmark icon 124 to open Dynamic Automatic bookmark generation is operated to generate the c bookmart 126 of media file 104.
Fig. 4 shows user interface views 400.User interface views 400 are similar to user interface views 200,300, because The user interface views of such as MICROSOFT ONENOTE exemplary application are shown for it.In user interface views In 400, media file component 110 operates in standby mode 210, instruction media file component 110 suspend media content to/oneself The record of media file 104 or playback.During standby mode 210, can by prevent bookmark icon 124 it is graying prevent its from by User selects and is rendered into bookmark icon 124 inactive.
Fig. 5 A show user interface views 500.As shown in user interface views 500, media file component 110 can be by It is arranged as user interface 120 and presentation surface 122 is provided.Among other components, surface 122, which is presented, can include media text Part icon 125-1, the media file 104-1 (not shown) of the media content comprising audio content form is represented, for example, coming from Audio content in the computer science lecture that Monday on May 4th, 2015 provides.The audio recording of lecture is made in 2015 On May 4, the afternoon of Monday 1 point 53 minutes.Media file 104-1 title is " lecture 1 ".
In addition to media file icon 125-1, surface 122, which is presented, also includes the various notes 502-s associated with lecture G, take down notes and presented in the form of text in the various pieces that surface 122 is presented.User can generate notes 502, for example, in media text During part 104-1 logging mode 206 or play mode 208.Sometimes, before logging mode 206 or play mode 208, it Afterwards or simultaneously, user can select and activate the (not shown) of bookmark icon 124, to be automatically generated for media file 104-1's Multiple c bookmarts 126, such as bookmark 126-1,126-2.Media bookmark component 130 can using bookmark 126-1,126-2 as The part that surface 122 is presented is presented on the various positions presented in surface 122.In one embodiment, media bookmark component 130 can select position based on specific criteria, such as neighbouring respectively during the time associated with bookmark 126-1,126-2 The position for the notes 502 done.Alternately, media bookmark component 130 bookmark 126 can be presented on present surface 122 or In the list on another presentation surface separated with surface 122 is presented.The ad-hoc location that bookmark 126 is presented can be according to given Implementation and change, and embodiment is unrestricted in this context.
Bookmark 126-1,126-2 can be presented in a defined format for media bookmark component 130.In one embodiment, example Such as, the form of definition can include following form:
<Bookmark identification accords with><" for ... place "><Media file name><Time started>For example, if user is in audio track Bookmark icon 124 is activated at 28 seconds in road, then media bookmark component 130 can be come using definition provided above form It " is the bookmark 1 " that lecture 1 is placed at 0.28 that generation bookmark 126-1, which is,.Similarly, if one point in audio track of user Bookmark icon 124 is activated at 36 seconds, then media bookmark component 130 can generate book using definition provided above form It " is the bookmark 2 " that lecture 1 is placed at 1.36 that label 126-2, which is,.Specific format for bookmark 126 to be presented can be according to given Implementation and change, and embodiment is unrestricted in this context.
In various embodiments, bookmark 126 can include resetting icon 504-h.Can activate reset icon 504 with by The media content from media file 104 is reproduced between at the beginning of the storage of bookmark 126.As shown in user interface 500, bookmark 126- 1st, 126-2 can each have corresponding playback icon 504-1,504-2 respectively.It can activate and reset icon 504-1 with table Show at the media file 104-1 temporal information 106-1 very first time stamp reproduction media file 104-1, in this case when Between information be in the time 0.28.It can activate and reset icon 504-2 with presentation medium file 104-1 temporal information 106-1 Very first time stamp at reproduce media file 104-1, temporal information is in the time 1.36 in the case of bookmark 126-2.
In one embodiment, reset icon 504-1,504-2 can respectively together with bookmark 126-1,126-2 constantly Present.In one embodiment, it can be presented in response to some events and reset icon 504-1,504-2, such as when user will refer to When pin is hovered on bookmark 126-1,126-2.Embodiment is not limited to these examples.
Media file component 110 can control the replay operations of media file 104, and be based upon media file 104 and give birth to Into bookmark 126 reproduce media file 104 media content.For example, media file component 110 can control media file 104- 1 replay operations, and reproduced based on playback icon 504-1, the 504-2 associated with bookmark 126-1,126-2 activation Media file 104-1 media content (by the output equipment of such as loudspeaker, text of generation etc. is transcribed by audio).
Fig. 5 B show user interface views 550.As user interface views 500, user interface views 550 show by Arrangement provides the media file component 110 on the presentation surface 122 of user interface 120.Surface 122, which is presented, to be included except other Outside element, represent comprising the audio content form come from the computer science lecture on Monday on May 4th, 2015 The media file icon 125-1 of the media file 104-1 (not shown) of media content.The audio recording of lecture is made in 2015 May 4, on Monday, started from afternoon 1 point 53 minutes.Media file 104-1 title is " lecture 1 ".In addition, except other elements Outside, surface 122, which is presented, can be included representing comprising for example coming from the computer science that Monday on May 4th, 2015 provides The media file icon 125-2 of the media file 104-2 (not shown) of the media content of audio content form in lecture.Lecture Audio recording be made in Monday on May 4th, 2015, start from afternoon 4:00, carried out after lecture 1.Media file 104-2 title is " lecture 2 ".
User interface views 550 show that multiple media file 104-1,104-2 can be related to single presentation surface 122 The situation of connection, wherein being generated for each media file 104-1,104-2 and bookmark 126 being presented.Regarded as previously discussed with respect to user interface Described by Figure 50 0, media bookmark component 130 can generate and a pair of bookmarks 126-1,126-3 are presented.Bookmark 126-1 can be with It is entitled " the media file 104-1 of lecture 1 " bookmark.Resetting icon 504-1 activation will cause in the time started 0.28 Reproduce the media content from media file 104-1.Bookmark 126-3 can be entitled " the media file 104-2 of lecture 2 " Bookmark.Playback icon 504-3 activation will cause reproduces the media content from media file 104-2 in the time started 0.15.
Fig. 5 C show user interface views 580.As user interface views 500,550, user interface views 580 are shown The media file component 110 for being arranged to the presentation surface 122 that user interface 120 is provided is gone out.Surface 122, which is presented, to be included, Expression in addition to other components includes the audio in the computer science lecture for example from Monday on May 4th, 2015 The media file icon 125-1 of the media file 104-1 (not shown) of the media content of content-form.The audio recording system of lecture Make in Monday on May 4th, 2015, start from afternoon 1 point 53 minutes.Media file 104-1 title is " lecture 1 ".In addition, remove Outside other elements, surface 122, which is presented, can include representing the calculating comprising May in 2015 Monday on the 4th is for example come from The media file icon 125- of the media file 104-2 (not shown) of the media content of audio content form in machine science lecture 2.The audio recording of lecture is made in Monday on May 4th, 2015, starts from afternoon 4:00, carried out after lecture 1.Media File 104-2 title is " lecture 2 ".
User interface views 580 are shown in which that text chunk 128-1 embodiment is presented for bookmark 126-1.Text chunk 128-1 represents the text message from the definition part conversion for the audio-frequency information being stored in media file 104-1.With bookmark 126- As 1, media bookmark component 130 can be presented as the part that surface 122 is presented each position in surface 122 is presented Text chunk 128-1.In one embodiment, media bookmark component 130 can be based on for example close to bookmark 126-1 or in book Nearby take notes 502 specific criterias of position of label 126-1 select position.As shown in Figure 5 C, text chunk 128-1 can be straight Connect below bookmark 126-1.Alternately, bookmark 126-1 and/or text chunk 128-1 can be presented in media bookmark component 130 On another presentation surface separated in list on surface 122 is presented or with surface 122 is presented.It is presented text chunk 128-1's Ad-hoc location can change according to given implementation, and embodiment is unrestricted in this context.
Fig. 6 shows user interface views 600.As user interface views 500,550, user interface views 600 are shown The media file component 110 for being arranged to the presentation surface 122 that user interface 120 is provided is gone out.In addition to other components, present Surface 122 can include representing the audio comprising in the computer science lecture for example provided from Monday on May 4th, 2015 The media file icon 125-3 of the media file 104-1 (not shown) of the media content of content-form.The audio recording system of lecture Make in Monday on May 4th, 2015, start from afternoon 1 point 53 minutes.Media file 104-1 title is " lecture 1 ".
User interface views 600 are shown carrys out visually presentation medium text using different types of media file icon 125 Part 104-1 situation.User interface views 600 include being rendered as the media file icon 125 of audio volume control or audio search bar. In addition to associated with bookmark 126-1,126-2 respectively playback icon 504-1,504-2 or alternatively, icon is reset 504-3,504-4 can be presented that the period separator being covered on the audio volume control of media file icon 125.User is right Activation can be selected to reset any one in icon 504-1,503-3 to start bookmark 126-1 replay operations afterwards.Similarly, User can select activation to reset any one in icon 504-2,504-4 to start bookmark 126-2 replay operations.This can be with Strengthen Consumer's Experience, and simply bookmark activates, because user need not scroll down through tediously long presentation surface 122 to activate The replay operations of specific bookmark 126.
Media file icon 125 can generate in a number of different manners.For example, during audio recording, can shield Curtain near top display waveform, the waveform roll from right to left, represent the audio content recorded by microphone.Waveform will have a variety of Purposes.First, it is used as incoming level table, therefore whether too user can determine audio recording sound or too quiet.Second, when with When bookmark or other audio sync points are added in family, mark is drawn on waveform to show that new bookmark 126 has been linked to them To audio recording.
One embodiment, which defines, is adapted for carrying out Mobile operating system (such asOrEtc) mobile device realize media file icon 125 Exemplary Visual design. Vision Design may include several base attributes.When recording starts first, the screen space for being mostly used in waveform will be sky In vain.In initial several seconds of recording, waveform will fill from right to left, until filling up whole white space, then as record Progress, continuation roll in this direction.The whole width of waveform can correspond to the span of the definition of such as audio recording.Example Such as, defined span can be selected or adjust to match particular device or application, such as the voice memos of specific mobile device (for example, across 5 seconds) are applied in record.In this case, because smart mobile phone is by with the less horizontal screen that can be used for presenting Curtain space, the definition leap of waveform may use the time span (such as 4 seconds) somewhat shortened.Waveform only shows single audio Passage (Y>=0).When audio recording is monophonic, it is motionless that screen can be saved by the waveform portion for not showing less than X-axis Production.Under full duration, waveform can include a series of vertical bar of about 80 equal wides.Because complete audio volume control will Across the period of 4 seconds, it means therefore that each bar is by corresponding to the audio of about 0.05 second.Therefore, the loudness of audio is every It is sampled within 0.05 second.The height of each bar represents the loudness of audio, and by by preset time iOS AVAudioRecorder classes The value that averagePowerForChannel methods return determines.Maximum height bar represents that averagePowerForChannel is returned Return value>=0dB, and zero elevation bar represents its return value<=-160dB., will when adding bookmark 126 or other audio sync points Corresponding bar on audio volume control changes into different colors to indicate that new bookmark 126 has been linked to audio recording.This face The change of color can be paid attention to it along with animation to cause.Beside waveform, time counter can be presented to show audio The current length of record.The counter will be from 0:00 starts, per second to update to show the new elapsed time.As record length increases It is long, more numerals are added in time counter as needed, for example, it is aobvious to add another numeral in 10 minutes marks Show 10:00, the numeral and colon of 1 hour-symbols are added to display 1:00:In 00, and the numeral of 10 hour-symbols is added with aobvious Show 10:00:00.Dominant record size will need the numerical sum that adds.It is to be appreciated that this is only media file icon A kind of 125 possible Vision Designs, and can be based on given implementation for the details of the particular visual design of waveform And change.
Fig. 7 A show user interface views 700.User interface views 700 are shown for example in the shifting of such as smart phone The more detailed user interface views of such as MICROSOFT ONENOTE operated in dynamic equipment exemplary application.This Outside, user interface views 700 show the user interface configuration suitable for being used during logging mode 206.
As shown in user interface views 700, mobile device 702 can include being used to be presented MICROSOFT ONENOTE's The user interface 120 of various user interface elements.User interface 120 can include surface 122 is presented so that bookmark icon is presented 124th, media file icon 125-4,125-5,502 and bookmark 126-4 of notes.Media file icon 125-4 is implemented as audio Searching bar, wherein bookmark 126-5,126-6 are covered in the period corresponding with the special time for creating bookmark 126-5,126-6 Place.Bookmark 126-5,126-6 can be identical or different with other bookmarks such as bookmark 126-4 on presentation surface 122.It is alternative Ground, bookmark 126-5,126-6 can be the playback icons 504 that other bookmarks on surface 122 are presented.Embodiment is in the context In it is unrestricted.
Fig. 7 B show user interface views 750.User interface views 750 are shown for example in the shifting of such as smart mobile phone The more detailed user interface views of the exemplary application (for example, MICROSOFTONENOTE) operated in dynamic equipment.This Outside, user interface views 750 show the user interface configuration for being adapted to use during play mode 208.
As shown in user interface views 750, mobile device 702 can include being used to be presented MICROSOFT ONENOTE's The user interface 120 of various user interface elements.User interface 120 can include surface 122 is presented so that media file figure is presented Mark 125-6, notes 502 and various bookmark 126-7,126-8,126-9 and 126-10.Media file icon 125-6 is implemented as Audio search bar, wherein bookmark 126-7,126-8,126-9 and 126-10 be covered in bookmark 126-7,126-8,126-9 and At special time corresponding period that 126-10 is created.Bookmark 126-7,126-8,126-9 and 126-10 can be with presentations Other bookmarks such as bookmark 126-7 on surface 122 is identical or different, and bookmark 126-7 is shown as the text above notes 502 Represent and also as the Hash mark on media file icon 125-6.Alternately, bookmark 126-7,126-8,126-9 and 126-10 can be the playback icon 504 that other bookmarks on surface 122 are presented.Embodiment is unrestricted in this context.
Fig. 7 C show user interface views 780.User interface views 780 are shown for example in the shifting of such as smart mobile phone The more detailed user interface views of the exemplary application (for example, MICROSOFTONENOTE) operated in dynamic equipment.This Outside, user interface views 780 show the user interface configuration for being adapted to use during play mode 208.
As shown in user interface views 780, mobile device 702 can include user interface 120, be used for presenting MICROSOFT ONENOTE various user interface elements.As user interface views 750, user interface 120 can include Surface 122 is presented so that media file icon 125-6, notes 502 and various bookmark 126-7,126-8,126-9 and 126- is presented 10.In addition, user interface views 780 include bookmark icon 124, bookmark icon 124 can be used for application play mode 208 it Before, start afterwards or during play mode 208 Automatic bookmark generation operation so as to automatically (for example, without the mankind or manually intervene) Create bookmark 126.Media file icon 125-6 is implemented as audio search bar, wherein bookmark 126-7,126-8,126-9 and 126-10 is covered in described bookmark 126-7,126-8,126-9 and 126-10 are created special time corresponding period On.Bookmark 126-7,126-8,126-9 and 126-10 can be identical or different with other bookmarks on presentation surface 122, such as Bookmark 126-7, it is shown in the text representation of the top of notes 502 and also as dissipating on media file icon 125-6 Row mark.Alternately, bookmark 126-7,126-8,126-9 and 126-10 can be that other bookmarks on surface 122 are presented Reset icon 504.Embodiment is unrestricted in this context.
As it was previously stated, the operation associated with bookmark icon 124 can also be shown using such as certain form of touch-screen Other input technologies of tactile or voice command etc on device are realized.On latter technique, voice can be used Identification technology is automatically based on the content of user record intelligently to add bookmark.We, which can listen to, may indicate that important letter The particular keywords or phrase of breath sounding, for example, " adding bookmark " or " remembeing " or " to this follow-up " or " this is important " or The surname or name of user or the voice command of some other identifications., can be in record when detecting the voice command of identification Corresponding time addition bookmark 126.
One or more logic flows be may be referred to further describe the operation of above-described embodiment.It is to be appreciated that remove Non- to be otherwise noted, otherwise representational logic flow must not necessarily be held with the order that is presented or in any particular order OK.In addition, the various activities on logic flow description can be performed in a manner of serial or parallel.Described reality can be used Apply the one or more hardware elements and/or software element of example or desired for given one group of design and performance constraint Alternative elements realize logic flow.For example, logic flow may be implemented as by logical device (for example, universal or special meter Calculation machine) perform logic (for example, computer program instructions).
Fig. 8 A show one embodiment of the logic flow 800 of the bookmark for generating media file.Logic flow 800 Some or all performed by one or more embodiments described herein can be represented to operate, such as bookmark applies 140 matchmaker Body file components 110 and/or media bookmark component 130.
In the embodiment of the diagram shown in Fig. 8 A, logic flow 800 can identify media information at frame 802 Media file.For example, media file component 110 can identify the media file 104 of media information.The media text identified Part 104 can be created by given application and given user, and given application and given user can be with creating media text The application of the bookmark of part 104 or user are identical or different.
Logic flow 800 can alternatively receive control instruction to automatically generate the bookmark of media file at frame 804.Example Such as, the bookmark icon 124 of media file 104 can be presented in media bookmark component 130 on user interface 120.Bookmark icon 124 Can visually it be presented by user interface 120.When user selects bookmark icon 124, media bookmark component 130 can start certainly Dynamic bookmark generation is operated to generate bookmark 126.Additionally or alternatively, user can use what the shortcut such as defined combined Keyboard commands perform the operation of bookmark icon 124.
Media bookmark component 130 can detect activation of the user to bookmark icon 124.In one embodiment, media book Sign component 130 can the media content of media file 104 record operation (for example, logging mode 206) before, period or it The activation of bookmark icon 124 is detected afterwards.In one embodiment, media bookmark component 130 can be in the media of media file 104 The activation of bookmark icon 124 is detected before, during or after the replay operations (for example, play mode 208) of content.In a reality Apply in example, media bookmark component 130 can be based on and such as smart phone, intelligent watch, tablet personal computer or other electronic equipments The tactile of touch screen interface of touch-screen display of electronic equipment engage and detect the activation of bookmark icon 124.Can Alternatively, media bookmark component 130 can detect the activation of bookmark icon 124 based on voice command.
Logic flow 800 can obtain bookmark designator in the scanning medium file of frame 806.For example, media bookmark component 130 can receive the control instruction for starting bookmark generation operation, and start scanning medium file 104 to obtain one or more books Sign designator 132, such as keyword 134 and/or identity 136.
Logic flow 800 can be directed to the bookmark of media file in frame 808 based on the generation of bookmark designator.For example, media Bookmark component 130 can detect bookmark designator 132, fetch the media file 104 time letter corresponding with bookmark designator 132 Breath 106, and the generation of the temporal information 106 based on the media file 104 got back to is for the bookmark 126 of media file 104.
Logic flow 800 can be optionally bookmark generation text chunk at frame 810.For example, media bookmark designator 130 The text chunk 128 for bookmark 126 can be generated using audio length parameter.Audio length parameter can be used for fetching from The text message that audio-frequency information corresponding to the time interval indicated as audio length parameter is changed.
Bookmark and/or text chunk can be presented on the user interface at frame 812 for logic flow 800.For example, media bookmark Bookmark 126 and/or text chunk 128 can be presented in component 130 on the presentation surface 122 of user interface 120.Bookmark 126 can be with Bookmark 126 is presented using any amount of different types of multimedia messages, such as is embedded in and is presented what is presented on surface 122 Text based bookmark in notes, colour or other recognizable marks on audio volume control, is separated with surface 122 is presented Different user interface views in, or other users interface element.For user visually impaired, when user activates definition User interface elements or when focus is placed in the notes near specific bookmark or bookmark, can pass through Text To Speech (TTS) bookmark 126 and/or text chunk is presented to be audible in technology.
Fig. 8 B are shown for scanning medium file to obtain one embodiment of the logic flow 820 of bookmark designator. Logic flow 820 can represent some or all performed by one or more embodiments described herein and operate, such as bookmark Using 140 media file component 110 and/or media bookmark component 130.
In the embodiment of the diagram shown in Fig. 8 B, logic flow 820 can at frame 806 scanning medium file to obtain Bookmark designator.For example, media bookmark component 130 can receive control instruction to start bookmark generation operation, and start to scan Media file 104 is to obtain one or more bookmark designators 132, such as keyword 134 and/or identity 136.
Audio-frequency information can be converted to text message by logic flow 820 at frame 824 from media file.For example, STT groups Part 150 can receive audio-frequency information in recording mode from microphone, or be stored in media file when in replay mode In 104, and audio-frequency information is converted into text message using various STT technologies.Text message and temporal information can be stored in In data structure, for being fetched by media bookmark component 130 later, or in real time or near real-time directly transmit as a stream arrive matchmaker Body bookmark component 130.
Logic flow 820 can detect one or more keywords from text message at frame 826 and be indicated as bookmark Symbol.For example, media bookmark component 130 can fetch text message from data structure, or directly receive text from STT components 150 This information, and compared with the bookmark designator 132-1 in the form of one group of keyword 134.When media bookmark component 130 When finding matching, media bookmark component 130 can generate bookmark 126 and/or text chunk 128.
Fig. 8 C are shown for scanning medium file to obtain one embodiment of the logic flow 840 of bookmark designator. Logic flow 840 can represent some or all performed by one or more embodiments described herein and operate, such as bookmark Using 140 media file component 110 and/or media bookmark component 130.
In the embodiment of the diagram shown in Fig. 8 C, logic flow 840 can at frame 806 scanning medium file to obtain Bookmark designator.For example, media bookmark component 130 can receive the control instruction for starting bookmark generation operation, and start to sweep Media file 104 is retouched to obtain one or more bookmark designators 132, such as keyword 134 and/or identity 136.
Logic flow 840 can perform speech recognition to determine the identity information in the source of audio-frequency information at frame 842.Example Such as, speech recognition component 160 can receive audio-frequency information from microphone in record mode, or be stored in replay mode In media file 104, various talkers are identified from audio recording.Identity information and temporal information can be stored in data structure In, for being fetched by media bookmark component 130 later, or real-time or near real-time is directly transmitted as a stream and arrives media bookmark component 130。
Logic flow 840 can detect identity as bookmark designator at frame 844 from identity information.For example, media book Label component 130 can fetch identity information from data structure, or directly receive identity information from speech recognition component 160, and And compared with the bookmark designator 132-2 in the form of one group of identity 136.When media bookmark component 130 finds matching, Media bookmark component 130 can generate bookmark 126 and/or text chunk 128.
Fig. 8 D show one embodiment of the logic flow 860 for generating the bookmark for media file.Logic flow Journey 860 can represent some or all performed by one or more embodiments described herein and operate, and such as bookmark applies 140 Media file component 110 and/or media bookmark component 130.
Logic flow 860 can generate the bookmark for media file at frame 808 based on bookmark designator.For example, matchmaker Body bookmark component 130 can detect bookmark designator 132, fetch the time of the media file 104 corresponding to bookmark designator 132 Information 106, and the temporal information 106 based on the media file 104 got back to generates the bookmark 126 for media file 104.
Logic flow 860 can be fetched in the definition part of the audio-frequency information with media file at frame 862 based on bookmark Corresponding text chunk.For example, media bookmark component 130 can be turned the audio-frequency information from media file 104 based on bookmark 126 Change text message into.Media bookmark component 130 can use the audio length parameter associated with bookmark 126 from being stored in data Text chunk 128 is fetched in text message and associated temporal information in structure.Audio length parameter can be believed with instruction time Breath, for example, at the beginning of time interval between and the end time.Media bookmark component 130 can pass through inspection using audio length parameter The temporal information that is stored together with text message is looked into position text message corresponding with time interval, temporal information such as indicates Text message when as in media file 104 audio-frequency information occur timestamp.
Bookmark and text chunk can be presented on the presentation surface at frame 864 in application program in logic flow 860.For example, Bookmark 126 and/or text chunk 128 can be presented in media bookmark component 130 on the presentation surface 122 of user interface 120.Can So that bookmark 126 is presented using any amount of different types of multimedia messages, such as it is embedded in present and is presented on surface 122 Notes in text based bookmark, colour or other recognizable marks on audio volume control, divide with surface 122 is presented In the different user interface views opened, or other users interface element.For user visually impaired, defined when user activates User interface elements or when focus is placed on into the notes near specific bookmark or bookmark, Text To Speech can be passed through (TTS) bookmark 126 and/or text chunk is presented to be audible in technology.
Fig. 9 shows one embodiment of the logic flow 900 of the bookmark for generating media file.Logic flow 900 Some or all performed by one or more embodiments described herein can be represented to operate, such as bookmark applies 140 matchmaker Body file components 110 and/or media bookmark component 130.
In the diagram embodiment shown in Fig. 9, logic flow 900 can be fetched at frame 902 presentation medium file when Between index the very first time stamp.For example, media bookmark component 130 can fetch presentation medium file 104 from temporal information 106 The very first time stamp of time index.Then, media bookmark component 130 can generate bookmark 126 with including presentation medium file 104 Temporal information 106 the very first time stamp, the very first time stamp be activated corresponding to bookmark icon at the beginning of between (for example, by with Family selects and activation).
Logic flow 900 can alternatively fetch the second timestamp of the time index of presentation medium file at frame 904. For example, media bookmark component 130 can fetch the second time of the time index of presentation medium file 104 from temporal information 106 Stamp.Then media bookmark component 130 can generate bookmark 126 with second of the temporal information 106 including presentation medium file 104 Timestamp, the second timestamp correspond to the end time of the media file section of media file 104, wherein, the second timestamp is the After one timestamp.Second timestamp can correspond to such as use of bookmark icon 124 or entirely different user interface elements The selection of family interface element.For example, bookmark icon 124 can have switch mode, wherein the first activation corresponds to the very first time Stamp, and the second activation corresponds to the second timestamp.Alternately, the second timestamp can correspond to the time interval (example of definition Such as, 5m increments), the pause length between speech utterance, keyword etc..
Logic flow 900 can alternatively fetch the file identifier of media file at frame 906.For example, media bookmark Component 130 can fetch the file identifier of media file 104 from the data repository of media file 104.Alternately, media Bookmark component 130 can be from the demand file identifier of media file component 110.File identifier can include such as filename, Identifier that GUID (GUTD), locally unique identifier, machine generate etc..
Logic flow 900 can be generated at frame 908 with very first time stamp, the second timestamp and/or file identifier Media file bookmark.For example, media bookmark component 130 can be generated with very first time stamp, the second timestamp and/or text The bookmark 126 of the media file of part identifier.Bookmark 126 can be stored as the member of media file 104 by media bookmark component 130 Data.Bookmark 126 can store together with media file 104 or be stored separately in Local or Remote with media file 104 In data repository.
In one embodiment, bookmark 126 can only include very first time stamp.Once bookmark 126 is activated, media file Component 110 can reproduce media content the time for stabbing instruction by the very first time from media file 104, and replay, Terminated until by user.
In one embodiment, bookmark 126 can include very first time stamp and file identifier.Once bookmark 126 is swashed Living, media file component 110 can be the time for stabbing instruction by the very first time from such as by the specific of file identifier mark Media file 104-1,104-2 reproduce media content, until user terminates.In addition to other usage scenarios, when presence and list When multiple media file 104-1,104-2 that individual presentation surface 122 is associated, this is probably particularly useful.
In one embodiment, bookmark 126 can include very first time stamp and the second timestamp.Once bookmark 126 is swashed Living, media file component 110 can reproduce media content the time for stabbing instruction by the very first time from media file 104, And stop resetting in the time indicated by the second timestamp.The very first time stabs and the second timestamp effectively identified media file 104 media fragment or media clip.
In one embodiment, bookmark 126 can include very first time stamp, the second timestamp and file identifier.Once Bookmark 126 is activated, media file component 110 can start by the very first time stab instruction time from media file 104 again Existing media content, and stop resetting in the time indicated by the second timestamp.The very first time stabs and the effective terrestrial reference of the second timestamp Know the media fragment or media clip of media file 104, and file identifier is effectively from multiple media file 104-1,104-2 The middle specific media file 104 of identification.
Figure 10 shows an implementation of the logic flow 1000 for reproducing the media content for adding bookmark from media file Example.Logic flow 1000 can represent some or all performed by one or more embodiments described herein and operate, such as Bookmark applies 140 media file component 110 and/or media bookmark component 130.
In the diagram embodiment shown in Figure 10, logic flow 1000 book can be presented on the user interface at frame 1002 Label.For example, bookmark 126 can be presented on the presentation surface 122 of application program by media bookmark component 130.Media bookmark component 130 can also be presented with the bookmark 126 for resetting icon 504, so as to the of the temporal information 106 of presentation medium file 104 One timestamp reproduces media file.Additionally or alternatively, media bookmark component 130 can be in such as media file icon 125- Bookmark 126, or the playback icon 504 for bookmark 126 are presented on the visual representation of 3 etc media file 104.
Logic flow 1000 can be detected at the very first time stamp of the temporal information of presentation medium file at frame 1004 Start the event since media file reproduction media content.For example, media bookmark component 130 can detect beginning event so that Obtain and start to reproduce media content from media file 104 at the very first time stamp of the temporal information 106 of presentation medium file 104 (for example, commencing play out pattern 208).The example of beginning event can be the activation playback icon 504 associated with bookmark 126.
Logic flow 100 can alternatively detect stopping event at frame 1006, to believe in the time of presentation medium file Stop reproducing media content from media file at second timestamp of breath.For example, media bookmark component 130 can detect stopping thing Part with cause at the second timestamp of the temporal information 106 of presentation medium file 104 stop from media file 104 reproduce media Content (for example, stop play mode 208 or enter standby mode 210).The example of stopping event can be activation user interface Control 204 is to stop the reproduction of media file 104.When another example of stopping event can reach second during replay operations Between stab.
In various embodiments, bookmark can be arranged to various single user scenes using 140.For example, multiple users Can each have the copy or version of the media file 104 of their own, and correspondingly manage its bookmark 126.In addition, user can To manage and select the various properties or attribute of one group of bookmark 126, to customize one group of bookmark 126 for user.Each user can Bookmark 126 to be configured to there is different colors, user identifier, bookmark identification symbol, text message, audio-frequency information, vision Information etc..User can also be the various properties or attribute that some tasks customize one group of bookmark 126, such as take notes, follow up, Distribution, issue, shared etc..
In various embodiments, bookmark can be arranged to various cooperation scenes using 140.As it was previously stated, multiple use Family can each have the copy or version of the media file 104 of their own, and correspondingly manage its bookmark 126.However, one In the case of a little, multiple users can share the media recording in single medium file 104, such as shared note sheet.In this feelings Under condition, media bookmark component 130 can generate the different bookmarks 126 corresponding to different user.Can be by changing each bookmark 126 some properties or attribute visualize different bookmarks 126, such as by using different colors, user identifier, book Identifier, text message, audio-frequency information, visual information etc. are signed to specify each user and corresponding bookmark 126.
Figure 11 shows the electronic equipment 1100 for being adapted for carrying out foregoing various embodiments.In one embodiment, Electronic equipment 1100 is such as wireless mobile apparatus of smart phone, intelligent watch or tablet PC.Electronic equipment 1100 can With including the processor 1102 to be communicated with memory 1116.Processor 1102 can be CPU and/or graphics process Unit.Memory 1116 is the combination of flash memory and random access memory.Memory 116 stores bookmark and applies 140, with Realize the operation of foregoing various embodiments.Bookmark includes media file component 110 and media bookmark component using 140 130 executable instruction.
Processor 1102 is additionally coupled to Digital Media sensor 1104.Digital Media sensor 1104 can include for example scheming As sensor, such as charge coupling device.Imaging sensor capture is presented on the visual media on display 1106.Image sensing Device captures visual media and visual media is presented on display 1106 so that user can observe captured vision matchmaker Body.Digital Media sensor 1104 can also include such as audio sensor, such as microphone apparatus.Audio sensor capture is logical Cross the audible media of the reproduction of loudspeaker 1108.Other Digital Media sensors can also be added based on given implementation 1104 (for example, heat sensor, height sensor, biometric sensors etc.).Embodiment is unrestricted in this context.
Touch controller 1110 is connected to display 1106 and processor 1102.Touch controller 1110 is to applied to display The haptic signal of device 1106 responds.In one embodiment, bookmark is presented various using 140 on display 1106 User interface views.That is, bookmark includes being performed to present respectively on display 1106 by processor 1102 using 140 The executable instruction of kind user interface views.
Bookmark is applied to touching for display 1106 with processor 1102 using 140 on what is recorded by touch controller 1110 Feel that signal is communicated.In one configuration, bookmark is applied to bookmark icon 124 using 140 processing and resets touching for icon 504 Feel signal, and as it was previously stated, determine it is to generate bookmark 126 or reset the media file associated with bookmark 126.
Electronic equipment 1100 can also include generally associated with smart mobile phone, intelligent watch or tablet PC other Component, such as global positioning system (GPS) processor 1112, power control circuit 1114 and wireless signal processor 1116. Embodiment is unrestricted in this context.
Figure 12 shows the implementation for the example calculation architecture 1200 for being adapted for carrying out foregoing various embodiments Example.Counting system structure 1200 includes such as one or more processors, coprocessor, memory cell, chipset, control Device, ancillary equipment, interface, oscillator, timing device, video card, audio card, multimedia input/output (I/O) component etc. are various Universal decision element.However, embodiment is not limited to the implementation of counting system structure 1200.
As shown in figure 12, counting system structure 1200 includes processing unit 1204, system storage 1206 and system bus 1208.Processing unit 1204 can be any one of various commercially available processors arrived.Dual micro processor is more with other Processor architecture is also used as processing unit 1204.System bus 1208 is including but not limited to system storage 1206 System component provide to processing unit 1204 interface.System bus 1208 can be can use it is various commercially available total Any of wire body architecture is total to further interconnect to memory bus (with or without Memory Controller), periphery If any one of bus structures of dry type of line and local bus.
System storage 1206 can include various types of memory cells, such as read-only storage (ROM), deposit at random Access to memory (RAM), dynamic ram (DRAM), double data rate DRAM (DDRAM), synchronous dram (SDRAM), static RAM (SRAM), programming ROM (PROM), erasable programmable ROM (EPROM), electrically erasable ROM (EEPROM), flash The polymer memory of memory, such as ferroelectric polymer memory, ovonic memory, phase transformation or ferroelectric memory, silica- Oxidenitride oxide-silica (SONOS) memory, magnetic or optical card or any other class suitable for storage information The medium of type.In the diagram embodiment shown in Figure 12, system storage 1206 can include nonvolatile memory 1210 and/ Or volatile memory 1212.Basic input/output (BIOS) can be stored in nonvolatile memory 1210.
Computer 1202 can include various types of computer-readable recording mediums, including internal hard disk drive (HDD) 1214, the magnetic floppy disk (FDD) 1216 for reading or writing from moveable magnetic disc 1218, and for from removable CD 1222 (for example, CD-ROM or DVD) is read or the CD drive 1220 of write-in.HDD 1214, FDD 1216 and CD Driver 1220 can be connected to system bus by HDD interface 1224, FDD interfaces 1226 and CD-ROM driver interface 1228 respectively 1208.HDD interface 1224 for peripheral driver implementation can include USB (USB) and IEEE 1394 At least one of interfacing or both.
Driver and associated computer-readable medium provide data, data structure, computer executable instructions etc. Volatibility and/or non-volatile memories.For example, many program modules can be stored in driver and memory cell 1210, In 1212, including operating system 1230, one or more application programs 1232, other program modules 1234 and routine data 1236.One or more application programs 1232, other program modules 1234 and routine data 1236 can answer including such as bookmark With 140, media file component 112, media bookmark component 130, security component 536, issue component 532, message components 534, use Family interface 538 and messaging application 542.
User can pass through one or more wire/wireless input equipments (such as keyboard 1238 and such as mouse 1240 Instruction equipment) order and information are input in computer 1202.Other input equipments can include microphone, infrared ray (IR) Remote control, control stick, cribbage-board, contact pilotage pen, touch-screen etc..These and other input equipments are generally by being coupled to system bus 1208 input equipment interface 1242 is connected to processing unit 1204, but can be gone here and there by such as parallel port, IEEE 1394 Other interfaces connection of row port, game port, USB port, IR interfaces etc..
Monitor 1244 or other kinds of display device are also connected to via the interface of such as video adapter 1246 is System bus 1208.In addition to the monitor 1244, computer generally also includes other peripheral output devices, such as loudspeaker, beats Print machine etc..
Computer 1202 can use via being wired and/or wireless communications to such as remote computer in a network environment The logic of 1248 one or more remote computers is connected to operate.Remote computer 1248 can be work station, server Computer, router, personal computer, portable computer, the amusement equipment based on microprocessor, peer device or other public affairs Network node altogether, and many or whole elements described relative to computer 1202 are generally included, although for simplicity, Only illustrate memory/storage 1250.The logic connection of description is included to LAN (LAN) 1252 and/or compared with big net The wire/wireless connection of network (such as wide area network (WAN) 1254).Such LAN and WAN networked environments are in office and company It is common, and promotes the computer network of enterprise-wide, such as Intranet, all these networks may be coupled to Global Link Communication network, such as internet.
When in LAN networked environments in use, computer 1202 passes through wiredly and/or wirelessly communications network interface or adaptation Device 1256 is connected to LAN 1252.Adapter 1256 can promote the wiredly and/or wirelessly communication to LAN 1252, and it can be with Including the WAP being disposed thereon, for being communicated with the radio function of adapter 1256.
In use, computer 1202 can include modem 1258, or it is connected to when in WAN networked environments The communication server on WAN 1254, or with the other modes for being used to establish communication by WAN 1254, such as by mutual Networking.Can be internal or external and wiredly and/or wirelessly equipment modem 1258 via input equipment interface 1242 It is connected to system bus 1208.In a network environment, the program module described relative to computer 1202 or part thereof can store In remote memory/storage device 1250.It will be realized that shown network connection is exemplary, and can use Other means of communication link are established between computer.
Computer 1202 is operable such that to be communicated with the series standards of IEEE 802 with wired and wireless device or entity, Such as with such as printer, scanner, desktop and/or portable computer, personal digital assistant (PDA), telecommunication satellite, with Wireless detectable label associated any equipment or the radio communication of position (for example, newsstand, message platform, toilet) and phone In the wireless device that is operationally set (for example, the digital modulation techniques of IEEE 802.11).It is (or wireless that this comprises at least Wi-Fi Fidelity), WiMax and BluetoothTM wireless technologys.Therefore, communication can be the predefined structure as general networkses, or Person is simply the ad-hoc communication between at least two equipment.Wi-Fi network uses referred to as IEEE 802.11x (a, b, g etc.) Radiotechnics safe and reliable, quick wireless connection is provided.Wi-Fi network can be used for computer being connected to each other, will Computer is connected to internet, and connects a computer to cable network (using IEEE 802.3 related medium and work( Can).
Various embodiments can be realized using the combination of hardware element, software element or both.The example of hardware element Equipment can be included, component, processor, microprocessor, circuit, circuit element is (for example, transistor, resistor, capacitor, electricity Sensor etc.), integrated circuit, application specific integrated circuit (ASIC), PLD (PLD), digital signal processor (DSP), Field programmable gate array (FPGA), memory cell, gate, register, semiconductor devices, chip, microchip, chipset Deng.The example of software element can include component software, program, application, computer program, application program, system program, machine Program, operating system software, middleware, firmware, software module, routine, subroutine, function, method, process, software interface should With routine interface (API), instruction set, calculation code, computer code, code segment, computer code segments, word, value, symbol or its Any combinations.Determining embodiment is realized and can be become according to any amount of factor using hardware element and/or software element To change, all computation rates as required, power level, thermal capacitance is poor, process cycle budget, input data rate, output data rate, Memory resource, data bus speed and other designs or performance constraints, it is such as desired for given implementation.
Some embodiments can include product.Product can include being used for the storage medium for storing logic.Storage medium Example can include the computer-readable recording medium that can store one or more types of electronic data, including volatibility is deposited Reservoir or nonvolatile memory, may move or non-removable memory, erasable or nonerasable memory are writeable or can weigh Memory write etc..The example of logic can include such as component software, program, application, computer program, application program, system Program, machine program, operating system software, middleware, firmware, software module, routine, subroutine, function, method, process are soft Part interface, application programming interfaces (API), instruction set, calculation code, computer code, code segment, computer code segments, word, Value, symbol or its any combination of various software element.In one embodiment, for example, product can store executable meter Calculation machine programmed instruction, when executed by a computer, computer can be made to perform method and/or operation according to the embodiment.Can The code of any suitable type, such as source code can be included by performing computer program instructions, and compiled code, interpretive code can Perform code, static code, dynamic code etc..Executable computer program instruction can according to predefined computer language, Mode or syntax are realized, for indicating that computer performs certain function.Instruction can use it is any it is suitable it is advanced, rudimentary, Object-oriented, visual, compiling and/or the programming language explained are realized.
Some embodiments can be described using expression " one embodiment " or " embodiment " and its derivative words.These terms Special characteristic, structure or the characteristic for meaning to combine embodiment description are included at least one embodiment.In the description The appearance of each local phrase " in one embodiment " be not necessarily all referring to the same embodiment.
Some embodiments can be described using expression " coupling " and " connection " and their derivative words.These terms are not Must be as mutual synonym.It is, for example, possible to use term " connection " and/or " coupling " describe some embodiments, with Indicate that two or more elements physically or electrically contact directly with one another.However, term " coupling " may also mean that two or more Multiple element is not directly contacted with each other, but still is fitted to each other or is interacted.
It is emphasized that disclosed summary is provided to meet 37C.F.R. 1.72 (b) sections, it is desirable to a summary, can To allow reader's quickly essence disclosed in determination technology.It should be understood that summary is not used in the scope explained or limit claim Or implication.In addition, in detailed description above, it can be seen that, will in single embodiment in order to simplify the purpose of the disclosure Various features are grouped together.This disclosed method is not necessarily to be construed as reflecting that embodiment claimed is required than each The intention of the more features clearly described in claim.On the contrary, as the following claims reflect, master of the invention Topic is all features less than single the disclosed embodiments.Therefore, following claims is merged in detailed description, wherein Each claim is independently as single embodiment.In the following claims, term " including (including) " and " wherein (in which) " is used separately as respective term " including (comprising) " and the plain English of " wherein (wherein) " Equivalent word.In addition, term " first ", " second ", " the 3rd " etc. are used only as label, it is not intended to which applying numeral to its object will Ask.
Although acting distinctive language with architectural feature and/or method describes theme, but it is to be understood that appended The theme limited in claim is not necessarily limited to above-mentioned specific features or action.On the contrary, above-mentioned specific features and action are public Open to realize the exemplary forms of claim.

Claims (15)

1. a kind of device, including:
Logic, its at least a portion realize that the logic control bookmark is applied to manage the media of storage media content with hardware The bookmark of file, the bookmark apply including:
Media file component, it is configured as managing media file;And
Media bookmark component, it can be operatively coupled to the media file component, and the media bookmark component is configured as The media file of media information is identified, scans the media file to obtain bookmark designator, is indicated based on the bookmark Symbol automatically generates the bookmark of the media file, and the instruction of the bookmark is presented on the user interface;And
The component of the media file component and the media bookmark component can be operatively coupled to, the component is configured as The media information is converted into text message, and the text message is output to the media bookmark component.
2. device according to claim 1, in addition to:
The language of the media file component and the media bookmark component can be operatively coupled to text (STT) component, The STT components are configured as receiving audio-frequency information from the media file, and the audio-frequency information is converted into text message, with And the text message is output to the media bookmark component;And
Wherein, the media bookmark component is configured as by detecting one or more keywords from the text message to scan The media file is to obtain bookmark designator.
3. device according to claim 1, in addition to:
The speech recognition component of the media file component and the media bookmark component, the voice can be operatively coupled to Recognizer component is configured as receiving audio-frequency information from the media file, performs speech recognition to determine the source of the audio-frequency information Identity information, and the output identity information is to the media bookmark component;And
The media bookmark component is configured as by detecting identity from the identity information to scan the media file to obtain Obtain bookmark designator.
4. device according to claim 1, the media bookmark component is additionally configured to take based on the bookmark designator Return the corresponding text chunk in definition part with the audio-frequency information of the media file, and by the bookmark and the text chunk It is presented on the presentation surface of application program.
5. device according to claim 1, the media bookmark component is additionally configured to related to the instruction of the bookmark Connection ground, which is presented, resets icon, wherein, the icon of resetting can be selected to represent the of the temporal information of the bookmark The media file is reproduced at one timestamp.
6. device according to claim 1, the media file component is configured as by controlling the media file Replay operations and the media content of the media file is reproduced to manage the media file based on the bookmark.
7. device according to claim 1, in addition to the digital matchmaker of the media file component can be operatively coupled to Body sensor, the Digital Media sensor are configured as recording the media content of the media file.
8. a kind of method, including:
Identify the media file of media information;
The media file is scanned to obtain bookmark designator;
The bookmark of the media file is generated based on the bookmark designator;
The instruction of the bookmark is presented on the user interface;
The media information is converted into text message;And
The text message is output to the media bookmark component.
9. according to the method for claim 8, wherein, the media file includes audio-frequency information, and wherein described in scanning Media file is included with obtaining the bookmark designator:
Speech recognition is performed to determine the identity information in the source of the audio-frequency information;And
Identity is detected from the identity information as the bookmark designator.
10. the method according to claim 11, including:
The text chunk corresponding with the definition part of the audio-frequency information of the media file is fetched based on the bookmark designator;With And
The bookmark and the text chunk is presented on the presentation surface of application program.
11. according to the method for claim 8, wherein, generating the bookmark of the media file includes generating the book Label are stabbed with the very first time of the temporal information including representing the media file, and the very first time stamp corresponds to the bookmark quilt The time of generation.
12. according to the method for claim 8, wherein, generating the bookmark of the media file includes generating the book For label with the second timestamp of the temporal information including representing the media file, second timestamp corresponds to media text The end points of the media file section of part, wherein, second timestamp is after the very first time stabs.
13. according to the method for claim 8, wherein, the instruction that the bookmark is presented is included in the presentation table of application program The instruction of the bookmark is presented on face.
14. according to the method for claim 8, wherein, the instruction that the bookmark is presented is included in the table of the media file Show the visual indicator that the bookmark is presented and playback icon is relatively presented with the instruction of the bookmark, wherein, it is described Resetting icon can be selected with the reproduction media file at the very first time stamp for representing the temporal information of the bookmark.
15. the method according to claim 11, in addition to:Detect the very first time in the temporal information for representing the bookmark Start the beginning event of media content of the reproduction from the media file at stamp, or detection is representing the time of the bookmark Stop reproducing the stopping event of the media content from the media file at second timestamp of information.
CN201680026385.6A 2015-05-06 2016-05-03 Automatically generate the technology of media file bookmark Pending CN107636645A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201562157577P 2015-05-06 2015-05-06
US62/157,577 2015-05-06
US14/741,580 US10331304B2 (en) 2015-05-06 2015-06-17 Techniques to automatically generate bookmarks for media files
US14/741,580 2015-06-17
PCT/US2016/030488 WO2016179128A1 (en) 2015-05-06 2016-05-03 Techniques to automatically generate bookmarks for media files

Publications (1)

Publication Number Publication Date
CN107636645A true CN107636645A (en) 2018-01-26

Family

ID=55969489

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201680026385.6A Pending CN107636645A (en) 2015-05-06 2016-05-03 Automatically generate the technology of media file bookmark

Country Status (4)

Country Link
US (1) US10331304B2 (en)
EP (1) EP3292480A1 (en)
CN (1) CN107636645A (en)
WO (1) WO2016179128A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110647710A (en) * 2019-09-18 2020-01-03 上海掌门科技有限公司 Information presentation method and device, electronic equipment and computer readable medium
CN111611505A (en) * 2020-05-19 2020-09-01 掌阅科技股份有限公司 Method for accessing multimedia resources in electronic book, computing equipment and storage medium

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10121474B2 (en) * 2016-02-17 2018-11-06 Microsoft Technology Licensing, Llc Contextual note taking
WO2018174397A1 (en) 2017-03-20 2018-09-27 삼성전자 주식회사 Electronic device and control method
US11662895B2 (en) 2020-08-14 2023-05-30 Apple Inc. Audio media playback user interface
US20220261453A1 (en) * 2021-02-13 2022-08-18 Kevin Bilberry Real Estate Search TV Channel

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040223737A1 (en) * 2003-05-07 2004-11-11 Johnson Carolyn Rae User created video bookmarks
US20080155627A1 (en) * 2006-12-04 2008-06-26 O'connor Daniel Systems and methods of searching for and presenting video and audio
US20090306981A1 (en) * 2008-04-23 2009-12-10 Mark Cromack Systems and methods for conversation enhancement
US7823055B2 (en) * 2000-07-24 2010-10-26 Vmark, Inc. System and method for indexing, searching, identifying, and editing multimedia files
US20120245936A1 (en) * 2011-03-25 2012-09-27 Bryan Treglia Device to Capture and Temporally Synchronize Aspects of a Conversation and Method and System Thereof

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030093790A1 (en) 2000-03-28 2003-05-15 Logan James D. Audio and video program recording, editing and playback systems using metadata
US20020120925A1 (en) 2000-03-28 2002-08-29 Logan James D. Audio and video program recording, editing and playback systems using metadata
EP1953758B1 (en) 1999-03-30 2014-04-30 TiVo, Inc. Multimedia program bookmarking system
US6876729B1 (en) 1999-11-16 2005-04-05 Avaya Technology Corp. Bookmarking voice messages
US7032177B2 (en) 2001-12-27 2006-04-18 Digeo, Inc. Method and system for distributing personalized editions of media programs using bookmarks
US20040203621A1 (en) 2002-10-23 2004-10-14 International Business Machines Corporation System and method for queuing and bookmarking tekephony conversations
US8233597B2 (en) 2005-02-11 2012-07-31 Cisco Technology, Inc. System and method for the playing of key phrases in voice mail messages
US20090251440A1 (en) 2008-04-03 2009-10-08 Livescribe, Inc. Audio Bookmarking
US20100088726A1 (en) 2008-10-08 2010-04-08 Concert Technology Corporation Automatic one-click bookmarks and bookmark headings for user-generated videos
US8351581B2 (en) 2008-12-19 2013-01-08 At&T Mobility Ii Llc Systems and methods for intelligent call transcription
CA2690174C (en) 2009-01-13 2014-10-14 Crim (Centre De Recherche Informatique De Montreal) Identifying keyword occurrences in audio data
US8731935B2 (en) 2009-09-10 2014-05-20 Nuance Communications, Inc. Issuing alerts on detection of contents of interest introduced during a conference
US20110258216A1 (en) 2010-04-20 2011-10-20 International Business Machines Corporation Usability enhancements for bookmarks of browsers
US8953928B2 (en) 2010-12-21 2015-02-10 Google Technology Holdings LLC Bookmarks in recorded video
US20120290310A1 (en) 2011-05-12 2012-11-15 Onics Inc Dynamic decision tree system for clinical information acquisition
US10672399B2 (en) 2011-06-03 2020-06-02 Apple Inc. Switching between text data and audio data based on a mapping
US9318110B2 (en) 2011-09-09 2016-04-19 Roe Mobile Development Llc Audio transcription generator and editor
US20130145265A1 (en) 2011-12-02 2013-06-06 Nicole Cunningham Bookmark with Audio Playback
US20130266127A1 (en) 2012-04-10 2013-10-10 Raytheon Bbn Technologies Corp System and method for removing sensitive data from a recording
US9672815B2 (en) 2012-07-20 2017-06-06 Interactive Intelligence Group, Inc. Method and system for real-time keyword spotting for speech analytics
US9372616B2 (en) 2013-01-31 2016-06-21 International Business Machines Corporation Smart interactive bookmarks
US20160328105A1 (en) 2015-05-06 2016-11-10 Microsoft Technology Licensing, Llc Techniques to manage bookmarks for media files

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7823055B2 (en) * 2000-07-24 2010-10-26 Vmark, Inc. System and method for indexing, searching, identifying, and editing multimedia files
US20040223737A1 (en) * 2003-05-07 2004-11-11 Johnson Carolyn Rae User created video bookmarks
US20080155627A1 (en) * 2006-12-04 2008-06-26 O'connor Daniel Systems and methods of searching for and presenting video and audio
US20090306981A1 (en) * 2008-04-23 2009-12-10 Mark Cromack Systems and methods for conversation enhancement
US20120245936A1 (en) * 2011-03-25 2012-09-27 Bryan Treglia Device to Capture and Temporally Synchronize Aspects of a Conversation and Method and System Thereof

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110647710A (en) * 2019-09-18 2020-01-03 上海掌门科技有限公司 Information presentation method and device, electronic equipment and computer readable medium
CN111611505A (en) * 2020-05-19 2020-09-01 掌阅科技股份有限公司 Method for accessing multimedia resources in electronic book, computing equipment and storage medium
CN111611505B (en) * 2020-05-19 2023-08-29 掌阅科技股份有限公司 Method for accessing multimedia resources in electronic book, computing device and storage medium

Also Published As

Publication number Publication date
US20160328104A1 (en) 2016-11-10
EP3292480A1 (en) 2018-03-14
US10331304B2 (en) 2019-06-25
WO2016179128A1 (en) 2016-11-10

Similar Documents

Publication Publication Date Title
CN107580705A (en) Manage the technology of the bookmark of media file
CN107636645A (en) Automatically generate the technology of media file bookmark
US20140310746A1 (en) Digital asset management, authoring, and presentation techniques
US8930308B1 (en) Methods and systems of associating metadata with media
JPWO2014178219A1 (en) Information processing apparatus and information processing method
CN105190678A (en) Language learning environment
CN108292322A (en) Use tissue, retrieval, annotation and the presentation of the media data file from the signal for checking environment capture
CN110019934A (en) Identify the correlation of video
US10939186B2 (en) Virtual collaboration system and method
KR102347068B1 (en) Method and device for replaying content
US20220197931A1 (en) Method Of Automating And Creating Challenges, Calls To Action, Interviews, And Questions
Spence Disrupting Digital Monolingualism: A report on multilingualism in digital theory and practice
Bai [Retracted] Strategies for Improving the Quality of Music Teaching in Primary and Secondary Schools in the Context of Artificial Intelligence and Evaluation
Carter et al. Tools to support expository video capture and access
Campbell Tools and resources for visualising conversational-speech interaction
Lalanne et al. The IM2 multimodal meeting browser family
Crabtree et al. Digital records and the digital replay system
KR101408722B1 (en) smart device with Play App for play of Textbook Information using Code recognition pen possible Bluetooth communication.
Yen Capturing multimodal design activities in support of information retrieval and process analysis
Papadakis A digital elearning educational tool library for synchronization composition & orchestration of learning session data.
Bellot et al. Report on clef 2018: Experimental ir meets multilinguality, multimodality, and interaction
Gupta et al. Internet of Things for Smart Class Rooms: A Review
KR101415447B1 (en) System and method for information sharing based on social network service through watching video
CN112000256A (en) Content interaction method and device
WO2020016646A1 (en) Method of automating and creating challenges, calls to action, interviews, and questions

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination