US20010025375A1 - Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data - Google Patents
Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data Download PDFInfo
- Publication number
- US20010025375A1 US20010025375A1 US09/866,956 US86695601A US2001025375A1 US 20010025375 A1 US20010025375 A1 US 20010025375A1 US 86695601 A US86695601 A US 86695601A US 2001025375 A1 US2001025375 A1 US 2001025375A1
- Authority
- US
- United States
- Prior art keywords
- information
- data
- segment
- display
- segments
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000012552 review Methods 0.000 claims abstract description 16
- 238000000034 method Methods 0.000 claims description 162
- 238000000638 solvent extraction Methods 0.000 claims description 52
- 238000004891 communication Methods 0.000 claims description 28
- 230000004044 response Effects 0.000 claims description 18
- 238000004590 computer program Methods 0.000 claims description 8
- 230000008569 process Effects 0.000 claims description 6
- 230000007246 mechanism Effects 0.000 description 32
- 230000000875 corresponding effect Effects 0.000 description 25
- 238000013500 data storage Methods 0.000 description 20
- 239000013598 vector Substances 0.000 description 16
- 238000013459 approach Methods 0.000 description 15
- 238000013518 transcription Methods 0.000 description 12
- 230000035897 transcription Effects 0.000 description 12
- 230000008859 change Effects 0.000 description 10
- 238000005192 partition Methods 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 7
- 238000003860 storage Methods 0.000 description 7
- 230000001360 synchronised effect Effects 0.000 description 7
- 238000007906 compression Methods 0.000 description 6
- 230000006835 compression Effects 0.000 description 6
- 230000001276 controlling effect Effects 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 230000007704 transition Effects 0.000 description 5
- 230000008901 benefit Effects 0.000 description 4
- 230000002596 correlated effect Effects 0.000 description 4
- 230000003993 interaction Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000008030 elimination Effects 0.000 description 2
- 238000003379 elimination reaction Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 241000555745 Sciuridae Species 0.000 description 1
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000003416 augmentation Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000004870 electrical engineering Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000004880 explosion Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 229910000078 germane Inorganic materials 0.000 description 1
- 238000010191 image analysis Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/45—Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
- H04N21/462—Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
- H04N21/4622—Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/738—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/738—Presentation of query results
- G06F16/739—Presentation of query results in form of a video summary, e.g. the video summary being a video sequence, a composite still image or having synthesized frames
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7834—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using audio features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7844—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using original textual content or text extracted from visual content or transcript of audio data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/7847—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content
- G06F16/785—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using low-level visual features of the video content using colour or luminescence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/478—Supplemental services, e.g. displaying phone caller identification, shopping application
- H04N21/4782—Web browsing, e.g. WebTV
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/858—Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot
- H04N21/8586—Linking data to content, e.g. by linking an URL to a video object, by creating a hotspot by using a URL
Definitions
- This invention relates to systems and methods that enable observation of a body of information and, in particular, a body of information that can be represented, at least in part, by audiovisual data. Most particularly, the invention relates to systems and methods for accessing and reviewing a body of information represented by one or more sets of audiovisual data that can be used to generate an audiovisual display and one or more related sets of text data that can be used to generate a text display.
- previous systems either require that related segments have previously been determined or, at least, that the segments have been categorized according to subject matter content so that whether two segments are related can readily be determined.
- previous systems have not enabled determination of relatedness between segments of information represented by different types of data, e.g., such systems cannot determine whether a segment represented by audiovisual data is related to a segment represented by text data.
- the display device of these systems e.g., conventional computer display monitor
- the display device of these systems does not provide a high quality display of time-varying audiovisual information (such as produced by a television, for example).
- display devices that do display such information well e.g., televisions
- typically do not provide a high quality display of text information such as produced by a computer display monitor.
- a system that can provide a high quality display of both types of information is needed.
- the remote control device even in those systems where remote operation is possible (e.g., remotely controlled televisions), the remote control device often does not have a user interface that is as readily accessible as desired (as many consumer electronics users can testify, the keypads of many remote control devices are an impenetrable _array of cryptic control keys, often requiring non-intuitive key combinations to effect particular control instructions) or the remote control device does not contain a rich set of control features. Moreover, the remote control devices used with previous systems do not have the capability of themselves displaying a part of the body of information.
- previous systems often do not enable real-time acquisition and review of some or all of the body of information.
- many computer-based systems acquire and store data representing a body of information. The stored data can then be accessed to enable display of segments of the body of information.
- these systems generally do not analyze the data to enable the data to be organized, categorized and related so that, for example, segments of the body of information can be related to other segments for which data is acquired in the future or for which data has previously been acquired.
- such systems do not enable the real-time display of some or all of a body of information while also displaying related information in response to the real-time display.
- the invention enables a body of information to be displayed by electronic devices (e.g., a television, a computer display monitor) in a manner that allows the body of information to be reviewed quickly and in a flexible manner.
- the body of information will be represented by a set of audio data, video data, text data or some combination of the three.
- the invention enables generation of an audiovisual display of one or more segments of information, as well as a display (a text display, an audio display, a video display, or an audiovisual display), for each of the segments, of one or more related segments of information.
- the invention enables acquisition, and subsequent review, of news stories obtained over a specified period of time from a specified group of news sources.
- the invention can be used to review news stories acquired during one day from several television news programs (e.g., CNN Headline News, NBC Nightly News), as well as from text news sources (e.g., news wire services, traditional print media such as newspapers and magazines, and online news services such as ClarinetTM).
- television news programs e.g., CNN Headline News, NBC Nightly News
- text news sources e.g., news wire services, traditional print media such as newspapers and magazines, and online news services such as ClarinetTM.
- the invention enables some or all of a body of information to be skimmed quickly, enabling a quick overview of the content of the body of information to be obtained.
- the invention also enables quick identification of information that pertains to a particular subject.
- the invention further enables quick movement from one segment of a body of information to another, so that observation of particular information of interest can be accomplished quickly.
- a news browser for example, each of a set of television news programs can be skimmed to quickly ascertain the subject matter content of the news stories contained therein.
- a particular category e.g., subject matter category
- news stories having content that fits within the specified subject matter category can be immediately identified and either displayed or identified as pertinent to the subject matter category and available for display.
- a user of the news browser can move arbitrarily among news stories within the same or different news programs.
- the invention also enables automatic identification of information that is related to information that is being displayed, so that the related information can be observed, thereby enabling information about a particular subject to be examined in depth.
- the invention enables such identification of related segments to be made between segments of different types (e.g., a segment represented by audiovisual data can be compared to a segment represented by text data to enable a determination of whether the segments are related).
- a portion or a representation of the related information can be displayed in response to (e.g., simultaneous with) the original information display.
- one or more text news stories e.g., news stories that are obtained from traditional print media or from electronic publications
- that are related i.e., which cover the same or similar subject matter
- a television news story being displayed can be automatically identified and a portion of the related text news story or stories displayed so that the story or stories can be reviewed for additional information regarding the subject matter of the television news story.
- one or more other television news stories that are related to a television news story being displayed can be automatically identified and a single representative video frame displayed for each such news story.
- the invention enables automatic categorization of uncategorized segments of the body of information based upon comparison to other segments of the body of information that have been categorized.
- the subject matter category of a segment of information can be determined by comparing the segment to one or more previously categorized segments and categorizing the segment in accordance with the subject matter categorization of one or more previously categorized segments that are determined to be relevant to the uncategorized segment.
- this can be used to categorize the news stories of a television news program based upon the categorization of text news stories that are found to be relevant to the television news stories.
- the invention can be implemented in a system that is convenient to use, that presents the body of information in a readily accessible way, and that presents the information via one or more display devices that are tailored for use with the particular type of data that is used to generate the display.
- a system according to the invention can include a control device that enables remote, untethered control of a primary display device of the system.
- the remote control device can also be implemented so that some or all of the body of information can also be displayed on the remote control device.
- the system can include, for example, a television for display of audiovisual information and a computer display monitor for display of text information.
- a control device of a system can be implemented with a graphical user interface that facilitates user interaction with the system.
- a graphical user interface can include a region that provides an indication of a user's past progression through, and present location within, the body of information.
- a program map is displayed that facilitates navigation through the news programs that can be selected for display.
- the invention also enables real-time acquisition and review of some or all of the body of information.
- the invention enables on-the-fly analysis of data as the data is acquired, so that the data can be organized, categorized and related to other data.
- the invention also enables the real-time display of some or all of a body of information while also displaying related information in response to the real-time display. For example, in a news browser according to the invention, television news programs can be acquired and displayed as they occur. Related news stories, either from previously acquired television news programs or text news sources can be displayed as each television news story is displayed in real time.
- the invention also enables control of the manner in which the information is displayed (e.g., the apparent display rate of the display can be controlled, the display can be paused, a summary of a portion of the body of information can be displayed).
- the user can cause a summary of one or more television news stories to be displayed (rather than the entire news story or stories), the user can speed up (or slow down) the display of a television news story, and the user can pause and resume the display of a television news story such that the display resumes at an accelerated rate until the display of the news story “catches up” to where the display would have been without the pause (a useful feature when the television news story is being acquired and displayed in real time).
- a system enables acquisition and review of a body of information that includes a multiplicity of segments that each represent a defined set of information (frequently, a contiguous related set of information) in the body of information.
- the system includes: i) a mechanism for acquiring data representing the body of information; ii) a mechanism for storing the data; iii) a first display mechanism for generating a display of a first segment of the body of information from data that is part of the stored data; iv) a mechanism for comparing the data representing a segment of the body of information to the data representing a different segment of the body of information to determine whether, according to one or more predetermined criteria, the compared segments are related; and v) a second display mechanism for generating a display of a portion of, or a representation of, a second segment of the body of information from data that is part of the stored data.
- the second display mechanism displays a portion or representation of the second segment in response to the display by the first display mechanism of a first segment to which the second segment is related.
- the second display mechanism can display a portion or representation of the second segment substantially coextensive in time with the display of the related first segment by the first display mechanism.
- the system can further include a mechanism for identifying the subject matter content of a segment of the body of information, so that the mechanism for comparing can determine the similarity of the subject matter content of a segment to the subject matter content of a different segment (using, for example, relevance feedback) and use that result to determine the relatedness of the compared segments.
- the system can also include a mechanism for identifying an instruction from a user to begin displaying at least some of the body of information, the first display mechanism beginning display of a segment in response to the user instruction.
- the system can enable such a second segment to be selected for display by the first display mechanism.
- the segments displayed by the first display mechanism are represented by audiovisual data (and, in particular, audiovisual data that can be used to generate an audiovisual display that can vary with time), such as, for example, data produced from television or radio broadcast signals.
- the segments displayed by the second display mechanism can be represented by audiovisual data (e.g., a single representative video image, or “keyframe”) or by text data (e.g., text excerpts), such as, for example, data from computer-readable data files acquired over a computer network from an information providing site that is part of that network.
- the first display mechanism can be an analog display device (such as a television) and the second display means can be a digital display device (such as a computer display monitor).
- the system can advantageously be implemented so that the various devices are interconnected to a conventional computer bus that enables the devices to communicate with each other such that the devices do not require wire communication over network communication lines to communicate with each other (the devices are “untethered”).
- a system for reviewing a body of audiovisual information that can vary with time includes: i) a mechanism for displaying the audiovisual information; and ii) a mechanism for controlling operation of the system, the mechanism for controlling being physically separate from the mechanism for displaying and including a graphical user interface for enabling specification of control instructions.
- the mechanism can advantageously be made portable.
- the system can advantageously include a mechanism for 2-way wireless communication between the mechanism for displaying and the mechanism for controlling.
- the graphical user interface can include one or more of the following: i) a playback control region for enabling specification of control instructions that control the manner in which the audiovisual information is displayed on the means for displaying; ii) a map region for providing a description of the subject matter content of the audiovisual information and for enabling specification of control instructions that enable navigation within the audiovisual information; iii) a related information region for displaying a portion of, or a representation of, a segment that is related to a segment being displayed by the mechanism for displaying; and iv) a secondary information display region for displaying a secondary information segment that is related to a segment of the audiovisual information that is being displayed by the mechanism for displaying.
- the playback control region can include one or more of the following: i) an interface that enables selection of one of a plurality of subject matter categories, all of the segments of the audiovisual information corresponding to a particular subject matter category being displayed in response to the selection of that subject matter category; ii) an interface that enables variation of the apparent display rate at which the audiovisual information is displayed; iii) an interface that enables specification of the display of a summary of a segment of the audiovisual information; iv) an interface that enables the display to be paused, then resumed at an accelerated rate that continues until the display of the audiovisual information coincides with the display that would have appeared had the display not been paused; v) an interface that enables termination of the current segment display and beginning of a new segment display; and vi) an interface that enables repetition of the current segment display.
- the map region can further identify a segment of the audiovisual information that is currently being displayed and/or identify each segment of the audiovisual information that has previously been displayed.
- a system enables review of a body of information, the body of information including a first portion that is represented by audiovisual data that can vary with time and a second portion that is represented by text data.
- the system includes a first display device for displaying the first portion of information and a second display device for displaying the second portion of information.
- the first display device is particularly adapted for generation of a display from time-varying audiovisual data
- the second display device is particularly adapted for generation of a display from text data.
- the first display device can be, for example, an analog display device such as a television.
- the second display device can be, for example, a digital display device such as a computer display monitor.
- the two devices can interact with each other so that related information can be displayed at the same time on the two devices, in the same manner as that described above.
- a method categorizes according to subject matter a segment of a body of information (that includes a plurality of segments), the segment not previously having been categorized according to subject matter, based upon the subject matter category or categories associated with one or more previously categorized segments of the body of information.
- the uncategorized segment can have been acquired from a first data source (that supplies, for example, television or radio broadcast signals) and the previously categorized segment or segments can have been acquired from a second data source (that supplies, for example, computer-readable data files) that is different than the first data source.
- the method includes the steps of: i) determining the degree of similarity between the subject matter content of the uncategorized segment and the subject matter content of each of the previously categorized segments; ii) identifying one or more of the previously categorized segments as relevant to the uncategorized segment based upon the determined degrees of similarity of subject matter content between the uncategorized segment and the previously categorized segments; and iii) selecting one or more subject matter categories with which to identify the uncategorized segment based upon the subject matter category or categories used to identify the relevant previously categorized segment or segments.
- a computer readable medium encoded with one or more computer programs according to the invention enables similar capability.
- the step of determining the degree of similarity can be accomplished using a relevance feedback method.
- the step of identifying one or more of the previously categorized segments as relevant to the uncategorized segment can include the steps of: i) identifying a multiplicity of the previously categorized segments that are the most similar to the uncategorized segment; ii) determining the degree of similarity between each of the multiplicity of previously categorized segments and each other of the plurality of previously categorized segments; iii) for each pair of previously categorized segments of the multiplicity of previously categorized segments having greater than a predefined degree of similarity, eliminating one of the pair of previously categorized segments from the multiplicity of previously categorized segments, wherein the previously categorized segment or segments remaining after the step of eliminating are similar and distinct previously categorized segments; and iv) identifying one or more of the similar and distinct previously categorized segments as relevant previously categorized segments.
- a method determines whether a first set of information represented by a set of data of a first type (e.g., text data) is relevant to a second set of information (that is different than the first set of information) represented by a set of data of a second type (e.g., audiovisual data).
- a first type e.g., text data
- a second set of information that is different than the first set of information
- a set of data of a second type e.g., audiovisual data
- the method includes the steps of: i) deriving a set of data of the second type from the set of data of the first type, the derived set of data of the second type also being representative of the first set of information; ii) determining the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information; and iii) determining whether the first set of information is relevant to the second set of information based upon the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information.
- a computer readable medium encoded with one or more computer programs according to the invention enables similar capability.
- the step of determining the degree of similarity can be accomplished using a relevance feedback method.
- a method can determine which, if any, of a multiplicity of sets of information represented by an associated set of data of a first type (each of the multiplicity of sets of information being different from other of the multiplicity of sets of information) are relevant to the second set of information represented by the set of data of the second type.
- This method includes the steps of, in addition to those discussed above: i) determining the degree of similarity between each set of data of the first type representing one of the multiplicity of sets of information and the derived set of data of the first type representing the second set of information; ii) identifying which, if any, of the sets of data of the first type representing one of the multiplicity of sets of information have greater than a predefined degree of similarity to the derived set of data of the first type representing the second set of information, the sets of data of the first type so identified being termed similar sets of data of the first type; iii) determining the degree of similarity between each similar set of data of the first type and each other similar set of data of the first type; iv) for each pair of similar sets of data of the first type having greater than a predefined degree of similarity, eliminating one of the pair of similar sets of data of the first type from the set of similar sets of data of the first type, wherein the set or sets of similar data of the first type remaining after the step of eliminating are similar and
- a method enables the identification of the boundaries of segments in a body of information that is represented by a set of text data and at least one of a set of audio data or a set of video data, each segment representing a contiguous related set of information in the body of information.
- a computer readable medium encoded with one or more computer programs according to the invention enables similar capability.
- the segment boundaries are identified by first performing a coarse partitioning method to approximately locate the segment boundaries, then performing a fine partitioning method to more precisely locate the segment boundaries.
- time-stamped markers in the set of text data are identified and used to determine approximate segment boundaries within the body of information.
- a range of time is specified that includes the time of occurrence.
- Subsets of audio data or subsets of video data that occur during the specified ranges of time are extracted from the complete set of audio data or the complete set of video data.
- the fine partitioning method is then performed to identify one or more breaks in each of the subsets of audio data or each of the subsets of video data.
- the best break that occurs in each subset of audio data or each subset of video data is selected, and the time of occurrence of the best break in each subset is designated as a boundary of a segment in the body of information.
- the fine partitioning can be performed using any appropriate method.
- scene break identification can be used to implement the fine partitioning.
- the fine partitioning can be implemented by, for example, pause recognition, voice recognition, word recognition or music recognition.
- FIG. 1 is a block diagram illustrating a system according to the invention for acquiring and reviewing a body of information.
- FIG. 2A is a diagrammatic representation of a graphical user interface according to the invention that can be used to enable control of the operation of a system according to the invention, display information regarding operation of the system of the invention and display information acquired by the system of the invention.
- FIG. 2B is a view of an illustrative graphical user interface in accordance with the diagrammatic representation of FIG. 2A.
- FIG. 3 is a flow chart of a method in accordance with the invention for identifying the boundaries of segments in a body of information.
- FIG. 4 is a flow chart of a method in accordance with the invention for determining whether a first set of information represented by data of a first type is relevant to a second set of information represented by data of a second type.
- FIG. 5 is a flow chart of a method in accordance with the invention for categorizing according to subject matter an uncategorized segment of a body of information based on the categorization of other previously categorized segments of the body of information.
- the invention enables the acquisition of a body of information and review of the content of the body of information.
- the invention includes various features that facilitate and enhance review of the body of information.
- the invention enables the body of information to be quickly reviewed to obtain an overview of the content of the body of information or some portion of the body information.
- the invention also allows flexibility in the manner in which the body of information is reviewed. For example, the invention enables a user to move quickly from one segment of a body of information to another, enabling the user to rapidly begin observing particular information of interest. Further, the invention enables a user to quickly locate information within the body of information that pertains to a particular subject in which the user has an interest.
- the invention also enables a user to, when observing particular information, quickly find and review other information that is related to the information that the user is observing. Additionally, the invention enables the user to control the manner in which the information is displayed (e.g., the apparent display rate of the display can be controlled, the display can be paused, a summary of a portion of the body of information can be displayed). The invention also provides the user with an indication of the user's past progression through, and present location within, the body of information, such indications aiding the user in selecting further segments (described below) of the body of information for review.
- the body of information can be represented by one or more sets of audio data, one or more sets of video data, one or more sets of text data or some combination of the three.
- audio data refers to data used to generate an audio display
- video data refers to data used to generate a video display substantially including images other than text images
- text data refers to data used to generate a video (or audio, though typically video) display of text images
- audiovisual data refers to data that includes audio and/or video data, and may include text data.
- the invention enables the acquisition and review of one or more sets of information represented by audiovisual data, as well as related sets of information represented by text data.
- the content of one or more audiovisual news programs is acquired from a first set of one or more information sources and news stories (or “articles”) from text news sources are acquired from a second set of one or more information sources.
- the first set of information sources could be, for example, CNN Headline News or network (e.g., ABC, NBC, CBS) news programs.
- the second set of information sources could be, for example, on-line news services such as ClarinetTM or news wire services such as AP or UPI. It is contemplated that this application of the invention can be particularly useful as a means of enhancing the viewing of conventional television news programs.
- the invention can enable the user to access the news stories of audiovisual news programs in a random manner so that the user can move quickly from one news program to another, or from one news story in a news program to another news story in the same or another news program.
- the invention can also enable the user to quickly locate news stories pertaining to a particular subject.
- the invention can identify and display a related text news story or stories.
- the invention can also enable the user to control the display of the audiovisual news programs by, for example, speeding up the display, causing a summary of one or more news stories to be displayed, or pausing the display of the news stories, thereby enabling the user to quickly ascertain the content of one or more news stories or entire news programs. Additionally, the invention can indicate to the user which audiovisual news program is currently being viewed (and, further, which news story within the news program is being viewed), as well as which news stories and/or news programs have previously been viewed.
- FIG. 1 is a block diagram illustrating a system 100 according to the invention for acquiring and reviewing a body of information.
- a user 109 interacts with a control device 101 to cause information to be displayed on a primary display device 102 .
- the control device 101 includes an appropriate user interface (e.g., a graphical user interface, as discussed in more detail below) that allows the user 109 to specify control instructions for effecting control of the system 100 .
- Communication between the control device 101 and the primary display device 102 is mediated by a system controller 103 .
- the system controller 103 causes primary information to be acquired from a primary information source 107 via a primary information data acquisition device 105 .
- “primary information” is any information the display of which the user can directly control.
- the system controller 103 also causes secondary information (which is typically related to the primary information) to be acquired from a secondary information source 108 via a secondary information data acquisition device 106 .
- secondary information is any information other than primary information that is acquired by a system according to the invention and that can be displayed by the system and/or used by the system to manipulate or categorize (as described in more detail below) the primary information.
- a data storage device 104 stores the acquired primary and secondary information.
- the primary information is displayed on the primary display device 102 .
- the secondary information can be displayed (e.g., by the control device 101 or by the primary display device 102 in addition to the primary information) or not (i.e., the secondary information may be used only for categorizing and/or manipulation of the primary information).
- the primary information can be videotape (or other audiovisual data representation) of an audiovisual news program or programs and the secondary information can be the text of news stories from text news sources.
- the control device 101 , the primary display device 102 , the system controller 103 and the data storage device 104 can be embodied in one or more devices that can be interconnected to a conventional computer bus that enables the devices to communicate with each other.
- the devices 101 , 102 , 103 and 104 can be integrated into a system in which the devices do not require wire communication over network communication lines to communicate with each other (one or more of the devices 101 , 102 , 103 and 104 is “untethered” with respect to one or more of the other devices 101 , 102 , 103 and 104 ).
- the primary and secondary information can be accessed and displayed at a relatively fast speed, thus providing quick response to control instructions from the user and enabling generation of displays with acceptable fidelity.
- a networked system in which the devices must communicate with each other over a network via wire communication lines—in particular, a system in which the control device and display device or devices must communicate over such wire communication lines with the data storage device on which the information is stored—may not produce acceptable performance.
- the operation of the system is limited by the communications bandwidth and latency of the network communications medium.
- the bandwidth of the network communications medium may not be adequate to enable transfer of data from the data storage device 104 to the primary display device 102 quickly enough to enable a display with acceptable fidelity to be generated by the primary display device 102 .
- the response to a control instruction from the control device 101 may be undesirably slow because of inadequate speed of the network communications medium.
- the primary information data acquisition device 105 and secondary information data acquisition device 106 can be implemented by any appropriate such devices.
- the primary information source 107 is comprised of television news broadcasts
- the primary information data acquisition device 105 can be a conventional television tuner and video capture device that acquires the data representing the primary information via conventional cable connections, satellite dish or television antenna.
- the secondary information is comprised of online text sources (i.e., text sources available over a computer network such as the Internet)
- the secondary information data acquisition device 106 can be a conventional modem or other communications adapter, as known by those skilled in the art of data communications, that enables acquisition of data representing the secondary information via one or more conventional communication lines, such as telephone lines, ISDN lines or Ethernet connections. (It is also possible that the primary information can be acquired from online sources, such as via the Internet or other computer network.)
- the primary information data acquisition device 105 and the secondary information data acquisition device 106 can communicate with the system controller 103 in any appropriate manner.
- the system controller 103 can be implemented as part of a digital computer. Where this is the case, the communication between the system controller 103 and the devices 105 and 106 is preferably implemented to enable computer control of the devices 105 and 106 .
- the device 105 or 106 When the device 105 or 106 is used to acquire information over a computer network, the device 105 or 106 will be a device, such as a computer modem, for which such communication to the system controller 103 can be implemented using well-known methods and apparatus. For other types of devices, such communication must be implemented in another manner. For example, when the device 105 is a television tuner, communication between the system controller 103 and the device 105 can be implemented using a VISCA (Video System Control Architecture) connection.
- VISCA Video System Control Architecture
- the processing of the data representing the primary and secondary information generally requires that the data be in digital form.
- Text data acquired from online text sources for example, is acquired in digital form and so can be used directly in such processing.
- Analog television signals must be digitized before being used in digital processing. This can be accomplished using conventional A/D conversion methods and apparatus.
- the television data can be compressed according to the MPEG, JPEG or MJPEG video compression standards, as known by those skilled in the art of audio and video data compression.
- the text data can also be compressed, using conventional text file compression programs, such as PKZIP, though, typically, such compression provides a relatively small benefit because the amount of text data is small compared to the amount of audio and video data, and the amount of data required to represent the categorization information (described below).
- PKZIP text file compression programs
- it may be desirable or necessary to transform digital data into an analog waveform again e.g., convert digital video data into analog video data for display by a television. This can be accomplished using conventional D/A conversion methods and apparatus.
- the system 100 makes use of two devices for display and control: a primary display device 102 for displaying the primary information and a control device 101 for controlling the operation of the primary display device 102 .
- the control device 101 is physically separate from the primary display device 102 and portable so that the user has flexibility in selecting a position relative to the primary display device 102 during use of the system 100 .
- such an embodiment could allow a user to use the invention while sitting in a chair or on a couch, reclining in bed, or sitting at a table or desk.
- the control device 101 when the secondary information is textual (e.g., the text of news stories) and the control device 101 is used to display such secondary information, the portability of the control device 101 attendant such an embodiment increases the likelihood that the text is displayed on a device that can be held in close proximity to the user, thereby improving the ability of the user to view the text. Further, as discussed in greater detail below, the control device 101 preferably has sophisticated user interface capabilities.
- a system according to the invention can be implemented so that the primary display device 102 displays the primary information while a separate device (e.g., the control device 101 ) displays the secondary information.
- the invention can advantageously be used in situations in which the primary information is audiovisual information (and, in particular, audiovisual information that can vary with time, such as the content of a television program) and the secondary information is text information (some or all of which is, typically, likely to be related to the audiovisual information).
- the use of two different devices for display allows the optimization of the display devices for the particular type of information to be displayed.
- the primary display device 102 is preferably a device that enables high quality audio and video images (in particular, time-varying audio and video images) to be produced, such as a television.
- a television is good for displaying audiovisual information
- the television doesn't do as good a job with the display of text, particularly at typical viewing distances.
- a computer display monitor does a good job of displaying text.
- a computer display monitor can be used to display the secondary information.
- a “computer display monitor” can display not only video, but also audio.
- a portable computer e.g., a notebook or subnotebook computer
- the portable computer can also be used to implement the control device 101 , thus allowing the display of the secondary information to be integrated with the user interface used to specify instructions for controlling operation of the system 100 .
- communication between the control device 101 and the rest of the system 100 is advantageously accomplished using a wireless local area network (LAN), infrared link, or other wireless communications system, so that the user will have more freedom of movement when using the control device 101 .
- LAN local area network
- infrared link or other wireless communications system
- the system controller 103 can be implemented by any conventional processing device or devices that can accomplish the functions of a system controller as described herein.
- the system controller 103 can be implemented by a conventional microprocessor chip, as well as peripheral and other computer chips that can be configured to perform the functions of the system controller 103 .
- the data storage device 104 can be implemented by any conventional storage devices.
- the data storage device 104 can be implemented, for example, by a conventional computer hard disk (to enable storage of digital data, including analog data—e.g., television or radio signals—that has been digitized), a conventional videotape (to enable storage of, for example, analog data corresponding to acquired television signals) or a conventional audiotape (to enable storage of, for example, analog data corresponding to acquired radio signals).
- system controller 103 and data storage device 104 can be implemented, for example, in a conventional digital computer.
- the devices with which the system controller 103 and data storage device 104 are implemented should have the capability to compress and decompress the audio, video and text data quickly enough to enable real-time display of that data.
- the system controller 103 can communicate with the control device 101 and the primary display device 102 in any appropriate manner, including wire and wireless communications.
- the control device 101 can be embodied by a portable computer (e.g., a ThinkpadTM computer, made by IBM Corp. of Armonk, N.Y.).
- the portable computer and associated display screen facilitate the presentation of a graphical user interface, as will be apparent from the description below.
- the portable computer has a color display screen.
- a color display screen further facilitates implementation of a graphical user interface by enabling color differentiation to be used to enhance the features provided in the graphical user interface.
- the ThinkpadTM can be configured (as known by those skilled in such art) to act as an X/windows terminal (client) that communicates with an X/windows host (server), using standard X/windows protocols (as also known by those skilled in such art), to enable generation and display of the graphical user interface.
- the primary display device 102 as well as the system controller (X/windows host) 103 , can be embodied, for example, by an Indigo2 workstation computer made by Silicon Graphics Incorporated (SGI) of Mountain View, Calif.
- SGI Silicon Graphics Incorporated
- the portable computer can communicate with the SGI Indigo2 computer via a wireless Ethernet link.
- both of the primary display device 102 and control device 101 could be implemented in a digital computer with the system controller 103 and data storage device 104 (although such an implementation may not have some of the advantages of the embodiments of the invention described above).
- the above-mentioned SGI Indigo2 computer or an IBM-compatible desktop computer could be used to implement a system of the invention in this manner.
- implementation of a system according to the invention in this manner could advantageously be accomplished on a portable computer such as a notebook computer.
- FIG. 2A is a diagrammatic representation of a graphical user interface (GUI) 200 according to the invention that can be used to enable control of the operation of a system according to the invention, display information regarding operation of the system of the invention and display information acquired by the system of the invention.
- GUI graphical user interface
- a GUI according to the invention can be displayed using any suitable display device.
- the GUI can be implemented by appropriately tailoring conventional computer display software, as known to those skilled in the art in view of the discussion below.
- the GUI 200 can be displayed on the screen of a portable computer.
- the GUI 200 includes four regions: primary information playback control region 201 , primary information map region 202 , related primary information region 203 , and related secondary information region 204 . It is to be understood that the regions 201 , 202 , 203 and 204 could be arranged in a different manner, have different shapes and/or occupy a greater or lesser portion of the GUI 200 than shown in FIG. 2A. Additionally, it is to be understood that a GUI according to the invention need not include all or any of the regions 201 , 202 , 203 or 204 ; it is only necessary that the GUI include features that allow the system according to the invention to be controlled. Thus, for example, a GUI according to the invention could function adequately without a related primary information region 203 .
- the GUI also need not, for example, include a primary information map region 202 or a primary information playback control region 201 having exactly the characteristics described below; other interfaces enabling similar functionality could also be used.
- the GUI could also be implemented so that user interaction with standard GUI mechanisms such as menus and dialog boxes is necessary to cause display of system controls, system operation information, and/or acquired information.
- a GUI according to the invention could be implemented such that a display of the related secondary information region 204 is produced only upon appropriate interaction with one or more menus and/or dialog boxes.
- FIG. 2B is a view of an illustrative GUI 210 in accordance with the diagrammatic representation of FIG. 2A.
- the GUI 210 is particularly tailored for use with an embodiment of the invention in which the primary information includes videotape of one or more news programs and the secondary information includes the text of news stories from text news sources.
- the regions 201 , 202 , 203 and 204 of the generic GUI 200 are described generally, while the corresponding regions 211 , 212 , 213 and 214 of the particular GUI 210 are described in detail.
- the primary information playback control region 201 of the GUI 200 is used to control the manner in which the primary information is displayed on the primary display device 102 .
- the region 201 can be used, for example, to provide a mechanism to enable the user to begin, stop or pause display of the primary information, as well as rewind or fast forward the display.
- the region 201 can also be used, for example, to control the particular primary information that is displayed, as well as the apparent display rate at which the primary information is displayed.
- the primary information playback control region 211 of the GUI 210 includes topic “buttons” 215 , control “buttons” 216 and a speed control 217 . It is to be understood that the functionality of the topic buttons 215 , control buttons 216 and speed control 217 , described below, could be accomplished in a manner other than that shown in FIG. 2B and described below.
- the topic buttons 215 enable the user to select a subject matter category so that, for example, all news stories in the recorded news programs that pertain to the selected subject matter category are displayed one after the other by the primary display device 102 .
- selection of a topic button 215 could cause a list of news stories pertaining to that subject matter category to appear, from which list the user could select one or more news stories for viewing.
- the GUI 210 includes six topic buttons 215 to enable selection of news stories related to international news (“World”), national news (“National”), regional news (“Local”), business news (“Business”), sports news (“Sports”), and human interest news (“Living”); however, a GUI according to the invention can include any number of topic buttons and each button can correspond to any desired subject matter category designation.
- the control buttons 216 enable the user to control which news story is displayed, as well as the manner in which a news story is displayed. Moving from left to right in FIG. 2B, the control buttons 216 respectively cause the display to activate a dialog box that enables the user to perform a keyword search of the text of news stories acquired by the system of the invention, return to the beginning of the currently displayed story to begin displaying the story again, stop the display, start the display, and skip ahead to the next story in a predetermined sequence of stories.
- a GUI according to the invention can include other control buttons that enable performance of other functions instead of, or in addition to, the functions enabled by the control buttons 216 , such as fast forwarding the display, rewinding the display, pausing the display (a particular method according to the invention is described below), and displaying a summarized version of the primary information (a particular method according to the invention is described in more detail below).
- the speed control 217 can be used to increase or decrease the apparent display rate with which the primary information is displayed.
- the speed control display 217 shows a number that represents the amount by which a normal display rate is multiplied to produce the current apparent display display rate, and includes a graphical slider bar that can be used to adjust the apparent display rate. The manner in which the apparent display rate can be changed is described in more detail below.
- the primary information map region 202 of the GUI 200 provides the user with a description of the content of the primary information that is available for display, as well as information that facilitates navigation through the primary information, and can also be used to allow the user to select particular primary information for display.
- the description of the primary information can include, for example, an illustration or other description of the subdivision of the primary information into smaller portions (e.g., segments) of information. Such illustration or description can convey the number of portions, the length (i.e., time duration) of each portion and the subject matter of each portion.
- the region 202 can also be used to show the user the location within the primary information of the portion of the primary information that is currently being viewed, as well as which (if any) portions of the primary information have previously been viewed. Additionally, the region 202 can be used to enable the user to move freely among portions of the primary information by, for example, using a conventional mouse to point and click on a portion of the primary information that is illustrated in the region 202 .
- the primary information map region 212 of the GUI 210 includes several subdivided rows, each row representing a particular news program (e.g, CNN Headline News, NBC Nightly News, etc.). Each row is a map that illustrates to some level of detail the content of the corresponding news program. Each of the subdivisions of a row represent breaks during the news program, such as breaks between news stories. The region between each subdivision represents a news story (a region could also represent, for example, an advertisement). The duration of each news story is depicted graphically by the length of the region corresponding to that news story. Each region in a row can be displayed in a particular color, each color representing a particular predetermined subject matter category (i.e., topic), so that the color of each region denotes the subject matter category of the news story corresponding to that region.
- a particular news program e.g, CNN Headline News, NBC Nightly News, etc.
- Each row is a map that illustrates to some level of detail the content of the corresponding news program.
- the map region 212 can be further enhanced in any of a variety of ways.
- the news program (row) that is currently being viewed can be marked, such as by, for example, shading the row of the currently viewed news program a particular color or causing a particular type of symbol to appear adjacent to the row of the currently viewed news program.
- news stories that have already been viewed can be marked in an appropriate manner, such as by, for example, causing the regions of the viewed news stories to be cross-hatched or to be shaded a particular color.
- the current viewing location can also be shown: in FIG. 2B, this is shown by a vertical line.
- the related primary information region 203 of the GUI 200 displays “thumbnails” which identify segments of the primary information that are related to the primary information that is currently being displayed.
- the region 203 includes four thumbnails 203 a , 203 b , 203 c , 203 d , generally, the region 203 can be used to display any number of thumbnails.
- the thumbnails can take any form, such as a display of a portion of the segment or a display of a representation of the segment.
- the thumbnails 203 a , 203 b , 203 c , 203 d can be single video images that represent the video data of the segment being identified (“keyframes”). (As seen in FIG.
- the related primary information region 213 of the GUI 210 includes three single video images that each represent a news story from a news program.
- the thumbnails 203 a , 203 b , 203 c , 203 d could be a text summary or other text identifier of the segment being identified.
- the thumbnails 203 a , 203 b , 203 c , 203 d could be pictorial representations that identify the corresponding segment.
- a threshold of relatedness (the expression of the threshold depending upon the method used to determine relatedness) is preferably specified so that only segments that are sufficiently related to the displayed segment are displayed in the related primary information region 203 , even if that means that less than the allotted number of segments (including no segments) are displayed. If appropriate, redundant segments can be eliminated from the primary information segments to be displayed in the related primary information region 203 , using techniques similar to those described below for eliminating redundant segments from a set of segments identified as similar to a designated segment (e.g., eliminating redundant secondary information segments that are similar to a displayed primary information segment).
- Identification of the relatedness of primary information segments can be accomplished by determining the degree of similarity between the primary information segment being displayed and each other primary information segment.
- the degree of similarity can be determined using any appropriate method, such as, for example, relevance feedback.
- the use of relevance feedback to determine the similarity between two segments is discussed in more detail below with respect to the determination of the relatedness of primary and secondary information segments (see, in particular, section IV.B.2. below).
- the use of relevance feedback necessitates that sets of text data that represent the primary information segments be created (by, for example, using a conventional speech recognition method to create a transcript of the spoken portion of the audio data set) if such sets of text data do not already exist (e.g., a closed-caption transcript).
- each keyframe should be representative of the video content of the segment being identified.
- Each keyframe can be, for example, a video frame selected from the video data representing the segment.
- the keyframe can be selected from the video data in any appropriate manner.
- the keyframe can be a video frame that occurs at a specified location within the video data of the segment.
- the primary information comprises television news stories
- a video frame that occurs one tenth of the way through the video data representing the news story is selected.
- One tenth was chosen because it was determined empirically that video frames of particular relevance to the content of a television news story tend to occur at about that point in the television news story.
- the keyframe can be selected based upon an analysis of the content of the video data.
- One method of accomplishing this is described in detail in the commonly owned, co-pending U.S. patent application entitled “A Method of Compressing a Plurality of Video Images for Efficiently Storing, Displaying and Searching the Plurality of Video Images,” by Subutai Ahmad, Ser. No. 08/528,891, filed on Sep. 15, 1995, the disclosure of which is incorporated by reference herein.
- the content of each video frame is represented by a vector.
- the vector can comprise, for example, the discrete cosine transform (DCT) coefficients for the video frame, as known to those skilled in the art of video image analysis.
- DCT discrete cosine transform
- the DCT coefficients indicate, for example, how much objects in a video frame have moved since the previous video frame.
- the keyframe is selected as the video frame that is represented by a vector that is closest to the average vector for the video data. This method of selecting a keyframe can be advantageous as compared to the arbitrary selection of a video frame that occurs at a specified location within the video data, since it is likely to result in the selection of a video frame that is more representative of the video content of the segment.
- multiple keyframes can be identified from the video data and the keyframes “tiled,” i.e., presented together adjacent to each other.
- the video data can be analyzed and a composite video frame synthesized from the video data. Any technique for synthesizing a video frame or frames can be used.
- the keyframe may also be a video frame or frames that are not selected from the video data.
- a representative video image e.g., one or more video frames
- a news story about baseball could be represented by a keyframe showing a batter swinging at a pitch.
- Such selection can be done manually, i.e., at some point, a person reviews or is made aware of the content of the segment and, based upon that knowledge, associates a video image from the library with the segment.
- such selection can be accomplished automatically (meaning, here, without human intervention, except to establish the criteria for the selection process) by analyzing the audiovisual data of the segment (e.g., with an appropriately programmed digital computer) to ascertain the content of the segment and, based upon that analysis, associating a video image from the library with the segment.
- the content of the segment could be determined, for example, using a categorization method as described in more detail below.
- the segment to be categorized could either be compared to previously categorized segments that can be displayed by the system of the invention, or to a library of “control segments”, each of which contain words germane to a particular subject.
- the GUI 200 can be implemented, using conventional interface methods, so that a user of a system of the invention can select (e.g., by pointing and clicking with a mouse) one of the thumbnails 203 a , 203 b , 203 c , 203 d to cause the corresponding primary information segment to be displayed. (The map in the primary information map region 202 is adjusted accordingly.)
- the related secondary information region 204 of the GUI 200 provides the user information from a secondary information source or sources, the secondary information being related to the primary information currently being displayed.
- the region 204 includes two secondary information displays 204 a , 204 b , generally, the region 204 can include any number of secondary information displays.
- the secondary information displays 204 a , 204 b can take any form.
- the secondary information displays 204 a , 204 b could be single video images, moving video images or sets of text. (As shown in FIG.
- the related secondary information region 214 of the GUI 210 includes three sets of text that each are a story from a text news source.
- the secondary information displays 204 a , 204 b typically change as well.
- segments of secondary information that are related to the primary information that is being displayed can be identified in a manner discussed in more detail below.
- the system according to the invention can also be implemented so that the user can cause various parts of the secondary information displays 204 a , 204 b to be displayed, e.g., the user can be enabled to scroll up and down through a set of text or move back and forth through a video clip, using conventional GUI tools such as mouse pointing and clicking.
- GUI “buttons” as illustrated in the primary information playback control region 211 of the GUI 210 of FIG. 2B
- the manner in which the primary information is displayed could be controlled using a rotating knob device. Rotation of the knob in one direction could cause the display of the primary information to move forward (play); rotation of the knob in the other direction could cause the display of the primary information to move backward (rewind).
- the knob could be constructed so that as the knob is rotated the user feels detents at certain points in the rotation. Each detent could correspond to a particular apparent display rate of the display. For example, when the knob is positioned in a home position, the display is stopped.
- the display moves forward, the first detent in the clockwise direction causing the display to occur at a normal display rate, the second detent specifying a target apparent display rate of, for example, 1.5 times the normal display rate, the third detent specifying a target apparent display rate of, for example, 2.0 times the normal display rate, and so on.
- the knob is rotated counterclockwise, the display moves backward (i.e., in a chronological direction opposite that in which the display normally progresses).
- the first detent corresponds to normal display rate
- the second detent specifies a target display rate of, for example, 1.5 times the normal display rate, and so on.
- the maximum rotation of the knob in either direction could be limited, the maximum rotation corresponding to a maximum target apparent display rate.
- the knob could be positioned at any position in between, thus allowing the target apparent display rate to be varied continuously between the maximum forward and backward display rates.
- the knob could also include a centrally located pushbutton to, for example, enable skipping from the display of one segment of the primary information to a next segment of the primary information.
- the knob could be constructed so that the position of the knob (or activation of the pushbutton) is transmitted to the remainder of the system using wireless communications, thus providing the user with relatively large freedom of movement during use of the system.
- the system controller 103 causes data to be acquired from the primary information source 107 and the secondary information source 108 , as described above.
- the data is acquired using methods and apparatus that are appropriate to the type of data being acquired.
- the system controller 103 can acquire data representing television broadcasts using conventional equipment for receiving (e.g., a television set and antenna) and recording (e.g., a conventional videocassette recorder) television signals.
- the system controller 103 can acquire data representing radio broadcasts using conventional equipment for receiving (e.g., a radio and antenna) and recording (e.g., a conventional audiotape recorder) radio signals.
- the system controller 103 can acquire computer-readable data files (that can include text data, audio data, video data or some combination of two or more of those types of data), using conventional communications hardware and techniques, over a computer network (e.g., a public network such as the Internet or a proprietary network such as America OnlineTM, CompuServeTM or ProdigyTM) from an information providing site that is part of that network.
- a computer network e.g., a public network such as the Internet or a proprietary network such as America OnlineTM, CompuServeTM or ProdigyTM
- the system controller 103 acquires primary information including the television signals representing the content of designated television news broadcasts, and secondary information including computer-readable data files that represent the content of designated news stories from text news sources.
- the data can be acquired according to a pre-established schedule (that can be stored, for example, by the data storage device 104 ).
- Data can be acquired at any desired frequency and the scheduled acquisition times specified in any desired manner (e.g., hourly, daily at a specified time, weekly on a specified day at a specified time, or after the occurrence of a specified event).
- the schedule can be used, for example, to program a videocassette recorder to record particular television programs at particular times.
- the schedule can be used, for example, to appropriately program a computer to retrieve desired data files from particular network sites (e.g., by specifying an appropriate network address, such as a URL) of a computer network at specified times.
- connection over the network to the site or sites from which data is to be obtained can be accomplished by, for example, inserting a communications daemon into a startup file that is executed at the beginning of operation of the operating system of a computer used to implement the system controller 103 .
- the daemon can initiate a WinSock TCP/IP connection to enable connection to be made to the network site.
- the acquired data must be stored.
- analog data such as television or radio signals
- videotape or audiotape can be stored on an appropriate medium, such as videotape or audiotape.
- some or all of the data acquired by a system according to the invention is, if not already in that form, converted to digital data.
- the digital data can be stored on a conventional hard disk having adequate capacity, as described above.
- the digital data can be compressed using conventional techniques and equipment.
- a half hour television news program requires approximately 250 MB of hard disk storage capacity when the video is recorded using Adobe Premiere with Radius Studio compression at 15 fps and “high” quality capture at 240 ⁇ 180 resolution, and the audio is recorded at approximately 22 kHz.
- Appropriate rules can be established to handle situations in which the data storage device 104 (whether single or multiple devices) has insufficient data storage capacity to store new data. For example, the oldest data can be deleted, as necessary, to make room for new data.
- the primary information is the content of designated television news programs and the secondary information is the content of designated text news stories
- the oldest stored programs can be deleted as necessary to make space to store the new programs, and text stories that are older than a specified length of time (e.g., several days) are automatically deleted.
- the GUI 200 can also include a mechanism for enabling the user to specify the particular information desired, i.e., specify particular information providers (e.g., news networks, such as CNN, NBC, ABC or CBS, or information services, such as ClarinetTM) and data acquisition schedules for both the primary information source 107 and the secondary information source 108 .
- particular information providers e.g., news networks, such as CNN, NBC, ABC or CBS, or information services, such as ClarinetTM
- ClarinetTM information services
- a system according to the invention may be instructed to acquire new information at the same time that the system is instructed to display other information.
- limitations of the devices or configuration of the system of the invention can impede or prevent such simultaneous acquisition and display.
- the operating speed of a hard disk used to store the data describing the acquired information can limit the capacity of the system for such simultaneous operation: for typical amounts of audiovisual data, current conventional hard disks may not operate at a speed that is adequate to enable the simultaneous storing of data to, and accessing of stored data from, the hard disk.
- a conventional graphical user interface mechanism e.g., a dialog box
- a dialog box is used to alert the user of the system to the conflict and offer a choice between continuing with the display (thus delaying or eliminating the data acquisition) or ending the display and allowing the data acquisition to occur.
- the user can be alerted of an impending data acquisition at some predetermined time before the data acquisition is scheduled to begin. Similar to the choice described above, the user can be presented with a choice to continue with the display at that time or allow the data acquisition to occur.
- the system of the invention can default to one or the other modes of operation (i.e., data acquisition or display) if the user does not make a selection.
- the hard disk operating speed limitation described above can be alleviated or overcome by using multiple hard disks so that if data acquisition begins at a time when data is being accessed for use in generating a display, the newly acquired data is stored to a hard disk that does not contain any previously stored data (or that, based upon evaluation of one or more predetermined rules, does not contain data that is expected to be accessed during the time that the new data is being acquired), thus ensuring that data access and data storage will not occur simultaneously for a single hard disk.
- the hard disk operating speed limitation can be addressed by using only some portion of the available data to generate the information display, thus freeing more time for use in storing data to the hard disk. However, this latter approach may decrease the fidelity of the display unacceptably.
- the data being acquired can be stored on a data storage device of one type, while the data to be used for generating a display is accessed from a data storage device of another type.
- incoming television signals could be stored on a videocassette tape by a VCR, while digital data from previous television transmissions is retrieved from a hard disk for use in generating a television display of the previously acquired data.
- the data recorded by the VCR could be digitized at a later time and stored on the hard disk for subsequent use (which use may also occur at a time at which incoming television signals are being acquired by the VCR).
- the data representing the primary and secondary information are not provided from the primary and secondary information sources in a form that enables the various aspects of the invention described herein to be realized.
- structure i.e., to organize and categorize the data, and relate particular data to other data.
- the primary and secondary information can be, and typically are, divided (“partitioned”) into smaller related sets of information.
- of particular utility for the invention is the identification within the primary and secondary information of contiguous related sets of information that typically concern a single theme or subject and that can be delineated in some manner from adjacent information.
- each such contiguous related set of information can be referred to as a “segment” of the primary or secondary information.
- Segments within the primary information are “primary information segments” while segments within the secondary information are “secondary information segments.” For example, if the primary information includes the content of several news programs, the primary information can be divided into particular news programs and each news program can further be broken down into particular news stories within the news program, each news story being denoted as a segment.
- the secondary information can be divided into particular text sources and each text source can be further divided into separate text stories, each text story being denoted as a segment.
- a “segment” may sometimes, strictly speaking, not be contiguous in time (though it is contiguous in content).
- a news story that is interrupted by a commercial break then continues after the commercial break, may be defined as a single segment, particularly if the body of information is modified so that commercial breaks—and other extraneous portions of the body of information—are eliminated (an approach that, generally, is preferred, though such portions could also be treated as segments).
- each segment of the primary information can be identified within the data storage device which stores the data representing the primary information, in a manner known by those skilled in the art (e.g., by maintaining a table of segment identifiers and associated locations of the beginning of the identified segment), thus enabling the primary information segments to be accessed randomly so that the user can change the displayed segment freely among the primary information segments.
- identification of primary information segments also enables the creation of the map region 202 of the GUI 200 (FIG. 2).
- each segment of the primary information can be correlated, as described in more detail below, with segments of the secondary information, thereby enabling one Qr more secondary information segments that are sufficiently related to a primary information segment to be displayed at the same time that the primary information segment is displayed.
- the correlation of primary information segments with secondary information segments can also be used to categorize the primary information segments according to subject matter, thus enabling the user to sort or to cause display of segments of the primary information that pertain to a particular subject matter category (see the discussion of the topic buttons 215 in the playback control region 211 of the GUI 210 shown in FIG. 2A).
- partitioning of a set of data requires some analysis of the data to identify “breaks” within the data, i.e., differences between adjacent data that are of sufficient magnitude to indicate a significant change in the content of the information represented by the data.
- a break may signify a demarcation of one segment from another, but need not necessarily do so: a break may also signify, for example, a change in the video image within a segment or a change of speakers within a segment.
- Partitioning of text data is often straightforward.
- bodies of information that are collections of segments (e.g., stories) from text sources that are represented as computer-readable data typically include markers that identify the breaks between segments.
- text transcripts of bodies of information represented as a set of audiovisual information also frequently include markers that identify breaks between segments of the information.
- closed caption text data that can accompany the audio and video data of a set of audiovisual data often includes characters that indicate breaks in the text data (most news broadcasts, for example, include closed caption text data containing markers that designate story and paragraph boundaries, the beginning and end of advertisements, and changes in speaker) and, in particular, characters that explicitly designate breaks between segments (e.g., markers that identify story boundaries).
- Partitioning of such text data then, requires only the identification of the location (e.g., if the text transcript of a set of audiovisual data is time-stamped, the time of occurrence) of the markers within the text data.
- the text data can be partitioned based upon analysis of the content of the text data.
- breaks between segments can be determined, for example, based upon identification of the occurrence of a particular word, sequence of words, or pattern of words (particularly words that typically indicate a transition), and identification of changes in speaker.
- phrases of the form, “Jane Doe, WXYZ news, reporting live from Anytown, USA,” can indicate a break between segments.
- Partitioning of audio and video data typically requires some non-trivial analysis of the data.
- the partitioning of audio and video data in accordance with the invention can be accomplished in any suitable manner. Some examples of methods that can be used to accomplish partitioning of audio or video data are described below. (These methods are applicable to digital data; thus, if the primary information is initially analog, it must be digitized before partitioning.)
- the audio and video data are synchronized as a result of having been recorded together.
- partitioning of either the audio or the video data will result in a corresponding partitioning of the other of the audio and video data.
- the audio and video data are not synchronized, then such synchronization must be accomplished, in addition to partitioning one of the audio or video data, so that the other of the audio and video data can be partitioned in like manner.
- Partitioning of audio data can be accomplished in any of a number of ways.
- the audio data can be partitioned using a known voice recognition method.
- a voice recognition method that could be used with the invention is described in “A Gaussian Mixture Modeling Approach to Text-Independent Speaker Identification,” by Douglas Reynolds, PhD thesis, Dept. of Electrical Engineering, Georgia Institute of Technology, 1992, the disclosure of which is incorporated by reference herein.
- Voice recognition methods can be tailored to, for example, identify a break in the audio data when a particular voice speaks, when a particular sequence of voices speak, or when a more complicated occurrence of voices is identified (e.g., the occurrence of two voices within a specified time of each other, or the occurrence of a voice followed by a silence of specified duration).
- a break between news stories could be identified when a particular newscaster's voice is followed or preceded by a silence of specified duration.
- the audio data can be partitioned using a known word recognition method.
- a conventional speech recognition method (a large variety of which are known to those skilled in that art) can be used to enable identification of words.
- the identified words can then be analyzed in the same manner as that described above for analysis of text data, e.g., transition words or speaker changes can be used to indicate breaks.
- transition words or speaker changes can be used to indicate breaks.
- a break between news stories could be identified when one of a set of particular word patterns occurs (e.g., “we go now to”, “update from”, “more on that”).
- Audio data can also be partitioned using music recognition, i.e., a break is identified when specified music occurs.
- a method for partitioning audio data in this way is described in detail in the commonly owned, co-pending U.S. patent application entitled “System and Method for Selective Recording of Information,” by Michelle Covell and Meg Withgott, Ser. No. 08/399,482, filed on Mar. 7, 1995, the disclosure of which is incorporated by reference herein. Partitioning of audio data using music recognition can be particularly useful when transitions between segments of the body of information are sometimes made using standard musical phrases.
- music recognition can be used to partition certain news programs (e.g., The MacNeil/Lehrer news hour) which use one or more standard musical phrases to transition between news stories.
- Pause recognition is based on the assumption that a pause occurs at the time of a significant change in the content of the primary information. For many types of information, such as news programs, this is a workable assumption. A break is identified each time a pause occurs. A pause can be defined as any period of silence having greater than a specified magnitude.
- Video data can be partitioned, for example, by searching for scene breaks, a method similar to the pause recognition method for partitioning audio data discussed immediately above.
- One method of accomplishing this is described in detail in the above-mentioned U.S. patent application entitled “A Method of Compressing a Plurality of Video Images for Efficiently Storing, Displaying and Searching the Plurality of Video Images,” by Subutai Ahmad.
- the content of each video frame is represented by a vector, as described above.
- the vector for each video frame is compared to the vector of the immediately previous video frame and the immediately subsequent video frame, i.e., vectors of adjacent video frames are compared.
- a break is identified each time the difference between the vectors of adjacent video frames is greater than a predetermined threshold.
- a predetermined number of partitions is specified and the video frames are partitioned to produce that number of partitions (the partitioning can be accomplished by considering each video frame to be initially partitioned from all other video frames and recursively eliminating the partition between partitioned video frames having the least difference, or considering none of the video frames to be partitioned and recursively establishing partitions between unpartitioned video frames having the greatest difference).
- scene breaks could be identified based upon the magnitude of the overall changes in color of the pixels of adjacent video frames (a color change having a magnitude above a specified threshold is identified as a scene break). Or, scene breaks could be identified based upon the magnitude of the compression ratio for a particular set of adjacent video frames (a relatively small amount of compression indicates a relatively large change between video frames and, likely, a change in scenes, i.e., a scene break).
- the above-described methods for partitioning audio or video data directly may not, by themselves, enable identification of segment breaks to be accomplished easily or at all.
- pause recognition or scene break identification typically are not implemented in a manner that enables distinguishing between segment breaks and other breaks.
- Voice recognition may not, alone, be a reliable indicator of segment breaks, since switches in speaker often occur for reasons unrelated to a segment break.
- Word recognition too, may be erratic in determining segment breaks; it also requires obtaining a text transcript of the audio.
- Music recognition works well only with a limited number of information sources, i.e., information sources that use well-defined musical transitions.
- markers similar to those discussed above with respect to closed caption text data
- the invention contemplates use of such markers to segment audio and/or video data.
- a set of audiovisual data also includes text data (e.g., a closed caption transcript of the spoken audio), it is possible to partition the audiovisual data by partitioning the text data, then using the partitioned text data to partition the audio data and video data in a corresponding manner. Even if the audiovisual data does not initially include text data, the text data can be produced using a speech recognition method. The text data can be partitioned using any appropriate method, as described above.
- text data e.g., a closed caption transcript of the spoken audio
- the text data, audio data and video data are each time-stamped. Theoretically, then, once segment breaks are determined in the text data, the time-stamps of the beginning and end of each segment within the text data could be used directly to identify segment breaks within the audio data and/or video data.
- the text data is typically not exactly synchronized with the audio data and video data (e.g., the text data of a particular segment may begin or end several seconds after the corresponding audio or video data), making such a straightforward approach infeasible.
- the time-stamps of the segment breaks in the text data can be used to enable synchronization of those segment breaks with the corresponding segment breaks in the audio and video data. Such synchronization can be accomplished using any appropriate technique.
- One way to partition the audio and video data based upon the partition of the text data is to use a synchronization of the complete set of audio data with the complete set of text data, and a synchronization of the complete set of audio data with the complete set of video data to identify the partitions in the audio and video data.
- the latter synchronization typically exists as a consequence of the manner in which the audio and video data is obtained.
- synchronization between the text data and the audio data frequently does not already exist, and, if it does not, obtaining such synchronization can be computationally expensive.
- a simpler approach is to determine the segment breaks in the audio and video data from the segment breaks in the text data based upon a rule or rules that exploit one or more characteristics of the body of information.
- a rule might be based on an observation that segment breaks in the audio and/or video data of a set of audiovisual data bear a relatively fixed relationship to the corresponding segment breaks in the corresponding text data. For example, it was observed that the video data of a news story from an audiovisual news program frequently begins about 5 to 10 seconds before the closed caption text data of the news story. Thus, in one embodiment of news browser implementation of the invention, the beginning of the video data of a news story is assumed to be 4 seconds prior to the closed-caption text data. This enables most of the relevant video data to be captured, while reducing the possibility of capturing extraneous video. This approach was found to be accurate within 2 seconds for CNN Headline News and the news programs of the NBC, ABC and CBS television broadcasting networks.
- FIG. 3 is a flow chart of a method 300 , in accordance with this aspect of the invention, for identifying the boundaries of segments in a body of information.
- the time-stamps associated with the segment breaks in the text data can be used to approximate the location of the corresponding segment breaks in the audio and video data, as described above.
- a window of data e.g., audio or video data in the context of the current discussion
- This can be accomplished, for example, by specifying a time range that includes the time associated with the segment break in the text data (e.g., the time of occurrence of the segment break in the text data plus or minus several seconds) and identifying audio and/or video data that falls within that time range from the time-stamps associated with the audio and/or video data.
- the fine partitioning step 303 can then be used to identify breaks within the audio and/or video data.
- the fine partitioning can be accomplished using any appropriate method, such as one of the above-discussed methods (i.e., scene break identification, pause recognition, voice recognition, word recognition, or music recognition) to identify breaks in audio and video data.
- the fine partitioning can be performed on the entire set of audio data or video data, or only on the audio or video data that occurs within the time range.
- the data within the time range can then be examined to identify the location of a break or breaks within the time range. If more than one break is identified, the “best” break, measured according to the criteria of the partitioning method used, can be identified as the segment break, or the break occurring closest in time to the approximate segment break can be identified as the segment break.
- segment breaks in the other of the audio or video data can be determined using a synchronization of the audio and video data, as discussed above. Pointers to the segment breaks in the text data, audio data and/or video data can be maintained to indicate the beginning and end of each segment, thus enabling random access to segments within a body of information (e.g., news stories within a news program), as discussed in more detail above.
- the identified segments can also be used to enable other features of the invention, as described in more detail below.
- the related secondary information region 204 of the GUI 200 is used to provide the user, from a secondary information source or sources, information that is related to the primary information currently being displayed.
- a secondary information source or sources information that is related to the primary information currently being displayed.
- An important aspect of the invention is the capability to determine relatedness of segments of information represented by two different types of data.
- the invention can enable the determination of relatedness between segments of information represented by audiovisual data (such as is frequently the case for the primary information that can be displayed by the invention) and segments represented by text data (such as is generally the case for the secondary information as described particularly herein).
- This aspect of the invention enables the display of the related secondary information region 204 to be generated. It can also enable categorization of uncategorized segments, as described further below.
- FIG. 4 is a flow chart of a method 400 , in accordance with this aspect of the invention, for determining whether a first set of information represented by a first set of data of a first type (e.g., audiovisual data) is relevant to a second set of information represented by a second set of data of a second type (e.g., text data).
- a set of data of the second type is derived from the first set of data of the first type.
- step 401 causes a set of text data to be produced from a set of audiovisual data.
- the set of text data can be produced in any appropriate manner.
- “production” of the set of text data may be as simple as extracting a pre-existing text transcript (e.g., a closed caption transcript) from the set of audiovisual data.
- the set of text data can be produced from the set of audio data using a conventional speech recognition method.
- the derived set of data (of the second type) is compared to the second set of data of the second type to determine the degree of similarity between the derived set of data and the second set of data.
- a determination is made as to whether the first set of data is relevant to the second set of data, based on the comparison of step 402 .
- a threshold level of similarity (the expression of the which depends upon the method used to determine similarity) is specified so that only a sets of information that are sufficiently related to each other are identified as related. (This means, when the method 400 is used to generate the related secondary information region 204 , that less than the allotted number of secondary information segments—or even no secondary information segments—may be displayed.)
- the degree of similarity can be determined using any appropriate method, such as, for example, relevance feedback.
- relevance feedback a text representation of each segment to be compared (e.g., each audiovisual news story or text story) is represented as a vector, each component of the vector corresponding to a word, the value of each component being the number of occurrences of the word in the segment.
- the related secondary information region 204 of the GUI 200 can display a predetermined number of relevant secondary information segments. Generally, it is desirable to display the secondary information segments that are most similar to the primary information segment that is being displayed. While this can be accomplished straightforwardly by displaying those secondary information segments having the highest determined degree of similarity, such an approach may not be desirable in some situations.
- the secondary information source may include segments that are identical or nearly identical (e.g., news stories are often repeated in a variety of text news sources with little or no change), so that display of the secondary information segments having the highest determined degree of similarity can result in undesirable redundancy.
- This problem can be overcome by further determining the degree of similarity between each of a predetermined number of the secondary information segments having the highest determined degree of similarity (in one embodiment of the news browser implementation of the invention, the 10 most similar text stories are compared), and displaying only one of each pair of secondary information segments having a degree of similarity above a specified threshold, i.e., redundant secondary information segments are eliminated.
- a particular segment may have greater than the threshold degree of similarity when compared to each of second and third segments, but the second and third segments may have less than the threshold degree of similarity when compared to each other. From the three segments, it would be desirable to show both the second and third segments.
- first segment is compared to the second segment or the third segment, and the second or third segment discarded, before comparison of the first segment to the other of the second or third segment (which will also result in discarding of one of the compared segments), then only one of the three segments will be shown.
- Such a situation could be handled by, for example, calculating the similarity between all pairs of the predetermined number of secondary information segments, and performing comparisons that reveal the situation described above before discarding any of the secondary information segments.
- An important aspect of the invention is the capability to categorize uncategorized segments of information based upon the categorization of previously categorized segments of information.
- the degree of similarity between the subject matter content of segments of the primary information (e.g., news stories in audiovisual news programs) and segments of the secondary information (e.g., news stories from text news sources) can also be used to categorize the primary information according to subject matter. This can be useful to enable determination of which primary information segments fall within a particular subject matter category that corresponds to one of the topic buttons 215 (FIG.
- this aspect of the invention has particular utility in categorizing primary information segments based upon the categorization of preexisting secondary information segments, it can generally enable any categorized segments to be used to categorize uncategorized segments.
- FIG. 5 is a flow chart of a method 500 , in accordance with this aspect of the invention, for categorizing according to subject matter an uncategorized segment of a body of information based on the subject matter categorization of other previously categorized segments of the body of information.
- each story from the ClarinetTM news service is categorized according to the subject matter of the story by associating one or more predefined subject matter categories (e.g., sports, travel, computers, business, international news) with the story.
- This subject matter categorization can be used to categorize news stories from audiovisual news programs based on the similarity between each audiovisual news story and text stories from the ClarinetTM news service.
- categorization of audiovisual news stories is described as an example of how categorizing segments of primary information can be accomplished in accordance with the invention.
- the subject matter category or categories associated with each ClarinetTM text story are acquired as part of the acquisition of the text stories themselves and can, for example, be stored in a relational database in a memory that is part of the system controller 103 (FIG. 1). It may be desirable to associate only one subject matter category with each text story. For example, the most salient subject matter category can be identified in any appropriate manner and used as the sole subject matter category associated with the story. This may be done, for example, to increase the likelihood that the subject matter category eventually associated with each news story accurately describes the subject matter content of that news story.
- step 501 of the method 500 a determination is made as to the degree of similarity between the subject matter content of an uncategorized segment and that of previously categorized segments.
- the degree of similarity can be determined using any appropriate method, such as, for example, relevance feedback. When relevance feedback is used, it is necessary to obtain a textual representation of audiovisual data, if appropriate (i.e., if one or both of the segments is represented as audiovisual data) and not already existent.
- step 502 previously categorized segments that are relevant to the uncategorized segment are identified. Relevant segments can be identified based upon the degree of similarity in the same manner as that described above with respect to correlation of segments, e.g., segments having greater than a threshold level of similarity can be designated as relevant. Step 501 can also include elimination of redundant segments (in the same manner as described above) from among those that have the required degree of similarity to the uncategorized segment.
- the uncategorized segment is categorized based upon the subject matter categories associated with the relevant previously categorized segments.
- One or more subject matter categories can be associated with the uncategorized segment.
- the subject matter category or categories can be selected from the subject matter categories associated with the relevant previously categorized segments using any desired method.
- the subject matter category or categories of the most similar previously categorized segment could be selected as the subject matter category or categories of the uncategorized segment.
- the most frequently occurring subject matter category or categories associated with a predefined number of the most similar previously categorized segments (or previously categorized segments having greater than a threshold degree of similarity) could be selected as the subject matter category of the uncategorized segment.
- the apparent display rate with which the primary information is displayed by the primary display device 102 can be varied by the user. Variation in the apparent display rate of an audiovisual display can be implemented by appropriately programming a digital computer to accomplish the functions of a method for varying the apparent display rate. Generally, any method for varying the apparent display rate can be used with the invention. As described elsewhere herein, the primary information will often be represented by coextensive sets of data of several types (audio, video and, possible text).
- the particular method used to vary the apparent display rate of the primary information will typically depend upon the type of the set of data (e.g., audio, video, text) that is directly modified to produce appropriately modified data for use in generating a display of the primary information at the new apparent display rate.
- the method also preferably synchronizes the sets of data that are not directly modified with the set of data that is.
- the audio data can be modified to cause the apparent display rate of the audio display to be varied (either slowed down or speeded up) from a normal display rate and the video data synchronized with the modified audio data (resulting in a variation of the apparent video display rate that corresponds to the variation in the apparent audio display rate).
- variations in the apparent display rate of an audiovisual display are described in detail in the commonly owned, co-pending U.S. patent application entitled “Variable Rate Video Playback with Synchronized Audio,” by Neal A. Bhadkamkar, Subutai Ahmad and Michelle Covell, attorney docket number I0359-991160, filed on the same day as the present application, the disclosure of which is incorporated by reference herein.
- At least some of the methods described therein have the advantage that the apparent display rate of the audio can be varied while maintaining proper pitch (i.e., the voices don't sound stupefied when the display is slowed down or like chipmunks when the display is speeded up) and, therefore, intelligibility.
- proper pitch i.e., the voices don't sound stupefied when the display is slowed down or like chipmunks when the display is speeded up
- a brief description of a general method described therein is given immediately below, followed by a brief description of one particular method for modifying the audio data.
- a correspondence between an original audio data set and an original video data set is first established. For example, the number of audio samples that have the same duration as a frame of video data can be determined and that number of audio samples defined to be an audio segment.
- segment refers to a contiguous portion of a set of audio data that occurs during a specified duration of time; elsewhere herein, “segment” refers to a contiguous related set of information within the primary or secondary information that typically concerns a single theme or subject and that can be delineated in some manner from adjacent information.
- the audio segments can be defined, for example, so that each audio segment corresponds to a single particular video frame.
- a target display rate (which can be faster or slower than a normal display rate at which an audiovisual display system generates an audiovisual display from the unmodified, original sets of audio and video data) is also determined.
- the target display rate can be a single value which remains unchanged throughout the display or a sequence of values such that the target display rate changes during the display.
- the original audio data set is manipulated, based upon the target display rate and an evaluation of the original audio data set, to produce a modified audio data set.
- the modified audio data set is produced so that, generally, when the modified audio data set is used to generate an audio display, the audio display appears to be speeded up or slowed down by an amount that is approximately equal to the target display rate.
- the correspondence between the modified audio data set and the original audio data set, and the correspondence between the original audio data set and the original video data set are used to create a correspondence between the modified audio data set and the original video data set, which, in turn, is used to delete video data from, or add video data to, as appropriate, the original video data set to create a modified video data set.
- an audiovisual display can be generated from those modified data sets by an audiovisual display system, or the modified audio and video data sets can be stored on a conventional data storage device for use in generating a display at a later time.
- the audio and video data of the modified audio and video data sets are processed at the same rate as before (i.e., when the original audio and video data sets were used to generate a display at the normal display rate) by the audiovisual display system.
- the modified audio and video data sets in the usual case
- the apparent display rate of the audiovisual display generated from the modified audio and video data sets is different than the normal display rate.
- the modified video data set is created based upon the content of the modified audio data set and a correspondence between the modified audio data set and the original video data set, the modified video data set is synchronized (at least approximately and, possibly, exactly) with the modified audio data set and produces a display of the same or approximately the same duration.
- the audio data can be modified in any suitable manner; one way is described following.
- An audio data set is divided into non-overlapping segments of equal length. Generally, the beginning and end of each segment are overlapped with the end and beginning, respectively, of adjacent segments. (Note that the overlap can be negative, such that the length of the adjacent segments is extended.
- the audio data of corresponding overlapped portions of adjacent segments are blended and replaced by the blended audio data.
- the possible lengths of each overlap are constrained in accordance with a target overlap that corresponds to the specified target display rate. However, within this constraint, the length of each particular overlap is chosen so that the pitch pulses of the overlapped portions closely resemble each other.
- the invention enables the audio data set to be condensed or expanded a desired amount (i.e., the display of an audio data set can be speeded up or slowed down as desired), while minimizing the amount of distortion associated with the modification of the audio data set (i.e., the audio display sounds “normal”).
- the actual apparent display rate can vary from the target display rate. Over relatively long periods of time (e.g., greater than approximately 0.5 seconds), the actual apparent display rate typically closely approximates the target display rate. Over shorter time periods (e.g., approximately 30 milliseconds), the actual apparent display rate can vary more substantially from the target display rate. However, these short term fluctuations are not perceptible to an observer. Thus, this method produces an actual apparent display rate that to an observer appears to faithfully track the target display rate over the entire range of the display.
- the computation required to produce a particular amount of variation in the apparent display rate is done at the time that the determination of a target display rate mandates such variation.
- This has the advantage of reducing the amount of data storage capacity required by a system of the invention.
- This also enables any magnitude of apparent display rate to be specified over a continuous range of allowed display rates, rather than restricting the magnitude of the apparent display rate to one of a set of discrete magnitudes within an allowed range, as would be necessary if all of the computations for each magnitude of apparent display rate were pre-computed. Additionally, this enables the apparent display rate of the display to be varied in real time.
- a system according to the invention can include another information presentation feature that enables the display of a primary segment or segments to be summarized.
- Summarization enables an observer to quickly get an overview of the content of a particular segment or segments of information.
- Summarization can be implemented by appropriately programming a digital computer to accomplish the functions of a summarization method.
- summarization can be accomplished using any appropriate method.
- the particular method used will typically depend upon the type of the set of data (e.g., audio, video, text) that is directly modified to produce appropriately modified data for use in generating a summary display of the primary information.
- the method also preferably synchronizes the sets of data that are not modified directly with the set of data that is.
- text data that is part of, or derived from, audiovisual data that represents a primary segment can be summarized, and the corresponding audio and video data summarized based upon the text summary.
- One method of accomplishing such summarization is described in detail in the commonly owned, co-pending U.S. patent application entitled “Indirect Manipulation Of Data Using Temporally Related Data, With Particular Application To Manipulation Of Audio Or Audiovisual Data,” by Emanuel E. Farber and Subutai Ahmad, attorney docket number I0359-991110, filed on the same day as the present application, the disclosure of which is incorporated by reference herein. A brief description of that method is given immediately below.
- the text data of a set of audiovisual data represents a transcription of the spoken portion of the audio data and is temporally related to each of the audio and video data.
- the text data can be obtained in any appropriate manner, e.g., the text data can be pre-existing text data such as closed-caption data or subtitles, or the text data can be obtained by using any of a number of known speech recognition methods to analyze the audio data to produce the text data.
- the text data is summarized using an appropriate summarization method.
- any text summarization method can be used; a particular example of a text summarization method that can be used with the invention is described in U.S. Pat. No. 5,384,703, issued to Withgott et al. on Jan. 24, 1995.
- the unsummarized text data is aligned with the unsummarized audio data. If the text data has been obtained from the audio data using a speech recognition method, then the alignment of the unsummarized text data with the unsummarized audio data typically exists as a byproduct of the speech recognition method. Otherwise, alignment is accomplished in three steps. First, the unsummarized text data is evaluated to generate a corresponding linguistic transcription network (e.g., a network describing the set of possible phonetic transcriptions). Second, a feature analysis is performed on the audio samples comprising the unsummarized audio data set to create a set of audio feature data.
- a linguistic transcription network e.g., a network describing the set of possible phonetic transcriptions
- the linguistic transcription network is compared to the set of audio feature data (using Hidden Markov Models to describe the linguistic units of the linguistic transcription network in terms of audio features) to determine the linguistic transcription (from all of the possible linguistic transcriptions allowed by the linguistic transcription network) which best fits the set of audio feature data.
- the audio features of the best fit linguistic transcription are correlated with audio features in the set of audio feature data.
- the audio features of the best fit linguistic transcription can also be correlated with the linguistic units of the lingusitic transcription network.
- the linguistic units of the linguistic transcription network can, in turn, be correlated with the unsummarized text data. As a consequence of these correlations, an alignment of the unsummarized text data with the unsummarized audio data can be obtained. Using the previously determined text summary and the alignment between the text data and audio data, an audio summary can be produced.
- a video summary can be produced from the audio summary using an alignment between the unsummarized audio data and the unsummarized video data.
- alignment can be pre-existing (because the audio data and video data were recorded together, the alignment being inherent because of the like time stamps associated with each of the audio and video data) or can be calculated easily (the time stamp for an audio sample or video frame can be calculated by multiplying the time duration of each sample or frame by the sequence number of the sample or frame within the audio data or video data).
- Another method that can be used to summarize the display of a set of audiovisual information includes identifying and eliminating “sound bites” (defined below) in the audio portion of the primary information.
- the sound bites can be identified based upon analysis of a set of text data that corresponds to the spoken portion of the set of audio data.
- the text data can be obtained in any appropriate manner.
- the text data may be closed caption data that is provided with the audio and video data representing the primary information.
- the text data can be obtained from the set of audio data using conventional speech recognition techniques.
- the text data can be “pre-processed” using known methods to classify the words in the text data according to their characteristics, e.g., part of speech.
- a “sound bite” is a related set of contiguous audio information that conforms to one or more predetermined criteria that are intended to identify short spoken phrases that are not spoken by a previously identified primary speaker and that represent information of little interest and/or are redundant.
- the primary information includes the content of audiovisual news programs (e.g., television news programs)
- the predetermined criteria can be established so that spoken portions of the audio information that are likely not to have been spoken by a news anchorperson or a news reporter are identified as sound bites.
- Such criteria might include, for example, rules that tend to identify a spoken portion of the audio as a sound bite if the spoken portion includes slang words or the use of first person pronouns (e.g., I or we), both of which tend not to be present in the speech of an anchorperson or reporter.
- first person pronouns e.g., I or we
- elimination of such audio portions will typically not significantly adversely affect the presentation of the essential content of a set of audio information, but will enable the set of audio information to be presented more quickly. (It should be noted that the summarization method of Withgott et al. was also found to be incidentally effective at eliminating sound bites.)
- the set of modified audio data must be aligned (synchronized) with the video data (if present) to enable the video data to be modified to produce a speeded-up video display.
- the audio/video alignment can either be pre-existing or calculated easily.
- a summarization method such as one of those described above could be used in combination with a method for increasing the apparent display rate as described above (see section IV.C.1. above on Skimming) to even further condense the display of a set of primary information.
- the set or sets of data representing the primary information could be modified to increase the apparent display rate, then the modified set or sets of data could be summarized to produce a speeded-up summary of the set of primary information.
- the set or sets of data representing the primary information could be summarized, then the summarized set or sets of data modified to increase the apparent display rate, thus producing a speeded-up summary of the set of primary information.
- the methods described above for manipulating audiovisual data to produce a summarized display of the audiovisual data can also be used, with appropriate modification (e.g., instead of producing a summary of the text data, the text data could be manipulated in some other desired fashion), to manipulate the audiovisual data for some other purpose, such as rearranging, editing, selectively accessing or searching the audiovisual data.
- a system according to the invention can include yet another information presentation feature that enables the display of an image to be paused, then, at the end of the pause, resumed at an accelerated rate (i.e., a rate that is faster than a normal display rate) until a time at which the content of the display corresponds to the content that would have been displayed had the image been displayed at the normal display rate without the pause, at which time display of the image at the normal display rate resumes.
- an accelerated rate i.e., a rate that is faster than a normal display rate
- the image display is speeded up so that the display “catches up” to where it would have been without the pause, then slowed back down to the normal display rate.
- the image to be displayed is represented by an ordered set of display data.
- This display data is acquired from a data source at a first rate.
- the display data is transferred to a display device at the first rate as the display data is acquired.
- An image is generated from the display data transferred to the display device and displayed on the display device.
- the user instructs the system to pause the display.
- the system identifies the pause instruction from the user and, in response, stops the transfer of display data to the display device and begins storing the acquired display data at the first rate.
- the user instructs the system to resume the display.
- the system identifies the resume instruction from the user and, in response, begins transferring stored display data to the display device at a second, effective rate that is greater than the first rate.
- An image is generated from the stored display data transferred to the display device and displayed on the display device. While the stored display data is being transferred to the display device, the newly acquired data continues to be stored. The storage of display data finally stops when there is no more stored display data to be transferred to the display device, the amount of stored display data having gradually been reduced by transferral of the stored display data to the display device at the second, effective rate that is greater than the first rate at which the display data is stored. Once the storage of display data stops, the display data is again transferred to the display device at the first rate as the display data is acquired.
- This feature of the invention enables a great deal of flexibility in observing a real-time display of audiovisual information.
- the invention enables an observer to pause and resume the display as desired so that, if the observer wants to temporarily stop watching to go to the bathroom or to take a phone call, the observer can pause the display, then, after resuming the display upon return, watch the audiovisual information at an accelerated display rate until the display of the program catches up to where it would have been without the pause.
- the user can attend to other matters while the audiovisual information is being viewed, without sacrificing viewing any of the content of the audiovisual information or enduring the inconvenience of spending additional time to finish watching the audiovisual program.
- This feature of the invention can also be tailored to enable a user who has begun viewing the audiovisual information at a time later than desired, to observe the audiovisual information at an accelerated rate until the display catches up to the point at which the display have been if the audiovisual information had been viewed at a normal display rate beginning at the desired start time.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Library & Information Science (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- User Interface Of Digital Computer (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention facilitates and enhances review of a body of information (that can be represented by a set of audio data, video data, text data or some combination of the three), enabling the body of information to be quickly reviewed to obtain an overview of the content of the body of information and allowing flexibility in the manner in which the body of information is reviewed. In a particular application of the invention, the content of audiovisual news programs is acquired from a first set of one or more information sources (e.g., television news programs) and text news stories are acquired from a second set of one or more information sources (e.g., online news services or news wire services). In such a particular application, the invention can enable the user to access the news stories of audiovisual news programs in a random manner so that the user can move quickly among news stories or news programs. The invention can also enable the user to quickly locate news stories pertaining to a particular subject. Additionally, when the user is observing a particular news story in a news program, the invention can identify and display related news stories. The invention can also enable the user to control the display of the news programs by, for example, speeding up the display, causing a summary of one or more news stories to be displayed, or pausing the display of the news stories. Additionally, the invention can indicate to the user which news story is currently being viewed, as well as which news stories have previously been viewed.
Description
- 1. Field of the Invention
- This invention relates to systems and methods that enable observation of a body of information and, in particular, a body of information that can be represented, at least in part, by audiovisual data. Most particularly, the invention relates to systems and methods for accessing and reviewing a body of information represented by one or more sets of audiovisual data that can be used to generate an audiovisual display and one or more related sets of text data that can be used to generate a text display.
- 2. Related Art
- The increasing complexity of the modern world, and the concomitant explosion in the amount of information available to describe that world, has placed competing demands on people. There is more subject matter that people find necessary or desirable to master or, at least, be familiar with. At the same time, there is less time to spend delving into any particular subject. Too, there is a much larger universe of information from which the desired information must be extracted. Trying to get just an overview of a large body of information can be overwhelming, and attempting to find specific material within the body of information can be like searching for a needle in a haystack.
- Thus, there is a continuing and growing need for methods and systems for enabling bodies of information to be accessed and reviewed in a useful manner, e.g., a manner that allows the scope and content of available information to be quickly ascertained and that enables quick access to information of particular interest. In particular, there is a need for systems and methods of organizing, categorizing and relating the various segments of a large body of information to facilitate the access and review of the body of information. For example, while some previous systems for enabling observation of a large body of information enable identification of one or more segments of information that are related to a specified segment of information, these systems do not automatically display such related segments of information. Moreover, the previous systems either require that related segments have previously been determined or, at least, that the segments have been categorized according to subject matter content so that whether two segments are related can readily be determined. Further, previous systems have not enabled determination of relatedness between segments of information represented by different types of data, e.g., such systems cannot determine whether a segment represented by audiovisual data is related to a segment represented by text data.
- There is also a need for systems and methods for enabling observation of a body of information that are user-friendly, e.g., that can be used with little training, that are convenient to use, that enable information to be quickly and easily accessed, and that present the information in an accessible format via a high quality display medium. It would also be desirable for such systems and methods to be adapted for use with bodies of information represented by different types of data (i.e., audio data, video data, text data or some combination of the three). It would further be desirable for such systems and methods to be adapted for use with bodies of information represented by data acquired from a wide variety of media (e.g., print media such as newspapers or magazines, television and radio broadcasts, online computer information services and pre-recorded audiovisual programs, to name a few). Previous systems and methods for accessing and reviewing a body of information are deficient in one or more of these respects.
- For example, many previous systems are computer-based. Typically, the display device of these systems (e.g., conventional computer display monitor) does not provide a high quality display of time-varying audiovisual information (such as produced by a television, for example). On the other hand, display devices that do display such information well (e.g., televisions), typically do not provide a high quality display of text information (such as produced by a computer display monitor). A system that can provide a high quality display of both types of information is needed.
- Additionally, previous systems for reviewing a body of information are not as flexible or convenient to use as is desirable. For example, in many such systems (e.g., computers), the mechanism for controlling the operation of the system is physically coupled to the display device of the system. Therefore, the system can not be operated remotely, thus constraining the user's freedom of movement while operating the system. Additionally, even in those systems where remote operation is possible (e.g., remotely controlled televisions), the remote control device often does not have a user interface that is as readily accessible as desired (as many consumer electronics users can testify, the keypads of many remote control devices are an impenetrable _array of cryptic control keys, often requiring non-intuitive key combinations to effect particular control instructions) or the remote control device does not contain a rich set of control features. Moreover, the remote control devices used with previous systems do not have the capability of themselves displaying a part of the body of information.
- Further, previous systems often do not enable real-time acquisition and review of some or all of the body of information. For example, many computer-based systems acquire and store data representing a body of information. The stored data can then be accessed to enable display of segments of the body of information. However, insofar as previous systems for observing a body of information allow real-time acquisition and review of the body of information, these systems generally do not analyze the data to enable the data to be organized, categorized and related so that, for example, segments of the body of information can be related to other segments for which data is acquired in the future or for which data has previously been acquired. Moreover, such systems do not enable the real-time display of some or all of a body of information while also displaying related information in response to the real-time display.
- Thus, there is a need for improved systems and methods for enabling observation of a body of information and, in particular, such systems and methods that address the above-identified inadequacies in previous systems and methods for enabling observation of a body of information.
- The invention enables a body of information to be displayed by electronic devices (e.g., a television, a computer display monitor) in a manner that allows the body of information to be reviewed quickly and in a flexible manner. Typically, the body of information will be represented by a set of audio data, video data, text data or some combination of the three. In a particular embodiment, the invention enables generation of an audiovisual display of one or more segments of information, as well as a display (a text display, an audio display, a video display, or an audiovisual display), for each of the segments, of one or more related segments of information. In a particular application of the invention, referred to herein as a “news browser”, the invention enables acquisition, and subsequent review, of news stories obtained over a specified period of time from a specified group of news sources. For example, as a news browser, the invention can be used to review news stories acquired during one day from several television news programs (e.g., CNN Headline News, NBC Nightly News), as well as from text news sources (e.g., news wire services, traditional print media such as newspapers and magazines, and online news services such as Clarinet™).
- The invention enables some or all of a body of information to be skimmed quickly, enabling a quick overview of the content of the body of information to be obtained. The invention also enables quick identification of information that pertains to a particular subject. The invention further enables quick movement from one segment of a body of information to another, so that observation of particular information of interest can be accomplished quickly. In a news browser according to the invention, for example, each of a set of television news programs can be skimmed to quickly ascertain the subject matter content of the news stories contained therein. Additionally, a particular category (e.g., subject matter category) can be specified and news stories having content that fits within the specified subject matter category can be immediately identified and either displayed or identified as pertinent to the subject matter category and available for display. Further, a user of the news browser can move arbitrarily among news stories within the same or different news programs.
- The invention also enables automatic identification of information that is related to information that is being displayed, so that the related information can be observed, thereby enabling information about a particular subject to be examined in depth. In particular, the invention enables such identification of related segments to be made between segments of different types (e.g., a segment represented by audiovisual data can be compared to a segment represented by text data to enable a determination of whether the segments are related). A portion or a representation of the related information can be displayed in response to (e.g., simultaneous with) the original information display. For instance, in a news browser according to the invention, one or more text news stories (e.g., news stories that are obtained from traditional print media or from electronic publications) that are related (i.e., which cover the same or similar subject matter) to a television news story being displayed can be automatically identified and a portion of the related text news story or stories displayed so that the story or stories can be reviewed for additional information regarding the subject matter of the television news story. Additionally, in a news browser according to the invention, one or more other television news stories that are related to a television news story being displayed can be automatically identified and a single representative video frame displayed for each such news story.
- Additionally, the invention enables automatic categorization of uncategorized segments of the body of information based upon comparison to other segments of the body of information that have been categorized. In particular, the subject matter category of a segment of information can be determined by comparing the segment to one or more previously categorized segments and categorizing the segment in accordance with the subject matter categorization of one or more previously categorized segments that are determined to be relevant to the uncategorized segment. In a news browser according to the invention, for example, this can be used to categorize the news stories of a television news program based upon the categorization of text news stories that are found to be relevant to the television news stories.
- The invention can be implemented in a system that is convenient to use, that presents the body of information in a readily accessible way, and that presents the information via one or more display devices that are tailored for use with the particular type of data that is used to generate the display. For example, a system according to the invention can include a control device that enables remote, untethered control of a primary display device of the system. The remote control device can also be implemented so that some or all of the body of information can also be displayed on the remote control device. The system can include, for example, a television for display of audiovisual information and a computer display monitor for display of text information.
- Additionally, a control device of a system according to the invention can be implemented with a graphical user interface that facilitates user interaction with the system. For example, such an interface can include a region that provides an indication of a user's past progression through, and present location within, the body of information. In a news browser according to the invention, for example, a program map is displayed that facilitates navigation through the news programs that can be selected for display.
- The invention also enables real-time acquisition and review of some or all of the body of information. The invention enables on-the-fly analysis of data as the data is acquired, so that the data can be organized, categorized and related to other data. The invention also enables the real-time display of some or all of a body of information while also displaying related information in response to the real-time display. For example, in a news browser according to the invention, television news programs can be acquired and displayed as they occur. Related news stories, either from previously acquired television news programs or text news sources can be displayed as each television news story is displayed in real time.
- The invention also enables control of the manner in which the information is displayed (e.g., the apparent display rate of the display can be controlled, the display can be paused, a summary of a portion of the body of information can be displayed). For example, in a news browser according to the invention, the user can cause a summary of one or more television news stories to be displayed (rather than the entire news story or stories), the user can speed up (or slow down) the display of a television news story, and the user can pause and resume the display of a television news story such that the display resumes at an accelerated rate until the display of the news story “catches up” to where the display would have been without the pause (a useful feature when the television news story is being acquired and displayed in real time).
- In one aspect of the invention, a system enables acquisition and review of a body of information that includes a multiplicity of segments that each represent a defined set of information (frequently, a contiguous related set of information) in the body of information. The system includes: i) a mechanism for acquiring data representing the body of information; ii) a mechanism for storing the data; iii) a first display mechanism for generating a display of a first segment of the body of information from data that is part of the stored data; iv) a mechanism for comparing the data representing a segment of the body of information to the data representing a different segment of the body of information to determine whether, according to one or more predetermined criteria, the compared segments are related; and v) a second display mechanism for generating a display of a portion of, or a representation of, a second segment of the body of information from data that is part of the stored data. (A method according to the invention, and a computer readable medium encoded with one or more computer programs according to the invention, both enable similar capability.) The second display mechanism displays a portion or representation of the second segment in response to the display by the first display mechanism of a first segment to which the second segment is related. The second display mechanism can display a portion or representation of the second segment substantially coextensive in time with the display of the related first segment by the first display mechanism. The system can further include a mechanism for identifying the subject matter content of a segment of the body of information, so that the mechanism for comparing can determine the similarity of the subject matter content of a segment to the subject matter content of a different segment (using, for example, relevance feedback) and use that result to determine the relatedness of the compared segments. The system can also include a mechanism for identifying an instruction from a user to begin displaying at least some of the body of information, the first display mechanism beginning display of a segment in response to the user instruction. When a portion or representation of a second segment is being displayed, the system can enable such a second segment to be selected for display by the first display mechanism. Often, the segments displayed by the first display mechanism are represented by audiovisual data (and, in particular, audiovisual data that can be used to generate an audiovisual display that can vary with time), such as, for example, data produced from television or radio broadcast signals. The segments displayed by the second display mechanism can be represented by audiovisual data (e.g., a single representative video image, or “keyframe”) or by text data (e.g., text excerpts), such as, for example, data from computer-readable data files acquired over a computer network from an information providing site that is part of that network. In particular applications for which use of the invention is contemplated, the first display mechanism can be an analog display device (such as a television) and the second display means can be a digital display device (such as a computer display monitor). The system can advantageously be implemented so that the various devices are interconnected to a conventional computer bus that enables the devices to communicate with each other such that the devices do not require wire communication over network communication lines to communicate with each other (the devices are “untethered”).
- In another aspect of the invention, a system for reviewing a body of audiovisual information that can vary with time (e.g., the content from one or more news broadcasts) includes: i) a mechanism for displaying the audiovisual information; and ii) a mechanism for controlling operation of the system, the mechanism for controlling being physically separate from the mechanism for displaying and including a graphical user interface for enabling specification of control instructions. The mechanism can advantageously be made portable. Further, the system can advantageously include a mechanism for 2-way wireless communication between the mechanism for displaying and the mechanism for controlling. The graphical user interface can include one or more of the following: i) a playback control region for enabling specification of control instructions that control the manner in which the audiovisual information is displayed on the means for displaying; ii) a map region for providing a description of the subject matter content of the audiovisual information and for enabling specification of control instructions that enable navigation within the audiovisual information; iii) a related information region for displaying a portion of, or a representation of, a segment that is related to a segment being displayed by the mechanism for displaying; and iv) a secondary information display region for displaying a secondary information segment that is related to a segment of the audiovisual information that is being displayed by the mechanism for displaying. In particular, the playback control region can include one or more of the following: i) an interface that enables selection of one of a plurality of subject matter categories, all of the segments of the audiovisual information corresponding to a particular subject matter category being displayed in response to the selection of that subject matter category; ii) an interface that enables variation of the apparent display rate at which the audiovisual information is displayed; iii) an interface that enables specification of the display of a summary of a segment of the audiovisual information; iv) an interface that enables the display to be paused, then resumed at an accelerated rate that continues until the display of the audiovisual information coincides with the display that would have appeared had the display not been paused; v) an interface that enables termination of the current segment display and beginning of a new segment display; and vi) an interface that enables repetition of the current segment display. The map region can further identify a segment of the audiovisual information that is currently being displayed and/or identify each segment of the audiovisual information that has previously been displayed.
- In still another aspect of the invention, a system enables review of a body of information, the body of information including a first portion that is represented by audiovisual data that can vary with time and a second portion that is represented by text data. The system includes a first display device for displaying the first portion of information and a second display device for displaying the second portion of information. The first display device is particularly adapted for generation of a display from time-varying audiovisual data, while the second display device is particularly adapted for generation of a display from text data. The first display device can be, for example, an analog display device such as a television. The second display device can be, for example, a digital display device such as a computer display monitor. The two devices can interact with each other so that related information can be displayed at the same time on the two devices, in the same manner as that described above.
- In another aspect of the invention, a method categorizes according to subject matter a segment of a body of information (that includes a plurality of segments), the segment not previously having been categorized according to subject matter, based upon the subject matter category or categories associated with one or more previously categorized segments of the body of information. The uncategorized segment can have been acquired from a first data source (that supplies, for example, television or radio broadcast signals) and the previously categorized segment or segments can have been acquired from a second data source (that supplies, for example, computer-readable data files) that is different than the first data source. The method includes the steps of: i) determining the degree of similarity between the subject matter content of the uncategorized segment and the subject matter content of each of the previously categorized segments; ii) identifying one or more of the previously categorized segments as relevant to the uncategorized segment based upon the determined degrees of similarity of subject matter content between the uncategorized segment and the previously categorized segments; and iii) selecting one or more subject matter categories with which to identify the uncategorized segment based upon the subject matter category or categories used to identify the relevant previously categorized segment or segments. (A computer readable medium encoded with one or more computer programs according to the invention enables similar capability.) The step of determining the degree of similarity can be accomplished using a relevance feedback method. The step of identifying one or more of the previously categorized segments as relevant to the uncategorized segment can include the steps of: i) identifying a multiplicity of the previously categorized segments that are the most similar to the uncategorized segment; ii) determining the degree of similarity between each of the multiplicity of previously categorized segments and each other of the plurality of previously categorized segments; iii) for each pair of previously categorized segments of the multiplicity of previously categorized segments having greater than a predefined degree of similarity, eliminating one of the pair of previously categorized segments from the multiplicity of previously categorized segments, wherein the previously categorized segment or segments remaining after the step of eliminating are similar and distinct previously categorized segments; and iv) identifying one or more of the similar and distinct previously categorized segments as relevant previously categorized segments.
- In another aspect of the invention, a method determines whether a first set of information represented by a set of data of a first type (e.g., text data) is relevant to a second set of information (that is different than the first set of information) represented by a set of data of a second type (e.g., audiovisual data). The method includes the steps of: i) deriving a set of data of the second type from the set of data of the first type, the derived set of data of the second type also being representative of the first set of information; ii) determining the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information; and iii) determining whether the first set of information is relevant to the second set of information based upon the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information. (A computer readable medium encoded with one or more computer programs according to the invention enables similar capability.) The step of determining the degree of similarity can be accomplished using a relevance feedback method. Still further in accordance with this aspect of the invention, a method can determine which, if any, of a multiplicity of sets of information represented by an associated set of data of a first type (each of the multiplicity of sets of information being different from other of the multiplicity of sets of information) are relevant to the second set of information represented by the set of data of the second type. This method includes the steps of, in addition to those discussed above: i) determining the degree of similarity between each set of data of the first type representing one of the multiplicity of sets of information and the derived set of data of the first type representing the second set of information; ii) identifying which, if any, of the sets of data of the first type representing one of the multiplicity of sets of information have greater than a predefined degree of similarity to the derived set of data of the first type representing the second set of information, the sets of data of the first type so identified being termed similar sets of data of the first type; iii) determining the degree of similarity between each similar set of data of the first type and each other similar set of data of the first type; iv) for each pair of similar sets of data of the first type having greater than a predefined degree of similarity, eliminating one of the pair of similar sets of data of the first type from the set of similar sets of data of the first type, wherein the set or sets of similar data of the first type remaining after the step of eliminating are similar and distinct sets of data of the first type; and v) identifying the set or sets of information corresponding to one or more of the similar and distinct sets of data of the first type as relevant to the second set of information.
- In still another aspect of the invention, a method enables the identification of the boundaries of segments in a body of information that is represented by a set of text data and at least one of a set of audio data or a set of video data, each segment representing a contiguous related set of information in the body of information. (A computer readable medium encoded with one or more computer programs according to the invention enables similar capability.) The segment boundaries are identified by first performing a coarse partitioning method to approximately locate the segment boundaries, then performing a fine partitioning method to more precisely locate the segment boundaries. In the coarse partitioning method, time-stamped markers in the set of text data are identified and used to determine approximate segment boundaries within the body of information. For each time of occurrence of an approximate segment boundary in the text data, a range of time is specified that includes the time of occurrence. Subsets of audio data or subsets of video data that occur during the specified ranges of time are extracted from the complete set of audio data or the complete set of video data. The fine partitioning method is then performed to identify one or more breaks in each of the subsets of audio data or each of the subsets of video data. The best break that occurs in each subset of audio data or each subset of video data is selected, and the time of occurrence of the best break in each subset is designated as a boundary of a segment in the body of information. The fine partitioning can be performed using any appropriate method. For example, when segment boundaries are being determined in video data, scene break identification can be used to implement the fine partitioning. When segment boundaries are being determined in audio data, the fine partitioning can be implemented by, for example, pause recognition, voice recognition, word recognition or music recognition. Once segment boundaries have been determined in the audio data or the video data, a synchronization of the audio data and the video data can be used to determine the boundaries of the segment in the other of the audio data or video data.
- FIG. 1 is a block diagram illustrating a system according to the invention for acquiring and reviewing a body of information.
- FIG. 2A is a diagrammatic representation of a graphical user interface according to the invention that can be used to enable control of the operation of a system according to the invention, display information regarding operation of the system of the invention and display information acquired by the system of the invention.
- FIG. 2B is a view of an illustrative graphical user interface in accordance with the diagrammatic representation of FIG. 2A.
- FIG. 3 is a flow chart of a method in accordance with the invention for identifying the boundaries of segments in a body of information.
- FIG. 4 is a flow chart of a method in accordance with the invention for determining whether a first set of information represented by data of a first type is relevant to a second set of information represented by data of a second type.
- FIG. 5 is a flow chart of a method in accordance with the invention for categorizing according to subject matter an uncategorized segment of a body of information based on the categorization of other previously categorized segments of the body of information.
- I. Overview
- Generally, the invention enables the acquisition of a body of information and review of the content of the body of information. In particular, the invention includes various features that facilitate and enhance review of the body of information. The invention enables the body of information to be quickly reviewed to obtain an overview of the content of the body of information or some portion of the body information. The invention also allows flexibility in the manner in which the body of information is reviewed. For example, the invention enables a user to move quickly from one segment of a body of information to another, enabling the user to rapidly begin observing particular information of interest. Further, the invention enables a user to quickly locate information within the body of information that pertains to a particular subject in which the user has an interest. The invention also enables a user to, when observing particular information, quickly find and review other information that is related to the information that the user is observing. Additionally, the invention enables the user to control the manner in which the information is displayed (e.g., the apparent display rate of the display can be controlled, the display can be paused, a summary of a portion of the body of information can be displayed). The invention also provides the user with an indication of the user's past progression through, and present location within, the body of information, such indications aiding the user in selecting further segments (described below) of the body of information for review.
- The body of information can be represented by one or more sets of audio data, one or more sets of video data, one or more sets of text data or some combination of the three. Herein, “audio data” refers to data used to generate an audio display, “video data” refers to data used to generate a video display substantially including images other than text images, “text data” refers to data used to generate a video (or audio, though typically video) display of text images, and “audiovisual data” refers to data that includes audio and/or video data, and may include text data. In a particular embodiment, the invention enables the acquisition and review of one or more sets of information represented by audiovisual data, as well as related sets of information represented by text data.
- For example, in a particular application of the invention, the content of one or more audiovisual news programs is acquired from a first set of one or more information sources and news stories (or “articles”) from text news sources are acquired from a second set of one or more information sources. The first set of information sources could be, for example, CNN Headline News or network (e.g., ABC, NBC, CBS) news programs. The second set of information sources could be, for example, on-line news services such as Clarinet™ or news wire services such as AP or UPI. It is contemplated that this application of the invention can be particularly useful as a means of enhancing the viewing of conventional television news programs. For example, in this application, the invention can enable the user to access the news stories of audiovisual news programs in a random manner so that the user can move quickly from one news program to another, or from one news story in a news program to another news story in the same or another news program. The invention can also enable the user to quickly locate news stories pertaining to a particular subject. Additionally, when the user is observing a particular news story in an audiovisual news program, the invention can identify and display a related text news story or stories. The invention can also enable the user to control the display of the audiovisual news programs by, for example, speeding up the display, causing a summary of one or more news stories to be displayed, or pausing the display of the news stories, thereby enabling the user to quickly ascertain the content of one or more news stories or entire news programs. Additionally, the invention can indicate to the user which audiovisual news program is currently being viewed (and, further, which news story within the news program is being viewed), as well as which news stories and/or news programs have previously been viewed.
- II. System Configuration
- FIG. 1 is a block diagram illustrating a
system 100 according to the invention for acquiring and reviewing a body of information. Auser 109 interacts with acontrol device 101 to cause information to be displayed on aprimary display device 102. Thecontrol device 101 includes an appropriate user interface (e.g., a graphical user interface, as discussed in more detail below) that allows theuser 109 to specify control instructions for effecting control of thesystem 100. Communication between thecontrol device 101 and theprimary display device 102 is mediated by asystem controller 103. Thesystem controller 103 causes primary information to be acquired from aprimary information source 107 via a primary informationdata acquisition device 105. Herein, “primary information” is any information the display of which the user can directly control. Thesystem controller 103 also causes secondary information (which is typically related to the primary information) to be acquired from asecondary information source 108 via a secondary informationdata acquisition device 106. Herein, “secondary information” is any information other than primary information that is acquired by a system according to the invention and that can be displayed by the system and/or used by the system to manipulate or categorize (as described in more detail below) the primary information. Adata storage device 104 stores the acquired primary and secondary information. The primary information is displayed on theprimary display device 102. The secondary information can be displayed (e.g., by thecontrol device 101 or by theprimary display device 102 in addition to the primary information) or not (i.e., the secondary information may be used only for categorizing and/or manipulation of the primary information). Illustratively, the primary information can be videotape (or other audiovisual data representation) of an audiovisual news program or programs and the secondary information can be the text of news stories from text news sources. - The
control device 101, theprimary display device 102, thesystem controller 103 and thedata storage device 104 can be embodied in one or more devices that can be interconnected to a conventional computer bus that enables the devices to communicate with each other. In particular, thedevices devices other devices system 100, the primary and secondary information can be accessed and displayed at a relatively fast speed, thus providing quick response to control instructions from the user and enabling generation of displays with acceptable fidelity. In contrast, a networked system in which the devices must communicate with each other over a network via wire communication lines—in particular, a system in which the control device and display device or devices must communicate over such wire communication lines with the data storage device on which the information is stored—may not produce acceptable performance. In the networked system, the operation of the system is limited by the communications bandwidth and latency of the network communications medium. For example, the bandwidth of the network communications medium may not be adequate to enable transfer of data from thedata storage device 104 to theprimary display device 102 quickly enough to enable a display with acceptable fidelity to be generated by theprimary display device 102. Or, the response to a control instruction from thecontrol device 101 may be undesirably slow because of inadequate speed of the network communications medium. - The primary information
data acquisition device 105 and secondary informationdata acquisition device 106 can be implemented by any appropriate such devices. Where theprimary information source 107 is comprised of television news broadcasts, for example, the primary informationdata acquisition device 105 can be a conventional television tuner and video capture device that acquires the data representing the primary information via conventional cable connections, satellite dish or television antenna. Where the secondary information is comprised of online text sources (i.e., text sources available over a computer network such as the Internet), for example, the secondary informationdata acquisition device 106 can be a conventional modem or other communications adapter, as known by those skilled in the art of data communications, that enables acquisition of data representing the secondary information via one or more conventional communication lines, such as telephone lines, ISDN lines or Ethernet connections. (It is also possible that the primary information can be acquired from online sources, such as via the Internet or other computer network.) - The primary information
data acquisition device 105 and the secondary informationdata acquisition device 106 can communicate with thesystem controller 103 in any appropriate manner. As described below, thesystem controller 103 can be implemented as part of a digital computer. Where this is the case, the communication between thesystem controller 103 and thedevices devices device device system controller 103 can be implemented using well-known methods and apparatus. For other types of devices, such communication must be implemented in another manner. For example, when thedevice 105 is a television tuner, communication between thesystem controller 103 and thedevice 105 can be implemented using a VISCA (Video System Control Architecture) connection. - As will be apparent from the description below, the processing of the data representing the primary and secondary information generally requires that the data be in digital form. Text data acquired from online text sources, for example, is acquired in digital form and so can be used directly in such processing. Analog television signals, however, must be digitized before being used in digital processing. This can be accomplished using conventional A/D conversion methods and apparatus. Further, it is desirable to compress the data to increase the amount of data (i.e., primary and secondary information) that can be stored on the
data storage device 104. For example, the television data can be compressed according to the MPEG, JPEG or MJPEG video compression standards, as known by those skilled in the art of audio and video data compression. The text data can also be compressed, using conventional text file compression programs, such as PKZIP, though, typically, such compression provides a relatively small benefit because the amount of text data is small compared to the amount of audio and video data, and the amount of data required to represent the categorization information (described below). Finally, it may be desirable or necessary to transform digital data into an analog waveform again (e.g., convert digital video data into analog video data for display by a television). This can be accomplished using conventional D/A conversion methods and apparatus. - In the embodiment of the invention shown in FIG. 1, the
system 100 according to the invention makes use of two devices for display and control: aprimary display device 102 for displaying the primary information and acontrol device 101 for controlling the operation of theprimary display device 102. Preferably, thecontrol device 101 is physically separate from theprimary display device 102 and portable so that the user has flexibility in selecting a position relative to theprimary display device 102 during use of thesystem 100. For example, such an embodiment could allow a user to use the invention while sitting in a chair or on a couch, reclining in bed, or sitting at a table or desk. Additionally, when the secondary information is textual (e.g., the text of news stories) and thecontrol device 101 is used to display such secondary information, the portability of thecontrol device 101 attendant such an embodiment increases the likelihood that the text is displayed on a device that can be held in close proximity to the user, thereby improving the ability of the user to view the text. Further, as discussed in greater detail below, thecontrol device 101 preferably has sophisticated user interface capabilities. - As previously mentioned, a system according to the invention (including the system100) can be implemented so that the
primary display device 102 displays the primary information while a separate device (e.g., the control device 101) displays the secondary information. Further, as can be appreciated from the description herein, the invention can advantageously be used in situations in which the primary information is audiovisual information (and, in particular, audiovisual information that can vary with time, such as the content of a television program) and the secondary information is text information (some or all of which is, typically, likely to be related to the audiovisual information). In such an implementation of the invention, the use of two different devices for display allows the optimization of the display devices for the particular type of information to be displayed. (A system according to the invention can, in general, have any number of displays, as necessary or advantageous.) Thus, where the primary information is audiovisual information, theprimary display device 102 is preferably a device that enables high quality audio and video images (in particular, time-varying audio and video images) to be produced, such as a television. However, while a television is good for displaying audiovisual information, the television doesn't do as good a job with the display of text, particularly at typical viewing distances. A computer display monitor, on the other hand, does a good job of displaying text. Thus, a computer display monitor can be used to display the secondary information. (Herein, a “computer display monitor” can display not only video, but also audio.) In particular, a portable computer (e.g., a notebook or subnotebook computer) can advantageously be used to implement such display. Moreover, the portable computer can also be used to implement thecontrol device 101, thus allowing the display of the secondary information to be integrated with the user interface used to specify instructions for controlling operation of thesystem 100. Where a portable computer is used to implement thecontrol device 101, communication between thecontrol device 101 and the rest of thesystem 100 is advantageously accomplished using a wireless local area network (LAN), infrared link, or other wireless communications system, so that the user will have more freedom of movement when using thecontrol device 101. - The
system controller 103 can be implemented by any conventional processing device or devices that can accomplish the functions of a system controller as described herein. For example, thesystem controller 103 can be implemented by a conventional microprocessor chip, as well as peripheral and other computer chips that can be configured to perform the functions of thesystem controller 103. Thedata storage device 104 can be implemented by any conventional storage devices. Thedata storage device 104 can be implemented, for example, by a conventional computer hard disk (to enable storage of digital data, including analog data—e.g., television or radio signals—that has been digitized), a conventional videotape (to enable storage of, for example, analog data corresponding to acquired television signals) or a conventional audiotape (to enable storage of, for example, analog data corresponding to acquired radio signals). In particular, thesystem controller 103 anddata storage device 104 can be implemented, for example, in a conventional digital computer. The devices with which thesystem controller 103 anddata storage device 104 are implemented should have the capability to compress and decompress the audio, video and text data quickly enough to enable real-time display of that data. Thesystem controller 103 can communicate with thecontrol device 101 and theprimary display device 102 in any appropriate manner, including wire and wireless communications. - In a particular embodiment of the invention, the
control device 101 can be embodied by a portable computer (e.g., a Thinkpad™ computer, made by IBM Corp. of Armonk, N.Y.). The portable computer and associated display screen facilitate the presentation of a graphical user interface, as will be apparent from the description below. Preferably, the portable computer has a color display screen. A color display screen further facilitates implementation of a graphical user interface by enabling color differentiation to be used to enhance the features provided in the graphical user interface. The Thinkpad™ can be configured (as known by those skilled in such art) to act as an X/windows terminal (client) that communicates with an X/windows host (server), using standard X/windows protocols (as also known by those skilled in such art), to enable generation and display of the graphical user interface. In this particular embodiment of the invention, theprimary display device 102, as well as the system controller (X/windows host) 103, can be embodied, for example, by an Indigo2 workstation computer made by Silicon Graphics Incorporated (SGI) of Mountain View, Calif. The portable computer can communicate with the SGI Indigo2 computer via a wireless Ethernet link. - Alternatively, both of the
primary display device 102 andcontrol device 101 could be implemented in a digital computer with thesystem controller 103 and data storage device 104 (although such an implementation may not have some of the advantages of the embodiments of the invention described above). For example, the above-mentioned SGI Indigo2 computer or an IBM-compatible desktop computer could be used to implement a system of the invention in this manner. In particular, implementation of a system according to the invention in this manner could advantageously be accomplished on a portable computer such as a notebook computer. - III. User Interface
- A. Graphical User Interface
- 1. Overview
- FIG. 2A is a diagrammatic representation of a graphical user interface (GUI)200 according to the invention that can be used to enable control of the operation of a system according to the invention, display information regarding operation of the system of the invention and display information acquired by the system of the invention. Generally, a GUI according to the invention can be displayed using any suitable display device. Further, when a GUI according to the invention is displayed on a display monitor of a digital computer, the GUI can be implemented by appropriately tailoring conventional computer display software, as known to those skilled in the art in view of the discussion below. For example, the
GUI 200 can be displayed on the screen of a portable computer. - The
GUI 200 includes four regions: primary informationplayback control region 201, primaryinformation map region 202, relatedprimary information region 203, and relatedsecondary information region 204. It is to be understood that theregions GUI 200 than shown in FIG. 2A. Additionally, it is to be understood that a GUI according to the invention need not include all or any of theregions primary information region 203. The GUI also need not, for example, include a primaryinformation map region 202 or a primary informationplayback control region 201 having exactly the characteristics described below; other interfaces enabling similar functionality could also be used. The GUI could also be implemented so that user interaction with standard GUI mechanisms such as menus and dialog boxes is necessary to cause display of system controls, system operation information, and/or acquired information. For example, a GUI according to the invention could be implemented such that a display of the relatedsecondary information region 204 is produced only upon appropriate interaction with one or more menus and/or dialog boxes. - FIG. 2B is a view of an
illustrative GUI 210 in accordance with the diagrammatic representation of FIG. 2A. TheGUI 210 is particularly tailored for use with an embodiment of the invention in which the primary information includes videotape of one or more news programs and the secondary information includes the text of news stories from text news sources. Below, theregions generic GUI 200 are described generally, while the correspondingregions particular GUI 210 are described in detail. - 2. Control of Primary Information Display
- The primary information
playback control region 201 of theGUI 200 is used to control the manner in which the primary information is displayed on theprimary display device 102. Theregion 201 can be used, for example, to provide a mechanism to enable the user to begin, stop or pause display of the primary information, as well as rewind or fast forward the display. Theregion 201 can also be used, for example, to control the particular primary information that is displayed, as well as the apparent display rate at which the primary information is displayed. - As seen in FIG. 2B, the primary information
playback control region 211 of theGUI 210 includes topic “buttons” 215, control “buttons” 216 and a speed control 217. It is to be understood that the functionality of thetopic buttons 215,control buttons 216 and speed control 217, described below, could be accomplished in a manner other than that shown in FIG. 2B and described below. - The
topic buttons 215 enable the user to select a subject matter category so that, for example, all news stories in the recorded news programs that pertain to the selected subject matter category are displayed one after the other by theprimary display device 102. Alternatively, selection of atopic button 215 could cause a list of news stories pertaining to that subject matter category to appear, from which list the user could select one or more news stories for viewing. (The categorization of the primary information by subject matter category is discussed in more detail below.) TheGUI 210 includes sixtopic buttons 215 to enable selection of news stories related to international news (“World”), national news (“National”), regional news (“Local”), business news (“Business”), sports news (“Sports”), and human interest news (“Living”); however, a GUI according to the invention can include any number of topic buttons and each button can correspond to any desired subject matter category designation. - The
control buttons 216 enable the user to control which news story is displayed, as well as the manner in which a news story is displayed. Moving from left to right in FIG. 2B, thecontrol buttons 216 respectively cause the display to activate a dialog box that enables the user to perform a keyword search of the text of news stories acquired by the system of the invention, return to the beginning of the currently displayed story to begin displaying the story again, stop the display, start the display, and skip ahead to the next story in a predetermined sequence of stories. A GUI according to the invention can include other control buttons that enable performance of other functions instead of, or in addition to, the functions enabled by thecontrol buttons 216, such as fast forwarding the display, rewinding the display, pausing the display (a particular method according to the invention is described below), and displaying a summarized version of the primary information (a particular method according to the invention is described in more detail below). - The speed control217 can be used to increase or decrease the apparent display rate with which the primary information is displayed. The speed control display 217 shows a number that represents the amount by which a normal display rate is multiplied to produce the current apparent display display rate, and includes a graphical slider bar that can be used to adjust the apparent display rate. The manner in which the apparent display rate can be changed is described in more detail below.
- 3. Map of Primary Information Display
- The primary
information map region 202 of theGUI 200 provides the user with a description of the content of the primary information that is available for display, as well as information that facilitates navigation through the primary information, and can also be used to allow the user to select particular primary information for display. The description of the primary information can include, for example, an illustration or other description of the subdivision of the primary information into smaller portions (e.g., segments) of information. Such illustration or description can convey the number of portions, the length (i.e., time duration) of each portion and the subject matter of each portion. Theregion 202 can also be used to show the user the location within the primary information of the portion of the primary information that is currently being viewed, as well as which (if any) portions of the primary information have previously been viewed. Additionally, theregion 202 can be used to enable the user to move freely among portions of the primary information by, for example, using a conventional mouse to point and click on a portion of the primary information that is illustrated in theregion 202. - As seen in FIG. 2B, the primary
information map region 212 of theGUI 210 includes several subdivided rows, each row representing a particular news program (e.g, CNN Headline News, NBC Nightly News, etc.). Each row is a map that illustrates to some level of detail the content of the corresponding news program. Each of the subdivisions of a row represent breaks during the news program, such as breaks between news stories. The region between each subdivision represents a news story (a region could also represent, for example, an advertisement). The duration of each news story is depicted graphically by the length of the region corresponding to that news story. Each region in a row can be displayed in a particular color, each color representing a particular predetermined subject matter category (i.e., topic), so that the color of each region denotes the subject matter category of the news story corresponding to that region. - The
map region 212 can be further enhanced in any of a variety of ways. For example, the news program (row) that is currently being viewed can be marked, such as by, for example, shading the row of the currently viewed news program a particular color or causing a particular type of symbol to appear adjacent to the row of the currently viewed news program. Additionally, news stories that have already been viewed can be marked in an appropriate manner, such as by, for example, causing the regions of the viewed news stories to be cross-hatched or to be shaded a particular color. The current viewing location can also be shown: in FIG. 2B, this is shown by a vertical line. - 4. Related Primary Information
- The related
primary information region 203 of theGUI 200 displays “thumbnails” which identify segments of the primary information that are related to the primary information that is currently being displayed. Though theregion 203 includes fourthumbnails region 203 can be used to display any number of thumbnails. Further, the thumbnails can take any form, such as a display of a portion of the segment or a display of a representation of the segment. For example, thethumbnails primary information region 213 of theGUI 210 includes three single video images that each represent a news story from a news program.) Alternatively, thethumbnails thumbnails - To enable display of thumbnails, primary information segments that are related to the primary information segment that is being displayed must be determined. A threshold of relatedness (the expression of the threshold depending upon the method used to determine relatedness) is preferably specified so that only segments that are sufficiently related to the displayed segment are displayed in the related
primary information region 203, even if that means that less than the allotted number of segments (including no segments) are displayed. If appropriate, redundant segments can be eliminated from the primary information segments to be displayed in the relatedprimary information region 203, using techniques similar to those described below for eliminating redundant segments from a set of segments identified as similar to a designated segment (e.g., eliminating redundant secondary information segments that are similar to a displayed primary information segment). - Identification of the relatedness of primary information segments can be accomplished by determining the degree of similarity between the primary information segment being displayed and each other primary information segment. The degree of similarity can be determined using any appropriate method, such as, for example, relevance feedback. The use of relevance feedback to determine the similarity between two segments is discussed in more detail below with respect to the determination of the relatedness of primary and secondary information segments (see, in particular, section IV.B.2. below). The use of relevance feedback necessitates that sets of text data that represent the primary information segments be created (by, for example, using a conventional speech recognition method to create a transcript of the spoken portion of the audio data set) if such sets of text data do not already exist (e.g., a closed-caption transcript).
- When the
thumbnails - For example, the keyframe can be a video frame that occurs at a specified location within the video data of the segment. In a particular embodiment of the invention in which the primary information comprises television news stories, a video frame that occurs one tenth of the way through the video data representing the news story is selected. One tenth was chosen because it was determined empirically that video frames of particular relevance to the content of a television news story tend to occur at about that point in the television news story.
- Alternatively, the keyframe can be selected based upon an analysis of the content of the video data. One method of accomplishing this is described in detail in the commonly owned, co-pending U.S. patent application entitled “A Method of Compressing a Plurality of Video Images for Efficiently Storing, Displaying and Searching the Plurality of Video Images,” by Subutai Ahmad, Ser. No. 08/528,891, filed on Sep. 15, 1995, the disclosure of which is incorporated by reference herein. In that method, the content of each video frame is represented by a vector. The vector can comprise, for example, the discrete cosine transform (DCT) coefficients for the video frame, as known to those skilled in the art of video image analysis. (The DCT coefficients indicate, for example, how much objects in a video frame have moved since the previous video frame.) From the vectors for all of the video frames of the video data of the segment an average vector is determined. The keyframe is selected as the video frame that is represented by a vector that is closest to the average vector for the video data. This method of selecting a keyframe can be advantageous as compared to the arbitrary selection of a video frame that occurs at a specified location within the video data, since it is likely to result in the selection of a video frame that is more representative of the video content of the segment.
- Rather than selecting a single video frame from the video data to be the keyframe, multiple keyframes can be identified from the video data and the keyframes “tiled,” i.e., presented together adjacent to each other. Or, the video data can be analyzed and a composite video frame synthesized from the video data. Any technique for synthesizing a video frame or frames can be used.
- The keyframe may also be a video frame or frames that are not selected from the video data. For example, a representative video image (e.g., one or more video frames) can be selected from a library of video images. For instance, a news story about baseball could be represented by a keyframe showing a batter swinging at a pitch. Such selection can be done manually, i.e., at some point, a person reviews or is made aware of the content of the segment and, based upon that knowledge, associates a video image from the library with the segment. Alternatively, such selection can be accomplished automatically (meaning, here, without human intervention, except to establish the criteria for the selection process) by analyzing the audiovisual data of the segment (e.g., with an appropriately programmed digital computer) to ascertain the content of the segment and, based upon that analysis, associating a video image from the library with the segment. The content of the segment could be determined, for example, using a categorization method as described in more detail below. The segment to be categorized could either be compared to previously categorized segments that can be displayed by the system of the invention, or to a library of “control segments”, each of which contain words germane to a particular subject.
- The
GUI 200 can be implemented, using conventional interface methods, so that a user of a system of the invention can select (e.g., by pointing and clicking with a mouse) one of thethumbnails information map region 202 is adjusted accordingly.) - 5. Related Secondary Information
- The related
secondary information region 204 of theGUI 200 provides the user information from a secondary information source or sources, the secondary information being related to the primary information currently being displayed. Though theregion 204 includes two secondary information displays 204 a, 204 b, generally, theregion 204 can include any number of secondary information displays. Further, as with thethumbnails primary information region 203, the secondary information displays 204 a, 204 b can take any form. For example, the secondary information displays 204 a, 204 b could be single video images, moving video images or sets of text. (As shown in FIG. 2B, the relatedsecondary information region 214 of theGUI 210 includes three sets of text that each are a story from a text news source.) Other possibilities exist for the secondary information displays 204 a, 204 b, as known to those skilled in the art. As the segment of primary information being displayed changes, the secondary information displays 204 a, 204 b typically change as well. As indicated above, segments of secondary information that are related to the primary information that is being displayed can be identified in a manner discussed in more detail below. The system according to the invention can also be implemented so that the user can cause various parts of the secondary information displays 204 a, 204 b to be displayed, e.g., the user can be enabled to scroll up and down through a set of text or move back and forth through a video clip, using conventional GUI tools such as mouse pointing and clicking. - B. Other User Interface Techniques
- User interface techniques other than GUI can be used with the invention. For example, rather than using GUI “buttons” (as illustrated in the primary information
playback control region 211 of theGUI 210 of FIG. 2B), the manner in which the primary information is displayed could be controlled using a rotating knob device. Rotation of the knob in one direction could cause the display of the primary information to move forward (play); rotation of the knob in the other direction could cause the display of the primary information to move backward (rewind). Further, the knob could be constructed so that as the knob is rotated the user feels detents at certain points in the rotation. Each detent could correspond to a particular apparent display rate of the display. For example, when the knob is positioned in a home position, the display is stopped. When the knob is rotated clockwise, the display moves forward, the first detent in the clockwise direction causing the display to occur at a normal display rate, the second detent specifying a target apparent display rate of, for example, 1.5 times the normal display rate, the third detent specifying a target apparent display rate of, for example, 2.0 times the normal display rate, and so on. Similarly, when the knob is rotated counterclockwise, the display moves backward (i.e., in a chronological direction opposite that in which the display normally progresses). The first detent corresponds to normal display rate, the second detent specifies a target display rate of, for example, 1.5 times the normal display rate, and so on. The maximum rotation of the knob in either direction could be limited, the maximum rotation corresponding to a maximum target apparent display rate. The knob could be positioned at any position in between, thus allowing the target apparent display rate to be varied continuously between the maximum forward and backward display rates. The knob could also include a centrally located pushbutton to, for example, enable skipping from the display of one segment of the primary information to a next segment of the primary information. The knob could be constructed so that the position of the knob (or activation of the pushbutton) is transmitted to the remainder of the system using wireless communications, thus providing the user with relatively large freedom of movement during use of the system. - IV. Processing of Obtained Information
- A. Information Acquisition
- 1. In General
- Returning to FIG. 1, the
system controller 103 causes data to be acquired from theprimary information source 107 and thesecondary information source 108, as described above. The data is acquired using methods and apparatus that are appropriate to the type of data being acquired. For example, thesystem controller 103 can acquire data representing television broadcasts using conventional equipment for receiving (e.g., a television set and antenna) and recording (e.g., a conventional videocassette recorder) television signals. Or, thesystem controller 103 can acquire data representing radio broadcasts using conventional equipment for receiving (e.g., a radio and antenna) and recording (e.g., a conventional audiotape recorder) radio signals. Or, thesystem controller 103 can acquire computer-readable data files (that can include text data, audio data, video data or some combination of two or more of those types of data), using conventional communications hardware and techniques, over a computer network (e.g., a public network such as the Internet or a proprietary network such as America Online™, CompuServe™ or Prodigy™) from an information providing site that is part of that network. In one particular embodiment of the invention, thesystem controller 103 acquires primary information including the television signals representing the content of designated television news broadcasts, and secondary information including computer-readable data files that represent the content of designated news stories from text news sources. - The data can be acquired according to a pre-established schedule (that can be stored, for example, by the data storage device104). Data can be acquired at any desired frequency and the scheduled acquisition times specified in any desired manner (e.g., hourly, daily at a specified time, weekly on a specified day at a specified time, or after the occurrence of a specified event). The schedule can be used, for example, to program a videocassette recorder to record particular television programs at particular times. Likewise, the schedule can be used, for example, to appropriately program a computer to retrieve desired data files from particular network sites (e.g., by specifying an appropriate network address, such as a URL) of a computer network at specified times. In the latter case, if the device with which the
system controller 103 is implemented is not operating (e.g., the computer is not turned on) at a time when a scheduled acquisition of data is to take place, thesystem controller 103 can be implemented so that all such data is immediately retrieved upon beginning operation of the device (e.g., turning the computer on). Further, connection over the network to the site or sites from which data is to be obtained can be accomplished by, for example, inserting a communications daemon into a startup file that is executed at the beginning of operation of the operating system of a computer used to implement thesystem controller 103. For example, if the computer uses a Windows operating system, the daemon can initiate a WinSock TCP/IP connection to enable connection to be made to the network site. - The acquired data must be stored. As indicated above, analog data (such as television or radio signals) can be stored on an appropriate medium, such as videotape or audiotape. Additionally, some or all of the data acquired by a system according to the invention is, if not already in that form, converted to digital data. The digital data can be stored on a conventional hard disk having adequate capacity, as described above. To minimize the amount of data storage capacity required, the digital data can be compressed using conventional techniques and equipment. Illustratively, a half hour television news program requires approximately 250 MB of hard disk storage capacity when the video is recorded using Adobe Premiere with Radius Studio compression at 15 fps and “high” quality capture at 240×180 resolution, and the audio is recorded at approximately 22 kHz.
- Appropriate rules can be established to handle situations in which the data storage device104 (whether single or multiple devices) has insufficient data storage capacity to store new data. For example, the oldest data can be deleted, as necessary, to make room for new data. For example, in the particular embodiment of the invention in which the primary information is the content of designated television news programs and the secondary information is the content of designated text news stories, as new television news programs are recorded, the oldest stored programs can be deleted as necessary to make space to store the new programs, and text stories that are older than a specified length of time (e.g., several days) are automatically deleted.
- The GUI200 (FIG. 2A) can also include a mechanism for enabling the user to specify the particular information desired, i.e., specify particular information providers (e.g., news networks, such as CNN, NBC, ABC or CBS, or information services, such as Clarinet™) and data acquisition schedules for both the
primary information source 107 and thesecondary information source 108. This could be implemented, for example, using a set of nested menus, as known by those skilled in the art. - 2. Recording/Playback Mediation
- A system according to the invention may be instructed to acquire new information at the same time that the system is instructed to display other information. However, limitations of the devices or configuration of the system of the invention can impede or prevent such simultaneous acquisition and display. For example, the operating speed of a hard disk used to store the data describing the acquired information can limit the capacity of the system for such simultaneous operation: for typical amounts of audiovisual data, current conventional hard disks may not operate at a speed that is adequate to enable the simultaneous storing of data to, and accessing of stored data from, the hard disk.
- Thus, in one embodiment of the invention, when data acquisition is scheduled to begin at a time when the system of the invention is being used for information display, a conventional graphical user interface mechanism (e.g., a dialog box) is used to alert the user of the system to the conflict and offer a choice between continuing with the display (thus delaying or eliminating the data acquisition) or ending the display and allowing the data acquisition to occur.
- In another embodiment of the invention, the user can be alerted of an impending data acquisition at some predetermined time before the data acquisition is scheduled to begin. Similar to the choice described above, the user can be presented with a choice to continue with the display at that time or allow the data acquisition to occur. The system of the invention can default to one or the other modes of operation (i.e., data acquisition or display) if the user does not make a selection.
- Or, the hard disk operating speed limitation described above can be alleviated or overcome by using multiple hard disks so that if data acquisition begins at a time when data is being accessed for use in generating a display, the newly acquired data is stored to a hard disk that does not contain any previously stored data (or that, based upon evaluation of one or more predetermined rules, does not contain data that is expected to be accessed during the time that the new data is being acquired), thus ensuring that data access and data storage will not occur simultaneously for a single hard disk. Alternatively, the hard disk operating speed limitation can be addressed by using only some portion of the available data to generate the information display, thus freeing more time for use in storing data to the hard disk. However, this latter approach may decrease the fidelity of the display unacceptably.
- In a similar approach to the two hard disk approach described above, the data being acquired can be stored on a data storage device of one type, while the data to be used for generating a display is accessed from a data storage device of another type. For example, incoming television signals could be stored on a videocassette tape by a VCR, while digital data from previous television transmissions is retrieved from a hard disk for use in generating a television display of the previously acquired data. The data recorded by the VCR could be digitized at a later time and stored on the hard disk for subsequent use (which use may also occur at a time at which incoming television signals are being acquired by the VCR).
- B. Information Structuring
- Typically, the data representing the primary and secondary information are not provided from the primary and secondary information sources in a form that enables the various aspects of the invention described herein to be realized. Thus, it is necessary or desirable to “structure” the data (i.e., to organize and categorize the data, and relate particular data to other data) in useful ways. Below are described several aspects of such data structuring that can be implemented as part of the invention.
- 1. Partitioning
- The primary and secondary information can be, and typically are, divided (“partitioned”) into smaller related sets of information. of particular utility for the invention is the identification within the primary and secondary information of contiguous related sets of information that typically concern a single theme or subject and that can be delineated in some manner from adjacent information. Herein, each such contiguous related set of information can be referred to as a “segment” of the primary or secondary information. (Note that, in the description below—see section IV.C.1.—of skimming an audiovisual display, “segment” is used in a different way; there, “segment” represents a contiguous portion of a set of audio data that occurs during a specified duration of time.) Segments within the primary information are “primary information segments” while segments within the secondary information are “secondary information segments.” For example, if the primary information includes the content of several news programs, the primary information can be divided into particular news programs and each news program can further be broken down into particular news stories within the news program, each news story being denoted as a segment. Similarly, if the secondary information includes content from several text sources, the secondary information can be divided into particular text sources and each text source can be further divided into separate text stories, each text story being denoted as a segment. Note that a “segment” may sometimes, strictly speaking, not be contiguous in time (though it is contiguous in content). For example, a news story that is interrupted by a commercial break, then continues after the commercial break, may be defined as a single segment, particularly if the body of information is modified so that commercial breaks—and other extraneous portions of the body of information—are eliminated (an approach that, generally, is preferred, though such portions could also be treated as segments).
- Partitioning the primary and secondary information into segments is useful for a variety of reasons. For example, each segment of the primary information can be identified within the data storage device which stores the data representing the primary information, in a manner known by those skilled in the art (e.g., by maintaining a table of segment identifiers and associated locations of the beginning of the identified segment), thus enabling the primary information segments to be accessed randomly so that the user can change the displayed segment freely among the primary information segments. Such identification of primary information segments also enables the creation of the
map region 202 of the GUI 200 (FIG. 2). Further, each segment of the primary information can be correlated, as described in more detail below, with segments of the secondary information, thereby enabling one Qr more secondary information segments that are sufficiently related to a primary information segment to be displayed at the same time that the primary information segment is displayed. As also described in more detail below, the correlation of primary information segments with secondary information segments can also be used to categorize the primary information segments according to subject matter, thus enabling the user to sort or to cause display of segments of the primary information that pertain to a particular subject matter category (see the discussion of thetopic buttons 215 in theplayback control region 211 of theGUI 210 shown in FIG. 2A). - Generally, partitioning of a set of data requires some analysis of the data to identify “breaks” within the data, i.e., differences between adjacent data that are of sufficient magnitude to indicate a significant change in the content of the information represented by the data. A break may signify a demarcation of one segment from another, but need not necessarily do so: a break may also signify, for example, a change in the video image within a segment or a change of speakers within a segment. Methods for enabling identification of breaks that constitute segment demarcation are discussed in more detail below.
- Partitioning of text data is often straightforward. For example, bodies of information that are collections of segments (e.g., stories) from text sources that are represented as computer-readable data typically include markers that identify the breaks between segments. Similarly, text transcripts of bodies of information represented as a set of audiovisual information also frequently include markers that identify breaks between segments of the information. For example, closed caption text data that can accompany the audio and video data of a set of audiovisual data often includes characters that indicate breaks in the text data (most news broadcasts, for example, include closed caption text data containing markers that designate story and paragraph boundaries, the beginning and end of advertisements, and changes in speaker) and, in particular, characters that explicitly designate breaks between segments (e.g., markers that identify story boundaries). Partitioning of such text data, then, requires only the identification of the location (e.g., if the text transcript of a set of audiovisual data is time-stamped, the time of occurrence) of the markers within the text data.
- Where such markers are not present, the text data can be partitioned based upon analysis of the content of the text data. In a set of audiovisual data, breaks between segments can be determined, for example, based upon identification of the occurrence of a particular word, sequence of words, or pattern of words (particularly words that typically indicate a transition), and identification of changes in speaker. As one illustration, in a news program, phrases of the form, “Jane Doe, WXYZ news, reporting live from Anytown, USA,” can indicate a break between segments.
- Partitioning of audio and video data typically requires some non-trivial analysis of the data. The partitioning of audio and video data in accordance with the invention can be accomplished in any suitable manner. Some examples of methods that can be used to accomplish partitioning of audio or video data are described below. (These methods are applicable to digital data; thus, if the primary information is initially analog, it must be digitized before partitioning.) Typically, the audio and video data are synchronized as a result of having been recorded together. Thus, partitioning of either the audio or the video data will result in a corresponding partitioning of the other of the audio and video data. However, if the audio and video data are not synchronized, then such synchronization must be accomplished, in addition to partitioning one of the audio or video data, so that the other of the audio and video data can be partitioned in like manner.
- Partitioning of audio data can be accomplished in any of a number of ways. For example, the audio data can be partitioned using a known voice recognition method. A voice recognition method that could be used with the invention is described in “A Gaussian Mixture Modeling Approach to Text-Independent Speaker Identification,” by Douglas Reynolds, PhD thesis, Dept. of Electrical Engineering, Georgia Institute of Technology, 1992, the disclosure of which is incorporated by reference herein. Voice recognition methods can be tailored to, for example, identify a break in the audio data when a particular voice speaks, when a particular sequence of voices speak, or when a more complicated occurrence of voices is identified (e.g., the occurrence of two voices within a specified time of each other, or the occurrence of a voice followed by a silence of specified duration). Illustratively, when the invention is implemented as a news browser, a break between news stories could be identified when a particular newscaster's voice is followed or preceded by a silence of specified duration.
- Or, the audio data can be partitioned using a known word recognition method. For example, a conventional speech recognition method (a large variety of which are known to those skilled in that art) can be used to enable identification of words. The identified words can then be analyzed in the same manner as that described above for analysis of text data, e.g., transition words or speaker changes can be used to indicate breaks. Illustratively, when the invention is implemented as a news browser, a break between news stories could be identified when one of a set of particular word patterns occurs (e.g., “we go now to”, “update from”, “more on that”).
- Audio data can also be partitioned using music recognition, i.e., a break is identified when specified music occurs. A method for partitioning audio data in this way is described in detail in the commonly owned, co-pending U.S. patent application entitled “System and Method for Selective Recording of Information,” by Michelle Covell and Meg Withgott, Ser. No. 08/399,482, filed on Mar. 7, 1995, the disclosure of which is incorporated by reference herein. Partitioning of audio data using music recognition can be particularly useful when transitions between segments of the body of information are sometimes made using standard musical phrases. Illustratively, when the invention is implemented as a news browser, music recognition can be used to partition certain news programs (e.g., The MacNeil/Lehrer news hour) which use one or more standard musical phrases to transition between news stories.
- Another method for partitioning audio data is pause recognition. Pause recognition is based on the assumption that a pause occurs at the time of a significant change in the content of the primary information. For many types of information, such as news programs, this is a workable assumption. A break is identified each time a pause occurs. A pause can be defined as any period of silence having greater than a specified magnitude.
- Video data can be partitioned, for example, by searching for scene breaks, a method similar to the pause recognition method for partitioning audio data discussed immediately above. One method of accomplishing this is described in detail in the above-mentioned U.S. patent application entitled “A Method of Compressing a Plurality of Video Images for Efficiently Storing, Displaying and Searching the Plurality of Video Images,” by Subutai Ahmad. In that method, the content of each video frame is represented by a vector, as described above. The vector for each video frame is compared to the vector of the immediately previous video frame and the immediately subsequent video frame, i.e., vectors of adjacent video frames are compared. In one approach, a break is identified each time the difference between the vectors of adjacent video frames is greater than a predetermined threshold. In another approach, a predetermined number of partitions is specified and the video frames are partitioned to produce that number of partitions (the partitioning can be accomplished by considering each video frame to be initially partitioned from all other video frames and recursively eliminating the partition between partitioned video frames having the least difference, or considering none of the video frames to be partitioned and recursively establishing partitions between unpartitioned video frames having the greatest difference).
- Other approaches to scene break identification could be used, as known by those skilled in the art of processing video images. Some other approaches to scene break identification are discussed in “Automatic Parsing of News Video,” by HongJiang Zhang, Gong Yihong, Stephen W. Smoliar, and Tan Ching Yong, IEEE Conference on Multimedia Computing and Systems, Boston, May 1994, the disclosure of which is incorporated by reference herein. For example, scene breaks could be identified based upon the magnitude of the overall changes in color of the pixels of adjacent video frames (a color change having a magnitude above a specified threshold is identified as a scene break). Or, scene breaks could be identified based upon the magnitude of the compression ratio for a particular set of adjacent video frames (a relatively small amount of compression indicates a relatively large change between video frames and, likely, a change in scenes, i.e., a scene break).
- The above-described methods for partitioning audio or video data directly may not, by themselves, enable identification of segment breaks to be accomplished easily or at all. For example, without augmentation, pause recognition or scene break identification typically are not implemented in a manner that enables distinguishing between segment breaks and other breaks. Voice recognition may not, alone, be a reliable indicator of segment breaks, since switches in speaker often occur for reasons unrelated to a segment break. Word recognition, too, may be erratic in determining segment breaks; it also requires obtaining a text transcript of the audio. Music recognition works well only with a limited number of information sources, i.e., information sources that use well-defined musical transitions.
- It may be possible to include markers (similar to those discussed above with respect to closed caption text data) in either audio or video data that directly identify segment or other breaks within the audio or video data. The invention contemplates use of such markers to segment audio and/or video data.
- If a set of audiovisual data also includes text data (e.g., a closed caption transcript of the spoken audio), it is possible to partition the audiovisual data by partitioning the text data, then using the partitioned text data to partition the audio data and video data in a corresponding manner. Even if the audiovisual data does not initially include text data, the text data can be produced using a speech recognition method. The text data can be partitioned using any appropriate method, as described above.
- Typically, the text data, audio data and video data are each time-stamped. Theoretically, then, once segment breaks are determined in the text data, the time-stamps of the beginning and end of each segment within the text data could be used directly to identify segment breaks within the audio data and/or video data. However, in practice, the text data is typically not exactly synchronized with the audio data and video data (e.g., the text data of a particular segment may begin or end several seconds after the corresponding audio or video data), making such a straightforward approach infeasible. Nevertheless, the time-stamps of the segment breaks in the text data can be used to enable synchronization of those segment breaks with the corresponding segment breaks in the audio and video data. Such synchronization can be accomplished using any appropriate technique. Some possible approaches are described below.
- One way to partition the audio and video data based upon the partition of the text data is to use a synchronization of the complete set of audio data with the complete set of text data, and a synchronization of the complete set of audio data with the complete set of video data to identify the partitions in the audio and video data. The latter synchronization typically exists as a consequence of the manner in which the audio and video data is obtained. However, synchronization between the text data and the audio data frequently does not already exist, and, if it does not, obtaining such synchronization can be computationally expensive. Further, it is not necessary to synchronize all of the text data with the audio and video data, but, rather, only the locations of the segment breaks.
- A simpler approach is to determine the segment breaks in the audio and video data from the segment breaks in the text data based upon a rule or rules that exploit one or more characteristics of the body of information. Such a rule might be based on an observation that segment breaks in the audio and/or video data of a set of audiovisual data bear a relatively fixed relationship to the corresponding segment breaks in the corresponding text data. For example, it was observed that the video data of a news story from an audiovisual news program frequently begins about 5 to 10 seconds before the closed caption text data of the news story. Thus, in one embodiment of news browser implementation of the invention, the beginning of the video data of a news story is assumed to be 4 seconds prior to the closed-caption text data. This enables most of the relevant video data to be captured, while reducing the possibility of capturing extraneous video. This approach was found to be accurate within 2 seconds for CNN Headline News and the news programs of the NBC, ABC and CBS television broadcasting networks.
- In some cases, the approach may still not produce as good a result as desired, i.e., the segmentation of the audio and video data is not as crisp as desired, either deleting part of the beginning or end of the audio or video segment, or including extraneous audio or video as part of the segment. Thus, according to another particular embodiment of the invention, partitioning of audiovisual data that includes text data in which segments breaks are explicitly designated by markers within the text data can be accomplished in two steps: a first, coarse partitioning followed by a second, fine partitioning. FIG. 3 is a flow chart of a
method 300, in accordance with this aspect of the invention, for identifying the boundaries of segments in a body of information. In thecoarse partitioning step 301 of themethod 300, the time-stamps associated with the segment breaks in the text data can be used to approximate the location of the corresponding segment breaks in the audio and video data, as described above. Instep 302, a window of data (e.g., audio or video data in the context of the current discussion) that includes the approximate segment boundary is specified. This can be accomplished, for example, by specifying a time range that includes the time associated with the segment break in the text data (e.g., the time of occurrence of the segment break in the text data plus or minus several seconds) and identifying audio and/or video data that falls within that time range from the time-stamps associated with the audio and/or video data. Thefine partitioning step 303 can then be used to identify breaks within the audio and/or video data. The fine partitioning can be accomplished using any appropriate method, such as one of the above-discussed methods (i.e., scene break identification, pause recognition, voice recognition, word recognition, or music recognition) to identify breaks in audio and video data. The fine partitioning can be performed on the entire set of audio data or video data, or only on the audio or video data that occurs within the time range. In thestep 304, the data within the time range can then be examined to identify the location of a break or breaks within the time range. If more than one break is identified, the “best” break, measured according to the criteria of the partitioning method used, can be identified as the segment break, or the break occurring closest in time to the approximate segment break can be identified as the segment break. - Once the segment breaks in the audio or video data are identified, segment breaks in the other of the audio or video data can be determined using a synchronization of the audio and video data, as discussed above. Pointers to the segment breaks in the text data, audio data and/or video data can be maintained to indicate the beginning and end of each segment, thus enabling random access to segments within a body of information (e.g., news stories within a news program), as discussed in more detail above. The identified segments can also be used to enable other features of the invention, as described in more detail below.
- 2. Correlation
- As mentioned above, the related
secondary information region 204 of theGUI 200 is used to provide the user, from a secondary information source or sources, information that is related to the primary information currently being displayed. Thus, it is necessary to determine which of the segments of the secondary information are sufficiently related to the primary information segment displayed on theprimary display device 102 to be displayed in the relatedsecondary information region 204. This can be accomplished by determining the degree of similarity between each segment of the primary information (e.g., news story from an audiovisual news program) and each segment of the secondary information (e.g., text story from a text news source), and displaying in the relatedsecondary information region 204 of theGUI 200 certain secondary information segments that are most similar to the primary information segment that is being displayed by theprimary display device 102. - An important aspect of the invention is the capability to determine relatedness of segments of information represented by two different types of data. In particular, the invention can enable the determination of relatedness between segments of information represented by audiovisual data (such as is frequently the case for the primary information that can be displayed by the invention) and segments represented by text data (such as is generally the case for the secondary information as described particularly herein). This aspect of the invention enables the display of the related
secondary information region 204 to be generated. It can also enable categorization of uncategorized segments, as described further below. - FIG. 4 is a flow chart of a
method 400, in accordance with this aspect of the invention, for determining whether a first set of information represented by a first set of data of a first type (e.g., audiovisual data) is relevant to a second set of information represented by a second set of data of a second type (e.g., text data). Instep 401, a set of data of the second type is derived from the first set of data of the first type. In a typical application of themethod 400, step 401 causes a set of text data to be produced from a set of audiovisual data. The set of text data can be produced in any appropriate manner. For example, “production” of the set of text data may be as simple as extracting a pre-existing text transcript (e.g., a closed caption transcript) from the set of audiovisual data. Or, the set of text data can be produced from the set of audio data using a conventional speech recognition method. Instep 402, the derived set of data (of the second type) is compared to the second set of data of the second type to determine the degree of similarity between the derived set of data and the second set of data. One way of making this determination is described in more detail below. Instep 403, a determination is made as to whether the first set of data is relevant to the second set of data, based on the comparison ofstep 402. Typically, a threshold level of similarity (the expression of the which depends upon the method used to determine similarity) is specified so that only a sets of information that are sufficiently related to each other are identified as related. (This means, when themethod 400 is used to generate the relatedsecondary information region 204, that less than the allotted number of secondary information segments—or even no secondary information segments—may be displayed.) - The degree of similarity can be determined using any appropriate method, such as, for example, relevance feedback. In relevance feedback, a text representation of each segment to be compared (e.g., each audiovisual news story or text story) is represented as a vector, each component of the vector corresponding to a word, the value of each component being the number of occurrences of the word in the segment. (Two words are considered identical—i.e., are amalgamated for purposes of ascribing a magnitude to each component of the vector representing the textual content of a segment—if the words have the same stem; for example, “play”, “played” and “player” are all considered to be the same word for purposes of forming the segment vector.) For each pair of segments, the normalized dot product of the vectors corresponding to the segments is calculated, yielding a number between 0 and 1. The degree of similarity between two segments is represented by the magnitude of the normalized dot product, 1 representing two segments with identical words and 0 representing two segments having no matching words. The use of relevance feedback to determine the similarity between two text segments is well-known, and is described in more detail in, for example, the textbook entitledIntroduction to Modern Information Retrieval, by Gerard Salton, McGraw-Hill, New York, 1983, the pertinent disclosure of which is incorporated by reference herein. Relevance feedback is also described in detail in “Improving Retrieval Performance by Relevance Feedback,” Salton, G., Journal of the American Society for Information Science, vol. 41, no. 4, pp. 288-297, June 1990 as well as “The Effect of Adding Relevance Information in a Relevance Feedback Environment,” Buckley, C. et. al., Proceedings of 17th International Conference on Research and Development in Information Retrieval, DIGIR 94, Springer-Verlag (Germany), 1994. pp. 292-300, the disclosures of which are incorporated by reference herein.
- The related
secondary information region 204 of theGUI 200 can display a predetermined number of relevant secondary information segments. Generally, it is desirable to display the secondary information segments that are most similar to the primary information segment that is being displayed. While this can be accomplished straightforwardly by displaying those secondary information segments having the highest determined degree of similarity, such an approach may not be desirable in some situations. For example, the secondary information source may include segments that are identical or nearly identical (e.g., news stories are often repeated in a variety of text news sources with little or no change), so that display of the secondary information segments having the highest determined degree of similarity can result in undesirable redundancy. - This problem can be overcome by further determining the degree of similarity between each of a predetermined number of the secondary information segments having the highest determined degree of similarity (in one embodiment of the news browser implementation of the invention, the 10 most similar text stories are compared), and displaying only one of each pair of secondary information segments having a degree of similarity above a specified threshold, i.e., redundant secondary information segments are eliminated. Again, this can be more problematic than first appears. For example, a particular segment may have greater than the threshold degree of similarity when compared to each of second and third segments, but the second and third segments may have less than the threshold degree of similarity when compared to each other. From the three segments, it would be desirable to show both the second and third segments. However, if the first segment is compared to the second segment or the third segment, and the second or third segment discarded, before comparison of the first segment to the other of the second or third segment (which will also result in discarding of one of the compared segments), then only one of the three segments will be shown. Such a situation could be handled by, for example, calculating the similarity between all pairs of the predetermined number of secondary information segments, and performing comparisons that reveal the situation described above before discarding any of the secondary information segments.
- 3. Categorizing
- An important aspect of the invention is the capability to categorize uncategorized segments of information based upon the categorization of previously categorized segments of information. In particular, if the segments of the secondary information have been categorized according to subject matter, then the degree of similarity between the subject matter content of segments of the primary information (e.g., news stories in audiovisual news programs) and segments of the secondary information (e.g., news stories from text news sources) can also be used to categorize the primary information according to subject matter. This can be useful to enable determination of which primary information segments fall within a particular subject matter category that corresponds to one of the topic buttons215 (FIG. 2) that a user can select to cause all primary information segments that pertain to the selected subject matter category to be displayed one after the other by the primary display device 102 (FIG. 1). Though this aspect of the invention has particular utility in categorizing primary information segments based upon the categorization of preexisting secondary information segments, it can generally enable any categorized segments to be used to categorize uncategorized segments.
- FIG. 5 is a flow chart of a
method 500, in accordance with this aspect of the invention, for categorizing according to subject matter an uncategorized segment of a body of information based on the subject matter categorization of other previously categorized segments of the body of information. For example, each story from the Clarinet™ news service is categorized according to the subject matter of the story by associating one or more predefined subject matter categories (e.g., sports, travel, computers, business, international news) with the story. This subject matter categorization can be used to categorize news stories from audiovisual news programs based on the similarity between each audiovisual news story and text stories from the Clarinet™ news service. Below, such categorization of audiovisual news stories is described as an example of how categorizing segments of primary information can be accomplished in accordance with the invention. - The subject matter category or categories associated with each Clarinet™ text story are acquired as part of the acquisition of the text stories themselves and can, for example, be stored in a relational database in a memory that is part of the system controller103 (FIG. 1). It may be desirable to associate only one subject matter category with each text story. For example, the most salient subject matter category can be identified in any appropriate manner and used as the sole subject matter category associated with the story. This may be done, for example, to increase the likelihood that the subject matter category eventually associated with each news story accurately describes the subject matter content of that news story.
- In
step 501 of themethod 500, a determination is made as to the degree of similarity between the subject matter content of an uncategorized segment and that of previously categorized segments. The degree of similarity can be determined using any appropriate method, such as, for example, relevance feedback. When relevance feedback is used, it is necessary to obtain a textual representation of audiovisual data, if appropriate (i.e., if one or both of the segments is represented as audiovisual data) and not already existent. - In
step 502, previously categorized segments that are relevant to the uncategorized segment are identified. Relevant segments can be identified based upon the degree of similarity in the same manner as that described above with respect to correlation of segments, e.g., segments having greater than a threshold level of similarity can be designated as relevant. Step 501 can also include elimination of redundant segments (in the same manner as described above) from among those that have the required degree of similarity to the uncategorized segment. - In
step 503, the uncategorized segment is categorized based upon the subject matter categories associated with the relevant previously categorized segments. One or more subject matter categories can be associated with the uncategorized segment. Generally, the subject matter category or categories can be selected from the subject matter categories associated with the relevant previously categorized segments using any desired method. For example, the subject matter category or categories of the most similar previously categorized segment could be selected as the subject matter category or categories of the uncategorized segment. Or, the most frequently occurring subject matter category or categories associated with a predefined number of the most similar previously categorized segments (or previously categorized segments having greater than a threshold degree of similarity) could be selected as the subject matter category of the uncategorized segment. In the latter case, it may be particularly desirable, as described above, to determine the similarity between the relevant previously categorized segments, so that only one of a set of previously categorized segments that are substantially identical to each other influences the categorization of the uncategorized segment. - C. Information Presentation
- Above, the acquisition of information and the structuring of acquired information has been described. The information must, of course, also be displayed to a user. The information display has been described generally above with respect to FIGS. 2A and 2B. However, a system according to the invention can also include one or more of a variety of additional features that enhance the information display.
- 1. Skimming
- As indicated above with respect to FIGS. 2A and 2B, the apparent display rate with which the primary information is displayed by the
primary display device 102 can be varied by the user. Variation in the apparent display rate of an audiovisual display can be implemented by appropriately programming a digital computer to accomplish the functions of a method for varying the apparent display rate. Generally, any method for varying the apparent display rate can be used with the invention. As described elsewhere herein, the primary information will often be represented by coextensive sets of data of several types (audio, video and, possible text). The particular method used to vary the apparent display rate of the primary information will typically depend upon the type of the set of data (e.g., audio, video, text) that is directly modified to produce appropriately modified data for use in generating a display of the primary information at the new apparent display rate. The method also preferably synchronizes the sets of data that are not directly modified with the set of data that is. - For example, the audio data can be modified to cause the apparent display rate of the audio display to be varied (either slowed down or speeded up) from a normal display rate and the video data synchronized with the modified audio data (resulting in a variation of the apparent video display rate that corresponds to the variation in the apparent audio display rate). Several methods of accomplishing such variation in the apparent display rate of an audiovisual display are described in detail in the commonly owned, co-pending U.S. patent application entitled “Variable Rate Video Playback with Synchronized Audio,” by Neal A. Bhadkamkar, Subutai Ahmad and Michelle Covell, attorney docket number I0359-991160, filed on the same day as the present application, the disclosure of which is incorporated by reference herein. At least some of the methods described therein have the advantage that the apparent display rate of the audio can be varied while maintaining proper pitch (i.e., the voices don't sound stupefied when the display is slowed down or like chipmunks when the display is speeded up) and, therefore, intelligibility. A brief description of a general method described therein is given immediately below, followed by a brief description of one particular method for modifying the audio data.
- Generally, in the methods described in the above-mentioned patent application, a correspondence between an original audio data set and an original video data set is first established. For example, the number of audio samples that have the same duration as a frame of video data can be determined and that number of audio samples defined to be an audio segment. (Note that, as mentioned above, as used here in the description of skimming, “segment” refers to a contiguous portion of a set of audio data that occurs during a specified duration of time; elsewhere herein, “segment” refers to a contiguous related set of information within the primary or secondary information that typically concerns a single theme or subject and that can be delineated in some manner from adjacent information.) The audio segments can be defined, for example, so that each audio segment corresponds to a single particular video frame. A target display rate (which can be faster or slower than a normal display rate at which an audiovisual display system generates an audiovisual display from the unmodified, original sets of audio and video data) is also determined. The target display rate can be a single value which remains unchanged throughout the display or a sequence of values such that the target display rate changes during the display. The original audio data set is manipulated, based upon the target display rate and an evaluation of the original audio data set, to produce a modified audio data set. As described below, the modified audio data set is produced so that, generally, when the modified audio data set is used to generate an audio display, the audio display appears to be speeded up or slowed down by an amount that is approximately equal to the target display rate. The correspondence between the modified audio data set and the original audio data set, and the correspondence between the original audio data set and the original video data set, are used to create a correspondence between the modified audio data set and the original video data set, which, in turn, is used to delete video data from, or add video data to, as appropriate, the original video data set to create a modified video data set. Once the modified audio and video data sets have been created, an audiovisual display can be generated from those modified data sets by an audiovisual display system, or the modified audio and video data sets can be stored on a conventional data storage device for use in generating a display at a later time. The audio and video data of the modified audio and video data sets are processed at the same rate as before (i.e., when the original audio and video data sets were used to generate a display at the normal display rate) by the audiovisual display system. However, since the modified audio and video data sets (in the usual case) have a different amount (either more or less) of data than the original audio and video data sets, the apparent display rate of the audiovisual display generated from the modified audio and video data sets is different than the normal display rate. Further, since the modified video data set is created based upon the content of the modified audio data set and a correspondence between the modified audio data set and the original video data set, the modified video data set is synchronized (at least approximately and, possibly, exactly) with the modified audio data set and produces a display of the same or approximately the same duration.
- The audio data can be modified in any suitable manner; one way is described following. An audio data set is divided into non-overlapping segments of equal length. Generally, the beginning and end of each segment are overlapped with the end and beginning, respectively, of adjacent segments. (Note that the overlap can be negative, such that the length of the adjacent segments is extended. The audio data of corresponding overlapped portions of adjacent segments are blended and replaced by the blended audio data. The possible lengths of each overlap are constrained in accordance with a target overlap that corresponds to the specified target display rate. However, within this constraint, the length of each particular overlap is chosen so that the pitch pulses of the overlapped portions closely resemble each other. Consequently, the blending of the audio data of the overlapped portions does not greatly distort the sound corresponding to the overlapped portions of audio data. Thus, the invention enables the audio data set to be condensed or expanded a desired amount (i.e., the display of an audio data set can be speeded up or slowed down as desired), while minimizing the amount of distortion associated with the modification of the audio data set (i.e., the audio display sounds “normal”).
- Since the actual amount of overlap of segments can vary from the target overlap that corresponds to the specified target display rate, the actual apparent display rate can vary from the target display rate. Over relatively long periods of time (e.g., greater than approximately 0.5 seconds), the actual apparent display rate typically closely approximates the target display rate. Over shorter time periods (e.g., approximately 30 milliseconds), the actual apparent display rate can vary more substantially from the target display rate. However, these short term fluctuations are not perceptible to an observer. Thus, this method produces an actual apparent display rate that to an observer appears to faithfully track the target display rate over the entire range of the display.
- Preferably, the computation required to produce a particular amount of variation in the apparent display rate is done at the time that the determination of a target display rate mandates such variation. This has the advantage of reducing the amount of data storage capacity required by a system of the invention. This also enables any magnitude of apparent display rate to be specified over a continuous range of allowed display rates, rather than restricting the magnitude of the apparent display rate to one of a set of discrete magnitudes within an allowed range, as would be necessary if all of the computations for each magnitude of apparent display rate were pre-computed. Additionally, this enables the apparent display rate of the display to be varied in real time.
- 2. Summarization
- A system according to the invention can include another information presentation feature that enables the display of a primary segment or segments to be summarized. Summarization enables an observer to quickly get an overview of the content of a particular segment or segments of information. Summarization can be implemented by appropriately programming a digital computer to accomplish the functions of a summarization method. Generally, summarization can be accomplished using any appropriate method. As with skimming, discussed above, the particular method used will typically depend upon the type of the set of data (e.g., audio, video, text) that is directly modified to produce appropriately modified data for use in generating a summary display of the primary information. The method also preferably synchronizes the sets of data that are not modified directly with the set of data that is.
- For example, text data that is part of, or derived from, audiovisual data that represents a primary segment can be summarized, and the corresponding audio and video data summarized based upon the text summary. One method of accomplishing such summarization is described in detail in the commonly owned, co-pending U.S. patent application entitled “Indirect Manipulation Of Data Using Temporally Related Data, With Particular Application To Manipulation Of Audio Or Audiovisual Data,” by Emanuel E. Farber and Subutai Ahmad, attorney docket number I0359-991110, filed on the same day as the present application, the disclosure of which is incorporated by reference herein. A brief description of that method is given immediately below.
- The text data of a set of audiovisual data represents a transcription of the spoken portion of the audio data and is temporally related to each of the audio and video data. The text data can be obtained in any appropriate manner, e.g., the text data can be pre-existing text data such as closed-caption data or subtitles, or the text data can be obtained by using any of a number of known speech recognition methods to analyze the audio data to produce the text data.
- The text data is summarized using an appropriate summarization method. Generally, any text summarization method can be used; a particular example of a text summarization method that can be used with the invention is described in U.S. Pat. No. 5,384,703, issued to Withgott et al. on Jan. 24, 1995.
- The unsummarized text data is aligned with the unsummarized audio data. If the text data has been obtained from the audio data using a speech recognition method, then the alignment of the unsummarized text data with the unsummarized audio data typically exists as a byproduct of the speech recognition method. Otherwise, alignment is accomplished in three steps. First, the unsummarized text data is evaluated to generate a corresponding linguistic transcription network (e.g., a network describing the set of possible phonetic transcriptions). Second, a feature analysis is performed on the audio samples comprising the unsummarized audio data set to create a set of audio feature data. Third, the linguistic transcription network is compared to the set of audio feature data (using Hidden Markov Models to describe the linguistic units of the linguistic transcription network in terms of audio features) to determine the linguistic transcription (from all of the possible linguistic transcriptions allowed by the linguistic transcription network) which best fits the set of audio feature data. As a result of this comparison, the audio features of the best fit linguistic transcription are correlated with audio features in the set of audio feature data. The audio features of the best fit linguistic transcription can also be correlated with the linguistic units of the lingusitic transcription network. The linguistic units of the linguistic transcription network can, in turn, be correlated with the unsummarized text data. As a consequence of these correlations, an alignment of the unsummarized text data with the unsummarized audio data can be obtained. Using the previously determined text summary and the alignment between the text data and audio data, an audio summary can be produced.
- A video summary can be produced from the audio summary using an alignment between the unsummarized audio data and the unsummarized video data. Such alignment can be pre-existing (because the audio data and video data were recorded together, the alignment being inherent because of the like time stamps associated with each of the audio and video data) or can be calculated easily (the time stamp for an audio sample or video frame can be calculated by multiplying the time duration of each sample or frame by the sequence number of the sample or frame within the audio data or video data).
- Another method that can be used to summarize the display of a set of audiovisual information includes identifying and eliminating “sound bites” (defined below) in the audio portion of the primary information. The sound bites can be identified based upon analysis of a set of text data that corresponds to the spoken portion of the set of audio data. The text data can be obtained in any appropriate manner. For example, the text data may be closed caption data that is provided with the audio and video data representing the primary information. Or, the text data can be obtained from the set of audio data using conventional speech recognition techniques. Once the text data is obtained, the text data can be “pre-processed” using known methods to classify the words in the text data according to their characteristics, e.g., part of speech.
- Herein, a “sound bite” is a related set of contiguous audio information that conforms to one or more predetermined criteria that are intended to identify short spoken phrases that are not spoken by a previously identified primary speaker and that represent information of little interest and/or are redundant. For example, in a news browser according to the invention, where the primary information includes the content of audiovisual news programs (e.g., television news programs), the predetermined criteria can be established so that spoken portions of the audio information that are likely not to have been spoken by a news anchorperson or a news reporter are identified as sound bites. Such criteria might include, for example, rules that tend to identify a spoken portion of the audio as a sound bite if the spoken portion includes slang words or the use of first person pronouns (e.g., I or we), both of which tend not to be present in the speech of an anchorperson or reporter. As can be appreciated, elimination of such audio portions will typically not significantly adversely affect the presentation of the essential content of a set of audio information, but will enable the set of audio information to be presented more quickly. (It should be noted that the summarization method of Withgott et al. was also found to be incidentally effective at eliminating sound bites.)
- Once the audio data has been modified by eliminating the audio data corresponding to the sound bites, the set of modified audio data must be aligned (synchronized) with the video data (if present) to enable the video data to be modified to produce a speeded-up video display. As described above with respect to the summarization method of Farber and Ahmad, the audio/video alignment can either be pre-existing or calculated easily.
- As can be appreciated, a summarization method such as one of those described above could be used in combination with a method for increasing the apparent display rate as described above (see section IV.C.1. above on Skimming) to even further condense the display of a set of primary information. For example, the set or sets of data representing the primary information could be modified to increase the apparent display rate, then the modified set or sets of data could be summarized to produce a speeded-up summary of the set of primary information. Or, conversely, the set or sets of data representing the primary information could be summarized, then the summarized set or sets of data modified to increase the apparent display rate, thus producing a speeded-up summary of the set of primary information.
- As can be appreciated, the methods described above for manipulating audiovisual data to produce a summarized display of the audiovisual data can also be used, with appropriate modification (e.g., instead of producing a summary of the text data, the text data could be manipulated in some other desired fashion), to manipulate the audiovisual data for some other purpose, such as rearranging, editing, selectively accessing or searching the audiovisual data.
- 3. Display Pause with Elastic Playback
- A system according to the invention can include yet another information presentation feature that enables the display of an image to be paused, then, at the end of the pause, resumed at an accelerated rate (i.e., a rate that is faster than a normal display rate) until a time at which the content of the display corresponds to the content that would have been displayed had the image been displayed at the normal display rate without the pause, at which time display of the image at the normal display rate resumes. In other words, after a pause, the image display is speeded up so that the display “catches up” to where it would have been without the pause, then slowed back down to the normal display rate. The implementation of this feature is described in detail in the commonly owned, co-pending U.S. patent application entitled “Display Pause with Elastic Playback,” by Subutai Ahmad, Neal A. Bhadkamkar, Steve B. Cousins, Paul A. Freiberger and Brygg A. Ullmer, attorney docket number I0359-991150, filed on the same day as the present application, the disclosure of which is incorporated by reference herein. A brief description of the implementation is given immediately below.
- The image to be displayed is represented by an ordered set of display data. This display data is acquired from a data source at a first rate. The display data is transferred to a display device at the first rate as the display data is acquired. An image is generated from the display data transferred to the display device and displayed on the display device. At some point, the user instructs the system to pause the display. The system identifies the pause instruction from the user and, in response, stops the transfer of display data to the display device and begins storing the acquired display data at the first rate. At some later time, the user instructs the system to resume the display. The system identifies the resume instruction from the user and, in response, begins transferring stored display data to the display device at a second, effective rate that is greater than the first rate. An image is generated from the stored display data transferred to the display device and displayed on the display device. While the stored display data is being transferred to the display device, the newly acquired data continues to be stored. The storage of display data finally stops when there is no more stored display data to be transferred to the display device, the amount of stored display data having gradually been reduced by transferral of the stored display data to the display device at the second, effective rate that is greater than the first rate at which the display data is stored. Once the storage of display data stops, the display data is again transferred to the display device at the first rate as the display data is acquired.
- This feature of the invention enables a great deal of flexibility in observing a real-time display of audiovisual information. For example, the invention enables an observer to pause and resume the display as desired so that, if the observer wants to temporarily stop watching to go to the bathroom or to take a phone call, the observer can pause the display, then, after resuming the display upon return, watch the audiovisual information at an accelerated display rate until the display of the program catches up to where it would have been without the pause. Thus, the user can attend to other matters while the audiovisual information is being viewed, without sacrificing viewing any of the content of the audiovisual information or enduring the inconvenience of spending additional time to finish watching the audiovisual program. This feature of the invention can also be tailored to enable a user who has begun viewing the audiovisual information at a time later than desired, to observe the audiovisual information at an accelerated rate until the display catches up to the point at which the display have been if the audiovisual information had been viewed at a normal display rate beginning at the desired start time.
- Various embodiments of the invention have been described.
- The descriptions are intended to be illustrative, not limitative. Thus, it will be apparent to one skilled in the art that certain modifications may be made to the invention as described without departing from the scope of the claims set out below.
Claims (62)
1. A system for acquiring and reviewing a body of information, wherein the body of information includes a plurality of segments, each segment representing a defined set of information in the body of information, the system comprising:
means for acquiring data representing the body of information;
means for storing the acquired data;
first display means for generating a display of a first segment of the body of information from data that is part of the stored data;
means for comparing data representing a segment of the body of information to data representing a different segment of the body of information to determine whether, according to one or more predetermined criteria, the compared segments are related; and
second display means for generating a display of a portion of, or a representation of, a second segment of the body of information from data that is part of the stored data, wherein the second display means displays the portion or representation of the second segment in response to the display by the first display means of a first segment to which the second segment is related.
2. A system as in , wherein the second display means displays the portion or representation of the second segment substantially coextensive in time with the display of the related first segment by the first display means.
claim 1
3. A system as in , wherein:
claim 1
at least a portion of the body of information is represented by audiovisual data;
the first segment is represented by audiovisual data;
the first display means displays an audiovisual display of the first segment; and
the second segment is represented by audiovisual data.
4. A system as in , further comprising means for selecting a segment for which a portion or representation is displayed by the second display means, wherein selection of such segment causes the first display means to display an audiovisual display of the selected segment.
claim 3
5. A system as in , wherein:
claim 1
at least a portion of the body of information is represented by audiovisual data;
the first display means displays an audiovisual display of the first segment; and
the second display means displays a text display of a portion or representation of the second segment.
6. A system as in , wherein:
claim 1
the first display means is an analog display device; and
the second display means is a digital display device.
7. A system as in , wherein:
claim 1
the first display means is a television; and
the second display means is a computer display monitor.
8. A system as in , further comprising means for identifying the subject matter content of a segment of the body of information, wherein the means for comparing further comprises means for determining the similarity of the subject matter content of a segment to the subject matter content of a different segment, the predetermined criteria including a predefined degree of similarity with respect to which the relatedness of the compared segments is determined.
claim 1
9. A system as in , wherein the means for determining the similarity of the subject matter of segments further comprises means for performing a relevance feedback method.
claim 8
10. A system as in , wherein the means for acquiring data further comprises means for acquiring television broadcast signals.
claim 1
11. A system as in , wherein the means for acquiring data further comprises means for acquiring radio broadcast signals.
claim 1
12. A system as in , wherein the means for acquiring data further comprises means for acquiring computer-readable data files over a computer network from an information providing site that is part of that network.
claim 1
13. A system as in , wherein the means for acquiring data further comprises:
claim 1
means for acquiring television broadcast signals; and
means for acquiring computer-readable data files over a computer network from an information providing site that is part of that network.
14. A system as in , wherein:
claim 13
the first segment is represented by data produced from the television broadcast signals; and
the second segment is represented by data from the computer-readable data files.
15. A system as in , further comprising means for identifying an instruction from a user to begin displaying at least some of the body of information, wherein the first display means begins displaying a segment in response to the user instruction.
claim 1
16. A system as in , wherein the first and second display means are physically separate.
claim 1
17. A system as in , wherein the means for storing the acquired data, the first display means and the second display means are interconnected to a conventional computer bus that enables the devices to communicate with each other such that the devices do not require wire communication over network communication lines to communicate with each other.
claim 1
18. A system for reviewing a body of audiovisual information that can vary with time, the system comprising:
means for displaying the audiovisual information; and
means for controlling operation of the system, the means for controlling being physically separate from the means for displaying, the means for controlling including a graphical user interface for enabling specification of control instructions.
19. A system as in , wherein the means for controlling is portable.
claim 18
20. A system as in , further comprising means for 2-way wireless communication between the means for displaying and the means for controlling.
claim 19
21. A system as in , wherein the graphical user interface includes a playback control region for enabling specification of control instructions that control the manner in which the audiovisual information is displayed by the means for displaying.
claim 18
22. A system as in , wherein:
claim 21
the audiovisual information includes a plurality of segments, each segment representing a defined set of information in the audiovisual information;
the playback control region includes an interface that enables selection of one of a plurality of subject matter categories; and
the means for controlling further comprises:
means for identifying the subject matter category of a segment; and
means for controlling the system to display each of the segments that correspond to a selected subject matter category.
23. A system as in , wherein:
claim 21
the playback control region includes an interface that enables variation of the apparent display rate at which the audiovisual information is displayed; and
the means for controlling further comprises means for controlling the means for displaying to cause the audiovisual information to be displayed at an apparent display rate other than a normal display rate.
24. A system as in , wherein:
claim 21
the audiovisual information includes a plurality of segments, each segment representing a defined set of information in the audiovisual information;
the playback control region includes an interface that enables specification of the display of a summary of a segment of the audiovisual information; and
the means for controlling further comprises:
means for summarizing a segment of the audiovisual information; and
means for controlling the means for displaying to cause the summary of the segment to be displayed.
25. A system as in , wherein:
claim 21
the playback control region includes:
an interface that enables specification of a pause instruction; and
an interface that enables specification of a resume instruction; and
the means for controlling further comprises:
means for identifying a pause instruction from a user;
means for controlling the means for displaying to stop the display of the audiovisual information in response to identification of the pause instruction;
means for identifying a resume instruction from a user; and
means for controlling the means for displaying to restart the display of the audiovisual information in response to identification of the resume instruction, wherein the audiovisual information is displayed at an accelerated rate that is greater than the rate at which the audiovisual information was previously displayed, such accelerated rate continuing until the display of the audiovisual information coincides with the display that would have appeared had the display not been paused.
26. A system as in , wherein:
claim 21
the audiovisual information includes a plurality of segments, each segment representing a defined set of information in the audiovisual information;
the playback control region includes an interface that enables specification of a termination instruction; and
the means for controlling further comprises:
means for identifying a termination instruction from a user; and
means for terminating display of the segment currently being displayed and beginning display of a new segment in response to identification of a termination instruction.
27. A system as in , wherein:
claim 21
the audiovisual information includes a plurality of segments, each segment representing a defined set of information in the audiovisual information;
the playback control region includes an interface that enables specification of a repeat instruction; and
the means for controlling further comprises:
means for identifying a repeat instruction from a user; and
means for repeating the display of the segment currently being displayed in response to identification of a repeat instruction.
28. A system as in , wherein the graphical user interface includes a map region for providing a description of the subject matter content of the audiovisual information and for enabling specification of control instructions that enable navigation within the audiovisual information.
claim 18
29. A system as in , wherein:
claim 28
the audiovisual information includes a plurality of segments, each segment representing a defined set of information in the audiovisual information; and
the map region further identifies a segment of the audiovisual information that is currently being displayed.
30. A system as in , wherein:
claim 28
the audiovisual information includes a plurality of segments, each segment representing a defined set of information in the audiovisual information; and
the map region further identifies each segment of the audiovisual information that has previously been displayed.
31. A system as in , wherein:
claim 18
the audiovisual information includes a plurality of segments, each segment representing a defined set of information in the audiovisual information; and
the graphical user interface includes a related information region for displaying a portion of, or a representation of, a segment that is related to a segment being displayed by the means for displaying.
32. A system as in , wherein:
claim 18
the audiovisual information includes a plurality of segments, each segment representing a defined set of information in the audiovisual information; and
the graphical user interface includes a secondary information display region for displaying a secondary information segment that is related to a segment of the audiovisual information that is being displayed by the means for displaying.
33. A system as in , wherein the audiovisual information further comprises the content from one or more news programs.
claim 18
34. A system for reviewing a body of information, the body of information including a first portion that is represented by audiovisual data that can vary with time and a second portion that is represented by text data, comprising:
a first display device for displaying the first portion of information, the first display device particularly adapted for generation of a display from time-varying audiovisual data; and
a second display device for displaying the second portion of information, the second display device particularly adapted for generation of a display from text data.
35. A method for acquiring and reviewing a body of information, wherein the body of information includes a plurality of segments, each segment representing a defined set of information in the body of information, the method comprising the steps of:
acquiring data representing the body of information;
storing the acquired data;
generating a display of a first segment of the body of information from data that is part of the stored data;
comparing data representing a segment of the body of information to data representing a different segment of the body of information to determine whether, according to one or more predetermined criteria, the compared segments are related; and
generating a display of a portion of, or a representation of, a second segment of the body of information from data that is part of the stored data, wherein the display of the portion or representation of the second segment is generated in response to the display of a first segment to which the second segment is related.
36. A method for categorizing according to subject matter an uncategorized segment of a body of information that includes a plurality of segments, each segment representing a defined set of information in the body of information, one or more segments of the body of information having previously been categorized by identifying each of the one or more segments with one or more subject matter categories, the method comprising the steps of:
determining the degree of similarity between the subject matter content of the uncategorized segment and the subject matter content of each of the previously categorized segments;
identifying one or more of the previously categorized segments as relevant to the uncategorized segment based upon the determined degrees of similarity of subject matter content between the uncategorized segment and the previously categorized segments; and
selecting one or more subject matter categories with which to identify the uncategorized segment based upon the subject matter categories used to identify the relevant previously categorized segments.
37. A method as in , wherein the step of determining the degree of similarity is accomplished using a relevance feedback method.
claim 36
38. A method as in , wherein the step of identifying one or more of the previously categorized segments as relevant to the uncategorized segment further comprises the steps of:
claim 36
identifying a plurality of the previously categorized segments that are the most similar to the uncategorized segment;
determining the degree of similarity between each of the plurality of previously categorized segments and each other of the plurality of previously categorized segments;
for each pair of previously categorized segments of the plurality of previously categorized segments having greater than a predefined degree of similarity, eliminating one of the pair of previously categorized segments from the plurality of previously categorized segments, wherein the previously categorized segment or segments remaining after the step of eliminating are similar and distinct previously categorized segments; and
identifying one or more of the similar and distinct previously categorized segments as relevant previously categorized segments.
39. A method as in , wherein the step of selecting one or more subject matter categories further comprises selecting the most frequently occurring subject matter category or categories associated with the relevant previously categorized segments.
claim 36
40. A system as in , wherein the uncategorized segment has been acquired from a first data source and the previously categorized segment or segments have been acquired from a second data source that is different than the first data source.
claim 36
41. A system as in , wherein:
claim 40
the data acquired from the first data source are television or radio broadcast signals; and
the data acquired from the second data source are computer-readable data files.
42. A method for determining whether a first set of information represented by a set of data of a first type is relevant to a second set of information represented by a set of data of a second type, the first and second sets of information being different from each other, the method comprising the steps of:
deriving a set of data of the second type from the set of data of the first type, the derived set of data of the second type also being representative of the first set of information;
determining the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information; and
determining whether the first set of information is relevant to the second set of information based upon the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information.
43. A method as in , wherein the first type of data is audiovisual data and the second type of data is text data.
claim 42
44. A method as in , wherein the step of determining the degree of similarity is accomplished using a relevance feedback method.
claim 43
45. A method as in , wherein a plurality of sets of information, each different from the other sets of the plurality of sets of information, are each represented by an associated set of data of the second type, the method enabling determination of which, if any, of the plurality of sets of information represented by a set of data of the second type are relevant to the first set of information represented by the set of data of the first type, the method further comprising the steps of:
claim 42
determining the degree of similarity between each set of data of the second type representing one of the plurality of sets of information and the derived set of data of the second type representing the first set of information;
identifying which, if any, of the sets of data of the second type representing one of the plurality of sets of information have greater than a predefined degree of similarity to the derived set of data of the second type representing the first set of information, the sets of data of the second type so identified being termed similar sets of data of the second type;
determining the degree of similarity between each similar set of data of the second type and each other similar set of data of the second type;
for each pair of similar sets of data of the second type having greater than a predefined degree of similarity, eliminating one of the pair of similar sets of data of the second type from the set of similar sets of data of the second type, wherein the set or sets of similar data of the second type remaining after the step of eliminating are similar and distinct sets of data of the second type; and
identifying the set or sets of information corresponding to one or more of the similar and distinct sets of data of the second type as relevant to the second set of information.
46. A method as in , wherein the step of identifying the relevant set or sets of information further comprises identifying no more than a predetermined number of relevant sets of information, the predetermined number of relevant sets of information corresponding to the sets of data of the second type having the greatest degree of similarity to the derived set of data of the second type.
claim 45
47. A method for identifying the boundaries of segments in a body of information, each segment comprising a contiguous related set of information in the body of information, wherein the body of information is represented by at least a set of text data and a set of video data, the method comprising the steps of:
performing a coarse partitioning method, the coarse partitioning method further comprising the steps of:
identifying time-stamped markers in the set of text data; and
determining approximate segment boundaries within the body of information as the times of occurrence of the time-stamp markers;
for each approximate segment boundary, specifying a range of time that includes the time of occurrence of the approximate segment boundary;
extracting subsets of video data from the set of video data that occur during the specified ranges of time;
performing a fine partitioning method to identify one or more breaks in the set of video data; and
selecting the best break that occurs in each subset of video data, the time of occurrence of the best break in each subset being designated as a boundary of a segment in the body of information.
48. A method as in , wherein the step of performing a fine partitioning method further comprises identifying the best breaks using a process that includes scene break identification.
claim 47
49. A method as in , wherein the step of fine partitioning is performed on the entire set of video data to identify all of the breaks in the set of video data.
claim 47
50. A method as in , wherein the step of fine partitioning is performed only on the subsets of video data to identify only breaks that occur in the subsets.
claim 47
51. A method as in , wherein the best break of each subset is determined according to the criteria of the fine partitioning method used.
claim 47
52. A method as in , wherein the best break of each subset is the break occurring closest in time to the time of occurrence of the segment boundary in the text data that corresponds to that subset.
claim 47
53. A method as in , wherein the body of information is represented by a set of text data, a set of audio data and a set of video data, the method further comprising the steps of:
claim 47
ascertaining a synchronization of the audio data and the video data; and
determining the location of the segment boundaries in the set of audio data using the previously determined location of the segment boundaries in the set of video data and the synchronization of the audio data and video data.
54. A method for identifying the boundaries of segments in a body of information, each segment comprising a contiguous related set of information in the body of information, wherein the body of information is represented by a set of text data, a set of video data, and a set of audio data the method comprising the steps of:
performing a coarse partitioning method, the coarse partitioning method further comprising the steps of:
identifying time-stamped markers in the set of text data; and
determining approximate segment boundaries within the body of information as the times of occurrence of the time-stamp markers;
for each approximate segment boundary, specifying a range of time that includes the time of occurrence of the approximate segment boundary;
extracting subsets of audio data from the set of audio data that occur during the specified ranges of time;
performing a fine partitioning method to identify one or more breaks in the set of audio data;
selecting the best break that occurs in each subset of audio data, the time of occurrence of the best break in each subset being designated as a boundary of a segment in the body of information;
ascertaining a synchronization of the audio data and the video data; and
determining the location of the segment boundaries in the set of video data using the previously determined location of the segment boundaries in the set of audio data and the synchronization of the audio data and video data.
55. A method as in , wherein the step of performing fine partitioning further comprises identifying the best breaks using a process that includes pause recognition.
claim 54
56. A method as in , wherein the step of performing fine partitioning further comprises identifying the best breaks using a process that includes voice recognition.
claim 54
57. A method as in , wherein the step of performing fine partitioning further comprises identifying the best breaks using a process that includes word recognition.
claim 54
58. A method as in , wherein the step of performing fine partitioning further comprises identifying the best breaks using a process that includes music recognition.
claim 54
59. A computer readable medium encoded with one or more computer programs for enabling acquisition and review of a body of information, wherein the body of information includes a plurality of segments, each segment representing a defined set of information in the body of information, comprising:
instructions for acquiring data representing the body of information;
instructions for storing the acquired data;
instructions for generating a display of a first segment of the body of information from data that is part of the stored data;
instructions for comparing data representing a segment of the body of information to data representing a different segment of the body of information to determine whether, according to one or more predetermined criteria, the compared segments are related; and
instructions for generating a display of a portion of, or a representation of, a second segment of the body of information from data that is part of the stored data, wherein the display of the portion or representation of the second segment is generated in response to the display of a first segment to which the second segment is related.
60. A computer readable medium encoded with one or more computer programs for enabling categorization according to subject matter of an uncategorized segment of a body of information that includes a plurality of segments, each segment representing a defined set of information in the body of information, one or more segments having previously been categorized by identifying each of the one or more segments with one or more subject matter categories, comprising:
instructions for determining the degree of similarity between the subject matter content of the uncategorized segment and the subject matter content of each of the previously categorized segments;
instructions for identifying one or more of the previously categorized segments as relevant to the uncategorized segment based upon the determined degrees of similarity of subject matter content between the uncategorized segment and the previously categorized segments; and
instructions for selecting one or more subject matter categories with which to identify the uncategorized segment based upon the subject matter categories used to identify the relevant previously categorized segments.
61. A computer readable medium encoded with one or more computer programs for enabling determination of whether a first set of information represented by a set of data of a first type is relevant to a second set of information represented by a set of data of a second type, the first and second sets of information being different from each other, comprising:
instructions for deriving a set of data of the second type from the set of data of the first type, the derived set of data of the second type also being representative of the first set of information;
instructions for determining the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information; and
instructions for determining whether the first set of information is relevant to the second set of information based upon the degree of similarity between the set of data of the second type representing the second set of information and the derived set of data of the second type representing the first set of information.
62. A computer readable medium encoded with one or more computer programs for enabling identification of the boundaries of segments in a body of information, each segment comprising a contiguous related set of information in the body of information, wherein the body of information is represented by at least a set of text data and a set of video data, comprising:
instructions for performing a coarse partitioning method, the coarse partitioning instructions further comprising:
instructions for identifying time-stamped markers in the set of text data; and
instructions for determining approximate segment boundaries within the body of information as the times of occurrence of the time-stamp markers;
instructions for specifying, for each approximate segment boundary, a range of time that includes the time of occurrence of the approximate segment boundary;
instructions for extracting subsets of video data from the set of video data that occur during the specified ranges of time;
instructions for performing a fine partitioning method to identify one or more breaks in the set of video data; and
instructions for selecting the best break that occurs in each subset of video data, the time of occurrence of the best break in each subset being designated as a boundary of a segment in the body of information.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/866,956 US20010025375A1 (en) | 1996-12-05 | 2001-05-29 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US11/682,201 US8176515B2 (en) | 1996-12-05 | 2007-03-05 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US13/465,920 US20120293522A1 (en) | 1996-12-05 | 2012-05-07 | Browser for Use in Navigating a Body of Information, with Particular Application to Browsing Information Represented by Audiovisual Data |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/761,030 US6263507B1 (en) | 1996-12-05 | 1996-12-05 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US09/866,956 US20010025375A1 (en) | 1996-12-05 | 2001-05-29 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/761,030 Continuation US6263507B1 (en) | 1996-12-05 | 1996-12-05 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/682,201 Continuation US8176515B2 (en) | 1996-12-05 | 2007-03-05 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
Publications (1)
Publication Number | Publication Date |
---|---|
US20010025375A1 true US20010025375A1 (en) | 2001-09-27 |
Family
ID=25060899
Family Applications (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/761,030 Expired - Lifetime US6263507B1 (en) | 1996-12-05 | 1996-12-05 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US09/344,213 Expired - Fee Related US6880171B1 (en) | 1996-12-05 | 1999-06-25 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US09/866,956 Abandoned US20010025375A1 (en) | 1996-12-05 | 2001-05-29 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US11/682,201 Expired - Fee Related US8176515B2 (en) | 1996-12-05 | 2007-03-05 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US13/465,920 Abandoned US20120293522A1 (en) | 1996-12-05 | 2012-05-07 | Browser for Use in Navigating a Body of Information, with Particular Application to Browsing Information Represented by Audiovisual Data |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/761,030 Expired - Lifetime US6263507B1 (en) | 1996-12-05 | 1996-12-05 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US09/344,213 Expired - Fee Related US6880171B1 (en) | 1996-12-05 | 1999-06-25 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/682,201 Expired - Fee Related US8176515B2 (en) | 1996-12-05 | 2007-03-05 | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US13/465,920 Abandoned US20120293522A1 (en) | 1996-12-05 | 2012-05-07 | Browser for Use in Navigating a Body of Information, with Particular Application to Browsing Information Represented by Audiovisual Data |
Country Status (3)
Country | Link |
---|---|
US (5) | US6263507B1 (en) |
AU (1) | AU5515498A (en) |
WO (1) | WO1998027497A1 (en) |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030210350A1 (en) * | 2002-05-08 | 2003-11-13 | Fujitsu Ten Limited | Program information display apparatus |
US6782551B1 (en) * | 1999-07-15 | 2004-08-24 | Pace Micro Technology Plc | System for indicating when a program has been selected from a program guide display |
US20040177317A1 (en) * | 2003-03-07 | 2004-09-09 | John Bradstreet | Closed caption navigation |
US20040199906A1 (en) * | 2003-04-01 | 2004-10-07 | Mcknight Russell F. | Systems and methods for saving files having different media types |
EP1482727A2 (en) * | 2003-05-28 | 2004-12-01 | Thomson Licensing S.A. | Process of navigation for the selection of documents associated with identifiers, and apparatus implementing the process. |
US20060210248A1 (en) * | 2005-03-18 | 2006-09-21 | Kabushiki Kaisha Toshiba | Information recording apparatus and information |
US20070226640A1 (en) * | 2000-11-15 | 2007-09-27 | Holbrook David M | Apparatus and methods for organizing and/or presenting data |
US20070245242A1 (en) * | 2006-04-12 | 2007-10-18 | Yagnik Jay N | Method and apparatus for automatically summarizing video |
US20080086453A1 (en) * | 2006-10-05 | 2008-04-10 | Fabian-Baber, Inc. | Method and apparatus for correlating the results of a computer network text search with relevant multimedia files |
US20110042824A1 (en) * | 2009-08-20 | 2011-02-24 | Fujitsu Limited | Multi-chip module and method of manufacturing the same |
US8087050B2 (en) * | 1998-08-21 | 2011-12-27 | United Video Properties, Inc. | Client-server electronic program guide |
US8229156B1 (en) | 2006-08-08 | 2012-07-24 | Google Inc. | Using curve invariants to automatically characterize videos |
US8230470B2 (en) | 2003-01-15 | 2012-07-24 | Robertson Neil C | Full duplex wideband communications system for a local coaxial network |
US20120311434A1 (en) * | 2003-12-17 | 2012-12-06 | Richard Skrenta | System and method for automating categorization and aggregation of content from network sites |
US20140223482A1 (en) * | 2013-02-05 | 2014-08-07 | Redux, Inc. | Video preview creation with link |
US8806536B2 (en) | 1998-03-04 | 2014-08-12 | United Video Properties, Inc. | Program guide system with preference profiles |
US9075861B2 (en) | 2006-03-06 | 2015-07-07 | Veveo, Inc. | Methods and systems for segmenting relative user preferences into fine-grain and coarse-grain collections |
US9166714B2 (en) | 2009-09-11 | 2015-10-20 | Veveo, Inc. | Method of and system for presenting enriched video viewing analytics |
US9191722B2 (en) | 1997-07-21 | 2015-11-17 | Rovi Guides, Inc. | System and method for modifying advertisement responsive to EPG information |
US9319735B2 (en) | 1995-06-07 | 2016-04-19 | Rovi Guides, Inc. | Electronic television program guide schedule system and method with data feed access |
US9326025B2 (en) | 2007-03-09 | 2016-04-26 | Rovi Technologies Corporation | Media content search results ranked by popularity |
US9736524B2 (en) | 2011-01-06 | 2017-08-15 | Veveo, Inc. | Methods of and systems for content search based on environment sampling |
US9749693B2 (en) | 2006-03-24 | 2017-08-29 | Rovi Guides, Inc. | Interactive media guidance application with intelligent navigation and display features |
USRE46651E1 (en) | 2000-11-15 | 2017-12-26 | Callahan Cellular L.L.C. | Apparatus and methods for organizing and/or presenting data |
US10631066B2 (en) | 2009-09-23 | 2020-04-21 | Rovi Guides, Inc. | Systems and method for automatically detecting users within detection regions of media devices |
Families Citing this family (309)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8352400B2 (en) | 1991-12-23 | 2013-01-08 | Hoffberg Steven M | Adaptive pattern recognition based controller apparatus and method and human-factored interface therefore |
EP0688488A1 (en) * | 1993-03-05 | 1995-12-27 | MANKOVITZ, Roy J. | Apparatus and method using compressed codes for television program record scheduling |
US6239794B1 (en) | 1994-08-31 | 2001-05-29 | E Guide, Inc. | Method and system for simultaneously displaying a television program and information about the program |
US8793738B2 (en) | 1994-05-04 | 2014-07-29 | Starsight Telecast Incorporated | Television system with downloadable features |
ES2196087T3 (en) * | 1994-10-27 | 2003-12-16 | Index Systems Inc | SYSTEM AND METHOD FOR DOWNLOADING PROGRAMMING DATA FROM A RECORDER ON A VIDEO SIGNAL. |
WO1996027983A1 (en) | 1995-03-07 | 1996-09-12 | Interval Research Corporation | System and method for selective recording of information |
US8850477B2 (en) | 1995-10-02 | 2014-09-30 | Starsight Telecast, Inc. | Systems and methods for linking television viewers with advertisers and broadcasters |
US6002394A (en) | 1995-10-02 | 1999-12-14 | Starsight Telecast, Inc. | Systems and methods for linking television viewers with advertisers and broadcasters |
US6323911B1 (en) | 1995-10-02 | 2001-11-27 | Starsight Telecast, Inc. | System and method for using television schedule information |
US5940073A (en) | 1996-05-03 | 1999-08-17 | Starsight Telecast Inc. | Method and system for displaying other information in a TV program guide |
US6263507B1 (en) * | 1996-12-05 | 2001-07-17 | Interval Research Corporation | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US5893062A (en) | 1996-12-05 | 1999-04-06 | Interval Research Corporation | Variable rate video playback with synchronized audio |
US8635649B2 (en) | 1996-12-19 | 2014-01-21 | Gemstar Development Corporation | System and method for modifying advertisement responsive to EPG information |
US6687906B1 (en) | 1996-12-19 | 2004-02-03 | Index Systems, Inc. | EPG with advertising inserts |
WO1998043406A1 (en) * | 1997-03-21 | 1998-10-01 | Walker Asset Management Limited Partnership | System and method for supplying supplemental audio and visual information for video programs |
US6209028B1 (en) | 1997-03-21 | 2001-03-27 | Walker Digital, Llc | System and method for supplying supplemental audio information for broadcast television programs |
JP4072233B2 (en) * | 1997-03-24 | 2008-04-09 | キヤノン株式会社 | Information processing device |
US6564383B1 (en) * | 1997-04-14 | 2003-05-13 | International Business Machines Corporation | Method and system for interactively capturing organizing and presenting information generated from television programs to viewers |
GB9712724D0 (en) * | 1997-06-18 | 1997-08-20 | Holmes Steven | Method and apparatus for interaction with broadcast television content |
US8073921B2 (en) * | 1997-07-01 | 2011-12-06 | Advanced Technology Company, LLC | Methods for remote monitoring and control of appliances over a computer network |
JP3413065B2 (en) * | 1997-07-03 | 2003-06-03 | 松下電器産業株式会社 | Program information processing device |
IL125141A0 (en) * | 1998-06-29 | 1999-01-26 | Nds Ltd | Advanced television system |
US6604240B2 (en) | 1997-10-06 | 2003-08-05 | United Video Properties, Inc. | Interactive television program guide system with operator showcase |
US6219837B1 (en) * | 1997-10-23 | 2001-04-17 | International Business Machines Corporation | Summary frames in video |
US6961954B1 (en) * | 1997-10-27 | 2005-11-01 | The Mitre Corporation | Automated segmentation, information extraction, summarization, and presentation of broadcast news |
US6938073B1 (en) * | 1997-11-14 | 2005-08-30 | Yahoo! Inc. | Method and apparatus for re-formatting web pages |
US7257589B1 (en) * | 1997-12-22 | 2007-08-14 | Ricoh Company, Ltd. | Techniques for targeting information to users |
US7124093B1 (en) * | 1997-12-22 | 2006-10-17 | Ricoh Company, Ltd. | Method, system and computer code for content based web advertising |
US6665835B1 (en) * | 1997-12-23 | 2003-12-16 | Verizon Laboratories, Inc. | Real time media journaler with a timing event coordinator |
US20020002039A1 (en) | 1998-06-12 | 2002-01-03 | Safi Qureshey | Network-enabled audio device |
MXPA00008584A (en) * | 1998-03-04 | 2002-05-08 | United Video Properties Inc | Program guide system with targeted advertising. |
EP1060617B1 (en) * | 1998-03-04 | 2004-05-06 | United Video Properties Inc. | Program guide system with monitoring of advertisement usage and user activities |
US6564379B1 (en) | 1998-04-30 | 2003-05-13 | United Video Properties, Inc. | Program guide system with flip and browse advertisements |
US20020095676A1 (en) | 1998-05-15 | 2002-07-18 | Robert A. Knee | Interactive television program guide system for determining user values for demographic categories |
JP2000013777A (en) * | 1998-06-26 | 2000-01-14 | Matsushita Electric Ind Co Ltd | Video reproducing device and video storage device |
US6442755B1 (en) | 1998-07-07 | 2002-08-27 | United Video Properties, Inc. | Electronic program guide using markup language |
CN1867068A (en) | 1998-07-14 | 2006-11-22 | 联合视频制品公司 | Client-server based interactive television program guide system with remote server recording |
MX355835B (en) | 1998-07-17 | 2018-05-02 | Rovi Guides Inc | Interactive television program guide system having multiple devices within a household. |
AR020608A1 (en) | 1998-07-17 | 2002-05-22 | United Video Properties Inc | A METHOD AND A PROVISION TO SUPPLY A USER REMOTE ACCESS TO AN INTERACTIVE PROGRAMMING GUIDE BY A REMOTE ACCESS LINK |
US6505348B1 (en) | 1998-07-29 | 2003-01-07 | Starsight Telecast, Inc. | Multiple interactive electronic program guide system and methods |
US6714909B1 (en) * | 1998-08-13 | 2004-03-30 | At&T Corp. | System and method for automated multimedia content indexing and retrieval |
TW447221B (en) | 1998-08-26 | 2001-07-21 | United Video Properties Inc | Television message system |
TW463503B (en) | 1998-08-26 | 2001-11-11 | United Video Properties Inc | Television chat system |
JP3935276B2 (en) * | 1998-10-21 | 2007-06-20 | キヤノン株式会社 | Network device management method, apparatus, storage medium, and transmission apparatus |
JP2000156031A (en) * | 1998-11-17 | 2000-06-06 | Sony Corp | Information process system, information processor and information processing method |
US6859799B1 (en) * | 1998-11-30 | 2005-02-22 | Gemstar Development Corporation | Search engine for video and graphics |
US7082397B2 (en) * | 1998-12-01 | 2006-07-25 | Nuance Communications, Inc. | System for and method of creating and browsing a voice web |
US6243676B1 (en) * | 1998-12-23 | 2001-06-05 | Openwave Systems Inc. | Searching and retrieving multimedia information |
US6473778B1 (en) * | 1998-12-24 | 2002-10-29 | At&T Corporation | Generating hypermedia documents from transcriptions of television programs using parallel text alignment |
US20020080273A1 (en) * | 1999-01-06 | 2002-06-27 | Harrison Robert G. | Appliance with TV and INTERNET modes of operation |
KR100296967B1 (en) * | 1999-01-30 | 2001-09-26 | 구자홍 | Method for representing multi-level digest segment information in order to provide efficient multi-level digest streams of a multimedia stream and digest stream browsing/recording/editing system using multi-level digest segment information scheme. |
US7966078B2 (en) * | 1999-02-01 | 2011-06-21 | Steven Hoffberg | Network media appliance system and method |
US6236395B1 (en) * | 1999-02-01 | 2001-05-22 | Sharp Laboratories Of America, Inc. | Audiovisual information management system |
US6882709B1 (en) * | 1999-04-14 | 2005-04-19 | General Instrument Corporation | Enhanced broadband telephony services |
KR100624865B1 (en) * | 1999-06-02 | 2006-09-18 | 엘지전자 주식회사 | Video system based on user profile |
US6266094B1 (en) * | 1999-06-14 | 2001-07-24 | Medialink Worldwide Incorporated | Method and apparatus for the aggregation and selective retrieval of television closed caption word content originating from multiple geographic locations |
CN1359591A (en) | 1999-06-28 | 2002-07-17 | 英戴克系统公司 | System and method for utilizing EPG database for modifying advertisements |
AU5775900A (en) | 1999-06-29 | 2001-01-31 | United Video Properties, Inc. | Method and system for a video-on-demand-related interactive display within an interactive television application |
US7293280B1 (en) * | 1999-07-08 | 2007-11-06 | Microsoft Corporation | Skimming continuous multimedia content |
US7313808B1 (en) * | 1999-07-08 | 2007-12-25 | Microsoft Corporation | Browsing continuous multimedia content |
US6442518B1 (en) | 1999-07-14 | 2002-08-27 | Compaq Information Technologies Group, L.P. | Method for refining time alignments of closed captions |
US6845485B1 (en) * | 1999-07-15 | 2005-01-18 | Hotv, Inc. | Method and apparatus for indicating story-line changes by mining closed-caption-text |
US7075591B1 (en) * | 1999-09-22 | 2006-07-11 | Lg Electronics Inc. | Method of constructing information on associate meanings between segments of multimedia stream and method of browsing video using the same |
US6598074B1 (en) * | 1999-09-23 | 2003-07-22 | Rocket Network, Inc. | System and method for enabling multimedia production collaboration over a network |
US7155735B1 (en) | 1999-10-08 | 2006-12-26 | Vulcan Patents Llc | System and method for the broadcast dissemination of time-ordered data |
SG94350A1 (en) * | 1999-10-21 | 2003-02-18 | Matsushita Electric Ind Co Ltd | Control content transmission method and storage-based broadcasting system |
FR2800958A1 (en) | 1999-11-10 | 2001-05-11 | Thomson Multimedia Sa | PROCESS FOR TRANSMISSION AND PROCESSING OF SERVICE INFORMATION IN A TELEVISION SYSTEM, RECEIVER AND TRANSMITTER IN SUCH A SYSTEM |
US7412643B1 (en) | 1999-11-23 | 2008-08-12 | International Business Machines Corporation | Method and apparatus for linking representation and realization data |
US7779436B1 (en) | 1999-11-24 | 2010-08-17 | Jlb Ventures Llc | Method for using banner advertisements during commercial breaks |
EP1309901B1 (en) | 1999-12-02 | 2008-05-21 | Western Digital Technologies, Inc. | System for remote recording of television programs |
JP2001168923A (en) * | 1999-12-08 | 2001-06-22 | Toshiba Corp | Multimedia service system, multimedia conversion server, and multimedia terminal |
KR20020062961A (en) * | 1999-12-10 | 2002-07-31 | 유나이티드 비디오 프로퍼티즈, 인크. | Features for use with advanced set-top applications on interactive television systems |
US7542068B2 (en) * | 2000-01-13 | 2009-06-02 | Polycom, Inc. | Method and system for controlling multimedia video communication |
US6757682B1 (en) * | 2000-01-28 | 2004-06-29 | Interval Research Corporation | Alerting users to items of current interest |
US6868440B1 (en) | 2000-02-04 | 2005-03-15 | Microsoft Corporation | Multi-level skimming of multimedia content using playlists |
JP2001230994A (en) * | 2000-02-15 | 2001-08-24 | Fujitsu Ltd | Data processor |
US6891566B2 (en) * | 2000-03-14 | 2005-05-10 | Joseph Robert Marchese | Digital video system using networked cameras |
EP1275253A2 (en) | 2000-03-31 | 2003-01-15 | United Video Properties, Inc. | Systems and methods for improved audience measuring |
CN101493919B (en) | 2000-03-31 | 2019-01-04 | 乐威指南公司 | The system and method for meta-data-linked advertisements |
JP3810268B2 (en) * | 2000-04-07 | 2006-08-16 | シャープ株式会社 | Audio visual system |
US6505153B1 (en) | 2000-05-22 | 2003-01-07 | Compaq Information Technologies Group, L.P. | Efficient method for producing off-line closed captions |
US6829781B1 (en) * | 2000-05-24 | 2004-12-07 | At&T Corp. | Network-based service to provide on-demand video summaries of television programs |
EP1293094A1 (en) | 2000-05-25 | 2003-03-19 | Thomson Licensing S.A. | Device and method for synchronising broadcast audio-visual programmes and complementary data |
US8028314B1 (en) | 2000-05-26 | 2011-09-27 | Sharp Laboratories Of America, Inc. | Audiovisual information management system |
US20040080528A1 (en) * | 2000-06-21 | 2004-04-29 | Watchit.Com,Inc. | Systems and methods for presenting interactive programs over the internet |
JP2002044572A (en) | 2000-07-21 | 2002-02-08 | Sony Corp | Information signal processor, information signal processing method and information signal recorder |
WO2002011428A2 (en) * | 2000-07-28 | 2002-02-07 | Koninklijke Philips Electronics N.V. | Visualization and playback of television shows at a sub-show level |
US20020059629A1 (en) * | 2000-08-21 | 2002-05-16 | Markel Steven O. | Detection and recognition of data receiver to facilitate proper transmission of enhanced data |
AU2001288453B2 (en) * | 2000-08-25 | 2006-05-18 | Opentv, Inc. | Personalized remote control |
US20020065678A1 (en) * | 2000-08-25 | 2002-05-30 | Steven Peliotis | iSelect video |
US20020057286A1 (en) * | 2000-08-25 | 2002-05-16 | Markel Steven O. | Device independent video enhancement scripting language |
US7421729B2 (en) * | 2000-08-25 | 2008-09-02 | Intellocity Usa Inc. | Generation and insertion of indicators using an address signal applied to a database |
US7840691B1 (en) | 2000-09-07 | 2010-11-23 | Zamora Radio, Llc | Personal broadcast server system for providing a customized broadcast |
US8020183B2 (en) | 2000-09-14 | 2011-09-13 | Sharp Laboratories Of America, Inc. | Audiovisual management system |
US6993246B1 (en) | 2000-09-15 | 2006-01-31 | Hewlett-Packard Development Company, L.P. | Method and system for correlating data streams |
US20030037332A1 (en) * | 2000-09-20 | 2003-02-20 | Chapin Paul W. | System and method for storyboard interactive television advertisements |
US20020069405A1 (en) * | 2000-09-20 | 2002-06-06 | Chapin Paul W. | System and method for spokesperson interactive television advertisements |
US7349946B2 (en) * | 2000-10-02 | 2008-03-25 | Canon Kabushiki Kaisha | Information processing system |
CN101715109A (en) | 2000-10-11 | 2010-05-26 | 联合视频制品公司 | Systems and methods for providing storage of data on servers in an on-demand media delivery system |
GB2370188A (en) * | 2000-11-01 | 2002-06-19 | Orange Personal Comm Serv Ltd | Mixed-media telecommunication call set-up |
EP1380170A2 (en) * | 2000-11-14 | 2004-01-14 | Koninklijke Philips Electronics N.V. | Summarization and/or indexing of programs |
EP1346559A4 (en) * | 2000-11-16 | 2006-02-01 | Mydtv Inc | System and methods for determining the desirability of video programming events |
US8255791B2 (en) | 2000-11-29 | 2012-08-28 | Dov Koren | Collaborative, flexible, interactive real-time displays |
US6798912B2 (en) | 2000-12-18 | 2004-09-28 | Koninklijke Philips Electronics N.V. | Apparatus and method of program classification based on syntax of transcript information |
US20020083471A1 (en) * | 2000-12-21 | 2002-06-27 | Philips Electronics North America Corporation | System and method for providing a multimedia summary of a video program |
US20020083473A1 (en) * | 2000-12-21 | 2002-06-27 | Philips Electronics North America Corporation | System and method for accessing a multimedia summary of a video program |
US20030167465A1 (en) * | 2001-01-17 | 2003-09-04 | Davis T. Ron | Method and system for supplementing television programming with e-mailed magazines |
WO2002058393A1 (en) * | 2001-01-17 | 2002-07-25 | I-Request, Inc. | A method and system for supplementing television programming with e-mailed magazines |
US20020100039A1 (en) * | 2001-01-19 | 2002-07-25 | Nicholas Iatropoulos | Media interactivity method and architecture |
US6788196B2 (en) * | 2001-01-26 | 2004-09-07 | Komatsu Ltd. | Display controller for switching display device of vehicle between monitor display and trouble display |
US6870469B2 (en) * | 2001-01-26 | 2005-03-22 | Komatsu Ltd. | Display controller for display device of vehicle |
EP1364533A1 (en) * | 2001-02-20 | 2003-11-26 | Intellocity USA, Inc. | Content based video selection |
US8103737B2 (en) * | 2001-03-07 | 2012-01-24 | International Business Machines Corporation | System and method for previewing hyperlinks with ‘flashback’ images |
US20020129383A1 (en) * | 2001-03-08 | 2002-09-12 | Wasilewski Louise Mary | Apparatus for a cosumer controlled selective recording device for interactive television |
US6903782B2 (en) * | 2001-03-28 | 2005-06-07 | Koninklijke Philips Electronics N.V. | System and method for performing segmentation-based enhancements of a video image |
US7904814B2 (en) * | 2001-04-19 | 2011-03-08 | Sharp Laboratories Of America, Inc. | System for presenting audio-video content |
CA2386303C (en) | 2001-05-14 | 2005-07-05 | At&T Corp. | Method for content-based non-linear control of multimedia playback |
US7016968B2 (en) * | 2001-06-22 | 2006-03-21 | International Business Machines Corporation | Method and apparatus for facilitating the providing of content |
US7296231B2 (en) * | 2001-08-09 | 2007-11-13 | Eastman Kodak Company | Video structuring by probabilistic merging of video segments |
US20030081249A1 (en) * | 2001-08-21 | 2003-05-01 | Yesvideo, Inc. | Easy printing of visual images extracted from a collection of visual images |
US20070226763A1 (en) * | 2001-08-24 | 2007-09-27 | Hempleman James D | System And Method Of Provising User Specified Information And Advertising |
US20030206710A1 (en) * | 2001-09-14 | 2003-11-06 | Ferman Ahmet Mufit | Audiovisual management system |
US7474698B2 (en) | 2001-10-19 | 2009-01-06 | Sharp Laboratories Of America, Inc. | Identification of replay segments |
US20030105794A1 (en) * | 2001-11-09 | 2003-06-05 | Jasinschi Radu S. | Systems for sensing similarity in monitored broadcast content streams and methods of operating the same |
US7095947B2 (en) * | 2001-11-13 | 2006-08-22 | Koninklijke Philips Electronics N.V. | System for synchronizing the playback of two or more connected playback devices using closed captioning |
US7610358B2 (en) * | 2001-11-26 | 2009-10-27 | Time Warner Cable | System and method for effectively presenting multimedia information materials |
KR100411437B1 (en) * | 2001-12-28 | 2003-12-18 | 엘지전자 주식회사 | Intelligent news video browsing system |
US20030131362A1 (en) * | 2002-01-09 | 2003-07-10 | Koninklijke Philips Electronics N.V. | Method and apparatus for multimodal story segmentation for linking multimedia content |
US20030142129A1 (en) * | 2002-01-31 | 2003-07-31 | Kleven Michael L. | Content processing and distribution systems and processes |
US20030154481A1 (en) * | 2002-02-11 | 2003-08-14 | Andersen David B. | Identification of programming having supplementary content |
US7286651B1 (en) * | 2002-02-12 | 2007-10-23 | Sprint Spectrum L.P. | Method and system for multi-modal interaction |
FR2836321B1 (en) * | 2002-02-18 | 2006-02-24 | Cit Alcatel | SELECTIVE RECEIVER OF INFORMATION ELEMENTS |
US20030159153A1 (en) * | 2002-02-20 | 2003-08-21 | General Instrument Corporation | Method and apparatus for processing ATVEF data to control the display of text and images |
US8453189B2 (en) * | 2002-02-25 | 2013-05-28 | Koninklijke Philips Electronics N.V. | Method and system for retrieving information about television programs |
US20030167174A1 (en) * | 2002-03-01 | 2003-09-04 | Koninlijke Philips Electronics N.V. | Automatic audio recorder-player and operating method therefor |
JP2003257159A (en) * | 2002-03-05 | 2003-09-12 | Sanyo Electric Co Ltd | Information editing device, information editing method, program for editing information and information recording medium |
US8214741B2 (en) | 2002-03-19 | 2012-07-03 | Sharp Laboratories Of America, Inc. | Synchronization of video and data |
US7668901B2 (en) * | 2002-04-15 | 2010-02-23 | Avid Technology, Inc. | Methods and system using a local proxy server to process media data for local area users |
US7155674B2 (en) | 2002-04-29 | 2006-12-26 | Seachange International, Inc. | Accessing television services |
US20040024780A1 (en) * | 2002-08-01 | 2004-02-05 | Koninklijke Philips Electronics N.V. | Method, system and program product for generating a content-based table of contents |
US7657907B2 (en) | 2002-09-30 | 2010-02-02 | Sharp Laboratories Of America, Inc. | Automatic user profiling |
US7539086B2 (en) * | 2002-10-23 | 2009-05-26 | J2 Global Communications, Inc. | System and method for the secure, real-time, high accuracy conversion of general-quality speech into text |
US7716312B2 (en) | 2002-11-13 | 2010-05-11 | Avid Technology, Inc. | Method and system for transferring large data files over parallel connections |
WO2004055630A2 (en) * | 2002-12-12 | 2004-07-01 | Scientific-Atlanta, Inc. | Data enhanced multi-media system for a headend |
US7895337B2 (en) * | 2002-12-26 | 2011-02-22 | Oracle International Corporation | Systems and methods of generating a content aware interface |
US7725544B2 (en) | 2003-01-24 | 2010-05-25 | Aol Inc. | Group based spam classification |
US7493646B2 (en) | 2003-01-30 | 2009-02-17 | United Video Properties, Inc. | Interactive television systems with digital video recording and adjustable reminders |
KR20050106097A (en) * | 2003-03-07 | 2005-11-08 | 닛본 덴끼 가부시끼가이샤 | Scroll display control |
US20040197088A1 (en) * | 2003-03-31 | 2004-10-07 | Ferman Ahmet Mufit | System for presenting audio-video content |
US8019656B2 (en) * | 2003-05-07 | 2011-09-13 | Cbs Interactive Inc. | System and method for generating an alternative product recommendation |
US7840448B2 (en) * | 2003-05-07 | 2010-11-23 | Cbs Interactive Inc. | System and method for automatically generating a narrative product summary |
US7590695B2 (en) | 2003-05-09 | 2009-09-15 | Aol Llc | Managing electronic messages |
US7505969B2 (en) * | 2003-08-05 | 2009-03-17 | Cbs Interactive, Inc. | Product placement engine and method |
US7788696B2 (en) * | 2003-10-15 | 2010-08-31 | Microsoft Corporation | Inferring information about media stream objects |
US7984468B2 (en) | 2003-11-06 | 2011-07-19 | United Video Properties, Inc. | Systems and methods for providing program suggestions in an interactive television program guide |
US8024755B2 (en) * | 2003-11-17 | 2011-09-20 | Sony Corporation | Interactive program guide with preferred items list apparatus and method |
US20050108752A1 (en) * | 2003-11-17 | 2005-05-19 | Sony Corporation, A Japanese Corporation | 3-Dimensional browsing and selection apparatus and method |
US20050108750A1 (en) * | 2003-11-17 | 2005-05-19 | Sony Corporation, A Japanese Corporation | Candidate data selection and display apparatus and method |
US20050108755A1 (en) * | 2003-11-17 | 2005-05-19 | Sony Corporation, A Japanese Corporation | Multi-source programming guide apparatus and method |
US20050108749A1 (en) * | 2003-11-17 | 2005-05-19 | Sony Corporation, A Japanese Corporation | Automatic content display apparatus and method |
EP1538536A1 (en) * | 2003-12-05 | 2005-06-08 | Sony International (Europe) GmbH | Visualization and control techniques for multimedia digital content |
US7293066B1 (en) | 2004-01-21 | 2007-11-06 | Cisco Technology, Inc. | Methods and apparatus supporting access to stored data |
US8949899B2 (en) | 2005-03-04 | 2015-02-03 | Sharp Laboratories Of America, Inc. | Collaborative recommendation system |
US8356317B2 (en) | 2004-03-04 | 2013-01-15 | Sharp Laboratories Of America, Inc. | Presence based technology |
US7594245B2 (en) * | 2004-03-04 | 2009-09-22 | Sharp Laboratories Of America, Inc. | Networked video devices |
US20050219366A1 (en) * | 2004-03-31 | 2005-10-06 | Hollowbush Richard R | Digital audio-video differential delay and channel analyzer |
US20050234961A1 (en) * | 2004-04-16 | 2005-10-20 | Pinnacle Systems, Inc. | Systems and Methods for providing a proxy for a shared file system |
US7836389B2 (en) * | 2004-04-16 | 2010-11-16 | Avid Technology, Inc. | Editing system for audiovisual works and corresponding text for television news |
EP1743258A1 (en) * | 2004-04-23 | 2007-01-17 | Koninklijke Philips Electronics N.V. | Method and apparatus to catch up with a running broadcast or stored content |
US20060053470A1 (en) * | 2004-04-30 | 2006-03-09 | Vulcan Inc. | Management and non-linear presentation of augmented broadcasted or streamed multimedia content |
US7818444B2 (en) | 2004-04-30 | 2010-10-19 | Move Networks, Inc. | Apparatus, system, and method for multi-bitrate content streaming |
US20060031885A1 (en) * | 2004-04-30 | 2006-02-09 | Vulcan Inc. | Management and non-linear presentation of music-related broadcasted or streamed multimedia content |
US20060031916A1 (en) * | 2004-04-30 | 2006-02-09 | Vulcan Inc. | Management and non-linear presentation of broadcasted or streamed multimedia content |
US20060031879A1 (en) * | 2004-04-30 | 2006-02-09 | Vulcan Inc. | Management and non-linear presentation of news-related broadcasted or streamed multimedia content |
US8028323B2 (en) | 2004-05-05 | 2011-09-27 | Dryden Enterprises, Llc | Method and system for employing a first device to direct a networked audio device to obtain a media item |
US7386542B2 (en) * | 2004-08-30 | 2008-06-10 | The Mitre Corporation | Personalized broadcast news navigator |
KR100704620B1 (en) * | 2004-09-07 | 2007-04-10 | 삼성전자주식회사 | Digital broadcasting receiving device and video on demand receiving method using digital broadcasting receiving device |
WO2006035450A1 (en) * | 2004-09-29 | 2006-04-06 | Hewlett-Packard Development Company L.P. | Systems and methods for soliciting feedback using print-augmented broadcast signal |
US9021520B2 (en) * | 2004-09-29 | 2015-04-28 | Hewlett-Packard Development Company, L.P. | Systems and methods for providing and processing print-augmented broadcast signals |
US8806533B1 (en) | 2004-10-08 | 2014-08-12 | United Video Properties, Inc. | System and method for using television information codes |
US20060088145A1 (en) * | 2004-10-27 | 2006-04-27 | Bellsouth Intellectual Property Corporation | Methods and systems for an interactive communications directory and directory channel |
US7286955B1 (en) * | 2004-11-01 | 2007-10-23 | Laser Atlanta, Llc | Ship proximity measurement display system including the remotely viewable billboard and bridge display with data communication and attachment framing |
JP2006165824A (en) * | 2004-12-03 | 2006-06-22 | Fuji Xerox Co Ltd | Image display program, image display method and image display device |
US20060173916A1 (en) * | 2004-12-22 | 2006-08-03 | Verbeck Sibley Timothy J R | Method and system for automatically generating a personalized sequence of rich media |
JP4981026B2 (en) * | 2005-03-31 | 2012-07-18 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | Composite news story synthesis |
US8234679B2 (en) | 2005-04-01 | 2012-07-31 | Time Warner Cable, Inc. | Technique for selecting multiple entertainment programs to be provided over a communication network |
WO2006109260A2 (en) * | 2005-04-15 | 2006-10-19 | Koninklijke Philips Electronics N.V. | Method and device for searching a video movie at a variable speed using an additional file containing screen shots |
WO2006114797A1 (en) * | 2005-04-25 | 2006-11-02 | Hewlett-Packard Development Company, L.P. | Systems and methods for providing summaries on segments of broadcast program not rendered to audience |
US7788266B2 (en) | 2005-08-26 | 2010-08-31 | Veveo, Inc. | Method and system for processing ambiguous, multi-term search queries |
JP2007081594A (en) * | 2005-09-13 | 2007-03-29 | Sony Corp | Imaging apparatus and recording method |
US9083564B2 (en) * | 2005-10-13 | 2015-07-14 | At&T Intellectual Property I, L.P. | System and method of delivering notifications |
US7697827B2 (en) | 2005-10-17 | 2010-04-13 | Konicek Jeffrey C | User-friendlier interfaces for a camera |
US8856118B2 (en) * | 2005-10-31 | 2014-10-07 | Qwest Communications International Inc. | Creation and transmission of rich content media |
US7840898B2 (en) * | 2005-11-01 | 2010-11-23 | Microsoft Corporation | Video booklet |
US20070101394A1 (en) * | 2005-11-01 | 2007-05-03 | Yesvideo, Inc. | Indexing a recording of audiovisual content to enable rich navigation |
US20070101264A1 (en) * | 2005-11-01 | 2007-05-03 | Microsoft Corporation | Position-and length-sensitive video timeline behavior |
US9113107B2 (en) | 2005-11-08 | 2015-08-18 | Rovi Guides, Inc. | Interactive advertising and program promotion in an interactive television system |
US20070139189A1 (en) * | 2005-12-05 | 2007-06-21 | Helmig Kevin S | Multi-platform monitoring system and method |
US8613024B2 (en) | 2005-12-13 | 2013-12-17 | United Video Properties, Inc. | Cross-platform predictive popularity ratings for use in interactive television applications |
US20070156521A1 (en) | 2005-12-29 | 2007-07-05 | United Video Properties, Inc. | Systems and methods for commerce in media program related merchandise |
WO2008108750A1 (en) * | 2006-02-06 | 2008-09-12 | Cnet Networks, Inc. | Controllable automated generator of optimized allied product content |
US8689253B2 (en) | 2006-03-03 | 2014-04-01 | Sharp Laboratories Of America, Inc. | Method and system for configuring media-playing sets |
US7890849B2 (en) * | 2006-09-15 | 2011-02-15 | Microsoft Corporation | Concurrent presentation of media and related content lists |
US8832742B2 (en) | 2006-10-06 | 2014-09-09 | United Video Properties, Inc. | Systems and methods for acquiring, categorizing and delivering media in interactive media guidance applications |
US20080181513A1 (en) * | 2007-01-31 | 2008-07-31 | John Almeida | Method, apparatus and algorithm for indexing, searching, retrieval of digital stream by the use of summed partitions |
US20080201158A1 (en) | 2007-02-15 | 2008-08-21 | Johnson Mark D | System and method for visitation management in a controlled-access environment |
US8542802B2 (en) | 2007-02-15 | 2013-09-24 | Global Tel*Link Corporation | System and method for three-way call detection |
WO2008106365A1 (en) * | 2007-02-22 | 2008-09-04 | Nexidia Inc. | Accessing multimedia |
JP2008219343A (en) * | 2007-03-02 | 2008-09-18 | Sony Corp | Information processor and method, and program |
US8312492B2 (en) | 2007-03-19 | 2012-11-13 | At&T Intellectual Property I, L.P. | Systems and methods of providing modified media content |
US8418206B2 (en) | 2007-03-22 | 2013-04-09 | United Video Properties, Inc. | User defined rules for assigning destinations of content |
US8503523B2 (en) * | 2007-06-29 | 2013-08-06 | Microsoft Corporation | Forming a representation of a video item and use thereof |
CA2597200A1 (en) * | 2007-08-13 | 2009-02-13 | Semiconductor Insights Inc. | Method and apparatus for organizing claim elements |
US8943539B2 (en) | 2007-11-21 | 2015-01-27 | Rovi Guides, Inc. | Enabling a friend to remotely modify user data |
US20090150784A1 (en) * | 2007-12-07 | 2009-06-11 | Microsoft Corporation | User interface for previewing video items |
US9015147B2 (en) | 2007-12-20 | 2015-04-21 | Porto Technology, Llc | System and method for generating dynamically filtered content results, including for audio and/or video channels |
US8117193B2 (en) * | 2007-12-21 | 2012-02-14 | Lemi Technology, Llc | Tunersphere |
US8316015B2 (en) | 2007-12-21 | 2012-11-20 | Lemi Technology, Llc | Tunersphere |
TW200937313A (en) * | 2008-02-18 | 2009-09-01 | Univ Nat Chiao Tung | Method and system for increasing license plate detection efficiency in successively inputting image |
US20090249406A1 (en) * | 2008-03-31 | 2009-10-01 | Broadcom Corporation | Mobile video device with enhanced video navigation |
AU2008354332B2 (en) * | 2008-04-09 | 2013-04-18 | The Nielsen Company (Us), Llc | Methods and apparatus to play and control playing of media content in a web page |
US9639531B2 (en) | 2008-04-09 | 2017-05-02 | The Nielsen Company (Us), Llc | Methods and apparatus to play and control playing of media in a web page |
KR101478620B1 (en) | 2008-04-22 | 2015-01-05 | 삼성전자주식회사 | Method and apparatus for segmenting recorded news program according to articles |
US8601526B2 (en) | 2008-06-13 | 2013-12-03 | United Video Properties, Inc. | Systems and methods for displaying media content and media guidance information |
US8775454B2 (en) | 2008-07-29 | 2014-07-08 | James L. Geer | Phone assisted ‘photographic memory’ |
US9128981B1 (en) | 2008-07-29 | 2015-09-08 | James L. Geer | Phone assisted ‘photographic memory’ |
US20100104258A1 (en) * | 2008-10-28 | 2010-04-29 | International Business Machines Corporation | User-specified event-based remote triggering of a consumer digital video recording device |
US8593570B2 (en) | 2008-11-07 | 2013-11-26 | Looxcie, Inc. | Video recording camera headset |
US8526779B2 (en) | 2008-11-07 | 2013-09-03 | Looxcie, Inc. | Creating and editing video recorded by a hands-free video recording device |
US10063934B2 (en) | 2008-11-25 | 2018-08-28 | Rovi Technologies Corporation | Reducing unicast session duration with restart TV |
US8494899B2 (en) | 2008-12-02 | 2013-07-23 | Lemi Technology, Llc | Dynamic talk radio program scheduling |
US9390167B2 (en) | 2010-07-29 | 2016-07-12 | Soundhound, Inc. | System and methods for continuous audio matching |
US20100185616A1 (en) * | 2009-01-14 | 2010-07-22 | Cbs Interactive, Inc. | Systems and methods for predictive recommendations |
US9225838B2 (en) | 2009-02-12 | 2015-12-29 | Value-Added Communications, Inc. | System and method for detecting three-way call circumvention attempts |
US8630726B2 (en) * | 2009-02-12 | 2014-01-14 | Value-Added Communications, Inc. | System and method for detecting three-way call circumvention attempts |
US7657337B1 (en) | 2009-04-29 | 2010-02-02 | Lemi Technology, Llc | Skip feature for a broadcast or multicast media station |
US8806047B2 (en) | 2009-04-29 | 2014-08-12 | Lemi Technology, Llc | Skip feature for a broadcast or multicast media station |
US20100293072A1 (en) * | 2009-05-13 | 2010-11-18 | David Murrant | Preserving the Integrity of Segments of Audio Streams |
US8799253B2 (en) * | 2009-06-26 | 2014-08-05 | Microsoft Corporation | Presenting an assembled sequence of preview videos |
US8391673B2 (en) * | 2009-06-26 | 2013-03-05 | Intel Corporation | Method, system, and apparatus to derive content related to a multimedia stream and dynamically combine and display the stream with the related content |
US8359616B2 (en) | 2009-09-30 | 2013-01-22 | United Video Properties, Inc. | Systems and methods for automatically generating advertisements using a media guidance application |
KR101164353B1 (en) * | 2009-10-23 | 2012-07-09 | 삼성전자주식회사 | Method and apparatus for browsing and executing media contents |
US20110106594A1 (en) * | 2009-11-05 | 2011-05-05 | Cbs Interactive, Inc. | Expandable product feature and relation comparison system |
CN102063414A (en) * | 2009-11-13 | 2011-05-18 | 新奥特(北京)视频技术有限公司 | Method and device for positioning file contents |
US8645134B1 (en) * | 2009-11-18 | 2014-02-04 | Google Inc. | Generation of timed text using speech-to-text technology and applications thereof |
KR20110062982A (en) * | 2009-12-04 | 2011-06-10 | 삼성전자주식회사 | Method and apparatus for generating program summary information of broadcasting content on real-time, providing method thereof, and broadcasting receiver |
US20110161153A1 (en) * | 2009-12-30 | 2011-06-30 | Cbs Interactive Inc. | Method and system for recommending assets based on recently viewed assets basket |
US9361130B2 (en) * | 2010-05-03 | 2016-06-07 | Apple Inc. | Systems, methods, and computer program products providing an integrated user interface for reading content |
US9122701B2 (en) | 2010-05-13 | 2015-09-01 | Rovi Guides, Inc. | Systems and methods for providing media content listings according to points of interest |
US9204193B2 (en) | 2010-05-14 | 2015-12-01 | Rovi Guides, Inc. | Systems and methods for media detection and filtering using a parental control logging application |
US8739019B1 (en) | 2011-07-01 | 2014-05-27 | Joel Nevins | Computer-implemented methods and computer program products for integrating and synchronizing multimedia content, including content displayed via interactive televisions, smartphones, electronic book readers, holographic imagery projectors, and other computerized devices |
USRE47059E1 (en) | 2010-07-24 | 2018-09-25 | Joel Nevins | Computer-implemented methods and computer program products for integrating and synchronizing multimedia content, including content displayed via interactive televisions, smartphones, electronic book readers, holographic imagery projectors, and other computerized devices |
US9047371B2 (en) | 2010-07-29 | 2015-06-02 | Soundhound, Inc. | System and method for matching a query against a broadcast stream |
US9699503B2 (en) | 2010-09-07 | 2017-07-04 | Opentv, Inc. | Smart playlist |
US10210160B2 (en) | 2010-09-07 | 2019-02-19 | Opentv, Inc. | Collecting data from different sources |
US8949871B2 (en) | 2010-09-08 | 2015-02-03 | Opentv, Inc. | Smart media selection based on viewer user presence |
US9134873B2 (en) * | 2010-09-28 | 2015-09-15 | Qualcomm Incorporated | Apparatus and methods for presenting interaction information |
US8682740B2 (en) | 2010-10-26 | 2014-03-25 | Cbs Interactive Inc. | Systems and methods using a manufacturer line, series, model hierarchy |
BRPI1003568A2 (en) | 2010-10-29 | 2012-06-12 | Log On Multimidia Ltda | Dynamic audiovisual browser and method |
US20120158527A1 (en) * | 2010-12-21 | 2012-06-21 | Class6Ix, Llc | Systems, Methods and/or Computer Readable Storage Media Facilitating Aggregation and/or Personalized Sequencing of News Video Content |
US20120272171A1 (en) * | 2011-04-21 | 2012-10-25 | Panasonic Corporation | Apparatus, Method and Computer-Implemented Program for Editable Categorization |
US9035163B1 (en) | 2011-05-10 | 2015-05-19 | Soundbound, Inc. | System and method for targeting content based on identified audio and multimedia |
US8737803B2 (en) * | 2011-05-27 | 2014-05-27 | Looxcie, Inc. | Method and apparatus for storing and streaming audiovisual content |
US20120306930A1 (en) * | 2011-06-05 | 2012-12-06 | Apple Inc. | Techniques for zooming in and out with dynamic content |
US20130036442A1 (en) * | 2011-08-05 | 2013-02-07 | Qualcomm Incorporated | System and method for visual selection of elements in video content |
US9069743B2 (en) | 2011-10-13 | 2015-06-30 | Microsoft Technology Licensing, Llc | Application of comments in multiple application functionality content |
US9176933B2 (en) * | 2011-10-13 | 2015-11-03 | Microsoft Technology Licensing, Llc | Application of multiple content items and functionality to an electronic content item |
US8805418B2 (en) | 2011-12-23 | 2014-08-12 | United Video Properties, Inc. | Methods and systems for performing actions based on location-based rules |
US10592596B2 (en) | 2011-12-28 | 2020-03-17 | Cbs Interactive Inc. | Techniques for providing a narrative summary for fantasy games |
US10540430B2 (en) | 2011-12-28 | 2020-01-21 | Cbs Interactive Inc. | Techniques for providing a natural language narrative |
KR101906150B1 (en) * | 2011-12-30 | 2018-10-11 | 삼성전자 주식회사 | Display apparatus and image displaying method |
KR101902320B1 (en) | 2011-12-30 | 2018-10-02 | 삼성전자 주식회사 | Display apparatus, external peripheral device connectable thereof and image displaying method |
US20130205213A1 (en) * | 2012-02-06 | 2013-08-08 | edX Inc. | Caption-based navigation for a video player |
CA2878554A1 (en) * | 2012-06-18 | 2013-12-27 | Matthias Rath | Integrated interactive system and method for visualizing human physiology, disease, treatment options and use |
US10957310B1 (en) | 2012-07-23 | 2021-03-23 | Soundhound, Inc. | Integrated programming framework for speech and text understanding with meaning parsing |
US8821271B2 (en) | 2012-07-30 | 2014-09-02 | Cbs Interactive, Inc. | Techniques for providing narrative content for competitive gaming events |
CN103634362A (en) * | 2012-08-28 | 2014-03-12 | 金蝶软件(中国)有限公司 | File transfer method, file server and file transfer system |
CN102917265A (en) * | 2012-10-25 | 2013-02-06 | 深圳创维-Rgb电子有限公司 | Information browsing method and system based on network television |
CN107274653B (en) | 2012-11-20 | 2019-07-09 | 华为终端有限公司 | The key value information processing method and control equipment, remote controler of remote controler |
KR101289085B1 (en) * | 2012-12-12 | 2013-07-30 | 오드컨셉 주식회사 | Images searching system based on object and method thereof |
CN103914491B (en) * | 2013-01-09 | 2017-11-17 | 腾讯科技(北京)有限公司 | To the data digging method and system of high-quality user-generated content |
US20140195334A1 (en) | 2013-01-10 | 2014-07-10 | United Video Properties, Inc. | Systems and methods for optimizing data driven media placement |
US9848276B2 (en) | 2013-03-11 | 2017-12-19 | Rovi Guides, Inc. | Systems and methods for auto-configuring a user equipment device with content consumption material |
US20140281980A1 (en) | 2013-03-15 | 2014-09-18 | Chad A. Hage | Methods and Apparatus to Identify a Type of Media Presented by a Media Player |
US9743124B2 (en) | 2013-09-12 | 2017-08-22 | Wideorbit Inc. | Systems and methods to deliver a personalized mediacast with an uninterrupted lead-in portion |
US9507849B2 (en) | 2013-11-28 | 2016-11-29 | Soundhound, Inc. | Method for combining a query and a communication command in a natural language computer system |
US9292488B2 (en) | 2014-02-01 | 2016-03-22 | Soundhound, Inc. | Method for embedding voice mail in a spoken utterance using a natural language processing computer system |
US11295730B1 (en) | 2014-02-27 | 2022-04-05 | Soundhound, Inc. | Using phonetic variants in a local context to improve natural language understanding |
US9564123B1 (en) | 2014-05-12 | 2017-02-07 | Soundhound, Inc. | Method and system for building an integrated user profile |
US10986379B2 (en) * | 2015-06-08 | 2021-04-20 | Wideorbit Llc | Content management and provisioning system |
US20170004859A1 (en) * | 2015-06-30 | 2017-01-05 | Coursera, Inc. | User created textbook |
US10867256B2 (en) * | 2015-07-17 | 2020-12-15 | Knoema Corporation | Method and system to provide related data |
US10108907B2 (en) * | 2015-07-17 | 2018-10-23 | Knoema Corporation | Method and system to provide related data |
US10055489B2 (en) * | 2016-02-08 | 2018-08-21 | Ebay Inc. | System and method for content-based media analysis |
US10572961B2 (en) | 2016-03-15 | 2020-02-25 | Global Tel*Link Corporation | Detection and prevention of inmate to inmate message relay |
US9609121B1 (en) | 2016-04-07 | 2017-03-28 | Global Tel*Link Corporation | System and method for third party monitoring of voice and video calls |
US10091549B1 (en) | 2017-03-30 | 2018-10-02 | Rovi Guides, Inc. | Methods and systems for recommending media assets based on the geographic location at which the media assets are frequently consumed |
US10027797B1 (en) | 2017-05-10 | 2018-07-17 | Global Tel*Link Corporation | Alarm control for inmate call monitoring |
US10225396B2 (en) | 2017-05-18 | 2019-03-05 | Global Tel*Link Corporation | Third party monitoring of a activity within a monitoring platform |
US10860786B2 (en) | 2017-06-01 | 2020-12-08 | Global Tel*Link Corporation | System and method for analyzing and investigating communication data from a controlled environment |
US9930088B1 (en) | 2017-06-22 | 2018-03-27 | Global Tel*Link Corporation | Utilizing VoIP codec negotiation during a controlled environment call |
US20200237622A1 (en) * | 2017-10-16 | 2020-07-30 | Eric Campos | Chambered dispensing devices and methods |
US10733984B2 (en) | 2018-05-07 | 2020-08-04 | Google Llc | Multi-modal interface in a voice-activated network |
CN109151543A (en) * | 2018-07-27 | 2019-01-04 | 北京优酷科技有限公司 | Playing frame, display methods, device and the storage medium of media content |
WO2020148875A1 (en) * | 2019-01-17 | 2020-07-23 | オリンパス株式会社 | Centralized control device and centralized control system |
US11270061B2 (en) * | 2020-02-25 | 2022-03-08 | International Business Machines Corporation | Automatic generation of training data for scientific paper summarization using videos |
CN111460180B (en) * | 2020-03-30 | 2024-03-15 | 维沃移动通信有限公司 | Information display method, information display device, electronic equipment and storage medium |
US11829710B2 (en) | 2022-01-25 | 2023-11-28 | Adobe Inc. | Deriving global intent from a composite document to facilitate editing of the composite document |
US20230359325A1 (en) * | 2022-05-05 | 2023-11-09 | Adobe Inc. | User interface for editing of a composite document through intelligently zoomed previews |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5109482A (en) * | 1989-01-11 | 1992-04-28 | David Bohrman | Interactive video control system for displaying user-selectable clips |
US5537530A (en) * | 1992-08-12 | 1996-07-16 | International Business Machines Corporation | Video editing by locating segment boundaries and reordering segment sequences |
US5613909A (en) * | 1994-07-21 | 1997-03-25 | Stelovsky; Jan | Time-segmented multimedia game playing and authoring system |
US5635982A (en) * | 1994-06-27 | 1997-06-03 | Zhang; Hong J. | System for automatic video segmentation and key frame extraction for video sequences having both sharp and gradual transitions |
US5664227A (en) * | 1994-10-14 | 1997-09-02 | Carnegie Mellon University | System and method for skimming digital audio/video data |
US5703655A (en) * | 1995-03-24 | 1997-12-30 | U S West Technologies, Inc. | Video programming retrieval using extracted closed caption data which has been partitioned and stored to facilitate a search and retrieval process |
US5758181A (en) * | 1996-01-22 | 1998-05-26 | International Business Machines Corporation | Method and system for accelerated presentation of segmented data |
US5818439A (en) * | 1995-02-20 | 1998-10-06 | Hitachi, Ltd. | Video viewing assisting method and a video playback system therefor |
US5835667A (en) * | 1994-10-14 | 1998-11-10 | Carnegie Mellon University | Method and apparatus for creating a searchable digital video library and a system and method of using such a library |
US6263507B1 (en) * | 1996-12-05 | 2001-07-17 | Interval Research Corporation | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
Family Cites Families (275)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US2583652A (en) * | 1949-12-15 | 1952-01-29 | Keagy Nellie Wannetta | Bunion cushion |
US3884403A (en) | 1973-12-07 | 1975-05-20 | Robert A Brewer | Article carrying strap |
US3919475A (en) | 1974-10-09 | 1975-11-11 | Honeywell Inc | Head attached television |
US4033335A (en) | 1975-06-12 | 1977-07-05 | Wisconsin Alumni Research Foundation | Method and apparatus for multiplexing of physiological sensor signals with gamma ray camera data signals |
US4051534A (en) | 1976-10-27 | 1977-09-27 | Honeywell Inc. | Head attached television |
US4131919A (en) | 1977-05-20 | 1978-12-26 | Eastman Kodak Company | Electronic still camera |
US4260229A (en) | 1978-01-23 | 1981-04-07 | Bloomstein Richard W | Creating visual images of lip movements |
JPS54114920A (en) | 1978-02-28 | 1979-09-07 | Kokusai Denshin Denwa Co Ltd | Television signal adaptive forecasting encoding system |
US4782401A (en) | 1979-05-11 | 1988-11-01 | Nelson A. Faerber | Editing method and apparatus for commercials during video recording |
FR2462728A1 (en) | 1979-07-30 | 1981-02-13 | Moviecam Kinematograph | CAMERA |
US4390904A (en) | 1979-09-20 | 1983-06-28 | Shelton Video Editors, Inc. | Automatic circuit and method for editing commercial messages from television signals |
US4283735A (en) | 1979-09-21 | 1981-08-11 | David Jagger | Method and apparatus for selectively deleting during video tape recording |
US4319286A (en) | 1980-01-07 | 1982-03-09 | Muntz Electronics, Inc. | System for detecting fades in television signals to delete commercials from recorded television broadcasts |
US4750052A (en) | 1981-02-13 | 1988-06-07 | Zenith Electronics Corporation | Apparatus and method for automatically deleting selected program intervals from recorded television broadcasts |
JPH0642740B2 (en) | 1981-05-12 | 1994-06-01 | 富士写真フイルム株式会社 | Image recording / reproducing device |
US4965825A (en) | 1981-11-03 | 1990-10-23 | The Personalized Mass Media Corporation | Signal processing apparatus and methods |
WO1983002208A1 (en) | 1981-12-19 | 1983-06-23 | Frederick William Chard | Method and apparatus for editing the output of a television set |
US5105285A (en) | 1982-03-19 | 1992-04-14 | Canon Kabushiki Kaisha | Image transmission system |
US4605973A (en) | 1982-08-23 | 1986-08-12 | Kohorn H Von | System, apparatus and method for recording and editing broadcast transmissions |
US4520404A (en) | 1982-08-23 | 1985-05-28 | Kohorn H Von | System, apparatus and method for recording and editing broadcast transmissions |
US4574354A (en) | 1982-11-19 | 1986-03-04 | Tektronix, Inc. | Method and apparatus for time-aligning data |
US4446997A (en) | 1983-01-26 | 1984-05-08 | Elliot Himberg | Convertible camera-supporting belt device |
US4527201A (en) | 1983-03-29 | 1985-07-02 | Panavision, Inc. | Zoom indicating apparatus for video camera or the like |
US4618895A (en) | 1983-08-31 | 1986-10-21 | Wright Bruce R | Video editing system |
US4526308A (en) | 1984-01-09 | 1985-07-02 | Dovey Dennis J | Camera support |
US4750053A (en) | 1984-02-02 | 1988-06-07 | Broadcast Advertisers Reports, Inc. | Method and system for enabling television commerical monitoring using a marking signal superimposed over an audio signal |
JPS60250784A (en) | 1984-05-28 | 1985-12-11 | Fuji Photo Optical Co Ltd | Electronic camera |
US4602297A (en) | 1985-01-22 | 1986-07-22 | Morris Reese | System for editing commercial messages from recorded television broadcasts |
US4600281A (en) | 1985-03-29 | 1986-07-15 | Bloomstein Richard W | Altering facial displays in cinematic works |
US4777537A (en) | 1985-10-21 | 1988-10-11 | Sony Corporation | Signal recording apparatus and method |
GB8528143D0 (en) | 1985-11-14 | 1985-12-18 | British Telecomm | Image encoding & synthesis |
JPS62171267U (en) | 1986-04-18 | 1987-10-30 | ||
US4739398A (en) | 1986-05-02 | 1988-04-19 | Control Data Corporation | Method, apparatus and system for recognizing broadcast segments |
DE3628743C2 (en) | 1986-08-23 | 1994-05-11 | Grundig Emv | Device for recording and quickly retrieving video signal sections on a magnetic tape |
US4843484A (en) | 1986-09-12 | 1989-06-27 | Pioneer Electronic Corporation | Information recording disc with composite index codes and its playback method |
US5040081A (en) | 1986-09-23 | 1991-08-13 | Mccutchen David | Audiovisual synchronization signal generator using audio signature comparison |
US4714184A (en) | 1987-03-13 | 1987-12-22 | Fotima International Ltd. | Camera carrier |
US4947265A (en) | 1987-06-11 | 1990-08-07 | Sony Corporation | Apparatus and method for recording or reproducing still video and audio information and having after recording editing capability |
US4930158A (en) | 1987-09-02 | 1990-05-29 | Vogel Peter S | Selective video playing system |
JP2565209B2 (en) | 1987-12-28 | 1996-12-18 | ソニー株式会社 | Television signal processing device |
US4913539A (en) | 1988-04-04 | 1990-04-03 | New York Institute Of Technology | Apparatus and method for lip-synching animation |
US4847543A (en) | 1988-04-08 | 1989-07-11 | Ultimatte Corporation | Motion control drive interface |
US5514861A (en) | 1988-05-11 | 1996-05-07 | Symbol Technologies, Inc. | Computer and/or scanner system mounted on a glove |
US5012335A (en) | 1988-06-27 | 1991-04-30 | Alija Cohodar | Observation and recording system for a police vehicle |
US5025394A (en) | 1988-09-09 | 1991-06-18 | New York Institute Of Technology | Method and apparatus for generating animated images |
JP2977829B2 (en) | 1989-01-11 | 1999-11-15 | 株式会社東芝 | Moving picture reproducing apparatus and moving picture reproducing method |
JP2518683B2 (en) | 1989-03-08 | 1996-07-24 | 国際電信電話株式会社 | Image combining method and apparatus thereof |
US5253066C1 (en) | 1989-06-01 | 2001-05-22 | United Video Properties Inc | Tv recording and viewing control system |
US4934821A (en) | 1989-06-26 | 1990-06-19 | Eastman Kodak Company | Technique for scanning a microfilm image moving at a variable speed |
US5421031A (en) | 1989-08-23 | 1995-05-30 | Delta Beta Pty. Ltd. | Program transmission optimisation |
US5701582A (en) | 1989-08-23 | 1997-12-23 | Delta Beta Pty. Ltd. | Method and apparatus for efficient transmissions of programs |
US5249289A (en) | 1989-09-28 | 1993-09-28 | International Business Machines Corporation | System and method for rebuilding edited digital audio files |
JP3225356B2 (en) | 1989-11-29 | 2001-11-05 | コニカ株式会社 | Still video camera |
US5012334B1 (en) | 1990-01-29 | 1997-05-13 | Grass Valley Group | Video image bank for storing and retrieving video image sequences |
JPH03252287A (en) | 1990-02-28 | 1991-11-11 | Victor Co Of Japan Ltd | Moving picture compressor |
US5136655A (en) | 1990-03-26 | 1992-08-04 | Hewlett-Pacard Company | Method and apparatus for indexing and retrieving audio-video data |
JP2958048B2 (en) | 1990-05-16 | 1999-10-06 | シャープ株式会社 | Television image processing device |
JPH0427280A (en) | 1990-05-22 | 1992-01-30 | Canon Inc | Camera integrated video recorder device |
US5477331A (en) | 1990-09-14 | 1995-12-19 | Canon Kabushiki Kaisha | Image recording apparatus with index information recording feature |
US5177796A (en) | 1990-10-19 | 1993-01-05 | International Business Machines Corporation | Image data processing of correlated images |
JPH04209384A (en) | 1990-11-30 | 1992-07-30 | Sharp Corp | Magnetic tape recording and reproducing device |
JPH04207788A (en) | 1990-11-30 | 1992-07-29 | Sony Corp | Band compression device |
US5305400A (en) | 1990-12-05 | 1994-04-19 | Deutsche Itt Industries Gmbh | Method of encoding and decoding the video data of an image sequence |
US5172281A (en) | 1990-12-17 | 1992-12-15 | Ardis Patrick M | Video transcript retriever |
US5253275A (en) | 1991-01-07 | 1993-10-12 | H. Lee Browne | Audio and video transmission and receiving system |
JPH04250436A (en) | 1991-01-11 | 1992-09-07 | Pioneer Electron Corp | Image pickup device |
US5317730A (en) | 1991-01-11 | 1994-05-31 | International Business Machines Corporation | System for modifying persistent database based upon set of data elements formed after selective insertion or deletion |
US5684514A (en) | 1991-01-11 | 1997-11-04 | Advanced Interaction, Inc. | Apparatus and method for assembling content addressable video |
US5187571A (en) | 1991-02-01 | 1993-02-16 | Bell Communications Research, Inc. | Television system for displaying multiple views of a remote location |
US5430835A (en) | 1991-02-15 | 1995-07-04 | Sierra On-Line, Inc. | Method and means for computer sychronization of actions and sounds |
US5241428A (en) | 1991-03-12 | 1993-08-31 | Goldwasser Eric P | Variable-delay video recorder |
CA2057961C (en) | 1991-05-06 | 2000-06-13 | Robert Paff | Graphical workstation for integrated security system |
US5185667A (en) | 1991-05-13 | 1993-02-09 | Telerobotics International, Inc. | Omniview motionless camera orientation system |
US5265180A (en) | 1991-06-13 | 1993-11-23 | Intel Corporation | Method of encoding a sequence of images of a digital motion video signal |
US5182641A (en) | 1991-06-17 | 1993-01-26 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | Composite video and graphics display for camera viewing systems in robotics and teleoperation |
DE69222580T2 (en) | 1991-07-15 | 1998-04-16 | Hitachi Ltd | Image encoder decoder and teleconferencing terminal |
US5175769A (en) | 1991-07-23 | 1992-12-29 | Rolm Systems | Method for time-scale modification of signals |
US5488409A (en) | 1991-08-19 | 1996-01-30 | Yuen; Henry C. | Apparatus and method for tracking the playing of VCR programs |
US5524193A (en) | 1991-10-15 | 1996-06-04 | And Communications | Interactive multimedia annotation method and apparatus |
JPH05145818A (en) | 1991-11-21 | 1993-06-11 | Sony Corp | Image pickup device |
US5861881A (en) * | 1991-11-25 | 1999-01-19 | Actv, Inc. | Interactive computer system for providing an interactive presentation with personalized video, audio and graphics responses for multiple viewers |
US5689648A (en) * | 1992-01-31 | 1997-11-18 | Raychem Corporation | Method and apparatus for publication of information |
US6208805B1 (en) | 1992-02-07 | 2001-03-27 | Max Abecassis | Inhibiting a control function from interfering with a playing of a video |
US5396287A (en) | 1992-02-25 | 1995-03-07 | Fuji Photo Optical Co., Ltd. | TV camera work control apparatus using tripod head |
KR100206261B1 (en) | 1992-02-28 | 1999-07-01 | 윤종용 | Video signal band compression device for a digital vtr |
ATE203844T1 (en) | 1992-03-20 | 2001-08-15 | Commw Scient Ind Res Org | OBJECT MONITORING SYSTEM |
JPH0756652B2 (en) | 1992-03-24 | 1995-06-14 | インターナショナル・ビジネス・マシーンズ・コーポレイション | Search for video frame sequence |
US5999173A (en) | 1992-04-03 | 1999-12-07 | Adobe Systems Incorporated | Method and apparatus for video editing with video clip representations displayed along a time line |
US5467288A (en) | 1992-04-10 | 1995-11-14 | Avid Technology, Inc. | Digital audio workstations providing digital storage and display of video information |
US5436653A (en) | 1992-04-30 | 1995-07-25 | The Arbitron Company | Method and system for recognition of broadcast segments |
US5692661A (en) | 1992-05-22 | 1997-12-02 | Kellerman; Theodore J. | Sports harness for a portable radio/cassette player |
US5295089A (en) | 1992-05-28 | 1994-03-15 | Emilio Ambasz | Soft, foldable consumer electronic products |
US5262856A (en) | 1992-06-04 | 1993-11-16 | Massachusetts Institute Of Technology | Video image compositing techniques |
US5703795A (en) * | 1992-06-22 | 1997-12-30 | Mankovitz; Roy J. | Apparatus and methods for accessing information relating to radio and television programs |
JPH0678307A (en) | 1992-07-06 | 1994-03-18 | Sanyo Electric Co Ltd | Remote controller and electronic device control system |
US5404316A (en) | 1992-08-03 | 1995-04-04 | Spectra Group Ltd., Inc. | Desktop digital video processing system |
DE69327220T2 (en) | 1992-10-09 | 2000-06-21 | Sony Corp., Tokio/Tokyo | Creation and recording of images |
US5396583A (en) | 1992-10-13 | 1995-03-07 | Apple Computer, Inc. | Cylindrical to planar image mapping using scanline coherence |
US5420801A (en) | 1992-11-13 | 1995-05-30 | International Business Machines Corporation | System and method for synchronization of multimedia streams |
US5329320A (en) | 1992-12-03 | 1994-07-12 | Aharon Yifrach | TV receiver and buffer system therefor |
KR940017747A (en) | 1992-12-29 | 1994-07-27 | 에프. 제이. 스미트 | Image processing device |
JP3382276B2 (en) | 1993-01-07 | 2003-03-04 | キヤノン株式会社 | Electronic device and control method thereof |
US5333091B2 (en) | 1993-01-08 | 1996-12-17 | Arthur D Little Enterprises | Method and apparatus for controlling a videotape player to automatically scan past recorded commercial messages |
US5377051A (en) | 1993-01-13 | 1994-12-27 | Hitachi America, Ltd. | Digital video recorder compatible receiver with trick play image enhancement |
FR2700908B1 (en) | 1993-01-26 | 1995-02-24 | Thomson Consumer Electronics | Buffer television receiver. |
US5406626A (en) | 1993-03-15 | 1995-04-11 | Macrovision Corporation | Radio receiver for information dissemenation using subcarrier |
US5590195A (en) | 1993-03-15 | 1996-12-31 | Command Audio Corporation | Information dissemination using various transmission modes |
US8046800B2 (en) * | 1993-03-29 | 2011-10-25 | Microsoft Corporation | Remotely controlling a video recorder |
US5440348A (en) | 1993-04-16 | 1995-08-08 | Avid Technology, Inc. | Method and user interface for creating, specifying and adjusting motion picture transitions |
WO1994026061A1 (en) | 1993-04-29 | 1994-11-10 | Michael Friedland | Hands free video camera system |
US5343251A (en) | 1993-05-13 | 1994-08-30 | Pareto Partners, Inc. | Method and apparatus for classifying patterns of television programs and commercials based on discerning of broadcast audio and video signals |
EP0625857B1 (en) | 1993-05-19 | 1998-06-24 | ALCATEL BELL Naamloze Vennootschap | Video server |
US5416310A (en) | 1993-05-28 | 1995-05-16 | Symbol Technologies, Inc. | Computer and/or scanner system incorporated into a garment |
GB2278907A (en) | 1993-06-08 | 1994-12-14 | Vinten Group Plc | Manual control system for camera mountings |
US5438423C1 (en) | 1993-06-25 | 2002-08-27 | Grass Valley Us Inc | Time warping for video viewing |
US5384703A (en) | 1993-07-02 | 1995-01-24 | Xerox Corporation | Method and apparatus for summarizing documents according to theme |
US5608839A (en) | 1994-03-18 | 1997-03-04 | Lucent Technologies Inc. | Sound-synchronized video system |
WO1995011566A1 (en) | 1993-10-20 | 1995-04-27 | Videoconferencing Systems, Inc. | Adaptive videoconferencing system |
US5886739A (en) | 1993-11-01 | 1999-03-23 | Winningstad; C. Norman | Portable automatic tracking video recording system |
US5473379A (en) | 1993-11-04 | 1995-12-05 | At&T Corp. | Method and apparatus for improving motion compensation in digital video coding |
US5438357A (en) | 1993-11-23 | 1995-08-01 | Mcnelley; Steve H. | Image manipulating teleconferencing system |
US5828786A (en) | 1993-12-02 | 1998-10-27 | General Instrument Corporation | Analyzer and methods for detecting and processing video data types in a video data stream |
US5467271A (en) | 1993-12-17 | 1995-11-14 | Trw, Inc. | Mapping and analysis system for precision farming applications |
JPH07219970A (en) | 1993-12-20 | 1995-08-18 | Xerox Corp | Method and apparatus for reproduction in acceleration format |
US5436542A (en) | 1994-01-28 | 1995-07-25 | Surgix, Inc. | Telescopic camera mount with remotely controlled positioning |
JPH07264452A (en) | 1994-02-03 | 1995-10-13 | Samsung Electron Co Ltd | Built-in camera type magnetic recording/reproducing device, and method therefor |
US5592626A (en) | 1994-02-07 | 1997-01-07 | The Regents Of The University Of California | System and method for selecting cache server based on transmission and storage factors for efficient delivery of multimedia information in a hierarchical network of servers |
US5537151A (en) * | 1994-02-16 | 1996-07-16 | Ati Technologies Inc. | Close caption support with timewarp |
DE4408131A1 (en) | 1994-03-10 | 1995-07-06 | Otto Marchner | Auxiliary appts. built into TV receiver |
US5623173A (en) | 1994-03-18 | 1997-04-22 | Lucent Technologies Inc. | Bus structure for power system |
CA2144795A1 (en) | 1994-03-18 | 1995-09-19 | Homer H. Chen | Audio visual dubbing system and method |
JPH07274049A (en) | 1994-03-30 | 1995-10-20 | Sony Corp | Electronic equipment provided with memory for function information |
US5524051A (en) | 1994-04-06 | 1996-06-04 | Command Audio Corporation | Method and system for audio information dissemination using various modes of transmission |
DE69517647T2 (en) | 1994-04-25 | 2001-02-22 | Sony Corp., Tokio/Tokyo | VIDEO SIGNAL PLAYER |
US5583652A (en) | 1994-04-28 | 1996-12-10 | International Business Machines Corporation | Synchronized, variable-speed playback of digitally recorded audio and video |
US6069621A (en) * | 1994-05-10 | 2000-05-30 | Schupak; Donald | Distributed computer system for providing audio, video, and information signals to plural modules throughout a home |
US5550754A (en) | 1994-05-13 | 1996-08-27 | Videoptic Research | Teleconferencing camcorder |
US5796426A (en) | 1994-05-27 | 1998-08-18 | Warp, Ltd. | Wide-angle image dewarping method and apparatus |
US5606359A (en) | 1994-06-30 | 1997-02-25 | Hewlett-Packard Company | Video on demand system with multiple data sources configured to provide vcr-like services |
US5546145A (en) | 1994-08-30 | 1996-08-13 | Eastman Kodak Company | Camera on-board voice recognition |
JPH0879685A (en) | 1994-08-31 | 1996-03-22 | Sony Corp | Program reproducing device for near-video-on-demand system |
US5613032A (en) | 1994-09-02 | 1997-03-18 | Bell Communications Research, Inc. | System and method for recording, playing back and searching multimedia events wherein video, audio and text can be searched and retrieved |
JPH0879626A (en) | 1994-09-05 | 1996-03-22 | Sony Corp | Video device |
US5805156A (en) | 1994-09-19 | 1998-09-08 | Intel Corporation | Automated media capturing system |
US5598352A (en) | 1994-09-30 | 1997-01-28 | Cirrus Logic, Inc. | Method and apparatus for audio and video synchronizing in MPEG playback systems |
US5575443A (en) | 1994-10-04 | 1996-11-19 | Honeycutt; Jay W. | Quick release accessory mount on a bicycle |
US5920842A (en) | 1994-10-12 | 1999-07-06 | Pixel Instruments | Signal synchronization |
US5594498A (en) | 1994-10-14 | 1997-01-14 | Semco, Inc. | Personal audio/video surveillance system |
US5926205A (en) | 1994-10-19 | 1999-07-20 | Imedia Corporation | Method and apparatus for encoding and formatting data representing a video program to provide multiple overlapping presentations of the video program |
US5612742A (en) | 1994-10-19 | 1997-03-18 | Imedia Corporation | Method and apparatus for encoding and formatting data representing a video program to provide multiple overlapping presentations of the video program |
US5614940A (en) * | 1994-10-21 | 1997-03-25 | Intel Corporation | Method and apparatus for providing broadcast information with indexing |
US5687095A (en) | 1994-11-01 | 1997-11-11 | Lucent Technologies Inc. | Video transmission rate matching for multimedia communication systems |
TW301101B (en) | 1994-11-17 | 1997-03-21 | Matsushita Electric Ind Co Ltd | |
US6266085B1 (en) | 1994-11-17 | 2001-07-24 | Canon Kabushiki Kaisha | Camera imaging and magnification device |
US5758257A (en) * | 1994-11-29 | 1998-05-26 | Herz; Frederick | System and method for scheduling broadcast of and access to video programs and other data using customer profiles |
US5617565A (en) * | 1994-11-29 | 1997-04-01 | Hitachi America, Ltd. | Broadcast interactive multimedia system |
GB9426165D0 (en) * | 1994-12-23 | 1995-02-22 | Anthony Andre C | Method of retrieving and displaying data |
JP3392967B2 (en) | 1994-12-27 | 2003-03-31 | ペンタックス株式会社 | Still video camera |
JP3804070B2 (en) | 1994-12-28 | 2006-08-02 | ソニー株式会社 | Data transmission apparatus and method |
WO1996027983A1 (en) | 1995-03-07 | 1996-09-12 | Interval Research Corporation | System and method for selective recording of information |
JPH08249348A (en) | 1995-03-13 | 1996-09-27 | Hitachi Ltd | Method and device for video retrieval |
IT1279171B1 (en) | 1995-03-17 | 1997-12-04 | Ist Trentino Di Cultura | CONTINUOUS SPEECH RECOGNITION SYSTEM |
JP3315555B2 (en) | 1995-04-07 | 2002-08-19 | キヤノン株式会社 | Camera control device |
US5729741A (en) | 1995-04-10 | 1998-03-17 | Golden Enterprises, Inc. | System for storage and retrieval of diverse types of information obtained from different media sources which includes video, audio, and text transcriptions |
US5666159A (en) | 1995-04-24 | 1997-09-09 | Eastman Kodak Company | Electronic camera system with programmable transmission capability |
US5838874A (en) | 1995-05-08 | 1998-11-17 | Kabushiki Kaisha Toshiba | Audiovisual encoding system with a reduced number of audio encoders |
US5572261A (en) | 1995-06-07 | 1996-11-05 | Cooper; J. Carl | Automatic audio to video timing measurement device and method |
US7302638B1 (en) * | 1995-06-07 | 2007-11-27 | Wolfe Mark A | Efficiently displaying and researching information about the interrelationships between documents |
US5724646A (en) | 1995-06-15 | 1998-03-03 | International Business Machines Corporation | Fixed video-on-demand |
US5682597A (en) | 1995-06-15 | 1997-10-28 | International Business Machines Corporation | Hybrid video-on-demand based on a near-video-on-demand system |
JPH0916457A (en) | 1995-06-28 | 1997-01-17 | Fujitsu Ltd | Multimedia data retrieval system |
US5539483A (en) | 1995-06-30 | 1996-07-23 | At&T Corp. | Panoramic projection apparatus |
DE69637452D1 (en) * | 1995-07-31 | 2008-04-17 | Toshiba Kawasaki Kk | Interactive television system |
US5907836A (en) * | 1995-07-31 | 1999-05-25 | Kabushiki Kaisha Toshiba | Information filtering apparatus for selecting predetermined article from plural articles to present selected article to user, and method therefore |
US5649186A (en) * | 1995-08-07 | 1997-07-15 | Silicon Graphics Incorporated | System and method for a computer-based dynamic information clipping service |
US5742517A (en) | 1995-08-29 | 1998-04-21 | Integrated Computer Utilities, Llc | Method for randomly accessing stored video and a field inspection system employing the same |
WO1997010564A1 (en) | 1995-09-15 | 1997-03-20 | Interval Research Corporation | A method of compressing a plurality of video images |
US5694474A (en) | 1995-09-18 | 1997-12-02 | Interval Research Corporation | Adaptive filter for signal processing and method therefor |
US5721823A (en) | 1995-09-29 | 1998-02-24 | Hewlett-Packard Co. | Digital layout method suitable for near video on demand system |
US5751336A (en) | 1995-10-12 | 1998-05-12 | International Business Machines Corporation | Permutation based pyramid block transmission scheme for broadcasting in video-on-demand storage systems |
JPH09121358A (en) | 1995-10-25 | 1997-05-06 | Matsushita Electric Ind Co Ltd | Picture coding/decoding device and its method |
US5768640A (en) | 1995-10-27 | 1998-06-16 | Konica Corporation | Camera having an information recording function |
US5678793A (en) | 1995-10-30 | 1997-10-21 | Hill; Gregory Hill | Bracket for mounting a hand holdable appliance or the like |
US5717869A (en) | 1995-11-03 | 1998-02-10 | Xerox Corporation | Computer controlled display system using a timeline to control playback of temporal data representing collaborative activities |
US6282362B1 (en) | 1995-11-07 | 2001-08-28 | Trimble Navigation Limited | Geographical position/image digital recording and display system |
US6118925A (en) | 1995-11-14 | 2000-09-12 | Hitachi Denshi Kabushiki Kaisha | Method of and system for confirming program materials to be broadcasted and then broadcasting the program materials, and recording medium having recorded therein a procedure for implementing the method |
US5726660A (en) | 1995-12-01 | 1998-03-10 | Purdy; Peter K. | Personal data collection and reporting system |
US5752113A (en) | 1995-12-22 | 1998-05-12 | Borden; John | Panoramic indexing camera mount |
US5740037A (en) | 1996-01-22 | 1998-04-14 | Hughes Aircraft Company | Graphical user interface system for manportable applications |
US5936659A (en) | 1996-01-31 | 1999-08-10 | Telcordia Technologies, Inc. | Method for video delivery using pyramid broadcasting |
US6061056A (en) * | 1996-03-04 | 2000-05-09 | Telexis Corporation | Television monitoring system with automatic selection of program material of interest and subsequent display under user control |
CN1130076C (en) | 1996-03-04 | 2003-12-03 | 松下电器产业株式会社 | Image selecting/displaying appts. |
US5791907A (en) * | 1996-03-08 | 1998-08-11 | Ramshaw; Bruce J. | Interactive medical training system |
US5774664A (en) * | 1996-03-08 | 1998-06-30 | Actv, Inc. | Enhanced video programming system and method for incorporating and displaying retrieved integrated internet information segments |
US5778181A (en) * | 1996-03-08 | 1998-07-07 | Actv, Inc. | Enhanced video programming system and method for incorporating and displaying retrieved integrated internet information segments |
US5826206A (en) | 1996-03-12 | 1998-10-20 | Training Inovations Group, Llc | Debriefing systems and methods for retrieving and presenting multiple datastreams with time indication marks in time synchronism |
US5880788A (en) | 1996-03-25 | 1999-03-09 | Interval Research Corporation | Automated synchronization of video image sequences to new soundtracks |
US6025837A (en) * | 1996-03-29 | 2000-02-15 | Micrsoft Corporation | Electronic program guide with hyperlinks to target resources |
US5737009A (en) | 1996-04-04 | 1998-04-07 | Hughes Electronics | On-demand digital information delivery system and method using signal fragmentation and linear/fractal sequencing. |
US5831662A (en) | 1996-04-04 | 1998-11-03 | Hughes Electronics Corporation | Near on-demand digital information delivery system and method using signal fragmentation and sequencing to reduce average bandwidth and peak bandwidth variability |
US6404811B1 (en) | 1996-05-13 | 2002-06-11 | Tektronix, Inc. | Interactive multimedia system |
US6141693A (en) | 1996-06-03 | 2000-10-31 | Webtv Networks, Inc. | Method and apparatus for extracting digital data from a video stream and using the digital data to configure the video stream for display on a television set |
US5828994A (en) | 1996-06-05 | 1998-10-27 | Interval Research Corporation | Non-uniform time scale modification of recorded audio |
US6098082A (en) * | 1996-07-15 | 2000-08-01 | At&T Corp | Method for automatically providing a compressed rendition of a video program in a format suitable for electronic searching and retrieval |
US6160950A (en) | 1996-07-18 | 2000-12-12 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for automatically generating a digest of a program |
US5928327A (en) | 1996-08-08 | 1999-07-27 | Wang; Pong-Sheng | System and process for delivering digital data on demand |
US20030093790A1 (en) | 2000-03-28 | 2003-05-15 | Logan James D. | Audio and video program recording, editing and playback systems using metadata |
US6199076B1 (en) * | 1996-10-02 | 2001-03-06 | James Logan | Audio program player including a dynamic program selection controller |
US5892536A (en) * | 1996-10-03 | 1999-04-06 | Personal Audio | Systems and methods for computer enhanced broadcast monitoring |
US20020120925A1 (en) | 2000-03-28 | 2002-08-29 | Logan James D. | Audio and video program recording, editing and playback systems using metadata |
US5946050A (en) * | 1996-10-04 | 1999-08-31 | Samsung Electronics Co., Ltd. | Keyword listening device |
US5974235A (en) | 1996-10-31 | 1999-10-26 | Sensormatic Electronics Corporation | Apparatus having flexible capabilities for analysis of video information |
US6512551B1 (en) * | 1996-11-12 | 2003-01-28 | Compaq Computer Corporation | Platform for displaying information from multiple sources |
US5893062A (en) | 1996-12-05 | 1999-04-06 | Interval Research Corporation | Variable rate video playback with synchronized audio |
US6172675B1 (en) | 1996-12-05 | 2001-01-09 | Interval Research Corporation | Indirect manipulation of data using temporally related data, with particular application to manipulation of audio or audiovisual data |
US6005564A (en) | 1996-12-05 | 1999-12-21 | Interval Research Corporation | Display pause with elastic playback |
US5917542A (en) | 1997-02-18 | 1999-06-29 | Eastman Kodak Company | System and method for digital image capture and transmission |
US6061055A (en) | 1997-03-21 | 2000-05-09 | Autodesk, Inc. | Method of tracking objects with an imaging device |
US5749010A (en) | 1997-04-18 | 1998-05-05 | Mccumber Enterprises, Inc. | Camera support |
US6243725B1 (en) | 1997-05-21 | 2001-06-05 | Premier International, Ltd. | List building system |
JP3528524B2 (en) | 1997-07-10 | 2004-05-17 | ソニー株式会社 | Recording / reproducing apparatus, recording / reproducing method, and recording medium |
US6624846B1 (en) | 1997-07-18 | 2003-09-23 | Interval Research Corporation | Visual user interface for use in controlling the interaction of a device with a spatial region |
US20020031331A1 (en) | 1997-08-12 | 2002-03-14 | Index Systems, Inc. | Apparatus and methods for voice titles |
US6360234B2 (en) | 1997-08-14 | 2002-03-19 | Virage, Inc. | Video cataloger system with synchronized encoders |
US5768648A (en) | 1997-09-05 | 1998-06-16 | Roy Isaia | Camera mount for controlled and steady rolling movement |
US6961954B1 (en) | 1997-10-27 | 2005-11-01 | The Mitre Corporation | Automated segmentation, information extraction, summarization, and presentation of broadcast news |
US6072542A (en) | 1997-11-25 | 2000-06-06 | Fuji Xerox Co., Ltd. | Automatic video segmentation using hidden markov model |
US5940004A (en) | 1997-12-18 | 1999-08-17 | Fulton; John G. | Personal recovery system |
US6272231B1 (en) * | 1998-11-06 | 2001-08-07 | Eyematic Interfaces, Inc. | Wavelet-based facial motion capture for avatar animation |
US6018359A (en) | 1998-04-24 | 2000-01-25 | Massachusetts Institute Of Technology | System and method for multicast video-on-demand delivery system |
US6163510A (en) | 1998-06-30 | 2000-12-19 | International Business Machines Corporation | Multimedia search and indexing system and method of operation using audio cues with signal thresholds |
US6366296B1 (en) | 1998-09-11 | 2002-04-02 | Xerox Corporation | Media browser using multimodal analysis |
US6452969B1 (en) | 1998-09-28 | 2002-09-17 | Thomson Licensing S.A. | Transform domain inverse motion compensation having fractional pel accuracy |
US6317039B1 (en) | 1998-10-19 | 2001-11-13 | John A. Thomason | Wireless video audio data remote system |
US6993787B1 (en) | 1998-10-29 | 2006-01-31 | Matsushita Electric Industrial Co., Ltd. | Providing VCR functionality for data-centered video multicast |
US7024678B2 (en) | 1998-11-30 | 2006-04-04 | Sedna Patent Services, Llc | Method and apparatus for producing demand real-time television |
US6297845B1 (en) | 1998-12-29 | 2001-10-02 | International Business Machines Corporation | System and method of in-service testing of compressed digital broadcast video |
US6825875B1 (en) | 1999-01-05 | 2004-11-30 | Interval Research Corporation | Hybrid recording unit including portable video recorder and auxillary device |
US6934461B1 (en) | 1999-01-05 | 2005-08-23 | Interval Research Corporation | Low attention recording, with particular application to social recording |
US6236395B1 (en) | 1999-02-01 | 2001-05-22 | Sharp Laboratories Of America, Inc. | Audiovisual information management system |
US6934759B2 (en) | 1999-05-26 | 2005-08-23 | Enounce, Inc. | Method and apparatus for user-time-alignment for broadcast works |
US6502139B1 (en) | 1999-06-01 | 2002-12-31 | Technion Research And Development Foundation Ltd. | System for optimizing video on demand transmission by partitioning video program into multiple segments, decreasing transmission rate for successive segments and repeatedly, simultaneously transmission |
US6986156B1 (en) | 1999-06-11 | 2006-01-10 | Scientific Atlanta, Inc | Systems and methods for adaptive scheduling and dynamic bandwidth resource allocation management in a digital broadband delivery system |
US7143431B1 (en) | 1999-08-06 | 2006-11-28 | Wisconsin Alumni Research Foundation | Method for reduced bandwidth for on-demand data streaming using mini-clusters |
US6868452B1 (en) | 1999-08-06 | 2005-03-15 | Wisconsin Alumni Research Foundation | Method for caching of media files to reduce delivery cost |
US7155735B1 (en) | 1999-10-08 | 2006-12-26 | Vulcan Patents Llc | System and method for the broadcast dissemination of time-ordered data |
US20020157103A1 (en) | 2000-01-07 | 2002-10-24 | Deyang Song | Method for digital media playback in a broadcast network |
KR100317303B1 (en) | 2000-01-10 | 2001-12-22 | 구자홍 | apparatus for synchronizing video indexing between A/V and data at writing and reading of broadcasting program using metadata |
EP2076036A2 (en) | 2000-01-14 | 2009-07-01 | NDS Limited | Advertisement in an end-user controlled playback environment |
US6701528B1 (en) | 2000-01-26 | 2004-03-02 | Hughes Electronics Corporation | Virtual video on demand using multiple encrypted video segments |
US6622305B1 (en) | 2000-02-25 | 2003-09-16 | Opentv, Inc. | System and method for displaying near video on demand |
US20040123324A1 (en) | 2000-03-07 | 2004-06-24 | Sazzad Sharif M. | Methods and apparatus for providing video services such as Video-on-Demand, news and advertising services |
US20020049975A1 (en) * | 2000-04-05 | 2002-04-25 | Thomas William L. | Interactive wagering system with multiple display support |
JP2001306581A (en) | 2000-04-18 | 2001-11-02 | Sony Corp | Middleware and media data audiovisual equipment using the middleware |
US7266771B1 (en) | 2000-04-21 | 2007-09-04 | Vulcan Patents Llc | Video stream representation and navigation using inherent data |
US7194186B1 (en) | 2000-04-21 | 2007-03-20 | Vulcan Patents Llc | Flexible marking of recording data by a recording unit |
KR100547317B1 (en) | 2000-07-14 | 2006-01-26 | 엘지전자 주식회사 | Simultaneous recording and playback apparatus with indexing/searching/browsing functionality |
TWI230858B (en) | 2000-12-12 | 2005-04-11 | Matsushita Electric Ind Co Ltd | File management method, content recording/playback apparatus and content recording program |
MY147018A (en) | 2001-01-04 | 2012-10-15 | Thomson Licensing Sa | A method and apparatus for acquiring media services available from content aggregators |
US20020170068A1 (en) | 2001-03-19 | 2002-11-14 | Rafey Richter A. | Virtual and condensed television programs |
US20020159750A1 (en) | 2001-04-26 | 2002-10-31 | Koninklijke Philips Electronics N.V. | Method for segmenting and indexing TV programs using multi-media cues |
US7055103B2 (en) | 2001-08-28 | 2006-05-30 | Itzhak Lif | Method of matchmaking service |
US7610358B2 (en) * | 2001-11-26 | 2009-10-27 | Time Warner Cable | System and method for effectively presenting multimedia information materials |
US20030149574A1 (en) | 2002-02-05 | 2003-08-07 | Rudman Daniel E. | Method for providing media consumers with total choice and total control |
US7130528B2 (en) | 2002-03-01 | 2006-10-31 | Thomson Licensing | Audio data deletion and silencing during trick mode replay |
KR100447200B1 (en) | 2002-07-30 | 2004-09-04 | 엘지전자 주식회사 | System for decoding video with PVR function |
FR2855705A1 (en) * | 2003-05-28 | 2004-12-03 | Thomson Licensing Sa | NAVIGATION METHOD FOR SELECTING DOCUMENTS ASSOCIATED WITH IDENTIFIERS, AND RECEIVER IMPLEMENTING THE METHOD. |
US7788696B2 (en) * | 2003-10-15 | 2010-08-31 | Microsoft Corporation | Inferring information about media stream objects |
US20060053470A1 (en) | 2004-04-30 | 2006-03-09 | Vulcan Inc. | Management and non-linear presentation of augmented broadcasted or streamed multimedia content |
US20060031879A1 (en) | 2004-04-30 | 2006-02-09 | Vulcan Inc. | Management and non-linear presentation of news-related broadcasted or streamed multimedia content |
US20060031885A1 (en) | 2004-04-30 | 2006-02-09 | Vulcan Inc. | Management and non-linear presentation of music-related broadcasted or streamed multimedia content |
US20060031916A1 (en) | 2004-04-30 | 2006-02-09 | Vulcan Inc. | Management and non-linear presentation of broadcasted or streamed multimedia content |
US7627890B2 (en) * | 2006-02-21 | 2009-12-01 | At&T Intellectual Property, I,L.P. | Methods, systems, and computer program products for providing content synchronization or control among one or more devices |
US7890849B2 (en) * | 2006-09-15 | 2011-02-15 | Microsoft Corporation | Concurrent presentation of media and related content lists |
FR2910769B1 (en) * | 2006-12-21 | 2009-03-06 | Thomson Licensing Sas | METHOD FOR CREATING A SUMMARY OF AUDIOVISUAL DOCUMENT COMPRISING A SUMMARY AND REPORTS, AND RECEIVER IMPLEMENTING THE METHOD |
-
1996
- 1996-12-05 US US08/761,030 patent/US6263507B1/en not_active Expired - Lifetime
-
1997
- 1997-12-03 WO PCT/US1997/022145 patent/WO1998027497A1/en active Application Filing
- 1997-12-03 AU AU55154/98A patent/AU5515498A/en not_active Abandoned
-
1999
- 1999-06-25 US US09/344,213 patent/US6880171B1/en not_active Expired - Fee Related
-
2001
- 2001-05-29 US US09/866,956 patent/US20010025375A1/en not_active Abandoned
-
2007
- 2007-03-05 US US11/682,201 patent/US8176515B2/en not_active Expired - Fee Related
-
2012
- 2012-05-07 US US13/465,920 patent/US20120293522A1/en not_active Abandoned
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5109482A (en) * | 1989-01-11 | 1992-04-28 | David Bohrman | Interactive video control system for displaying user-selectable clips |
US5537530A (en) * | 1992-08-12 | 1996-07-16 | International Business Machines Corporation | Video editing by locating segment boundaries and reordering segment sequences |
US5635982A (en) * | 1994-06-27 | 1997-06-03 | Zhang; Hong J. | System for automatic video segmentation and key frame extraction for video sequences having both sharp and gradual transitions |
US5613909A (en) * | 1994-07-21 | 1997-03-25 | Stelovsky; Jan | Time-segmented multimedia game playing and authoring system |
US5664227A (en) * | 1994-10-14 | 1997-09-02 | Carnegie Mellon University | System and method for skimming digital audio/video data |
US5835667A (en) * | 1994-10-14 | 1998-11-10 | Carnegie Mellon University | Method and apparatus for creating a searchable digital video library and a system and method of using such a library |
US5818439A (en) * | 1995-02-20 | 1998-10-06 | Hitachi, Ltd. | Video viewing assisting method and a video playback system therefor |
US5703655A (en) * | 1995-03-24 | 1997-12-30 | U S West Technologies, Inc. | Video programming retrieval using extracted closed caption data which has been partitioned and stored to facilitate a search and retrieval process |
US5758181A (en) * | 1996-01-22 | 1998-05-26 | International Business Machines Corporation | Method and system for accelerated presentation of segmented data |
US6263507B1 (en) * | 1996-12-05 | 2001-07-17 | Interval Research Corporation | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
US6880171B1 (en) * | 1996-12-05 | 2005-04-12 | Interval Research Corporation | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data |
Cited By (44)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9319735B2 (en) | 1995-06-07 | 2016-04-19 | Rovi Guides, Inc. | Electronic television program guide schedule system and method with data feed access |
US9191722B2 (en) | 1997-07-21 | 2015-11-17 | Rovi Guides, Inc. | System and method for modifying advertisement responsive to EPG information |
US8806536B2 (en) | 1998-03-04 | 2014-08-12 | United Video Properties, Inc. | Program guide system with preference profiles |
US9854321B2 (en) | 1998-08-21 | 2017-12-26 | Rovi Guides, Inc. | Client-server electronic program guide |
US8087050B2 (en) * | 1998-08-21 | 2011-12-27 | United Video Properties, Inc. | Client-server electronic program guide |
US9426509B2 (en) | 1998-08-21 | 2016-08-23 | Rovi Guides, Inc. | Client-server electronic program guide |
US6782551B1 (en) * | 1999-07-15 | 2004-08-24 | Pace Micro Technology Plc | System for indicating when a program has been selected from a program guide display |
US20070226640A1 (en) * | 2000-11-15 | 2007-09-27 | Holbrook David M | Apparatus and methods for organizing and/or presenting data |
USRE46651E1 (en) | 2000-11-15 | 2017-12-26 | Callahan Cellular L.L.C. | Apparatus and methods for organizing and/or presenting data |
US7627884B2 (en) * | 2002-05-08 | 2009-12-01 | Fujitsu Ten Limited | Program information display apparatus with program selection input |
US20030210350A1 (en) * | 2002-05-08 | 2003-11-13 | Fujitsu Ten Limited | Program information display apparatus |
US8230470B2 (en) | 2003-01-15 | 2012-07-24 | Robertson Neil C | Full duplex wideband communications system for a local coaxial network |
US20040177317A1 (en) * | 2003-03-07 | 2004-09-09 | John Bradstreet | Closed caption navigation |
US20040199906A1 (en) * | 2003-04-01 | 2004-10-07 | Mcknight Russell F. | Systems and methods for saving files having different media types |
EP1482727A3 (en) * | 2003-05-28 | 2010-06-23 | Thomson Licensing | Process of navigation for the selection of documents associated with identifiers, and apparatus implementing the process. |
US7823067B2 (en) | 2003-05-28 | 2010-10-26 | Thomson Licensing | Process of navigation for the selection of documents associated with identifiers, and apparatus implementing the process |
FR2855705A1 (en) * | 2003-05-28 | 2004-12-03 | Thomson Licensing Sa | NAVIGATION METHOD FOR SELECTING DOCUMENTS ASSOCIATED WITH IDENTIFIERS, AND RECEIVER IMPLEMENTING THE METHOD. |
EP1482727A2 (en) * | 2003-05-28 | 2004-12-01 | Thomson Licensing S.A. | Process of navigation for the selection of documents associated with identifiers, and apparatus implementing the process. |
US20120311434A1 (en) * | 2003-12-17 | 2012-12-06 | Richard Skrenta | System and method for automating categorization and aggregation of content from network sites |
US8645385B2 (en) * | 2003-12-17 | 2014-02-04 | Topix Llc | System and method for automating categorization and aggregation of content from network sites |
US20060210248A1 (en) * | 2005-03-18 | 2006-09-21 | Kabushiki Kaisha Toshiba | Information recording apparatus and information |
US9075861B2 (en) | 2006-03-06 | 2015-07-07 | Veveo, Inc. | Methods and systems for segmenting relative user preferences into fine-grain and coarse-grain collections |
US9092503B2 (en) | 2006-03-06 | 2015-07-28 | Veveo, Inc. | Methods and systems for selecting and presenting content based on dynamically identifying microgenres associated with the content |
US9128987B2 (en) | 2006-03-06 | 2015-09-08 | Veveo, Inc. | Methods and systems for selecting and presenting content based on a comparison of preference signatures from multiple users |
US10984037B2 (en) | 2006-03-06 | 2021-04-20 | Veveo, Inc. | Methods and systems for selecting and presenting content on a first system based on user preferences learned on a second system |
US9749693B2 (en) | 2006-03-24 | 2017-08-29 | Rovi Guides, Inc. | Interactive media guidance application with intelligent navigation and display features |
US8699806B2 (en) * | 2006-04-12 | 2014-04-15 | Google Inc. | Method and apparatus for automatically summarizing video |
US20070245242A1 (en) * | 2006-04-12 | 2007-10-18 | Yagnik Jay N | Method and apparatus for automatically summarizing video |
US8229156B1 (en) | 2006-08-08 | 2012-07-24 | Google Inc. | Using curve invariants to automatically characterize videos |
US20080086453A1 (en) * | 2006-10-05 | 2008-04-10 | Fabian-Baber, Inc. | Method and apparatus for correlating the results of a computer network text search with relevant multimedia files |
US10694256B2 (en) | 2007-03-09 | 2020-06-23 | Rovi Technologies Corporation | Media content search results ranked by popularity |
US9326025B2 (en) | 2007-03-09 | 2016-04-26 | Rovi Technologies Corporation | Media content search results ranked by popularity |
US20110042824A1 (en) * | 2009-08-20 | 2011-02-24 | Fujitsu Limited | Multi-chip module and method of manufacturing the same |
US9166714B2 (en) | 2009-09-11 | 2015-10-20 | Veveo, Inc. | Method of and system for presenting enriched video viewing analytics |
US10631066B2 (en) | 2009-09-23 | 2020-04-21 | Rovi Guides, Inc. | Systems and method for automatically detecting users within detection regions of media devices |
US9736524B2 (en) | 2011-01-06 | 2017-08-15 | Veveo, Inc. | Methods of and systems for content search based on environment sampling |
US9852762B2 (en) | 2013-02-05 | 2017-12-26 | Alc Holdings, Inc. | User interface for video preview creation |
US20140223482A1 (en) * | 2013-02-05 | 2014-08-07 | Redux, Inc. | Video preview creation with link |
US9881646B2 (en) | 2013-02-05 | 2018-01-30 | Alc Holdings, Inc. | Video preview creation with audio |
US10373646B2 (en) | 2013-02-05 | 2019-08-06 | Alc Holdings, Inc. | Generation of layout of videos |
US9767845B2 (en) | 2013-02-05 | 2017-09-19 | Alc Holdings, Inc. | Activating a video based on location in screen |
US10643660B2 (en) | 2013-02-05 | 2020-05-05 | Alc Holdings, Inc. | Video preview creation with audio |
US9589594B2 (en) | 2013-02-05 | 2017-03-07 | Alc Holdings, Inc. | Generation of layout of videos |
US9530452B2 (en) * | 2013-02-05 | 2016-12-27 | Alc Holdings, Inc. | Video preview creation with link |
Also Published As
Publication number | Publication date |
---|---|
WO1998027497A1 (en) | 1998-06-25 |
US20070204319A1 (en) | 2007-08-30 |
AU5515498A (en) | 1998-07-15 |
US6880171B1 (en) | 2005-04-12 |
US6263507B1 (en) | 2001-07-17 |
US8176515B2 (en) | 2012-05-08 |
US20120293522A1 (en) | 2012-11-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6263507B1 (en) | Browser for use in navigating a body of information, with particular application to browsing information represented by audiovisual data | |
US10034028B2 (en) | Caption and/or metadata synchronization for replay of previously or simultaneously recorded live programs | |
US8528019B1 (en) | Method and apparatus for audio/data/visual information | |
CA2202540C (en) | System and method for skimming digital audio/video data | |
US5703655A (en) | Video programming retrieval using extracted closed caption data which has been partitioned and stored to facilitate a search and retrieval process | |
JP4905103B2 (en) | Movie playback device | |
US5613032A (en) | System and method for recording, playing back and searching multimedia events wherein video, audio and text can be searched and retrieved | |
US8448068B2 (en) | Information processing apparatus, information processing method, program, and storage medium | |
US20010049826A1 (en) | Method of searching video channels by content | |
US9576581B2 (en) | Metatagging of captions | |
US20050080631A1 (en) | Information processing apparatus and method therefor | |
JP2002533841A (en) | Personal video classification and search system | |
JP2003510625A (en) | Method and apparatus for preparing a creation filtered by listener interest | |
WO2000007310A1 (en) | System for analyzing television programs | |
Gauch et al. | The VISION digital video library | |
CN1976430B (en) | Method for realizing previewing mobile multimedia program in terminal | |
Amir et al. | Efficient Video Browsing: Using Multiple Synchronized Views | |
KR100678895B1 (en) | Apparatus and method for creating model-based segment metadata | |
GB2349764A (en) | 2-D Moving image database | |
US9400842B2 (en) | Method for selection of a document shot using graphic paths and receiver implementing the method | |
JP2000308017A (en) | Video audience device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: VULCAN PATENTS LLC, WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTERVAL RESEARCH CORPORATION;REEL/FRAME:016227/0693 Effective date: 20041229 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |