EP1571579A1 - Principes et procédés pour la personnalisation de newsfeeds par une analyse de la nouveauté et de la dynamique de l'information - Google Patents
Principes et procédés pour la personnalisation de newsfeeds par une analyse de la nouveauté et de la dynamique de l'information Download PDFInfo
- Publication number
- EP1571579A1 EP1571579A1 EP05101400A EP05101400A EP1571579A1 EP 1571579 A1 EP1571579 A1 EP 1571579A1 EP 05101400 A EP05101400 A EP 05101400A EP 05101400 A EP05101400 A EP 05101400A EP 1571579 A1 EP1571579 A1 EP 1571579A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- articles
- news
- information
- novelty
- documents
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99935—Query augmenting and refining, e.g. inexact access
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99942—Manipulating data structure, e.g. compression, compaction, compilation
Definitions
- the present invention relates generally to computer systems and more particularly, the present invention relates to systems and methods that personalize temporal streams of information such as news via an automated analysis of information dynamics.
- the present invention provides systems and methods for identifying information novelty and on how these methods can be applied to manage information content that evolves over time.
- a general framework is provided for comparing collections of documents, whereby documents can be assumed to be organized into groups by their content or source, and analyzed for inter-group and intra-group differences and commonalities. For example, juxtaposing two groups of documents devoted to the same topic but derived from two distinct sources, e.g., news coverage of an event in different parts of the world can reveal interesting differences of opinions and overall interpretations of situations.
- the evolution of content can be examined. For example, a stream of news articles can be examined over time on a common story, with the goal of highlighting truly informative updates and filtering out a large mass of articles that largely relay "more of the same.”
- Detailed statistics can be gathered on word occurrence across sets of documents in order to characterize differences and similarities among these sets.
- Various word models can be enhanced by extracting named entities that denote names of people, organizations, and geographical locations, for example.
- phrases and collocations whose discriminative semantic properties are usually outweighed by lack of sufficient statistics--named entities identify relatively stable tokens that are used in a common manner by many writers on a given topic, and thus their use contributes a considerable amount of information.
- one type of analysis provided represents articles using the named entities found in them. Analysis can be focused on live streams of news or other topics. Live news streams pose tantalizing challenges and opportunities for research.
- News feeds span enormous amounts of data, present a cornucopia of opinions and views, and include a wide spectrum of formats and content from short updates on breaking news, to major recaps of story developments, to mere reiterations of "the same old facts" reported over and over again.
- Algorithms can be developed that identify significant updates on stories being tracked, relieving the users from having to sift through long lists of similar articles arriving from different sources.
- the methods provided in accordance with the present invention provide the basis for personalized news portal and news alerting services that seek to minimize the time and disruptions to users who desire to follow evolving news stories.
- the subject invention provides various architectural components for analyzing information and filtering content for users.
- a framework is provided for identifying differences in sets of documents by analyzing the distributions of words and recognized named entities. This framework can be applied to compare individual documents, sets of documents, or a document and a set (for example, a new article vs. the union of previously reviewed news articles on the topic).
- Second, a collection of algorithms that operate on live news streams (or other temporally evolving streams) provide users with a personalized news experience. These algorithms have been implemented in an example system called News Junkie that presents users with maximally informative news updates. Users can request updates per user-defined periods or per each burst of reports about a story.
- Users can also tune the desired degree of relevance of these updates to the core story, allowing delivery of offshoot articles that report on related or similar stories. Also, an evaluation method is provided which presents users with a single seed story and sets of articles ranked by different novelty-assessing metrics, and seeks to understand how participants perceive the novelty of these sets in the context of the seed story.
- the present invention relates to a system and method to identify information novelty and manage information content as it evolves over time.
- a system for distributing personalized information.
- the system includes a component that determines differences between two or more information items.
- An analyzer automatically determines a subset of the information items based in part on the determined differences and as data relating to the information items evolves over time.
- various methods are provided.
- a method for creating personalized information includes automatically analyzing documents from different information sources and automatically determining novelty of the documents. A personalized feed of information is then provided to the user based on the novelty of the documents.
- the systems and methods of the present invention can be applied to a plurality of different applications. These can include applications that assist with the design of ideal reading sequences or paths through currently unread news stories on a topic, within different time-horizons of recency from present time. For designing sequences for catching up on news , applications consider the most recent news as well as news bursts over time, to help people understand the evolution of a news story and navigate the history of stories by major events / updates. Other applications include developing different types of display designs and metaphors, such as the use of a time-line view or other aspects such as the notion of clusters in time.
- alerts can be provided when a news story appears with keywords if the information novelty is great enough, thus being more useful than simple keyword-centric alerting schemes.
- a component may be, but is not limited to being, a process running on a processor, a processor, an object, an executable, a thread of execution, a program, and/or a computer.
- an application running on a server and the server can be a component.
- One or more components may reside within a process and/or thread of execution and a component may be localized on one computer and/or distributed between two or more computers. Also, these components can execute from various computer readable media having various data structures stored thereon.
- the components may communicate via local and/or remote processes such as in accordance with a signal having one or more data packets (e.g. , data from one component interacting with another component in a local system, distributed system, and/or across a network such as the Internet with other systems via the signal).
- a signal having one or more data packets (e.g. , data from one component interacting with another component in a local system, distributed system, and/or across a network such as the Internet with other systems via the signal).
- an information dynamics system 100 is illustrated in accordance with an aspect of the present invention.
- the present invention provides systems and methods for identifying information novelty and on how these methods can be applied to manage information content that evolves over time.
- a general framework 100 is provided for comparing collections of documents 110 via a comparator 114, whereby documents can be organized into groups by their content or source 120, and analyzed by an analyzer 130 for inter-group and intra-group differences and commonalities. For example, juxtaposing two or more groups of documents or files devoted to the same topic but derived from two distinct sources, e.g., news coverage of an event in different parts of the world, can reveal interesting differences of opinions and overall interpretations of situations.
- the evolution of content can be examined. For example, a stream of news articles can be examined over time on a common story, with the goal of highlighting truly informative updates and filtering out a large mass of articles via a filter 140 that cooperates with the analyzer 130 to deliver personalized information at 150.
- a model based on words can be enhanced by extracting named entities that denote names of people, organizations, and geographical locations, for example.
- named entities that denote names of people, organizations, and geographical locations, for example.
- phrases and collocations whose discriminative semantic properties are usually outweighed by lack of sufficient statistics--named entities identify relatively stable tokens that are used in a common manner by many writers on a given topic, and so their use contributes a considerable amount of information.
- One type of analysis provided represents articles using the named entities found in them. Analysis can be focused on live streams of news or other temporal streams of data. In one example news feeds span enormous amounts of data, present a plurality of opinions and views, and include a wide spectrum of formats and content from short updates on breaking news, to major recaps of story developments, to mere reiterations of old facts reported over and over again.
- Algorithms which are described in more detail below can be provided in the comparator 114, analyzer 130 and/or filter 140 that identify updates on stories or streams being tracked, relieving users from having to sift through long lists of similar articles arriving from different news sources.
- Various methods provide the basis for a personalized news portal and news alerting services at 150 that seek to minimize the time and disruptions to users who desire to follow evolving stories. It is to be appreciated that although one example aspect of the present invention can be applied to analyzing and filtering information such as news, substantially any temporally evolving stream of information can be processed in accordance with the present invention.
- data can be collected from a plurality of different information sources such as from a user's laptop, mobile device, desktop computer, wherein such data can be cached (e.g., centralized server) and analyzed according to what data the user has previously observed.
- information can be generated from a plurality of sources such as from the Internet, for example, or in local contexts such as an internal company Intranet.
- a framework 210 for comparing text collections is illustrated in accordance with an aspect of the present invention. Given two or more sets of textual content, it is to be determined how differences are characterized between the sets. Determining differences is useful in a variety of applications, including automatic profiling and comparison of text collections, automatic identification of different views, scopes and interests reflected in the texts, and automatic identification of novel information. In general, several aspects of "difference" may be investigated as follows:
- Temporal differences include automatically assessing the novelty over time of news articles (or other type information) originating from live news feeds. Specifically, the following aspects are considered:
- Fig. 3 is a methodology 300 illustrating a process of characterizing novelty in accordance with an aspect of the present invention. While, for purposes of simplicity of explanation, the methodology is shown and described as a series of acts, it is to be understood and appreciated that the present invention is not limited by the order of acts, as some acts may, in accordance with the present invention, occur in different orders and/or concurrently with other acts from that shown and described herein. For example, those skilled in the art will understand and appreciate that a methodology could alternatively be represented as a series of interrelated states or events, such as in a state diagram. Moreover, not all illustrated acts may be required to implement a methodology in accordance with the present invention.
- NewsJunkie that implements a collection of algorithms and a number of visualization options for comparing text collections.
- NewsJunkie represents documents as a set of words augmented with named entities extracted from the text.
- Common extraction tools were also used for this purpose, which identified names of people, organizations and geographical locations.
- document groups contain documents with some common property, and constitute the basic unit of comparison. Examples of such common properties can be a particular topic or source of news (e.g ., blackout stories coming from the East Coast news agencies). Inferences are drawn about the differences between document groups by building a model for each group, and then comparing the models using a similarity metric as described below.
- NewsJunkie represents documents either as smoothed probability distributions over all the features (words + named entities), or as vectors of weighted features (in the same feature space).
- Weights can be assigned by the popular family of TF.IDF functions which use components representing the frequency of term occurrence in a document and the inverse frequency of term occurrence across documents. Probabilistic weighting functions can also be used. Different smoothing options can also be implemented to improve the term weighting estimates. For example, Laplace's law of succession, or linear smoothing with word probabilities in the entire text collection; the latter option was used throughout the experiments described below. It is noted that more than one smoothing option can be implemented within the system.
- Similarity metrics are determined for determining differences between information items such as a document or text.
- a common situation occurs where something interesting happens in the world, and the event is picked up by the news media. If the event is of sufficient public interest, the ensuing developments are tracked in the news as well.
- an initial report is read and, at some later time, users are interested in catching up with the story.
- the user's acute information-seeking goal can be satisfied in many ways and with many more updates than even the most avid news junkie has the time to review.
- Automated tools for sifting through a large quantity of documents on a topic that work to identify elements of genuinely new information can provide great value.
- a number of document similarity metrics can be employed to identify documents that are most different from a given set of documents (e.g., the union of those read previously), wherein a term distance metric is defined to emphasize the fact that documents are sought that are generally most dissimilar from a set of documents.
- Normalization by document length is typically essential, as, without normalization the NE score will tend to rise with length, because of the probabilistic influence of length on seeing additional named entities; the longer the document is, the higher the chance it contains more named entities.
- the distance metrics can be harnessed to identify novel information content for presentation to users.
- a novelty ranking algorithm is applied iteratively to produce a small set of articles that a reader may be interested in.
- a greedy, incremental analysis is employed. The algorithm initially compares substantially all the available updates to a seed story that the user has read, and selects the article least similar to it. This article is then added to the seed story (forming a group of two documents), and the algorithm looks for the next update most dissimilar to these articles combined, and so on.
- the pseudocode for the ranking algorithm is outlined below in Algorithm RANKNEWSBYNOVELTY.
- judging novelty is a subjective task.
- One way to obtain statistically meaningful results is to average the judgments of a set of users.
- participants were asked to read several sets of articles ordered by alternate metrics, and to decide which sets carried the most novel information. Note that this scenario generally requires the evaluators to keep in mind all the article sets they read until they rate them. Since it is difficult to keep several sets of articles on an unfamiliar topic in memory, the experiment was limited to evaluating the following three metrics:
- the first story was selected as the seed story, and used the three metrics described above to order the rest of the stories by novelty using the algorithm RANKNEWSBYNOVELTY.
- the algorithm first selects the most novel article relative to the seed story. This article is then added to the seed story to form a new model of what the user is familiar with, and the next most novel article selected.
- Three articles were selected in this manner for each of the three metrics and each of the 12 topics.
- the subjects were first asked to read the seed story to get background about the topic. They were then shown the three sets of articles (each set chosen by one of the metrics), and asked to rate the sets from most novel to least novel set. They were instructed to think of the task as identifying the set of articles that they would choose for a friend who had reviewed the seed story, and now desired to learn what was new.
- the presentation order of the sets generated by the three metrics was randomized across participants.
- Fig. 4 is a graph 400 illustrating results ranking in accordance with an aspect of the present invention. Overall, 111 user judgments on 12 topics were obtained, averaging 9-10 judgments per topic. Fig. 4 shows the number of times each metric was rated the most, medium and least novel. As can be observed from the graph 400, the sets generated by the KL and NE metrics were rated more novel than those produced by the baseline metric (ORG). Results by topic.
- Topic id Topic description #times most novel Mean rank KL NE ORG KL NE ORG topic 1 Pizza robbery 5 4 1 1.7 1.6 2.7 topic 2 RIAA rues MP3 users 2 7 0 1.8 1.2 3.0 topic 3 Sharon visits India 2 3 4 2.6 1.7 1.8 topic 4 Pope visits Slovakia 9 0 0 1.0 2.2 2.8 topic 5 Swedish FM killed 5 4 0 1.4 1.6 3.0 topic 6 Al-gori 8 1 0 1.1 2.1 2.8 topic 7 CA governor recall 4 2 3 1.7 2.2 2.1 topic 8 MS bugs 3 5 1 1.9 1.6 2.6 topic 9 SARS in Singapore 7 1 1 1.3 2.0 2.7 topic 10 Iran develops A-bomb 3 5 2 2.2 1.7 2.1 topic 11 NASA investigation 2 5 3 2.1 1.6 2.3 topic 12 Hurricane Isabel 4 5 0 1.9 1.6 2.6 2.6
- Table 1 presents per-topic results.
- the three penultimate columns show the number of times each metric was rated the most novel for each topic.
- the last three columns show mean ranks of the metrics, assuming the most novel is assigned the rank of 1, medium novel - 2, and least novel - 3.
- Fig. 5 illustrates a personalized update process 500 in accordance with an aspect of the present invention.
- the algorithm RANKNEWSBYNOVELTY presented and evaluated in the previous section tends to work under the assumption that a user wants to catch up with latest story developments some time after initially reading about it.
- the algorithm orders the recent articles by their novelty compared to the seed story, and then the user can read a number of highest-scoring articles depending on how much spare time he or she can allocate for the reading.
- Logistic support such as a collection server would keep track of the articles the user reads in order to estimate the novelty of the new articles streaming in the news or information feed. Based on user's personal preferences, for example, how often the user is interested in getting updates on the story, the server decides which articles to display. Therefore, an online decision mechanism can be provided that determines whether an article contains sufficiently new information to warrant its delivery to the user.
- an online decision mechanism can be provided that determines whether an article contains sufficiently new information to warrant its delivery to the user.
- the original novelty algorithm is modified as shown below relating to pick a periodic update.
- a period of a day was used, so the algorithm identifies daily updates for a user.
- algorithm PICKDAILYUPDATE compares the articles received today with the union of all the articles received the day before .
- the algorithm attempts to select the most informative update compared to what was known yesterday, and shows it to the user, provided that the update carries enough new information ( i.e ., its estimated novelty is above the user's personalized threshold).
- Such conditioning endows the system with the ability to relay to the user informative updates and to filter out articles that only recap previously known details.
- the algorithm can be generalized to identify n most informative updates per day.
- the algorithm presented above at 510 can be largely an "offline" procedure, as it updates users at predefined time intervals. Hardcore news junkies may find it frustrating to wait for daily scheduled news updates. For some, a more responsive form of analysis may be desired.
- breaking news events may be processed at 520 of Fig. 5 where a sliding window is used covering a number of preceding articles to estimate the novelty of the current one. It is noted that estimating distances between articles and a preceding window of fixed-length facilitates the comparison of scores, and different window lengths of 20-60 articles were evaluated. It was found that lengths of approximately 40 typically worked well in practice.
- a median filter provides this functionality by reducing the amount of noise in the signal.
- the filter successively considers each data point in the signal and adapts it to better resemble its surroundings, effectively smoothing the original signal and removing outliers.
- a median filter of width w first sorts w data points within the window centered on the current point, and then replaces the latter with the median value of these points.
- the resultant signal is passed through a median filter.
- filters include widths of 3-7, for example; the filter of width 5 appears to work well in the majority of cases.
- dist is the distance metric
- D a sequence of relevant articles
- l sliding window length
- fw median filter width
- thresh user-defined sensitivity threshold.
- a median filter may delay the routing of novel articles to users, since several following articles may need to be considered to reliably detect the beginning of a new burst.
- delays are rather small (half the width of the median filter used), and the utility of the filter more than compensates for this inconvenience.
- the algorithm can scan forward several dozens of articles from the moment a burst is detected, in order to select the most informative update instead of simply picking the one that starts the burst.
- Combination approaches are also feasible such as the rendering of an early update on breaking news, and then waiting for a more informed burst analysis to send the best article on the development.
- the algorithm above shows the pseudocode for IDENTIFYBREAKINGNEWS that implements burst analysis for news alerting.
- Fig. 6 shows the application of the algorithm IDENTIFYBREAKINGNEWS to a sample topic.
- the topic in question is devoted to a bank robbery case in Erie, Pennsylvania, USA, where a group of criminals apparently seized a pizza delivery man, locked a bomb device to his neck and, according to statements made by the delivery man, forced him to rob a local bank. The man was promptly apprehended by police, but soon afterwards the device detonated and killed him. The strange initial story and ensuing investigation were tracked by many news sources for several weeks starting in September 2003.
- the x-axis of the figure corresponds to the sequence of articles as they arrived in time, and the y-axis plots (raw and median-filtered) distance values for each article given the preceding sliding window.
- Raw distance scores are represented by a dotted line, and filtered scores are plotted with a solid line.
- the text boxes accompanying Fig. 6 comment on the actual events that correspond to the identified novelty bursts, and show which potentially spurious peaks have been discarded by the filter.
- the smoothed novelty score which incorporates the median filter, captures the main developments in the story (interviews with friends, details about the weapon, FBI bulletin for two suspects, and a copycat case), while at the same time filtering out spurious peaks of novelty.
- characterization of article types and user controls are considered.
- novelty scores alone should not be relied upon as a sole selection criterion; some articles are identified as novel by virtue a change in topic.
- a classification of types of novelty is formulated, based on different relationships between an article and a seed story or topic of interest. Examples of these classes of relationships include:
- relationship types 2 and 3 are probably what most users want to see when they are tracking a topic.
- a new type of document analysis can be provided that scrutinizes intra-document dynamics. As opposed to previous types of analysis that compared entire documents to one another, this technique "zooms into” documents estimating the relevance of their parts.
- a model is constructed for every document, and a fixed distance metric is used, e.g. , KL divergence. Then, for each document, a distance score, of a sliding window of words within the document versus the seed story, is computed.
- the score of a window of words can be construed as a sum of point-wise scores of each word in the window vs. the seed story, as stipulated by comparing the model of the within document window with that of the seed story using the selected metric.
- a useful property of this technique is that it goes beyond the proverbial bag of words , and considers the document words in their original context. It was opted for using sliding contextual windows rather than apparently more appealing paragraph units, since using a fixed-length window makes distance scores directly comparable. Another obvious choice of the comparison unit would be individual sentences. However, it was believed that performing this analysis at the sentence level would consider too little information, and the range of possible scores would be too large to be useful.
- Fig. 7 shows sample results of intra-document analysis.
- a seed story for this analysis was a report on a new case of SARS in Singapore. Articles that mostly recap what has already been said typically have a very limited dynamic range and low absolute scores. Elaboration articles usually have higher absolute scores that reflect the new information they carry. One elaboration for this story reported that the patient's wife was being held under quarantine. Further along this spectrum, articles that may qualify as offshoots but are still anchored to the events described in the seed story have a much wider dynamic range.
- One offshoot was a story that focused on the impact of SARS on the Asian stock market, and another was on progress on a SARS vaccine. Both offshoot articles used the recent case as a starting point, but were really about a related topic. It is believed that analyzing intra-document dynamics such as the dynamic range and patterns of novelty scores are useful in identifying different types of information that readers would like to follow.
- the Web has been providing users with a rich set of news sources. It is deceptively easy for Internet surfers to browse multitudes of sources in pursuit of news updates, yet sifting through large quantities of news can involve the reading of large quantities of redundant material.
- a collection of algorithms have been presented that analyze news feeds and identify articles that carry most novel information given a model of what the user has read before.
- a word-based representation has been extended with named entities extracted from the text. Using this representation a variety of distance metrics are employed to estimate the dissimilarity between each news article and a collection of articles (e.g., previously read stories).
- the techniques underlying the algorithms analyze inter- and intra-document dynamics by studying how the delivery of information evolves over time from article to article, as well as within each individual article at the level of contextual word windows.
- News browsers or server-based services incorporating these algorithms can offer users a personalized news experience, giving users the ability to tune both the desired frequency of news updates and the degree to which these updates should be similar to the seed story, via exercising control over the novelty constraint. More sophisticated distance metrics can be provided that incorporate some of the basic metrics described herein, as well as more detailed profiles of within-document patterns.
- Figs. 8-11 illustrate example user interfaces in accordance with an aspect of the present invention.
- Fig. 8 illustrates a list of news stories at 810, wherein a particular topic is selected from the news stories at 810 and displayed at 820 (e.g., Investigators Probe).
- the display 820 displays news items of interest relating to the selected topic.
- a particular news item is displayed which is selected from the list at 820.
- Fig. 9 illustrates that after a topic is selected, it can be listed under an already read section at 910.
- Fig. 10 illustrates how a subsequent novel article appears at 1010 that is then inspected or read at 1020.
- Fig. 11 shows how the read item of 1020 is then placed into an already read location at 1110.
- an exemplary environment 1210 for implementing various aspects of the invention includes a computer 1212.
- the computer 1212 includes a processing unit 1214, a system memory 1216, and a system bus 1218.
- the system bus 1218 couples system components including, but not limited to, the system memory 1216 to the processing unit 1214.
- the processing unit 1214 can be any of various available processors. Dual microprocessors and other multiprocessor architectures also can be employed as the processing unit 1214.
- the system bus 1218 can be any of several types of bus structure(s) including the memory bus or memory controller, a peripheral bus or external bus, and/or a local bus using any variety of available bus architectures including, but not limited to, 16-bit bus, Industrial Standard Architecture (ISA), Micro-Channel Architecture (MSA), Extended ISA (EISA), Intelligent Drive Electronics (IDE), VESA Local Bus (VLB), Peripheral Component Interconnect (PCI), Universal Serial Bus (USB), Advanced Graphics Port (AGP), Personal Computer Memory Card International Association bus (PCMCIA), and Small Computer Systems Interface (SCSI).
- ISA Industrial Standard Architecture
- MSA Micro-Channel Architecture
- EISA Extended ISA
- IDE Intelligent Drive Electronics
- VLB VESA Local Bus
- PCI Peripheral Component Interconnect
- USB Universal Serial Bus
- AGP Advanced Graphics Port
- PCMCIA Personal Computer Memory Card International Association bus
- SCSI Small Computer Systems Interface
- the system memory 1216 includes volatile memory 1220 and nonvolatile memory 1222.
- the basic input/output system (BIOS) containing the basic routines to transfer information between elements within the computer 1212, such as during start-up, is stored in nonvolatile memory 1222.
- nonvolatile memory 1222 can include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable ROM (EEPROM), or flash memory.
- Volatile memory 1220 includes random access memory (RAM), which acts as external cache memory.
- RAM is available in many forms such as synchronous RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDR SDRAM), enhanced SDRAM (ESDRAM), Synchlink DRAM (SLDRAM), and direct Rambus RAM (DRRAM).
- SRAM synchronous RAM
- DRAM dynamic RAM
- SDRAM synchronous DRAM
- DDR SDRAM double data rate SDRAM
- ESDRAM enhanced SDRAM
- SLDRAM Synchlink DRAM
- DRRAM direct Rambus RAM
- Computer 1212 also includes removable/non-removable, volatile/non-volatile computer storage media.
- Fig. 12 illustrates, for example a disk storage 1224.
- Disk storage 1224 includes, but is not limited to, devices like a magnetic disk drive, floppy disk drive, tape drive, Jaz drive, Zip drive, LS-100 drive, flash memory card, or memory stick.
- disk storage 1224 can include storage media separately or in combination with other storage media including, but not limited to, an optical disk drive such as a compact disk ROM device (CD-ROM), CD recordable drive (CD-R Drive), CD rewritable drive (CD-RW Drive) or a digital versatile disk ROM drive (DVD-ROM).
- CD-ROM compact disk ROM device
- CD-R Drive CD recordable drive
- CD-RW Drive CD rewritable drive
- DVD-ROM digital versatile disk ROM drive
- a removable or non-removable interface is typically used such as interface 1226.
- Fig 12 describes software that acts as an intermediary between users and the basic computer resources described in suitable operating environment 1210.
- Such software includes an operating system 1228.
- Operating system 1228 which can be stored on disk storage 1224, acts to control and allocate resources of the computer system 1212.
- System applications 1230 take advantage of the management of resources by operating system 1228 through program modules 1232 and program data 1234 stored either in system memory 1216 or on disk storage 1224. It is to be appreciated that the present invention can be implemented with various operating systems or combinations of operating systems.
- Input devices 1236 include, but are not limited to, a pointing device such as a mouse, trackball, stylus, touch pad, keyboard, microphone, joystick, game pad, satellite dish, scanner, TV tuner card, digital camera, digital video camera, web camera, and the like. These and other input devices connect to the processing unit 1214 through the system bus 1218 via interface port(s) 1238.
- Interface port(s) 1238 include, for example, a serial port, a parallel port, a game port, and a universal serial bus (USB).
- Output device(s) 1240 use some of the same type of ports as input device(s) 1236.
- a USB port may be used to provide input to computer 1212, and to output information from computer 1212 to an output device 1240.
- Output adapter 1242 is provided to illustrate that there are some output devices 1240 like monitors, speakers, and printers, among other output devices 1240, that require special adapters.
- the output adapters 1242 include, by way of illustration and not limitation, video and sound cards that provide a means of connection between the output device 1240 and the system bus 1218. It should be noted that other devices and/or systems of devices provide both input and output capabilities such as remote computer(s) 1244.
- Computer 1212 can operate in a networked environment using logical connections to one or more remote computers, such as remote computer(s) 1244.
- the remote computer(s) 1244 can be a personal computer, a server, a router, a network PC, a workstation, a microprocessor based appliance, a peer device or other common network node and the like, and typically includes many or all of the elements described relative to computer 1212. For purposes of brevity, only a memory storage device 1246 is illustrated with remote computer(s) 1244.
- Remote computer(s) 1244 is logically connected to computer 1212 through a network interface 1248 and then physically connected via communication connection 1250.
- Network interface 1248 encompasses communication networks such as local-area networks (LAN) and wide-area networks (WAN).
- LAN technologies include Fiber Distributed Data Interface (FDDI), Copper Distributed Data Interface (CDDI), Ethernet/IEEE 1102.3, Token Ring/IEEE 1102.5 and the like.
- WAN technologies include, but are not limited to, point-to-point links, circuit switching networks like Integrated Services Digital Networks (ISDN) and variations thereon, packet switching networks, and Digital Subscriber Lines (DSL).
- ISDN Integrated Services Digital Networks
- DSL Digital Subscriber Lines
- Communication connection(s) 1250 refers to the hardware/software employed to connect the network interface 1248 to the bus 1218. While communication connection 1250 is shown for illustrative clarity inside computer 1212, it can also be external to computer 1212.
- the hardware/software necessary for connection to the network interface 1248 includes, for exemplary purposes only, internal and external technologies such as, modems including regular telephone grade modems, cable modems and DSL modems, ISDN adapters, and Ethernet cards.
- Fig. 13 is a schematic block diagram of a sample-computing environment 1300 with which the present invention can interact.
- the system 1300 includes one or more client(s) 1310.
- the client(s) 1310 can be hardware and/or software (e.g., threads, processes, computing devices).
- the system 1300 also includes one or more server(s) 1330.
- the server(s) 1330 can also be hardware and/or software (e.g., threads, processes, computing devices).
- the servers 1330 can house threads to perform transformations by employing the present invention, for example.
- One possible communication between a client 1310 and a server 1330 may be in the form of a data packet adapted to be transmitted between two or more computer processes.
- the system 1300 includes a communication framework 1350 that can be employed to facilitate communications between the client(s) 1310 and the server(s) 1330.
- the client(s) 1310 are operably connected to one or more client data store(s) 1360 that can be employed to store information local to the client(s) 1310.
- the server(s) 1330 are operably connected to one or more server data store(s) 1340 that can be employed to store information local to the servers 1330.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US549371 | 2000-04-14 | ||
US54937104P | 2004-03-02 | 2004-03-02 | |
US827729 | 2004-04-20 | ||
US10/827,729 US7293019B2 (en) | 2004-03-02 | 2004-04-20 | Principles and methods for personalizing newsfeeds via an analysis of information novelty and dynamics |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1571579A1 true EP1571579A1 (fr) | 2005-09-07 |
Family
ID=34915631
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP05101400A Ceased EP1571579A1 (fr) | 2004-03-02 | 2005-02-24 | Principes et procédés pour la personnalisation de newsfeeds par une analyse de la nouveauté et de la dynamique de l'information |
Country Status (9)
Country | Link |
---|---|
US (1) | US7293019B2 (fr) |
EP (1) | EP1571579A1 (fr) |
JP (1) | JP4845392B2 (fr) |
KR (1) | KR101114012B1 (fr) |
CN (2) | CN101256591B (fr) |
AU (1) | AU2005200877B2 (fr) |
BR (1) | BRPI0500612A (fr) |
CA (1) | CA2498376C (fr) |
RU (1) | RU2382401C2 (fr) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007098398A1 (fr) * | 2006-02-16 | 2007-08-30 | Newsgator Technologies, Inc. | Système et procédé de synchronisation d'un contenu de fil de syndication |
EP2211281A3 (fr) * | 2009-01-27 | 2011-04-20 | Palo Alto Research Center Incorporated | Système et procédé pour utiliser une pertinence d'objet enrubanné et période de priorisation de l'article |
EP2211282A3 (fr) * | 2009-01-27 | 2011-05-18 | Palo Alto Research Center Incorporated | Système et procédé pour gérer l'attention de l'utilisateur en détectant les sujets chauds et froids dans les indices sociaux |
CN101373351B (zh) * | 2007-08-21 | 2011-06-22 | 京瓷美达株式会社 | 自动供稿装置、图像读取装置及图像形成装置 |
US8010545B2 (en) | 2008-08-28 | 2011-08-30 | Palo Alto Research Center Incorporated | System and method for providing a topic-directed search |
US8073682B2 (en) | 2007-10-12 | 2011-12-06 | Palo Alto Research Center Incorporated | System and method for prospecting digital information |
US8165985B2 (en) | 2007-10-12 | 2012-04-24 | Palo Alto Research Center Incorporated | System and method for performing discovery of digital information in a subject area |
US8209616B2 (en) | 2008-08-28 | 2012-06-26 | Palo Alto Research Center Incorporated | System and method for interfacing a web browser widget with social indexing |
US8356044B2 (en) | 2009-01-27 | 2013-01-15 | Palo Alto Research Center Incorporated | System and method for providing default hierarchical training for social indexing |
US8549016B2 (en) | 2008-11-14 | 2013-10-01 | Palo Alto Research Center Incorporated | System and method for providing robust topic identification in social indexes |
US8671104B2 (en) | 2007-10-12 | 2014-03-11 | Palo Alto Research Center Incorporated | System and method for providing orientation into digital information |
US9031944B2 (en) | 2010-04-30 | 2015-05-12 | Palo Alto Research Center Incorporated | System and method for providing multi-core and multi-level topical organization in social indexes |
EP2664997A3 (fr) * | 2012-05-18 | 2015-08-12 | Xerox Corporation | Système et procédé de résolution de coréférence d'entité nommée |
US9665828B2 (en) | 2014-01-16 | 2017-05-30 | International Business Machines Corporation | Using physicochemical correlates of perceptual flavor similarity to enhance, balance and substitute flavors |
Families Citing this family (151)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8706747B2 (en) | 2000-07-06 | 2014-04-22 | Google Inc. | Systems and methods for searching using queries written in a different character-set and/or language from the target pages |
US8959019B2 (en) * | 2002-10-31 | 2015-02-17 | Promptu Systems Corporation | Efficient empirical determination, computation, and use of acoustic confusability measures |
US20060240851A1 (en) * | 2003-03-21 | 2006-10-26 | Vocel, Inc. | Interactive messaging system |
US7599938B1 (en) | 2003-07-11 | 2009-10-06 | Harrison Jr Shelton E | Social news gathering, prioritizing, tagging, searching, and syndication method |
CA2475189C (fr) * | 2003-07-17 | 2009-10-06 | At&T Corp. | Methode et dispositif de couplage de fenetres dans des compresseurs delta |
US7293019B2 (en) * | 2004-03-02 | 2007-11-06 | Microsoft Corporation | Principles and methods for personalizing newsfeeds via an analysis of information novelty and dynamics |
US20050289147A1 (en) * | 2004-06-25 | 2005-12-29 | Jessica Kahn | News feed viewer |
US8972444B2 (en) * | 2004-06-25 | 2015-03-03 | Google Inc. | Nonstandard locality-based text entry |
US8392453B2 (en) * | 2004-06-25 | 2013-03-05 | Google Inc. | Nonstandard text entry |
US7865511B2 (en) * | 2004-06-25 | 2011-01-04 | Apple Inc. | News feed browser |
US7596571B2 (en) * | 2004-06-30 | 2009-09-29 | Technorati, Inc. | Ecosystem method of aggregation and search and related techniques |
AU2006225078B2 (en) * | 2005-03-16 | 2008-11-06 | Airscape Technology Pty. Limited | Method for distributing computing between server and client |
US20060235885A1 (en) * | 2005-04-18 | 2006-10-19 | Virtual Reach, Inc. | Selective delivery of digitally encoded news content |
US7958446B2 (en) * | 2005-05-17 | 2011-06-07 | Yahoo! Inc. | Systems and methods for language translation in network browsing applications |
US20060265472A1 (en) * | 2005-05-17 | 2006-11-23 | Yahoo! Inc. | Systems and methods for providing short message service features and user interfaces therefor in network browsing applications |
US20070174286A1 (en) * | 2005-05-17 | 2007-07-26 | Yahoo!, Inc. | Systems and methods for providing features and user interface in network browsing applications |
US9582602B2 (en) * | 2005-05-17 | 2017-02-28 | Excalibur Ip, Llc | Systems and methods for improving access to syndication feeds in network browsing applications |
US20070033290A1 (en) * | 2005-08-03 | 2007-02-08 | Valen Joseph R V Iii | Normalization and customization of syndication feeds |
US9268867B2 (en) * | 2005-08-03 | 2016-02-23 | Aol Inc. | Enhanced favorites service for web browsers and web applications |
US8739020B2 (en) | 2005-08-03 | 2014-05-27 | Aol Inc. | Enhanced favorites service for web browsers and web applications |
US7702675B1 (en) * | 2005-08-03 | 2010-04-20 | Aol Inc. | Automated categorization of RSS feeds using standardized directory structures |
US8190997B2 (en) * | 2005-10-07 | 2012-05-29 | Google Inc. | Personalized content feed suggestions page |
US8949154B2 (en) * | 2005-10-07 | 2015-02-03 | Google Inc. | Content feed user interface with gallery display of same-type items |
US7853485B2 (en) | 2005-11-22 | 2010-12-14 | Nec Laboratories America, Inc. | Methods and systems for utilizing content, dynamic patterns, and/or relational information for data analysis |
KR100684160B1 (ko) | 2005-12-08 | 2007-02-20 | 한국전자통신연구원 | 개체명 인식을 이용한 대화 분석 장치 및 방법 |
US8327297B2 (en) | 2005-12-16 | 2012-12-04 | Aol Inc. | User interface system for handheld devices |
US20070143300A1 (en) * | 2005-12-20 | 2007-06-21 | Ask Jeeves, Inc. | System and method for monitoring evolution over time of temporal content |
US9459622B2 (en) | 2007-01-12 | 2016-10-04 | Legalforce, Inc. | Driverless vehicle commerce network and community |
US8874489B2 (en) | 2006-03-17 | 2014-10-28 | Fatdoor, Inc. | Short-term residential spaces in a geo-spatial environment |
US20070218900A1 (en) | 2006-03-17 | 2007-09-20 | Raj Vasant Abhyanker | Map based neighborhood search and community contribution |
JP4542993B2 (ja) * | 2006-01-13 | 2010-09-15 | 株式会社東芝 | 構造化文書抽出装置、構造化文書抽出方法および構造化文書抽出プログラム |
US8965409B2 (en) | 2006-03-17 | 2015-02-24 | Fatdoor, Inc. | User-generated community publication in an online neighborhood social network |
US9373149B2 (en) | 2006-03-17 | 2016-06-21 | Fatdoor, Inc. | Autonomous neighborhood vehicle commerce network and community |
US20080201156A1 (en) * | 2007-02-21 | 2008-08-21 | Fatdoor, Inc. | User-generated community publication in a geo-spatial environment |
US9098545B2 (en) | 2007-07-10 | 2015-08-04 | Raj Abhyanker | Hot news neighborhood banter in a geo-spatial social network |
US9071367B2 (en) | 2006-03-17 | 2015-06-30 | Fatdoor, Inc. | Emergency including crime broadcast in a neighborhood social network |
US8738545B2 (en) | 2006-11-22 | 2014-05-27 | Raj Abhyanker | Map based neighborhood search and community contribution |
US9002754B2 (en) | 2006-03-17 | 2015-04-07 | Fatdoor, Inc. | Campaign in a geo-spatial environment |
US8732091B1 (en) | 2006-03-17 | 2014-05-20 | Raj Abhyanker | Security in a geo-spatial environment |
US9070101B2 (en) | 2007-01-12 | 2015-06-30 | Fatdoor, Inc. | Peer-to-peer neighborhood delivery multi-copter and method |
US9037516B2 (en) | 2006-03-17 | 2015-05-19 | Fatdoor, Inc. | Direct mailing in a geo-spatial environment |
US9064288B2 (en) | 2006-03-17 | 2015-06-23 | Fatdoor, Inc. | Government structures and neighborhood leads in a geo-spatial environment |
US7451120B1 (en) * | 2006-03-20 | 2008-11-11 | Google Inc. | Detecting novel document content |
US20070265870A1 (en) * | 2006-04-19 | 2007-11-15 | Nec Laboratories America, Inc. | Methods and systems for utilizing a time factor and/or asymmetric user behavior patterns for data analysis |
US20070260586A1 (en) * | 2006-05-03 | 2007-11-08 | Antonio Savona | Systems and methods for selecting and organizing information using temporal clustering |
US8010645B2 (en) * | 2006-05-12 | 2011-08-30 | Sharp Laboratories Of America, Inc. | Method and apparatus for providing feeds to users |
US7831928B1 (en) | 2006-06-22 | 2010-11-09 | Digg, Inc. | Content visualization |
US7865513B2 (en) * | 2006-06-30 | 2011-01-04 | Rearden Commerce, Inc. | Derivation of relationships between data sets using structured tags or schemas |
US20080005148A1 (en) * | 2006-06-30 | 2008-01-03 | Rearden Commerce, Inc. | Automated knowledge base of feed tags |
US20080026742A1 (en) * | 2006-07-28 | 2008-01-31 | Sony Ericsson Mobile Communications Ab | Information nugget sharing among mobile phones |
US8271429B2 (en) | 2006-09-11 | 2012-09-18 | Wiredset Llc | System and method for collecting and processing data |
US7801901B2 (en) * | 2006-09-15 | 2010-09-21 | Microsoft Corporation | Tracking storylines around a query |
US8230361B2 (en) | 2006-09-28 | 2012-07-24 | Google Inc. | Content feed user interface |
US8645497B2 (en) * | 2006-09-28 | 2014-02-04 | Google Inc. | Bookmark-based access to content feeds |
US8694607B2 (en) * | 2006-10-06 | 2014-04-08 | Google Inc. | Recursive subscriptions to content feeds |
US20080091828A1 (en) * | 2006-10-16 | 2008-04-17 | Rearden Commerce, Inc. | Method and system for fine and course-grained authorization of personal feed contents |
US7752328B2 (en) * | 2006-10-16 | 2010-07-06 | Rearden Commerce, Inc. | System and method for view of transactions and events with dynamic updates |
US8863245B1 (en) | 2006-10-19 | 2014-10-14 | Fatdoor, Inc. | Nextdoor neighborhood social network method, apparatus, and system |
US7979425B2 (en) * | 2006-10-25 | 2011-07-12 | Google Inc. | Server-side match |
US8025220B2 (en) * | 2006-11-10 | 2011-09-27 | Fair Isaac Corporation | Cardholder localization based on transaction data |
US8316000B2 (en) | 2006-12-07 | 2012-11-20 | At&T Intellectual Property Ii, L.P. | Method and apparatus for using tag topology |
US20080155118A1 (en) * | 2006-12-21 | 2008-06-26 | International Business Machines Corporation | Really simple syndication (rss) feed customization |
US7562088B2 (en) * | 2006-12-27 | 2009-07-14 | Sap Ag | Structure extraction from unstructured documents |
US20080294663A1 (en) * | 2007-05-14 | 2008-11-27 | Heinley Brandon J | Creation and management of visual timelines |
US8290921B2 (en) * | 2007-06-28 | 2012-10-16 | Microsoft Corporation | Identification of similar queries based on overall and partial similarity of time series |
US20090030889A1 (en) * | 2007-07-25 | 2009-01-29 | Ehud Chatow | Viewing of feeds |
US8442969B2 (en) * | 2007-08-14 | 2013-05-14 | John Nicholas Gross | Location based news and search engine |
WO2009023828A1 (fr) * | 2007-08-15 | 2009-02-19 | Indiana University Research & Technology Corporation | Système et procédé de mesure de la clarté d'images utilisée dans un système de reconnaissance de l'iris |
US20090070346A1 (en) * | 2007-09-06 | 2009-03-12 | Antonio Savona | Systems and methods for clustering information |
US8060634B1 (en) | 2007-09-26 | 2011-11-15 | Google Inc. | Determining and displaying a count of unread items in content feeds |
US10025871B2 (en) | 2007-09-27 | 2018-07-17 | Google Llc | Setting and displaying a read status for items in content feeds |
US8024347B2 (en) | 2007-09-27 | 2011-09-20 | International Business Machines Corporation | Method and apparatus for automatically differentiating between types of names stored in a data collection |
US20090089380A1 (en) * | 2007-09-28 | 2009-04-02 | Microsoft Corporation | Aggregating and Delivering Information |
US20090100031A1 (en) * | 2007-10-12 | 2009-04-16 | Tele Atlas North America, Inc. | Method and System for Detecting Changes in Geographic Information |
US8280885B2 (en) | 2007-10-29 | 2012-10-02 | Cornell University | System and method for automatically summarizing fine-grained opinions in digital text |
US11263543B2 (en) | 2007-11-02 | 2022-03-01 | Ebay Inc. | Node bootstrapping in a social graph |
US7958066B2 (en) | 2007-11-02 | 2011-06-07 | Hunch Inc. | Interactive machine learning advice facility |
US8032480B2 (en) * | 2007-11-02 | 2011-10-04 | Hunch Inc. | Interactive computing advice facility with learning based on user feedback |
US8484142B2 (en) * | 2007-11-02 | 2013-07-09 | Ebay Inc. | Integrating an internet preference learning facility into third parties |
US8666909B2 (en) | 2007-11-02 | 2014-03-04 | Ebay, Inc. | Interestingness recommendations in a computing advice facility |
US8494978B2 (en) | 2007-11-02 | 2013-07-23 | Ebay Inc. | Inferring user preferences from an internet based social interactive construct |
US9159034B2 (en) | 2007-11-02 | 2015-10-13 | Ebay Inc. | Geographically localized recommendations in a computing advice facility |
US8375073B1 (en) | 2007-11-12 | 2013-02-12 | Google Inc. | Identification and ranking of news stories of interest |
US20090144226A1 (en) * | 2007-12-03 | 2009-06-04 | Kei Tateno | Information processing device and method, and program |
US7814108B2 (en) * | 2007-12-21 | 2010-10-12 | Microsoft Corporation | Search engine platform |
US7996379B1 (en) | 2008-02-01 | 2011-08-09 | Google Inc. | Document ranking using word relationships |
US7970739B2 (en) * | 2008-04-30 | 2011-06-28 | International Business Machines Corporation | Method and system for maintaining profiles of information channels |
US20090292688A1 (en) * | 2008-05-23 | 2009-11-26 | Yahoo! Inc. | Ordering relevant content by time for determining top picks |
US8725716B1 (en) * | 2008-05-30 | 2014-05-13 | Google Inc. | Customized web summaries and alerts based on custom search engines |
US20100057536A1 (en) * | 2008-08-28 | 2010-03-04 | Palo Alto Research Center Incorporated | System And Method For Providing Community-Based Advertising Term Disambiguation |
US20100057577A1 (en) * | 2008-08-28 | 2010-03-04 | Palo Alto Research Center Incorporated | System And Method For Providing Topic-Guided Broadening Of Advertising Targets In Social Indexing |
US9411877B2 (en) | 2008-09-03 | 2016-08-09 | International Business Machines Corporation | Entity-driven logic for improved name-searching in mixed-entity lists |
CN102257487B (zh) * | 2008-10-07 | 2015-07-01 | 惠普开发有限公司 | 分析事件 |
US8293114B2 (en) * | 2008-10-10 | 2012-10-23 | Gambro Lundia Ab | Heat exchanger and method for heat exchanging |
US20100094822A1 (en) * | 2008-10-13 | 2010-04-15 | Rohit Dilip Kelapure | System and method for determining a file save location |
US20100114887A1 (en) * | 2008-10-17 | 2010-05-06 | Google Inc. | Textual Disambiguation Using Social Connections |
US8914359B2 (en) * | 2008-12-30 | 2014-12-16 | Microsoft Corporation | Ranking documents with social tags |
US8583603B2 (en) | 2009-04-02 | 2013-11-12 | Microsoft Corporation | Employing user-context in connection with backup or restore of data |
KR100910718B1 (ko) * | 2009-04-28 | 2009-08-04 | 황건하 | 동적 순위 갱신 시스템 및 그 갱신 방법 |
US9026641B2 (en) * | 2009-05-20 | 2015-05-05 | Genieo Innovation Ltd. | System and method for management of information streams delivered for use by a user |
US8407212B2 (en) * | 2009-05-20 | 2013-03-26 | Genieo Innovation Ltd. | System and method for generation of a customized web page based on user identifiers |
US8560575B2 (en) | 2009-11-12 | 2013-10-15 | Salesforce.Com, Inc. | Methods and apparatus for selecting updates to associated records to publish on an information feed in an on-demand database service environment |
US8429170B2 (en) * | 2010-02-05 | 2013-04-23 | Yahoo! Inc. | System and method for discovering story trends in real time from user generated content |
US20110219016A1 (en) * | 2010-03-04 | 2011-09-08 | Src, Inc. | Stream Mining via State Machine and High Dimensionality Database |
US8260789B2 (en) * | 2010-04-01 | 2012-09-04 | Microsoft Corporation | System and method for authority value obtained by defining ranking functions related to weight and confidence value |
US9361130B2 (en) * | 2010-05-03 | 2016-06-07 | Apple Inc. | Systems, methods, and computer program products providing an integrated user interface for reading content |
US8566348B2 (en) * | 2010-05-24 | 2013-10-22 | Intersect Ptp, Inc. | Systems and methods for collaborative storytelling in a virtual space |
WO2011149961A2 (fr) | 2010-05-24 | 2011-12-01 | Intersect Ptp, Inc. | Systèmes et procédés permettant d'identifier des intersections à l'aide de métadonnées de contenu |
US20110302103A1 (en) * | 2010-06-08 | 2011-12-08 | International Business Machines Corporation | Popularity prediction of user-generated content |
US8560554B2 (en) * | 2010-09-23 | 2013-10-15 | Salesforce.Com, Inc. | Methods and apparatus for selecting updates to associated records to publish on an information feed using importance weights in an on-demand database service environment |
US9076146B2 (en) | 2010-10-15 | 2015-07-07 | At&T Intellectual Property I, L.P. | Personal customer care agent |
US9286643B2 (en) | 2011-03-01 | 2016-03-15 | Applaud, Llc | Personalized memory compilation for members of a group and collaborative method to build a memory compilation |
US8615518B2 (en) * | 2011-04-11 | 2013-12-24 | Yahoo! Inc. | Real time association of related breaking news stories across different content providers |
EP2702481A4 (fr) * | 2011-04-26 | 2014-10-01 | Hewlett Packard Development Co | Procédé et système de prévision hiérarchique |
US9195771B2 (en) | 2011-08-09 | 2015-11-24 | Christian George STRIKE | System for creating and method for providing a news feed website and application |
US8782042B1 (en) * | 2011-10-14 | 2014-07-15 | Firstrain, Inc. | Method and system for identifying entities |
US8713028B2 (en) * | 2011-11-17 | 2014-04-29 | Yahoo! Inc. | Related news articles |
US8572107B2 (en) * | 2011-12-09 | 2013-10-29 | International Business Machines Corporation | Identifying inconsistencies in object similarities from multiple information sources |
US9633118B2 (en) | 2012-03-13 | 2017-04-25 | Microsoft Technology Licensing, Llc. | Editorial service supporting contrasting content |
US10275521B2 (en) * | 2012-10-13 | 2019-04-30 | John Angwin | System and method for displaying changes in trending topics to a user |
US9146969B2 (en) * | 2012-11-26 | 2015-09-29 | The Boeing Company | System and method of reduction of irrelevant information during search |
US9300492B2 (en) * | 2013-01-14 | 2016-03-29 | Dropbox, Inc. | Notification feed across multiple client devices |
WO2014172494A1 (fr) | 2013-04-16 | 2014-10-23 | Imageware Systems, Inc. | Admission et authentification biométriques soumises à des conditions et des situations |
US9286528B2 (en) | 2013-04-16 | 2016-03-15 | Imageware Systems, Inc. | Multi-modal biometric database searching methods |
US10635732B2 (en) | 2013-09-19 | 2020-04-28 | Facebook, Inc. | Selecting content items for presentation to a social networking system user in a newsfeed |
EP3058437A4 (fr) * | 2013-10-17 | 2017-06-07 | Samsung Electronics Co., Ltd. | Contextualisation de données de capteurs, de services et de dispositifs au moyen de dispositifs mobiles |
US9439367B2 (en) | 2014-02-07 | 2016-09-13 | Arthi Abhyanker | Network enabled gardening with a remotely controllable positioning extension |
US9457901B2 (en) | 2014-04-22 | 2016-10-04 | Fatdoor, Inc. | Quadcopter with a printable payload extension system and method |
US9004396B1 (en) | 2014-04-24 | 2015-04-14 | Fatdoor, Inc. | Skyteboard quadcopter and method |
US9022324B1 (en) | 2014-05-05 | 2015-05-05 | Fatdoor, Inc. | Coordination of aerial vehicles through a central server |
JP6209492B2 (ja) * | 2014-06-11 | 2017-10-04 | 日本電信電話株式会社 | イベント同一性判定方法、イベント同一性判定装置、イベント同一性判定プログラム |
US9971985B2 (en) | 2014-06-20 | 2018-05-15 | Raj Abhyanker | Train based community |
US9441981B2 (en) | 2014-06-20 | 2016-09-13 | Fatdoor, Inc. | Variable bus stops across a bus route in a regional transportation network |
US10601749B1 (en) | 2014-07-11 | 2020-03-24 | Twitter, Inc. | Trends in a messaging platform |
US10592539B1 (en) | 2014-07-11 | 2020-03-17 | Twitter, Inc. | Trends in a messaging platform |
US9451020B2 (en) | 2014-07-18 | 2016-09-20 | Legalforce, Inc. | Distributed communication of independent autonomous vehicles to provide redundancy and performance |
US20160055164A1 (en) * | 2014-08-25 | 2016-02-25 | Tll, Llc | News alert system and method |
US9984166B2 (en) * | 2014-10-10 | 2018-05-29 | Salesforce.Com, Inc. | Systems and methods of de-duplicating similar news feed items |
US10592841B2 (en) | 2014-10-10 | 2020-03-17 | Salesforce.Com, Inc. | Automatic clustering by topic and prioritizing online feed items |
CN105335467A (zh) * | 2015-09-25 | 2016-02-17 | 苏州天梯卓越传媒有限公司 | 一种用于出版行业热点选题的新颖性判断方法与系统 |
WO2017060795A1 (fr) * | 2015-10-07 | 2017-04-13 | Koninklijke Philips N.V. | Dispositif, système et procédé de détermination d'informations relatives à un clinicien |
US10372813B2 (en) * | 2017-01-17 | 2019-08-06 | International Business Machines Corporation | Selective content dissemination |
US10621177B2 (en) * | 2017-03-23 | 2020-04-14 | International Business Machines Corporation | Leveraging extracted entity and relation data to automatically filter data streams |
US10459450B2 (en) | 2017-05-12 | 2019-10-29 | Autonomy Squared Llc | Robot delivery system |
US10698876B2 (en) | 2017-08-11 | 2020-06-30 | Micro Focus Llc | Distinguish phrases in displayed content |
US11244013B2 (en) | 2018-06-01 | 2022-02-08 | International Business Machines Corporation | Tracking the evolution of topic rankings from contextual data |
CN109635089B (zh) * | 2018-12-14 | 2023-09-05 | 李华康 | 一种基于语义网络的文学作品新颖度评价系统和方法 |
CN111507110B (zh) * | 2019-01-30 | 2022-10-18 | 国家计算机网络与信息安全管理中心 | 一种突发事件检测方法、装置、设备及存储介质 |
RU2698916C1 (ru) * | 2019-03-14 | 2019-09-02 | Публичное Акционерное Общество "Сбербанк России" (Пао Сбербанк) | Способ и система поиска релевантных новостей |
CN112597269A (zh) * | 2020-12-25 | 2021-04-02 | 西南电子技术研究所(中国电子科技集团公司第十研究所) | 流式数据事件文本专题及检测系统 |
CN112668726B (zh) * | 2020-12-25 | 2023-07-11 | 中山大学 | 一种高效通信且保护隐私的个性化联邦学习方法 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020007364A1 (en) * | 2000-05-02 | 2002-01-17 | Mei Kobayashi | Detecting and tracking new events/classes of documents in a data base |
EP1378838A2 (fr) * | 2002-07-04 | 2004-01-07 | Hewlett-Packard Development Company | Evalution du netteté d'un document |
Family Cites Families (55)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US54130A (en) * | 1866-04-24 | Improvement in lever-power of windlasses | ||
US52930A (en) * | 1866-02-27 | Improvement in wrenches | ||
US83158A (en) * | 1868-10-20 | Frank a | ||
US154476A (en) * | 1874-08-25 | Improvement in sulky-plows | ||
US7364A (en) * | 1850-05-14 | Preventing fibers from winding ob drawing rollers in spinning | ||
US43231A (en) * | 1864-06-21 | Improved tire or hoop bender | ||
US32689A (en) * | 1861-07-02 | Improvement in projectiles for ordnance | ||
US46401A (en) * | 1865-02-14 | Improved milling-machine | ||
US80155A (en) * | 1868-07-21 | brisk ell | ||
US40591A (en) * | 1863-11-10 | Improvement in gas-heating apparatus | ||
US54174A (en) * | 1866-04-24 | Improved sad-iron | ||
US78204A (en) * | 1868-05-26 | Improved lounge | ||
US34078A (en) * | 1862-01-07 | Improvement in scroll-saws | ||
US80156A (en) * | 1868-07-21 | James k | ||
US40590A (en) * | 1863-11-10 | Improvement in wrenches | ||
US83025A (en) * | 1868-10-13 | Improved sofa-bedstead | ||
US52963A (en) * | 1866-03-06 | Improvement in carts | ||
US43232A (en) * | 1864-06-21 | Improvement in preserving fruits | ||
US44152A (en) * | 1864-09-13 | Improved tile-machine | ||
US99817A (en) * | 1870-02-15 | of chicago | ||
US87525A (en) * | 1869-03-02 | George tefft | ||
US5434777A (en) * | 1992-05-27 | 1995-07-18 | Apple Computer, Inc. | Method and apparatus for processing natural language |
US6311157B1 (en) * | 1992-12-31 | 2001-10-30 | Apple Computer, Inc. | Assigning meanings to utterances in a speech recognition system |
US5384892A (en) * | 1992-12-31 | 1995-01-24 | Apple Computer, Inc. | Dynamic language model for speech recognition |
US5555376A (en) * | 1993-12-03 | 1996-09-10 | Xerox Corporation | Method for granting a user request having locational and contextual attributes consistent with user policies for devices having locational attributes consistent with the user request |
US5812865A (en) * | 1993-12-03 | 1998-09-22 | Xerox Corporation | Specifying and establishing communication data paths between particular media devices in multiple media device computing systems based on context of a user or users |
US5493692A (en) * | 1993-12-03 | 1996-02-20 | Xerox Corporation | Selective delivery of electronic messages in a multiple computer system based on context and environment of a user |
US6035104A (en) | 1996-06-28 | 2000-03-07 | Data Link Systems Corp. | Method and apparatus for managing electronic documents by alerting a subscriber at a destination other than the primary destination |
GB9701866D0 (en) * | 1997-01-30 | 1997-03-19 | British Telecomm | Information retrieval |
US6418431B1 (en) * | 1998-03-30 | 2002-07-09 | Microsoft Corporation | Information retrieval and speech recognition based on language models |
US6209023B1 (en) * | 1998-04-24 | 2001-03-27 | Compaq Computer Corporation | Supporting a SCSI device on a non-SCSI transport medium of a network |
US6421711B1 (en) * | 1998-06-29 | 2002-07-16 | Emc Corporation | Virtual ports for data transferring of a data storage system |
US6470397B1 (en) * | 1998-11-16 | 2002-10-22 | Qlogic Corporation | Systems and methods for network and I/O device drivers |
US6466232B1 (en) * | 1998-12-18 | 2002-10-15 | Tangis Corporation | Method and system for controlling presentation of information to a user based on the user's condition |
US6363427B1 (en) * | 1998-12-18 | 2002-03-26 | Intel Corporation | Method and apparatus for a bulletin board system |
US7137069B2 (en) | 1998-12-18 | 2006-11-14 | Tangis Corporation | Thematic response to a computer user's context, such as by a wearable personal computer |
US6791580B1 (en) * | 1998-12-18 | 2004-09-14 | Tangis Corporation | Supplying notifications related to supply and consumption of user context data |
US6812937B1 (en) * | 1998-12-18 | 2004-11-02 | Tangis Corporation | Supplying enhanced computer user's context data |
US7055101B2 (en) | 1998-12-18 | 2006-05-30 | Tangis Corporation | Thematic response to a computer user's context, such as by a wearable personal computer |
US7076737B2 (en) | 1998-12-18 | 2006-07-11 | Tangis Corporation | Thematic response to a computer user's context, such as by a wearable personal computer |
US6513046B1 (en) | 1999-12-15 | 2003-01-28 | Tangis Corporation | Storing and recalling information to augment human memories |
US6842877B2 (en) * | 1998-12-18 | 2005-01-11 | Tangis Corporation | Contextual responses based on automated learning techniques |
US6801223B1 (en) | 1998-12-18 | 2004-10-05 | Tangis Corporation | Managing interactions between computer users' context models |
US6747675B1 (en) | 1998-12-18 | 2004-06-08 | Tangis Corporation | Mediating conflicts in computer user's context data |
US7107539B2 (en) | 1998-12-18 | 2006-09-12 | Tangis Corporation | Thematic response to a computer user's context, such as by a wearable personal computer |
US6389432B1 (en) * | 1999-04-05 | 2002-05-14 | Auspex Systems, Inc. | Intelligent virtual volume access |
AU2001249768A1 (en) | 2000-04-02 | 2001-10-15 | Tangis Corporation | Soliciting information based on a computer user's context |
CN1336610A (zh) * | 2000-07-27 | 2002-02-20 | 国际商业机器公司 | 网上商务交易的广告方法及其系统 |
US20020054130A1 (en) | 2000-10-16 | 2002-05-09 | Abbott Kenneth H. | Dynamically displaying current status of tasks |
US20020044152A1 (en) | 2000-10-16 | 2002-04-18 | Abbott Kenneth H. | Dynamic integration of computer generated and real world images |
US20030046401A1 (en) | 2000-10-16 | 2003-03-06 | Abbott Kenneth H. | Dynamically determing appropriate computer user interfaces |
GB2381638B (en) * | 2001-11-03 | 2004-02-04 | Dremedia Ltd | Identifying audio characteristics |
US6801917B2 (en) * | 2001-11-13 | 2004-10-05 | Koninklijke Philips Electronics N.V. | Method and apparatus for partitioning a plurality of items into groups of similar items in a recommender of such items |
JP2003162639A (ja) * | 2001-11-28 | 2003-06-06 | Fujitsu Ltd | 銘柄選択支援装置 |
US7293019B2 (en) * | 2004-03-02 | 2007-11-06 | Microsoft Corporation | Principles and methods for personalizing newsfeeds via an analysis of information novelty and dynamics |
-
2004
- 2004-04-20 US US10/827,729 patent/US7293019B2/en not_active Expired - Fee Related
-
2005
- 2005-02-23 AU AU2005200877A patent/AU2005200877B2/en not_active Ceased
- 2005-02-24 EP EP05101400A patent/EP1571579A1/fr not_active Ceased
- 2005-02-24 CA CA2498376A patent/CA2498376C/fr not_active Expired - Fee Related
- 2005-03-01 BR BR0500612-0A patent/BRPI0500612A/pt not_active IP Right Cessation
- 2005-03-01 RU RU2005105751/09A patent/RU2382401C2/ru not_active IP Right Cessation
- 2005-03-02 JP JP2005057282A patent/JP4845392B2/ja not_active Expired - Fee Related
- 2005-03-02 KR KR1020050017311A patent/KR101114012B1/ko not_active IP Right Cessation
- 2005-03-02 CN CN2008100907009A patent/CN101256591B/zh not_active Expired - Fee Related
- 2005-03-02 CN CN2005100531853A patent/CN1664819A/zh active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020007364A1 (en) * | 2000-05-02 | 2002-01-17 | Mei Kobayashi | Detecting and tracking new events/classes of documents in a data base |
EP1378838A2 (fr) * | 2002-07-04 | 2004-01-07 | Hewlett-Packard Development Company | Evalution du netteté d'un document |
Non-Patent Citations (3)
Title |
---|
JAMES ALLAN, RON PAPKA, VICTOR LAVRENKO: "On-line New Event Detection and Tracking", PROCEEDINGS OF THE 21ST ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, MELBOURNE, AUSTRALIA, 24-28 AUGUST 1998, August 1998 (1998-08-01), ACM Press, New York, NY, USA, pages 37 - 45, XP002334941 * |
THORSTEN BRANTS, FRANCINE CHEN, AYMAN FARAHAT: "A System for New Event Detection", PROCEEDINGS OF THE 26TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL; TORONTO, CANADA, 26 JULY-1 AUGUST 2003, July 2003 (2003-07-01), ACM Press, New York, NY, USA, pages 330 - 337, XP002334940 * |
YIMING YANG, JIAN ZHANG, JAIME CARBONELL, CHUN JIN: "Topic-conditioned Novelty Detection", PROCEEDINGS OF THE EIGHTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING; EDMONTON, ALBERTA, CANADA, 23-26 JULY 2002, July 2002 (2002-07-01), ACM Press, New York, NY, USA, pages 688 - 693, XP002334939 * |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007098398A1 (fr) * | 2006-02-16 | 2007-08-30 | Newsgator Technologies, Inc. | Système et procédé de synchronisation d'un contenu de fil de syndication |
CN101373351B (zh) * | 2007-08-21 | 2011-06-22 | 京瓷美达株式会社 | 自动供稿装置、图像读取装置及图像形成装置 |
US8930388B2 (en) | 2007-10-12 | 2015-01-06 | Palo Alto Research Center Incorporated | System and method for providing orientation into subject areas of digital information for augmented communities |
US8073682B2 (en) | 2007-10-12 | 2011-12-06 | Palo Alto Research Center Incorporated | System and method for prospecting digital information |
US8165985B2 (en) | 2007-10-12 | 2012-04-24 | Palo Alto Research Center Incorporated | System and method for performing discovery of digital information in a subject area |
US8190424B2 (en) | 2007-10-12 | 2012-05-29 | Palo Alto Research Center Incorporated | Computer-implemented system and method for prospecting digital information through online social communities |
US8706678B2 (en) | 2007-10-12 | 2014-04-22 | Palo Alto Research Center Incorporated | System and method for facilitating evergreen discovery of digital information |
US8671104B2 (en) | 2007-10-12 | 2014-03-11 | Palo Alto Research Center Incorporated | System and method for providing orientation into digital information |
US8010545B2 (en) | 2008-08-28 | 2011-08-30 | Palo Alto Research Center Incorporated | System and method for providing a topic-directed search |
US8209616B2 (en) | 2008-08-28 | 2012-06-26 | Palo Alto Research Center Incorporated | System and method for interfacing a web browser widget with social indexing |
US8549016B2 (en) | 2008-11-14 | 2013-10-01 | Palo Alto Research Center Incorporated | System and method for providing robust topic identification in social indexes |
US8452781B2 (en) | 2009-01-27 | 2013-05-28 | Palo Alto Research Center Incorporated | System and method for using banded topic relevance and time for article prioritization |
EP2211282A3 (fr) * | 2009-01-27 | 2011-05-18 | Palo Alto Research Center Incorporated | Système et procédé pour gérer l'attention de l'utilisateur en détectant les sujets chauds et froids dans les indices sociaux |
US8356044B2 (en) | 2009-01-27 | 2013-01-15 | Palo Alto Research Center Incorporated | System and method for providing default hierarchical training for social indexing |
US8239397B2 (en) | 2009-01-27 | 2012-08-07 | Palo Alto Research Center Incorporated | System and method for managing user attention by detecting hot and cold topics in social indexes |
EP2211281A3 (fr) * | 2009-01-27 | 2011-04-20 | Palo Alto Research Center Incorporated | Système et procédé pour utiliser une pertinence d'objet enrubanné et période de priorisation de l'article |
US9031944B2 (en) | 2010-04-30 | 2015-05-12 | Palo Alto Research Center Incorporated | System and method for providing multi-core and multi-level topical organization in social indexes |
EP2664997A3 (fr) * | 2012-05-18 | 2015-08-12 | Xerox Corporation | Système et procédé de résolution de coréférence d'entité nommée |
US9665828B2 (en) | 2014-01-16 | 2017-05-30 | International Business Machines Corporation | Using physicochemical correlates of perceptual flavor similarity to enhance, balance and substitute flavors |
US9852380B2 (en) | 2014-01-16 | 2017-12-26 | International Business Machines Corporation | Computing personalized probabilistic familiarity based on known artifact data |
US9858530B2 (en) | 2014-01-16 | 2018-01-02 | International Business Machines Corporation | Generating novel work products using computational creativity |
US11107008B2 (en) | 2014-01-16 | 2021-08-31 | International Business Machines Corporation | Computing personalized probabilistic familiarity based on known artifact data |
Also Published As
Publication number | Publication date |
---|---|
AU2005200877A1 (en) | 2005-09-22 |
BRPI0500612A (pt) | 2005-11-08 |
CA2498376C (fr) | 2013-10-22 |
JP4845392B2 (ja) | 2011-12-28 |
RU2005105751A (ru) | 2006-08-10 |
US20050198056A1 (en) | 2005-09-08 |
KR20060043331A (ko) | 2006-05-15 |
AU2005200877B2 (en) | 2011-02-03 |
CN101256591B (zh) | 2011-02-23 |
CN1664819A (zh) | 2005-09-07 |
US7293019B2 (en) | 2007-11-06 |
CN101256591A (zh) | 2008-09-03 |
CA2498376A1 (fr) | 2005-09-02 |
KR101114012B1 (ko) | 2012-03-13 |
JP2005251203A (ja) | 2005-09-15 |
RU2382401C2 (ru) | 2010-02-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7293019B2 (en) | Principles and methods for personalizing newsfeeds via an analysis of information novelty and dynamics | |
Gabrilovich et al. | Newsjunkie: providing personalized newsfeeds via analysis of information novelty | |
US7962510B2 (en) | Using content analysis to detect spam web pages | |
US8886641B2 (en) | Incorporating recency in network search using machine learning | |
US8239380B2 (en) | Systems and methods to tune a general-purpose search engine for a search entry point | |
US9870405B2 (en) | System and method for evaluating results of a search query in a network environment | |
US20130018967A1 (en) | System and method for deriving user expertise based on data propagating in a network environment | |
KR101284875B1 (ko) | 사용자의 웹 히스토리를 분석하기 위한 시스템 및 방법 | |
US8538959B2 (en) | Personalized data search utilizing social activities | |
US8478737B2 (en) | Segmentation of search topics in query logs | |
US7809716B2 (en) | Method and apparatus for establishing relationship between documents | |
US8965893B2 (en) | System and method for grouping multiple streams of data | |
JP4633162B2 (ja) | インデックス生成システム、情報検索システム、及びインデックス生成方法 | |
US20070112719A1 (en) | System and method for dynamically generating and managing an online context-driven interactive social network | |
US20070038646A1 (en) | Ranking blog content | |
US9311395B2 (en) | Systems and methods for manipulating electronic content based on speech recognition | |
US20090125549A1 (en) | Method and system for calculating competitiveness metric between objects | |
WO2010141429A1 (fr) | Fourniture de suggestions de requêtes de recherche sur le web, fondée sur des données de clic de requêtes de recherche stockées | |
JP2006107473A (ja) | パーソナル化された検索および情報アクセスを提供するシステム、方法、およびインターフェース | |
CN109947902B (zh) | 一种数据查询方法、装置和可读介质 | |
MXPA05002372A (en) | Principles and methods for personalizing newsfeeds via an analysis of information novelty and dynamics | |
Lin et al. | Accelerating web content filtering by the early decision algorithm | |
Mladenić | Web browsing using machine learning on text data | |
WO2008030568A2 (fr) | Système et procédé d'exploration de transmissions et filtre anti-spam | |
KR100645711B1 (ko) | 다수의 정보 블록으로 구분된 웹 페이지를 이용한 정보검색 서비스 제공 서버, 방법 및 시스템 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA HR LV MK YU |
|
17P | Request for examination filed |
Effective date: 20060209 |
|
AKX | Designation fees paid |
Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
17Q | First examination report despatched |
Effective date: 20060803 |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
18R | Application refused |
Effective date: 20150417 |