US20180203927A1 - System and method for determining an authority rank for real time searching - Google Patents

System and method for determining an authority rank for real time searching Download PDF

Info

Publication number
US20180203927A1
US20180203927A1 US15/919,391 US201815919391A US2018203927A1 US 20180203927 A1 US20180203927 A1 US 20180203927A1 US 201815919391 A US201815919391 A US 201815919391A US 2018203927 A1 US2018203927 A1 US 2018203927A1
Authority
US
United States
Prior art keywords
search
search engine
content
processing capacity
reliability factor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/919,391
Inventor
Vik Singh
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Excalibur IP LLC
Original Assignee
Excalibur IP LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Excalibur IP LLC filed Critical Excalibur IP LLC
Priority to US15/919,391 priority Critical patent/US20180203927A1/en
Assigned to YAHOO! INC. reassignment YAHOO! INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SINGH, VIK
Assigned to EXCALIBUR IP, LLC reassignment EXCALIBUR IP, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: YAHOO! INC.
Publication of US20180203927A1 publication Critical patent/US20180203927A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • G06F17/30864
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • G06F17/30867
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F17/30637

Definitions

  • the invention described herein generally relates to search engines and more specifically to systems and methods for processing and improving search results for current real-time trends and/or events.
  • search engines deal with multiple sets of information corpora.
  • the search engine In response to a search request, the search engine returns result sets in an ordered listing.
  • the reliability of search results often depends on various factors, including the collection of the information, processing of the information, the information source and user feedback on the veracity of this information.
  • a spike in activity typically represents a corresponding real world occurrence and users seeking information as it becomes available. For example, suppose a natural disaster occurs or a rumor emerges that a company is about to launch a ground-breaking new product, there will be a corresponding in spike in people searching for this information.
  • contemporaneous information may include data feeds, such as social media feeds, really simple syndication (RSS) feeds, web logs, etc.
  • RSS really simple syndication
  • the present invention is directed towards a method and system for processing a real time increase in search requests for a common event.
  • the method and system includes detecting an activity spike in user search request activity based on monitoring of user search requests over a defined period of time and determining source locations associated with the activity spike based on user search result activities.
  • the method and system further includes associating the source locations with the user search request and thereupon applying a machine-learning model to determine a plurality of common features operative to cause the activity spike, including determining associations between the source locations and the activity spike.
  • the present invention further includes determining a plurality of fresh web content for a search engine and measuring a real-time authority for the fresh content using the machine-learning model.
  • the method and system includes adjusting a reliability factor for the fresh web content based on the measured real-time authority.
  • the adjustment of the reliability factor based on the measured real-time authority is performed instead of at least one of a link flux calculation and a page rank adjustment.
  • the present invention further includes determining the search terms of the user search requests associated with the activity spike and determining a plurality of additional content sources associated with the search terms.
  • the system and method further includes ranking the plurality of additional content sources based on the measured real-time authority.
  • the system and method additionally includes determining an additional processing capacity in the search engine caused by the activity spike and allocating available processing capacity of the search engine proportional to the additional processing capacity.
  • FIG. 1 illustrates one embodiment of a block diagram of a search result system that includes capabilities of providing search results for contemporaneous information
  • FIG. 2 illustrates one embodiment of a search engine including capabilities for processing contemporaneous information from real time sources and determining an authority rank for the real-time sources;
  • FIG. 3 illustrates a flowchart of the steps of one embodiment of a method for determining an authority rank for real time sources
  • FIG. 4 illustrates a flowchart of the steps of another embodiment of a method for determining an authority rank for real time sources.
  • the search engine technology recognizes a spike or dramatic increase in user search activity for a particular event or theme. From that spike, the search engine is able to process search result options relating to real time sources. From the processing of those sources, the search engine therefore generates an authority rank for the real time sources. Using this authority rank improves the ordering of the search results.
  • FIG. 1 illustrates one embodiment of a system 100 that includes a user device 102 , a network 104 and a search engine 106 .
  • the system 100 further includes a database population module 108 and a content database 110 usable by the search engine 106 .
  • additional content sources 120 a , 120 b and 120 c are additionally connected via the network 104 .
  • the user device 102 may be any suitable type of user processing device as recognized by one skilled in the art.
  • the user device 102 is a personal or mobile computing device that includes local processing capabilities, as well networking capabilities to interact and engage the network 104 .
  • the network 104 may be any suitable type of network allowing data communication thereacross.
  • the network 104 is the Internet, following known Internet protocols for data communication thereacross.
  • the search engine 106 is one or more processing components disposed on one or more processing devices or systems in a networked environment.
  • the search engine 106 may operate similar to known search engine technologies, but with the inclusion of additional processing capabilities describes herein.
  • the search engine is operative to receive search requests and process the requests to generate search results to the user device 102 across the network 104 .
  • the search engine 106 is additionally capable of recognizing a spike in search activity, recognizing contemporaneous sources of information, processing various details of information and thereby ranking the sources.
  • the database population module 108 is illustrated as being separate from the search engine 106 , but it is recognized that this module may be incorporated therein.
  • the database population module 108 is a processing device or system operative to perform processing operations in response to executable instructions, instructions for extracting search information relating to web-based content and then populating the content database 110 .
  • the module 108 may include technology crawling Internet content to populate the database 110 .
  • the module 108 includes processing operations for determining contemporaneous sources for real time information.
  • the system 100 illustrates three sample contemporaneous sources 120 a , 120 b and 120 c (collectively referred to as 120 ).
  • the sources 120 can be any type of source that provides real-time information.
  • a typical example may be a social network feed.
  • a Twitter® feed from various account users can be a real-time source.
  • This real-time data feed provides large amounts of contemporaneous information, with significant uncertainty regarding the veracity of this information.
  • Another source could be a really simple syndication (RSS) feed or other type of news or data feed, e.g. a stock ticker feed.
  • RSS really simple syndication
  • the user may enter a search request to the search engine 106 via the network 104 .
  • the search engine accesses the database 110 to find content results that answer the search inquiry, where based on the population module 108 , the database 110 includes real-time information from the contemporaneous sources 120 .
  • FIG. 2 describes in further detail one embodiment of processing operations and subsequent methodologies for processing the real time increase in search requests and providing improved search results to the user device 102 .
  • FIG. 2 illustrates a system 140 including an interface 142 , search term monitor 144 , search processing engine 146 , a source locator, 148 , a spike content database 150 and a machine learning processing device 152 .
  • the system 140 may be embedded within or part of a larger search engine, such as the search engine 106 of FIG. 1 , for example.
  • the interface 142 represents the computer executable code that provides the front-end user experience for search operations, such as the user entering search terms and receiving search results in response thereto.
  • the search term monitor 144 may be a processing device or module that monitors search terms over periods of time to determine if there is a particular trend or a spike in activity. For example, a spike in activity may be determined by specific standards, such as if there are X number of search requests to the same common theme within Y seconds. Merely by way of example, a spike may including noting there are in excess of 10,000 searches for the same or common terms within a period of 30 seconds.
  • the search processing engine 146 is a processing device operative to process various aspects of the search engine operations.
  • the engine 146 may include receipt of the search term, accessing a database of search results and then generating the search results page in response thereto.
  • the engine 146 includes additional processing capabilities for real-time rank authority as described in further detail below.
  • the source locator 148 may be a processing device or a module of executable instructions, operative to perform operations relating to determining a content source and allowing for various processing operations relating to that source. As described in further detail below, it is important to find various information sources, typically contemporaneous sources, in real-time event search response scenarios. And when those sources are discovered, the system 140 is operative to thereby rank the sources to quantify the reliability of these sources, typically illustrated via the search result rankings.
  • the spike content database 150 is any suitable type of data storage device that stores spike information.
  • This database 150 may include the storage of search query information, e.g. search query terms, search query rewrites, search result actions, etc. This information is then usable for tracking search query information over a period of time, as described in further detail below.
  • the machine learning processing device 152 is one or more processing devices operative to generate authority ranks
  • the device 152 uses machine learning operations to evaluate the authority of the determined sources, where in one embodiment the device 152 may use known machine learning techniques for ranking source authority, techniques used in existing search engines for evaluating sources with existing crawling techniques to crawl web content.
  • the timeliness of the real time authority rank complicates the machine learning process, such that the machine learning is modified to be performed in a more expedited manner.
  • FIGS. 1 and 2 For further illustration of the systems of FIGS. 1 and 2 , the operations of these systems are described in further detail regarding the methodologies of FIGS. 3 and 4 .
  • FIG. 3 illustrates a flowchart of one embodiment of a method processing a real time increase in search requests for a common event.
  • the method begins, step 160 , by detecting an activity spike in user search request activity based on monitoring of user search requests over a defined period of time.
  • a spike in activity relates to an immediate jump in searching for information as may be caused by an immediate, typically unplanned event.
  • an unplanned event may be a natural disaster, e.g. an earthquake in Haiti.
  • the search engine 106 using the search term monitoring processing device 144 , is therefore able to determine an activity spike by detecting that over a very short period of time there is an increase in the number of common searches.
  • the activity spike may be recognized as tens of thousands of searches for same or common terms, such as “Earthquake” and “Haiti.”
  • the period of time can readily be adjusted to determine differences between a trend and an activity spike.
  • a trend is more likely over an extended period of time, whereas an activity spike occurs in a truncated time period, whether it be seconds, minutes, hours, etc.
  • a next step, step 162 is determining source locations associated with the activity spike based on user search result activities.
  • the source locations for real-time activities include contemporaneous sources. Examples of contemporaneous sources may be any source that includes information in real time or in a timely fashion likely to be missed by web crawling techniques, for example a social network feed.
  • one example may be an event of a rumor of a high-tech product launch.
  • a source location may be a technology blog dedicated to tracking and reporting on high tech rumors and news releases.
  • Another source location may be a technical journal reporting on the blog article.
  • This step may include determining that this web log and the journal articles are the sources of the activity spike.
  • the source of the activity spike may be a news web location reporting on the event in a breaking news fashion.
  • Other sources could be social network feeds, for example, from individuals at the specification location.
  • a next step, step 164 is associating the source locations with the user search requests.
  • the source locations 148 may determine these sources and the locations are stored in the spike content database 150 .
  • the locations can be stored and referenced by the associated search request and/or search terms.
  • the web log and journal article may be cataloged in the spike content database 150 referenced by the search terms used in the spike of activity.
  • the machine learning processing device 152 is operative to perform the next step, step 166 , of FIG. 3 .
  • the step includes applying a machine-learning model to determine a plurality of common features operative to cause the activity spike including determining associations between the source locations and the activity spike. It is recognized that there are any number of suitable machine-learning techniques operative to process the associations described herein, including in one embodiment machine-learning techniques applicable to web-crawling techniques in non-real time data processing and cataloging operations.
  • the method further includes measuring a real-time authority rank for search result items based on the machine learning models. Based on this real time machine learning, the real-time authority indicates an authority ranking determinative of the veracity of the source.
  • the web log may be given a high authority ranking based on the machine-learning factors indicating it is a highly trustworthy source.
  • Another source could be a secondary, less reliable web log indicating the product rumor.
  • This less reliable web log may be less reliable for any number of reasons, such as it regularly broadcasts various rumors, is associated with a competing business, is associated with an illegal stock manipulation scheme, just by way of example.
  • this particular web location is then given a low authority for search results.
  • the search processing engine 146 is operative to generate search results for users performing search requests.
  • Step 170 of the method of FIG. 3 includes generating or updating search results including updating the ranking of search result items based on the authority rank.
  • the client 102 submits the search request to the search engine 106 , the search request being directed to a real time event causing a spike in user search activity.
  • the search engine 106 is thereby operative to provide updated search results included results adjusted based on the authority rank generated via the methodology of the steps of FIG. 3 .
  • FIG. 4 illustrates another embodiment, whereby processing operations provides for improving search results relating to real time event and activity spikes in searching, including processing information from contemporaneous sources.
  • the method of FIG. 4 may be processed in the systems of FIGS. 1 and 2 or any other suitable processing environment.
  • a first step, step 180 is determining a plurality of fresh web content for a search engine based on the active spike.
  • the method of FIG. 4 may be predicated on the events of the method of FIG. 3 , including the determination of an activity spike.
  • the method of FIG. 4 provides additional processing operations to augment and/or supplement the search results with new and/or additional search results, such as may be found from contemporaneous sources, or at least via sources not yet captured using the web crawling techniques.
  • the determination of fresh web content may include direct web crawling techniques or searching contemporaneous feeds.
  • one technique may include searching a social network data feed of user submissions.
  • the user submissions may be short messages, such as status updates or real-time messages, also colloquially known as a “tweet.”
  • a next step, step 182 is measuring real-time authority for the fresh content using the machine-learning model. The measuring of the real-time authority for the fresh content may be performed using the machine learning processing device 152 of FIG. 2 as described in further detail above.
  • a next step, step 184 is adjusting a reliability factor for the fresh web content based on the measured real-time authority.
  • the adjustment of the reliability factor includes utilizing the authority information from step 182 . If the authority information indicates a high degree of trustworthiness, the reliability factor can be improved and if the authority information indicates a low degree of trustworthiness, the reliability factor can be lowered.
  • a next step, step 186 is adjusting the reliability factor based on the real-time authority instead of either a link flux calculation or a page rank adjustment. Similar to step 184 , the reliability factor is adjusted, but in step 186 , there is a reduction in factors used for this calculation.
  • the methodology of FIG. 4 does not require the steps to be in sequential order and various embodiments can include step 186 instead of step 184 or vice versa. Step 186 further modifies step 184 by eliminating the adjustment factors of the link flux calculation and the page rank adjustment.
  • a next step, step 188 is determining an additional processing capacity in the search engine caused by the activity spike.
  • a processing operation examines the processing load search engine 106 of FIG. 1 to account for this activity spike and the processing of real-time authority ranks as described above. This determination may be performed using any suitable processing technique, such as monitoring processing load allocations to search operations on a search processing environment and measuring the delta between before the activity spike and during the activity spike.
  • an additional step is allocating available processing capacity of the search engine proportional to the additional processing capacity. Therefore, in this embodiment, the search engine provides for processing capacity as needed, without seeking to lose or otherwise compromise processing operations relating to the non-activity spike information. It is recognized that just because there is a spike in user activity, there is still a need to maintain standard search engine operations, therefore by the allocation of step 190 , attempts are made to maintain the search engine but also efficiently and effectively provide real-time source information whereby there is a real-time authority ranking for this information.
  • search engines provides effective solutions to standard searching operations, but based on the crawling data cataloging nature of these systems, problems can arise in real time activities.
  • the present method and system Based on the detection of the activity spike and machine-learning processing, the present method and system provides not only time sensitive search results, but also performs machine-learning authority rank to improve the accuracy and benefit of the search results.
  • the authority ranking allows for presentation of users with highest quality results in primary result positions, including account for contemporaneous sources as described above.
  • FIGS. 1 through 4 are conceptual illustrations allowing for an explanation of the present invention. It should be understood that various aspects of the embodiments of the present invention could be implemented in hardware, firmware, software, or combinations thereof. In such embodiments, the various components and/or steps would be implemented in hardware, firmware, and/or software to perform the functions of the present invention. That is, the same piece of hardware, firmware, or module of software could perform one or more of the illustrated blocks (e.g., components or steps).
  • computer software e.g., programs or other instructions
  • data is stored on a machine readable medium as part of a computer program product, and is loaded into a computer system or other device or machine via a removable storage drive, hard drive, or communications interface.
  • Computer programs also called computer control logic or computer readable program code
  • processors controllers, or the like
  • machine readable medium “computer program medium” and “computer usable medium” are used to generally refer to media such as a random access memory (RAM); a read only memory (ROM); a removable storage unit (e.g., a magnetic or optical disc, flash memory device, or the like); a hard disk; or the like.
  • RAM random access memory
  • ROM read only memory
  • removable storage unit e.g., a magnetic or optical disc, flash memory device, or the like
  • hard disk or the like.

Abstract

The present invention is directed towards a method and system for processing a real time increase in search requests for a common event. The method and system includes detecting an activity spike in user search request activity based on monitoring of user search requests over a defined period of time and determining source locations associated with the activity spike based on user search result activities. The method and system further includes associating the source locations with the user search request and thereupon applying a machine-learning model to determine a plurality of common features operative to cause the activity spike, including determining associations between the source locations and the activity spike.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of U.S. patent application Ser. No. 12/706,349 filed Feb. 16, 2010, which is incorporated herein by reference in its entirety.
  • COPYRIGHT NOTICE
  • A portion of the disclosure of this patent document contains material, which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent files or records, but otherwise reserves all copyright rights whatsoever.
  • FIELD OF THE INVENTION
  • The invention described herein generally relates to search engines and more specifically to systems and methods for processing and improving search results for current real-time trends and/or events.
  • BACKGROUND OF THE INVENTION
  • Traditional search engines deal with multiple sets of information corpora. In response to a search request, the search engine returns result sets in an ordered listing. The reliability of search results often depends on various factors, including the collection of the information, processing of the information, the information source and user feedback on the veracity of this information.
  • Problems can arise when there is a spike in activity for a particular search trend because of problems with not only determining the right contemporaneous information, but also the reliability of this information. Standard search terms can be easily and readily handled using existing search technology, for example a user conducting a search to find information on a vacation to Las Vegas.
  • But a spike in activity typically represents a corresponding real world occurrence and users seeking information as it becomes available. For example, suppose a natural disaster occurs or a rumor emerges that a company is about to launch a ground-breaking new product, there will be a corresponding in spike in people searching for this information.
  • Current web searching technology suffers from an ability to successfully account for contemporaneous information. There is a growing trend for highly contemporaneous information achieving a critical mass of distribution in a very short time frame. This increase in contemporaneous information is predicated on the wide use and quick dissemination of information occurring in the current electronic world.
  • The conversion of the Internet from a passive online informational source to a de facto medium for information distribution, combined with the new tools for increases contemporaneous content generation, complicates existing web searching technology. Examples of contemporaneous information may include data feeds, such as social media feeds, really simple syndication (RSS) feeds, web logs, etc. Prior techniques of crawling the Internet, cataloging and then searching these corpora suffer from a lack of proper accounting for these contemporaneous data sources.
  • With developments in search engine technology to account for these feeds, problems can arise in the reliability of this information. For example, just because a search engine may describe a social media feed that includes information relating to the event, there is no way to trust the source of this feed. Therefore, there exists a need for improving search results correlating to spikes in real time search activities by accounting for the authority of sources in the search result.
  • SUMMARY OF THE INVENTION
  • The present invention is directed towards a method and system for processing a real time increase in search requests for a common event. The method and system includes detecting an activity spike in user search request activity based on monitoring of user search requests over a defined period of time and determining source locations associated with the activity spike based on user search result activities. The method and system further includes associating the source locations with the user search request and thereupon applying a machine-learning model to determine a plurality of common features operative to cause the activity spike, including determining associations between the source locations and the activity spike.
  • The present invention further includes determining a plurality of fresh web content for a search engine and measuring a real-time authority for the fresh content using the machine-learning model. Therein, the method and system includes adjusting a reliability factor for the fresh web content based on the measured real-time authority. In one embodiment, the method and system, the adjustment of the reliability factor based on the measured real-time authority is performed instead of at least one of a link flux calculation and a page rank adjustment.
  • The present invention further includes determining the search terms of the user search requests associated with the activity spike and determining a plurality of additional content sources associated with the search terms. The system and method further includes ranking the plurality of additional content sources based on the measured real-time authority. The system and method additionally includes determining an additional processing capacity in the search engine caused by the activity spike and allocating available processing capacity of the search engine proportional to the additional processing capacity.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The invention is illustrated in the figures of the accompanying drawings which are meant to be exemplary and not limiting, in which like references are intended to refer to like or corresponding parts, and in which:
  • FIG. 1 illustrates one embodiment of a block diagram of a search result system that includes capabilities of providing search results for contemporaneous information;
  • FIG. 2 illustrates one embodiment of a search engine including capabilities for processing contemporaneous information from real time sources and determining an authority rank for the real-time sources;
  • FIG. 3 illustrates a flowchart of the steps of one embodiment of a method for determining an authority rank for real time sources; and
  • FIG. 4 illustrates a flowchart of the steps of another embodiment of a method for determining an authority rank for real time sources.
  • DETAILED DESCRIPTION OF THE EMBODIMENTS
  • In the following description of the embodiments of the invention, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration, exemplary embodiments in which the invention may be practiced. It is to be understood that other embodiments may be utilized and structural changes may be made without departing from the scope of the present invention.
  • As described herein, the search engine technology recognizes a spike or dramatic increase in user search activity for a particular event or theme. From that spike, the search engine is able to process search result options relating to real time sources. From the processing of those sources, the search engine therefore generates an authority rank for the real time sources. Using this authority rank improves the ordering of the search results.
  • FIG. 1 illustrates one embodiment of a system 100 that includes a user device 102, a network 104 and a search engine 106. The system 100 further includes a database population module 108 and a content database 110 usable by the search engine 106. In the system 100, additional content sources 120 a, 120 b and 120 c are additionally connected via the network 104.
  • The user device 102 may be any suitable type of user processing device as recognized by one skilled in the art. In a typical embodiment, the user device 102 is a personal or mobile computing device that includes local processing capabilities, as well networking capabilities to interact and engage the network 104.
  • The network 104 may be any suitable type of network allowing data communication thereacross. In a typical embodiment, the network 104 is the Internet, following known Internet protocols for data communication thereacross.
  • The search engine 106 is one or more processing components disposed on one or more processing devices or systems in a networked environment. The search engine 106 may operate similar to known search engine technologies, but with the inclusion of additional processing capabilities describes herein. The search engine is operative to receive search requests and process the requests to generate search results to the user device 102 across the network 104. Whereas, the search engine 106 is additionally capable of recognizing a spike in search activity, recognizing contemporaneous sources of information, processing various details of information and thereby ranking the sources.
  • It is recognized that various details of the user device 102, network 104 and search engine 106 have been omitted. Many details, such as techniques for engaging and communication therebetween, not described herein are known within the knowledge of one skilled in the art and are omitted for brevity purposes only.
  • The database population module 108 is illustrated as being separate from the search engine 106, but it is recognized that this module may be incorporated therein. The database population module 108 is a processing device or system operative to perform processing operations in response to executable instructions, instructions for extracting search information relating to web-based content and then populating the content database 110. In one embodiment, the module 108 may include technology crawling Internet content to populate the database 110. Additionally, the module 108 includes processing operations for determining contemporaneous sources for real time information.
  • The system 100 illustrates three sample contemporaneous sources 120 a, 120 b and 120 c (collectively referred to as 120). The sources 120 can be any type of source that provides real-time information. A typical example may be a social network feed. For example, a Twitter® feed from various account users can be a real-time source. This real-time data feed provides large amounts of contemporaneous information, with significant uncertainty regarding the veracity of this information. Another source could be a really simple syndication (RSS) feed or other type of news or data feed, e.g. a stock ticker feed.
  • It is recognized that there are other types of information sources that provide real-time content and the sources 120 are not limited by the examples listed above. As the speed of information is received, there is the uncertainty of the trustworthiness of this information.
  • In the system 100, the user may enter a search request to the search engine 106 via the network 104. The search engine accesses the database 110 to find content results that answer the search inquiry, where based on the population module 108, the database 110 includes real-time information from the contemporaneous sources 120.
  • Search results are provided back to the user device 102. The ranking of these results are affected by a recognition in spike in user search activity and generating an authority rank. FIG. 2 describes in further detail one embodiment of processing operations and subsequent methodologies for processing the real time increase in search requests and providing improved search results to the user device 102.
  • FIG. 2 illustrates a system 140 including an interface 142, search term monitor 144, search processing engine 146, a source locator, 148, a spike content database 150 and a machine learning processing device 152. The system 140 may be embedded within or part of a larger search engine, such as the search engine 106 of FIG. 1, for example.
  • The interface 142 represents the computer executable code that provides the front-end user experience for search operations, such as the user entering search terms and receiving search results in response thereto. The search term monitor 144 may be a processing device or module that monitors search terms over periods of time to determine if there is a particular trend or a spike in activity. For example, a spike in activity may be determined by specific standards, such as if there are X number of search requests to the same common theme within Y seconds. Merely by way of example, a spike may including noting there are in excess of 10,000 searches for the same or common terms within a period of 30 seconds.
  • The search processing engine 146 is a processing device operative to process various aspects of the search engine operations. The engine 146 may include receipt of the search term, accessing a database of search results and then generating the search results page in response thereto. The engine 146 includes additional processing capabilities for real-time rank authority as described in further detail below.
  • The source locator 148 may be a processing device or a module of executable instructions, operative to perform operations relating to determining a content source and allowing for various processing operations relating to that source. As described in further detail below, it is important to find various information sources, typically contemporaneous sources, in real-time event search response scenarios. And when those sources are discovered, the system 140 is operative to thereby rank the sources to quantify the reliability of these sources, typically illustrated via the search result rankings.
  • The spike content database 150 is any suitable type of data storage device that stores spike information. This database 150 may include the storage of search query information, e.g. search query terms, search query rewrites, search result actions, etc. This information is then usable for tracking search query information over a period of time, as described in further detail below.
  • In this system 140, the machine learning processing device 152 is one or more processing devices operative to generate authority ranks The device 152 uses machine learning operations to evaluate the authority of the determined sources, where in one embodiment the device 152 may use known machine learning techniques for ranking source authority, techniques used in existing search engines for evaluating sources with existing crawling techniques to crawl web content. Whereas, in the present system 140, the timeliness of the real time authority rank complicates the machine learning process, such that the machine learning is modified to be performed in a more expedited manner.
  • For further illustration of the systems of FIGS. 1 and 2, the operations of these systems are described in further detail regarding the methodologies of FIGS. 3 and 4.
  • FIG. 3 illustrates a flowchart of one embodiment of a method processing a real time increase in search requests for a common event. The method begins, step 160, by detecting an activity spike in user search request activity based on monitoring of user search requests over a defined period of time.
  • As described above, in a typical search engine, various amounts of searches are conducted on a regular basis. There are larger trends relating to common events, such as for example there may be an increase in searches for an actor or actress around the time a movie premieres, or general searches to a sporting event around the time the sporting event occurs.
  • By contrast, a spike in activity relates to an immediate jump in searching for information as may be caused by an immediate, typically unplanned event. Simply by way of example, an unplanned event may be a natural disaster, e.g. an earthquake in Haiti. The search engine 106, using the search term monitoring processing device 144, is therefore able to determine an activity spike by detecting that over a very short period of time there is an increase in the number of common searches. Using the example of a Haitian earthquake, the activity spike may be recognized as tens of thousands of searches for same or common terms, such as “Earthquake” and “Haiti.”
  • The period of time can readily be adjusted to determine differences between a trend and an activity spike. A trend is more likely over an extended period of time, whereas an activity spike occurs in a truncated time period, whether it be seconds, minutes, hours, etc.
  • In the method of FIG. 3, a next step, step 162, is determining source locations associated with the activity spike based on user search result activities. The source locations for real-time activities include contemporaneous sources. Examples of contemporaneous sources may be any source that includes information in real time or in a timely fashion likely to be missed by web crawling techniques, for example a social network feed.
  • In this step, one example may be an event of a rumor of a high-tech product launch. A source location may be a technology blog dedicated to tracking and reporting on high tech rumors and news releases. Another source location may be a technical journal reporting on the blog article. This step may include determining that this web log and the journal articles are the sources of the activity spike. In the example of a natural disaster, the source of the activity spike may be a news web location reporting on the event in a breaking news fashion. Other sources could be social network feeds, for example, from individuals at the specification location.
  • A next step, step 164, is associating the source locations with the user search requests. Reference back to FIG. 2, the source locations 148 may determine these sources and the locations are stored in the spike content database 150. The locations can be stored and referenced by the associated search request and/or search terms. For example, in the example of a product launch, the web log and journal article may be cataloged in the spike content database 150 referenced by the search terms used in the spike of activity.
  • With a database cataloging the information, the machine learning processing device 152 is operative to perform the next step, step 166, of FIG. 3. The step includes applying a machine-learning model to determine a plurality of common features operative to cause the activity spike including determining associations between the source locations and the activity spike. It is recognized that there are any number of suitable machine-learning techniques operative to process the associations described herein, including in one embodiment machine-learning techniques applicable to web-crawling techniques in non-real time data processing and cataloging operations.
  • In step 168, the method further includes measuring a real-time authority rank for search result items based on the machine learning models. Based on this real time machine learning, the real-time authority indicates an authority ranking determinative of the veracity of the source. Using the above example of a rumored product launch, the web log may be given a high authority ranking based on the machine-learning factors indicating it is a highly trustworthy source.
  • By contrast, it is also possible that another source could be a secondary, less reliable web log indicating the product rumor. This less reliable web log may be less reliable for any number of reasons, such as it regularly broadcasts various rumors, is associated with a competing business, is associated with an illegal stock manipulation scheme, just by way of example. Using the machine learning operations of the machine learning processing device 152, this particular web location is then given a low authority for search results.
  • Based on determination of various sources, machine-learning processing and generating authority ranks, the search processing engine 146 is operative to generate search results for users performing search requests.
  • Step 170 of the method of FIG. 3 includes generating or updating search results including updating the ranking of search result items based on the authority rank. Thereby, in the system of FIG. 1, the client 102 submits the search request to the search engine 106, the search request being directed to a real time event causing a spike in user search activity. The search engine 106 is thereby operative to provide updated search results included results adjusted based on the authority rank generated via the methodology of the steps of FIG. 3.
  • FIG. 4 illustrates another embodiment, whereby processing operations provides for improving search results relating to real time event and activity spikes in searching, including processing information from contemporaneous sources. The method of FIG. 4 may be processed in the systems of FIGS. 1 and 2 or any other suitable processing environment.
  • In this embodiment, a first step, step 180, is determining a plurality of fresh web content for a search engine based on the active spike. The method of FIG. 4 may be predicated on the events of the method of FIG. 3, including the determination of an activity spike. The method of FIG. 4 provides additional processing operations to augment and/or supplement the search results with new and/or additional search results, such as may be found from contemporaneous sources, or at least via sources not yet captured using the web crawling techniques.
  • The determination of fresh web content may include direct web crawling techniques or searching contemporaneous feeds. For example, one technique may include searching a social network data feed of user submissions. The user submissions may be short messages, such as status updates or real-time messages, also colloquially known as a “tweet.” A next step, step 182, is measuring real-time authority for the fresh content using the machine-learning model. The measuring of the real-time authority for the fresh content may be performed using the machine learning processing device 152 of FIG. 2 as described in further detail above.
  • A next step, step 184, is adjusting a reliability factor for the fresh web content based on the measured real-time authority. The adjustment of the reliability factor includes utilizing the authority information from step 182. If the authority information indicates a high degree of trustworthiness, the reliability factor can be improved and if the authority information indicates a low degree of trustworthiness, the reliability factor can be lowered.
  • In this embodiment, a next step, step 186, is adjusting the reliability factor based on the real-time authority instead of either a link flux calculation or a page rank adjustment. Similar to step 184, the reliability factor is adjusted, but in step 186, there is a reduction in factors used for this calculation. The methodology of FIG. 4 does not require the steps to be in sequential order and various embodiments can include step 186 instead of step 184 or vice versa. Step 186 further modifies step 184 by eliminating the adjustment factors of the link flux calculation and the page rank adjustment.
  • A next step, step 188, is determining an additional processing capacity in the search engine caused by the activity spike. In this embodiment, a processing operation examines the processing load search engine 106 of FIG. 1 to account for this activity spike and the processing of real-time authority ranks as described above. This determination may be performed using any suitable processing technique, such as monitoring processing load allocations to search operations on a search processing environment and measuring the delta between before the activity spike and during the activity spike.
  • In the embodiment including step 188, an additional step, step 190, is allocating available processing capacity of the search engine proportional to the additional processing capacity. Therefore, in this embodiment, the search engine provides for processing capacity as needed, without seeking to lose or otherwise compromise processing operations relating to the non-activity spike information. It is recognized that just because there is a spike in user activity, there is still a need to maintain standard search engine operations, therefore by the allocation of step 190, attempts are made to maintain the search engine but also efficiently and effectively provide real-time source information whereby there is a real-time authority ranking for this information.
  • It is understood that search engines provides effective solutions to standard searching operations, but based on the crawling data cataloging nature of these systems, problems can arise in real time activities. Based on the detection of the activity spike and machine-learning processing, the present method and system provides not only time sensitive search results, but also performs machine-learning authority rank to improve the accuracy and benefit of the search results. The authority ranking allows for presentation of users with highest quality results in primary result positions, including account for contemporaneous sources as described above.
  • FIGS. 1 through 4 are conceptual illustrations allowing for an explanation of the present invention. It should be understood that various aspects of the embodiments of the present invention could be implemented in hardware, firmware, software, or combinations thereof. In such embodiments, the various components and/or steps would be implemented in hardware, firmware, and/or software to perform the functions of the present invention. That is, the same piece of hardware, firmware, or module of software could perform one or more of the illustrated blocks (e.g., components or steps).
  • In software implementations, computer software (e.g., programs or other instructions) and/or data is stored on a machine readable medium as part of a computer program product, and is loaded into a computer system or other device or machine via a removable storage drive, hard drive, or communications interface. Computer programs (also called computer control logic or computer readable program code) are stored in a main and/or secondary memory, and executed by one or more processors (controllers, or the like) to cause the one or more processors to perform the functions of the invention as described herein. In this document, the terms “machine readable medium,” “computer program medium” and “computer usable medium” are used to generally refer to media such as a random access memory (RAM); a read only memory (ROM); a removable storage unit (e.g., a magnetic or optical disc, flash memory device, or the like); a hard disk; or the like.
  • Notably, the figures and examples above are not meant to limit the scope of the present invention to a single embodiment, as other embodiments are possible by way of interchange of some or all of the described or illustrated elements. Moreover, where certain elements of the present invention can be partially or fully implemented using known components, only those portions of such known components that are necessary for an understanding of the present invention are described, and detailed descriptions of other portions of such known components are omitted so as not to obscure the invention. In the present specification, an embodiment showing a singular component should not necessarily be limited to other embodiments including a plurality of the same component, and vice-versa, unless explicitly stated otherwise herein. Moreover, applicants do not intend for any term in the specification or claims to be ascribed an uncommon or special meaning unless explicitly set forth as such. Further, the present invention encompasses present and future known equivalents to the known components referred to herein by way of illustration.
  • The foregoing description of the specific embodiments will so fully reveal the general nature of the invention that others can, by applying knowledge within the skill of the relevant art(s) (including the contents of the documents cited and incorporated by reference herein), readily modify and/or adapt for various applications such specific embodiments, without undue experimentation, without departing from the general concept of the present invention. Such adaptations and modifications are therefore intended to be within the meaning and range of equivalents of the disclosed embodiments, based on the teaching and guidance presented herein. It is to be understood that the phraseology or terminology herein is for the purpose of description and not of limitation, such that the terminology or phraseology of the present specification is to be interpreted by the skilled artisan in light of the teachings and guidance presented herein, in combination with the knowledge of one skilled in the relevant art(s).
  • While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example, and not limitation. It would be apparent to one skilled in the relevant art(s) that various changes in form and detail could be made therein without departing from the spirit and scope of the invention. Thus, the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents.

Claims (20)

What is claimed:
1. A method, implemented one or more machines having at least one processor, memory, and a communication platform connected to a network for allocating processing capacity of a search engine, the method comprising:
determining a plurality of content for a search engine based on user search activity satisfying a condition;
measuring a real-time authority for the plurality of content using a machine-learning model;
adjusting a reliability factor for the plurality of content;
determining additional processing capacity of the search engine as a result of the user search activity satisfying the condition; and
allocating available processing capacity to the search engine based on the additional processing capacity.
2. The method of claim 1, wherein the user search activity satisfying the condition comprises:
determining that, during a period of time, user search request activity included an increased amount of that one or more search terms within one or more search requests, the one or more search terms being associated with an event.
3. The method of claim 1, wherein the plurality of content comprises fresh web content, the method further comprises:
searching one or more contemporaneous data feeds; and
identifying the plurality of content within the one or more contemporaneous data feeds.
4. The method of claim 1, wherein measuring the real-time authority comprises:
generating a rank for a source authority by evaluating the source authority of a source associated with each of the plurality of content.
5. The method of claim 1, wherein adjusting the reliability factor comprises:
utilizing the real-time authority to determine whether each of the plurality of content indicates a high degree of trustworthiness or a low degree of trustworthiness, wherein:
the reliability factor is increased for content that indicates the high degree of trustworthiness, and
the reliability factor is decreased for content that indicating the low degree of trustworthiness.
6. The method of claim 1, wherein adjusting the reliability factor comprises at least one of:
adjusting the reliability factor based on the real-time authority; and
adjusting the reliability factor by eliminating adjustment factors of link flux calculation and page rank adjustment.
7. The method of claim 1, wherein determining the additional processing capacity of the search engine comprises:
monitoring processing load allocations by measuring a difference between search operations of the search engine prior to the user search activity satisfying the condition being detected and during the user search activity satisfying the condition.
8. The method of claim 1, wherein allocating the available processing capacity to the search engine comprises:
allocating the available processing capacity to the search engine such the available processing capacity of the search engine is proportional to the additional processing capacity of the search engine.
9. A system for allocating processing capacity of a search engine, comprising:
a computer readable medium comprising instructions stored thereon; and
a processing device that, in response to executing the instructions, is operative to:
determine a plurality of content for a search engine based on user search activity satisfying the condition;
measure a real-time authority for the plurality of content using a machine-learning model;
adjust a reliability factor for the plurality of content;
determine additional processing capacity of the search engine as a result of the user search activity satisfying the condition; and
allocate available processing capacity to the search engine based on the additional processing capacity.
10. The system of claim 9, wherein the processing device, in response to executing the instructions corresponding to the user search activity satisfying the condition, is further operative to:
determine that, during a period of time, user search request activity included an increased amount of that one or more search terms within one or more search requests, the one or more search terms being associated with an event.
11. The system of claim 9, wherein the plurality of content comprises fresh web content, the processing device, in response to executing the instructions, is further operative to:
search one or more contemporaneous data feeds; and
identify the plurality of content within the one or more contemporaneous data feeds.
12. The system of claim 9, wherein the processing device, in response to executing the instructions corresponding to the real-time authority being measured, is operative to:
generate a rank for a source authority by evaluating the source authority of a source associated with each of the plurality of content.
13. The system of claim 9, wherein the processing device, in response to executing the instructions corresponding to the reliability factor being adjusted, is operative to:
utilize the real-time authority to determine whether each of the plurality of content indicates a high degree of trustworthiness or a low degree of trustworthiness, wherein:
the reliability factor is increased for content that indicates the high degree of trustworthiness, and
the reliability factor is decreased for content that indicating the low degree of trustworthiness.
14. The system of claim 9, wherein the processing device, in response to executing the instructions corresponding to the reliability factor being adjusted, is operative to at least one of:
adjust the reliability factor based on the real-time authority; and
adjust the reliability factor by eliminating adjustment factors of link flux calculation and page rank adjustment.
15. The system of claim 9, wherein the processing device, in response to executing the instructions corresponding to the additional processing capacity of the search engine being determined, is operative to:
monitor processing load allocations by measuring a difference between search operations of the search engine prior to the user search activity satisfying the condition being detected and during the user search activity satisfying the condition.
16. The system of claim 9, wherein the processing device, in response to executing the instructions corresponding to the available processing capacity to the search engine being allocated, is operative to:
allocate the available processing capacity to the search engine such the available processing capacity of the search engine is proportional to the additional processing capacity of the search engine.
17. A non-transitory computer readable medium comprising instructions that, when executing by at least one processor, cause a device to:
determine a plurality of content for a search engine based on user search activity satisfying a condition;
measure a real-time authority for the plurality of content using a machine-learning model;
adjust a reliability factor for the plurality of content;
determine additional processing capacity of the search engine as a result of the user search activity satisfying the condition; and
allocate available processing capacity to the search engine based on the additional processing capacity.
18. The non-transitory computer readable medium of claim 17, wherein the instructions corresponding to adjusting the reliability factor, when executed by the at least one processor, cause the device to:
utilize the real-time authority to determine whether each of the plurality of content indicates a high degree of trustworthiness or a low degree of trustworthiness, wherein:
the reliability factor is increased for content that indicates the high degree of trustworthiness, and
the reliability factor is decreased for content that indicating the low degree of trustworthiness.
19. The non-transitory computer readable medium of claim 17, wherein the instructions comprising the additional processing capacity of the search engine being determined, when executed by the at least one processor, cause the device to:
monitor processing load allocations by measuring a difference between search operations of the search engine prior to the user search activity satisfying the condition being detected and during the user search activity satisfying the condition.
20. The non-transitory computer readable medium of claim 17, wherein the instructions comprising the available processing capacity to the search engine being allocated, when executed by the at least one processor, cause the device to:
allocate the available processing capacity to the search engine such the available processing capacity of the search engine is proportional to the additional processing capacity of the search engine.
US15/919,391 2010-02-16 2018-03-13 System and method for determining an authority rank for real time searching Abandoned US20180203927A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US15/919,391 US20180203927A1 (en) 2010-02-16 2018-03-13 System and method for determining an authority rank for real time searching

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/706,349 US9953083B2 (en) 2010-02-16 2010-02-16 System and method for determining an authority rank for real time searching
US15/919,391 US20180203927A1 (en) 2010-02-16 2018-03-13 System and method for determining an authority rank for real time searching

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US12/706,349 Continuation US9953083B2 (en) 2010-02-16 2010-02-16 System and method for determining an authority rank for real time searching

Publications (1)

Publication Number Publication Date
US20180203927A1 true US20180203927A1 (en) 2018-07-19

Family

ID=44370349

Family Applications (2)

Application Number Title Priority Date Filing Date
US12/706,349 Expired - Fee Related US9953083B2 (en) 2010-02-16 2010-02-16 System and method for determining an authority rank for real time searching
US15/919,391 Abandoned US20180203927A1 (en) 2010-02-16 2018-03-13 System and method for determining an authority rank for real time searching

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US12/706,349 Expired - Fee Related US9953083B2 (en) 2010-02-16 2010-02-16 System and method for determining an authority rank for real time searching

Country Status (1)

Country Link
US (2) US9953083B2 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9224177B2 (en) * 2010-11-05 2015-12-29 Market Data Service Llc Systems and methods for searching for and translating real estate descriptions from diverse sources utilizing an operator-based product definition
US10134063B2 (en) 2010-11-05 2018-11-20 Market Data Service Llc Systems and methods for searching for and translating real estate descriptions from diverse sources utilizing an operator-based product definition
US9218629B2 (en) * 2012-01-20 2015-12-22 Blackberry Limited Prioritizing and providing information about user contacts
US11232163B2 (en) * 2018-08-23 2022-01-25 Walmart Apollo, Llc Method and apparatus for ecommerce search ranking
US11127064B2 (en) 2018-08-23 2021-09-21 Walmart Apollo, Llc Method and apparatus for ecommerce search ranking
CN112307360B (en) * 2019-07-30 2023-08-25 百度在线网络技术(北京)有限公司 Regional event detection method and device based on search engine and search engine

Family Cites Families (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5832447A (en) * 1994-05-24 1998-11-03 Envoy Corporation Automated system and method for providing real-time verification of health insurance eligibility
US6366925B1 (en) * 1999-01-04 2002-04-02 American Services Network, Pc Network based legal services system
US6801909B2 (en) * 2000-07-21 2004-10-05 Triplehop Technologies, Inc. System and method for obtaining user preferences and providing user recommendations for unseen physical and information goods and services
US6754366B2 (en) * 2001-03-27 2004-06-22 Pitney Bowes Inc. Method for determining if mail contains life harming materials
CN1564994B (en) * 2001-08-10 2010-09-29 松下电器产业株式会社 Electronic device
US7577655B2 (en) * 2003-09-16 2009-08-18 Google Inc. Systems and methods for improving the ranking of news articles
EP1671235A4 (en) * 2003-09-23 2007-10-24 Amazon Com Inc Personalized searchable library with highlighting capabilities and access to electronic images of text based on user ownership of corresponding physical text
US9396212B2 (en) * 2004-04-07 2016-07-19 Visible World, Inc. System and method for enhanced video selection
US8335785B2 (en) * 2004-09-28 2012-12-18 Hewlett-Packard Development Company, L.P. Ranking results for network search query
JP4977624B2 (en) * 2005-01-18 2012-07-18 ヤフー! インコーポレイテッド Matching and ranking of sponsored search listings that incorporate web search technology and web content
US7386105B2 (en) * 2005-05-27 2008-06-10 Nice Systems Ltd Method and apparatus for fraud detection
US7496588B2 (en) * 2005-06-27 2009-02-24 Siperian, Inc. Method and apparatus for data integration and management
US20100076994A1 (en) * 2005-11-05 2010-03-25 Adam Soroca Using Mobile Communication Facility Device Data Within a Monetization Platform
US8015484B2 (en) * 2006-02-09 2011-09-06 Alejandro Backer Reputation system for web pages and online entities
US8386469B2 (en) * 2006-02-16 2013-02-26 Mobile Content Networks, Inc. Method and system for determining relevant sources, querying and merging results from multiple content sources
US7739280B2 (en) * 2006-03-06 2010-06-15 Veveo, Inc. Methods and systems for selecting and presenting content based on user preference information extracted from an aggregate preference signature
US20070233777A1 (en) * 2006-03-28 2007-10-04 International Business Machines Corporation Methods, systems, and computer program products for dynamically classifying web pages
US20080072264A1 (en) * 2006-08-02 2008-03-20 Aaron Crayford Distribution of content on a network
WO2008057268A2 (en) * 2006-10-26 2008-05-15 Mobile Content Networks, Inc. Techniques for determining relevant advertisements in response to queries
US8224905B2 (en) * 2006-12-06 2012-07-17 Microsoft Corporation Spam filtration utilizing sender activity data
US8126820B1 (en) * 2007-01-30 2012-02-28 Intuit Inc. Community to support the definition and sharing of source trust level configurations
US20080208820A1 (en) * 2007-02-28 2008-08-28 Psydex Corporation Systems and methods for performing semantic analysis of information over time and space
US7672937B2 (en) * 2007-04-11 2010-03-02 Yahoo, Inc. Temporal targeting of advertisements
US20080320087A1 (en) * 2007-06-22 2008-12-25 Microsoft Corporation Swarm sensing and actuating
US20090265205A1 (en) * 2008-03-11 2009-10-22 Incentalign, Inc. Pricing, Allocating, accounting and distributing internal resources using a market mechanism
US8543592B2 (en) * 2008-05-30 2013-09-24 Microsoft Corporation Related URLs for task-oriented query results
US8483382B2 (en) * 2008-07-24 2013-07-09 Grape Technology Group, Inc. System and method for real-time comparison of news events against directory assistance queries
US20100036771A1 (en) * 2008-08-11 2010-02-11 Laru Corporation System, method an computer program product for processing automated clearing house (ACH) historical data
US9979737B2 (en) * 2008-12-30 2018-05-22 Genesys Telecommunications Laboratories, Inc. Scoring persons and files for trust in digital communication
US8667026B1 (en) * 2009-01-22 2014-03-04 American Express Travel Related Services Company, Inc. Method and system for ranking multiple data sources
US7966383B2 (en) * 2009-03-27 2011-06-21 Business Objects Software Ltd. Client-server systems and methods for accessing metadata information across a network using proxies
US20100281061A1 (en) * 2009-04-30 2010-11-04 International Business Machines Corporation Semantic Data Validation of Disjoint Data
US20100306235A1 (en) * 2009-05-28 2010-12-02 Yahoo! Inc. Real-Time Detection of Emerging Web Search Queries
US20110087647A1 (en) * 2009-10-13 2011-04-14 Alessio Signorini System and method for providing web search results to a particular computer user based on the popularity of the search results with other computer users
US8150841B2 (en) * 2010-01-20 2012-04-03 Microsoft Corporation Detecting spiking queries
US8260789B2 (en) * 2010-04-01 2012-09-04 Microsoft Corporation System and method for authority value obtained by defining ranking functions related to weight and confidence value

Also Published As

Publication number Publication date
US20110202513A1 (en) 2011-08-18
US9953083B2 (en) 2018-04-24

Similar Documents

Publication Publication Date Title
US20180203927A1 (en) System and method for determining an authority rank for real time searching
JP6122199B2 (en) System, method and storage medium for improving access to search results
US10496815B1 (en) System, method, and computer program for classifying monitored assets based on user labels and for detecting potential misuse of monitored assets based on the classifications
EP3063652B1 (en) Application programable interface (api) discovery
US20180365254A1 (en) Method and apparatus for processing information flow data
US20180054499A1 (en) Content delivery acceleration system
US8504569B2 (en) Apparatus and methods for providing answers to queries respective of a user based on user uniquifiers
US10887261B2 (en) Dynamic attachment delivery in emails for advanced malicious content filtering
CN110059269B (en) Page tracking method and device, electronic equipment and computer readable storage medium
US10979532B2 (en) Resource download method, electronic device, and apparatus
US9514176B2 (en) Database update notification method
CN112948397A (en) Data processing system, method, device and storage medium
US7970760B2 (en) System and method for automatic detection of needy queries
JP6683681B2 (en) Determining the contribution of various user interactions to conversions
CN109981533B (en) DDoS attack detection method, device, electronic equipment and storage medium
US10033737B2 (en) System and method for cross-cloud identity matching
US9251273B2 (en) Delivering a filtered search result
US9092338B1 (en) Multi-level caching event lookup
CN115065520A (en) Anti-crawler processing method and device, electronic equipment and readable storage medium
US11706117B1 (en) Message-based monitoring and action system
US11748354B2 (en) Data shape confidence
RU2775591C2 (en) Method and system for detecting abnormal crowdsourcing label
US20210056145A1 (en) Method and system for determining abnormal crowd-sourced label
US20190179809A1 (en) System and method for meaningful research investment metrology
CN113449886A (en) Data processing method, processing device, equipment and storage medium

Legal Events

Date Code Title Description
AS Assignment

Owner name: YAHOO| INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SINGH, VIK;REEL/FRAME:045185/0402

Effective date: 20100216

Owner name: EXCALIBUR IP, LLC, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:YAHOO| INC.;REEL/FRAME:045573/0079

Effective date: 20160531

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION