US10489469B2 - Systems and methods for semantic keyword analysis for paid search - Google Patents

Systems and methods for semantic keyword analysis for paid search Download PDF

Info

Publication number
US10489469B2
US10489469B2 US14/928,220 US201514928220A US10489469B2 US 10489469 B2 US10489469 B2 US 10489469B2 US 201514928220 A US201514928220 A US 201514928220A US 10489469 B2 US10489469 B2 US 10489469B2
Authority
US
United States
Prior art keywords
keywords
tool
content
algorithms
ensemble
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US14/928,220
Other versions
US20160125462A1 (en
Inventor
Richard Israel Mallah
Akos Lajos Balogh
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Informite Inc
Marketmuse Inc
Original Assignee
Marketmuse Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Marketmuse Inc filed Critical Marketmuse Inc
Priority to US14/928,220 priority Critical patent/US10489469B2/en
Publication of US20160125462A1 publication Critical patent/US20160125462A1/en
Assigned to Informite Inc. reassignment Informite Inc. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MALLAH, Richard Israel, BALOGH, Akos Lajos
Assigned to MARKETMUSE, INC. reassignment MARKETMUSE, INC. CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: Informite Inc.
Application granted granted Critical
Publication of US10489469B2 publication Critical patent/US10489469B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24575Query processing with adaptation to user needs using context
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24578Query processing with adaptation to user needs using ranking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation
    • G06F16/90324Query formulation using system suggestions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation
    • G06F16/90332Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0255Targeted advertisements based on user history
    • G06Q30/0256User search

Definitions

  • This disclosure generally relates to systems and methods for keyword research and analysis, and in particular, to keyword research and analysis with respect to paid search optimization.
  • search engine optimization entails considerations of how search engines work, what people search for, how people search (e.g., what terms people use to search for various topics), and the like.
  • owners of web pages may also attempt to optimize the effectiveness of targeting advertisements towards users of social media (e.g., optimizing targeting based on keywords used by users of social media).
  • owners of websites may attempt to manually research keywords typically related to topics of their websites, and attempt to utilize these manually researched keywords in their targeted internet advertising.
  • such methods may be cumbersome, time-consuming, and provide minimal beneficial effect for web visibility.
  • the present solution provides a new tool for keyword research and analysis for optimization of paid searches.
  • Various embodiments of the tool provide an efficient and user-friendly mechanism for identifying recommended keywords that may replace or supplement existing keywords associated with a user's paid searches.
  • a method for generating from one or more keywords a list of recommended keywords for using in paid search advertising includes identifying, via a tool, one or more keywords to be used in a paid search advertising campaign at an identified website.
  • the method may further include acquiring, by a crawler, content from a plurality of different web content sources via one or more networks.
  • the method may also include applying, by the tool, to the acquired content an ensemble of one or more key phrase extraction algorithms, one or more graph analyses algorithms and one or more natural language processing algorithms to identify a set of semantically relevant keywords ranked by a relevance score.
  • the method may further include generating, by the tool from the set of semantically relevant keywords, a knowledge graph of recommended keywords to replace or supplement the one of more keywords.
  • the method may further include outputting, by the tool based at least partially on the knowledge graph, an enumerated list of recommended keywords to one of replace or supplement the one or more keywords to be used for the paid search advertising campaign at the identified website.
  • the method further includes receiving, by the tool, the one or more keywords from the identified website.
  • the method further includes acquiring, by the crawler, content from the identified website.
  • the method further includes applying, by the tool, the ensemble to the content from the identified website.
  • the method further includes acquiring content, by a crawler, from the plurality of different web content sources including web sites, news articles, blog posts and keyword data.
  • the one or more key phrase extraction algorithms includes a Bayesian statistical ensemble.
  • the method further includes outputting, by the tool, the enumerated list of recommend keywords ranked by at least one of an attractiveness score, a volume score and a competition score.
  • the method further includes outputting, by the tool, the enumerated list of recommend keywords ranked by at least one of a relevance score.
  • the method further includes outputting, by the tool, the enumerated list of recommend keywords ranked by a cost-per-click value.
  • the one or more key phrase extraction algorithms include a Bayesian statistical ensemble and the ensemble performs a plurality of term ranking functions are performed including one or more of a core phrase term ranking function, a tail phrase term ranking function, a hyperdictionary graph traversal algorithm and/or a semantic knowledgebase path traversal score.
  • a system for generating from one or more keywords a list of recommended keywords for using in paid search advertising includes a tool configured to execute on a processor to receive an input of one or more keywords for which to generate a list of recommended keywords to use via paid search advertising at an identified website and one or more targeting attributes for targeting the paid search advertising at the identified website.
  • the system may further include a crawler configured to acquire content from the identified website based at least in part on the one or more target attributes.
  • the tool may be configured to apply to the acquired content an ensemble of one or more key phrase extraction algorithms, one or more graph analyses algorithms and one or more natural language processing algorithms to identify a set of semantically relevant keywords ranked by a relevance score.
  • the tool may be configured to generate from the set of semantically relevant keywords, a knowledge graph of recommended keywords to replace or supplement the one of more keywords.
  • the tool may further be configured to output, based at least partially on the knowledge graph, an enumerated list of recommended keywords to one of replace or supplement the one or more keywords to be used for the paid search advertising campaign at the identified website.
  • the tool is further configured to receive the one or more keywords from the identified website.
  • the crawler is further configured to acquire content from the identified website.
  • the tool is further configured to apply the ensemble to the content from the identified website.
  • the crawler is further configured to acquire from the plurality of different web content sources including web sites, news articles, blog posts and keyword data.
  • the one or more key phrase extraction algorithms includes a Bayesian statistical ensemble.
  • the tool is further configured to output the enumerated list of recommend keywords ranked by at least one of an attractiveness score, a volume score and a competition score.
  • the tool is further configured to output the enumerated list of recommend keywords ranked by at least one of a relevance score.
  • the tool is further configured to output the enumerated list of recommend keywords ranked by a cost-per-click value.
  • the ensemble is further configured to perform a plurality of term ranking functions including one or more of a core phrase term ranking function, a tail phrase term ranking function, a hyperdictionary graph traversal algorithm and/or a semantic knowledgebase path traversal score.
  • FIG. 1A is a block diagram depicting an embodiment of a network environment comprising client device in communication with server device;
  • FIG. 1B is a block diagram depicting a cloud computing environment comprising client device in communication with cloud service providers;
  • FIGS. 1C and 1D are block diagrams depicting embodiments of computing devices useful in connection with the methods and systems described herein.
  • FIG. 2A is an embodiment of a system comprising a keyword research and analysis tool
  • FIG. 2B is an embodiment of a screen shot of a tool page
  • FIG. 2C is an embodiment of a screen shot of a recommended keywords page
  • FIG. 2D is another embodiment of a screen shot of the tool page
  • FIG. 2E is yet another embodiment of a screen shot of the tool page.
  • FIG. 3 is a flow diagram depicting an embodiment of a method of using the tool.
  • Section A describes a network environment and computing environment which may be useful for practicing embodiments described herein.
  • Section B describes embodiments of systems and methods for a keyword research and analysis tool.
  • Section C describes embodiments of systems and methods for a crawler.
  • Section D describes embodiments of a storage medium including an ensemble of algorithms.
  • Section E describes embodiments of systems and methods for a tool.
  • FIG. 1A an embodiment of a network environment is depicted.
  • the network environment includes one or more clients 102 a - 102 n (also generally referred to as local machine(s) 102 , client(s) 102 , client node(s) 102 , client machine(s) 102 , client computer(s) 102 , client device(s) 102 , endpoint(s) 102 , or endpoint node(s) 102 ) in communication with one or more servers 106 a - 106 n (also generally referred to as server(s) 106 , node 106 , or remote machine(s) 106 ) via one or more networks 104 .
  • a client 102 has the capacity to function as both a client node seeking access to resources provided by a server and as a server providing access to hosted resources for other clients 102 a - 102 n.
  • FIG. 1A shows a network 104 between the clients 102 and the servers 106
  • the clients 102 and the servers 106 may be on the same network 104 .
  • a network 104 ′ (not shown) may be a private network and a network 104 may be a public network.
  • a network 104 may be a private network and a network 104 ′ a public network.
  • networks 104 and 104 ′ may both be private networks.
  • the network 104 may be connected via wired or wireless links.
  • Wired links may include Digital Subscriber Line (DSL), coaxial cable lines, or optical fiber lines.
  • the wireless links may include BLUETOOTH, Wi-Fi, Worldwide Interoperability for Microwave Access (WiMAX), an infrared channel or satellite band.
  • the wireless links may also include any cellular network standards used to communicate among mobile devices, including standards that qualify as 1G, 2G, 3G, or 4G.
  • the network standards may qualify as one or more generation of mobile telecommunication standards by fulfilling a specification or standards such as the specifications maintained by International Telecommunication Union.
  • the 3G standards may correspond to the International Mobile Telecommunications-2000 (IMT-2000) specification, and the 4G standards may correspond to the International Mobile Telecommunications Advanced (IMT-Advanced) specification.
  • cellular network standards include AMPS, GSM, GPRS, UMTS, LTE, LTE Advanced, Mobile WiMAX, and WiMAX-Advanced.
  • Cellular network standards may use various channel access methods e.g. FDMA, TDMA, CDMA, or SDMA.
  • different types of data may be transmitted via different links and standards.
  • the same types of data may be transmitted via different links and standards.
  • the network 104 may be any type and/or form of network.
  • the geographical scope of the network 104 may vary widely and the network 104 can be a body area network (BAN), a personal area network (PAN), a local-area network (LAN), e.g. Intranet, a metropolitan area network (MAN), a wide area network (WAN), or the Internet.
  • the topology of the network 104 may be of any form and may include, e.g., any of the following: point-to-point, bus, star, ring, mesh, or tree.
  • the network 104 may be an overlay network which is virtual and sits on top of one or more layers of other networks 104 ′.
  • the network 104 may be of any such network topology as known to those ordinarily skilled in the art capable of supporting the operations described herein.
  • the network 104 may utilize different techniques and layers or stacks of protocols, including, e.g., the Ethernet protocol, the internet protocol suite (TCP/IP), the ATM (Asynchronous Transfer Mode) technique, the SONET (Synchronous Optical Networking) protocol, or the SDH (Synchronous Digital Hierarchy) protocol.
  • the TCP/IP internet protocol suite may include application layer, transport layer, internet layer (including, e.g., IPv6), or the link layer.
  • the network 104 may be a type of a broadcast network, a telecommunications network, a data communication network, or a computer network.
  • the system may include multiple, logically-grouped servers 106 .
  • the logical group of servers may be referred to as a server farm 38 or a machine farm 38 .
  • the servers 106 may be geographically dispersed.
  • a machine farm 38 may be administered as a single entity.
  • the machine farm 38 includes a plurality of machine farms 38 .
  • the servers 106 within each machine farm 38 can be heterogeneous—one or more of the servers 106 or machines 106 can operate according to one type of operating system platform (e.g., WINDOWS NT, manufactured by Microsoft Corp. of Redmond, Wash.), while one or more of the other servers 106 can operate on according to another type of operating system platform (e.g., Unix, Linux, or Mac OS X).
  • operating system platform e.g., Unix, Linux, or Mac OS X
  • servers 106 in the machine farm 38 may be stored in high-density rack systems, along with associated storage systems, and located in an enterprise data center. In this embodiment, consolidating the servers 106 in this way may improve system manageability, data security, the physical security of the system, and system performance by locating servers 106 and high performance storage systems on localized high performance networks. Centralizing the servers 106 and storage systems and coupling them with advanced system management tools allows more efficient use of server resources.
  • the servers 106 of each machine farm 38 do not need to be physically proximate to another server 106 in the same machine farm 38 .
  • the group of servers 106 logically grouped as a machine farm 38 may be interconnected using a wide-area network (WAN) connection or a metropolitan-area network (MAN) connection.
  • WAN wide-area network
  • MAN metropolitan-area network
  • a machine farm 38 may include servers 106 physically located in different continents or different regions of a continent, country, state, city, campus, or room. Data transmission speeds between servers 106 in the machine farm 38 can be increased if the servers 106 are connected using a local-area network (LAN) connection or some form of direct connection.
  • LAN local-area network
  • a heterogeneous machine farm 38 may include one or more servers 106 operating according to a type of operating system, while one or more other servers 106 execute one or more types of hypervisors rather than operating systems.
  • hypervisors may be used to emulate virtual hardware, partition physical hardware, virtualize physical hardware, and execute virtual machines that provide access to computing environments, allowing multiple operating systems to run concurrently on a host computer.
  • Native hypervisors may run directly on the host computer.
  • Hypervisors may include VMware ESX/ESXi, manufactured by VMWare, Inc., of Palo Alto, Calif.; the Xen hypervisor, an open source product whose development is overseen by Citrix Systems, Inc.; the HYPER-V hypervisors provided by Microsoft or others.
  • Hosted hypervisors may run within an operating system on a second software level. Examples of hosted hypervisors may include VMware Workstation and VIRTUALBOX.
  • Management of the machine farm 38 may be de-centralized.
  • one or more servers 106 may comprise components, subsystems and modules to support one or more management services for the machine farm 38 .
  • one or more servers 106 provide functionality for management of dynamic data, including techniques for handling failover, data replication, and increasing the robustness of the machine farm 38 .
  • Each server 106 may communicate with a persistent store and, in some embodiments, with a dynamic store.
  • Server 106 may be a file server, application server, web server, proxy server, appliance, network appliance, gateway, gateway server, virtualization server, deployment server, SSL VPN server, or firewall.
  • the server 106 may be referred to as a remote machine or a node.
  • a plurality of nodes 290 may be in the path between any two communicating servers.
  • a cloud computing environment may provide client 102 with one or more resources provided by a network environment.
  • the cloud computing environment may include one or more clients 102 a - 102 n , in communication with the cloud 108 over one or more networks 104 .
  • Clients 102 may include, e.g., thick clients, thin clients, and zero clients.
  • a thick client may provide at least some functionality even when disconnected from the cloud 108 or servers 106 .
  • a thin client or a zero client may depend on the connection to the cloud 108 or server 106 to provide functionality.
  • a zero client may depend on the cloud 108 or other networks 104 or servers 106 to retrieve operating system data for the client device.
  • the cloud 108 may include back end platforms, e.g., servers 106 , storage, server farms or data centers.
  • the cloud 108 may be public, private, or hybrid.
  • Public clouds may include public servers 106 that are maintained by third parties to the clients 102 or the owners of the clients.
  • the servers 106 may be located off-site in remote geographical locations as disclosed above or otherwise.
  • Public clouds may be connected to the servers 106 over a public network.
  • Private clouds may include private servers 106 that are physically maintained by clients 102 or owners of clients.
  • Private clouds may be connected to the servers 106 over a private network 104 .
  • Hybrid clouds 108 may include both the private and public networks 104 and servers 106 .
  • the cloud 108 may also include a cloud based delivery, e.g. Software as a Service (SaaS) 110 , Platform as a Service (PaaS) 112 , and Infrastructure as a Service (IaaS) 114 .
  • SaaS Software as a Service
  • PaaS Platform as a Service
  • IaaS Infrastructure as a Service
  • IaaS may refer to a user renting the use of infrastructure resources that are needed during a specified time period.
  • IaaS providers may offer storage, networking, servers or virtualization resources from large pools, allowing the users to quickly scale up by accessing more resources as needed. Examples of IaaS include AMAZON WEB SERVICES provided by Amazon.com, Inc., of Seattle, Wash., RACKSPACE CLOUD provided by Rackspace US, Inc., of San Antonio, Tex., Google Compute Engine provided by Google Inc.
  • PaaS providers may offer functionality provided by IaaS, including, e.g., storage, networking, servers or virtualization, as well as additional resources such as, e.g., the operating system, middleware, or runtime resources. Examples of PaaS include WINDOWS AZURE provided by Microsoft Corporation of Redmond, Wash., Google App Engine provided by Google Inc., and HEROKU provided by Heroku, Inc. of San Francisco, Calif. SaaS providers may offer the resources that PaaS provides, including storage, networking, servers, virtualization, operating system, middleware, or runtime resources.
  • SaaS providers may offer additional resources including, e.g., data and application resources.
  • SaaS include GOOGLE APPS provided by Google Inc., SALESFORCE provided by Salesforce.com Inc. of San Francisco, Calif., or OFFICE 365 provided by Microsoft Corporation.
  • Examples of SaaS may also include data storage providers, e.g. DROPBOX provided by Dropbox, Inc. of San Francisco, Calif., Microsoft SKYDRIVE provided by Microsoft Corporation, Google Drive provided by Google Inc., or Apple ICLOUD provided by Apple Inc. of Cupertino, Calif.
  • Clients 102 may access IaaS resources with one or more IaaS standards, including, e.g., Amazon Elastic Compute Cloud (EC2), Open Cloud Computing Interface (OCCI), Cloud Infrastructure Management Interface (CIMI), or OpenStack standards.
  • IaaS standards may allow clients access to resources over HTTP, and may use Representational State Transfer (REST) protocol or Simple Object Access Protocol (SOAP).
  • REST Representational State Transfer
  • SOAP Simple Object Access Protocol
  • Clients 102 may access PaaS resources with different PaaS interfaces.
  • PaaS interfaces use HTTP packages, standard Java APIs, JavaMail API, Java Data Objects (JDO), Java Persistence API (JPA), Python APIs, web integration APIs for different programming languages including, e.g., Rack for Ruby, WSGI for Python, or PSGI for Perl, or other APIs that may be built on REST, HTTP, XML, or other protocols.
  • Clients 102 may access SaaS resources through the use of web-based user interfaces, provided by a web browser (e.g. GOOGLE CHROME, Microsoft INTERNET EXPLORER, or Mozilla Firefox provided by Mozilla Foundation of Mountain View, Calif.).
  • Clients 102 may also access SaaS resources through smartphone or tablet applications, including, e.g., Salesforce Sales Cloud, or Google Drive app. Clients 102 may also access SaaS resources through the client operating system, including, e.g., Windows file system for DROPBOX.
  • access to IaaS, PaaS, or SaaS resources may be authenticated.
  • a server or authentication server may authenticate a user via security certificates, HTTPS, or API keys.
  • API keys may include various encryption standards such as, e.g., Advanced Encryption Standard (AES).
  • Data resources may be sent over Transport Layer Security (TLS) or Secure Sockets Layer (SSL).
  • TLS Transport Layer Security
  • SSL Secure Sockets Layer
  • the client 102 and server 106 may be deployed as and/or executed on any type and form of computing device, e.g. a computer, network device or appliance capable of communicating on any type and form of network and performing the operations described herein.
  • FIGS. 1C and 1D depict block diagrams of a computing device 100 useful for practicing an embodiment of the client 102 or a server 106 .
  • each computing device 100 includes a central processing unit 121 , and a main memory unit 122 .
  • main memory unit 122 main memory
  • a computing device 100 may include a storage device 128 , an installation device 116 , a network interface 118 , an I/O controller 123 , display devices 124 a - 124 n , a keyboard 126 and a pointing device 127 , e.g. a mouse.
  • the storage device 128 may include, without limitation, an operating system, software, and a software of a tool for keyword research and analysis 120 .
  • each computing device 100 may also include additional optional elements, e.g. a memory port 103 , a bridge 170 , one or more input/output devices 130 a - 130 n (generally referred to using reference numeral 130 ), and a cache memory 140 in communication with the central processing unit 121 .
  • the central processing unit 121 is any logic circuitry that responds to and processes instructions fetched from the main memory unit 122 .
  • the central processing unit 121 is provided by a microprocessor unit, e.g.: those manufactured by Intel Corporation of Mountain View, Calif.; those manufactured by Motorola Corporation of Schaumburg, Ill.; the ARM processor and TEGRA system on a chip (SoC) manufactured by Nvidia of Santa Clara, Calif.; the POWER7 processor, those manufactured by International Business Machines of White Plains, N.Y.; or those manufactured by Advanced Micro Devices of Sunnyvale, Calif.
  • the computing device 100 may be based on any of these processors, or any other processor capable of operating as described herein.
  • the central processing unit 121 may utilize instruction level parallelism, thread level parallelism, different levels of cache, and multi-core processors.
  • a multi-core processor may include two or more processing units on a single computing component. Examples of a multi-core processors include the AMD PHENOM IIX2, INTEL CORE i5 and INTEL CORE i7.
  • Main memory unit 122 may include one or more memory chips capable of storing data and allowing any storage location to be directly accessed by the microprocessor 121 .
  • Main memory unit 122 may be volatile and faster than storage 128 memory.
  • Main memory units 122 may be Dynamic random access memory (DRAM) or any variants, including static random access memory (SRAM), Burst SRAM or SynchBurst SRAM (BSRAM), Fast Page Mode DRAM (FPM DRAM), Enhanced DRAM (EDRAM), Extended Data Output RAM (EDO RAM), Extended Data Output DRAM (EDO DRAM), Burst Extended Data Output DRAM (BEDO DRAM), Single Data Rate Synchronous DRAM (SDR SDRAM), Double Data Rate SDRAM (DDR SDRAM), Direct Rambus DRAM (DRDRAM), or Extreme Data Rate DRAM (XDR DRAM).
  • DRAM Dynamic random access memory
  • SRAM static random access memory
  • BSRAM Burst SRAM or SynchBurst SRAM
  • FPM DRAM Fast Page Mode DRAM
  • the main memory 122 or the storage 128 may be non-volatile; e.g., non-volatile read access memory (NVRAM), flash memory non-volatile static RAM (nvSRAM), Ferroelectric RAM (FeRAM), Magnetoresistive RAM (MRAM), Phase-change memory (PRAM), conductive-bridging RAM (CBRAM), Silicon-Oxide-Nitride-Oxide-Silicon (SONOS), Resistive RAM (RRAM), Racetrack, Nano-RAM (NRAM), or Millipede memory.
  • NVRAM non-volatile read access memory
  • nvSRAM flash memory non-volatile static RAM
  • FeRAM Ferroelectric RAM
  • MRAM Magnetoresistive RAM
  • PRAM Phase-change memory
  • CBRAM conductive-bridging RAM
  • SONOS Silicon-Oxide-Nitride-Oxide-Silicon
  • Resistive RAM RRAM
  • Racetrack Nano-RAM
  • Millipede memory Millipede memory
  • FIG. 1C depicts an embodiment of a computing device 100 in which the processor communicates directly with main memory 122 via a memory port 103 .
  • the main memory 122 may be DRDRAM.
  • FIG. 1D depicts an embodiment in which the main processor 121 communicates directly with cache memory 140 via a secondary bus, sometimes referred to as a backside bus.
  • the main processor 121 communicates with cache memory 140 using the system bus 150 .
  • Cache memory 140 typically has a faster response time than main memory 122 and is typically provided by SRAM, BSRAM, or EDRAM.
  • the processor 121 communicates with various I/O devices 130 via a local system bus 150 .
  • Various buses may be used to connect the central processing unit 121 to any of the I/O devices 130 , including a PCI bus, a PCI-X bus, or a PCI-Express bus, or a NuBus.
  • the processor 121 may use an Advanced Graphics Port (AGP) to communicate with the display 124 or the I/O controller 123 for the display 124 .
  • FIG. 1D depicts an embodiment of a computer 100 in which the main processor 121 communicates directly with I/O device 130 b or other processors 121 ′ via HYPERTRANSPORT, RAPIDIO, or INFINIBAND communications technology.
  • FIG. 1D also depicts an embodiment in which local busses and direct communication are mixed: the processor 121 communicates with I/O device 130 a using a local interconnect bus while communicating with I/O device 130 b directly.
  • I/O devices 130 a - 130 n may be present in the computing device 100 .
  • Input devices may include keyboards, mice, trackpads, trackballs, touchpads, touch mice, multi-touch touchpads and touch mice, microphones, multi-array microphones, drawing tablets, cameras, single-lens reflex camera (SLR), digital SLR (DSLR), CMOS sensors, accelerometers, infrared optical sensors, pressure sensors, magnetometer sensors, angular rate sensors, depth sensors, proximity sensors, ambient light sensors, gyroscopic sensors, or other sensors.
  • Output devices may include video displays, graphical displays, speakers, headphones, inkjet printers, laser printers, and 3D printers.
  • Devices 130 a - 130 n may include a combination of multiple input or output devices, including, e.g., Microsoft KINECT, Nintendo Wiimote for the WII, Nintendo WII U GAMEPAD, or Apple IPHONE. Some devices 130 a - 130 n allow gesture recognition inputs through combining some of the inputs and outputs. Some devices 130 a - 130 n provides for facial recognition which may be utilized as an input for different purposes including authentication and other commands. Some devices 130 a - 130 n provides for voice recognition and inputs, including, e.g., Microsoft KINECT, SIRI for IPHONE by Apple, Google Now or Google Voice Search.
  • Additional devices 130 a - 130 n have both input and output capabilities, including, e.g., haptic feedback devices, touchscreen displays, or multi-touch displays.
  • Touchscreen, multi-touch displays, touchpads, touch mice, or other touch sensing devices may use different technologies to sense touch, including, e.g., capacitive, surface capacitive, projected capacitive touch (PCT), in-cell capacitive, resistive, infrared, waveguide, dispersive signal touch (DST), in-cell optical, surface acoustic wave (SAW), bending wave touch (BWT), or force-based sensing technologies.
  • PCT surface capacitive, projected capacitive touch
  • DST dispersive signal touch
  • SAW surface acoustic wave
  • BWT bending wave touch
  • Some multi-touch devices may allow two or more contact points with the surface, allowing advanced functionality including, e.g., pinch, spread, rotate, scroll, or other gestures.
  • Some touchscreen devices including, e.g., Microsoft PIXELSENSE or Multi-Touch Collaboration Wall, may have larger surfaces, such as on a table-top or on a wall, and may also interact with other electronic devices.
  • Some I/O devices 130 a - 130 n , display devices 124 a - 124 n or group of devices may be augment reality devices.
  • the I/O devices may be controlled by an I/O controller 123 as shown in FIG. 1C .
  • the I/O controller may control one or more I/O devices, such as, e.g., a keyboard 126 and a pointing device 127 , e.g., a mouse or optical pen. Furthermore, an I/O device may also provide storage and/or an installation medium 116 for the computing device 100 . In still other embodiments, the computing device 100 may provide USB connections (not shown) to receive handheld USB storage devices. In further embodiments, an I/O device 130 may be a bridge between the system bus 150 and an external communication bus, e.g. a USB bus, a SCSI bus, a FireWire bus, an Ethernet bus, a Gigabit Ethernet bus, a Fibre Channel bus, or a Thunderbolt bus.
  • an external communication bus e.g. a USB bus, a SCSI bus, a FireWire bus, an Ethernet bus, a Gigabit Ethernet bus, a Fibre Channel bus, or a Thunderbolt bus.
  • Display devices 124 a - 124 n may be connected to I/O controller 123 .
  • Display devices may include, e.g., liquid crystal displays (LCD), thin film transistor LCD (TFT-LCD), blue phase LCD, electronic papers (e-ink) displays, flexile displays, light emitting diode displays (LED), digital light processing (DLP) displays, liquid crystal on silicon (LCOS) displays, organic light-emitting diode (OLED) displays, active-matrix organic light-emitting diode (AMOLED) displays, liquid crystal laser displays, time-multiplexed optical shutter (TMOS) displays, or 3D displays. Examples of 3D displays may use, e.g.
  • Display devices 124 a - 124 n may also be a head-mounted display (HMD). In some embodiments, display devices 124 a - 124 n or the corresponding I/O controllers 123 may be controlled through or have hardware support for OPENGL or DIRECTX API or other graphics libraries.
  • HMD head-mounted display
  • the computing device 100 may include or connect to multiple display devices 124 a - 124 n , which each may be of the same or different type and/or form.
  • any of the I/O devices 130 a - 130 n and/or the I/O controller 123 may include any type and/or form of suitable hardware, software, or combination of hardware and software to support, enable or provide for the connection and use of multiple display devices 124 a - 124 n by the computing device 100 .
  • the computing device 100 may include any type and/or form of video adapter, video card, driver, and/or library to interface, communicate, connect or otherwise use the display devices 124 a - 124 n .
  • a video adapter may include multiple connectors to interface to multiple display devices 124 a - 124 n .
  • the computing device 100 may include multiple video adapters, with each video adapter connected to one or more of the display devices 124 a - 124 n .
  • any portion of the operating system of the computing device 100 may be configured for using multiple displays 124 a - 124 n .
  • one or more of the display devices 124 a - 124 n may be provided by one or more other computing devices 100 a or 100 b connected to the computing device 100 , via the network 104 .
  • software may be designed and constructed to use another computer's display device as a second display device 124 a for the computing device 100 .
  • a second display device 124 a for the computing device 100 .
  • an Apple iPad may connect to a computing device 100 and use the display of the device 100 as an additional display screen that may be used as an extended desktop.
  • a computing device 100 may be configured to have multiple display devices 124 a - 124 n.
  • the computing device 100 may comprise a storage device 128 (e.g. one or more hard disk drives or redundant arrays of independent disks) for storing an operating system or other related software, and for storing application software programs such as any program related to the software 120 for the experiment tracker system.
  • storage device 128 include, e.g., hard disk drive (HDD); optical drive including CD drive, DVD drive, or BLU-RAY drive; solid-state drive (SSD); USB flash drive; or any other device suitable for storing data.
  • Some storage devices may include multiple volatile and non-volatile memories, including, e.g., solid state hybrid drives that combine hard disks with solid state cache.
  • Some storage device 128 may be non-volatile, mutable, or read-only. Some storage device 128 may be internal and connect to the computing device 100 via a bus 150 . Some storage device 128 may be external and connect to the computing device 100 via a I/O device 130 that provides an external bus. Some storage device 128 may connect to the computing device 100 via the network interface 118 over a network 104 , including, e.g., the Remote Disk for MACBOOK AIR by Apple. Some client devices 100 may not require a non-volatile storage device 128 and may be thin clients or zero clients 102 . Some storage device 128 may also be used as a installation device 116 , and may be suitable for installing software and programs.
  • the operating system and the software can be run from a bootable medium, for example, a bootable CD, e.g. KNOPPIX, a bootable CD for GNU/Linux that is available as a GNU/Linux distribution from knoppix.net.
  • a bootable CD e.g. KNOPPIX
  • a bootable CD for GNU/Linux that is available as a GNU/Linux distribution from knoppix.net.
  • Client device 100 may also install software or application from an application distribution platform.
  • application distribution platforms include the App Store for iOS provided by Apple, Inc., the Mac App Store provided by Apple, Inc., GOOGLE PLAY for Android OS provided by Google Inc., Chrome Webstore for CHROME OS provided by Google Inc., and Amazon Appstore for Android OS and KINDLE FIRE provided by Amazon.com, Inc.
  • An application distribution platform may facilitate installation of software on a client device 102 .
  • An application distribution platform may include a repository of applications on a server 106 or a cloud 108 , which the clients 102 a - 102 n may access over a network 104 .
  • An application distribution platform may include application developed and provided by various developers. A user of a client device 102 may select, purchase and/or download an application via the application distribution platform.
  • the computing device 100 may include a network interface 118 to interface to the network 104 through a variety of connections including, but not limited to, standard telephone lines LAN or WAN links (e.g., 802.11, T1, T3, Gigabit Ethernet, Infiniband), broadband connections (e.g., ISDN, Frame Relay, ATM, Gigabit Ethernet, Ethernet-over-SONET, ADSL, VDSL, BPON, GPON, fiber optical including FiOS), wireless connections, or some combination of any or all of the above.
  • standard telephone lines LAN or WAN links e.g., 802.11, T1, T3, Gigabit Ethernet, Infiniband
  • broadband connections e.g., ISDN, Frame Relay, ATM, Gigabit Ethernet, Ethernet-over-SONET, ADSL, VDSL, BPON, GPON, fiber optical including FiOS
  • wireless connections or some combination of any or all of the above.
  • Connections can be established using a variety of communication protocols (e.g., TCP/IP, Ethernet, ARCNET, SONET, SDH, Fiber Distributed Data Interface (FDDI), IEEE 802.11a/b/g/n/ac CDMA, GSM, WiMax and direct asynchronous connections).
  • the computing device 100 communicates with other computing devices 100 ′ via any type and/or form of gateway or tunneling protocol e.g. Secure Socket Layer (SSL) or Transport Layer Security (TLS), or the Citrix Gateway Protocol manufactured by Citrix Systems, Inc. of Ft. Lauderdale, Fla.
  • SSL Secure Socket Layer
  • TLS Transport Layer Security
  • Citrix Gateway Protocol manufactured by Citrix Systems, Inc. of Ft. Lauderdale, Fla.
  • the network interface 118 may comprise a built-in network adapter, network interface card, PCMCIA network card, EXPRESSCARD network card, card bus network adapter, wireless network adapter, USB network adapter, modem or any other device suitable for interfacing the computing device 100 to any type of network capable of communication and performing the operations described herein.
  • a computing device 100 of the sort depicted in FIGS. 1B and 1C may operate under the control of an operating system, which controls scheduling of tasks and access to system resources.
  • the computing device 100 can be running any operating system such as any of the versions of the MICROSOFT WINDOWS operating systems, the different releases of the Unix and Linux operating systems, any version of the MAC OS for Macintosh computers, any embedded operating system, any real-time operating system, any open source operating system, any proprietary operating system, any operating systems for mobile computing devices, or any other operating system capable of running on the computing device and performing the operations described herein.
  • Typical operating systems include, but are not limited to: WINDOWS 2000, WINDOWS Server 2012, WINDOWS CE, WINDOWS Phone, WINDOWS XP, WINDOWS VISTA, and WINDOWS 7, WINDOWS RT, and WINDOWS 8 all of which are manufactured by Microsoft Corporation of Redmond, Wash.; MAC OS and iOS, manufactured by Apple, Inc. of Cupertino, Calif.; and Linux, a freely-available operating system, e.g. Linux Mint distribution (“distro”) or Ubuntu, distributed by Canonical Ltd. of London, United Kingdom; or Unix or other Unix-like derivative operating systems; and Android, designed by Google, of Mountain View, Calif., among others.
  • Some operating systems including, e.g., the CHROME OS by Google, may be used on zero clients or thin clients, including, e.g., CHROMEBOOKS.
  • the computer system 100 can be any workstation, telephone, desktop computer, laptop or notebook computer, netbook, ULTRABOOK, tablet, server, handheld computer, mobile telephone, smartphone or other portable telecommunications device, media playing device, a gaming system, mobile computing device, or any other type and/or form of computing, telecommunications or media device that is capable of communication.
  • the computer system 100 has sufficient processor power and memory capacity to perform the operations described herein.
  • the computing device 100 may have different processors, operating systems, and input devices consistent with the device.
  • the Samsung GALAXY smartphones e.g., operate under the control of Android operating system developed by Google, Inc. GALAXY smartphones receive input via a touch interface.
  • the computing device 100 is a gaming system.
  • the computer system 100 may comprise a PLAYSTATION 3, or PERSONAL PLAYSTATION PORTABLE (PSP), or a PLAYSTATION VITA device manufactured by the Sony Corporation of Tokyo, Japan, a NINTENDO DS, NINTENDO 3DS, NINTENDO WII, or a NINTENDO WII U device manufactured by Nintendo Co., Ltd., of Kyoto, Japan, an XBOX 360 device manufactured by the Microsoft Corporation of Redmond, Wash.
  • the computing device 100 is a digital audio player such as the Apple IPOD, IPOD Touch, and IPOD NANO lines of devices, manufactured by Apple Computer of Cupertino, Calif.
  • Some digital audio players may have other functionality, including, e.g., a gaming system or any functionality made available by an application from a digital application distribution platform.
  • the IPOD Touch may access the Apple App Store.
  • the computing device 100 is a portable media player or digital audio player supporting file formats including, but not limited to, MP3, WAV, M4A/AAC, WMA Protected AAC, RIFF, Audible audiobook, Apple Lossless audio file formats and .mov, .m4v, and .mp4 MPEG-4 (H.264/MPEG-4 AVC) video file formats.
  • file formats including, but not limited to, MP3, WAV, M4A/AAC, WMA Protected AAC, RIFF, Audible audiobook, Apple Lossless audio file formats and .mov, .m4v, and .mp4 MPEG-4 (H.264/MPEG-4 AVC) video file formats.
  • the computing device 100 is a tablet e.g. the IPAD line of devices by Apple; GALAXY TAB family of devices by Samsung; or KINDLE FIRE, by Amazon.com, Inc. of Seattle, Wash.
  • the computing device 100 is a eBook reader, e.g. the KINDLE family of devices by Amazon.com, or NOOK family of devices by Barnes & Noble, Inc. of New York City, N.Y.
  • the communications device 102 includes a combination of devices, e.g. a smartphone combined with a digital audio player or portable media player.
  • a smartphone e.g. the IPHONE family of smartphones manufactured by Apple, Inc.; a Samsung GALAXY family of smartphones manufactured by Samsung, Inc; or a Motorola DROID family of smartphones.
  • the communications device 102 is a laptop or desktop computer equipped with a web browser and a microphone and speaker system, e.g. a telephony headset.
  • the communications devices 102 are web-enabled and can receive and initiate phone calls.
  • a laptop or desktop computer is also equipped with a webcam or other video capture device that enables video chat and video call.
  • the status of one or more machines 102 , 106 in the network 104 is monitored, generally as part of network management.
  • the status of a machine may include an identification of load information (e.g., the number of processes on the machine, CPU and memory utilization), of port information (e.g., the number of available communication ports and the port addresses), or of session status (e.g., the duration and type of processes, and whether a process is active or idle).
  • this information may be identified by a plurality of metrics, and the plurality of metrics can be applied at least in part towards decisions in load distribution, network traffic management, and network failure recovery as well as any aspects of operations of the present solution described herein.
  • Systems and methods of the present solution are directed to performing keyword research and analysis to generate a list of topically-related and conceptually related keywords to a specific topic, for example, for purposes of search engine optimization (SEO) for paid searches.
  • SEO search engine optimization
  • a keyword tool can perform semantic keyword research and recommendations.
  • semantic keyword research e.g., semantic SEO keyword research
  • a tool may generate keywords that are conceptually related to the topic that a user is marketing.
  • the tool can generate recommended keywords such as “pet food” and “doggy treats.”
  • the keywords suggestions may be ranked by relevance scores, a measure of topical relevance, saving a user from hours of manual keyword research effort.
  • traditional applications include keyword research for prospecting and new clients, identifying new keywords for existing clients, and identifying relevant head terms for semantic SEO applications.
  • the systems and methods described herein can be used alongside a long-tail keyword tool (e.g., Google Keyword Planner) to identify new long-tail opportunities. Search engine marketing campaigns can be optimized with semantic machine learning.
  • the present disclosure is directed to system and methods for performing social advertising research.
  • the keyword tool can identify high-value keyword buys on various social media sites such as Facebook, Twitter, LinkedIn, and other social media advertising.
  • the present keyword tool can generate suggestions that are topically-related, allowing advertisers to quickly identify the most relevant and best-performing terms to be used, for example, for Facebook Interest Group targeting, Twitter keyword/hashtag targeting, and any other display advertising system that utilizes keyword-based targeting.
  • aspects of the present disclosure are directed to a keyword tool that can generate a list of topically-related and conceptually related keywords for use in user-targeted marketing and advertising.
  • the keyword tool may be used to build keyword strategy for advertising.
  • the keyword search tool can generate a target keyword list with respect to paid searches and targeted advertising or identify keyword candidates.
  • a server 106 including a keyword research and analysis tool (“keyword tool”) 202 may also be connected to the network 104 . Accordingly, the keyword tool 202 may be connected to the client(s) 102 a - 102 n and the server(s) 106 a - 106 n via the network 104 .
  • the server(s) 106 a - 106 n may each be a content source 206 A for providing content for research and analysis by the keyword tool 202 .
  • the content may include data corresponding to websites, news articles, blog posts, social networking sites, keyword data, and/or any other suitable information for use with the keyword tool 202 .
  • the keyword tool 202 may include one or more components or modules for performing various functions corresponding to keyword research and analysis.
  • the keyword tool 202 includes a tool 202 a , an ensemble of algorithms 202 b , a crawler 202 c , and a database 202 d . Each of these components are described in further detail below.
  • the keyword tool 202 may include less or more components, depending on the desired functions to be implemented by the keyword tool 202 .
  • one embodiment of the keyword tool 202 may include the tool 202 a and the ensemble of algorithms 202 b only.
  • the crawler 202 c may be a tool for parsing and collecting relevant content over the web.
  • each of the other tools at the keyword tool 202 such as, but not limited to, the tool 202 a may be connected to the crawler 202 c and may operate in conjunction with the crawler 202 c .
  • the other tools of the keyword tool 202 may receive the content acquired by the crawler 202 c and perform various operations on the content.
  • the crawler 202 c at the keyword tool 202 may use one or more keywords to search the web for a plurality of different relevant web content sources.
  • the crawler 202 c may acquire content by crawling and searching the web for the one or more keywords and any content related to the keywords.
  • the crawler 202 c may be configured to acquire content from a variety of media, such as, but not limited to, websites, news articles, blog posts, keyword data (e.g., stored in the database 202 d of the keyword tool 202 ), internet forums, social networking sites, advertising sites, and/or the like.
  • the content to be acquired by the crawler 202 c may be located at servers 106 a - 106 n .
  • the keyword tool 202 may analyze the acquired content to build a collection of sources (e.g., articles, blogs, social media, etc.) related to the input keywords.
  • the crawler 202 c may be configured to perform advanced semantic processing, such as, but not limited to, stemming and lemmatization.
  • the database 202 d may include data acquired from one or more other sources (e.g., data acquired by the crawler 202 c ).
  • the database 202 d may include keyword data acquired or obtained from a search engine.
  • the database 202 d may include keyword data acquired or obtained from a web site.
  • the database 202 d may include keyword data acquired or obtained from a social networking site.
  • the database 202 d may include keyword data acquired or obtained from a third party that aggregates data and provides data for use or purchase. Any of the systems and methods described herein, such as any of the algorithms may use the data in the database 202 d for any of the computations described herein, such as keyword recommendations.
  • the crawler 202 c may execute any suitable search software for crawling a given website.
  • the crawler 202 c may be configured to crawl the web by jumping from webpage to webpage.
  • the crawler 202 c may be configured to capture all pages on a single website, as opposed to jumping from webpage to webpage.
  • the crawler 202 e is capable of capturing all pages on a single website
  • the crawler 202 c may have particularly configured parameters, such as, but not limited to, which directories to exclude, which special directories to include, directories and/or pages that the crawler 202 c should merely visit but not index, and special customizations in terms of what pattern of pages to include or exclude.
  • the crawler 202 c may be configured to exclude certain elements of a website, while including certain other elements of the website.
  • the crawler 202 c may be configured to execute crawls so as not to weigh down or hinder servers.
  • the crawler 202 c may be configured to be limited in the number of webpages it fetches during a predetermined amount of time. As a non-limiting example, the crawler 202 c may be configured to fetch no more than five pages per second.
  • the crawler 202 c may be configured to call the header of a page (e.g., HTTP HEAD request) before fetching the body of the page to retrieve information about the page to determine if the page contains content that should be downloaded or not.
  • the crawler 202 c may be configured to monitor a response time of a server.
  • the crawler 202 c may monitor how long it takes a server to send a response to a request, and if the response time drops below a predetermined threshold (e.g., five seconds), the crawler 202 c may be configured to stand by for a predetermined amount of time (e.g., 20 seconds) before resuming crawling. In still further embodiments, after waiting for the predetermined amount of time, the crawler 202 c may resume crawling at a slightly more cautious pace than previously exhibited. As a non-limiting example, the crawler 202 c may crawl at a rate of four pages per second, as opposed to a previous rate of five pages per second.
  • a predetermined threshold e.g., five seconds
  • the crawler 202 c may be configured to interact with any suitable indexing technology, such as, but not limited to, an open source software (e.g., Solr) for indexing of the content acquired by the crawler 202 c .
  • the indexing technology for use with the crawler 202 c may be configured to capture necessary fields, clean data, and/or perform statistical analyses on the acquired content.
  • the ensemble of algorithms 202 b may be a storage medium for storing a plurality of algorithms to be accessed by each of the other tools at the keyword tool 202 , such as, but not limited to, the tool 202 a , which may be connected to the ensemble of algorithms 202 b and which may operate in conjunction with the ensemble of algorithms 202 b .
  • the other tools of the keyword tool 202 may access the ensemble of algorithms 202 b and perform operations based on the instructions stored at the ensemble of algorithms 202 b.
  • the ensemble of algorithms 202 b takes as input a set or corpus of pseudo-relevant documents, such as content acquired by the crawler 202 c and provides an output of a set of related keyphrases or keywords (e.g., a set of semantic-relevance-scored related keyphrases or keywords).
  • a tool in generating a set of semantically relevant topics scored or ranked by relevance, may be configured to receive data corresponding to a corpus of pseudo-relevant documents (e.g., acquired by the crawler 202 c ), and may be instructed by the ensemble of algorithms 202 d to cleanse and normalize the received documents and information.
  • the crawler 202 c may be configured to lemmatize the documents as well.
  • the ensemble of algorithms 202 b may instruct all descriptive phrase n-grams in the corpus up to some length to be identified.
  • the length of the n-grams may be predetermined by a user or administrator. Thresh-holding may then be performed using some function of frequency and available memory.
  • phrases starting or ending with conjunctives or other stop words may be discarded.
  • the method includes building term frequency and inverse document frequency matrices to act as a shared resource for subsequent phrases. The most frequent morphological phrase forms may be re-allocated based upon a weight value assigned to the phrase.
  • a lemmatization-equivalent of unigrams or phrases may be used to group.
  • the method includes estimating the bayesian prior of phrases. This may be done optionally as an adjunct to the term frequency matrix.
  • a linear combination of rarity-ranked bayesian priors of the constituent unigrams of the phrase is used.
  • the ensemble of algorithms 202 b includes instructions for applying an ensemble of a plurality of different classes of algorithms such as four constituent classes of algorithm.
  • term-ranking functions TRFs
  • a core phrase TRF may be performed (e.g., Robertson selection value and Kullback-Leibler distance).
  • a tail phrase TRF may be performed (e.g., Rocchio's weights, chi-square, and binary independence model).
  • a hyperdictionary graph traversal algorithm may be performed (e.g. TextRank).
  • a semantic knowledgebases path traversal score based on terseness of semantic path and some curated predicate weights may be generated (e.g., using ConceptNet or Yago knowledgebases).
  • Each constituent class of algorithm may itself be an ensemble of algorithms under that class, such that the ensemble of algorithms comprises a plurality of ensembles.
  • the ensemble method may be a weighted combination of the sets of scored phrases from each of the above constituents (e.g. a linear combination, or a linear combination of some normalization of the constituents).
  • the weights may be tuned by how tail-oriented the desired output is (e.g. where tail TRFs' output are weighed more heavily if more tail-like concepts are desired).
  • the weights may be tuned by how much n-grams of various n are desired (e.g. by multiplication by a function on desired lengths and phrase length under consideration).
  • unigrams with high bayesian priors are then reduced in the distribution's weights by some function of the unigram's prior.
  • the output of this method may be a set of semantic-relevance-scored phrases.
  • a key phrase of one phrase is a keyword.
  • a keyword is a single phrase is a key phrase.
  • a plurality of keywords is a key phrase.
  • the screen shot may represent a keyword user interface (UI) 210 to be presented to a user at the client device 102 for selecting parameters for a keyword search.
  • UI keyword user interface
  • the UI 210 and the keyword tool 202 are compatible for use with Twitter.
  • the UI 210 and the keyword tool 202 may be compatible for use with one or more other social media websites, such as, but not limited to, Facebook, Instagram, MySpace, and the like.
  • the tool 202 a may receive the entered information via the network 104 to perform one or more operations with respect to the user entries. The functions of the tool 202 a are discussed further below.
  • the keyword UI 210 may include an input portion including an input field 213 having a keyword field 213 a for receiving and entering one or more user-entered keywords and an import field 213 b for importing keywords into the UI 210 .
  • the one or more keywords may be a keyword, topic, or phrase that a user wishes to have analyzed by the tool 202 a .
  • a user may enter a keyword or phrase via the keyword field 213 a or may import multiple keywords via the import field 213 b .
  • a user may select a file (e.g., a file locally stored on the client 102 ) via the UI 210 and upload the file to the UI 210 .
  • the UI 210 (e.g., via the tool 202 a ) may read the file and populate one or more keywords into the keyword field 213 a in accordance with the uploaded file.
  • the keywords that a user entered or imported via the input field 213 is displayed directly below the keyword field 213 a and the import field 213 b .
  • the UI 210 may include a parameters display 212 for indicating the parameters of the operation.
  • parameters of the operation provide the rules by which the analysis of the one or more keywords will abide, and a summary of these parameters may be displayed at 212 .
  • parameters may include, but not be limited to, geographic locations, gender of social media users, language of the users, social media website(s) to be searched, type of advertisements shown to the social media users, level of activity of the users, and/or the like.
  • the keyword UI 210 may also include an output portion including an activity indicator 211 and a suggestions box 214 .
  • the activity indicator 211 may display a volume of activity associated with the entered keywords.
  • the activity indicator 211 may indicate a recent global daily volume of instances of social media user activity (e.g., tweets on Twitter) containing each keyword.
  • the suggestions box 214 may indicate recommended keywords, based on the input keywords, that a user may wish to add to a targeted advertising campaign to reach more quality customers.
  • the suggestions box 214 may display a preview of the recommended keywords, and may have the option of expanding the list so that a user may review the full list of recommended keywords.
  • the keyword UI 210 may include any suitable variation of the layout illustrated in FIG. 2B , or even different layouts.
  • the user input interface may take on other suitable formats, for example, but not limited to, a list for user selection and/or the like.
  • the keyword field 213 may auto-populate the one or more keywords as a user types into the keyword field 213 .
  • a list of keywords may be stored in the database 202 d , for example, to auto-populate the one or more keywords.
  • the keyword recommendations may be displayed in other suitable formats, for example, but not limited to, a chart, a graph, etc., or combinations thereof.
  • the tool 202 a may identify keywords that are relevant to the entered one or more keywords for purposes of targeted advertising or marketing, including paid searches.
  • a user of keyword UI 210 may enter into the keyword field 211 “keyword research” and “seo keyword research.” The user may wish to find recommended keywords for supplementing or replacing the input keywords for paid searches.
  • the tool 202 a after undergoing its operations with respect to the input keywords, may output a list in the form of suggestions box 212 of recommended keywords for use in paid searches.
  • the recommended keywords include “search engine optimization” and “methodology,” which are related to the input keywords. Accordingly, a user may review and interpret the information embodied in suggestions box 214 and may identify further keywords from the recommended keywords to target for paid searches to increase traffic to the user's website via the paid searches.
  • the entered information may be transmitted to the keyword research and analysis tool 202 via the network 104 , at which one or more operations may be performed corresponding to the entered keywords.
  • the submitted keywords are received by the crawler 202 c .
  • the crawler 202 c may then acquire content from various web sources to build a corpus of collected content that is relevant to the input keywords.
  • the crawler 202 c may parse Twitter, or any other social networking site, to search for relevant terms or hash tags used by the users of the social media sites. Further details regarding the operation of the crawler 202 c are described above.
  • a knowledge graph 229 may be generated by applying, by the tool 202 a , the ensemble of algorithms 202 b to the content acquired by the crawler 202 c to identify and organize a set of semantically relevant topics or keywords from the acquired content.
  • the knowledge graph is a knowledge base that organizes information gathered from the various web sources to provide structured and detailed information regarding each of the user-entered one or more keywords.
  • the knowledge graph may be built by the tool 202 a based on the semantically relevant topics scored by relevance.
  • the knowledge graph may be a representation and ranking of the semantically relevant topics or keywords.
  • the knowledge graph includes degrees of conceptual relevance to the initially-entered keywords, each of which serve as the seed terms from which the knowledge graph is built. For example, one of the input keywords is “keyword research,” which may be placed at a first level (or seed level) of the knowledge graph.
  • the tool 202 a may be configured to populate lower levels of the knowledge graph branching out from the seed level. For example, if some topics or keywords are identified as having relatively high relevance, based on the acquired content, and as determined using the above-described algorithms, to the input keyword “keyword research,” those highly relevant keywords may be placed closest to the seed term on the knowledge graph (e.g., at the second level of the knowledge graph, that is, one level beneath the seed keyword). The tool 202 a may continue placing keywords at different levels of the knowledge graph based on the keywords' degree of relevance to the seed keyword. As such, a hierarchy of relevant keywords with respect to the input keywords may be built in the form of the knowledge graph. In some embodiments, the step of generating a knowledge graph may be optional, and the tool 202 a may directly output the recommended keyword results to the user based on the results of application of the ensemble of algorithms to the acquired content, and skip the building of any knowledge graph.
  • keyword suggestions may be served to the user by the tool 202 a and displayed at the UI 210 via suggestions box 214 .
  • the keyword suggestions are served based directly on the results of the tool's application of the ensemble of algorithms to the acquired content, rather than based on the knowledge graph.
  • the tool 202 a outputs to the UI 210 all of the keywords of the knowledge graph.
  • the tool 202 a may utilize relevancy thresholds in determining whether or not to output a keyword as a recommended keyword.
  • the tool 202 a may include a predetermined relevancy threshold, and if a relevance score (as determined by the tool 202 a ) of a given keyword is below the relevancy threshold, then the tool 202 a will not output that keyword as a suggested keyword.
  • the relevancy threshold corresponds to locations on the knowledge graph (e.g., the levels or tiers of the knowledge graph).
  • the recommended keywords page 220 may include a first page 221 and a second page 222 .
  • the recommended keywords page 220 may include a plurality of recommended keywords 223 that are generated and output by the topic tool 202 a .
  • the suggestions box 214 may display a select few of the suggested keywords returned by the topic tool 202 a .
  • the keyword UI 210 may further provide the expanded recommended keywords list via the recommended keywords page 220 .
  • the recommended keywords page 220 is accessed via the suggestions box 214 (e.g., by clicking on the suggestions box 214 ).
  • the list of recommended keywords 223 may be ranked in the order of their relevance, from more relevant to less relevant.
  • the recommended keywords page 220 may further provide an add keyword interface 224 for allowing a user to add one or more of the recommended keywords 223 to the input field 213 or to their paid search keyword list. Accordingly, the recommended keywords 223 allow a user to supplement or replace the input keywords for purposes of paid searches and targeted social media advertising.
  • FIG. 2D illustrated is another embodiment of a screen shot of a tool page.
  • numerous keywords are entered into the input field 213 , including “#digitalmarketing,” “authorship,” “adwords,” etc.
  • the topic tool 202 a returns recommended keywords via the suggestions box 214 .
  • the recommended keywords include “engine optimization” and “#socialmediamarketing.”
  • FIG. 2E illustrated is yet another embodiment of a screen shot of a tool page.
  • numerous keywords are entered into the input field 213 , including “free seo,” “paid search,” “seo tools,” etc.
  • the topic tool 202 a returns recommended keywords via the suggestions box 214 .
  • the recommended keywords include “search engine marketing” and “seotools.”
  • a method 300 for generating from one or more keywords a list of recommended keywords for using in paid search advertising includes identifying, via a tool, one or more keywords to be used in a paid search advertising campaign at an identified website.
  • the method 300 includes acquiring, by a crawler, content from a plurality of different web content sources via one or more networks.
  • the method 300 includes applying, by the tool, to the acquired content an ensemble of algorithms to identify a set of semantically relevant keywords ranked by a relevance score.
  • the method 300 includes generating, by the tool, from the set of semantically relevant keywords, a knowledge graph of recommended keywords to replace or supplement the one or more keywords.
  • the method 300 includes outputting, by the tool based at least partially on the knowledge graph, an enumerated list of recommended keywords to one of replace or supplement the one or more keywords to be used for the paid search advertising campaign at the identified website.
  • step 301 may be performed by the tool 202 a .
  • the tool 202 a may receive the one or more keywords from a user interface at a client 102 and via the network 104 .
  • the user interface may correspond to UI 210 .
  • the one or more keywords may be various words or phrases that a user wishes to acquire related keywords for.
  • step 302 may be performed by the crawler 202 c .
  • the crawler 202 c may be configured to acquire content from various sources, such as, but not limited to, websites, blogs, articles, social networking sites, and/or the like.
  • the acquired content may relate to the one or more keywords.
  • step 303 may be performed by the tool 202 a , which may receive the content acquired from the crawler 202 ac
  • the tool 202 a may access and work in conjunction with the ensemble algorithm 202 b in applying the algorithms to the acquired content.
  • the ensemble of algorithms 202 b may include one or more key phrase extraction algorithms, one or more graph analyses algorithms, and one or more natural language processing algorithms.
  • step 304 may be performed by the tool 202 a after the tool 202 a identifies the set of semantically relevant keywords ranked by relevance score.
  • the ranking of the keywords may be embodied by the knowledge graph.
  • the generation of the knowledge graph is an optional step and may be skipped.
  • step 305 may be performed by the tool 202 a .
  • the tool 202 a may output the list of recommended keywords to the client 102 via the network 104 .
  • the list of recommended keywords may be displayed at the UI at the client 102 .
  • the list of recommended keywords may be organized based on corresponding relevance scores associated with each recommended keyword.
  • the recommended keywords may be incorporated into a paid search campaign such as by submission to a paid search tool or interface, such as Google AdWords for example.
  • the tool may be designed and constructed to facilitate the use of any one or more recommended keywords in paid advertising tool via any type and form of application programming interface to the paid advertising tool.
  • the user may use the recommended keyword list to manually configure the paid search campaign in the paid advertising tool. The user may use the systems and methods herein multiple times to replace and/or supplement keywords in the paid search campaign until the desired results or acceptable results are achieved.

Abstract

According to various embodiments, a method for generating from one or more keywords a list of recommended keywords for using in paid search advertising includes identifying, via a tool, one or more keywords to be used in a paid search advertising campaign at an identified website. The method may further include acquiring, by a crawler, content from a plurality of different web content sources via one or more networks. The method may also include applying, by the tool, to the acquired content an ensemble of one or more key phrase extraction algorithms, one or more graph analyses algorithms and one or more natural language processing algorithms to identify a set of semantically relevant keywords ranked by a relevance score. The method may further include generating, by the tool from the set of semantically relevant keywords, a knowledge graph of recommended keywords to replace or supplement the one of more keywords. The method may further include outputting, by the tool based at least partially on the knowledge graph, an enumerated list of recommended keywords to one of replace or supplement the one or more keywords to be used for the paid search advertising campaign at the identified website.

Description

CROSS-REFERENCES TO RELATED APPLICATIONS
The present application claims priority to and the benefit of U.S. Provisional Application No. 62/073,560, titled “Systems And Methods For Semantic Keyword Analysis For Organic Search” and filed on Oct. 31, 2014, which is incorporated by reference herein in its entirety for all purposes.
FIELD OF THE DISCLOSURE
This disclosure generally relates to systems and methods for keyword research and analysis, and in particular, to keyword research and analysis with respect to paid search optimization.
BACKGROUND OF THE DISCLOSURE
In efforts to increase advertising visibility and traffic to web pages (e.g., blogs, news sites, shopping sites, etc.) associated with the advertising, owners of web pages may engage in search engine optimization (SEO) corresponding to paid searches. Search engine optimization entails considerations of how search engines work, what people search for, how people search (e.g., what terms people use to search for various topics), and the like. In addition, owners of web pages may also attempt to optimize the effectiveness of targeting advertisements towards users of social media (e.g., optimizing targeting based on keywords used by users of social media). As an example of one method of optimization of paid searches, owners of websites may attempt to manually research keywords typically related to topics of their websites, and attempt to utilize these manually researched keywords in their targeted internet advertising. However, such methods may be cumbersome, time-consuming, and provide minimal beneficial effect for web visibility.
BRIEF SUMMARY OF THE DISCLOSURE
The present solution provides a new tool for keyword research and analysis for optimization of paid searches. Various embodiments of the tool provide an efficient and user-friendly mechanism for identifying recommended keywords that may replace or supplement existing keywords associated with a user's paid searches.
According to various embodiments, a method for generating from one or more keywords a list of recommended keywords for using in paid search advertising includes identifying, via a tool, one or more keywords to be used in a paid search advertising campaign at an identified website. The method may further include acquiring, by a crawler, content from a plurality of different web content sources via one or more networks. The method may also include applying, by the tool, to the acquired content an ensemble of one or more key phrase extraction algorithms, one or more graph analyses algorithms and one or more natural language processing algorithms to identify a set of semantically relevant keywords ranked by a relevance score. The method may further include generating, by the tool from the set of semantically relevant keywords, a knowledge graph of recommended keywords to replace or supplement the one of more keywords. The method may further include outputting, by the tool based at least partially on the knowledge graph, an enumerated list of recommended keywords to one of replace or supplement the one or more keywords to be used for the paid search advertising campaign at the identified website.
In some embodiments, the method further includes receiving, by the tool, the one or more keywords from the identified website.
In some embodiments, the method further includes acquiring, by the crawler, content from the identified website.
In some embodiments, the method further includes applying, by the tool, the ensemble to the content from the identified website.
In some embodiments, the method further includes acquiring content, by a crawler, from the plurality of different web content sources including web sites, news articles, blog posts and keyword data.
In some embodiments, the one or more key phrase extraction algorithms includes a Bayesian statistical ensemble.
In some embodiments, the method further includes outputting, by the tool, the enumerated list of recommend keywords ranked by at least one of an attractiveness score, a volume score and a competition score.
In some embodiments, the method further includes outputting, by the tool, the enumerated list of recommend keywords ranked by at least one of a relevance score.
In some embodiments, the method further includes outputting, by the tool, the enumerated list of recommend keywords ranked by a cost-per-click value.
In some embodiments, the one or more key phrase extraction algorithms include a Bayesian statistical ensemble and the ensemble performs a plurality of term ranking functions are performed including one or more of a core phrase term ranking function, a tail phrase term ranking function, a hyperdictionary graph traversal algorithm and/or a semantic knowledgebase path traversal score.
According to various embodiments, a system for generating from one or more keywords a list of recommended keywords for using in paid search advertising includes a tool configured to execute on a processor to receive an input of one or more keywords for which to generate a list of recommended keywords to use via paid search advertising at an identified website and one or more targeting attributes for targeting the paid search advertising at the identified website. The system may further include a crawler configured to acquire content from the identified website based at least in part on the one or more target attributes. The tool may be configured to apply to the acquired content an ensemble of one or more key phrase extraction algorithms, one or more graph analyses algorithms and one or more natural language processing algorithms to identify a set of semantically relevant keywords ranked by a relevance score. The tool may be configured to generate from the set of semantically relevant keywords, a knowledge graph of recommended keywords to replace or supplement the one of more keywords. The tool may further be configured to output, based at least partially on the knowledge graph, an enumerated list of recommended keywords to one of replace or supplement the one or more keywords to be used for the paid search advertising campaign at the identified website.
In some embodiments, the tool is further configured to receive the one or more keywords from the identified website.
In some embodiments, the crawler is further configured to acquire content from the identified website.
In some embodiments, the tool is further configured to apply the ensemble to the content from the identified website.
In some embodiments, the crawler is further configured to acquire from the plurality of different web content sources including web sites, news articles, blog posts and keyword data.
In some embodiments, the one or more key phrase extraction algorithms includes a Bayesian statistical ensemble.
In some embodiments, the tool is further configured to output the enumerated list of recommend keywords ranked by at least one of an attractiveness score, a volume score and a competition score.
In some embodiments, the tool is further configured to output the enumerated list of recommend keywords ranked by at least one of a relevance score.
In some embodiments, the tool is further configured to output the enumerated list of recommend keywords ranked by a cost-per-click value.
In some embodiments, the ensemble is further configured to perform a plurality of term ranking functions including one or more of a core phrase term ranking function, a tail phrase term ranking function, a hyperdictionary graph traversal algorithm and/or a semantic knowledgebase path traversal score.
BRIEF DESCRIPTION OF THE DRAWINGS
The foregoing and other objects, aspects, features, and advantages of the disclosure will become more apparent and better understood by referring to the following description taken in conjunction with the accompanying drawings, in which:
FIG. 1A is a block diagram depicting an embodiment of a network environment comprising client device in communication with server device;
FIG. 1B is a block diagram depicting a cloud computing environment comprising client device in communication with cloud service providers;
FIGS. 1C and 1D are block diagrams depicting embodiments of computing devices useful in connection with the methods and systems described herein.
FIG. 2A is an embodiment of a system comprising a keyword research and analysis tool;
FIG. 2B is an embodiment of a screen shot of a tool page;
FIG. 2C is an embodiment of a screen shot of a recommended keywords page;
FIG. 2D is another embodiment of a screen shot of the tool page;
FIG. 2E is yet another embodiment of a screen shot of the tool page; and
FIG. 3 is a flow diagram depicting an embodiment of a method of using the tool.
DETAILED DESCRIPTION
For purposes of reading the description of the various embodiments below, the following descriptions of the sections of the specification and their respective contents may be helpful:
Section A describes a network environment and computing environment which may be useful for practicing embodiments described herein.
Section B describes embodiments of systems and methods for a keyword research and analysis tool.
Section C describes embodiments of systems and methods for a crawler.
Section D describes embodiments of a storage medium including an ensemble of algorithms.
Section E describes embodiments of systems and methods for a tool.
A. Computing and Network Environment
Prior to discussing specific embodiments of the present solution, it may be helpful to describe aspects of the operating environment as well as associated system components (e.g., hardware elements) in connection with the methods and systems described herein. Referring to FIG. 1A, an embodiment of a network environment is depicted. In brief overview, the network environment includes one or more clients 102 a-102 n (also generally referred to as local machine(s) 102, client(s) 102, client node(s) 102, client machine(s) 102, client computer(s) 102, client device(s) 102, endpoint(s) 102, or endpoint node(s) 102) in communication with one or more servers 106 a-106 n (also generally referred to as server(s) 106, node 106, or remote machine(s) 106) via one or more networks 104. In some embodiments, a client 102 has the capacity to function as both a client node seeking access to resources provided by a server and as a server providing access to hosted resources for other clients 102 a-102 n.
Although FIG. 1A shows a network 104 between the clients 102 and the servers 106, the clients 102 and the servers 106 may be on the same network 104. In some embodiments, there are multiple networks 104 between the clients 102 and the servers 106. In one of these embodiments, a network 104′ (not shown) may be a private network and a network 104 may be a public network. In another of these embodiments, a network 104 may be a private network and a network 104′ a public network. In still another of these embodiments, networks 104 and 104′ may both be private networks.
The network 104 may be connected via wired or wireless links. Wired links may include Digital Subscriber Line (DSL), coaxial cable lines, or optical fiber lines. The wireless links may include BLUETOOTH, Wi-Fi, Worldwide Interoperability for Microwave Access (WiMAX), an infrared channel or satellite band. The wireless links may also include any cellular network standards used to communicate among mobile devices, including standards that qualify as 1G, 2G, 3G, or 4G. The network standards may qualify as one or more generation of mobile telecommunication standards by fulfilling a specification or standards such as the specifications maintained by International Telecommunication Union. The 3G standards, for example, may correspond to the International Mobile Telecommunications-2000 (IMT-2000) specification, and the 4G standards may correspond to the International Mobile Telecommunications Advanced (IMT-Advanced) specification. Examples of cellular network standards include AMPS, GSM, GPRS, UMTS, LTE, LTE Advanced, Mobile WiMAX, and WiMAX-Advanced. Cellular network standards may use various channel access methods e.g. FDMA, TDMA, CDMA, or SDMA. In some embodiments, different types of data may be transmitted via different links and standards. In other embodiments, the same types of data may be transmitted via different links and standards.
The network 104 may be any type and/or form of network. The geographical scope of the network 104 may vary widely and the network 104 can be a body area network (BAN), a personal area network (PAN), a local-area network (LAN), e.g. Intranet, a metropolitan area network (MAN), a wide area network (WAN), or the Internet. The topology of the network 104 may be of any form and may include, e.g., any of the following: point-to-point, bus, star, ring, mesh, or tree. The network 104 may be an overlay network which is virtual and sits on top of one or more layers of other networks 104′. The network 104 may be of any such network topology as known to those ordinarily skilled in the art capable of supporting the operations described herein. The network 104 may utilize different techniques and layers or stacks of protocols, including, e.g., the Ethernet protocol, the internet protocol suite (TCP/IP), the ATM (Asynchronous Transfer Mode) technique, the SONET (Synchronous Optical Networking) protocol, or the SDH (Synchronous Digital Hierarchy) protocol. The TCP/IP internet protocol suite may include application layer, transport layer, internet layer (including, e.g., IPv6), or the link layer. The network 104 may be a type of a broadcast network, a telecommunications network, a data communication network, or a computer network.
In some embodiments, the system may include multiple, logically-grouped servers 106. In one of these embodiments, the logical group of servers may be referred to as a server farm 38 or a machine farm 38. In another of these embodiments, the servers 106 may be geographically dispersed. In other embodiments, a machine farm 38 may be administered as a single entity. In still other embodiments, the machine farm 38 includes a plurality of machine farms 38. The servers 106 within each machine farm 38 can be heterogeneous—one or more of the servers 106 or machines 106 can operate according to one type of operating system platform (e.g., WINDOWS NT, manufactured by Microsoft Corp. of Redmond, Wash.), while one or more of the other servers 106 can operate on according to another type of operating system platform (e.g., Unix, Linux, or Mac OS X).
In one embodiment, servers 106 in the machine farm 38 may be stored in high-density rack systems, along with associated storage systems, and located in an enterprise data center. In this embodiment, consolidating the servers 106 in this way may improve system manageability, data security, the physical security of the system, and system performance by locating servers 106 and high performance storage systems on localized high performance networks. Centralizing the servers 106 and storage systems and coupling them with advanced system management tools allows more efficient use of server resources.
The servers 106 of each machine farm 38 do not need to be physically proximate to another server 106 in the same machine farm 38. Thus, the group of servers 106 logically grouped as a machine farm 38 may be interconnected using a wide-area network (WAN) connection or a metropolitan-area network (MAN) connection. For example, a machine farm 38 may include servers 106 physically located in different continents or different regions of a continent, country, state, city, campus, or room. Data transmission speeds between servers 106 in the machine farm 38 can be increased if the servers 106 are connected using a local-area network (LAN) connection or some form of direct connection. Additionally, a heterogeneous machine farm 38 may include one or more servers 106 operating according to a type of operating system, while one or more other servers 106 execute one or more types of hypervisors rather than operating systems. In these embodiments, hypervisors may be used to emulate virtual hardware, partition physical hardware, virtualize physical hardware, and execute virtual machines that provide access to computing environments, allowing multiple operating systems to run concurrently on a host computer. Native hypervisors may run directly on the host computer. Hypervisors may include VMware ESX/ESXi, manufactured by VMWare, Inc., of Palo Alto, Calif.; the Xen hypervisor, an open source product whose development is overseen by Citrix Systems, Inc.; the HYPER-V hypervisors provided by Microsoft or others. Hosted hypervisors may run within an operating system on a second software level. Examples of hosted hypervisors may include VMware Workstation and VIRTUALBOX.
Management of the machine farm 38 may be de-centralized. For example, one or more servers 106 may comprise components, subsystems and modules to support one or more management services for the machine farm 38. In one of these embodiments, one or more servers 106 provide functionality for management of dynamic data, including techniques for handling failover, data replication, and increasing the robustness of the machine farm 38. Each server 106 may communicate with a persistent store and, in some embodiments, with a dynamic store.
Server 106 may be a file server, application server, web server, proxy server, appliance, network appliance, gateway, gateway server, virtualization server, deployment server, SSL VPN server, or firewall. In one embodiment, the server 106 may be referred to as a remote machine or a node. In another embodiment, a plurality of nodes 290 may be in the path between any two communicating servers.
Referring to FIG. 1B, a cloud computing environment is depicted. A cloud computing environment may provide client 102 with one or more resources provided by a network environment. The cloud computing environment may include one or more clients 102 a-102 n, in communication with the cloud 108 over one or more networks 104. Clients 102 may include, e.g., thick clients, thin clients, and zero clients. A thick client may provide at least some functionality even when disconnected from the cloud 108 or servers 106. A thin client or a zero client may depend on the connection to the cloud 108 or server 106 to provide functionality. A zero client may depend on the cloud 108 or other networks 104 or servers 106 to retrieve operating system data for the client device. The cloud 108 may include back end platforms, e.g., servers 106, storage, server farms or data centers.
The cloud 108 may be public, private, or hybrid. Public clouds may include public servers 106 that are maintained by third parties to the clients 102 or the owners of the clients. The servers 106 may be located off-site in remote geographical locations as disclosed above or otherwise. Public clouds may be connected to the servers 106 over a public network. Private clouds may include private servers 106 that are physically maintained by clients 102 or owners of clients. Private clouds may be connected to the servers 106 over a private network 104. Hybrid clouds 108 may include both the private and public networks 104 and servers 106.
The cloud 108 may also include a cloud based delivery, e.g. Software as a Service (SaaS) 110, Platform as a Service (PaaS) 112, and Infrastructure as a Service (IaaS) 114. IaaS may refer to a user renting the use of infrastructure resources that are needed during a specified time period. IaaS providers may offer storage, networking, servers or virtualization resources from large pools, allowing the users to quickly scale up by accessing more resources as needed. Examples of IaaS include AMAZON WEB SERVICES provided by Amazon.com, Inc., of Seattle, Wash., RACKSPACE CLOUD provided by Rackspace US, Inc., of San Antonio, Tex., Google Compute Engine provided by Google Inc. of Mountain View, Calif., or RIGHTSCALE provided by RightScale, Inc., of Santa Barbara, Calif. PaaS providers may offer functionality provided by IaaS, including, e.g., storage, networking, servers or virtualization, as well as additional resources such as, e.g., the operating system, middleware, or runtime resources. Examples of PaaS include WINDOWS AZURE provided by Microsoft Corporation of Redmond, Wash., Google App Engine provided by Google Inc., and HEROKU provided by Heroku, Inc. of San Francisco, Calif. SaaS providers may offer the resources that PaaS provides, including storage, networking, servers, virtualization, operating system, middleware, or runtime resources. In some embodiments, SaaS providers may offer additional resources including, e.g., data and application resources. Examples of SaaS include GOOGLE APPS provided by Google Inc., SALESFORCE provided by Salesforce.com Inc. of San Francisco, Calif., or OFFICE 365 provided by Microsoft Corporation. Examples of SaaS may also include data storage providers, e.g. DROPBOX provided by Dropbox, Inc. of San Francisco, Calif., Microsoft SKYDRIVE provided by Microsoft Corporation, Google Drive provided by Google Inc., or Apple ICLOUD provided by Apple Inc. of Cupertino, Calif.
Clients 102 may access IaaS resources with one or more IaaS standards, including, e.g., Amazon Elastic Compute Cloud (EC2), Open Cloud Computing Interface (OCCI), Cloud Infrastructure Management Interface (CIMI), or OpenStack standards. Some IaaS standards may allow clients access to resources over HTTP, and may use Representational State Transfer (REST) protocol or Simple Object Access Protocol (SOAP). Clients 102 may access PaaS resources with different PaaS interfaces. Some PaaS interfaces use HTTP packages, standard Java APIs, JavaMail API, Java Data Objects (JDO), Java Persistence API (JPA), Python APIs, web integration APIs for different programming languages including, e.g., Rack for Ruby, WSGI for Python, or PSGI for Perl, or other APIs that may be built on REST, HTTP, XML, or other protocols. Clients 102 may access SaaS resources through the use of web-based user interfaces, provided by a web browser (e.g. GOOGLE CHROME, Microsoft INTERNET EXPLORER, or Mozilla Firefox provided by Mozilla Foundation of Mountain View, Calif.). Clients 102 may also access SaaS resources through smartphone or tablet applications, including, e.g., Salesforce Sales Cloud, or Google Drive app. Clients 102 may also access SaaS resources through the client operating system, including, e.g., Windows file system for DROPBOX.
In some embodiments, access to IaaS, PaaS, or SaaS resources may be authenticated. For example, a server or authentication server may authenticate a user via security certificates, HTTPS, or API keys. API keys may include various encryption standards such as, e.g., Advanced Encryption Standard (AES). Data resources may be sent over Transport Layer Security (TLS) or Secure Sockets Layer (SSL).
The client 102 and server 106 may be deployed as and/or executed on any type and form of computing device, e.g. a computer, network device or appliance capable of communicating on any type and form of network and performing the operations described herein. FIGS. 1C and 1D depict block diagrams of a computing device 100 useful for practicing an embodiment of the client 102 or a server 106. As shown in FIGS. 1C and 1D, each computing device 100 includes a central processing unit 121, and a main memory unit 122. As shown in FIG. 1C, a computing device 100 may include a storage device 128, an installation device 116, a network interface 118, an I/O controller 123, display devices 124 a-124 n, a keyboard 126 and a pointing device 127, e.g. a mouse. The storage device 128 may include, without limitation, an operating system, software, and a software of a tool for keyword research and analysis 120. As shown in FIG. 1D, each computing device 100 may also include additional optional elements, e.g. a memory port 103, a bridge 170, one or more input/output devices 130 a-130 n (generally referred to using reference numeral 130), and a cache memory 140 in communication with the central processing unit 121.
The central processing unit 121 is any logic circuitry that responds to and processes instructions fetched from the main memory unit 122. In many embodiments, the central processing unit 121 is provided by a microprocessor unit, e.g.: those manufactured by Intel Corporation of Mountain View, Calif.; those manufactured by Motorola Corporation of Schaumburg, Ill.; the ARM processor and TEGRA system on a chip (SoC) manufactured by Nvidia of Santa Clara, Calif.; the POWER7 processor, those manufactured by International Business Machines of White Plains, N.Y.; or those manufactured by Advanced Micro Devices of Sunnyvale, Calif. The computing device 100 may be based on any of these processors, or any other processor capable of operating as described herein. The central processing unit 121 may utilize instruction level parallelism, thread level parallelism, different levels of cache, and multi-core processors. A multi-core processor may include two or more processing units on a single computing component. Examples of a multi-core processors include the AMD PHENOM IIX2, INTEL CORE i5 and INTEL CORE i7.
Main memory unit 122 may include one or more memory chips capable of storing data and allowing any storage location to be directly accessed by the microprocessor 121. Main memory unit 122 may be volatile and faster than storage 128 memory. Main memory units 122 may be Dynamic random access memory (DRAM) or any variants, including static random access memory (SRAM), Burst SRAM or SynchBurst SRAM (BSRAM), Fast Page Mode DRAM (FPM DRAM), Enhanced DRAM (EDRAM), Extended Data Output RAM (EDO RAM), Extended Data Output DRAM (EDO DRAM), Burst Extended Data Output DRAM (BEDO DRAM), Single Data Rate Synchronous DRAM (SDR SDRAM), Double Data Rate SDRAM (DDR SDRAM), Direct Rambus DRAM (DRDRAM), or Extreme Data Rate DRAM (XDR DRAM). In some embodiments, the main memory 122 or the storage 128 may be non-volatile; e.g., non-volatile read access memory (NVRAM), flash memory non-volatile static RAM (nvSRAM), Ferroelectric RAM (FeRAM), Magnetoresistive RAM (MRAM), Phase-change memory (PRAM), conductive-bridging RAM (CBRAM), Silicon-Oxide-Nitride-Oxide-Silicon (SONOS), Resistive RAM (RRAM), Racetrack, Nano-RAM (NRAM), or Millipede memory. The main memory 122 may be based on any of the above described memory chips, or any other available memory chips capable of operating as described herein. In the embodiment shown in FIG. 1C, the processor 121 communicates with main memory 122 via a system bus 150 (described in more detail below). FIG. 1D depicts an embodiment of a computing device 100 in which the processor communicates directly with main memory 122 via a memory port 103. For example, in FIG. 1D the main memory 122 may be DRDRAM.
FIG. 1D depicts an embodiment in which the main processor 121 communicates directly with cache memory 140 via a secondary bus, sometimes referred to as a backside bus. In other embodiments, the main processor 121 communicates with cache memory 140 using the system bus 150. Cache memory 140 typically has a faster response time than main memory 122 and is typically provided by SRAM, BSRAM, or EDRAM. In the embodiment shown in FIG. 1D, the processor 121 communicates with various I/O devices 130 via a local system bus 150. Various buses may be used to connect the central processing unit 121 to any of the I/O devices 130, including a PCI bus, a PCI-X bus, or a PCI-Express bus, or a NuBus. For embodiments in which the I/O device is a video display 124, the processor 121 may use an Advanced Graphics Port (AGP) to communicate with the display 124 or the I/O controller 123 for the display 124. FIG. 1D depicts an embodiment of a computer 100 in which the main processor 121 communicates directly with I/O device 130 b or other processors 121′ via HYPERTRANSPORT, RAPIDIO, or INFINIBAND communications technology. FIG. 1D also depicts an embodiment in which local busses and direct communication are mixed: the processor 121 communicates with I/O device 130 a using a local interconnect bus while communicating with I/O device 130 b directly.
A wide variety of I/O devices 130 a-130 n may be present in the computing device 100. Input devices may include keyboards, mice, trackpads, trackballs, touchpads, touch mice, multi-touch touchpads and touch mice, microphones, multi-array microphones, drawing tablets, cameras, single-lens reflex camera (SLR), digital SLR (DSLR), CMOS sensors, accelerometers, infrared optical sensors, pressure sensors, magnetometer sensors, angular rate sensors, depth sensors, proximity sensors, ambient light sensors, gyroscopic sensors, or other sensors. Output devices may include video displays, graphical displays, speakers, headphones, inkjet printers, laser printers, and 3D printers.
Devices 130 a-130 n may include a combination of multiple input or output devices, including, e.g., Microsoft KINECT, Nintendo Wiimote for the WII, Nintendo WII U GAMEPAD, or Apple IPHONE. Some devices 130 a-130 n allow gesture recognition inputs through combining some of the inputs and outputs. Some devices 130 a-130 n provides for facial recognition which may be utilized as an input for different purposes including authentication and other commands. Some devices 130 a-130 n provides for voice recognition and inputs, including, e.g., Microsoft KINECT, SIRI for IPHONE by Apple, Google Now or Google Voice Search.
Additional devices 130 a-130 n have both input and output capabilities, including, e.g., haptic feedback devices, touchscreen displays, or multi-touch displays. Touchscreen, multi-touch displays, touchpads, touch mice, or other touch sensing devices may use different technologies to sense touch, including, e.g., capacitive, surface capacitive, projected capacitive touch (PCT), in-cell capacitive, resistive, infrared, waveguide, dispersive signal touch (DST), in-cell optical, surface acoustic wave (SAW), bending wave touch (BWT), or force-based sensing technologies. Some multi-touch devices may allow two or more contact points with the surface, allowing advanced functionality including, e.g., pinch, spread, rotate, scroll, or other gestures. Some touchscreen devices, including, e.g., Microsoft PIXELSENSE or Multi-Touch Collaboration Wall, may have larger surfaces, such as on a table-top or on a wall, and may also interact with other electronic devices. Some I/O devices 130 a-130 n, display devices 124 a-124 n or group of devices may be augment reality devices. The I/O devices may be controlled by an I/O controller 123 as shown in FIG. 1C. The I/O controller may control one or more I/O devices, such as, e.g., a keyboard 126 and a pointing device 127, e.g., a mouse or optical pen. Furthermore, an I/O device may also provide storage and/or an installation medium 116 for the computing device 100. In still other embodiments, the computing device 100 may provide USB connections (not shown) to receive handheld USB storage devices. In further embodiments, an I/O device 130 may be a bridge between the system bus 150 and an external communication bus, e.g. a USB bus, a SCSI bus, a FireWire bus, an Ethernet bus, a Gigabit Ethernet bus, a Fibre Channel bus, or a Thunderbolt bus.
In some embodiments, display devices 124 a-124 n may be connected to I/O controller 123. Display devices may include, e.g., liquid crystal displays (LCD), thin film transistor LCD (TFT-LCD), blue phase LCD, electronic papers (e-ink) displays, flexile displays, light emitting diode displays (LED), digital light processing (DLP) displays, liquid crystal on silicon (LCOS) displays, organic light-emitting diode (OLED) displays, active-matrix organic light-emitting diode (AMOLED) displays, liquid crystal laser displays, time-multiplexed optical shutter (TMOS) displays, or 3D displays. Examples of 3D displays may use, e.g. stereoscopy, polarization filters, active shutters, or autostereoscopy. Display devices 124 a-124 n may also be a head-mounted display (HMD). In some embodiments, display devices 124 a-124 n or the corresponding I/O controllers 123 may be controlled through or have hardware support for OPENGL or DIRECTX API or other graphics libraries.
In some embodiments, the computing device 100 may include or connect to multiple display devices 124 a-124 n, which each may be of the same or different type and/or form. As such, any of the I/O devices 130 a-130 n and/or the I/O controller 123 may include any type and/or form of suitable hardware, software, or combination of hardware and software to support, enable or provide for the connection and use of multiple display devices 124 a-124 n by the computing device 100. For example, the computing device 100 may include any type and/or form of video adapter, video card, driver, and/or library to interface, communicate, connect or otherwise use the display devices 124 a-124 n. In one embodiment, a video adapter may include multiple connectors to interface to multiple display devices 124 a-124 n. In other embodiments, the computing device 100 may include multiple video adapters, with each video adapter connected to one or more of the display devices 124 a-124 n. In some embodiments, any portion of the operating system of the computing device 100 may be configured for using multiple displays 124 a-124 n. In other embodiments, one or more of the display devices 124 a-124 n may be provided by one or more other computing devices 100 a or 100 b connected to the computing device 100, via the network 104. In some embodiments software may be designed and constructed to use another computer's display device as a second display device 124 a for the computing device 100. For example, in one embodiment, an Apple iPad may connect to a computing device 100 and use the display of the device 100 as an additional display screen that may be used as an extended desktop. One ordinarily skilled in the art will recognize and appreciate the various ways and embodiments that a computing device 100 may be configured to have multiple display devices 124 a-124 n.
Referring again to FIG. 1C, the computing device 100 may comprise a storage device 128 (e.g. one or more hard disk drives or redundant arrays of independent disks) for storing an operating system or other related software, and for storing application software programs such as any program related to the software 120 for the experiment tracker system. Examples of storage device 128 include, e.g., hard disk drive (HDD); optical drive including CD drive, DVD drive, or BLU-RAY drive; solid-state drive (SSD); USB flash drive; or any other device suitable for storing data. Some storage devices may include multiple volatile and non-volatile memories, including, e.g., solid state hybrid drives that combine hard disks with solid state cache. Some storage device 128 may be non-volatile, mutable, or read-only. Some storage device 128 may be internal and connect to the computing device 100 via a bus 150. Some storage device 128 may be external and connect to the computing device 100 via a I/O device 130 that provides an external bus. Some storage device 128 may connect to the computing device 100 via the network interface 118 over a network 104, including, e.g., the Remote Disk for MACBOOK AIR by Apple. Some client devices 100 may not require a non-volatile storage device 128 and may be thin clients or zero clients 102. Some storage device 128 may also be used as a installation device 116, and may be suitable for installing software and programs. Additionally, the operating system and the software can be run from a bootable medium, for example, a bootable CD, e.g. KNOPPIX, a bootable CD for GNU/Linux that is available as a GNU/Linux distribution from knoppix.net.
Client device 100 may also install software or application from an application distribution platform. Examples of application distribution platforms include the App Store for iOS provided by Apple, Inc., the Mac App Store provided by Apple, Inc., GOOGLE PLAY for Android OS provided by Google Inc., Chrome Webstore for CHROME OS provided by Google Inc., and Amazon Appstore for Android OS and KINDLE FIRE provided by Amazon.com, Inc. An application distribution platform may facilitate installation of software on a client device 102. An application distribution platform may include a repository of applications on a server 106 or a cloud 108, which the clients 102 a-102 n may access over a network 104. An application distribution platform may include application developed and provided by various developers. A user of a client device 102 may select, purchase and/or download an application via the application distribution platform.
Furthermore, the computing device 100 may include a network interface 118 to interface to the network 104 through a variety of connections including, but not limited to, standard telephone lines LAN or WAN links (e.g., 802.11, T1, T3, Gigabit Ethernet, Infiniband), broadband connections (e.g., ISDN, Frame Relay, ATM, Gigabit Ethernet, Ethernet-over-SONET, ADSL, VDSL, BPON, GPON, fiber optical including FiOS), wireless connections, or some combination of any or all of the above. Connections can be established using a variety of communication protocols (e.g., TCP/IP, Ethernet, ARCNET, SONET, SDH, Fiber Distributed Data Interface (FDDI), IEEE 802.11a/b/g/n/ac CDMA, GSM, WiMax and direct asynchronous connections). In one embodiment, the computing device 100 communicates with other computing devices 100′ via any type and/or form of gateway or tunneling protocol e.g. Secure Socket Layer (SSL) or Transport Layer Security (TLS), or the Citrix Gateway Protocol manufactured by Citrix Systems, Inc. of Ft. Lauderdale, Fla. The network interface 118 may comprise a built-in network adapter, network interface card, PCMCIA network card, EXPRESSCARD network card, card bus network adapter, wireless network adapter, USB network adapter, modem or any other device suitable for interfacing the computing device 100 to any type of network capable of communication and performing the operations described herein.
A computing device 100 of the sort depicted in FIGS. 1B and 1C may operate under the control of an operating system, which controls scheduling of tasks and access to system resources. The computing device 100 can be running any operating system such as any of the versions of the MICROSOFT WINDOWS operating systems, the different releases of the Unix and Linux operating systems, any version of the MAC OS for Macintosh computers, any embedded operating system, any real-time operating system, any open source operating system, any proprietary operating system, any operating systems for mobile computing devices, or any other operating system capable of running on the computing device and performing the operations described herein. Typical operating systems include, but are not limited to: WINDOWS 2000, WINDOWS Server 2012, WINDOWS CE, WINDOWS Phone, WINDOWS XP, WINDOWS VISTA, and WINDOWS 7, WINDOWS RT, and WINDOWS 8 all of which are manufactured by Microsoft Corporation of Redmond, Wash.; MAC OS and iOS, manufactured by Apple, Inc. of Cupertino, Calif.; and Linux, a freely-available operating system, e.g. Linux Mint distribution (“distro”) or Ubuntu, distributed by Canonical Ltd. of London, United Kingdom; or Unix or other Unix-like derivative operating systems; and Android, designed by Google, of Mountain View, Calif., among others. Some operating systems, including, e.g., the CHROME OS by Google, may be used on zero clients or thin clients, including, e.g., CHROMEBOOKS.
The computer system 100 can be any workstation, telephone, desktop computer, laptop or notebook computer, netbook, ULTRABOOK, tablet, server, handheld computer, mobile telephone, smartphone or other portable telecommunications device, media playing device, a gaming system, mobile computing device, or any other type and/or form of computing, telecommunications or media device that is capable of communication. The computer system 100 has sufficient processor power and memory capacity to perform the operations described herein. In some embodiments, the computing device 100 may have different processors, operating systems, and input devices consistent with the device. The Samsung GALAXY smartphones, e.g., operate under the control of Android operating system developed by Google, Inc. GALAXY smartphones receive input via a touch interface.
In some embodiments, the computing device 100 is a gaming system. For example, the computer system 100 may comprise a PLAYSTATION 3, or PERSONAL PLAYSTATION PORTABLE (PSP), or a PLAYSTATION VITA device manufactured by the Sony Corporation of Tokyo, Japan, a NINTENDO DS, NINTENDO 3DS, NINTENDO WII, or a NINTENDO WII U device manufactured by Nintendo Co., Ltd., of Kyoto, Japan, an XBOX 360 device manufactured by the Microsoft Corporation of Redmond, Wash.
In some embodiments, the computing device 100 is a digital audio player such as the Apple IPOD, IPOD Touch, and IPOD NANO lines of devices, manufactured by Apple Computer of Cupertino, Calif. Some digital audio players may have other functionality, including, e.g., a gaming system or any functionality made available by an application from a digital application distribution platform. For example, the IPOD Touch may access the Apple App Store. In some embodiments, the computing device 100 is a portable media player or digital audio player supporting file formats including, but not limited to, MP3, WAV, M4A/AAC, WMA Protected AAC, RIFF, Audible audiobook, Apple Lossless audio file formats and .mov, .m4v, and .mp4 MPEG-4 (H.264/MPEG-4 AVC) video file formats.
In some embodiments, the computing device 100 is a tablet e.g. the IPAD line of devices by Apple; GALAXY TAB family of devices by Samsung; or KINDLE FIRE, by Amazon.com, Inc. of Seattle, Wash. In other embodiments, the computing device 100 is a eBook reader, e.g. the KINDLE family of devices by Amazon.com, or NOOK family of devices by Barnes & Noble, Inc. of New York City, N.Y.
In some embodiments, the communications device 102 includes a combination of devices, e.g. a smartphone combined with a digital audio player or portable media player. For example, one of these embodiments is a smartphone, e.g. the IPHONE family of smartphones manufactured by Apple, Inc.; a Samsung GALAXY family of smartphones manufactured by Samsung, Inc; or a Motorola DROID family of smartphones. In yet another embodiment, the communications device 102 is a laptop or desktop computer equipped with a web browser and a microphone and speaker system, e.g. a telephony headset. In these embodiments, the communications devices 102 are web-enabled and can receive and initiate phone calls. In some embodiments, a laptop or desktop computer is also equipped with a webcam or other video capture device that enables video chat and video call.
In some embodiments, the status of one or more machines 102, 106 in the network 104 is monitored, generally as part of network management. In one of these embodiments, the status of a machine may include an identification of load information (e.g., the number of processes on the machine, CPU and memory utilization), of port information (e.g., the number of available communication ports and the port addresses), or of session status (e.g., the duration and type of processes, and whether a process is active or idle). In another of these embodiments, this information may be identified by a plurality of metrics, and the plurality of metrics can be applied at least in part towards decisions in load distribution, network traffic management, and network failure recovery as well as any aspects of operations of the present solution described herein. Aspects of the operating environments and components described above will become apparent in the context of the systems and methods disclosed herein.
B. Keyword Research and Analysis Tool
Systems and methods of the present solution are directed to performing keyword research and analysis to generate a list of topically-related and conceptually related keywords to a specific topic, for example, for purposes of search engine optimization (SEO) for paid searches.
One aspect of the present disclosure is directed to systems and methods for performing a keyword search to generate a list of topically-related and conceptually related keywords for use in a marketing or advertising campaign of a website. In some embodiments, a keyword tool can perform semantic keyword research and recommendations. In one aspect of the present disclosure, there is provided a system and method for performing semantic keyword research (e.g., semantic SEO keyword research), which may quickly and efficiently generate a list of topically-related recommended keywords. Unlike the traditional keyword planner and research tools, a tool may generate keywords that are conceptually related to the topic that a user is marketing. For example, for the term “dog food,” the tool can generate recommended keywords such as “pet food” and “doggy treats.” The keywords suggestions may be ranked by relevance scores, a measure of topical relevance, saving a user from hours of manual keyword research effort. In some embodiments, traditional applications include keyword research for prospecting and new clients, identifying new keywords for existing clients, and identifying relevant head terms for semantic SEO applications. In some embodiments, the systems and methods described herein can be used alongside a long-tail keyword tool (e.g., Google Keyword Planner) to identify new long-tail opportunities. Search engine marketing campaigns can be optimized with semantic machine learning.
In another aspect, the present disclosure is directed to system and methods for performing social advertising research. The keyword tool can identify high-value keyword buys on various social media sites such as Facebook, Twitter, LinkedIn, and other social media advertising. Unlike existing keyword research tools, the present keyword tool can generate suggestions that are topically-related, allowing advertisers to quickly identify the most relevant and best-performing terms to be used, for example, for Facebook Interest Group targeting, Twitter keyword/hashtag targeting, and any other display advertising system that utilizes keyword-based targeting.
Accordingly, aspects of the present disclosure are directed to a keyword tool that can generate a list of topically-related and conceptually related keywords for use in user-targeted marketing and advertising. In some embodiments, the keyword tool may be used to build keyword strategy for advertising. For example, the keyword search tool can generate a target keyword list with respect to paid searches and targeted advertising or identify keyword candidates.
Referring to FIG. 2A, an embodiment of a computer environment similar to that illustrated in FIG. 1A is depicted. In addition to the elements previously described in connection with FIG. 1A, a server 106 including a keyword research and analysis tool (“keyword tool”) 202 may also be connected to the network 104. Accordingly, the keyword tool 202 may be connected to the client(s) 102 a-102 n and the server(s) 106 a-106 n via the network 104. In addition, the server(s) 106 a-106 n may each be a content source 206A for providing content for research and analysis by the keyword tool 202. In various embodiments, the content may include data corresponding to websites, news articles, blog posts, social networking sites, keyword data, and/or any other suitable information for use with the keyword tool 202.
In the embodiment shown in FIG. 2A, the keyword tool 202 may include one or more components or modules for performing various functions corresponding to keyword research and analysis. In the embodiment of FIG. 2A, the keyword tool 202 includes a tool 202 a, an ensemble of algorithms 202 b, a crawler 202 c, and a database 202 d. Each of these components are described in further detail below. In other embodiments, the keyword tool 202 may include less or more components, depending on the desired functions to be implemented by the keyword tool 202. For example, one embodiment of the keyword tool 202 may include the tool 202 a and the ensemble of algorithms 202 b only.
C. Crawler
According to various embodiments, the crawler 202 c may be a tool for parsing and collecting relevant content over the web. In some embodiments, each of the other tools at the keyword tool 202, such as, but not limited to, the tool 202 a may be connected to the crawler 202 c and may operate in conjunction with the crawler 202 c. In other words, the other tools of the keyword tool 202 may receive the content acquired by the crawler 202 c and perform various operations on the content.
In some embodiments, the crawler 202 c at the keyword tool 202 may use one or more keywords to search the web for a plurality of different relevant web content sources. For example, the crawler 202 c may acquire content by crawling and searching the web for the one or more keywords and any content related to the keywords. In various embodiments, the crawler 202 c may be configured to acquire content from a variety of media, such as, but not limited to, websites, news articles, blog posts, keyword data (e.g., stored in the database 202 d of the keyword tool 202), internet forums, social networking sites, advertising sites, and/or the like. The content to be acquired by the crawler 202 c may be located at servers 106 a-106 n. The keyword tool 202 (e.g., the crawler 202 c) may analyze the acquired content to build a collection of sources (e.g., articles, blogs, social media, etc.) related to the input keywords. In some embodiments, the crawler 202 c may be configured to perform advanced semantic processing, such as, but not limited to, stemming and lemmatization.
In some embodiments, the database 202 d may include data acquired from one or more other sources (e.g., data acquired by the crawler 202 c). In some embodiments, the database 202 d may include keyword data acquired or obtained from a search engine. In some embodiments, the database 202 d may include keyword data acquired or obtained from a web site. In some embodiments, the database 202 d may include keyword data acquired or obtained from a social networking site. In some embodiments, the database 202 d may include keyword data acquired or obtained from a third party that aggregates data and provides data for use or purchase. Any of the systems and methods described herein, such as any of the algorithms may use the data in the database 202 d for any of the computations described herein, such as keyword recommendations.
According to some embodiments, the crawler 202 c may execute any suitable search software for crawling a given website. In particular embodiments, the crawler 202 c may be configured to crawl the web by jumping from webpage to webpage. In other embodiments, the crawler 202 c may be configured to capture all pages on a single website, as opposed to jumping from webpage to webpage. In such embodiments where the crawler 202 e is capable of capturing all pages on a single website, the crawler 202 c may have particularly configured parameters, such as, but not limited to, which directories to exclude, which special directories to include, directories and/or pages that the crawler 202 c should merely visit but not index, and special customizations in terms of what pattern of pages to include or exclude. In other words, the crawler 202 c may be configured to exclude certain elements of a website, while including certain other elements of the website.
In further embodiments, the crawler 202 c may be configured to execute crawls so as not to weigh down or hinder servers. In particular embodiments, the crawler 202 c may be configured to be limited in the number of webpages it fetches during a predetermined amount of time. As a non-limiting example, the crawler 202 c may be configured to fetch no more than five pages per second. In additional embodiments, the crawler 202 c may be configured to call the header of a page (e.g., HTTP HEAD request) before fetching the body of the page to retrieve information about the page to determine if the page contains content that should be downloaded or not. In yet further embodiments, the crawler 202 c may be configured to monitor a response time of a server. In other words, the crawler 202 c may monitor how long it takes a server to send a response to a request, and if the response time drops below a predetermined threshold (e.g., five seconds), the crawler 202 c may be configured to stand by for a predetermined amount of time (e.g., 20 seconds) before resuming crawling. In still further embodiments, after waiting for the predetermined amount of time, the crawler 202 c may resume crawling at a slightly more cautious pace than previously exhibited. As a non-limiting example, the crawler 202 c may crawl at a rate of four pages per second, as opposed to a previous rate of five pages per second. According to various embodiments, once the crawler 202 c has retrieved sources relevant to the focus keywords, the crawler 202 c may be configured to interact with any suitable indexing technology, such as, but not limited to, an open source software (e.g., Solr) for indexing of the content acquired by the crawler 202 c. In such embodiments, the indexing technology for use with the crawler 202 c may be configured to capture necessary fields, clean data, and/or perform statistical analyses on the acquired content.
D. Ensemble of Algorithms
According to various embodiments, the ensemble of algorithms 202 b may be a storage medium for storing a plurality of algorithms to be accessed by each of the other tools at the keyword tool 202, such as, but not limited to, the tool 202 a, which may be connected to the ensemble of algorithms 202 b and which may operate in conjunction with the ensemble of algorithms 202 b. In other words, the other tools of the keyword tool 202 may access the ensemble of algorithms 202 b and perform operations based on the instructions stored at the ensemble of algorithms 202 b.
In some embodiments, the ensemble of algorithms 202 b takes as input a set or corpus of pseudo-relevant documents, such as content acquired by the crawler 202 c and provides an output of a set of related keyphrases or keywords (e.g., a set of semantic-relevance-scored related keyphrases or keywords). Accordingly, in various embodiments, in generating a set of semantically relevant topics scored or ranked by relevance, a tool may be configured to receive data corresponding to a corpus of pseudo-relevant documents (e.g., acquired by the crawler 202 c), and may be instructed by the ensemble of algorithms 202 d to cleanse and normalize the received documents and information. In some embodiments, the crawler 202 c may be configured to lemmatize the documents as well. In some embodiments, the ensemble of algorithms 202 b may instruct all descriptive phrase n-grams in the corpus up to some length to be identified. In some embodiments, the length of the n-grams may be predetermined by a user or administrator. Thresh-holding may then be performed using some function of frequency and available memory. Next, in some embodiments, phrases starting or ending with conjunctives or other stop words may be discarded. In some embodiments, the method includes building term frequency and inverse document frequency matrices to act as a shared resource for subsequent phrases. The most frequent morphological phrase forms may be re-allocated based upon a weight value assigned to the phrase. In some embodiments, a lemmatization-equivalent of unigrams or phrases may be used to group. In further embodiments, the method includes estimating the bayesian prior of phrases. This may be done optionally as an adjunct to the term frequency matrix. In an embodiment, a linear combination of rarity-ranked bayesian priors of the constituent unigrams of the phrase is used.
In some embodiments, the ensemble of algorithms 202 b includes instructions for applying an ensemble of a plurality of different classes of algorithms such as four constituent classes of algorithm. For example, term-ranking functions (TRFs) may be performed based on analysis of phrase distribution in corpus and the estimated phrase priors. First, a core phrase TRF may be performed (e.g., Robertson selection value and Kullback-Leibler distance). Second, a tail phrase TRF may be performed (e.g., Rocchio's weights, chi-square, and binary independence model). Third, a hyperdictionary graph traversal algorithm may be performed (e.g. TextRank). Fourth, a semantic knowledgebases path traversal score based on terseness of semantic path and some curated predicate weights may be generated (e.g., using ConceptNet or Yago knowledgebases). Each constituent class of algorithm may itself be an ensemble of algorithms under that class, such that the ensemble of algorithms comprises a plurality of ensembles.
In some embodiments, the ensemble method may be a weighted combination of the sets of scored phrases from each of the above constituents (e.g. a linear combination, or a linear combination of some normalization of the constituents). The weights may be tuned by how tail-oriented the desired output is (e.g. where tail TRFs' output are weighed more heavily if more tail-like concepts are desired). In other embodiments, the weights may be tuned by how much n-grams of various n are desired (e.g. by multiplication by a function on desired lengths and phrase length under consideration). In some embodiments, unigrams with high bayesian priors are then reduced in the distribution's weights by some function of the unigram's prior. As such, the output of this method may be a set of semantic-relevance-scored phrases.
Although, generally, at times, the word keyword may be used to described one or more keywords to be used by the systems and methods, key phrases may be used interchangeably with keywords. In some embodiments, a key phrase of one phrase is a keyword. In some embodiments, a keyword is a single phrase is a key phrase. As such, in some embodiments, a plurality of keywords is a key phrase.
E. Tool
Referring to FIG. 2B, illustrated is an embodiment of a screen shot of a topic tool page. In various embodiments, the screen shot may represent a keyword user interface (UI) 210 to be presented to a user at the client device 102 for selecting parameters for a keyword search. In the embodiments shown in FIG. 2B, the UI 210 and the keyword tool 202 are compatible for use with Twitter. In other embodiments, the UI 210 and the keyword tool 202 may be compatible for use with one or more other social media websites, such as, but not limited to, Facebook, Instagram, MySpace, and the like. After a user populates entries into the keyword UI 210 and submits the entries, the tool 202 a may receive the entered information via the network 104 to perform one or more operations with respect to the user entries. The functions of the tool 202 a are discussed further below.
According to the embodiment shown in FIG. 2B, the keyword UI 210 may include an input portion including an input field 213 having a keyword field 213 a for receiving and entering one or more user-entered keywords and an import field 213 b for importing keywords into the UI 210. The one or more keywords may be a keyword, topic, or phrase that a user wishes to have analyzed by the tool 202 a. In some embodiments, a user may enter a keyword or phrase via the keyword field 213 a or may import multiple keywords via the import field 213 b. In embodiments where a user chooses to import multiple keywords, a user may select a file (e.g., a file locally stored on the client 102) via the UI 210 and upload the file to the UI 210. In such embodiments, the UI 210 (e.g., via the tool 202 a) may read the file and populate one or more keywords into the keyword field 213 a in accordance with the uploaded file. In the embodiment shown in FIG. 2B, the keywords that a user entered or imported via the input field 213 is displayed directly below the keyword field 213 a and the import field 213 b. In some embodiments, the UI 210 may include a parameters display 212 for indicating the parameters of the operation. The parameters of the operation provide the rules by which the analysis of the one or more keywords will abide, and a summary of these parameters may be displayed at 212. In some embodiments, parameters may include, but not be limited to, geographic locations, gender of social media users, language of the users, social media website(s) to be searched, type of advertisements shown to the social media users, level of activity of the users, and/or the like.
In addition, the keyword UI 210 may also include an output portion including an activity indicator 211 and a suggestions box 214. The activity indicator 211 may display a volume of activity associated with the entered keywords. For example, the activity indicator 211 may indicate a recent global daily volume of instances of social media user activity (e.g., tweets on Twitter) containing each keyword. The suggestions box 214 may indicate recommended keywords, based on the input keywords, that a user may wish to add to a targeted advertising campaign to reach more quality customers. The suggestions box 214 may display a preview of the recommended keywords, and may have the option of expanding the list so that a user may review the full list of recommended keywords.
In other embodiments, the keyword UI 210 may include any suitable variation of the layout illustrated in FIG. 2B, or even different layouts. In alternative embodiments, the user input interface may take on other suitable formats, for example, but not limited to, a list for user selection and/or the like. In some embodiments, the keyword field 213 may auto-populate the one or more keywords as a user types into the keyword field 213. In further embodiments, a list of keywords may be stored in the database 202 d, for example, to auto-populate the one or more keywords. In additional embodiments, the keyword recommendations may be displayed in other suitable formats, for example, but not limited to, a chart, a graph, etc., or combinations thereof.
After entering the one or more keywords, setting the parameters for the keyword research and analysis, and submitting the entries, the tool 202 a may identify keywords that are relevant to the entered one or more keywords for purposes of targeted advertising or marketing, including paid searches. By way of example, according to the embodiment shown in FIG. 2A, a user of keyword UI 210 may enter into the keyword field 211 “keyword research” and “seo keyword research.” The user may wish to find recommended keywords for supplementing or replacing the input keywords for paid searches. Accordingly, the tool 202 a, after undergoing its operations with respect to the input keywords, may output a list in the form of suggestions box 212 of recommended keywords for use in paid searches. In the present embodiment, the recommended keywords include “search engine optimization” and “methodology,” which are related to the input keywords. Accordingly, a user may review and interpret the information embodied in suggestions box 214 and may identify further keywords from the recommended keywords to target for paid searches to increase traffic to the user's website via the paid searches.
After the fields of the UI 210 are populated and submitted by a user, the entered information may be transmitted to the keyword research and analysis tool 202 via the network 104, at which one or more operations may be performed corresponding to the entered keywords.
According to various embodiments, the submitted keywords are received by the crawler 202 c. The crawler 202 c may then acquire content from various web sources to build a corpus of collected content that is relevant to the input keywords. For example, the crawler 202 c may parse Twitter, or any other social networking site, to search for relevant terms or hash tags used by the users of the social media sites. Further details regarding the operation of the crawler 202 c are described above.
According to various embodiments, after collecting and indexing the relevant sources from the web (e.g., via the crawler 202 c), a knowledge graph 229 may be generated by applying, by the tool 202 a, the ensemble of algorithms 202 b to the content acquired by the crawler 202 c to identify and organize a set of semantically relevant topics or keywords from the acquired content. In various embodiments, the knowledge graph is a knowledge base that organizes information gathered from the various web sources to provide structured and detailed information regarding each of the user-entered one or more keywords. In some embodiments, after the tool 202 a applies the ensemble of algorithms 202 b, as described above, to the content acquired by the crawler 202 c to generate the semantically relevant keywords scored by relevance, the knowledge graph may be built by the tool 202 a based on the semantically relevant topics scored by relevance. In other words, the knowledge graph may be a representation and ranking of the semantically relevant topics or keywords. In some embodiments, the knowledge graph includes degrees of conceptual relevance to the initially-entered keywords, each of which serve as the seed terms from which the knowledge graph is built. For example, one of the input keywords is “keyword research,” which may be placed at a first level (or seed level) of the knowledge graph. After applying the above-described algorithms to the content previously acquired by the crawler 202 c, the tool 202 a may be configured to populate lower levels of the knowledge graph branching out from the seed level. For example, if some topics or keywords are identified as having relatively high relevance, based on the acquired content, and as determined using the above-described algorithms, to the input keyword “keyword research,” those highly relevant keywords may be placed closest to the seed term on the knowledge graph (e.g., at the second level of the knowledge graph, that is, one level beneath the seed keyword). The tool 202 a may continue placing keywords at different levels of the knowledge graph based on the keywords' degree of relevance to the seed keyword. As such, a hierarchy of relevant keywords with respect to the input keywords may be built in the form of the knowledge graph. In some embodiments, the step of generating a knowledge graph may be optional, and the tool 202 a may directly output the recommended keyword results to the user based on the results of application of the ensemble of algorithms to the acquired content, and skip the building of any knowledge graph.
According to various embodiments, based on the built knowledge graph, keyword suggestions may be served to the user by the tool 202 a and displayed at the UI 210 via suggestions box 214. In other embodiments, the keyword suggestions are served based directly on the results of the tool's application of the ensemble of algorithms to the acquired content, rather than based on the knowledge graph. In some embodiments, the tool 202 a outputs to the UI 210 all of the keywords of the knowledge graph. In other embodiments, the tool 202 a may utilize relevancy thresholds in determining whether or not to output a keyword as a recommended keyword. For example, the tool 202 a may include a predetermined relevancy threshold, and if a relevance score (as determined by the tool 202 a) of a given keyword is below the relevancy threshold, then the tool 202 a will not output that keyword as a suggested keyword. In some embodiments, the relevancy threshold corresponds to locations on the knowledge graph (e.g., the levels or tiers of the knowledge graph).
Referring to FIG. 2C, illustrated is an embodiment of a screen shot of a recommended keywords page 220. The recommended keywords page 220 may include a first page 221 and a second page 222. The recommended keywords page 220 may include a plurality of recommended keywords 223 that are generated and output by the topic tool 202 a. As noted above, the suggestions box 214 may display a select few of the suggested keywords returned by the topic tool 202 a. The keyword UI 210 may further provide the expanded recommended keywords list via the recommended keywords page 220. In some embodiments, the recommended keywords page 220 is accessed via the suggestions box 214 (e.g., by clicking on the suggestions box 214). In some embodiments, the list of recommended keywords 223 may be ranked in the order of their relevance, from more relevant to less relevant. The recommended keywords page 220 may further provide an add keyword interface 224 for allowing a user to add one or more of the recommended keywords 223 to the input field 213 or to their paid search keyword list. Accordingly, the recommended keywords 223 allow a user to supplement or replace the input keywords for purposes of paid searches and targeted social media advertising.
Referring to FIG. 2D, illustrated is another embodiment of a screen shot of a tool page. According to the embodiment shown in FIG. 2D, numerous keywords are entered into the input field 213, including “#digitalmarketing,” “authorship,” “adwords,” etc. After these input keywords are entered, the topic tool 202 a returns recommended keywords via the suggestions box 214. In the present embodiment, the recommended keywords include “engine optimization” and “#socialmediamarketing.”
Referring to FIG. 2E, illustrated is yet another embodiment of a screen shot of a tool page. According to the embodiment shown in FIG. 2E, numerous keywords are entered into the input field 213, including “free seo,” “paid search,” “seo tools,” etc. After these input keywords are entered, the topic tool 202 a returns recommended keywords via the suggestions box 214. In the present embodiment, the recommended keywords include “search engine marketing” and “seotools.”
Referring to FIG. 3, a method 300 for generating from one or more keywords a list of recommended keywords for using in paid search advertising. At step 301, the method 300 includes identifying, via a tool, one or more keywords to be used in a paid search advertising campaign at an identified website. At step 302, the method 300 includes acquiring, by a crawler, content from a plurality of different web content sources via one or more networks. At step 303, the method 300 includes applying, by the tool, to the acquired content an ensemble of algorithms to identify a set of semantically relevant keywords ranked by a relevance score. At step 304, the method 300 includes generating, by the tool, from the set of semantically relevant keywords, a knowledge graph of recommended keywords to replace or supplement the one or more keywords. At step 305, the method 300 includes outputting, by the tool based at least partially on the knowledge graph, an enumerated list of recommended keywords to one of replace or supplement the one or more keywords to be used for the paid search advertising campaign at the identified website.
In some embodiments, step 301 may be performed by the tool 202 a. In such embodiments, the tool 202 a may receive the one or more keywords from a user interface at a client 102 and via the network 104. For example, the user interface may correspond to UI 210. The one or more keywords may be various words or phrases that a user wishes to acquire related keywords for.
In some embodiments, step 302 may be performed by the crawler 202 c. In such embodiments, the crawler 202 c may be configured to acquire content from various sources, such as, but not limited to, websites, blogs, articles, social networking sites, and/or the like. The acquired content may relate to the one or more keywords.
In some embodiments, step 303 may be performed by the tool 202 a, which may receive the content acquired from the crawler 202 ac In further embodiments, the tool 202 a may access and work in conjunction with the ensemble algorithm 202 b in applying the algorithms to the acquired content. In particular embodiments, the ensemble of algorithms 202 b may include one or more key phrase extraction algorithms, one or more graph analyses algorithms, and one or more natural language processing algorithms.
In some embodiments, step 304 may be performed by the tool 202 a after the tool 202 a identifies the set of semantically relevant keywords ranked by relevance score. The ranking of the keywords may be embodied by the knowledge graph. In other embodiments, the generation of the knowledge graph is an optional step and may be skipped.
In some embodiments, step 305 may be performed by the tool 202 a. In particular embodiments, the tool 202 a may output the list of recommended keywords to the client 102 via the network 104. As such, the list of recommended keywords may be displayed at the UI at the client 102. The list of recommended keywords may be organized based on corresponding relevance scores associated with each recommended keyword.
In some embodiments, the recommended keywords may be incorporated into a paid search campaign such as by submission to a paid search tool or interface, such as Google AdWords for example. In some embodiments, the tool may be designed and constructed to facilitate the use of any one or more recommended keywords in paid advertising tool via any type and form of application programming interface to the paid advertising tool. In some embodiments, the user may use the recommended keyword list to manually configure the paid search campaign in the paid advertising tool. The user may use the systems and methods herein multiple times to replace and/or supplement keywords in the paid search campaign until the desired results or acceptable results are achieved.
While the invention has been particularly shown and described with reference to specific embodiments, it should be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention described in this disclosure.

Claims (20)

What is claimed is:
1. A method for generating from keywords recommended keywords for selecting content in search engine querying, the method comprising:
(a) receiving, by a tool executing on a server having one or more processors, an input of one or more keywords for which to generate a list of recommended keywords to use via content selection in search engine queries at an identified website;
(b) receiving, by the tool, one or more targeting attributes for targeting in the content selection in search engine queries at the identified website;
(c) acquiring, by a crawler executing on the server, content from the identified website based at least in part on the one or more target attributes;
(d) applying, by the tool, to the acquired content, an ensemble of algorithms, the ensemble comprising a predetermined sequence of:
one or more key phrase extraction algorithms to generate a set of keywords based on at least the acquired content,
one or more graph analyses algorithms to identify a set of topics semantically relevant to the set of keywords generated using the one or more key phrase extract algorithms, and
one or more natural language processing algorithms to determine a relevance score for each topic of the set of semantically relevant keywords identified using the one or more graph analyses algorithm, the relevance score indicating semantic relevance of the topic to the input of the one of more keywords;
(e) generating, by the tool from the set of semantically relevant keywords, a knowledge graph of recommended keywords for the input one of more keywords ranked by at least the relevance scores; and
(f) outputting, by the tool based at least partially on the knowledge graph, an enumerated list of recommended keywords to use in the content selection in search engine queries at the identified website based at least in part on the one or more targeting attributes.
2. The method of claim 1, wherein (b) further comprises receiving, by the tool, one or more targeting attributes identifying a volume of content at the identified website for which to target.
3. The method of claim 1, wherein (b) further comprises receiving, by the tool, one or more targeting attributes identifying a timeline of content at the identified website for which to target.
4. The method of claim 1, wherein (b) further comprises receiving, by the tool, one or more targeting attributes identifying one or more of a gender, language or location of a user for which to target.
5. The method of claim 1, wherein (c) further comprise acquiring, by the crawler, additional content from the plurality of different web content sources comprising web sites, news articles, blog posts and keyword data.
6. The method of claim 5, wherein (d) further comprises applying, by the tool, the ensemble to both the acquired content and the additional content.
7. The method of claim 1, wherein (f) further comprising outputting, by the tool, the enumerated list of recommend keywords ranked by at least one of an attractiveness score, a volume score and a competition score.
8. The method of claim 1, wherein (f) further comprising outputting, by the tool, the enumerated list of recommend keywords ranked by at least a relevance score.
9. The method of claim 1, wherein (f) further comprising outputting, by the tool, the enumerated list of recommend keywords ranked by a cost-per-click value.
10. The method of claim 1, wherein the one or more key phrase extraction algorithms comprise a Bayesian statistical ensemble and the ensemble performs a plurality of term ranking functions are performed including one or more of the following: a core phrase term ranking function, a tail phrase term ranking function, a hyperdictionary graph traversal algorithm and a semantic knowledgebase path traversal score.
11. A system for generating from keywords a list of recommended keywords for selecting content in search engine querying, the system comprising:
a tool executable on a server having one or more processors, configured to:
receive an input of one or more keywords for which to generate a list of recommended keywords to use via content selection in search engine queries at an identified website and one or more targeting attributes for targeting at the identified website;
a crawler executable on the server, configured to acquire content from the identified website based at least in part on the one or more target attributes;
wherein the tool is further configured to:
apply to the acquired content an ensemble of algorithms, the ensemble comprising a predetermined sequence of:
one or more key phrase extraction algorithms to generate a set of keywords based on at least the acquired content,
one or more graph analyses algorithms to identify a set of topics semantically relevant to the set of keywords generated using the one or more key phrase extract algorithms, and
one or more natural language processing algorithms to determine a relevance score for each topic of the set of semantically relevant keywords identified using the one or more graph analyses algorithm, the relevance score indicating semantic relevance of the topic to the input of the one of more keywords;
generate from the set of semantically relevant keywords, a knowledge graph of recommended keywords for the input one of more keywords ranked by at least the relevance scores; and
output, based at least partially on the knowledge graph, an enumerated list of recommended keywords to use for content selection in search engine queries at the identified website based at least in part on the one or more targeting attributes.
12. The system of claim 11, wherein the one or more targeting attributes identify a volume of content at the identified website for which to target.
13. The system of claim 11, wherein the one or more targeting attributes identify a timeline of content at the identified website for which to target.
14. The system of claim 11, wherein the one or more targeting attributes identify one or more of a gender, language or location of a user for which to target.
15. The system of claim 11, wherein the crawler is further configured to acquire additional content from the plurality of different web content sources comprising web sites, news articles, blog posts and keyword data.
16. The system of claim 15, wherein the tool is further configured to apply the ensemble to both the acquired content and the additional content.
17. The system of claim 11, wherein the tool is further configured to output the enumerated list of recommend keywords ranked by at least one of an attractiveness score, a volume score and a competition score.
18. The system of claim 11, wherein the tool is further configured to output the enumerated list of recommend keywords ranked by at least a relevance score.
19. The system of claim 11, wherein the tool is further configured to output the enumerated list of recommend keywords ranked by a cost-per-click value.
20. The system of claim 11, wherein the one or more key phrase extraction algorithms comprise a Bayesian statistical ensemble and the ensemble is further configured to perform a plurality of term ranking functions are performed including one or more of the following: a core phrase term ranking function, a tail phrase term ranking function, a hyperdictionary graph traversal algorithm and a semantic knowledgebase path traversal score.
US14/928,220 2014-10-31 2015-10-30 Systems and methods for semantic keyword analysis for paid search Active 2038-08-07 US10489469B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/928,220 US10489469B2 (en) 2014-10-31 2015-10-30 Systems and methods for semantic keyword analysis for paid search

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201462073560P 2014-10-31 2014-10-31
US14/928,220 US10489469B2 (en) 2014-10-31 2015-10-30 Systems and methods for semantic keyword analysis for paid search

Publications (2)

Publication Number Publication Date
US20160125462A1 US20160125462A1 (en) 2016-05-05
US10489469B2 true US10489469B2 (en) 2019-11-26

Family

ID=54478284

Family Applications (5)

Application Number Title Priority Date Filing Date
US14/928,220 Active 2038-08-07 US10489469B2 (en) 2014-10-31 2015-10-30 Systems and methods for semantic keyword analysis for paid search
US14/928,210 Active 2037-09-21 US10409875B2 (en) 2014-10-31 2015-10-30 Systems and methods for semantic keyword analysis
US16/523,267 Abandoned US20190347304A1 (en) 2014-10-31 2019-07-26 Systems and methods for semantic keyword analysis
US17/188,600 Active 2036-07-05 US11768901B2 (en) 2014-10-31 2021-03-01 Systems and methods for semantic keyword analysis
US18/454,132 Pending US20230394101A1 (en) 2014-10-31 2023-08-23 Systems and methods for semantic keyword analysis

Family Applications After (4)

Application Number Title Priority Date Filing Date
US14/928,210 Active 2037-09-21 US10409875B2 (en) 2014-10-31 2015-10-30 Systems and methods for semantic keyword analysis
US16/523,267 Abandoned US20190347304A1 (en) 2014-10-31 2019-07-26 Systems and methods for semantic keyword analysis
US17/188,600 Active 2036-07-05 US11768901B2 (en) 2014-10-31 2021-03-01 Systems and methods for semantic keyword analysis
US18/454,132 Pending US20230394101A1 (en) 2014-10-31 2023-08-23 Systems and methods for semantic keyword analysis

Country Status (2)

Country Link
US (5) US10489469B2 (en)
WO (2) WO2016070028A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108228556A (en) * 2016-12-14 2018-06-29 北京国双科技有限公司 Key phrase extracting method and device

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10489469B2 (en) * 2014-10-31 2019-11-26 Marketmuse, Inc. Systems and methods for semantic keyword analysis for paid search
US11334800B2 (en) * 2016-05-12 2022-05-17 International Business Machines Corporation Altering input search terms
CN106446146B (en) * 2016-09-21 2019-05-17 中国国防科技信息中心 The identification model of event duration follower is established and recognition methods in a kind of microblogging
CN107918616A (en) * 2016-10-09 2018-04-17 阿里巴巴集团控股有限公司 Search system, page display method and client
US10325033B2 (en) 2016-10-28 2019-06-18 Searchmetrics Gmbh Determination of content score
CN108280081B (en) * 2017-01-06 2023-02-24 百度在线网络技术(北京)有限公司 Method and device for generating webpage
US10496713B2 (en) 2017-02-01 2019-12-03 Google Llc Gain adjustment component for computer network routing infrastructure
US10140286B2 (en) * 2017-02-22 2018-11-27 Google Llc Optimized graph traversal
CN107229668B (en) * 2017-03-07 2020-04-21 桂林电子科技大学 Text extraction method based on keyword matching
CN108694178B (en) * 2017-04-06 2020-11-27 北京国双科技有限公司 Method and device for recommending judicial knowledge
US10467265B2 (en) 2017-05-22 2019-11-05 Searchmetrics Gmbh Method for extracting entries from a database
CN107239450B (en) * 2017-06-02 2021-11-23 上海对岸信息科技有限公司 Method for processing natural language based on interactive context
US10776409B2 (en) 2017-06-21 2020-09-15 International Business Machines Corporation Recommending responses to emergent conditions
US11232363B2 (en) 2017-08-29 2022-01-25 Jacov Jackie Baloul System and method of providing news analysis using artificial intelligence
CN107633044B (en) * 2017-09-14 2021-08-06 国家计算机网络与信息安全管理中心 Public opinion knowledge graph construction method based on hot events
CN110020132B (en) * 2017-11-03 2023-04-11 腾讯科技(北京)有限公司 Keyword recommendation method and device, computing equipment and storage medium
US10572596B2 (en) 2017-11-14 2020-02-25 International Business Machines Corporation Real-time on-demand auction based content clarification
WO2019106566A2 (en) * 2017-11-28 2019-06-06 Wix.Com Ltd. System and method for the generation and editing of text content in website building systems
US10942963B1 (en) * 2018-04-05 2021-03-09 Intuit Inc. Method and system for generating topic names for groups of terms
IL258689A (en) 2018-04-12 2018-05-31 Browarnik Abel A system and method for computerized semantic indexing and searching
US11294944B2 (en) * 2018-06-03 2022-04-05 Apple Inc. Correction and completion of search queries
US11182396B2 (en) * 2018-07-02 2021-11-23 Palo Alto Research Center Incorporated System and method for a graph search engine
CN110245242B (en) * 2019-06-20 2022-01-18 北京百度网讯科技有限公司 Medical knowledge graph construction method and device and terminal
US11250214B2 (en) * 2019-07-02 2022-02-15 Microsoft Technology Licensing, Llc Keyphrase extraction beyond language modeling
US11874882B2 (en) 2019-07-02 2024-01-16 Microsoft Technology Licensing, Llc Extracting key phrase candidates from documents and producing topical authority ranking
CN111104475B (en) * 2019-11-15 2020-11-03 广东数果科技有限公司 Data storage and intelligent construction method of knowledge graph and implementation system
CN110929043B (en) * 2019-11-28 2023-02-24 北京智齿博创科技有限公司 Service problem extraction method and device
CN111221303B (en) * 2019-12-08 2021-10-19 华中科技大学同济医学院附属协和医院 CAN bus-based automatic office conference demonstration system and control method thereof
CN111279394B (en) * 2020-01-06 2021-04-23 深圳信息职业技术学院 Task execution method and device of virtual role and terminal equipment
CN111444407B (en) * 2020-03-26 2023-05-16 桂林理工大学 Automatic extraction method and system for page list information of web crawlers
US11238052B2 (en) * 2020-06-08 2022-02-01 International Business Machines Corporation Refining a search request to a content provider
DE112020007373T5 (en) * 2020-08-14 2023-05-11 Siemens Aktiengesellschaft Method and device for intelligently providing recommendations
CN112364257B (en) * 2020-11-13 2021-05-18 四川省人工智能研究院(宜宾) Knowledge graph-based user group recommendation system and method
US11244005B1 (en) * 2021-07-12 2022-02-08 Jeffrey Boettcher System for amplifying user online visibility and credibility
CN113742591B (en) * 2021-09-07 2023-07-25 平安科技(深圳)有限公司 Learning partner recommendation method and device, electronic equipment and storage medium
US20230146998A1 (en) * 2021-11-09 2023-05-11 GSCORE Inc. Systems, devices, and methods for search engine optimization

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020129014A1 (en) 2001-01-10 2002-09-12 Kim Brian S. Systems and methods of retrieving relevant information
US20070174340A1 (en) 2005-11-30 2007-07-26 Gross John N System & Method of Delivering RSS Content Based Advertising
US20100030648A1 (en) 2008-08-01 2010-02-04 Microsoft Corporation Social media driven advertisement targeting
US20110213655A1 (en) * 2009-01-24 2011-09-01 Kontera Technologies, Inc. Hybrid contextual advertising and related content analysis and display techniques
US20110288931A1 (en) 2010-05-20 2011-11-24 Google Inc. Microsite models
US20120226713A1 (en) 2011-03-03 2012-09-06 Brightedge Technologies, Inc. Optimizing internet campaigns
US8676798B1 (en) 2009-09-30 2014-03-18 BloomReach Inc. Query generation for searchable content
US20180366013A1 (en) * 2014-08-28 2018-12-20 Ideaphora India Private Limited System and method for providing an interactive visual learning environment for creation, presentation, sharing, organizing and analysis of knowledge on subject matter
US10162882B2 (en) * 2014-07-14 2018-12-25 Nternational Business Machines Corporation Automatically linking text to concepts in a knowledge base

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8271473B2 (en) * 2007-06-25 2012-09-18 Jobs2Web, Inc. System and method for career website optimization
US9160837B2 (en) * 2011-06-29 2015-10-13 Gracenote, Inc. Interactive streaming content apparatus, systems and methods
US9461876B2 (en) * 2012-08-29 2016-10-04 Loci System and method for fuzzy concept mapping, voting ontology crowd sourcing, and technology prediction
WO2015039165A1 (en) * 2013-09-19 2015-03-26 Longtail Ux Pty Ltd Improvements in website traffic optimization
US11551567B2 (en) * 2014-08-28 2023-01-10 Ideaphora India Private Limited System and method for providing an interactive visual learning environment for creation, presentation, sharing, organizing and analysis of knowledge on subject matter
US10489469B2 (en) * 2014-10-31 2019-11-26 Marketmuse, Inc. Systems and methods for semantic keyword analysis for paid search
US10496713B2 (en) * 2017-02-01 2019-12-03 Google Llc Gain adjustment component for computer network routing infrastructure
JP6569846B1 (en) * 2018-03-02 2019-09-04 富士通株式会社 Response processing program, response processing method, response processing apparatus, and response processing system
US10095688B1 (en) * 2018-04-02 2018-10-09 Josh Schilling Adaptive network querying system
US11061913B2 (en) * 2018-11-30 2021-07-13 International Business Machines Corporation Automated document filtration and priority scoring for document searching and access
US11074262B2 (en) * 2018-11-30 2021-07-27 International Business Machines Corporation Automated document filtration and prioritization for document searching and access
US11886814B2 (en) * 2020-01-24 2024-01-30 Thomson Reuters Enterprise Centre Gmbh Systems and methods for deviation detection, information extraction and obligation deviation detection
US11270061B2 (en) * 2020-02-25 2022-03-08 International Business Machines Corporation Automatic generation of training data for scientific paper summarization using videos
US11132988B1 (en) * 2020-10-22 2021-09-28 PolyAI Limited Dialogue system, a dialogue method, and a method of training
US11574522B1 (en) * 2022-03-01 2023-02-07 DraftKings, Inc. Systems and methods for arranging interface elements based on broadcaster events

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020129014A1 (en) 2001-01-10 2002-09-12 Kim Brian S. Systems and methods of retrieving relevant information
US20070174340A1 (en) 2005-11-30 2007-07-26 Gross John N System & Method of Delivering RSS Content Based Advertising
US20100030648A1 (en) 2008-08-01 2010-02-04 Microsoft Corporation Social media driven advertisement targeting
US20110213655A1 (en) * 2009-01-24 2011-09-01 Kontera Technologies, Inc. Hybrid contextual advertising and related content analysis and display techniques
US8676798B1 (en) 2009-09-30 2014-03-18 BloomReach Inc. Query generation for searchable content
US20110288931A1 (en) 2010-05-20 2011-11-24 Google Inc. Microsite models
US20120226713A1 (en) 2011-03-03 2012-09-06 Brightedge Technologies, Inc. Optimizing internet campaigns
US10162882B2 (en) * 2014-07-14 2018-12-25 Nternational Business Machines Corporation Automatically linking text to concepts in a knowledge base
US20180366013A1 (en) * 2014-08-28 2018-12-20 Ideaphora India Private Limited System and method for providing an interactive visual learning environment for creation, presentation, sharing, organizing and analysis of knowledge on subject matter

Non-Patent Citations (10)

* Cited by examiner, † Cited by third party
Title
Combination of Unsupervised Keyphrase Extraction Algorithms. (Year: 2013). *
Ensemble learning for keyword extraction from event descriptions (Year: 2014). *
Final Office Action on U.S. Appl. No. 14/928,210 dated Oct. 25, 2018.
International Preliminary Report on Patentability for PCT/US2015/058289 dated May 11, 2017.
International Search Report and Written Opinion of the International Searching Authority for International Application No. PCT/US2015/058289 dated Feb. 2, 2016.
International Search Report and Written Opinion of the International Searching Authority on PCT/US2015/058285 dated Feb. 1, 2016.
Kim, et al. "Retrieving and Ranking Methods for Finding Match Candidates". 2011 International Conference on Complex, Intelligent, and Software Intensive Systems. 5 pages.
Notice of Allowance dated May 1, 2019 for U.S. Appl. No. 14/928,210, 34 pages.
Notice of Allowance on U.S. Appl. No. 14/928,210 dated May 1, 2019.
U.S. Office Action on U.S. Appl. No. 14/928,210 dated May 3, 2018.

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108228556A (en) * 2016-12-14 2018-06-29 北京国双科技有限公司 Key phrase extracting method and device

Also Published As

Publication number Publication date
US20230394101A1 (en) 2023-12-07
WO2016070026A1 (en) 2016-05-06
WO2016070028A1 (en) 2016-05-06
US20160125462A1 (en) 2016-05-05
US20160125087A1 (en) 2016-05-05
US11768901B2 (en) 2023-09-26
US20210182352A1 (en) 2021-06-17
US10409875B2 (en) 2019-09-10
US20190347304A1 (en) 2019-11-14

Similar Documents

Publication Publication Date Title
US11768901B2 (en) Systems and methods for semantic keyword analysis
US9819633B2 (en) Systems and methods for categorizing messages
US20230195823A1 (en) Systems and methods for screenshot linking
US20190266200A1 (en) Systems and methods for redirecting to track user identifiers across different websites
US20210119976A1 (en) Systems and methods for managing telecommunications
US20140143333A1 (en) Systems and methods for a social discovery platform
US11704132B2 (en) Computer resource management based on prioritization of computer executable events
US20160371446A1 (en) Methods and systems for providing medical decision support
US20160005128A1 (en) Systems and methods of applying high performance computational techniques to analysis and execution of financial strategies
US20170272550A1 (en) Systems and methods for establishing communication interfaces to monitor online interactions via event listeners
US20200112534A1 (en) Systems and methods for text to social networking site to buy
US9460457B1 (en) Automatically annotating content items with an entity
US20140107925A1 (en) Systems and methods for tracking a set of experiments
US20160188721A1 (en) Accessing Multi-State Search Results
US20140282025A1 (en) Matching website visits with content selection parameters
US11899656B2 (en) Systems and methods for dynamic media asset modification
US11620316B1 (en) Systems and methods for building an inventory database with automatic labeling
US20230142718A1 (en) Systems and methods for generating dynamic feed of educational content
US11741276B1 (en) Systems and methods for modeling live events
US11695993B1 (en) System and method for creating and organizing content
US11734441B2 (en) Systems and methods for tracing data across file-related operations
US11622156B1 (en) Systems and methods for synchronizing client data structures with broadcaster events
US11122105B2 (en) System and method for component based web services development
US20230140974A1 (en) Systems and methods for deeply integrated presentation gadgets

Legal Events

Date Code Title Description
AS Assignment

Owner name: INFORMITE INC., MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MALLAH, RICHARD ISRAEL;BALOGH, AKOS LAJOS;SIGNING DATES FROM 20151119 TO 20151201;REEL/FRAME:046542/0913

Owner name: MARKETMUSE, INC., MASSACHUSETTS

Free format text: CHANGE OF NAME;ASSIGNOR:INFORMITE INC.;REEL/FRAME:046700/0358

Effective date: 20151001

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCF Information on status: patent grant

Free format text: PATENTED CASE

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YR, SMALL ENTITY (ORIGINAL EVENT CODE: M2551); ENTITY STATUS OF PATENT OWNER: SMALL ENTITY

Year of fee payment: 4