CN108280103B - Website index statistical method and device and server - Google Patents

Website index statistical method and device and server Download PDF

Info

Publication number
CN108280103B
CN108280103B CN201710074037.2A CN201710074037A CN108280103B CN 108280103 B CN108280103 B CN 108280103B CN 201710074037 A CN201710074037 A CN 201710074037A CN 108280103 B CN108280103 B CN 108280103B
Authority
CN
China
Prior art keywords
domain name
website
updated
database
site family
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710074037.2A
Other languages
Chinese (zh)
Other versions
CN108280103A (en
Inventor
江锐洪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Original Assignee
Alibaba China Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba China Co Ltd filed Critical Alibaba China Co Ltd
Priority to CN201710074037.2A priority Critical patent/CN108280103B/en
Publication of CN108280103A publication Critical patent/CN108280103A/en
Application granted granted Critical
Publication of CN108280103B publication Critical patent/CN108280103B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L61/00Network arrangements, protocols or services for addressing or naming
    • H04L61/45Network directories; Name-to-address mapping
    • H04L61/4505Network directories; Name-to-address mapping using standardised directories; using standardised directory access protocols
    • H04L61/4511Network directories; Name-to-address mapping using standardised directories; using standardised directory access protocols using domain name system [DNS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a website index statistical method, a device and a server, and relates to the field of website analysis. The website index statistical method and the website index statistical device are both applied to a server, and the server comprises a database which stores at least one site family, wherein the site family comprises a plurality of domain names which have skip relations with each other. The method queries a corresponding site family from the database according to the input domain name. And counting the website data of each domain name in the website family. The website data of each domain name represents website data generated by entering the corresponding website through each domain name. And obtaining more accurate website indexes according to the website data of each domain name in the website family. Therefore, the continuous tracking and accurate analysis of the website can be carried out under the condition that the domain name of the website is frequently changed.

Description

Website index statistical method and device and server
Technical Field
The invention relates to the technical field of website analysis, in particular to a website index statistical method, a device and a server.
Background
The website analysis is to analyze the data of the website in operation, and further obtain a website index which can accurately reflect the user trend and the actual status of the website. The website indexes obtained by analyzing the website can provide effective reference for a browser service provider in decision making; and the method can also play a role in guiding the website operator to adjust the website. Thus, website analysis is extremely important for the operator of the website, the browser provider, and the like.
Currently, the website analysis is mainly based on a separate domain name. When the domain name is frequently changed, because the domain names of the analyzed website used in different time periods are different, the website index data obtained by the independent domain name-based analysis method often has no reference property (the index data grows quickly and disappears quickly), and is not beneficial to tracking and analyzing the website index.
Disclosure of Invention
In order to solve the above problems, the embodiments of the present invention adopt the following technical solutions:
the embodiment of the invention provides a website index statistical method, which is applied to a server, wherein the server comprises a database which stores at least one site family, the site family comprises a plurality of domain names which have skip relations with each other, and the method comprises the following steps: obtaining an input domain name; inquiring a corresponding site family from the database according to the input domain name; counting website data of each domain name in the website family; and obtaining the website index according to the website data of each domain name in the website family.
The embodiment of the invention also provides a website index statistical device, which is applied to a server, wherein the server comprises a database which stores at least one site family, the site family comprises a plurality of domain names which have skip relations with each other, and the device comprises: an obtaining module for obtaining an input domain name; the query module is used for querying the corresponding site family from the database according to the input domain name; the statistical module is used for counting website data of each domain name in the site family; and the acquisition module is used for acquiring the website indexes according to the website data of each domain name in the website family.
An embodiment of the present invention further provides a server, where the server includes: a first memory; a first processor; the method comprises the steps that a database storing at least one site family is stored, wherein the site family comprises a plurality of domain names which have jumping relations with each other; and a website index statistical apparatus, the apparatus comprising: an obtaining module for obtaining an input domain name; the query module is used for querying the corresponding site family from the database according to the input domain name; the statistical module is used for counting website data of each domain name in the site family; and the acquisition module is used for acquiring the website indexes according to the website data of each domain name in the website family.
Compared with the prior art, the website index statistical method and device provided by the embodiment of the invention are applied to a server, wherein the server comprises a database which stores at least one site family, the site family comprises a plurality of domain names which have a skip relation with each other, and the domain names in the site family are domain names used by corresponding websites in different time periods. And the method queries a corresponding site family from the database according to the input domain name. And counting the website data of each domain name in the website family. The website data of each domain name represents website data generated by entering the corresponding website through each domain name. And obtaining the website index according to the website data of each domain name in the website family. The website indexes obtained by analyzing and counting based on the complete data of the website are more accurate. Therefore, the continuous tracking and accurate analysis of the website can be carried out under the condition that the domain name of the website is frequently changed.
In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.
Fig. 1 is a block diagram of a server according to a preferred embodiment of the present invention.
Fig. 2 shows a functional module diagram of the network station index statistical device in fig. 1.
Fig. 3 is a functional sub-module diagram of the setup module shown in fig. 2.
Fig. 4 is a schematic diagram of a list of domain names in a newly generated site family.
Fig. 5 is a functional sub-module diagram of the update module shown in fig. 2.
Fig. 6 is a schematic diagram of the updated domain name list of the site family.
FIG. 7 is a flowchart illustrating a method for providing website index statistics according to an embodiment of the present invention.
FIG. 8 shows another portion of a flowchart of a method for providing website metrics, according to an embodiment of the invention.
Fig. 9 is a flowchart illustrating sub-steps of step S201 in fig. 7.
Fig. 10 is a flowchart illustrating sub-steps of step S202 in fig. 7.
Icon: 100-a server; 111-a memory; 112-a processor; 113-a communication unit; 300-a database; 200-website index statistical device; 201-an obtaining module; 202-a query module; 203-a statistics module; 204-an obtaining module; 205-a setup module; 2051-a receiving submodule; 2052-a first load submodule; 2053-a first comparison sub-module; 2054-generating a submodule; 2055-storage submodule; 206-an update module; 2061-obtaining submodules; 2062-second load submodule; 2063-a second comparison submodule; 2064-update sub-module.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures. Meanwhile, in the description of the present invention, the terms "first", "second", and the like are used only for distinguishing the description, and are not to be construed as indicating or implying relative importance.
Referring to fig. 1, fig. 1 is a block diagram of a server 100. The server 100 includes a website index statistics apparatus 200, a database 300, a memory 111, a processor 112, and a communication unit 113.
The memory 111, the processor 112 and the communication unit 113 are electrically connected to each other directly or indirectly to realize data transmission or interaction. For example, the components may be electrically connected to each other via one or more communication buses or signal lines. The website index statistical device 200 includes at least one software function module, which may be stored in the memory 111 in the form of software or Firmware (Firmware) or solidified in an Operating System (OS) of the server 100. The processor 112 is used for executing executable modules stored in the memory 111, such as software functional modules and computer programs included in the website index statistics apparatus 200.
The Memory 111 may be, but is not limited to, a Random Access Memory (RAM), a Read Only Memory (ROM), a Programmable Read-Only Memory (PROM), an Erasable Read-Only Memory (EPROM), an electrically Erasable Read-Only Memory (EEPROM), and the like. The memory 111 is used to store programs or data. For example, the memory 111 may store the database 300. The communication unit 113 is configured to establish a communication connection between the server 100 and another communication terminal via the network, and to transceive data via the network.
The database 300 stores at least one site family. The site family comprises a plurality of domain names which have jumping relations with each other. The site family is a group for dividing domain names with direct or indirect jump relation. It should be noted that the Domain Name (Domain Name) is the Name of a computer or group of computers on the Internet (e.g., 1688.com) composed of a string of characters separated by dots, and is used to identify the electronic position of the computer during data transmission. The domain name of a website is the address of the website server. Therefore, to access a website, the domain name corresponding to the website needs to be loaded to access the website. When a domain name loading website is opened through a browser, the browser jumps to another domain name for loading according to the domain name, and displays the jumped domain name after the website is loaded, the jumping relation between the two domain names is shown. In general, when the domain name of a website is changed from an old domain name to a new domain name, the old domain name is not immediately stopped, but when the website is loaded by inputting the old domain name, the website is automatically jumped from the old domain name to the new domain name so as to be loaded out. It is understood that the above-mentioned jumping relationship may be jumping between domains used by the same website at different times, or jumping between different websites. Thus, a domain name in a site family may be understood as a domain name used by one or more associated websites corresponding to the site family at different time periods. The jumping direction between two domain names and the jumping time can be recorded in the site family.
It should be understood that the configuration shown in fig. 1 is merely a schematic diagram of the configuration of the server 100, and that the server 100 may include more or less components than those shown in fig. 1, or have a different configuration than that shown in fig. 1. The components shown in fig. 1 may be implemented in hardware, software, or a combination thereof.
First embodiment
Fig. 2 is a functional block diagram of a website index statistics apparatus 200 according to a first embodiment of the present invention. Referring to fig. 2, the website index statistical apparatus 200 provided in the present embodiment is applied to the server 100. The website index statistical device 200 includes: an obtaining module 201, a query module 202, a statistic module 203 and an obtaining module 204.
An obtaining module 201, configured to obtain an input domain name.
Wherein, the input domain name is the analyzed domain name. Specifically, the obtaining of the input domain name may be, but is not limited to, the server 100 receiving the input domain name sent by other communication terminals through the communication unit 113, obtaining the input domain name according to a preset website analysis domain name list, and the like.
In this embodiment, the obtaining module 201 may further be configured to obtain an instruction whether to enable the website index analysis method based on the site family. The instruction may be sent to the server 100 through other communication terminals (for example, sent to the server 100 by a user through other communication devices according to a requirement), or may be directly obtained by the server 100 (for example, an instruction triggering rule preset in the server 100, such as an instruction for automatically triggering to enable a website index analysis method based on a website family if the daily access amount of the input domain name exceeds 200 times). The server 100 determines whether to analyze the analyzed domain name by using a website index analysis method based on the website family according to the instruction, so as to reduce the workload of the server 100 and improve the working efficiency of the server 100.
The query module 202 is configured to query the corresponding site family from the database 300 according to the input domain name.
In the present embodiment, when the server 100 obtains an instruction to use the site family-based website index analysis method, it is queried from the database 300 whether there is a corresponding site family according to an input domain name. Specifically, a domain name list of a site family is queried, and if it is found that the input domain name has a record in the domain name list of the site family, it indicates that the site family is the site family corresponding to the input domain name. If the input domain name is not recorded in the domain name list of the site family, it indicates that the site family corresponding to the input domain name is not queried in the database 300.
The counting module 203 is configured to count website data of each domain name in the site family when the database 300 queries the site family corresponding to the input domain name.
In this embodiment, the website data may be, but is not limited to, click volume, traffic volume, and jump volume. Specifically, the website data of each domain name in the site family is obtained and classified and counted (for example, the values of click rate, traffic, and jump rate of each domain name in each time period are counted and accumulated). When the statistics module 203 does not query the site family corresponding to the input domain name, the statistics module 203 is configured to directly obtain the website data of the input domain name. Since the corresponding site family is not queried, it indicates that the input domain name may be a domain name without a jump relationship, and it is also possible that the corresponding site family is not established in the database 300 and the change of the domain name is tracked.
The obtaining module 204 is configured to, when a site family corresponding to the input domain name is queried in the database 300, obtain the website index according to the website data of each domain name in the site family.
In this embodiment, the website index may be, but is not limited to, a website conversion rate, a user rate, and the like. The website indexes are obtained after analysis is carried out according to website data. In this embodiment, the obtaining module 204 overlaps data of the website data of each domain name in the site family on a time axis within a preset time period, and then analyzes the data to obtain the website index within the preset time period. For example, the browsing volumes of the domain name a and the domain name B in the same site family in 1 month and 10 days are 100 times and 200 times, respectively, the browsing volume of the website corresponding to the site family in 1 month and 10 days is 300, and the browsing volume 300 is the website index. When the site family corresponding to the input domain name is not queried, the obtaining module 204 directly analyzes according to the website data of the input domain name to obtain the website index.
The website index statistical apparatus 200 according to the embodiment of the present invention further includes an establishing module 205 and an updating module 206.
The building module 205 is used to build the database 300. At least one site family is stored in the database 300. The site family is a group for dividing domain names with direct or indirect jump relation. The method is used for recording the domain name change condition of the website corresponding to the website family and the domain name used in the change process.
In this embodiment, as shown in fig. 3, the establishing module 205 includes a receiving sub-module 2051, a first loading sub-module 2052, a first comparing sub-module 2053, a generating sub-module 2054, and a storing sub-module 2055.
The receiving sub-module 2051 is configured to receive an input first domain name corresponding to a website to be monitored. The first domain name is the domain name of the website to be monitored which is obtained for the first time. It should be noted that, because there are a large number of websites in the current network, not all websites have the value of tracking records. In order to ensure the working efficiency of the server 100. The website to be monitored may be, but not limited to, a focus-focused website set by a user, a website automatically listed in a row or column of websites to be monitored due to a surge in website indexes (such as traffic), a website automatically skipped to a website to be monitored when being accessed and added to a row or column of websites to be monitored, and the like.
In this embodiment, the receiving sub-module 2051 receives, through the browser, the input first domain name corresponding to the website to be monitored. The first domain name is the domain name corresponding to the website to be monitored when the website is determined as the website to be monitored. Which may be understood as the starting point of the domain name to be monitored for the website to be monitored.
The first loading sub-module 2052 is configured to load the website to be monitored according to the first domain name.
In this embodiment, the website to be monitored is loaded, specifically, the website server of the website to be monitored is accessed according to the first domain name, and the feedback data of the website server is obtained. It should be noted that the feedback data includes the current domain name information of the website to be monitored.
And a first comparing sub-module 2053, configured to compare the first domain name with a second domain name corresponding to the website to be monitored after the loading is completed.
In this embodiment, the second domain name refers to a current domain name of the website to be monitored in the process of establishing the database 300. The first comparison sub-module 2053 first obtains the second domain name according to the feedback data. Specifically, the browser may present a website page to be monitored according to the feedback data of the website server, and the second domain name may be queried from an address bar of the website page to be monitored. The first comparison sub-module 2053 then compares the first domain name to the second domain name. If the domain name of the website to be monitored is changed, the browser automatically jumps to a second domain name according to the first domain name when accessing the website server to be monitored, and then accesses the website server to be monitored according to the second domain name. At this time, the address data in the feedback data of the website server to be monitored is loaded to obtain a second domain name which is different from the first domain name; if the domain name of the website to be monitored is not changed, the browser can directly access the website server according to the first domain name. At this time, the second domain name loaded according to the feedback data is the same as the first domain name.
A generating sub-module 2054 is configured to, when the first domain name is different from the second domain name, generate the site family based on the first domain name and the second domain name. Specifically, the first domain name, the second domain name, the jump relationship therebetween, and the occurrence time of the jump relationship are recorded in a domain name list, so as to generate a site family, where the domain name list may refer to fig. 4 and will not be described in detail herein.
A storage submodule 2055, configured to store all the generated site families to obtain the database 300. Specifically, the generated domain name lists of all the site families are stored in the database 300 of the server 100, and the establishment of the database 300 is completed.
The update module 206 is used to update the database 300. In reality, the domain name of the website to be monitored may be changed at irregular time, so that the database 300 needs to be updated continuously in order to continuously track the website to be monitored and obtain the website data of the website to be monitored when different domain names are used. Specifically, the site family stored in the database 300 is updated. In this embodiment, the updating may be performed by updating the database 300 at a preset time interval. For example, the database 300 is updated once every 3 days. Referring to fig. 5, in the present embodiment, the update module 206 may include: a get submodule 2061, a second load submodule 2062, a second compare submodule 2063, and an update submodule 2064.
The obtaining sub-module 2061 is used for obtaining the domain name to be updated. In this embodiment, the domain name to be updated is the domain name that needs to be monitored whether a jump will occur. Specifically, the domain name to be updated includes each domain name in the domain name list of the site family stored in the database 300 and the first domain name of the website to be monitored of the site family not stored in the database 300.
The second loading submodule 2062 is configured to open the domain name to be updated, and load the website corresponding to the domain name to be updated. Specifically, the website server corresponding to the domain name to be updated is accessed according to the domain name to be updated, and feedback data of the website server is obtained. It should be noted that the feedback data includes the current domain name information of the website corresponding to the domain name to be updated.
A second comparing sub-module 2063, configured to compare the domain name to be updated with the third domain name after the website corresponding to the domain name to be updated is loaded. In this embodiment, the third domain name refers to the current domain name when the website corresponding to the domain name to be updated is accessed in the process of updating the database 300 by the updating module 206. The second comparing sub-module 2063 first obtains the third domain name according to the feedback data. Specifically, the browser may present a website page according to the feedback data, and the third domain name may be queried in an address bar of the website page. The second comparison sub-module 2063 then compares the domain name to be updated with the third domain name. In this embodiment, at least one domain name to be updated is provided, so that when more than one domain name to be updated is provided, after loading and comparing one domain name to be updated is completed, the next domain name is continued until each domain name to be updated is loaded and compared.
An update sub-module 2064, configured to update the database 300 according to the domain name to be updated and the third domain name when the third domain name is different from the domain name to be updated. The step of the update sub-module 2064 executing the update of the database 300 according to the domain name to be updated and the third domain name includes: and inquiring whether the domain name to be updated has the corresponding site family. And when the domain name to be updated has the corresponding site family, adding the domain name to be updated and a third domain name into the corresponding site family. Specifically, the domain name to be updated, the third domain name, the jump relationship between the two and the time when the jump relationship occurs are added into the domain name list of the corresponding site family. By constantly updating the site family, the list of domain names in the site family is also constantly expanding. For convenience of description, the domain names in the domain name list are replaced by a1.com to a6.com, and the change time interval and the update time interval are calculated in one day. A1.com is a first domain name of a website to be monitored, and when updating is carried out on 1 month and 1 day, the step of A1.com is skipped to A2. com; when updating for 1 month and 2 days, both A1.com and A2.com jump to A3.com; when the updating is carried out for 3 days in 1 month, A1.com, A2.com and A.com3 all jump to A4.com; when the updating is carried out on 4 days in 1 month, A1.com, A2.com, A3.com and A4.com all jump to A5.com; when updating for 1 month and 5 days, A1.com no longer jumps, and the website cannot be loaded, and A2.com, A3.com, A4.com and A5.com all jump to A6. com.
When the corresponding site family does not exist in the domain name to be updated, a site family is generated according to the domain name to be updated and the third domain name, and is stored in the database 300. Specifically, the domain name to be updated, the third domain name, the jump relationship between the two and the time when the jump relationship occurs are recorded in a new domain name list, so as to generate a new site family, and the site family is stored in the database 300.
Second embodiment
Fig. 7 is a flowchart of a website index statistics method according to an embodiment of the present invention. Referring to fig. 7, a website index statistics method is applied to the server 100, and the method includes the following steps:
step S101, obtaining an input domain name. Inputting the domain name refers to the domain name which needs to be analyzed in daily website analysis work.
In the embodiment of the present invention, step S101 may be performed by the obtaining module 201.
In other embodiments, the method may include a step of obtaining an instruction whether to enable the site family based website index analysis method after step S101. When the obtained instruction is to start the website index analysis method based on the website family, the flow goes to step S102; and when the obtained instruction is that the website index analysis method based on the site family is not started, directly obtaining the website data of the input domain name after the step S101, and directly analyzing the website data of the input domain name to obtain the website index.
Step S102, querying a corresponding site family from the database 300 according to the input domain name.
In an embodiment of the present invention, step S102 may be performed by the query module 202.
Step S103, counting the website data of each domain name in the site family. Specifically, the website data of all domain names in the website family queried according to the input domain name are respectively counted.
In the embodiment of the present invention, step S103 may be performed by the statistics module 203. It should be noted that, when it is not found in step S102 that the input domain name has a corresponding site family in the database 300, step S103 is to directly count the website data of the input domain name.
And step S104, obtaining the website index according to the website data of each domain name in the website family. Specifically, overlapping data of website data of each domain name in the site family on a time axis within a preset time period are overlapped, and then are analyzed to obtain website indexes within the preset time period.
In the embodiment of the present invention, step S104 may be executed by the obtaining module 204. It should be noted that, when it is not found in step S102 that the input domain name has a corresponding site family in the database 300, step S104 is to directly perform analysis according to the website data of the input domain name to obtain the website index.
As shown in fig. 8, the website index statistical method provided in the embodiment of the present invention may further include the following steps:
step S201, the database 300 is established. At least one site family is stored in the database 300.
In the embodiment of the present invention, step S201 may be performed by the establishing module 205. Specifically, as shown in fig. 8, step S201 includes the following sub-steps:
and a substep S2011 of receiving the input first domain name corresponding to the website to be monitored.
In an embodiment of the present invention, sub-step S2011 may be performed by the receiving sub-module 2051.
And a substep S2012, loading the website to be monitored according to the first domain name.
In the embodiment of the present invention, sub-step S2012 may be performed by the first loading sub-module 2052.
And a substep S2013 of comparing the first domain name with a corresponding second domain name after the loading of the website to be monitored is finished.
In an embodiment of the present invention, sub-step S2013 may be performed by the first comparison sub-module 2053.
And a substep S2014 of generating the site family based on the first domain name and the second domain name. Specifically, when the first domain name is different from the second domain name, the site family is generated based on the first domain name and the second domain name.
In an embodiment of the present invention, sub-step S2014 may be performed by generating sub-module 2054.
Substep S2015, storing all the generated site families to obtain the database 300.
In an embodiment of the present invention, sub-step S2015 may be performed by storage sub-module 2055.
Step S202, updating the database 300. In reality, the domain name of the website to be monitored may be changed at irregular time, so that the database 300 needs to be updated continuously in order to continuously track the website to be monitored and obtain the website data of the website to be monitored when different domain names are used. Specifically, the site family stored in the database 300 is updated. In the present embodiment, the database 300 is updated at a predetermined time interval.
In the embodiment of the present invention, step S202 may be performed by the updating module 206. Referring to fig. 9, step S202 includes the following sub-steps:
and a substep S2021, obtaining the domain name to be updated. In this embodiment, the domain name to be updated is the domain name that needs to be monitored whether a jump will occur. Specifically, the domain name to be updated includes each domain name in the domain name list of the site family stored in the database 300 and the first domain name of the website to be monitored of the site family not stored in the database 300.
In an embodiment of the present invention, the substep S2021 may be performed by the obtaining submodule 2061.
Substep S2022, open the domain name to be updated, and load the website corresponding to the domain name to be updated.
In an embodiment of the present invention, sub-step S2022 may be performed by the second loading sub-module 2062.
And a substep S2023 of comparing the domain name to be updated with the third domain name after the website corresponding to the domain name to be updated is loaded.
In an embodiment of the present invention, the substep S2023 may be performed by the second comparing submodule 2063.
In the substep S2024, when the third domain name is different from the domain name to be updated, the database 300 is updated according to the domain name to be updated and the third domain name. Specifically, the step of updating the database 300 according to the domain name to be updated and the third domain name includes: and inquiring whether the domain name to be updated has the corresponding site family. And when the domain name to be updated has the corresponding site family, adding the domain name to be updated and a third domain name into the corresponding site family. When the corresponding site family does not exist in the domain name to be updated, a site family is generated according to the domain name to be updated and the third domain name, and is stored in the database 300.
When the third domain name is the same as the domain name to be updated, the database 300 is not updated according to the domain name to be updated.
In the present embodiment, sub-step S2024 may be performed by the update sub-module 2064. It should be noted that there is often more than one domain name to be updated. Therefore, after the database 300 is updated once or the database 300 is not updated according to the comparison result between the third domain name and the domain name to be updated according to the domain name to be updated, if there is a domain name to be updated that has not been processed in sub-step S2022 and sub-step S2023, the process proceeds to sub-step S2022 again, and the domain name to be updated that has not been processed in sub-step S2022 and sub-step S2023 is processed until all the domain names to be updated are processed. For example, there are 2 domain names to be updated, and when the database 300 is updated according to the first domain name to be updated or the database 300 is not updated according to the comparison result, but the second domain name to be updated is not yet processed, the process returns to substep S2022. And processing the second domain name to be updated, specifically, loading the second domain name to be updated and comparing the second domain name to be updated with a third domain name corresponding to the second domain name to be updated. And updates or does not update the database 300 once according to the comparison result. When the second domain name to be updated is completely processed and no domain name to be updated is yet to be processed, the database 300 is updated.
In summary, embodiments of the present invention provide a website index statistical method, an apparatus, and a server. The website index statistical method and the website index statistical device can be applied to a server, wherein the server comprises a database which stores at least one website family, the website family comprises a plurality of domain names which have skip relation with each other, and the domain names in the website family are domain names used by corresponding websites in different time periods. And the method queries a corresponding site family from the database according to the input domain name. And counting the website data of each domain name in the website family. The website data of each domain name represents website data generated by entering the corresponding website through each domain name. And obtaining the website index according to the website data of each domain name in the website family. The website indexes obtained by analyzing and counting based on the complete data of the website are more accurate. Therefore, continuous tracking analysis can be carried out on the website under the condition that the domain name of the website is frequently changed.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method can be implemented in other ways. The apparatus embodiments described above are merely illustrative, and for example, the flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
In addition, the functional modules in the embodiments of the present invention may be integrated together to form an independent part, or each module may exist separately, or two or more modules may be integrated to form an independent part.
The functions, if implemented in the form of software functional modules and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention. It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures.

Claims (15)

1. A website index statistical method is applied to a server, and is characterized in that the server comprises a database which stores at least one site family, wherein the site family comprises a plurality of domain names which have skip relation with each other, and the domain names are domain names used by one or more associated websites corresponding to the site family in different time periods, and the method comprises the following steps:
obtaining an input domain name;
inquiring a corresponding site family from the database according to the input domain name;
counting website data of each domain name in the website family;
and obtaining the website index according to the website data of each domain name in the website family.
2. A web site metrics statistical method as set forth in claim 1 further comprising building said database, said building said database comprising:
receiving a first domain name corresponding to an input website to be monitored;
loading the website to be monitored according to the first domain name;
comparing the first domain name with a corresponding second domain name after the website to be monitored is loaded;
when the first domain name is different from the second domain name, generating the site family based on the first domain name and the second domain name;
storing all generated site families to obtain the database.
3. A website index statistics method as defined in claim 2, further comprising updating the database, the step of updating the database comprising:
obtaining a domain name to be updated;
opening the domain name to be updated, and loading a website corresponding to the domain name to be updated;
comparing the domain name to be updated with the third domain name after the website corresponding to the domain name to be updated is loaded;
and when the third domain name is different from the domain name to be updated, updating the database according to the domain name to be updated and the third domain name.
4. The website index statistical method according to claim 3, wherein the step of updating the database according to the domain name to be updated and the third domain name comprises:
inquiring whether the domain name to be updated has the corresponding site family;
when the domain name to be updated has the corresponding site family, adding the domain name to be updated and a third domain name into the corresponding site family;
and when the corresponding site family does not exist in the domain name to be updated, generating the site family according to the domain name to be updated and the third domain name, and storing the site family in the database.
5. The website index statistical method according to claim 3 or 4, wherein the step of updating the database comprises:
and updating the database at a preset time interval.
6. The website index statistical method according to claim 1, wherein when the site family corresponding to the input domain name is not queried, the method further comprises:
counting website data of the input domain name;
and obtaining the website index according to the website data of the input domain name.
7. The website index statistical method according to claim 1, wherein the step of obtaining the website index according to the website data of each domain name in the site family comprises:
and superposing the website data of each domain name in the site family within a preset time period to obtain the website indexes to be counted within the preset time period.
8. A website index statistical device applied to a server, wherein the server comprises a database storing at least one site family, wherein the site family comprises a plurality of domain names having skip relation with each other, and the domain names are domain names used by one or more associated websites corresponding to the site family in different time periods, the device comprises:
an obtaining module for obtaining an input domain name;
the query module is used for querying the corresponding site family from the database according to the input domain name;
the statistical module is used for counting website data of each domain name in the site family;
and the acquisition module is used for acquiring the website indexes according to the website data of each domain name in the website family.
9. The website index statistics apparatus of claim 8, wherein the apparatus further comprises a building module for building the database, the building module comprises:
the receiving submodule is used for receiving an input first domain name corresponding to a website to be monitored;
the first loading sub-module is used for loading the website to be monitored according to the first domain name;
the first comparison sub-module is used for comparing the first domain name with a corresponding second domain name after the loading of the website to be monitored is finished;
a generation sub-module, configured to generate the site family based on the first domain name and the second domain name when the first domain name is different from the second domain name;
and the storage submodule is used for storing all the generated site families to obtain the database.
10. The website index statistics apparatus of claim 9, wherein the apparatus further comprises an update module for updating the database, the update module comprising:
the obtaining submodule is used for obtaining the domain name to be updated;
the second loading submodule is used for opening the domain name to be updated and loading the website corresponding to the domain name to be updated;
the second comparison sub-module is used for comparing the domain name to be updated with a third domain name after the website corresponding to the domain name to be updated is loaded;
and the updating sub-module is used for updating the database according to the domain name to be updated and the third domain name when the third domain name is different from the domain name to be updated.
11. The website index statistics device according to claim 10, wherein the updating sub-module performs a manner of updating the database according to the domain name to be updated and the third domain name, and includes:
inquiring whether the domain name to be updated has the corresponding site family;
when the domain name to be updated has the corresponding site family, adding the domain name to be updated and a third domain name into the corresponding site family;
and when the corresponding site family does not exist in the domain name to be updated, generating the site family according to the domain name to be updated and the third domain name, and storing the site family in the database.
12. The website index statistics apparatus according to claim 10 or 11, wherein the manner of updating the database performed by the updating sub-module further comprises:
and updating the database at a preset time interval.
13. The website index statistics device according to claim 8, wherein when the site family corresponding to the input domain name is not queried, the statistics module is further configured to count website data of the input domain name; and the acquisition module is also used for acquiring the website index according to the website data of the input domain name.
14. The website index statistical device according to claim 8, wherein the step of the obtaining module obtaining the website index according to the website data of each domain name in the website family comprises:
and superposing the website data of each domain name in the site family within a preset time period to obtain the website index within the preset time period.
15. A server, characterized in that the server comprises:
a memory;
a processor;
the method comprises the steps that a database storing at least one site family is stored, wherein the site family comprises a plurality of domain names which have jumping relations with each other; the domain name is a domain name used by one or more associated websites corresponding to the site family in different time periods,
and a website index statistical apparatus, the apparatus comprising:
an obtaining module for obtaining an input domain name;
the query module is used for querying the corresponding site family from the database according to the input domain name;
the statistical module is used for counting website data of each domain name in the site family;
and the acquisition module is used for acquiring the website indexes according to the website data of each domain name in the website family.
CN201710074037.2A 2017-02-10 2017-02-10 Website index statistical method and device and server Active CN108280103B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710074037.2A CN108280103B (en) 2017-02-10 2017-02-10 Website index statistical method and device and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710074037.2A CN108280103B (en) 2017-02-10 2017-02-10 Website index statistical method and device and server

Publications (2)

Publication Number Publication Date
CN108280103A CN108280103A (en) 2018-07-13
CN108280103B true CN108280103B (en) 2022-01-07

Family

ID=62801119

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710074037.2A Active CN108280103B (en) 2017-02-10 2017-02-10 Website index statistical method and device and server

Country Status (1)

Country Link
CN (1) CN108280103B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001319129A (en) * 2000-05-04 2001-11-16 Apex Interactive Inc System, method, and computer program product for improving search engine ranking of internet web site
CN101383713A (en) * 2007-09-06 2009-03-11 上海艾瑞市场咨询有限公司 Internet advertisement information processing method
CN103605737A (en) * 2013-11-19 2014-02-26 北京国双科技有限公司 Website analyzing method and website analyzing device
CN103929330A (en) * 2014-04-22 2014-07-16 中国科学院计算技术研究所 Domain name service quality evaluation method and system
CN104182542A (en) * 2014-09-04 2014-12-03 北京国双科技有限公司 Data processing method and device for website clusters
CN104410691A (en) * 2014-11-27 2015-03-11 网宿科技股份有限公司 Method and system for website acceleration on basis of content delivery network

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7962523B2 (en) * 2008-04-11 2011-06-14 Yahoo! Inc. System and method for detecting templates of a website using hyperlink analysis

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001319129A (en) * 2000-05-04 2001-11-16 Apex Interactive Inc System, method, and computer program product for improving search engine ranking of internet web site
CN101383713A (en) * 2007-09-06 2009-03-11 上海艾瑞市场咨询有限公司 Internet advertisement information processing method
CN103605737A (en) * 2013-11-19 2014-02-26 北京国双科技有限公司 Website analyzing method and website analyzing device
CN103929330A (en) * 2014-04-22 2014-07-16 中国科学院计算技术研究所 Domain name service quality evaluation method and system
CN104182542A (en) * 2014-09-04 2014-12-03 北京国双科技有限公司 Data processing method and device for website clusters
CN104410691A (en) * 2014-11-27 2015-03-11 网宿科技股份有限公司 Method and system for website acceleration on basis of content delivery network

Also Published As

Publication number Publication date
CN108280103A (en) 2018-07-13

Similar Documents

Publication Publication Date Title
US8370380B1 (en) Method and system for displaying real time trends
US10229160B2 (en) Search results based on a search history
CN112241506B (en) User behavior backtracking method, device, equipment and system
CN110503435B (en) Transaction early warning method, device and equipment based on blockchain
CN105989122A (en) Application recommendation method and device
US20130347127A1 (en) Database management by analyzing usage of database fields
US11314553B2 (en) Method, apparatus, and computer program product for determining usage change rate of storage system
CN113254472B (en) Parameter configuration method, device, equipment and readable storage medium
CN108197338B (en) Browser bookmark generation method and system and terminal device
CN112925783A (en) Service data processing method and device, electronic equipment and storage medium
CN113420211A (en) Package recommendation method and device and electronic equipment
WO2014187157A1 (en) Methods and systems for determining user online time
CN109240893B (en) Application running state query method and terminal equipment
CN111046081A (en) Access method and system for industrial time sequence data
CN108280103B (en) Website index statistical method and device and server
CN111835536A (en) Flow prediction method and device
CN109558403B (en) Data aggregation method and device, computer device and computer readable storage medium
CN108491315A (en) The page is resident statistical method, device and the computer readable storage medium of duration
CN111563094A (en) Data query method and device, electronic equipment and computer-readable storage medium
CN104391956B (en) The detection method and device of network upgrade content
CN113064801B (en) Data point burying method and device, readable storage medium and computer equipment
CN114860362A (en) Interface updating method and device
CN112966029A (en) Information display and sending method, device, equipment and readable medium
CN110851346B (en) Query statement boundary problem detection method, device, equipment and storage medium
CN110929177B (en) Electronic resource recommendation method, device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20200525

Address after: 310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province

Applicant after: Alibaba (China) Co.,Ltd.

Address before: 510000 Guangdong city of Guangzhou province Whampoa Tianhe District Road No. 163 Xiping Yun Lu Yun Ping B radio square 14 storey tower

Applicant before: GUANGZHOU UCWEB COMPUTER TECHNOLOGY Co.,Ltd.

GR01 Patent grant
GR01 Patent grant