CN111010456B - Main domain name acquisition and verification method - Google Patents

Main domain name acquisition and verification method Download PDF

Info

Publication number
CN111010456B
CN111010456B CN201911153077.1A CN201911153077A CN111010456B CN 111010456 B CN111010456 B CN 111010456B CN 201911153077 A CN201911153077 A CN 201911153077A CN 111010456 B CN111010456 B CN 111010456B
Authority
CN
China
Prior art keywords
domain name
main
seed
main domain
target unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911153077.1A
Other languages
Chinese (zh)
Other versions
CN111010456A (en
Inventor
张兆心
侯锐杰
郭长勇
陈若璇
刘英豪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin Institute of Technology Weihai
Original Assignee
Harbin Institute of Technology Weihai
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Institute of Technology Weihai filed Critical Harbin Institute of Technology Weihai
Priority to CN201911153077.1A priority Critical patent/CN111010456B/en
Publication of CN111010456A publication Critical patent/CN111010456A/en
Application granted granted Critical
Publication of CN111010456B publication Critical patent/CN111010456B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L61/00Network arrangements, protocols or services for addressing or naming
    • H04L61/45Network directories; Name-to-address mapping
    • H04L61/4505Network directories; Name-to-address mapping using standardised directories; using standardised directory access protocols
    • H04L61/4511Network directories; Name-to-address mapping using standardised directories; using standardised directory access protocols using domain name system [DNS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/08Network architectures or network communication protocols for network security for authentication of entities
    • H04L63/0876Network architectures or network communication protocols for network security for authentication of entities based on the identity of the terminal or configuration, e.g. MAC address, hardware or software configuration or device fingerprint

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Power Engineering (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to a main domain name acquisition and verification method, the purpose of the invention lies in solving the problem that the main domain name under the unit name is collected as completely as possible according to the basic information of the target unit, the invention has proposed the main domain name of the target unit based on the seed main domain name of the target unit acquired by the third-party website to find, expand, verify technology and method, the invention mainly constructs the domain name seed through the acquisition of the third-party website domain name; based on the domain name seeds, expanding the main domain name by combining the domain name prefix and the universal domain name suffix and extracting the domain name through the link in the page acquired by the accessible website of the target unit; and verifying the extended domain name by inquiring whois information and DNS information, and judging whether the extended domain name belongs to a target unit.

Description

Main domain name acquisition and verification method
Technical Field
The invention relates to the field of computers, in particular to a main domain name acquisition and verification method.
Background
Since the IP address is a pure number, it is difficult to remember, so the IP address is generally replaced by a domain name on the Internet. The domain name is composed of a plurality of English letters, numbers or minus signs, and is divided into a plurality of parts by decimal points. The domain name system adopts a hierarchical structure, and is layered according to a geographic domain or an organizational domain, each hierarchy is separated by decimal points, a highest domain name section, a second highest domain name section and the like are sequentially arranged from right to left, and the leftmost field is a main domain name. The main domain name, called domain name and network domain for short, is the name of a certain computer or computer group on the Internet composed of a string of names separated by points, and is used for identifying the electronic orientation of the computer during data transmission. However, in the prior art, when searching for the domain name under the unit name according to the basic information of the target unit, the searching is often limited, and the complete main domain name cannot be completely collected.
Disclosure of Invention
The invention aims to solve the problem of comprehensively collecting the main domain name under the unit name as much as possible according to the basic information of a target unit, and provides a target unit main domain name discovery, expansion and verification technology and method based on the seed main domain name of the target unit acquired by a third-party website.
The invention provides a main domain name acquisition and verification method, which comprises the following steps:
step 1: establishing a main domain name seed of a target unit through a third-party website;
step 2: establishing a domain name seed set by acquiring a secondary domain name of a main domain name seed, wherein the main domain name seed set and the secondary domain name seed set form a domain name seed set;
and step 3: acquiring whois information of a domain name seed, and constructing a whois information white list including registration information;
and 4, step 4: acquiring DNS records of domain name seeds, and constructing a DNS information white list; the CNAME record is added to the main domain name seed as a newly discovered main domain name; adding a source host and a contact mailbox in the SOA record into a white list;
and 5: obtaining an HTML document by accessing a domain name website based on a domain name seed, obtaining a webpage link in the accessible website, extracting a domain name in the link, further obtaining a main domain name set to be verified, and expanding the main domain name; if the obtained main domain name set to be verified is not empty, executing the step 6, otherwise, executing the step 9;
step 6: acquiring whois information of the domain name in the main domain name set to be verified in the step 5, comparing the registration information of the domain name to be verified with the information in the domain name whois information white list established in the step 3, and verifying whether the domain name to be verified belongs to a target unit; if the verification is successful, the domain name is the main domain name of the target unit, the domain name is added into a main domain name seed set, and the step 9 is executed; otherwise, executing step 7;
and 7: obtaining DNS information of domain names in the main domain name set to be verified in the step 5, and verifying whether the domain names belong to the target unit by judging whether the CNAME records are domain names in the domain name seeds of the target unit or not, whether suffixes of the NS and MX record host names are domain names in the main domain name seeds of the target unit or not, and comparing the SOA records with the DNS information white list established in the step 4; if the verification is successful, the domain name is the main domain name of the target unit, the domain name is added into a main domain name seed set, and the step 9 is executed; otherwise, executing step 8;
and 8: acquiring the filing information of the main domain name to be verified in the step 5, and verifying whether the main domain name is the main domain name of the target unit; if the verification is successful, executing step 9; if the verification fails, the main domain name, the domain name and the link are put in a warehouse for future reference, and the step 9 is executed;
and step 9: constructing a domain name prefix dictionary P, and executing the step 10;
step 10: constructing a domain name suffix dictionary L, and executing the step 11;
step 11: constructing a main domain name to be detected, and creating a new main domain name through a domain name prefix dictionary P and a domain name suffix dictionary L; the new main domain name forms a main domain name set DS1 to be detected;
step 12: performing DNS query on the seed domain name to obtain a corresponding IP set, requesting to obtain a corresponding certificate for a 443 port of each IP, and obtaining a main domain name set DS2 to be tested through a domain name served by the certificate; executing step 13;
step 13: the main domain name set to be tested DS = DS1+ DS2, the domain name in the main domain name set to be tested is verified, and the step 14 is executed;
step 14: acquiring the whois information of the domain name in the main domain name set to be verified in the step 13, comparing the registration information of the domain name to be verified with the information in the domain name whois information white list established in the step 3, and verifying whether the domain name belongs to a target unit; if the verification is successful, the domain name is the main domain name of the target unit, the domain name is added into the main domain name seed set, and the step 16 is executed on the newly added domain name; otherwise, executing step 15;
step 15: obtaining DNS information of domain names in the main domain name set to be verified in the step 13, if the CNAME is the domain name in the main domain name seed, or suffixes of host names recorded by the NS and the MX are the domain names in the main domain name seed, or the SOA record can be matched with the DNS information white list established in the step 4, considering the domain names as the main domain names of the target unit, and adding the domain names into the main domain name seed set; step 16 is executed;
step 16, if the new main domain name exists in the main domain name seed set, executing step 2; otherwise, the extension process ends.
Preferably, the domain name prefix dictionary P = P1+ P2, includes:
i. the prefix dictionary P1 is used for extracting a main domain name seed prefix;
ii. The prefix dictionary P2 is for collecting the full-simplified Chinese Pinyin, traditional Chinese and English writing method of the target unit.
Preferably, the domain name suffix dictionary L = L1+ L2+ L3, comprising:
a. constructing a top level domain dictionary L1 through a top level domain issued by IANA;
b. constructing a domain name suffix dictionary L2 from the seed domain name set;
c. the secondary domain constructs a domain name suffix dictionary L3.
Preferably, the registration information includes registrants, registration phones, registration mailboxes, and registration organization names.
The invention has the beneficial effects that: firstly, acquiring a main domain name seed through a third-party website, secondly, realizing domain name expansion through a page extraction link acquired by an accessible website of a target unit, a combination of a domain name prefix and a universal domain name suffix and the like on the basis of the main domain name seed, and then finishing accurate judgment on the affiliated relationship of the expanded main domain name and the target unit through whois information comparison verification, DNS information analysis verification and the like; the invention completes the discovery, the expansion and the verification of the main domain name of the target unit through various ways, provides guarantee for researching the domain name assets of the target unit, realizes automation by utilizing a computing technology, improves the efficiency and the feasibility of obtaining the main domain name of the target unit, and saves human resources.
Drawings
FIG. 1 is a diagrammatic illustration of a system workflow of the practice of the present invention;
fig. 2 is a system architecture diagram of the present invention.
Description of the symbols in the drawings
1. A database; 2. a domain name expansion submodule; 21. acquiring a domain name module based on the internet basic resource relation; 22. acquiring a domain name sub-module based on the webpage link; 23. a main domain name construction sub-module; 3. a domain name verification sub-module; whois information validation; DNS information verification; 4, a DNS information acquisition submodule; 5, a whois information acquisition submodule; 6. and a secondary domain name acquisition subsystem.
Detailed Description
The present invention is further described below with reference to the drawings and examples so that those skilled in the art can easily practice the present invention.
As shown in FIG. 1, when the system of the present invention works, the name of the target unit is inputted after the system starts, and the main domain name seed of the target unit is established through a third-party website such as an icp record, a sitter's home, a search engine, etc. And establishing a secondary domain name seed set by a secondary domain name module for acquiring the main domain name seed, and establishing the domain name seed set. And constructing a white list of the whois information through the whois information of the domain name seeds, wherein the white list comprises registrants, registered telephones, registered mailboxes and registered organization names. And constructing a DNS white list through DNS information of the domain name seed. And accessing a website corresponding to the domain name, acquiring a link of the website, extracting the domain name in the link, and further acquiring the main domain name to be verified. And judging whether the domain name belongs to a target unit or not by the whois information, and if so, adding the domain name to a newly added main domain name seed set. And if not, judging the relation of the record information verification target unit, wherein the record information comprises ICP record and public security record. Judging the relation of the record information verification target unit, if the record information verification target unit is unsuccessful, warehousing the unconfirmed domain name and the related link for being checked, and then constructing a prefix dictionary P; and if the prefix dictionary is successful, directly constructing a prefix dictionary P.
Prefix dictionary P = P1+ P2, including the following aspects:
i. extracting a main domain name seed prefix as a prefix dictionary P1;
ii. The full simplified Chinese Pinyin, traditional Chinese and English writing method of the target unit is collected as the domain name prefix dictionary P2.
After completing the construction of the prefix dictionary P, a suffix dictionary L = L1+ L2+ L3 is constructed, which includes
a. A top level domain dictionary L1 is constructed in a top level domain of https:// www.iana.org/domains/root/db/published by an IANA (The Internet Assigned number Authority);
b. constructing a domain name suffix dictionary L2 from the seed domain name set;
c. the secondary domain constructs a domain name suffix dictionary L3; for example, com.cn,. net.cn,. gov.cn,
Cn, etc.
The method comprises the steps of establishing a new main Domain name through a Domain name prefix dictionary P and a Domain name suffix dictionary L established by the Domain names, establishing a main Domain name Set DS1 to be tested by the new main Domain name, obtaining an IP Set corresponding to the Domain name Set through Domain name resolution, obtaining a Domain name Set CDS (namely a Certificate Domain Set) of Certificate service by sending a request to a 443 port of an IP, obtaining a corresponding main Domain name Set DS2 for the Domain name Set in the CDS, verifying each Domain name in the Domain name Set to be tested, and obtaining a Domain name Set DS = DS1+ DS2 to be tested. When the domain name to be detected is verified, whois information judges whether the domain name belongs to a target unit, if so, the whois information is added to a newly added main domain name seed set; if not, judging whether DNS information verification belongs to a target unit, if the DNS information verification is passed, adding the new main domain name seed set, and if the DNS information verification is not passed, judging whether the new main domain name set is empty. Meanwhile, judging whether the newly added main domain name seed set is empty, if the newly added main domain name seed set is empty, ending the system; and if not, adding the newly added main domain name set into the main domain name seed, constructing a secondary domain name set through the secondary domain name module again, and repeating the operation.
As shown in fig. 2, the system architecture diagram of the present invention includes a database 1, a domain name extension sub-module 2, a domain name verification sub-module 3, a DNS information acquisition sub-module 4, a whois information acquisition sub-module 5, and a secondary domain name acquisition sub-system 6. The database 1 comprises a main domain name seed, a DNS record library, a whois registration information white list, a whois registration information black list, a target unit information dictionary, a domain name prefix dictionary, a domain name suffix dictionary, a top level domain dictionary, a second level domain dictionary and a domain name black list. The domain name expansion sub-module 2 comprises a domain name obtaining module 21 based on the internet basic resource relation, a domain name obtaining sub-module 22 based on the web page link and a main domain name constructing sub-module 23; the domain name verification submodule 3 includes whois information verification 31 and DNS information verification 32.
The domain name obtaining module 21 based on the internet basic resource relationship includes obtaining an IP set of a main domain name, obtaining a certificate by requesting a 443 port of the IP, and further obtaining a certificate service domain name set as a main domain name set to be tested. The domain name obtaining sub-module 22 includes obtaining HTML documents of a domain name website, extracting a web link, extracting a main domain name, and further constructing a main domain name set to be tested. The main domain name constructing sub-module 23 constructs a main domain name set to be tested by combining a prefix, a suffix and a prefix and suffix.
The whois information validator 31 includes registrant validator, registered telephone validator, registered mailbox validator, and registered organization validator. The DNS information validation 32 comprises CNAME record validation, NS record validation, MX record validation and source host and mailbox validation in SOA; CNAME is whether the direct verification result is a main domain name or not, MX and NS record verification is whether the result suffix is a main domain name or not, source host verification in SOA record is actually verification of NS, and contact mailbox verification in SOA record is verified according to a history white list.
Example (b):
taking the business bank behavior example:
step 1: inquiring a Chinese industrial and commercial Bank limited company in an ICP filing website to obtain a main domain name seed, performing whois reverse check on the obtained main domain name seed in a station leader to expand the main domain name seed, and storing invalid information in a reverse check option in a blacklist.
Step 2: and solving a secondary domain name from the main domain name seed through a secondary domain name acquisition system, wherein the main domain name seed set and the secondary domain name seed set form a domain name seed set.
And step 3: and acquiring a white list of whois information of the domain name seeds. The white list was constructed as follows:
1. constructing registration information exposed in whois information of the seed domain name, wherein the registration information comprises a registrant, a registration mailbox, a registration telephone and a registration organization name;
2. the method is constructed by the full name, short name, English, pinyin and traditional Chinese of a target unit.
Therefore, there are industry and Commercial Bank of China, and China Industrial Bank of China, etc. in the whois white list that we build for the industry registrars, and the registration mailboxes contain the strings of icbc.
And 4, step 4: and acquiring a white list of DNS records of the domain name seed, wherein the white list comprises information such as SOA (service oriented architecture) and CNAME (network access management) records of the domain name. And E, putting the mailbox and the DNS in the SOA record into a white list, and putting the CNAME record into a domain name seed set.
And 5: and acquiring HTML (hypertext markup language) texts of websites corresponding to the known domain names of the workshops by using a selenium automation test tool based on the domain name seeds of the workshops. And extracting all webpage links from the HTML, acquiring the domain name from the webpage links and further acquiring the main domain name. If a new primary domain name is obtained, step 6 is performed, otherwise step 9 is performed.
Step 6: obtaining whois information of a domain name needing to be verified, verifying whether a main domain name exists or not and whether the main domain name is the main domain name of an industrial and commercial bank or not, wherein the verification process comprises the following steps:
1. and verifying whether the main domain name is registered or not, and inquiring whether registration time information exists in whois information of the domain name or not. If so, the domain name is registered.
2. Whether the domain name is the main domain name of the Chinese Industrial and commercial Bank, Inc. can be verified by the following three methods:
1) and judging through a registered mailbox in the whois information. If the registered mailbox suffix is the domain name of the Chinese Industrial and commercial Bank Ltd, the mail server assets of the Chinese Industrial and commercial Bank Ltd in use are proved to be the main domain name of the Chinese Industrial and commercial Bank Ltd. The authentication is successful, and the domain name is marked as the main domain name of the Chinese industrial and commercial Bank of China, and is added into the main domain name seed set of the industry;
2) authentication is performed by a registered phone in the whois message. The certification is successful, the certification is marked as the main domain name of the Chinese Industrial and commercial Bank resources company, the domain name is added into the main domain name seed set of the industry, and the white list sources of the registered telephones are as follows:
i. constructing a white list of registered telephones according to whois information of a main domain name seed set;
ii. And the telephone white list is constructed by the contact telephone information extracted from the webpage content of the seed domain name set.
3) Authentication is performed by registrants and registrars in the whois registry. If the verification is successful, the domain name is marked as the main domain name of the Chinese industrial and commercial Bank of China, and the domain name is added into the main domain name seed set of the industry. The specific process is as follows:
i. and verifying the registrant. Whether the registrant field contains: the complete name, short name, traditional Chinese, Chinese pinyin and English of the work bank and the name information of the registrant obtained by the query of the work bank seed domain name whois;
ii. And registering and organizing verification. Whether the registration organization field contains: the full name, simplified body, traditional body, Chinese phonetic alphabet and English of the industry and the trade.
And 7: acquiring a DNS record of a domain name needing to be verified, and verifying through the following four aspects:
1. acquiring a CNAME record of a domain name to be detected, and if the CNAME record is the domain name of a target unit, considering the domain name as a main domain name of a Chinese industrial and commercial Bank limited company;
2. acquiring NS records of domain names to be detected, and if the acquired host name suffix of the authoritative DNS is a main domain name in a main domain name seed set of a business bank, considering the domain name as the main domain name of the Chinese industrial and commercial bank corporation;
3. acquiring MX records of a domain name to be detected, and if the acquired host name suffix of the mailbox server is a main domain name in a main domain name seed set of a business bank, considering the domain name as the main domain name of a Chinese industrial and commercial banking company;
4. acquiring SOA records of a domain name to be detected, checking whether a mailbox and a DNS are in a white list, if so, proving that a host and an administrator are workers of a worker, and considering that the domain name is a main domain name of a Chinese industrial and commercial Bank limited company;
if one of the four aspects is met, the verification is successful, the domain name is added into the main domain name seed, and the step 9 is executed; otherwise, executing step 8, and verifying through the filing information.
And 8: and acquiring the filing information (ICP filing and public security filing) of the main domain name to be verified in the step 5, and verifying whether the main domain name is the main domain name of the target unit. If the verification is successful, executing step 9; and (5) if the verification fails, storing the main domain name, the domain name and the link for future reference, and executing the step 9.
And step 9: a prefix dictionary P is constructed. The prefix dictionary P includes the following aspects:
i. extracting a main domain name seed prefix as a prefix dictionary P1, e.g., an icbc join prefix dictionary in icbc.
ii. Collecting the simplified Chinese Pinyin, traditional Chinese AND English writing OF Chinese Industrial AND COMMERCIAL Bank, Inc., as the domain name prefix dictionary P2, such as ICBC, industry AND COMMERCIAL Bank, INDUSTRIAL AND COMMERCIAL BANK OF CHINA, INDUSTRIAL AND COMMERCIAL Bank OF China, LTD, INDUSTRIAL AND COMMERCIAL Bank OF China Co., Ltd;
iii, prefix dictionary P = P1+ P2.
Step 10: a suffix dictionary L is constructed. The suffix dictionary L includes the following aspects.
a. The top level domain serves as a suffix dictionary. A top level domain dictionary L1 is constructed by a top level domain published by IANA (The Internet Assigned Numbers Authority) at The website https:// www.iana.org/domains/root/db/which comprises 1581 suffixes in total;
b. constructing a domain name suffix dictionary L2, such as the suffix of icbc.com.cn, from the seed domain name set;
c. the secondary domain constructs a domain name suffix dictionary L3;
.com.cn、.net.cn、.gov.cn、.org.cn
d. then L = L1+ L2+ L3;
step 11: and constructing the main domain name. By the domain name prefix dictionary P and domain name suffix dictionary L constructed by the domain name, a new main domain name can be created, for example, the known icbc.com.cn is the domain name of the Chinese industrial and commercial Bank corporation, and the prefix of the icbc in the domain name is extracted and combined with any suffix such as asia to create the new main domain name icbc. Creating a new main domain name to form a main domain name set DS1 to be detected;
step 12: performing DNS query on a seed domain name of a Chinese industrial and commercial Bank (GmbH) company to obtain a corresponding IP set, requesting to obtain a corresponding certificate for a 443 port of each IP, and obtaining a main domain name set (DS 2) to be detected through a domain name served by the certificate; step 13 is performed.
Step 13: judging whether the domain name in the main domain name set S = DS1+ DS2 to be tested belongs to a target unit through whois verification and DNS verification, wherein the verification process is similar to the steps 6 and 7, and adding a main domain name seed set after the verification is successful; and if the verification fails, discarding the card. Step 14 is performed.
Step 14: acquiring the whois information of the domain name in the main domain name set to be verified in the step 13, comparing the registration information of the domain name to be verified with the information in the domain name whois information white list established in the step 3, and verifying whether the domain name belongs to a target unit; if the verification is successful, the domain name is the main domain name of the target unit, the domain name is added into the main domain name seed set, and the step 16 is executed on the newly added domain name; otherwise, executing step 15;
step 15: obtaining DNS information of domain names in the main domain name set to be verified in the step 13, if the CNAME is the domain name in the main domain name seed, or suffixes of host names recorded by the NS and the MX are the domain names in the main domain name seed, or the SOA record can be matched with the DNS information white list established in the step 4, considering the domain names as the main domain names of the target unit, and adding the domain names into the main domain name seed set; step 16 is executed;
step 16, if the new main domain name exists in the main domain name seed set, executing step 2; otherwise, the extension process ends.
The above description is only for the purpose of illustrating preferred embodiments of the present invention and is not to be construed as limiting the present invention, and it is apparent to those skilled in the art that various modifications and variations can be made in the present invention. All changes, equivalents, modifications and the like which come within the scope of the invention as defined by the appended claims are intended to be embraced therein.

Claims (4)

1. A main domain name obtaining and verifying method is characterized by comprising the following steps:
step 1: establishing a main domain name seed of a target unit through a third-party website;
step 2: establishing a domain name seed set by acquiring a secondary domain name of a main domain name seed, wherein the main domain name seed set and the secondary domain name seed set form a domain name seed set;
and step 3: acquiring whois information of a domain name seed, and constructing a whois information white list including registration information;
and 4, step 4: obtaining DNS records of domain name seeds, and constructing a DNS information white list: adding a source host and a contact mailbox in the SOA record into a white list; the CNAME record is added to the main domain name seed as a newly discovered main domain name;
and 5: obtaining an HTML document by accessing a domain name website based on a domain name seed, obtaining a webpage link in the accessible website, extracting a domain name in the link, further obtaining a main domain name set to be verified, and expanding the main domain name; if the obtained main domain name set to be verified is not empty, executing the step 6, otherwise, executing the step 9;
step 6: acquiring whois information of the domain name in the main domain name set to be verified in the step 5, comparing the registration information of the domain name to be verified with the information in the domain name whois information white list established in the step 3, and verifying whether the domain name to be verified belongs to a target unit; if the verification is successful, the domain name is the main domain name of the target unit, the domain name is added into a main domain name seed set, and the step 9 is executed; otherwise, executing step 7;
and 7: obtaining DNS information of domain names in the main domain name set to be verified in the step 5, and verifying whether the domain names belong to the target unit by judging whether the CNAME records are domain names in the domain name seeds of the target unit or not, whether suffixes of the NS and MX record host names are domain names in the main domain name seeds of the target unit or not, and comparing the SOA records with the DNS information white list established in the step 4; if the verification is successful, the domain name is the main domain name of the target unit, the domain name is added into a main domain name seed set, and the step 9 is executed; otherwise, executing step 8;
and 8: acquiring the filing information of the main domain name to be verified in the step 5, and verifying whether the main domain name is the main domain name of the target unit; if the verification is successful, executing step 9; if the verification fails, the main domain name, the domain name and the link are put in a warehouse for future reference, and the step 9 is executed;
and step 9: constructing a domain name prefix dictionary P, and executing the step 10;
step 10: constructing a domain name suffix dictionary L, and executing the step 11;
step 11: constructing a main domain name to be detected, and creating a new main domain name through a domain name prefix dictionary P and a domain name suffix dictionary L; the new main domain name forms a main domain name set DS1 to be detected;
step 12: performing DNS query on the seed domain name to obtain a corresponding IP set, requesting to obtain a corresponding certificate for a 443 port of each IP, and obtaining a main domain name set DS2 to be tested through a domain name served by the certificate; executing step 13;
step 13: the main domain name set to be tested DS = DS1+ DS2, the domain name in the main domain name set to be tested is verified, and the step 14 is executed;
step 14: acquiring the whois information of the domain name in the main domain name set to be verified in the step 13, comparing the registration information of the domain name to be verified with the information in the domain name whois information white list established in the step 3, and verifying whether the domain name belongs to a target unit; if the verification is successful, the domain name is the main domain name of the target unit, the domain name is added into the main domain name seed set, and the step 16 is executed on the newly added domain name; otherwise, executing step 15;
step 15: obtaining DNS information of domain names in the main domain name set to be verified in the step 13, if the CNAME is the domain name in the main domain name seed, or suffixes of host names recorded by the NS and the MX are the domain names in the main domain name seed, or the SOA record can be matched with the DNS information white list established in the step 4, considering the domain names as the main domain names of the target unit, and adding the domain names into the main domain name seed set; step 16 is executed;
step 16, if the new main domain name exists in the main domain name seed set, executing step 2; otherwise, the extension process ends.
2. The master domain name acquisition and verification method according to claim 1, wherein the prefix dictionary P = P1+ P2, comprising:
i. the prefix dictionary P1 is used for extracting a main domain name seed prefix;
ii. The prefix dictionary P2 is for collecting the full-simplified Chinese Pinyin, traditional Chinese and English writing method of the target unit.
3. The main domain name acquisition and verification method according to claim 2, wherein the suffix dictionary L = L1+ L2+ L3, comprising:
a. constructing a top level domain dictionary L1 through a top level domain issued by IANA;
b. constructing a domain name suffix dictionary L2 from the seed domain name set;
c. the secondary domain constructs a domain name suffix dictionary L3.
4. The primary domain name acquisition and validation method according to claim 3, wherein the registration information includes registrants, registered phones, registered mailboxes, and registered organization names.
CN201911153077.1A 2019-11-22 2019-11-22 Main domain name acquisition and verification method Active CN111010456B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911153077.1A CN111010456B (en) 2019-11-22 2019-11-22 Main domain name acquisition and verification method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911153077.1A CN111010456B (en) 2019-11-22 2019-11-22 Main domain name acquisition and verification method

Publications (2)

Publication Number Publication Date
CN111010456A CN111010456A (en) 2020-04-14
CN111010456B true CN111010456B (en) 2022-03-11

Family

ID=70113810

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911153077.1A Active CN111010456B (en) 2019-11-22 2019-11-22 Main domain name acquisition and verification method

Country Status (1)

Country Link
CN (1) CN111010456B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111885220B (en) * 2020-07-30 2023-04-07 哈尔滨工业大学(威海) Active acquisition and verification method for target unit IP assets
CN114629814A (en) * 2022-02-10 2022-06-14 互联网域名系统北京市工程研究中心有限公司 Website analysis method and device
CN116980234B (en) * 2023-09-25 2024-01-05 北京源堡科技有限公司 Domain name imitation detection method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102843445A (en) * 2012-09-29 2012-12-26 北京奇虎科技有限公司 Browser and domain name resolution method thereof
CN102999514A (en) * 2011-09-14 2013-03-27 百度在线网络技术(北京)有限公司 Method, device and equipment for obtaining webpage and link prefix information thereof
CN103037029A (en) * 2012-12-10 2013-04-10 中国科学院计算机网络信息中心 Binding registration based domain name WHOIS query method and binding registration based domain name WHOIS service system
CN103618742A (en) * 2013-12-09 2014-03-05 北京奇虎科技有限公司 Method and system for acquiring sub domain names and webmaster permission verification method
CN104363309A (en) * 2012-06-28 2015-02-18 北京奇虎科技有限公司 Extensive domain name recognizing and processing devices and extensive domain name recognizing and processing methods

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102999514A (en) * 2011-09-14 2013-03-27 百度在线网络技术(北京)有限公司 Method, device and equipment for obtaining webpage and link prefix information thereof
CN104363309A (en) * 2012-06-28 2015-02-18 北京奇虎科技有限公司 Extensive domain name recognizing and processing devices and extensive domain name recognizing and processing methods
CN102843445A (en) * 2012-09-29 2012-12-26 北京奇虎科技有限公司 Browser and domain name resolution method thereof
CN103037029A (en) * 2012-12-10 2013-04-10 中国科学院计算机网络信息中心 Binding registration based domain name WHOIS query method and binding registration based domain name WHOIS service system
CN103618742A (en) * 2013-12-09 2014-03-05 北京奇虎科技有限公司 Method and system for acquiring sub domain names and webmaster permission verification method

Also Published As

Publication number Publication date
CN111010456A (en) 2020-04-14

Similar Documents

Publication Publication Date Title
CN111010456B (en) Main domain name acquisition and verification method
CN109241292B (en) Method for establishing domain name server system knowledge graph based on active and passive data
EP2446381B1 (en) Characterizing unregistered domain names
CN108574742B (en) Domain name information collection method and domain name information collection device
CN103685606A (en) Associated domain name acquisition method, associated domain name acquisition system and web administrator permission validation method
EP2860946A2 (en) Characterization of domain names based on changes of authoritative name servers
CN111405079A (en) Domain name resolution method and device, storage medium and electronic equipment
CN108737589B (en) Method for portraying domain name based on geographic information
CN106776657B (en) Domain name retrieval method and device
CN114978740B (en) Identification association and verification analysis method based on block chain
JP5588845B2 (en) Unauthorized domain detection device, unauthorized domain detection method, and program
CN105227599A (en) The recognition methods of Web application and device
CN115794780A (en) Method and device for collecting network space assets, electronic equipment and storage medium
CN115934058A (en) Method for rapidly acquiring domain name and extracting value
CN115460171A (en) Data processing method and device, electronic equipment and storage medium
CN112261134B (en) Network data access auditing method, device, equipment and storage medium
CN111885220B (en) Active acquisition and verification method for target unit IP assets
CN114765599B (en) Subdomain name acquisition method and device
CN105245626B (en) The method for realizing website addressing using quick domain name in private network
CN104363309A (en) Extensive domain name recognizing and processing devices and extensive domain name recognizing and processing methods
CN111814643A (en) Black and gray URL (Uniform resource locator) identification method and device, electronic equipment and medium
CN106686050B (en) Domain name based classification information issuing method and system
RU2002111687A (en) METHOD AND SYSTEM OF EXECUTION OF SERVICES FOR REGISTRATION OF DOMAIN NAMES USING NATIONAL LANGUAGES IN NON-ENGLISH COUNTRIES
KR101647278B1 (en) JSON Transformation of DNS Packet and Top Extraction Method
CN113065078B (en) Statistical analysis method for simulating user behavior to dial and test multistage domain names of WEB sites

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant