WO2018072363A1 - Procédé et dispositif d'extension d'une source de données - Google Patents

Procédé et dispositif d'extension d'une source de données Download PDF

Info

Publication number
WO2018072363A1
WO2018072363A1 PCT/CN2017/073611 CN2017073611W WO2018072363A1 WO 2018072363 A1 WO2018072363 A1 WO 2018072363A1 CN 2017073611 W CN2017073611 W CN 2017073611W WO 2018072363 A1 WO2018072363 A1 WO 2018072363A1
Authority
WO
WIPO (PCT)
Prior art keywords
resource locator
uniform resource
locator data
data
character
Prior art date
Application number
PCT/CN2017/073611
Other languages
English (en)
Chinese (zh)
Inventor
李晓东
李雪妮
耿光刚
陈勇
Original Assignee
中国互联网络信息中心
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中国互联网络信息中心 filed Critical 中国互联网络信息中心
Publication of WO2018072363A1 publication Critical patent/WO2018072363A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9566URL specific, e.g. using aliases, detecting broken or misspelled links

Abstract

L'invention concerne un procédé et un dispositif d'extension d'une source de données, le procédé consistant : à obtenir des modèles de localisateur uniforme de ressources sur la base de toutes les données de localisateur uniforme de ressources (URL) connues et à étendre les modèles de localisateur uniforme de ressources afin d'obtenir des données de localisateur uniforme de ressources qui peuvent être considérées comme un site Web d'hameçonnage correspondant à chaque modèle de localisateur uniforme de ressources, ce qui permet d'obtenir l'acquisition automatique et active du site Web d'hameçonnage, de réduire de manière efficace les problèmes d'hystérésis de la découverte du site Web d'hameçonnage et de la dépendance humaine. Selon la manière susmentionnée, il est possible d'étendre la plage de détection, de réduire la perte d'intérêt et d'exécuter une extension sur la base des données de localisateur uniforme de ressources d'un site Web d'hameçonnage connu, ce qui permet d'améliorer le rapport de l'utilisation secondaire du site Web d'hameçonnage connu.
PCT/CN2017/073611 2016-10-19 2017-02-15 Procédé et dispositif d'extension d'une source de données WO2018072363A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610911941.X 2016-10-19
CN201610911941.XA CN106503125B (zh) 2016-10-19 2016-10-19 一种数据源扩展方法及装置

Publications (1)

Publication Number Publication Date
WO2018072363A1 true WO2018072363A1 (fr) 2018-04-26

Family

ID=58294512

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/073611 WO2018072363A1 (fr) 2016-10-19 2017-02-15 Procédé et dispositif d'extension d'une source de données

Country Status (2)

Country Link
CN (1) CN106503125B (fr)
WO (1) WO2018072363A1 (fr)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109241483B (zh) * 2018-08-31 2021-10-12 中国科学院计算技术研究所 一种基于域名推荐的网站发现方法和系统
CN109672678B (zh) * 2018-12-24 2021-05-14 亚信科技(中国)有限公司 一种钓鱼网站识别方法及装置

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102222187A (zh) * 2011-06-02 2011-10-19 国家计算机病毒应急处理中心 基于域名构造特征的挂马网页检测方法
CN104202291A (zh) * 2014-07-11 2014-12-10 西安电子科技大学 基于多因素综合评定方法的反钓鱼方法
CN104765882A (zh) * 2015-04-29 2015-07-08 中国互联网络信息中心 一种基于网页特征字符串的互联网网站统计方法
US20150295942A1 (en) * 2012-12-26 2015-10-15 Sinan TAO Method and server for performing cloud detection for malicious information

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8438642B2 (en) * 2009-06-05 2013-05-07 At&T Intellectual Property I, L.P. Method of detecting potential phishing by analyzing universal resource locators
CN102082792A (zh) * 2010-12-31 2011-06-01 成都市华为赛门铁克科技有限公司 钓鱼网页检测方法及设备
CN103491101A (zh) * 2013-09-30 2014-01-01 北京金山网络科技有限公司 钓鱼网站检测方法、装置及客户端
CN103685307B (zh) * 2013-12-25 2017-08-11 北京奇虎科技有限公司 基于特征库检测钓鱼欺诈网页的方法及系统、客户端、服务器
CN104615760B (zh) * 2015-02-13 2018-04-13 北京瑞星网安技术股份有限公司 钓鱼网站识别方法和系统
CN105138912A (zh) * 2015-09-25 2015-12-09 北京奇虎科技有限公司 钓鱼网站检测规则的自动生成方法及装置

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102222187A (zh) * 2011-06-02 2011-10-19 国家计算机病毒应急处理中心 基于域名构造特征的挂马网页检测方法
US20150295942A1 (en) * 2012-12-26 2015-10-15 Sinan TAO Method and server for performing cloud detection for malicious information
CN104202291A (zh) * 2014-07-11 2014-12-10 西安电子科技大学 基于多因素综合评定方法的反钓鱼方法
CN104765882A (zh) * 2015-04-29 2015-07-08 中国互联网络信息中心 一种基于网页特征字符串的互联网网站统计方法

Also Published As

Publication number Publication date
CN106503125A (zh) 2017-03-15
CN106503125B (zh) 2019-10-15

Similar Documents

Publication Publication Date Title
CN107786575B (zh) 一种基于dns流量的自适应恶意域名检测方法
CN108737423B (zh) 基于网页关键内容相似性分析的钓鱼网站发现方法及系统
Xiang et al. Cantina+ a feature-rich machine learning framework for detecting phishing web sites
Kumar et al. Malicious URL detection using multi-layer filtering model
CN108023868B (zh) 恶意资源地址检测方法和装置
Balduzzi et al. Targeted attacks detection with spunge
CN105677661A (zh) 一种检测社交媒体重复数据的方法
Li et al. Phishing detection based on newly registered domains
Bai Phishing website detection based on machine learning algorithm
Madhubala et al. Survey on malicious URL detection techniques
Nowroozi et al. An adversarial attack analysis on malicious advertisement url detection framework
WO2018072363A1 (fr) Procédé et dispositif d'extension d'une source de données
Khan Detection of phishing websites using deep learning techniques
CN115442075A (zh) 一种基于异质图传播网络的恶意域名检测方法和系统
Wu et al. Malicious website detection based on urls static features
Khan et al. Hot zone identification: Analyzing effects of data sampling on spam clustering
Zhang et al. Detecting the DGA-based malicious domain names
Yazhmozhi et al. Natural language processing and Machine learning based phishing website detection system
Lei et al. Design and implementation of an automatic scanning tool of SQL injection vulnerability based on Web crawler
Lee et al. Collaborative cyberporn filtering with collective intelligence
Tariq et al. USING black-list and white-list technique to detect malicious URLs
Xiong et al. MIRD: trigram-based M alicious URL detection I mplanted with R andom D omain name recognition
Ma Research on black hat SEO behaviour measurement
TWI579717B (zh) Dynamic Web site HTTP network packet and database packet auditing system and method
Lee et al. Objectionable content filtering by click-through data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17861640

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17861640

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 17861640

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 13/12/2019)

122 Ep: pct application non-entry in european phase

Ref document number: 17861640

Country of ref document: EP

Kind code of ref document: A1