KR20230046494A - 머신러닝 기반 url 카테고리 자동 분류 방법 및 시스템 - Google Patents
머신러닝 기반 url 카테고리 자동 분류 방법 및 시스템 Download PDFInfo
- Publication number
- KR20230046494A KR20230046494A KR1020210129546A KR20210129546A KR20230046494A KR 20230046494 A KR20230046494 A KR 20230046494A KR 1020210129546 A KR1020210129546 A KR 1020210129546A KR 20210129546 A KR20210129546 A KR 20210129546A KR 20230046494 A KR20230046494 A KR 20230046494A
- Authority
- KR
- South Korea
- Prior art keywords
- url
- machine learning
- analysis target
- category
- target url
- Prior art date
Links
- 238000010801 machine learning Methods 0.000 title claims abstract description 89
- 238000000034 method Methods 0.000 title claims abstract description 28
- 238000007781 pre-processing Methods 0.000 claims description 11
- 238000013473 artificial intelligence Methods 0.000 claims description 5
- 239000000284 extract Substances 0.000 claims description 5
- 230000009193 crawling Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 4
- 238000013528 artificial neural network Methods 0.000 description 2
- 230000000877 morphologic effect Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 230000000306 recurrent effect Effects 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 230000003796 beauty Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000015654 memory Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000006403 short-term memory Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/901—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/906—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Evolutionary Computation (AREA)
- Medical Informatics (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020210129546A KR20230046494A (ko) | 2021-09-30 | 2021-09-30 | 머신러닝 기반 url 카테고리 자동 분류 방법 및 시스템 |
PCT/KR2022/009723 WO2023054858A1 (fr) | 2021-09-30 | 2022-07-06 | Procédé et système de classification automatique de catégorie d'url en fonction d'un apprentissage automatique |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020210129546A KR20230046494A (ko) | 2021-09-30 | 2021-09-30 | 머신러닝 기반 url 카테고리 자동 분류 방법 및 시스템 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20230046494A true KR20230046494A (ko) | 2023-04-06 |
Family
ID=85783049
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020210129546A KR20230046494A (ko) | 2021-09-30 | 2021-09-30 | 머신러닝 기반 url 카테고리 자동 분류 방법 및 시스템 |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR20230046494A (fr) |
WO (1) | WO2023054858A1 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102580512B1 (ko) * | 2023-04-12 | 2023-09-20 | (주)유알피 | 자동 문장 클러스터링 딥러닝 모델 학습을 위한 자동화된 rpa 학습 장치 및 방법 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116188120B (zh) * | 2023-04-28 | 2023-07-25 | 北京华阅嘉诚科技发展有限公司 | 一种有声书的推荐方法、装置、系统及存储介质 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100848319B1 (ko) * | 2006-12-07 | 2008-07-24 | 한국전자통신연구원 | 웹 구조정보를 이용한 유해 사이트 차단 방법 및 장치 |
US8521667B2 (en) * | 2010-12-15 | 2013-08-27 | Microsoft Corporation | Detection and categorization of malicious URLs |
KR101922144B1 (ko) * | 2017-04-12 | 2019-02-20 | 김승민 | 키워드 라벨링의 학습화를 통한 불법/유해 정보에 대한 차단 방법 및 이를 수행하는 장치 |
KR102169143B1 (ko) * | 2019-04-10 | 2020-10-23 | 인천대학교 산학협력단 | 유해 콘텐츠 웹 페이지 url 필터링 장치 |
KR102516454B1 (ko) * | 2019-11-06 | 2023-03-30 | 삼성에스디에스 주식회사 | Url 클러스터링을 위한 url의 요약을 생성하는 방법 및 장치 |
-
2021
- 2021-09-30 KR KR1020210129546A patent/KR20230046494A/ko unknown
-
2022
- 2022-07-06 WO PCT/KR2022/009723 patent/WO2023054858A1/fr unknown
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102580512B1 (ko) * | 2023-04-12 | 2023-09-20 | (주)유알피 | 자동 문장 클러스터링 딥러닝 모델 학습을 위한 자동화된 rpa 학습 장치 및 방법 |
Also Published As
Publication number | Publication date |
---|---|
WO2023054858A1 (fr) | 2023-04-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Chawla et al. | Host based intrusion detection system with combined CNN/RNN model | |
US11275900B2 (en) | Systems and methods for automatically assigning one or more labels to discussion topics shown in online forums on the dark web | |
Buber et al. | NLP based phishing attack detection from URLs | |
Kiruthiga et al. | Phishing websites detection using machine learning | |
WO2023054858A1 (fr) | Procédé et système de classification automatique de catégorie d'url en fonction d'un apprentissage automatique | |
CN107547490B (zh) | 一种扫描器识别方法、装置及系统 | |
Wang et al. | Representing fine-grained co-occurrences for behavior-based fraud detection in online payment services | |
KR102060766B1 (ko) | 다크웹 범죄 사이트 모니터링 시스템 | |
Liu et al. | An efficient multistage phishing website detection model based on the CASE feature framework: Aiming at the real web environment | |
Halder et al. | Hands-On Machine Learning for Cybersecurity: Safeguard your system by making your machines intelligent using the Python ecosystem | |
Yang et al. | Scalable detection of promotional website defacements in black hat {SEO} campaigns | |
Wang et al. | Game of Missuggestions: Semantic Analysis of Search-Autocomplete Manipulations. | |
Pan et al. | Webshell detection based on executable data characteristics of php code | |
Abdulrahaman et al. | Phishing attack detection based on random forest with wrapper feature selection method | |
Bollinger | Analyzing cookies compliance with the GDPR | |
Alshammery et al. | Crawling and mining the dark web: A survey on existing and new approaches | |
Al-talak et al. | Detecting server-side request forgery (SSRF) attack by using deep learning techniques | |
Sushma et al. | Deep learning for phishing website detection | |
KR102357630B1 (ko) | 제어시스템 보안이벤트의 공격전략 분류 장치 및 방법 | |
Chen et al. | Retrieving potential cybersecurity information from hacker forums | |
Tong et al. | Detecting gambling sites from post behaviors | |
Millar et al. | Optimising vulnerability triage in dast with deep learning | |
Ram Naresh Yadav et al. | A vector space model approach for web attack classification using machine learning technique | |
Li et al. | A Malicious Webpage Detection Algorithm Based on Image Semantics. | |
Rajaram et al. | Scope of visual-based similarity approach using convolutional neural network on phishing website detection |