KR101506354B1 - 웹 페이지의 분류 및 대응 콘텐트의 조직화 방법 - Google Patents
웹 페이지의 분류 및 대응 콘텐트의 조직화 방법 Download PDFInfo
- Publication number
- KR101506354B1 KR101506354B1 KR1020097015036A KR20097015036A KR101506354B1 KR 101506354 B1 KR101506354 B1 KR 101506354B1 KR 1020097015036 A KR1020097015036 A KR 1020097015036A KR 20097015036 A KR20097015036 A KR 20097015036A KR 101506354 B1 KR101506354 B1 KR 101506354B1
- Authority
- KR
- South Korea
- Prior art keywords
- internet
- recording
- rti
- web page
- web pages
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
- Paper (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| IT002436A ITMI20062436A1 (it) | 2006-12-19 | 2006-12-19 | Metodo di classificazione di pagine web e di organizzazione dei corrispondenti contenuti |
| ITMI2006A002436 | 2006-12-19 | ||
| PCT/EP2007/011183 WO2008074486A2 (en) | 2006-12-19 | 2007-12-19 | Method for classifying web pages and organising corresponding contents |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR20090100417A KR20090100417A (ko) | 2009-09-23 |
| KR101506354B1 true KR101506354B1 (ko) | 2015-03-30 |
Family
ID=39427655
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020097015036A Expired - Fee Related KR101506354B1 (ko) | 2006-12-19 | 2007-12-19 | 웹 페이지의 분류 및 대응 콘텐트의 조직화 방법 |
Country Status (11)
| Country | Link |
|---|---|
| US (1) | US8255404B2 (https=) |
| EP (2) | EP2126750A2 (https=) |
| JP (1) | JP5227333B2 (https=) |
| KR (1) | KR101506354B1 (https=) |
| CN (1) | CN101617310A (https=) |
| BR (1) | BRPI0719477B1 (https=) |
| CA (1) | CA2672958C (https=) |
| IL (1) | IL199470A (https=) |
| IT (1) | ITMI20062436A1 (https=) |
| RU (1) | RU2487404C2 (https=) |
| WO (1) | WO2008074486A2 (https=) |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102063469B (zh) * | 2010-12-03 | 2013-04-24 | 百度在线网络技术(北京)有限公司 | 一种用于获取相关关键词信息的方法、装置和计算机设备 |
| US9104765B2 (en) * | 2011-06-17 | 2015-08-11 | Robert Osann, Jr. | Automatic webpage characterization and search results annotation |
| US9286390B2 (en) * | 2011-12-30 | 2016-03-15 | Microsoft Technology Licensing, Llc | Presentation of rich search results in delineated areas |
| US20150046468A1 (en) * | 2013-08-12 | 2015-02-12 | Alcatel Lucent | Ranking linked documents by modeling how links between the documents are used |
| CN104750692B (zh) * | 2013-12-25 | 2018-05-15 | 中国移动通信集团公司 | 一种信息处理方法、信息检索方法及其对应的装置 |
| US9569522B2 (en) | 2014-06-04 | 2017-02-14 | International Business Machines Corporation | Classifying uniform resource locators |
| RU2598789C2 (ru) | 2014-06-30 | 2016-09-27 | Общество С Ограниченной Ответственностью "Яндекс" | Способ представления результатов поиска в соответствии с поисковым запросом в сети интернет |
| WO2016103519A1 (ja) * | 2014-12-26 | 2016-06-30 | 株式会社Ubic | データ分析システム、データ分析方法、およびデータ分析プログラム |
| US10242112B2 (en) | 2015-07-15 | 2019-03-26 | Google Llc | Search result filters from resource content |
| US10318564B2 (en) | 2015-09-28 | 2019-06-11 | Microsoft Technology Licensing, Llc | Domain-specific unstructured text retrieval |
| US10354188B2 (en) | 2016-08-02 | 2019-07-16 | Microsoft Technology Licensing, Llc | Extracting facts from unstructured information |
| US12518111B2 (en) * | 2022-11-21 | 2026-01-06 | Oracle International Corporation | Automating large-scale data collection |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002236661A (ja) | 2000-10-24 | 2002-08-23 | Dualname Inc | インターネット上でユーザが所望する言語を使用する仮想ドメインネームシステム |
| JP2003271670A (ja) | 2002-03-19 | 2003-09-26 | Mitsubishi Electric Corp | 情報収集装置、情報収集方法及びプログラム |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6334145B1 (en) * | 1998-06-30 | 2001-12-25 | International Business Machines Corporation | Method of storing and classifying selectable web page links and sublinks thereof to a predetermined depth in response to a single user input |
| AU2003297523A1 (en) * | 2002-12-24 | 2004-07-22 | American Type Culture Collection | Systems and methods for enabling a user to find information of interest to the user |
| RU2236699C1 (ru) * | 2003-02-25 | 2004-09-20 | Открытое акционерное общество "Телепортал. Ру" | Способ поиска и выборки информации с повышенной релевантностью |
| US20050080770A1 (en) * | 2003-10-14 | 2005-04-14 | Microsoft Corporation | System and process for presenting search results in a tree format |
| US7707201B2 (en) * | 2004-12-06 | 2010-04-27 | Yahoo! Inc. | Systems and methods for managing and using multiple concept networks for assisted search processing |
| US7428533B2 (en) * | 2004-12-06 | 2008-09-23 | Yahoo! Inc. | Automatic generation of taxonomies for categorizing queries and search query processing using taxonomies |
| US7620628B2 (en) * | 2004-12-06 | 2009-11-17 | Yahoo! Inc. | Search processing with automatic categorization of queries |
-
2006
- 2006-12-19 IT IT002436A patent/ITMI20062436A1/it unknown
-
2007
- 2007-12-19 US US12/519,925 patent/US8255404B2/en not_active Expired - Fee Related
- 2007-12-19 BR BRPI0719477-3A patent/BRPI0719477B1/pt not_active IP Right Cessation
- 2007-12-19 JP JP2009541874A patent/JP5227333B2/ja not_active Expired - Fee Related
- 2007-12-19 EP EP07856906A patent/EP2126750A2/en not_active Ceased
- 2007-12-19 EP EP20120150981 patent/EP2466500A1/en not_active Ceased
- 2007-12-19 KR KR1020097015036A patent/KR101506354B1/ko not_active Expired - Fee Related
- 2007-12-19 WO PCT/EP2007/011183 patent/WO2008074486A2/en not_active Ceased
- 2007-12-19 CN CN200780047332A patent/CN101617310A/zh active Pending
- 2007-12-19 CA CA2672958A patent/CA2672958C/en not_active Expired - Fee Related
- 2007-12-19 RU RU2009127889/08A patent/RU2487404C2/ru active
-
2009
- 2009-06-21 IL IL199470A patent/IL199470A/en active IP Right Grant
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002236661A (ja) | 2000-10-24 | 2002-08-23 | Dualname Inc | インターネット上でユーザが所望する言語を使用する仮想ドメインネームシステム |
| JP2003271670A (ja) | 2002-03-19 | 2003-09-26 | Mitsubishi Electric Corp | 情報収集装置、情報収集方法及びプログラム |
Also Published As
| Publication number | Publication date |
|---|---|
| CN101617310A (zh) | 2009-12-30 |
| BRPI0719477A2 (pt) | 2014-10-21 |
| WO2008074486A3 (en) | 2008-08-21 |
| WO2008074486A8 (en) | 2009-07-30 |
| RU2009127889A (ru) | 2011-01-27 |
| AU2007334863A1 (en) | 2008-06-26 |
| US8255404B2 (en) | 2012-08-28 |
| IL199470A (en) | 2014-04-30 |
| EP2126750A2 (en) | 2009-12-02 |
| KR20090100417A (ko) | 2009-09-23 |
| BRPI0719477B1 (pt) | 2018-11-27 |
| CA2672958C (en) | 2016-04-26 |
| EP2466500A1 (en) | 2012-06-20 |
| ITMI20062436A1 (it) | 2008-06-20 |
| JP2010514026A (ja) | 2010-04-30 |
| WO2008074486A2 (en) | 2008-06-26 |
| JP5227333B2 (ja) | 2013-07-03 |
| CA2672958A1 (en) | 2008-06-26 |
| RU2487404C2 (ru) | 2013-07-10 |
| US20100241633A1 (en) | 2010-09-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR101506354B1 (ko) | 웹 페이지의 분류 및 대응 콘텐트의 조직화 방법 | |
| JP4857075B2 (ja) | ウェブドキュメントの集合において効率的に日付を検索する方法、コンピュータプログラム | |
| AU2007243784B2 (en) | Propagating useful information among related web pages, such as web pages of a website | |
| US9558186B2 (en) | Unsupervised extraction of facts | |
| KR100505848B1 (ko) | 검색 시스템 | |
| US20090070322A1 (en) | Browsing knowledge on the basis of semantic relations | |
| WO2015084759A1 (en) | Systems and methods for in-memory database search | |
| WO2009061399A1 (en) | Method for crawling, mapping and extracting information associated with a business using heuristic and semantic analysis | |
| US8423885B1 (en) | Updating search engine document index based on calculated age of changed portions in a document | |
| JPH09265482A (ja) | データベース検索装置及びデータベース検索方法 | |
| WO2009035871A1 (en) | Browsing knowledge on the basis of semantic relations | |
| JP2006529044A (ja) | 定義付けシステムおよび方法 | |
| AU2007334863B2 (en) | Method for classifying web pages and organising corresponding contents | |
| US20080033953A1 (en) | Method to search transactional web pages | |
| Goode et al. | A Toolkit for the Analysis of the NIME Proceedings Archive | |
| Olsson | Using Elasticsearch for full-text searches on unstructured data | |
| Masanés | Archiving the hidden web | |
| Saleh et al. | Effective Web Page Crawler | |
| Meneghello et al. | Unlocking Analytical Value from Social Media and User Generated Content | |
| Syed Mudhasir et al. | An evaluation of provenance-based near-duplicates detection | |
| Boekelo | Automatic collection of Web discussions | |
| Grigalis | Structured data extraction from template-generated web pages | |
| Agichtein | To search or to crawl?: towards a query optimizer for text-centric tasks | |
| PRATHIBHA et al. | AUTOMATIC TEMPLATE DETECTION USING NOVEL APPROACH | |
| Su | HTML-QS: A query system for hypertext markup language documents. |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
St.27 status event code: A-0-1-A10-A15-nap-PA0105 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| A201 | Request for examination | ||
| PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
| E902 | Notification of reason for refusal | ||
| PE0902 | Notice of grounds for rejection |
St.27 status event code: A-1-2-D10-D21-exm-PE0902 |
|
| T11-X000 | Administrative time limit extension requested |
St.27 status event code: U-3-3-T10-T11-oth-X000 |
|
| T11-X000 | Administrative time limit extension requested |
St.27 status event code: U-3-3-T10-T11-oth-X000 |
|
| T11-X000 | Administrative time limit extension requested |
St.27 status event code: U-3-3-T10-T11-oth-X000 |
|
| E13-X000 | Pre-grant limitation requested |
St.27 status event code: A-2-3-E10-E13-lim-X000 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| E701 | Decision to grant or registration of patent right | ||
| PE0701 | Decision of registration |
St.27 status event code: A-1-2-D10-D22-exm-PE0701 |
|
| PR0701 | Registration of establishment |
St.27 status event code: A-2-4-F10-F11-exm-PR0701 |
|
| PR1002 | Payment of registration fee |
St.27 status event code: A-2-2-U10-U12-oth-PR1002 Fee payment year number: 1 |
|
| PG1601 | Publication of registration |
St.27 status event code: A-4-4-Q10-Q13-nap-PG1601 |
|
| PN2301 | Change of applicant |
St.27 status event code: A-5-5-R10-R11-asn-PN2301 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 4 |
|
| P22-X000 | Classification modified |
St.27 status event code: A-4-4-P10-P22-nap-X000 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 5 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 6 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 7 |
|
| PC1903 | Unpaid annual fee |
St.27 status event code: A-4-4-U10-U13-oth-PC1903 Not in force date: 20220321 Payment event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE |
|
| PC1903 | Unpaid annual fee |
St.27 status event code: N-4-6-H10-H13-oth-PC1903 Ip right cessation event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE Not in force date: 20220321 |