EP4052145A4 - Effizientes crawling unter verwendung von pfadplanung und anwendungen davon - Google Patents

Effizientes crawling unter verwendung von pfadplanung und anwendungen davon Download PDF

Info

Publication number
EP4052145A4
EP4052145A4 EP20883009.1A EP20883009A EP4052145A4 EP 4052145 A4 EP4052145 A4 EP 4052145A4 EP 20883009 A EP20883009 A EP 20883009A EP 4052145 A4 EP4052145 A4 EP 4052145A4
Authority
EP
European Patent Office
Prior art keywords
applications
path scheduling
crawling
efficient
efficient crawling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP20883009.1A
Other languages
English (en)
French (fr)
Other versions
EP4052145A1 (de
Inventor
Carlos VERA-CIRO
Robert Raymond Lindner
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Veda Data Solutions Inc
Original Assignee
Veda Data Solutions Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US16/668,544 external-priority patent/US20210133275A1/en
Priority claimed from US16/668,524 external-priority patent/US20210134407A1/en
Application filed by Veda Data Solutions Inc filed Critical Veda Data Solutions Inc
Publication of EP4052145A1 publication Critical patent/EP4052145A1/de
Publication of EP4052145A4 publication Critical patent/EP4052145A4/de
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/22Social work or social welfare, e.g. community support activities or counselling services

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Theoretical Computer Science (AREA)
  • Development Economics (AREA)
  • General Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Tourism & Hospitality (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Child & Adolescent Psychology (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Primary Health Care (AREA)
  • Game Theory and Decision Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
EP20883009.1A 2019-10-30 2020-10-30 Effizientes crawling unter verwendung von pfadplanung und anwendungen davon Pending EP4052145A4 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/668,544 US20210133275A1 (en) 2019-10-30 2019-10-30 Extracting unstructured demographic information from a data source in a structured manner
US16/668,524 US20210134407A1 (en) 2019-10-30 2019-10-30 Efficient crawling using path scheduling, and applications thereof
PCT/US2020/058286 WO2021087308A1 (en) 2019-10-30 2020-10-30 Efficient crawling using path scheduling, and applications thereof

Publications (2)

Publication Number Publication Date
EP4052145A1 EP4052145A1 (de) 2022-09-07
EP4052145A4 true EP4052145A4 (de) 2023-11-01

Family

ID=75716503

Family Applications (1)

Application Number Title Priority Date Filing Date
EP20883009.1A Pending EP4052145A4 (de) 2019-10-30 2020-10-30 Effizientes crawling unter verwendung von pfadplanung und anwendungen davon

Country Status (3)

Country Link
EP (1) EP4052145A4 (de)
CN (1) CN114761945A (de)
WO (1) WO2021087308A1 (de)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130166207A1 (en) * 2011-12-21 2013-06-27 Telenav, Inc. Navigation system with point of interest harvesting mechanism and method of operation thereof
US20160188717A1 (en) * 2014-12-29 2016-06-30 Quixey, Inc. Network crawling prioritization
US20180150562A1 (en) * 2016-11-25 2018-05-31 Cognizant Technology Solutions India Pvt. Ltd. System and Method for Automatically Extracting and Analyzing Data

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6970882B2 (en) * 2002-04-04 2005-11-29 International Business Machines Corporation Unified relational database model for data mining selected model scoring results, model training results where selection is based on metadata included in mining model control table
US6941318B1 (en) * 2002-05-10 2005-09-06 Oracle International Corporation Universal tree interpreter for data mining models
EP1941432A4 (de) * 2005-10-25 2011-04-20 Angoss Software Corp Strategie-bäume zum daten-mining
US8831361B2 (en) * 2012-03-09 2014-09-09 Ancora Software Inc. Method and system for commercial document image classification
US9292797B2 (en) * 2012-12-14 2016-03-22 International Business Machines Corporation Semi-supervised data integration model for named entity classification
RU2571545C1 (ru) * 2014-09-30 2015-12-20 Общество с ограниченной ответственностью "Аби Девелопмент" Классификация изображений документов на основании контента

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130166207A1 (en) * 2011-12-21 2013-06-27 Telenav, Inc. Navigation system with point of interest harvesting mechanism and method of operation thereof
US20160188717A1 (en) * 2014-12-29 2016-06-30 Quixey, Inc. Network crawling prioritization
US20180150562A1 (en) * 2016-11-25 2018-05-31 Cognizant Technology Solutions India Pvt. Ltd. System and Method for Automatically Extracting and Analyzing Data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of WO2021087308A1 *

Also Published As

Publication number Publication date
EP4052145A1 (de) 2022-09-07
WO2021087308A1 (en) 2021-05-06
CN114761945A (zh) 2022-07-15

Similar Documents

Publication Publication Date Title
EP3587399A4 (de) Verbindung, lichtemittierendes material und lichtemittierendes element
EP3845561A4 (de) Anti-cd47-antikörper und verwendung davon
IL291299A (en) Anti-TNFR2 antibodies, preparations containing them and their uses
EP3917934A4 (de) Verbindungen und verwendungen davon
EP3917526A4 (de) Verbindungen und verwendungen davon
EP3917529A4 (de) Verbindungen und verwendungen davon
EP4076448A4 (de) Fluoroalkyl-oxadiazole und verwendungen davon
EP3917517A4 (de) Verbindungen und verwendungen davon
EP3918323A4 (de) Anti-gal3-antikörper und verwendungen davon
EP3997127A4 (de) Gegen dll3 gerichtete antikörper und verwendungen davon
EP3917527A4 (de) Verbindungen und verwendungen davon
EP3838900A4 (de) 3-aryloxy-3-aryl-propylamin-verbindung und verwendungen davon
EP3810190A4 (de) Technisierte zellen und deren verwendungen
EP3959307A4 (de) Gentechnisch veränderte zellen und verwendungen davon
IL285651A (en) Anti-trem2 antibodies, preparations containing them and their uses
EP3829621A4 (de) Gezüchtete hemikanäle, gezüchtete vesikel und deren verwendung
EP3867353A4 (de) Proto-antigen-präsentierende synthetische oberflächen, aktivierte t-zellen und deren verwendungen
EP3962954A4 (de) Anti-galectin-9-antikörper und verwendungen davon
EP3941908A4 (de) Verbindungen und verwendungen davon
EP4025609A4 (de) Anti-steap1-antikörper und verwendungen davon
EP3978481A4 (de) Isoxazolinverbindung und anwendung davon
EP4071172A4 (de) Anti-lilrb1-antikörper und verwendungen davon
EP3848462A4 (de) Baicalein- und wildbaicalein-synthetisierender mikroorganismus, herstellungsverfahren dafür und anwendungen davon
EP4048697A4 (de) Neuartige anti-cd47-antikörper und verwendungen davon
EP3986935A4 (de) Anti-cd47-antikörper und verwendungen davon

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20220504

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20231005

RIC1 Information provided on ipc code assigned before grant

Ipc: G06Q 30/0201 20230101ALI20230928BHEP

Ipc: G06F 16/951 20190101ALI20230928BHEP

Ipc: G06F 17/00 20190101AFI20230928BHEP