MXPA05005209A - Metodo y sistema para indexar y buscar bases de datos. - Google Patents
Metodo y sistema para indexar y buscar bases de datos.Info
- Publication number
- MXPA05005209A MXPA05005209A MXPA05005209A MXPA05005209A MXPA05005209A MX PA05005209 A MXPA05005209 A MX PA05005209A MX PA05005209 A MXPA05005209 A MX PA05005209A MX PA05005209 A MXPA05005209 A MX PA05005209A MX PA05005209 A MXPA05005209 A MX PA05005209A
- Authority
- MX
- Mexico
- Prior art keywords
- index
- site
- domain
- database
- sampling
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9532—Query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/953—Organization of data
- Y10S707/962—Entity-attribute-value
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99934—Query formulation, input preparation, or translation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99935—Query augmenting and refining, e.g. inexact access
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Un sistema de busqueda genera un indice para bases de datos muestreando generativamente las bases de datos y utiliza ese indice para identificar y formular consultas para buscar las bases de datos. El indice generado es denominado como un indice de atributo de dominio y contiene un indice de nivel de dominio e indices de nivel de sitio. Un indice de nivel de sitio para una base de datos delinea los atributos de sitio para distintos valores de atributo dentro de la base de datos. El indice de nivel de dominio para un dominio delinea valores de atributo para la base de datos y pares de atributo de sitio que contienen esos valores de atributo. Para generar un indice de nivel de sitio para una base de datos dentro de cierto dominio, el sistema de busqueda comienza con un grupo inicial de los datos de muestreo para ese dominio. El sistema de busqueda genera consultas de muestreo con base en los datos de muestra y emite las consultas de muestra para una base de datos. El sistema de busqueda actualiza el indice de nivel de sitio basandose en los resultados del muestreo y utiliza los resultados para generar mas consultas de muestreo.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/846,776 US7136851B2 (en) | 2004-05-14 | 2004-05-14 | Method and system for indexing and searching databases |
Publications (1)
Publication Number | Publication Date |
---|---|
MXPA05005209A true MXPA05005209A (es) | 2005-12-06 |
Family
ID=34939828
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MXPA05005209A MXPA05005209A (es) | 2004-05-14 | 2005-05-13 | Metodo y sistema para indexar y buscar bases de datos. |
Country Status (10)
Country | Link |
---|---|
US (1) | US7136851B2 (es) |
EP (1) | EP1598756A3 (es) |
JP (1) | JP4249726B2 (es) |
KR (1) | KR101150112B1 (es) |
CN (1) | CN1741017B (es) |
AU (1) | AU2005202020A1 (es) |
BR (1) | BRPI0503221A (es) |
CA (1) | CA2507336C (es) |
MX (1) | MXPA05005209A (es) |
RU (1) | RU2398272C2 (es) |
Families Citing this family (56)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6282956B1 (en) | 1994-12-29 | 2001-09-04 | Kazuhiro Okada | Multi-axial angular velocity sensor |
US9537731B2 (en) | 2004-07-07 | 2017-01-03 | Sciencelogic, Inc. | Management techniques for non-traditional network and information system topologies |
US20060031224A1 (en) * | 2004-08-05 | 2006-02-09 | International Business Machines Corp. | Method, system and computer program product for managing database records with attributes located in multiple databases |
US20060031206A1 (en) | 2004-08-06 | 2006-02-09 | Christian Deubel | Searching for data objects |
US7606793B2 (en) * | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
US8799107B1 (en) * | 2004-09-30 | 2014-08-05 | Google Inc. | Systems and methods for scoring documents |
US7469248B2 (en) * | 2005-05-17 | 2008-12-23 | International Business Machines Corporation | Common interface to access catalog information from heterogeneous databases |
US7769742B1 (en) * | 2005-05-31 | 2010-08-03 | Google Inc. | Web crawler scheduler that utilizes sitemaps from websites |
US7801881B1 (en) | 2005-05-31 | 2010-09-21 | Google Inc. | Sitemap generating client for web crawler |
US7599861B2 (en) | 2006-03-02 | 2009-10-06 | Convergys Customer Management Group, Inc. | System and method for closed loop decisionmaking in an automated care system |
US7809663B1 (en) | 2006-05-22 | 2010-10-05 | Convergys Cmg Utah, Inc. | System and method for supporting the utilization of machine language |
US8379830B1 (en) | 2006-05-22 | 2013-02-19 | Convergys Customer Management Delaware Llc | System and method for automated customer service with contingent live interaction |
US20080010238A1 (en) * | 2006-07-07 | 2008-01-10 | Microsoft Corporation | Index having short-term portion and long-term portion |
US7930400B1 (en) | 2006-08-04 | 2011-04-19 | Google Inc. | System and method for managing multiple domain names for a website in a website indexing system |
US8533226B1 (en) | 2006-08-04 | 2013-09-10 | Google Inc. | System and method for verifying and revoking ownership rights with respect to a website in a website indexing system |
US7599920B1 (en) | 2006-10-12 | 2009-10-06 | Google Inc. | System and method for enabling website owners to manage crawl rate in a website indexing system |
US20090089250A1 (en) * | 2007-10-02 | 2009-04-02 | Oracle International Corporation | Contract text search summarized by contract |
KR100859162B1 (ko) * | 2007-10-16 | 2008-09-19 | 펜타시큐리티시스템 주식회사 | 암호화된 칼럼을 포함하는 데이터베이스에서의 쿼리의 암호화 변조를 통한 사용자 쿼리 처리 장치 및 방법 |
US9348912B2 (en) | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US8046353B2 (en) * | 2007-11-02 | 2011-10-25 | Citrix Online Llc | Method and apparatus for searching a hierarchical database and an unstructured database with a single search query |
US20090204610A1 (en) * | 2008-02-11 | 2009-08-13 | Hellstrom Benjamin J | Deep web miner |
US8812493B2 (en) | 2008-04-11 | 2014-08-19 | Microsoft Corporation | Search results ranking using editing distance and document information |
US8645391B1 (en) | 2008-07-03 | 2014-02-04 | Google Inc. | Attribute-value extraction from structured documents |
US9189537B2 (en) * | 2008-08-29 | 2015-11-17 | Red Hat, Inc. | Extraction of critical information from database |
CN101667183B (zh) * | 2008-09-02 | 2011-12-21 | 浙江大学 | 一种基于定制的索引建立方法、装置和系统 |
US20100076952A1 (en) * | 2008-09-05 | 2010-03-25 | Xuejun Wang | Self contained multi-dimensional traffic data reporting and analysis in a large scale search hosting system |
US8290923B2 (en) * | 2008-09-05 | 2012-10-16 | Yahoo! Inc. | Performing large scale structured search allowing partial schema changes without system downtime |
US20100076979A1 (en) * | 2008-09-05 | 2010-03-25 | Xuejun Wang | Performing search query dimensional analysis on heterogeneous structured data based on relative density |
CN101727465B (zh) * | 2008-11-03 | 2011-12-21 | 中国移动通信集团公司 | 分布式列存储数据库索引建立、查询方法及装置与系统 |
CN101751406B (zh) * | 2008-12-18 | 2012-01-04 | 赵伟 | 一种实现基于列存储的关系型数据库的方法及装置 |
US20100174719A1 (en) * | 2009-01-06 | 2010-07-08 | Jorge Alegre Vilches | System, method, and program product for personalization of an open network search engine |
CN102004633B (zh) * | 2009-09-03 | 2013-04-24 | 阿里巴巴集团控股有限公司 | 一种处理控件属性的方法及装置 |
US8738635B2 (en) | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US9152683B2 (en) * | 2010-10-05 | 2015-10-06 | International Business Machines Corporation | Database-transparent near online archiving and retrieval of data |
CN102467536A (zh) * | 2010-11-12 | 2012-05-23 | 深圳市快易典电子技术有限公司 | 一种字符处理装置及其处理方法 |
US9244976B1 (en) * | 2010-12-16 | 2016-01-26 | The George Washington University and Board of Regents | Just-in-time analytics on large file systems and hidden databases |
US9244975B2 (en) | 2010-12-16 | 2016-01-26 | The George Washington University | Just-in-time analytics on large file systems |
WO2012099490A1 (ru) * | 2011-01-20 | 2012-07-26 | Tabun Peter Yurevich | Система интерактивного поиска и отображения информации |
US10872082B1 (en) * | 2011-10-24 | 2020-12-22 | NetBase Solutions, Inc. | Methods and apparatuses for clustered storage of information |
US20130117257A1 (en) * | 2011-11-03 | 2013-05-09 | Microsoft Corporation | Query result estimation |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
US8751486B1 (en) * | 2013-07-31 | 2014-06-10 | Splunk Inc. | Executing structured queries on unstructured data |
US9798833B2 (en) * | 2013-12-30 | 2017-10-24 | Sap Ag | Accessing information content in a database platform using metadata |
CA2943513C (en) | 2014-03-29 | 2020-08-04 | Thomson Reuters Global Resources | Improved method, system and software for searching, identifying, retrieving and presenting electronic documents |
KR101565528B1 (ko) | 2014-05-16 | 2015-11-03 | (주)케이사인 | 델타 인덱싱 시스템 및 델타 인덱싱 시스템의 동작 방법 |
CN112417243A (zh) * | 2014-06-24 | 2021-02-26 | 谷歌有限责任公司 | 本地应用的搜索结果 |
US9047246B1 (en) | 2014-07-31 | 2015-06-02 | Splunk Inc. | High availability scheduler |
CN105205104A (zh) * | 2015-08-26 | 2015-12-30 | 成都布林特信息技术有限公司 | 一种云平台数据获取方法 |
WO2017203698A1 (ja) * | 2016-05-27 | 2017-11-30 | 三菱電機株式会社 | 登録先決定装置、登録装置、秘匿検索システム、登録先決定方法及び登録先決定プログラム |
US10909100B2 (en) * | 2017-09-05 | 2021-02-02 | Google Llc | Object identifier index |
RU2733482C2 (ru) | 2018-11-16 | 2020-10-01 | Общество С Ограниченной Ответственностью "Яндекс" | Способ и система для обновления базы данных поискового индекса |
CN109635203B (zh) * | 2018-12-19 | 2020-12-25 | 北京达佳互联信息技术有限公司 | 网页抓取请求处理方法、装置、服务器及存储介质 |
KR102210346B1 (ko) * | 2019-10-16 | 2021-02-02 | 네이버 주식회사 | 대량 알림 발송 방법 및 시스템 |
CN110955711A (zh) * | 2019-11-26 | 2020-04-03 | 南京甄视智能科技有限公司 | 可动态扩展的检索方法与装置 |
US11727014B2 (en) * | 2019-12-12 | 2023-08-15 | The Yes Platform, Inc. | Dynamic filter recommendations |
CN117355827A (zh) * | 2022-03-16 | 2024-01-05 | 库尔马甘贝托夫·阿努阿尔·莱哈诺维奇 | 一种在应用程序的非结构化数据库中组织文档搜索的方法 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5548770A (en) * | 1993-02-25 | 1996-08-20 | Data Parallel Systems, Inc. | Method and apparatus for improving retrieval of data from a database |
US5999928A (en) * | 1997-06-30 | 1999-12-07 | Informix Software, Inc. | Estimating the number of distinct values for an attribute in a relational database table |
AU7339700A (en) * | 1999-11-16 | 2001-05-30 | Searchcraft Corporation | Method for searching from a plurality of data sources |
US7020679B2 (en) * | 2000-05-12 | 2006-03-28 | Taoofsearch, Inc. | Two-level internet search service system |
JP2002183432A (ja) * | 2000-12-14 | 2002-06-28 | Ibm Japan Ltd | データ抽出方法、データ操作方法、債権情報抽出方法、データベースシステム、債権商品化処理装置、記憶媒体及びコンピュータプログラム |
US7693956B2 (en) * | 2002-03-25 | 2010-04-06 | Michael Z. Moricz | Accessing deep web information using a search engine |
-
2004
- 2004-05-14 US US10/846,776 patent/US7136851B2/en not_active Expired - Fee Related
-
2005
- 2005-05-11 AU AU2005202020A patent/AU2005202020A1/en not_active Abandoned
- 2005-05-13 CA CA2507336A patent/CA2507336C/en not_active Expired - Fee Related
- 2005-05-13 EP EP05104017A patent/EP1598756A3/en not_active Withdrawn
- 2005-05-13 MX MXPA05005209A patent/MXPA05005209A/es active IP Right Grant
- 2005-05-13 BR BR0503221-0A patent/BRPI0503221A/pt not_active IP Right Cessation
- 2005-05-13 KR KR1020050040126A patent/KR101150112B1/ko not_active IP Right Cessation
- 2005-05-13 JP JP2005141126A patent/JP4249726B2/ja not_active Expired - Fee Related
- 2005-05-13 RU RU2005114657/09A patent/RU2398272C2/ru not_active IP Right Cessation
- 2005-05-16 CN CN2005100922433A patent/CN1741017B/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN1741017B (zh) | 2010-05-26 |
EP1598756A2 (en) | 2005-11-23 |
RU2005114657A (ru) | 2006-11-20 |
US7136851B2 (en) | 2006-11-14 |
AU2005202020A1 (en) | 2005-12-01 |
KR101150112B1 (ko) | 2012-06-08 |
EP1598756A3 (en) | 2006-07-26 |
JP2006012125A (ja) | 2006-01-12 |
CN1741017A (zh) | 2006-03-01 |
BRPI0503221A (pt) | 2006-01-10 |
US20050256865A1 (en) | 2005-11-17 |
CA2507336C (en) | 2013-12-24 |
JP4249726B2 (ja) | 2009-04-08 |
KR20060047882A (ko) | 2006-05-18 |
RU2398272C2 (ru) | 2010-08-27 |
CA2507336A1 (en) | 2005-11-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MXPA05005209A (es) | Metodo y sistema para indexar y buscar bases de datos. | |
CN105868313B (zh) | 一种基于模板匹配技术的知识图谱问答系统及方法 | |
CN109800284B (zh) | 一种面向任务的非结构化信息智能问答系统构建方法 | |
WO2004114163A3 (en) | Method and system for enhanced data searching | |
DE69916272D1 (de) | Methode und verfahren um relevante dokumente in einer datenbank zu finden | |
WO2003017143A3 (en) | Method and system for enhanced data searching | |
WO2006060773A3 (en) | Computer systems and methods for visualizing data with generation of marks | |
MXPA05002608A (es) | Develamiento de intencion de usuario. | |
WO2011160140A8 (en) | System and method of semantic based searching | |
ATE358851T1 (de) | Datenbanksuche mit digitaler tinte unter verwendung von handschriftmerkmalsynthese | |
CN103577442B (zh) | 一种地图数据重要度计算方法及装置 | |
JP2006012125A5 (es) | ||
WO2005060684A3 (en) | Method and system for obtaining solutions to contradictional problems from a semantically indexed database | |
CN106227788A (zh) | 一种以Lucene为基础的数据库查询方法 | |
CN106202450A (zh) | 一种基于makefile文件依赖的源码分析方法 | |
CN104462399A (zh) | 搜索结果的处理方法及装置 | |
CN104915405A (zh) | 一种基于多层次的微博查询扩展方法 | |
CN106933869A (zh) | 一种操作数据库的方法和装置 | |
CN101937433A (zh) | 一种产品实时搜索方法 | |
WO2010117617A3 (en) | Using ontology to order records by relevance | |
US11487795B2 (en) | Template-based automatic software bug question and answer method | |
US20160070707A1 (en) | Keyword search on databases | |
CN106649879A (zh) | 一种图书馆专业书籍智能推荐方法 | |
WO2006031466A3 (en) | Functionality and system for converting data from a first to a second form | |
CN102609455B (zh) | 一种实现汉语同音字检索的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration |