CN1741017A - 用于索引和搜索数据库的方法和装置 - Google Patents
用于索引和搜索数据库的方法和装置 Download PDFInfo
- Publication number
- CN1741017A CN1741017A CNA2005100922433A CN200510092243A CN1741017A CN 1741017 A CN1741017 A CN 1741017A CN A2005100922433 A CNA2005100922433 A CN A2005100922433A CN 200510092243 A CN200510092243 A CN 200510092243A CN 1741017 A CN1741017 A CN 1741017A
- Authority
- CN
- China
- Prior art keywords
- database
- attribute
- property value
- item
- inquiry
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/22—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9532—Query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/953—Organization of data
- Y10S707/962—Entity-attribute-value
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99934—Query formulation, input preparation, or translation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99935—Query augmenting and refining, e.g. inexact access
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (27)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/846,776 US7136851B2 (en) | 2004-05-14 | 2004-05-14 | Method and system for indexing and searching databases |
US10/846,776 | 2004-05-14 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1741017A true CN1741017A (zh) | 2006-03-01 |
CN1741017B CN1741017B (zh) | 2010-05-26 |
Family
ID=34939828
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2005100922433A Expired - Fee Related CN1741017B (zh) | 2004-05-14 | 2005-05-16 | 用于索引和搜索数据库的方法和装置 |
Country Status (10)
Country | Link |
---|---|
US (1) | US7136851B2 (zh) |
EP (1) | EP1598756A3 (zh) |
JP (1) | JP4249726B2 (zh) |
KR (1) | KR101150112B1 (zh) |
CN (1) | CN1741017B (zh) |
AU (1) | AU2005202020A1 (zh) |
BR (1) | BRPI0503221A (zh) |
CA (1) | CA2507336C (zh) |
MX (1) | MXPA05005209A (zh) |
RU (1) | RU2398272C2 (zh) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010048789A1 (zh) * | 2008-11-03 | 2010-05-06 | 中国移动通信集团公司 | 用于分布式列存储数据库索引建立、查询的方法、装置及系统 |
CN102004633A (zh) * | 2009-09-03 | 2011-04-06 | 阿里巴巴集团控股有限公司 | 一种处理控件属性的方法及装置 |
CN101667183B (zh) * | 2008-09-02 | 2011-12-21 | 浙江大学 | 一种基于定制的索引建立方法、装置和系统 |
CN101751406B (zh) * | 2008-12-18 | 2012-01-04 | 赵伟 | 一种实现基于列存储的关系型数据库的方法及装置 |
CN102467536A (zh) * | 2010-11-12 | 2012-05-23 | 深圳市快易典电子技术有限公司 | 一种字符处理装置及其处理方法 |
CN104750776A (zh) * | 2013-12-30 | 2015-07-01 | Sap欧洲公司 | 使用元数据访问数据库平台中的信息内容 |
CN105205104A (zh) * | 2015-08-26 | 2015-12-30 | 成都布林特信息技术有限公司 | 一种云平台数据获取方法 |
CN110955711A (zh) * | 2019-11-26 | 2020-04-03 | 南京甄视智能科技有限公司 | 可动态扩展的检索方法与装置 |
Families Citing this family (48)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6282956B1 (en) | 1994-12-29 | 2001-09-04 | Kazuhiro Okada | Multi-axial angular velocity sensor |
US9537731B2 (en) * | 2004-07-07 | 2017-01-03 | Sciencelogic, Inc. | Management techniques for non-traditional network and information system topologies |
US20060031224A1 (en) * | 2004-08-05 | 2006-02-09 | International Business Machines Corp. | Method, system and computer program product for managing database records with attributes located in multiple databases |
US20060031206A1 (en) * | 2004-08-06 | 2006-02-09 | Christian Deubel | Searching for data objects |
US7606793B2 (en) * | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
US8799107B1 (en) * | 2004-09-30 | 2014-08-05 | Google Inc. | Systems and methods for scoring documents |
US7469248B2 (en) * | 2005-05-17 | 2008-12-23 | International Business Machines Corporation | Common interface to access catalog information from heterogeneous databases |
US7769742B1 (en) | 2005-05-31 | 2010-08-03 | Google Inc. | Web crawler scheduler that utilizes sitemaps from websites |
US7801881B1 (en) | 2005-05-31 | 2010-09-21 | Google Inc. | Sitemap generating client for web crawler |
US7599861B2 (en) | 2006-03-02 | 2009-10-06 | Convergys Customer Management Group, Inc. | System and method for closed loop decisionmaking in an automated care system |
US7809663B1 (en) | 2006-05-22 | 2010-10-05 | Convergys Cmg Utah, Inc. | System and method for supporting the utilization of machine language |
US8379830B1 (en) | 2006-05-22 | 2013-02-19 | Convergys Customer Management Delaware Llc | System and method for automated customer service with contingent live interaction |
US20080010238A1 (en) * | 2006-07-07 | 2008-01-10 | Microsoft Corporation | Index having short-term portion and long-term portion |
US8533226B1 (en) | 2006-08-04 | 2013-09-10 | Google Inc. | System and method for verifying and revoking ownership rights with respect to a website in a website indexing system |
US7930400B1 (en) | 2006-08-04 | 2011-04-19 | Google Inc. | System and method for managing multiple domain names for a website in a website indexing system |
US7599920B1 (en) | 2006-10-12 | 2009-10-06 | Google Inc. | System and method for enabling website owners to manage crawl rate in a website indexing system |
US20090089250A1 (en) * | 2007-10-02 | 2009-04-02 | Oracle International Corporation | Contract text search summarized by contract |
KR100859162B1 (ko) * | 2007-10-16 | 2008-09-19 | 펜타시큐리티시스템 주식회사 | 암호화된 칼럼을 포함하는 데이터베이스에서의 쿼리의 암호화 변조를 통한 사용자 쿼리 처리 장치 및 방법 |
US9348912B2 (en) | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US8046353B2 (en) * | 2007-11-02 | 2011-10-25 | Citrix Online Llc | Method and apparatus for searching a hierarchical database and an unstructured database with a single search query |
US20090204610A1 (en) * | 2008-02-11 | 2009-08-13 | Hellstrom Benjamin J | Deep web miner |
US8812493B2 (en) | 2008-04-11 | 2014-08-19 | Microsoft Corporation | Search results ranking using editing distance and document information |
US8645391B1 (en) | 2008-07-03 | 2014-02-04 | Google Inc. | Attribute-value extraction from structured documents |
US9189537B2 (en) * | 2008-08-29 | 2015-11-17 | Red Hat, Inc. | Extraction of critical information from database |
US20100076979A1 (en) * | 2008-09-05 | 2010-03-25 | Xuejun Wang | Performing search query dimensional analysis on heterogeneous structured data based on relative density |
US8290923B2 (en) * | 2008-09-05 | 2012-10-16 | Yahoo! Inc. | Performing large scale structured search allowing partial schema changes without system downtime |
US20100076952A1 (en) * | 2008-09-05 | 2010-03-25 | Xuejun Wang | Self contained multi-dimensional traffic data reporting and analysis in a large scale search hosting system |
US20100174719A1 (en) * | 2009-01-06 | 2010-07-08 | Jorge Alegre Vilches | System, method, and program product for personalization of an open network search engine |
US8738635B2 (en) | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US9152683B2 (en) * | 2010-10-05 | 2015-10-06 | International Business Machines Corporation | Database-transparent near online archiving and retrieval of data |
US9244976B1 (en) * | 2010-12-16 | 2016-01-26 | The George Washington University and Board of Regents | Just-in-time analytics on large file systems and hidden databases |
US9244975B2 (en) | 2010-12-16 | 2016-01-26 | The George Washington University | Just-in-time analytics on large file systems |
US20160210336A1 (en) * | 2011-01-20 | 2016-07-21 | Peter Yurevich TABUN | System for interactively searching for and displaying information |
US10872082B1 (en) * | 2011-10-24 | 2020-12-22 | NetBase Solutions, Inc. | Methods and apparatuses for clustered storage of information |
US20130117257A1 (en) * | 2011-11-03 | 2013-05-09 | Microsoft Corporation | Query result estimation |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
US8751486B1 (en) | 2013-07-31 | 2014-06-10 | Splunk Inc. | Executing structured queries on unstructured data |
US10031913B2 (en) | 2014-03-29 | 2018-07-24 | Camelot Uk Bidco Limited | Method, system and software for searching, identifying, retrieving and presenting electronic documents |
KR101565528B1 (ko) | 2014-05-16 | 2015-11-03 | (주)케이사인 | 델타 인덱싱 시스템 및 델타 인덱싱 시스템의 동작 방법 |
CN106687949B (zh) * | 2014-06-24 | 2020-11-17 | 谷歌有限责任公司 | 本地应用的搜索结果 |
US9047246B1 (en) | 2014-07-31 | 2015-06-02 | Splunk Inc. | High availability scheduler |
JP6381861B2 (ja) * | 2016-05-27 | 2018-08-29 | 三菱電機株式会社 | 登録先決定装置、登録装置、秘匿検索システム、登録先決定方法及び登録先決定プログラム |
US10909100B2 (en) * | 2017-09-05 | 2021-02-02 | Google Llc | Object identifier index |
RU2733482C2 (ru) | 2018-11-16 | 2020-10-01 | Общество С Ограниченной Ответственностью "Яндекс" | Способ и система для обновления базы данных поискового индекса |
CN109635203B (zh) * | 2018-12-19 | 2020-12-25 | 北京达佳互联信息技术有限公司 | 网页抓取请求处理方法、装置、服务器及存储介质 |
KR102210346B1 (ko) * | 2019-10-16 | 2021-02-02 | 네이버 주식회사 | 대량 알림 발송 방법 및 시스템 |
US11727014B2 (en) * | 2019-12-12 | 2023-08-15 | The Yes Platform, Inc. | Dynamic filter recommendations |
CN117355827A (zh) * | 2022-03-16 | 2024-01-05 | 库尔马甘贝托夫·阿努阿尔·莱哈诺维奇 | 一种在应用程序的非结构化数据库中组织文档搜索的方法 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5548770A (en) * | 1993-02-25 | 1996-08-20 | Data Parallel Systems, Inc. | Method and apparatus for improving retrieval of data from a database |
US5999928A (en) * | 1997-06-30 | 1999-12-07 | Informix Software, Inc. | Estimating the number of distinct values for an attribute in a relational database table |
WO2001037134A1 (en) * | 1999-11-16 | 2001-05-25 | Searchcraft Corporation | Method for searching from a plurality of data sources |
US7020679B2 (en) * | 2000-05-12 | 2006-03-28 | Taoofsearch, Inc. | Two-level internet search service system |
JP2002183432A (ja) * | 2000-12-14 | 2002-06-28 | Ibm Japan Ltd | データ抽出方法、データ操作方法、債権情報抽出方法、データベースシステム、債権商品化処理装置、記憶媒体及びコンピュータプログラム |
AU2003228366A1 (en) * | 2002-03-25 | 2003-10-13 | Michael Z. Morciz | Accessing deep web information using a search engine |
-
2004
- 2004-05-14 US US10/846,776 patent/US7136851B2/en not_active Expired - Fee Related
-
2005
- 2005-05-11 AU AU2005202020A patent/AU2005202020A1/en not_active Abandoned
- 2005-05-13 CA CA2507336A patent/CA2507336C/en not_active Expired - Fee Related
- 2005-05-13 BR BR0503221-0A patent/BRPI0503221A/pt not_active IP Right Cessation
- 2005-05-13 MX MXPA05005209A patent/MXPA05005209A/es active IP Right Grant
- 2005-05-13 EP EP05104017A patent/EP1598756A3/en not_active Withdrawn
- 2005-05-13 KR KR1020050040126A patent/KR101150112B1/ko not_active IP Right Cessation
- 2005-05-13 JP JP2005141126A patent/JP4249726B2/ja not_active Expired - Fee Related
- 2005-05-13 RU RU2005114657/09A patent/RU2398272C2/ru not_active IP Right Cessation
- 2005-05-16 CN CN2005100922433A patent/CN1741017B/zh not_active Expired - Fee Related
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101667183B (zh) * | 2008-09-02 | 2011-12-21 | 浙江大学 | 一种基于定制的索引建立方法、装置和系统 |
WO2010048789A1 (zh) * | 2008-11-03 | 2010-05-06 | 中国移动通信集团公司 | 用于分布式列存储数据库索引建立、查询的方法、装置及系统 |
CN101727465B (zh) * | 2008-11-03 | 2011-12-21 | 中国移动通信集团公司 | 分布式列存储数据库索引建立、查询方法及装置与系统 |
CN101751406B (zh) * | 2008-12-18 | 2012-01-04 | 赵伟 | 一种实现基于列存储的关系型数据库的方法及装置 |
CN102004633A (zh) * | 2009-09-03 | 2011-04-06 | 阿里巴巴集团控股有限公司 | 一种处理控件属性的方法及装置 |
CN102004633B (zh) * | 2009-09-03 | 2013-04-24 | 阿里巴巴集团控股有限公司 | 一种处理控件属性的方法及装置 |
CN102467536A (zh) * | 2010-11-12 | 2012-05-23 | 深圳市快易典电子技术有限公司 | 一种字符处理装置及其处理方法 |
CN104750776A (zh) * | 2013-12-30 | 2015-07-01 | Sap欧洲公司 | 使用元数据访问数据库平台中的信息内容 |
CN104750776B (zh) * | 2013-12-30 | 2019-08-30 | Sap欧洲公司 | 使用元数据访问数据库平台中的信息内容 |
CN105205104A (zh) * | 2015-08-26 | 2015-12-30 | 成都布林特信息技术有限公司 | 一种云平台数据获取方法 |
CN110955711A (zh) * | 2019-11-26 | 2020-04-03 | 南京甄视智能科技有限公司 | 可动态扩展的检索方法与装置 |
Also Published As
Publication number | Publication date |
---|---|
KR20060047882A (ko) | 2006-05-18 |
EP1598756A3 (en) | 2006-07-26 |
RU2398272C2 (ru) | 2010-08-27 |
JP4249726B2 (ja) | 2009-04-08 |
CN1741017B (zh) | 2010-05-26 |
AU2005202020A1 (en) | 2005-12-01 |
RU2005114657A (ru) | 2006-11-20 |
CA2507336A1 (en) | 2005-11-14 |
US7136851B2 (en) | 2006-11-14 |
EP1598756A2 (en) | 2005-11-23 |
KR101150112B1 (ko) | 2012-06-08 |
MXPA05005209A (es) | 2005-12-06 |
CA2507336C (en) | 2013-12-24 |
JP2006012125A (ja) | 2006-01-12 |
US20050256865A1 (en) | 2005-11-17 |
BRPI0503221A (pt) | 2006-01-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1741017B (zh) | 用于索引和搜索数据库的方法和装置 | |
US6944609B2 (en) | Search results using editor feedback | |
US7552109B2 (en) | System, method, and service for collaborative focused crawling of documents on a network | |
US7689647B2 (en) | Systems and methods for removing duplicate search engine results | |
US7257577B2 (en) | System, method and service for ranking search results using a modular scoring system | |
US7805432B2 (en) | Meta search engine | |
Green | The evolution of Web searching | |
US7949648B2 (en) | Compiling and accessing subject-specific information from a computer network | |
US7039631B1 (en) | System and method for providing search results with configurable scoring formula | |
CA2288745C (en) | Method and apparatus for searching a database of records | |
IL164723A (en) | Data store for knowledge-based data mining system | |
CA2450882A1 (en) | Automatic search method | |
CN1898667A (zh) | 根据结果与用户查询的相关性增强搜索索引 | |
US20070136248A1 (en) | Keyword driven search for questions in search targets | |
US20020103794A1 (en) | System and method for processing database queries | |
Ali et al. | Search engine effectiveness using query classification: a study | |
Winship | World‐Wide Web searching tools: an evaluation | |
US7680760B2 (en) | System and method for labeling a document | |
Glover et al. | Recommending web documents based on user preferences | |
Mukhopadhyay et al. | An approach to confidence based page ranking for user oriented web search | |
US20060059126A1 (en) | System and method for network searching | |
Deogun et al. | Structural abstractions of hypertext documents for web-based retrieval | |
Handschuh et al. | Deep Annotation for Information Integration. | |
Crean et al. | A weblog recommender using machine learning and semantic web technologies | |
Trevor et al. | A Modern Approach to Searching the World Wide Web: Ranking Pages by Inference over Content. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MICROSOFT TECHNOLOGY LICENSING LLC Free format text: FORMER OWNER: MICROSOFT CORP. Effective date: 20150430 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20150430 Address after: Washington State Patentee after: Micro soft technique license Co., Ltd Address before: Washington State Patentee before: Microsoft Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20100526 Termination date: 20180516 |