HK1150081A1 - Document indexing method and data querying method based on search engine, and server - Google Patents

Document indexing method and data querying method based on search engine, and server

Info

Publication number
HK1150081A1
HK1150081A1 HK11103854.6A HK11103854A HK1150081A1 HK 1150081 A1 HK1150081 A1 HK 1150081A1 HK 11103854 A HK11103854 A HK 11103854A HK 1150081 A1 HK1150081 A1 HK 1150081A1
Authority
HK
Hong Kong
Prior art keywords
server
search engine
data querying
document indexing
method based
Prior art date
Application number
HK11103854.6A
Other languages
English (en)
Inventor
Lei Wei
Jiaxiang Shen
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Publication of HK1150081A1 publication Critical patent/HK1150081A1/xx

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations
    • G06F16/24554Unary operations; Data partitioning operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • G06F16/2379Updates performed during online database operations; commit processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3337Translation of the query language, e.g. Chinese to English
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
HK11103854.6A 2009-07-23 2011-04-18 Document indexing method and data querying method based on search engine, and server HK1150081A1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2009101514872A CN101963965B (zh) 2009-07-23 2009-07-23 基于搜索引擎的文档索引方法、数据查询方法及服务器

Publications (1)

Publication Number Publication Date
HK1150081A1 true HK1150081A1 (en) 2011-10-28

Family

ID=43498187

Family Applications (1)

Application Number Title Priority Date Filing Date
HK11103854.6A HK1150081A1 (en) 2009-07-23 2011-04-18 Document indexing method and data querying method based on search engine, and server

Country Status (6)

Country Link
US (2) US9275128B2 (zh)
EP (1) EP2457185A4 (zh)
JP (1) JP5616444B2 (zh)
CN (1) CN101963965B (zh)
HK (1) HK1150081A1 (zh)
WO (1) WO2011011063A2 (zh)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10311105B2 (en) * 2010-12-28 2019-06-04 Microsoft Technology Licensing, Llc Filtering queried data on data stores
US9129010B2 (en) * 2011-05-16 2015-09-08 Argo Data Resource Corporation System and method of partitioned lexicographic search
CN103064840A (zh) * 2011-10-20 2013-04-24 北京中搜网络技术股份有限公司 索引装置、索引方法、检索装置、检索方法和检索系统
US9536105B2 (en) 2012-01-26 2017-01-03 Nokia Technologies Oy Method and apparatus for providing data access via multi-user views
US8972715B2 (en) * 2012-07-13 2015-03-03 Securerf Corporation Cryptographic hash function
US9087055B2 (en) 2013-01-28 2015-07-21 International Business Machines Corporation Segmenting documents within a full text index
CN104376014B (zh) * 2013-08-15 2018-03-23 中国科学院声学研究所 一种结构化p2p网络中的资源发布及查询方法
US9715515B2 (en) * 2014-01-31 2017-07-25 Microsoft Technology Licensing, Llc External data access with split index
US10095807B2 (en) * 2015-04-28 2018-10-09 Microsoft Technology Licensing, Llc Linked data processor for database storage
CN106844638B (zh) * 2017-01-19 2020-11-03 杭州汇数智通科技有限公司 信息检索方法、装置及电子设备
CN107451122B (zh) * 2017-08-09 2020-11-13 南京华飞数据技术有限公司 一种基于Lucene的动态n元分词方法
CN110516141B (zh) * 2019-07-22 2022-08-30 视联动力信息技术股份有限公司 数据查询方法、装置、电子设备以及可读存储介质
US20240020330A1 (en) * 2022-07-18 2024-01-18 Providence St. Joseph Health Searching against attribute values of documents that are explicitly specified as part of the process of publishing the documents

Family Cites Families (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5235654A (en) 1992-04-30 1993-08-10 International Business Machines Corporation Advanced data capture architecture data processing system and method for scanned images of document forms
JP3081093B2 (ja) 1993-10-08 2000-08-28 松下電器産業株式会社 索引作成方法およびその装置と文書検索装置
US6076088A (en) * 1996-02-09 2000-06-13 Paik; Woojin Information extraction system and method using concept relation concept (CRC) triples
WO1998016889A1 (fr) * 1996-10-16 1998-04-23 Sharp Kabushiki Kaisha Appareil d'entree de caracteres et support de donnees dans lequel le programme d'entree de caracteres est mis en memoire
JP4149544B2 (ja) 1997-03-10 2008-09-10 株式会社東芝 全文検索システムおよび全文検索プログラムを記録した記録媒体
US6128613A (en) 1997-06-26 2000-10-03 The Chinese University Of Hong Kong Method and apparatus for establishing topic word classes based on an entropy cost function to retrieve documents represented by the topic words
US7039637B2 (en) * 1998-12-31 2006-05-02 International Business Machines Corporation System and method for evaluating characters in an inputted search string against a character table bank comprising a predetermined number of columns that correspond to a plurality of pre-determined candidate character sets in order to provide enhanced full text search
JP3696745B2 (ja) * 1999-02-09 2005-09-21 株式会社日立製作所 文書検索方法及び文書検索システム及び文書検索プログラムを記録したコンピュータ読み取り可能な記録媒体
US6631373B1 (en) 1999-03-02 2003-10-07 Canon Kabushiki Kaisha Segmented document indexing and search
JP2001109754A (ja) * 1999-09-30 2001-04-20 Internatl Business Mach Corp <Ibm> 索引ファイルを使用した検索方法及びそれに用いる装置
US7725307B2 (en) * 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
US20020022953A1 (en) * 2000-05-24 2002-02-21 Bertolus Phillip Andre Indexing and searching ideographic characters on the internet
US6941513B2 (en) 2000-06-15 2005-09-06 Cognisphere, Inc. System and method for text structuring and text generation
US6687687B1 (en) 2000-07-26 2004-02-03 Zix Scm, Inc. Dynamic indexing information retrieval or filtering system
US6697801B1 (en) * 2000-08-31 2004-02-24 Novell, Inc. Methods of hierarchically parsing and indexing text
US7254269B2 (en) * 2000-08-31 2007-08-07 Hewlett-Packard Development Company, L.P. Character recognition system
CN1253815C (zh) * 2000-09-08 2006-04-26 百度在线网络技术(北京)有限公司 计算机在中文数据中识别中文姓名的方法
US7860706B2 (en) * 2001-03-16 2010-12-28 Eli Abir Knowledge system method and appparatus
EP1417824A4 (en) * 2001-07-18 2006-09-13 Min-Kyum Kim DEVICE AND METHOD FOR ENTERING ALPHABETIC CHARACTERS
US7814043B2 (en) * 2001-11-26 2010-10-12 Fujitsu Limited Content information analyzing method and apparatus
JP4108337B2 (ja) 2002-01-10 2008-06-25 三菱電機株式会社 電子ファイリングシステム及びその検索インデックス作成方法
CA2475319A1 (en) * 2002-02-04 2003-08-14 Cataphora, Inc. A method and apparatus to visually present discussions for data mining purposes
CA2501114A1 (en) * 2002-04-12 2003-10-23 Metainformatics System and method for semantics driven data processing
US7254580B1 (en) 2003-07-31 2007-08-07 Google Inc. System and method for selectively searching partitions of a database
US7617249B2 (en) 2003-09-06 2009-11-10 Oracle International Corporation Method and system of handling document operation requests on documents having large collections with constrained memory
US7493322B2 (en) * 2003-10-15 2009-02-17 Xerox Corporation System and method for computing a measure of similarity between documents
US7458022B2 (en) 2003-10-22 2008-11-25 Intel Corporation Hardware/software partition for high performance structured data transformation
GB2417103A (en) * 2004-08-11 2006-02-15 Sdl Plc Natural language translation system
US7487138B2 (en) 2004-08-25 2009-02-03 Symantec Operating Corporation System and method for chunk-based indexing of file system content
US20080077570A1 (en) 2004-10-25 2008-03-27 Infovell, Inc. Full Text Query and Search Systems and Method of Use
US7516125B2 (en) 2005-08-01 2009-04-07 Business Objects Americas Processor for fast contextual searching
US20080155239A1 (en) * 2006-10-10 2008-06-26 Honeywell International Inc. Automata based storage and execution of application logic in smart card like devices
CN101149739A (zh) * 2007-08-24 2008-03-26 中国科学院计算技术研究所 一种面向互联网的有意义串的挖掘方法和系统
US9218166B2 (en) * 2008-02-20 2015-12-22 Embarcadero Technologies, Inc. Development system with improved methodology for creation and reuse of software assets
JP5408128B2 (ja) * 2008-05-15 2014-02-05 株式会社ニコン 画像処理装置、画像処理方法、処理装置、およびプログラム
JP2009104669A (ja) 2009-02-12 2009-05-14 Toshiba Corp 文書検索方法、システム及びプログラム
KR20120009446A (ko) * 2009-03-13 2012-01-31 인벤션 머신 코포레이션 자연 언어 텍스트의 자동화 의미적 라벨링 시스템 및 방법

Also Published As

Publication number Publication date
WO2011011063A2 (en) 2011-01-27
WO2011011063A3 (en) 2014-03-13
US9275128B2 (en) 2016-03-01
CN101963965A (zh) 2011-02-02
US20160171052A1 (en) 2016-06-16
EP2457185A2 (en) 2012-05-30
JP2012533819A (ja) 2012-12-27
JP5616444B2 (ja) 2014-10-29
US9946753B2 (en) 2018-04-17
EP2457185A4 (en) 2015-04-08
CN101963965B (zh) 2013-03-20
US20110022596A1 (en) 2011-01-27

Similar Documents

Publication Publication Date Title
HK1150081A1 (en) Document indexing method and data querying method based on search engine, and server
GB201014670D0 (en) Object matching for tracking, indexing, and search
EP2560102A4 (en) INFORMATION RECOVERY METHOD, INFORMATION RECOVERY SERVER, AND INFORMATION RECOVERY SYSTEM
EP2472490A4 (en) SERVER, SYSTEM AND METHOD FOR MANAGING ENERGY EFFICIENCY INFORMATION
EP2427834A4 (en) METHOD AND SYSTEM FOR SEARCH ENGINE INDICATION AND SEARCH ENGINE WITH THE RELATED INDEX
EP2248006A4 (en) METHOD FOR SEARCHING AND INDEXING DATA AND SYSTEM FOR IMPLEMENTING THE SAME
BRPI1012891A2 (pt) método, meio de armazenamento legível por computador, e, computador servidor.
EP2371663A4 (en) ROUTE SEARCH SYSTEM, ROUTE SEARCH SERVER AND ROUTE SEARCH METHOD
BRPI1106073A2 (pt) dispositivo eletrônico, e, método de determinação de interface digital.
EP2490131A4 (en) SEARCH ENGINE SYSTEM AND INFORMATION SEARCH PROCESS
EP2111590A4 (en) FEDERATED RESEARCH IMPLEMENTATION IN MULTIPLE SEARCH ENGINES
EP2057532A4 (en) METHOD, SYSTEM AND COMPUTER READABLE STORAGE FOR STORE SEARCH
EP2483816A4 (en) SYSTEM AND METHOD FOR BLOCK SEGMENTING, IDENTIFICATION AND INDICATION OF VISUAL ELEMENTS AND DOCUMENT SEARCHING
HK1151870A1 (en) Method and system for information searching
GB201209093D0 (en) Method of searching for document data files based on keywords,and computer system and computer program thereof
EP2165279A4 (en) METHOD AND SYSTEM FOR SEARCHING FOR DIGITAL ASSETS
GB201014954D0 (en) Media item clustering based on similarity data
EP2795486A4 (en) CLIENT-BASED SEARCH FOR LOCAL AND REMOTE DATA SOURCES FOR ANALYSIS, ORIENTATION AND RELEVANCE OF INTENTIONS
HK1162077A1 (en) Digital image retrieval by aggregating search results based on visual annotations
EP2680251A4 (en) SEARCHING SYSTEM, SEARCHING METHOD FOR SEARCHING SYSTEM, INFORMATION PROCESSING DEVICE, SEARCHING PROGRAM, CORRESPONDING KEYWORD MANAGEMENT DEVICE, AND CORRESPONDING KEYWORDS MANAGEMENT SYSTEM
EP2139229A4 (en) IP TELEVISION SYSTEM, MULTIMEDIA SERVER, AND METHOD FOR SEARCHING AND LOCATING IP TELEVISION PROGRAM
HK1153000A1 (en) Checking method, system and server for sql sentence sql
EP2408232A4 (en) PROCESS, SYSTEM AND POLICY SERVER TO ENSURE UNINTERRUPTIBLE DATA
EP2537358C0 (en) DEVICES AND METHODS FOR SEARCHING DATA ON DATA SOURCES ASSOCIATED WITH REGISTERED APPLICATIONS
IT1398993B1 (it) Dispositivo di batteria per un motociclo elettronico.

Legal Events

Date Code Title Description
PC Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee)

Effective date: 20200727