SE0004043D0 - Method and apparatus for document indexing and searching - Google Patents

Method and apparatus for document indexing and searching

Info

Publication number
SE0004043D0
SE0004043D0 SE0004043A SE0004043A SE0004043D0 SE 0004043 D0 SE0004043 D0 SE 0004043D0 SE 0004043 A SE0004043 A SE 0004043A SE 0004043 A SE0004043 A SE 0004043A SE 0004043 D0 SE0004043 D0 SE 0004043D0
Authority
SE
Sweden
Prior art keywords
document
searching
scores
document indexing
subset
Prior art date
Application number
SE0004043A
Other languages
English (en)
Other versions
SE0004043L (sv
Inventor
David Sharnoff
Matthew D Dillon
Original Assignee
David Sharnoff
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by David Sharnoff filed Critical David Sharnoff
Publication of SE0004043D0 publication Critical patent/SE0004043D0/sv
Publication of SE0004043L publication Critical patent/SE0004043L/sv

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3346Query execution using probabilistic model
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99935Query augmenting and refining, e.g. inexact access

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
SE0004043A 1998-05-12 2000-11-06 Sätt och anordning för indexering av och sökning efter dokument SE0004043L (sv)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/076,757 US6314421B1 (en) 1998-05-12 1998-05-12 Method and apparatus for indexing documents for message filtering
PCT/US1999/010627 WO1999059085A1 (en) 1998-05-12 1999-05-11 Method and apparatus for document indexing and searching

Publications (2)

Publication Number Publication Date
SE0004043D0 true SE0004043D0 (sv) 2000-11-06
SE0004043L SE0004043L (sv) 2001-01-04

Family

ID=22133996

Family Applications (1)

Application Number Title Priority Date Filing Date
SE0004043A SE0004043L (sv) 1998-05-12 2000-11-06 Sätt och anordning för indexering av och sökning efter dokument

Country Status (6)

Country Link
US (1) US6314421B1 (sv)
AU (1) AU3989899A (sv)
DE (1) DE19983146T1 (sv)
GB (1) GB2353883B (sv)
SE (1) SE0004043L (sv)
WO (1) WO1999059085A1 (sv)

Families Citing this family (62)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6701347B1 (en) 1998-09-23 2004-03-02 John W. L. Ogilvie Method for including a self-removing code in a self-removing email message that contains an advertisement
US6324569B1 (en) * 1998-09-23 2001-11-27 John W. L. Ogilvie Self-removing email verified or designated as such by a message distributor for the convenience of a recipient
US6757713B1 (en) 1998-09-23 2004-06-29 John W. L. Ogilvie Method for including a self-removing indicator in a self-removing message
US7095843B1 (en) * 1999-02-09 2006-08-22 Rockwell Electronic Commerce Technologies, Llc Selective messaging in a multiple messaging link environment
US6592627B1 (en) * 1999-06-10 2003-07-15 International Business Machines Corporation System and method for organizing repositories of semi-structured documents such as email
JP4065381B2 (ja) 1999-11-10 2008-03-26 ヤフー! インコーポレイテッド インターネットラジオ及びブロードキャスト方法
US6389467B1 (en) 2000-01-24 2002-05-14 Friskit, Inc. Streaming media search and continuous playback system of media resources located by multiple network addresses
US8352331B2 (en) 2000-05-03 2013-01-08 Yahoo! Inc. Relationship discovery engine
US7251665B1 (en) * 2000-05-03 2007-07-31 Yahoo! Inc. Determining a known character string equivalent to a query string
US7162482B1 (en) 2000-05-03 2007-01-09 Musicmatch, Inc. Information retrieval engine
US7376635B1 (en) * 2000-07-21 2008-05-20 Ford Global Technologies, Llc Theme-based system and method for classifying documents
US6772196B1 (en) * 2000-07-27 2004-08-03 Propel Software Corp. Electronic mail filtering system and methods
GB2366706B (en) * 2000-08-31 2004-11-03 Content Technologies Ltd Monitoring electronic mail messages digests
US8271333B1 (en) 2000-11-02 2012-09-18 Yahoo! Inc. Content-related wallpaper
US6898592B2 (en) * 2000-12-27 2005-05-24 Microsoft Corporation Scoping queries in a search engine
US20020198866A1 (en) * 2001-03-13 2002-12-26 Reiner Kraft Credibility rating platform
US20030046297A1 (en) * 2001-08-30 2003-03-06 Kana Software, Inc. System and method for a partially self-training learning system
JP4082059B2 (ja) * 2002-03-29 2008-04-30 ソニー株式会社 情報処理装置および方法、記録媒体、並びにプログラム
US7707221B1 (en) 2002-04-03 2010-04-27 Yahoo! Inc. Associating and linking compact disc metadata
US8046832B2 (en) 2002-06-26 2011-10-25 Microsoft Corporation Spam detector with challenges
AU2003258037B2 (en) * 2002-08-05 2009-11-26 Nokia Corporation Desktop client interaction with a geographic text search system
US7249162B2 (en) 2003-02-25 2007-07-24 Microsoft Corporation Adaptive junk message filtering system
US7543053B2 (en) * 2003-03-03 2009-06-02 Microsoft Corporation Intelligent quarantining for spam prevention
US7219148B2 (en) * 2003-03-03 2007-05-15 Microsoft Corporation Feedback loop for spam prevention
US7483947B2 (en) * 2003-05-02 2009-01-27 Microsoft Corporation Message rendering for identification of content features
US7272853B2 (en) * 2003-06-04 2007-09-18 Microsoft Corporation Origination/destination features and lists for spam prevention
US7519668B2 (en) * 2003-06-20 2009-04-14 Microsoft Corporation Obfuscation of spam filter
US7711779B2 (en) * 2003-06-20 2010-05-04 Microsoft Corporation Prevention of outgoing spam
US8533270B2 (en) * 2003-06-23 2013-09-10 Microsoft Corporation Advanced spam detection techniques
US20050060643A1 (en) * 2003-08-25 2005-03-17 Miavia, Inc. Document similarity detection and classification system
KR20060120029A (ko) 2003-09-10 2006-11-24 뮤직매치, 인크. 뮤직을 구매하고 플레이하는 시스템 및 방법
US8214438B2 (en) * 2004-03-01 2012-07-03 Microsoft Corporation (More) advanced spam detection features
US20050204005A1 (en) * 2004-03-12 2005-09-15 Purcell Sean E. Selective treatment of messages based on junk rating
US20050204006A1 (en) * 2004-03-12 2005-09-15 Purcell Sean E. Message junk rating interface
US7664819B2 (en) * 2004-06-29 2010-02-16 Microsoft Corporation Incremental anti-spam lookup and update service
US7904517B2 (en) * 2004-08-09 2011-03-08 Microsoft Corporation Challenge response systems
US7660865B2 (en) * 2004-08-12 2010-02-09 Microsoft Corporation Spam filtering with probabilistic secure hashes
US7606793B2 (en) 2004-09-27 2009-10-20 Microsoft Corporation System and method for scoping searches using index keys
US7930353B2 (en) 2005-07-29 2011-04-19 Microsoft Corporation Trees of classifiers for detecting email spam
US8065370B2 (en) 2005-11-03 2011-11-22 Microsoft Corporation Proofs to filter spam
JP4251652B2 (ja) * 2006-06-09 2009-04-08 インターナショナル・ビジネス・マシーンズ・コーポレーション 検索装置、検索プログラムおよび検索方法
US8224905B2 (en) 2006-12-06 2012-07-17 Microsoft Corporation Spam filtration utilizing sender activity data
US9348912B2 (en) 2007-10-18 2016-05-24 Microsoft Technology Licensing, Llc Document length as a static relevance feature for ranking search results
US8566256B2 (en) * 2008-04-01 2013-10-22 Certona Corporation Universal system and method for representing and predicting human behavior
US8812493B2 (en) 2008-04-11 2014-08-19 Microsoft Corporation Search results ranking using editing distance and document information
US8135930B1 (en) 2008-07-14 2012-03-13 Vizioncore, Inc. Replication systems and methods for a virtual computing environment
US8060476B1 (en) 2008-07-14 2011-11-15 Quest Software, Inc. Backup systems and methods for a virtual computing environment
US8046550B2 (en) 2008-07-14 2011-10-25 Quest Software, Inc. Systems and methods for performing backup operations of virtual machine files
US8429649B1 (en) 2008-09-25 2013-04-23 Quest Software, Inc. Systems and methods for data management in a virtual computing environment
US8996468B1 (en) 2009-04-17 2015-03-31 Dell Software Inc. Block status mapping system for reducing virtual machine backup storage
US9778946B2 (en) 2009-08-07 2017-10-03 Dell Software Inc. Optimized copy of virtual machine storage files
US8738635B2 (en) 2010-06-01 2014-05-27 Microsoft Corporation Detection of junk in search result ranking
US9569446B1 (en) 2010-06-08 2017-02-14 Dell Software Inc. Cataloging system for image-based backup
US8898114B1 (en) 2010-08-27 2014-11-25 Dell Software Inc. Multitier deduplication systems and methods
US9251508B2 (en) 2010-12-09 2016-02-02 At&T Intellectual Property I, L.P. Intelligent message processing
US8943071B2 (en) 2011-08-23 2015-01-27 At&T Intellectual Property I, L.P. Automatic sort and propagation associated with electronic documents
US9495462B2 (en) 2012-01-27 2016-11-15 Microsoft Technology Licensing, Llc Re-ranking search results
US9311375B1 (en) 2012-02-07 2016-04-12 Dell Software Inc. Systems and methods for compacting a virtual machine file
KR101264151B1 (ko) 2012-10-24 2013-05-14 주식회사 무하유 문서 표절률 산출 장치 및 방법, 이를 구현하기 위한 프로그램을 기록한 기록매체
US20160027119A1 (en) * 2014-07-24 2016-01-28 Madhu KOLACHINA Health or pharmacy plan benefit testing
KR101634681B1 (ko) * 2015-09-03 2016-06-29 주식회사 무하유 검사문서 내 인용구문 탐색 방법 및 프로그램
US10942909B2 (en) * 2018-09-25 2021-03-09 Salesforce.Com, Inc. Efficient production and consumption for data changes in a database under high concurrency

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5107419A (en) 1987-12-23 1992-04-21 International Business Machines Corporation Method of assigning retention and deletion criteria to electronic documents stored in an interactive information handling system
US5469354A (en) 1989-06-14 1995-11-21 Hitachi, Ltd. Document data processing method and apparatus for document retrieval
US5479654A (en) * 1990-04-26 1995-12-26 Squibb Data Systems, Inc. Apparatus and method for reconstructing a file from a difference signature and an original file
US5276869A (en) 1990-09-10 1994-01-04 International Business Machines Corporation System for selecting document recipients as determined by technical content of document and for electronically corroborating receipt of document
US5276741A (en) 1991-05-16 1994-01-04 Trw Financial Systems & Services, Inc. Fuzzy string matcher
US5204958A (en) 1991-06-27 1993-04-20 Digital Equipment Corporation System and method for efficiently indexing and storing a large database with high data insertion frequency
US5375235A (en) 1991-11-05 1994-12-20 Northern Telecom Limited Method of indexing keywords for searching in a database recorded on an information recording medium
GB9220404D0 (en) 1992-08-20 1992-11-11 Nat Security Agency Method of identifying,retrieving and sorting documents
US5701459A (en) 1993-01-13 1997-12-23 Novell, Inc. Method and apparatus for rapid full text index creation
JP3168756B2 (ja) 1993-02-24 2001-05-21 ミノルタ株式会社 電子メールシステムのメール管理方法
US5758257A (en) * 1994-11-29 1998-05-26 Herz; Frederick System and method for scheduling broadcast of and access to video programs and other data using customer profiles
US5659771A (en) * 1995-05-19 1997-08-19 Mitsubishi Electric Information Technology Center America, Inc. System for spelling correction in which the context of a target word in a sentence is utilized to determine which of several possible words was intended
WO1996041281A1 (en) * 1995-06-07 1996-12-19 International Language Engineering Corporation Machine assisted translation tools
US5764899A (en) * 1995-11-13 1998-06-09 Motorola, Inc. Method and apparatus for communicating an optimized reply
US5745899A (en) * 1996-08-09 1998-04-28 Digital Equipment Corporation Method for indexing information of a database
US6105023A (en) * 1997-08-18 2000-08-15 Dataware Technologies, Inc. System and method for filtering a document stream
US5991714A (en) * 1998-04-22 1999-11-23 The United States Of America As Represented By The National Security Agency Method of identifying data type and locating in a file

Also Published As

Publication number Publication date
GB2353883B (en) 2003-03-19
WO1999059085A1 (en) 1999-11-18
GB2353883A (en) 2001-03-07
AU3989899A (en) 1999-11-29
GB0027292D0 (en) 2000-12-27
US6314421B1 (en) 2001-11-06
SE0004043L (sv) 2001-01-04
DE19983146T1 (de) 2001-05-10

Similar Documents

Publication Publication Date Title
SE0004043D0 (sv) Method and apparatus for document indexing and searching
BRPI0411423A (pt) sistema e método de ordenação de resultados de busca, código de manutenção de meio de armazenamento legìvel por computador e equipamento
CA2092629A1 (en) Database searching system and method using a two dimensional marking matrix
CA2329558A1 (en) Methods and apparatus for similarity text search based on conceptual indexing
SG142159A1 (en) Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata
WO1997038390A3 (en) Browse by prompted keyword phrases
EP1367509A3 (en) Method and apparatus for categorizing and presenting documents of a distributed database
DE69916272D1 (de) Methode und verfahren um relevante dokumente in einer datenbank zu finden
ATE378643T1 (de) Indexstruktur von metadaten, verfahren zum bereitstellen von indizes von metadaten und metadatensuchverfahren und vorrichtung, die die indizes von metadaten verwenden
WO2004114163A3 (en) Method and system for enhanced data searching
AU6327501A (en) Method and apparatus for identifying related searches in a database search system
DE69032712D1 (de) Hierarchischer vorsuch-typ dokument suchverfahren, vorrichtung dazu, sowie eine magnetische plattenanordnung für diese vorrichtung
NZ332479A (en) Searching for relevant hyperlinked documents
EP1411448A3 (en) Data searching apparatus
WO2003017133A3 (en) System and method for retrieving location based site data
GB9918611D0 (en) Music database searching
BR0111192A (pt) Sistema e método para automaticamente gerar consultas a bancos de dados
MY132104A (en) System and method for searching for duplicate data
WO2001069450A3 (en) Method for automated web site maintenance via searching
TWI266213B (en) Sequence based indexing and retrieval method for text documents
EA199900027A1 (ru) Способ получения некоторых азациклогексапептидов
Chandrasekar et al. Institute for Research in Cognitive Science
WO2002069202A3 (en) Method for determining synthetic term senses using reference text
EA200100467A1 (ru) Способ поиска хранимых на устройствах хранения данных электронных документов и их фрагментов
CA2253744A1 (en) Indexing databases for efficient relational querying

Legal Events

Date Code Title Description
NAV Patent application has lapsed