WO1994023386A3 - Probabilistic information retrieval networks - Google Patents

Probabilistic information retrieval networks Download PDF

Info

Publication number
WO1994023386A3
WO1994023386A3 PCT/US1994/002579 US9402579W WO9423386A3 WO 1994023386 A3 WO1994023386 A3 WO 1994023386A3 US 9402579 W US9402579 W US 9402579W WO 9423386 A3 WO9423386 A3 WO 9423386A3
Authority
WO
Grant status
Application
Patent type
Prior art keywords
documents
occurrence
representation
frequency
document
Prior art date
Application number
PCT/US1994/002579
Other languages
French (fr)
Other versions
WO1994023386A2 (en )
Inventor
Howard R Turtle
Gerald J Morton
F Kinley Larntz
Original Assignee
West Publishing Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor ; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor ; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • G06F17/30675Query execution
    • G06F17/30687Query execution using probabilistic model
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor ; File system structures therefor
    • G06F17/30011Document retrieval systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor ; File system structures therefor
    • G06F17/3061Information retrieval; Database structures therefor ; File system structures therefor of unstructured textual data
    • G06F17/30634Querying
    • G06F17/30657Query processing
    • G06F17/3066Query translation
    • G06F17/30666Syntactic pre-processing steps, e.g. stopword elimination, stemming
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99935Query augmenting and refining, e.g. inexact access

Abstract

The frequency of occurrence of a representation in a collection of documents is estimated for document retrieval purposes by identifying the actual frequency of occurrence (actual fi) of the representation in a sample (ni) of documents and calculating the difference between the maximum (fmax) and minimum (fmin) probable frequencies of occurrence of the representation in the collection. If the difference does not exceed a limit, a midpoint of the maximum and minimum probable frequencies (fmean) is the estimated frequency of occurrence of the representation. Document distribution probabilities are optimized and probability thresholds are established for the identification of documents. An initial probability threshold is established and is adjusted as the probabilities are scored for documents in samples. The document result list (170) is iteratively adjusted through the samples.
PCT/US1994/002579 1991-10-08 1994-03-10 Probabilistic information retrieval networks WO1994023386A3 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US08039757 US5488725A (en) 1991-10-08 1993-03-30 System of document representation retrieval by successive iterated probability sampling
US08/039,757 1993-03-30

Publications (2)

Publication Number Publication Date
WO1994023386A2 true WO1994023386A2 (en) 1994-10-13
WO1994023386A3 true true WO1994023386A3 (en) 1994-11-10

Family

ID=21907211

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1994/002579 WO1994023386A3 (en) 1991-10-08 1994-03-10 Probabilistic information retrieval networks

Country Status (2)

Country Link
US (1) US5488725A (en)
WO (1) WO1994023386A3 (en)

Families Citing this family (258)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5649183A (en) * 1992-12-08 1997-07-15 Microsoft Corporation Method for compressing full text indexes with document identifiers and location offsets
US5619709A (en) * 1993-09-20 1997-04-08 Hnc, Inc. System and method of context vector generation and retrieval
WO1995023388A1 (en) * 1994-02-28 1995-08-31 Non Stop Info Ab Method and control device for authentication
US7181758B1 (en) 1994-07-25 2007-02-20 Data Innovation, L.L.C. Information distribution and processing system
US7991347B1 (en) 1994-04-07 2011-08-02 Data Innovation Llc System and method for accessing set of digital data at a remote site
US6473860B1 (en) 1994-04-07 2002-10-29 Hark C. Chan Information distribution and processing system
US5704018A (en) * 1994-05-09 1997-12-30 Microsoft Corporation Generating improved belief networks
JPH07319918A (en) * 1994-05-24 1995-12-08 Fuji Xerox Co Ltd Device for specifying retrieving object in document
JPH07319917A (en) * 1994-05-24 1995-12-08 Fuji Xerox Co Ltd Document data base managing device and document data base system
US5745745A (en) * 1994-06-29 1998-04-28 Hitachi, Ltd. Text search method and apparatus for structured documents
JP3030533B2 (en) * 1994-07-26 2000-04-10 篤 今野 Information classification device
US5717913A (en) * 1995-01-03 1998-02-10 University Of Central Florida Method for detecting and extracting text data using database schemas
US5794050A (en) * 1995-01-04 1998-08-11 Intelligent Text Processing, Inc. Natural language understanding system
US5946678A (en) * 1995-01-11 1999-08-31 Philips Electronics North America Corporation User interface for document retrieval
US5694559A (en) * 1995-03-07 1997-12-02 Microsoft Corporation On-line help method and system utilizing free text query
US5855015A (en) * 1995-03-20 1998-12-29 Interval Research Corporation System and method for retrieval of hyperlinked information resources
US5748954A (en) * 1995-06-05 1998-05-05 Carnegie Mellon University Method for searching a queued and ranked constructed catalog of files stored on a network
US5675710A (en) * 1995-06-07 1997-10-07 Lucent Technologies, Inc. Method and apparatus for training a text classifier
US6067552A (en) * 1995-08-21 2000-05-23 Cnet, Inc. User interface system and method for browsing a hypertext database
JPH0981574A (en) * 1995-09-14 1997-03-28 Fujitsu Ltd Method and system for data base retrieval using retrieval set display picture
JP3040945B2 (en) * 1995-11-29 2000-05-15 松下電器産業株式会社 Document retrieval apparatus
US5787424A (en) * 1995-11-30 1998-07-28 Electronic Data Systems Corporation Process and system for recursive document retrieval
US5689696A (en) * 1995-12-28 1997-11-18 Lucent Technologies Inc. Method for maintaining information in a database used to generate high biased histograms using a probability function, counter and threshold values
US5819260A (en) * 1996-01-22 1998-10-06 Lexis-Nexis Phrase recognition method and apparatus
US5754840A (en) * 1996-01-23 1998-05-19 Smartpatents, Inc. System, method, and computer program product for developing and maintaining documents which includes analyzing a patent application with regards to the specification and claims
WO1997038377A1 (en) * 1996-04-10 1997-10-16 At & T Corp. A system and method for finding information in a distributed information system using query learning and meta search
JP3113814B2 (en) * 1996-04-17 2000-12-04 インターナショナル・ビジネス・マシーンズ・コーポレ−ション Information retrieval methods and information retrieval apparatus
US5995921A (en) * 1996-04-23 1999-11-30 International Business Machines Corporation Natural language help interface
US5721896A (en) * 1996-05-13 1998-02-24 Lucent Technologies Inc. Method for skew resistant join size estimation
US7555458B1 (en) * 1996-06-05 2009-06-30 Fraud Control System.Com Corporation Method of billing a purchase made over a computer network
US20030195847A1 (en) * 1996-06-05 2003-10-16 David Felger Method of billing a purchase made over a computer network
US8229844B2 (en) 1996-06-05 2012-07-24 Fraud Control Systems.Com Corporation Method of billing a purchase made over a computer network
US5778362A (en) * 1996-06-21 1998-07-07 Kdl Technologies Limted Method and system for revealing information structures in collections of data items
US6581056B1 (en) * 1996-06-27 2003-06-17 Xerox Corporation Information retrieval system providing secondary content analysis on collections of information objects
US5813002A (en) * 1996-07-31 1998-09-22 International Business Machines Corporation Method and system for linearly detecting data deviations in a large database
US5787435A (en) * 1996-08-09 1998-07-28 Digital Equipment Corporation Method for mapping an index of a database into an array of files
US5765158A (en) * 1996-08-09 1998-06-09 Digital Equipment Corporation Method for sampling a compressed index to create a summarized index
JP3099756B2 (en) * 1996-10-31 2000-10-16 富士ゼロックス株式会社 Document processing apparatus, a word extractor and a word extracting method
US5950189A (en) * 1997-01-02 1999-09-07 At&T Corp Retrieval system and method
US6128712A (en) 1997-01-31 2000-10-03 Macromedia, Inc. Method and apparatus for improving playback of interactive multimedia works
US5920859A (en) * 1997-02-05 1999-07-06 Idd Enterprises, L.P. Hypertext document retrieval system and method
DE29704393U1 (en) * 1997-03-11 1997-07-17 Aesculap Ag An apparatus for pre-operative determination of the position data of Endoprothesenteilen
US7308485B2 (en) * 1997-04-15 2007-12-11 Gracenote, Inc. Method and system for accessing web pages based on playback of recordings
US7167857B2 (en) 1997-04-15 2007-01-23 Gracenote, Inc. Method and system for finding approximate matches in database
US5895464A (en) * 1997-04-30 1999-04-20 Eastman Kodak Company Computer program product and a method for using natural language for the description, search and retrieval of multi-media objects
US6460034B1 (en) * 1997-05-21 2002-10-01 Oracle Corporation Document knowledge base research and retrieval system
US6128613A (en) * 1997-06-26 2000-10-03 The Chinese University Of Hong Kong Method and apparatus for establishing topic word classes based on an entropy cost function to retrieve documents represented by the topic words
US5873081A (en) * 1997-06-27 1999-02-16 Microsoft Corporation Document filtering via directed acyclic graphs
US5926808A (en) * 1997-07-25 1999-07-20 Claritech Corporation Displaying portions of text from multiple documents over multiple databases related to a search query in a computer network
US5950196A (en) * 1997-07-25 1999-09-07 Sovereign Hill Software, Inc. Systems and methods for retrieving tabular data from textual sources
US6105023A (en) * 1997-08-18 2000-08-15 Dataware Technologies, Inc. System and method for filtering a document stream
US6081805A (en) * 1997-09-10 2000-06-27 Netscape Communications Corporation Pass-through architecture via hash techniques to remove duplicate query results
US6018733A (en) * 1997-09-12 2000-01-25 Infoseek Corporation Methods for iteratively and interactively performing collection selection in full text searches
US5845278A (en) * 1997-09-12 1998-12-01 Inioseek Corporation Method for automatically selecting collections to search in full text searches
DE69730057T2 (en) * 1997-09-29 2005-08-04 Webplus Ltd., Road Town A multi-element equivalent trust system and process therefor
US5966702A (en) 1997-10-31 1999-10-12 Sun Microsystems, Inc. Method and apparatus for pre-processing and packaging class files
US5987457A (en) * 1997-11-25 1999-11-16 Acceleration Software International Corporation Query refinement method for searching documents
US6389436B1 (en) * 1997-12-15 2002-05-14 International Business Machines Corporation Enhanced hypertext categorization using hyperlinks
US5983221A (en) * 1998-01-13 1999-11-09 Wordstream, Inc. Method and apparatus for improved document searching
US6119124A (en) * 1998-03-26 2000-09-12 Digital Equipment Corporation Method for clustering closely resembling data objects
US7529756B1 (en) * 1998-07-21 2009-05-05 West Services, Inc. System and method for processing formatted text documents in a database
US7778954B2 (en) 1998-07-21 2010-08-17 West Publishing Corporation Systems, methods, and software for presenting legal case histories
US6363377B1 (en) * 1998-07-30 2002-03-26 Sarnoff Corporation Search data processor
US6405188B1 (en) * 1998-07-31 2002-06-11 Genuity Inc. Information retrieval system
US8380875B1 (en) 1998-09-18 2013-02-19 Oracle International Corporation Method and system for addressing a communication document for transmission over a network based on the content thereof
WO2000017727A3 (en) 1998-09-18 2000-08-03 Tacit Knowledge Systems Method and apparatus for querying a user knowledge profile
US6154783A (en) 1998-09-18 2000-11-28 Tacit Knowledge Systems Method and apparatus for addressing an electronic document for transmission over a network
US6253202B1 (en) 1998-09-18 2001-06-26 Tacit Knowledge Systems, Inc. Method, system and apparatus for authorizing access by a first user to a knowledge profile of a second user responsive to an access request from the first user
US6115709A (en) * 1998-09-18 2000-09-05 Tacit Knowledge Systems, Inc. Method and system for constructing a knowledge profile of a user having unrestricted and restricted access portions according to respective levels of confidence of content of the portions
WO2000017784A1 (en) * 1998-09-18 2000-03-30 Tacit Knowledge Systems Method of constructing and displaying an entity profile constructed utilizing input from entities other than the owner
US6377949B1 (en) * 1998-09-18 2002-04-23 Tacit Knowledge Systems, Inc. Method and apparatus for assigning a confidence level to a term within a user knowledge profile
US6549897B1 (en) * 1998-10-09 2003-04-15 Microsoft Corporation Method and system for calculating phrase-document importance
US6366910B1 (en) 1998-12-07 2002-04-02 Amazon.Com, Inc. Method and system for generation of hierarchical search results
US6430557B1 (en) * 1998-12-16 2002-08-06 Xerox Corporation Identifying a group of words using modified query words obtained from successive suffix relationships
US6327593B1 (en) * 1998-12-23 2001-12-04 Unisys Corporation Automated system and method for capturing and managing user knowledge within a search system
US7003719B1 (en) 1999-01-25 2006-02-21 West Publishing Company, Dba West Group System, method, and software for inserting hyperlinks into documents
US6360227B1 (en) * 1999-01-29 2002-03-19 International Business Machines Corporation System and method for generating taxonomies with applications to content-based recommendations
US6330564B1 (en) * 1999-02-10 2001-12-11 International Business Machines Corporation System and method for automated problem isolation in systems with measurements structured as a multidimensional database
CA2371688C (en) * 1999-05-05 2008-09-09 West Publishing Company D/B/A West Group Document-classification system, method and software
EP1124189A4 (en) * 1999-06-04 2004-07-21 Seiko Epson Corp Document sorting method, document sorter, and recorded medium on which document sorting program is recorded
WO2000077609A1 (en) * 1999-06-14 2000-12-21 The Thomson Corporation System for converting data to a markup language
US6381594B1 (en) * 1999-07-12 2002-04-30 Yahoo! Inc. System and method for personalized information filtering and alert generation
US6535865B1 (en) * 1999-07-14 2003-03-18 Hewlett Packard Company Automated diagnosis of printer systems using Bayesian networks
US6853950B1 (en) * 1999-07-20 2005-02-08 Newsedge Corporation System for determining changes in the relative interest of subjects
US6816857B1 (en) 1999-11-01 2004-11-09 Applied Semantics, Inc. Meaning-based advertising and document relevance determination
US6772149B1 (en) 1999-09-23 2004-08-03 Lexis-Nexis Group System and method for identifying facts and legal discussion in court case law documents
WO2001024053A3 (en) * 1999-09-28 2004-03-25 Xmlexpress Inc System and method for automatic context creation for electronic documents
US6876991B1 (en) 1999-11-08 2005-04-05 Collaborative Decision Platforms, Llc. System, method and computer program product for a collaborative decision platform
US6651059B1 (en) * 1999-11-15 2003-11-18 International Business Machines Corporation System and method for the automatic recognition of relevant terms by mining link annotations
US6980990B2 (en) * 1999-12-01 2005-12-27 Barry Fellman Internet domain name registration system
WO2001042981A3 (en) * 1999-12-07 2003-12-24 Otman Basir Natural english language search and retrieval system and method
GB0003411D0 (en) * 2000-02-15 2000-04-05 Dialog Corp The Plc Accessing data
US7428500B1 (en) * 2000-03-30 2008-09-23 Amazon. Com, Inc. Automatically identifying similar purchasing opportunities
US7120574B2 (en) * 2000-04-03 2006-10-10 Invention Machine Corporation Synonym extension of search queries with validation
US8095508B2 (en) * 2000-04-07 2012-01-10 Washington University Intelligent data storage and processing using FPGA devices
US7139743B2 (en) * 2000-04-07 2006-11-21 Washington University Associative database scanning and information retrieval using FPGA devices
US6711558B1 (en) * 2000-04-07 2004-03-23 Washington University Associative database scanning and information retrieval
US7962326B2 (en) * 2000-04-20 2011-06-14 Invention Machine Corporation Semantic answering system and method
US6701309B1 (en) 2000-04-21 2004-03-02 Lycos, Inc. Method and system for collecting related queries
JP2001337980A (en) * 2000-05-29 2001-12-07 Sony Corp Electronic program guide retrieving method and electronic program guide retrieving device
US6829601B2 (en) * 2000-10-04 2004-12-07 Hitachi, Ltd. Document management system
US6668251B1 (en) 2000-11-01 2003-12-23 Tacit Knowledge Systems, Inc. Rendering discriminator members from an initial set of result data
US6640228B1 (en) * 2000-11-10 2003-10-28 Verizon Laboratories Inc. Method for detecting incorrectly categorized data
US20050010604A1 (en) * 2001-12-05 2005-01-13 Digital Networks North America, Inc. Automatic identification of DVD title using internet technologies and fuzzy matching techniques
US20040111386A1 (en) * 2001-01-08 2004-06-10 Goldberg Jonathan M. Knowledge neighborhoods
US7043489B1 (en) 2001-02-23 2006-05-09 Kelley Hubert C Litigation-related document repository
US6823333B2 (en) * 2001-03-02 2004-11-23 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration System, method and apparatus for conducting a keyterm search
US6697793B2 (en) 2001-03-02 2004-02-24 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration System, method and apparatus for generating phrases from a database
US6741981B2 (en) 2001-03-02 2004-05-25 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration (Nasa) System, method and apparatus for conducting a phrase search
US6721728B2 (en) * 2001-03-02 2004-04-13 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration System, method and apparatus for discovering phrases in a database
US8117313B2 (en) * 2001-03-19 2012-02-14 International Business Machines Corporation System and method for adaptive formatting of image information for efficient delivery and presentation
US6820081B1 (en) 2001-03-19 2004-11-16 Attenex Corporation System and method for evaluating a structured message store for message redundancy
US8484177B2 (en) * 2001-03-21 2013-07-09 Eugene M. Lee Apparatus for and method of searching and organizing intellectual property information utilizing a field-of-search
US20030016250A1 (en) * 2001-04-02 2003-01-23 Chang Edward Y. Computer user interface for perception-based information retrieval
WO2002082224A3 (en) * 2001-04-04 2003-05-22 West Publishing Co System, method, and software for identifying historically related legal opinions
US7500017B2 (en) * 2001-04-19 2009-03-03 Microsoft Corporation Method and system for providing an XML binary format
US20020156778A1 (en) * 2001-04-24 2002-10-24 Beeferman Douglas H. Phrase-based text searching
US7552385B2 (en) * 2001-05-04 2009-06-23 International Business Machines Coporation Efficient storage mechanism for representing term occurrence in unstructured text documents
USRE46973E1 (en) 2001-05-07 2018-07-31 Ureveal, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
US7194483B1 (en) 2001-05-07 2007-03-20 Intelligenxia, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
US7627588B1 (en) 2001-05-07 2009-12-01 Ixreveal, Inc. System and method for concept based analysis of unstructured data
US6970881B1 (en) * 2001-05-07 2005-11-29 Intelligenxia, Inc. Concept-based method and system for dynamically analyzing unstructured information
US8589413B1 (en) 2002-03-01 2013-11-19 Ixreveal, Inc. Concept-based method and system for dynamically analyzing results from search engines
US7536413B1 (en) 2001-05-07 2009-05-19 Ixreveal, Inc. Concept-based categorization of unstructured objects
US6925433B2 (en) * 2001-05-09 2005-08-02 International Business Machines Corporation System and method for context-dependent probabilistic modeling of words and documents
US7269546B2 (en) * 2001-05-09 2007-09-11 International Business Machines Corporation System and method of finding documents related to other documents and of finding related words in response to a query to refine a search
US20020174111A1 (en) * 2001-05-21 2002-11-21 Panagiotis Kougiouris System and method for managing resources stored in a relational database system
US6725217B2 (en) 2001-06-20 2004-04-20 International Business Machines Corporation Method and system for knowledge repository exploration and visualization
US20030014405A1 (en) * 2001-07-09 2003-01-16 Jacob Shapiro Search engine designed for handling long queries
US7133862B2 (en) * 2001-08-13 2006-11-07 Xerox Corporation System with user directed enrichment and import/export control
US7284191B2 (en) * 2001-08-13 2007-10-16 Xerox Corporation Meta-document management system with document identifiers
US6978274B1 (en) 2001-08-31 2005-12-20 Attenex Corporation System and method for dynamically evaluating latent concepts in unstructured documents
US6778995B1 (en) 2001-08-31 2004-08-17 Attenex Corporation System and method for efficiently generating cluster groupings in a multi-dimensional concept space
US6888548B1 (en) * 2001-08-31 2005-05-03 Attenex Corporation System and method for generating a visualized data representation preserving independent variable geometric relationships
US7716330B2 (en) 2001-10-19 2010-05-11 Global Velocity, Inc. System and method for controlling transmission of data packets over an information network
US7062498B2 (en) * 2001-11-02 2006-06-13 Thomson Legal Regulatory Global Ag Systems, methods, and software for classifying text from judicial opinions and other documents
JP2003157376A (en) * 2001-11-21 2003-05-30 Ricoh Co Ltd Network system, identification information management method, server device, program and recording medium
US7333966B2 (en) * 2001-12-21 2008-02-19 Thomson Global Resources Systems, methods, and software for hyperlinking names
US6941293B1 (en) 2002-02-01 2005-09-06 Google, Inc. Methods and apparatus for determining equivalent descriptions for an information need
US20030157470A1 (en) * 2002-02-11 2003-08-21 Michael Altenhofen E-learning station and interface
US7343372B2 (en) * 2002-02-22 2008-03-11 International Business Machines Corporation Direct navigation for information retrieval
US7271804B2 (en) * 2002-02-25 2007-09-18 Attenex Corporation System and method for arranging concept clusters in thematic relationships in a two-dimensional visual display area
US20040205660A1 (en) * 2002-04-23 2004-10-14 Joe Acton System and method for generating and displaying attribute-enhanced documents
US7093023B2 (en) * 2002-05-21 2006-08-15 Washington University Methods, systems, and devices using reprogrammable hardware for high-speed processing of streaming data to find a redefinable pattern and respond thereto
JP2007524923A (en) * 2003-05-23 2007-08-30 ワシントン ユニヴァーシティー Intelligent data storage and processing using the Fpga device
US7130866B2 (en) 2002-07-30 2006-10-31 Koninklijke Philips Electronics N.V. Controlling the growth of a feature frequency profile by deleting selected frequency counts of features of events
US7711844B2 (en) * 2002-08-15 2010-05-04 Washington University Of St. Louis TCP-splitter: reliable packet monitoring methods and apparatus for high speed networks
JP2006503351A (en) * 2002-09-20 2006-01-26 ボード オブ リージェンツ ユニバーシティ オブ テキサス システム Computer program product for finding the relationship analysis of information, the system and method
WO2004036461A3 (en) * 2002-10-14 2004-11-11 Battelle Memorial Institute Information reservoir
US7085755B2 (en) * 2002-11-07 2006-08-01 Thomson Global Resources Ag Electronic document repository management and access system
US9805373B1 (en) 2002-11-19 2017-10-31 Oracle International Corporation Expertise services platform
US20050171948A1 (en) * 2002-12-11 2005-08-04 Knight William C. System and method for identifying critical features in an ordered scale space within a multi-dimensional feature space
WO2004061619A3 (en) * 2002-12-30 2005-02-03 Trace Liggett Knowledge-management systems for law firms
EP1457889A1 (en) * 2003-03-13 2004-09-15 Philips Electronics N.V. Improved fingerprint matching method and system
US7451129B2 (en) * 2003-03-31 2008-11-11 Google Inc. System and method for providing preferred language ordering of search results
US8306972B2 (en) 2003-03-31 2012-11-06 Google Inc. Ordering of search results based on language and/or country of the search results
US7917483B2 (en) 2003-04-24 2011-03-29 Affini, Inc. Search engine and method with improved relevancy, scope, and timeliness
US7451130B2 (en) * 2003-06-16 2008-11-11 Google Inc. System and method for providing preferred country biasing of search results
US20040260681A1 (en) * 2003-06-19 2004-12-23 Dvorak Joseph L. Method and system for selectively retrieving text strings
US20050005239A1 (en) * 2003-07-03 2005-01-06 Richards James L. System and method for automatic insertion of cross references in a document
US7599938B1 (en) 2003-07-11 2009-10-06 Harrison Jr Shelton E Social news gathering, prioritizing, tagging, searching, and syndication method
US7610313B2 (en) * 2003-07-25 2009-10-27 Attenex Corporation System and method for performing efficient document scoring and clustering
US8856163B2 (en) * 2003-07-28 2014-10-07 Google Inc. System and method for providing a user interface with search query broadening
US8086619B2 (en) * 2003-09-05 2011-12-27 Google Inc. System and method for providing search query refinements
US7505964B2 (en) 2003-09-12 2009-03-17 Google Inc. Methods and systems for improving a search ranking using related queries
US7231399B1 (en) 2003-11-14 2007-06-12 Google Inc. Ranking documents based on large data sets
WO2005066848A1 (en) 2003-12-31 2005-07-21 Thomson Global Resources Systems, methods, interfaces and software for automated collection and integration of entity data into online databases and professional directories
JP4995072B2 (en) * 2003-12-31 2012-08-08 トムソン ルーターズ グローバル リソーシーズ Precedents and legal briefs, litigation documents and / or other action proved documents and systems for integrating, a method, software, and interfaces
US7602785B2 (en) 2004-02-09 2009-10-13 Washington University Method and system for performing longest prefix matching for network address lookup using bloom filters
US7191175B2 (en) * 2004-02-13 2007-03-13 Attenex Corporation System and method for arranging concept clusters in thematic neighborhood relationships in a two-dimensional visual display space
US20050246308A1 (en) * 2004-03-12 2005-11-03 Barker Joel A Method of exploring (arc)
EP1738305A2 (en) * 2004-03-12 2007-01-03 Joel A. Barker Method of exploring (wheel)
US7366705B2 (en) * 2004-04-15 2008-04-29 Microsoft Corporation Clustering based text classification
US20050234973A1 (en) * 2004-04-15 2005-10-20 Microsoft Corporation Mining service requests for product support
US7289985B2 (en) 2004-04-15 2007-10-30 Microsoft Corporation Enhanced document retrieval
US7428529B2 (en) * 2004-04-15 2008-09-23 Microsoft Corporation Term suggestion for multi-sense query
US7260568B2 (en) * 2004-04-15 2007-08-21 Microsoft Corporation Verifying relevance between keywords and web site contents
US7305389B2 (en) * 2004-04-15 2007-12-04 Microsoft Corporation Content propagation for enhanced document retrieval
US7689585B2 (en) * 2004-04-15 2010-03-30 Microsoft Corporation Reinforced clustering of multi-type data objects for search term suggestion
BE1016079A6 (en) * 2004-06-17 2006-02-07 Vartec Nv A method for indexing and retrieving documents, computer program used thereby, and which information carrier is provided with the above-mentioned computer program.
JP2008504594A (en) * 2004-06-22 2008-02-14 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ How to select a multimedia item and apparatus, portable preference storage device
JP4587163B2 (en) * 2004-07-13 2010-11-24 インターナショナル・ビジネス・マシーンズ・コーポレーションInternational Business Maschines Corporation Search system, search method, reporting systems, reporting method, and program
US7809695B2 (en) * 2004-08-23 2010-10-05 Thomson Reuters Global Resources Information retrieval systems with duplicate document detection and presentation functions
CA2577891A1 (en) * 2004-08-24 2006-03-02 Washington University Methods and systems for content detection in a reconfigurable hardware
GB0425321D0 (en) * 2004-11-17 2004-12-22 Transversal Corp Ltd An information handling system
US7533094B2 (en) * 2004-11-23 2009-05-12 Microsoft Corporation Method and system for determining similarity of items based on similarity objects and their features
US7356777B2 (en) 2005-01-26 2008-04-08 Attenex Corporation System and method for providing a dynamic user interface for a dense three-dimensional scene
US7404151B2 (en) * 2005-01-26 2008-07-22 Attenex Corporation System and method for providing a dynamic user interface for a dense three-dimensional scene
US7849062B1 (en) * 2005-03-18 2010-12-07 Beyondcore, Inc. Identifying and using critical fields in quality management
US8782087B2 (en) 2005-03-18 2014-07-15 Beyondcore, Inc. Analyzing large data sets to find deviation patterns
WO2011149608A1 (en) * 2010-05-25 2011-12-01 Beyondcore, Inc. Identifying and using critical fields in quality management
US7533088B2 (en) * 2005-05-04 2009-05-12 Microsoft Corporation Database reverse query matching
US7765214B2 (en) * 2005-05-10 2010-07-27 International Business Machines Corporation Enhancing query performance of search engines using lexical affinities
US7487147B2 (en) * 2005-07-13 2009-02-03 Sony Computer Entertainment Inc. Predictive user interface
US7788263B2 (en) * 2005-08-10 2010-08-31 Microsoft Corporation Probabilistic retrospective event detection
US8209335B2 (en) * 2005-09-20 2012-06-26 International Business Machines Corporation Extracting informative phrases from unstructured text
CA2625493C (en) * 2005-10-11 2014-12-16 Intelligenxia Inc. System, method & computer program product for concept based searching & analysis
US8572088B2 (en) * 2005-10-21 2013-10-29 Microsoft Corporation Automated rich presentation of a semantic topic
US7702629B2 (en) * 2005-12-02 2010-04-20 Exegy Incorporated Method and device for high performance regular expression pattern matching
US7676485B2 (en) * 2006-01-20 2010-03-09 Ixreveal, Inc. Method and computer program product for converting ontologies into concept semantic networks
US7954114B2 (en) 2006-01-26 2011-05-31 Exegy Incorporated Firmware socket module for FPGA-based pipeline processing
US20070179940A1 (en) * 2006-01-27 2007-08-02 Robinson Eric M System and method for formulating data search queries
WO2007103583A3 (en) * 2006-03-09 2008-05-08 Vadim Brenner Method and system for media navigation
US8943080B2 (en) * 2006-04-07 2015-01-27 University Of Southern California Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections
US7636703B2 (en) * 2006-05-02 2009-12-22 Exegy Incorporated Method and apparatus for approximate pattern matching
US20080189273A1 (en) * 2006-06-07 2008-08-07 Digital Mandate, Llc System and method for utilizing advanced search and highlighting techniques for isolating subsets of relevant content data
US7921046B2 (en) * 2006-06-19 2011-04-05 Exegy Incorporated High speed processing of financial information using FPGA devices
US7840482B2 (en) * 2006-06-19 2010-11-23 Exegy Incorporated Method and system for high speed options pricing
US7996393B1 (en) * 2006-09-29 2011-08-09 Google Inc. Keywords associated with document categories
US8661029B1 (en) 2006-11-02 2014-02-25 Google Inc. Modifying search result ranking based on implicit user feedback
US8326819B2 (en) * 2006-11-13 2012-12-04 Exegy Incorporated Method and system for high performance data metatagging and data indexing using coprocessors
US7660793B2 (en) 2006-11-13 2010-02-09 Exegy Incorporated Method and system for high performance integration, processing and searching of structured and unstructured data using coprocessors
US9122674B1 (en) 2006-12-15 2015-09-01 Language Weaver, Inc. Use of annotations in statistical machine translation
US7822763B2 (en) * 2007-02-22 2010-10-26 Microsoft Corporation Synonym and similar word page search
US8938463B1 (en) 2007-03-12 2015-01-20 Google Inc. Modifying search result ranking based on implicit user feedback and a model of presentation bias
US8694374B1 (en) 2007-03-14 2014-04-08 Google Inc. Detecting click spam
US9092510B1 (en) 2007-04-30 2015-07-28 Google Inc. Modifying search result ranking based on a temporal element of user feedback
US8694511B1 (en) 2007-08-20 2014-04-08 Google Inc. Modifying search result ranking based on populations
US20090094209A1 (en) * 2007-10-05 2009-04-09 Fujitsu Limited Determining The Depths Of Words And Documents
US8909655B1 (en) 2007-10-11 2014-12-09 Google Inc. Time based ranking
US20090150906A1 (en) * 2007-12-07 2009-06-11 Sap Ag Automatic electronic discovery of heterogeneous objects for litigation
US20090161568A1 (en) * 2007-12-21 2009-06-25 Charles Kastner TCP data reassembly
US7831588B2 (en) * 2008-02-05 2010-11-09 Yahoo! Inc. Context-sensitive query expansion
US8374986B2 (en) * 2008-05-15 2013-02-12 Exegy Incorporated Method and system for accelerated stream processing
CA2730555C (en) * 2008-07-11 2017-01-03 Thomson Reuters Global Resources Systems, methods, and interfaces for researching contractual precedents
US8396865B1 (en) 2008-12-10 2013-03-12 Google Inc. Sharing search engine relevance data between corpora
EP2370946A4 (en) 2008-12-15 2012-05-30 Exegy Inc Method and apparatus for high-speed processing of financial market depth data
US8949265B2 (en) * 2009-03-05 2015-02-03 Ebay Inc. System and method to provide query linguistic service
US9009146B1 (en) 2009-04-08 2015-04-14 Google Inc. Ranking search results based on similar queries
US9245243B2 (en) * 2009-04-14 2016-01-26 Ureveal, Inc. Concept-based analysis of structured and unstructured data using concept inheritance
US8447760B1 (en) 2009-07-20 2013-05-21 Google Inc. Generating a related set of documents for an initial set of documents
US8635223B2 (en) 2009-07-28 2014-01-21 Fti Consulting, Inc. System and method for providing a classification suggestion for electronically stored information
US8990064B2 (en) 2009-07-28 2015-03-24 Language Weaver, Inc. Translating documents based on content
CA2772082A1 (en) 2009-08-24 2011-03-10 William C. Knight Generating a reference set for use during document review
US8498974B1 (en) 2009-08-31 2013-07-30 Google Inc. Refining search results
US8972391B1 (en) 2009-10-02 2015-03-03 Google Inc. Recent interest based relevance scoring
US8874555B1 (en) 2009-11-20 2014-10-28 Google Inc. Modifying scoring data based on historical changes
US8615514B1 (en) 2010-02-03 2013-12-24 Google Inc. Evaluating website properties by partitioning user feedback
US8924379B1 (en) 2010-03-05 2014-12-30 Google Inc. Temporal-based score adjustments
US8959093B1 (en) 2010-03-15 2015-02-17 Google Inc. Ranking search results based on anchors
US8429098B1 (en) 2010-04-30 2013-04-23 Global Eprocure Classification confidence estimating tool
US9623119B1 (en) 2010-06-29 2017-04-18 Google Inc. Accentuating search results
US20130159889A1 (en) * 2010-07-07 2013-06-20 Li-Wei Zheng Obtaining Rendering Co-ordinates Of Visible Text Elements
US8832083B1 (en) 2010-07-23 2014-09-09 Google Inc. Combining user feedback
JP6045505B2 (en) 2010-12-09 2016-12-14 アイピー レザボア,エルエルシー.IP Reservoir, LLC. A method and apparatus for managing the order in financial markets
US9002867B1 (en) 2010-12-30 2015-04-07 Google Inc. Modifying ranking data based on document changes
CN102646103B (en) * 2011-02-18 2016-03-16 腾讯科技(深圳)有限公司 Clustering the search term and means
US8543577B1 (en) 2011-03-02 2013-09-24 Google Inc. Cross-channel clusters of information
US9990393B2 (en) 2012-03-27 2018-06-05 Ip Reservoir, Llc Intelligent feed switch
US9075898B1 (en) * 2012-08-10 2015-07-07 Evernote Corporation Generating and ranking incremental search suggestions for personal content
US9633097B2 (en) 2012-10-23 2017-04-25 Ip Reservoir, Llc Method and apparatus for record pivoting to accelerate processing of data fields
US9633093B2 (en) 2012-10-23 2017-04-25 Ip Reservoir, Llc Method and apparatus for accelerated format translation of data in a delimited data format
US9152622B2 (en) 2012-11-26 2015-10-06 Language Weaver, Inc. Personalized machine translation via online adaptation
US9323721B1 (en) * 2013-02-27 2016-04-26 Google Inc. Quotation identification
US9183499B1 (en) 2013-04-19 2015-11-10 Google Inc. Evaluating quality based on neighbor features
US9213694B2 (en) 2013-10-10 2015-12-15 Language Weaver, Inc. Efficient online domain adaptation
US20150261850A1 (en) * 2014-03-17 2015-09-17 NLPCore LLC Corpus search systems and methods
US9747273B2 (en) 2014-08-19 2017-08-29 International Business Machines Corporation String comparison results for character strings using frequency data

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4422158A (en) * 1980-11-28 1983-12-20 System Development Corporation Method and means for interrogating a layered data base
US4554631A (en) * 1983-07-13 1985-11-19 At&T Bell Laboratories Keyword search automatic limiting method
US4843389A (en) * 1986-12-04 1989-06-27 International Business Machines Corp. Text compression and expansion method and apparatus
US4870568A (en) * 1986-06-25 1989-09-26 Thinking Machines Corporation Method for searching a database system including parallel processors
US5265065A (en) * 1991-10-08 1993-11-23 West Publishing Company Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query
US5321833A (en) * 1990-08-29 1994-06-14 Gte Laboratories Incorporated Adaptive ranking system for information retrieval
US5335345A (en) * 1990-04-11 1994-08-02 Bell Communications Research, Inc. Dynamic query optimization using partial information

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4384329A (en) * 1980-12-19 1983-05-17 International Business Machines Corporation Retrieval of related linked linguistic expressions including synonyms and antonyms
JPH0510703B2 (en) * 1984-10-29 1993-02-10 Hitachi Ltd
US5159667A (en) * 1989-05-31 1992-10-27 Borrey Roland G Document identification by characteristics matching
US5220625A (en) * 1989-06-14 1993-06-15 Hitachi, Ltd. Information search terminal and system
JPH0675265B2 (en) * 1989-09-20 1994-09-21 インターナシヨナル・ビジネス・マシーンズ・コーポレーシヨン Information retrieval method and system
JPH03122770A (en) * 1989-10-05 1991-05-24 Ricoh Co Ltd Method for retrieving keyword associative document
US5301109A (en) * 1990-06-11 1994-04-05 Bell Communications Research, Inc. Computerized cross-language document retrieval using latent semantic indexing
US5325298A (en) * 1990-11-07 1994-06-28 Hnc, Inc. Methods for generating or revising context vectors for a plurality of word stems
US5278980A (en) * 1991-08-16 1994-01-11 Xerox Corporation Iterative technique for phrase query formation and an information retrieval system employing same

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4422158A (en) * 1980-11-28 1983-12-20 System Development Corporation Method and means for interrogating a layered data base
US4554631A (en) * 1983-07-13 1985-11-19 At&T Bell Laboratories Keyword search automatic limiting method
US4870568A (en) * 1986-06-25 1989-09-26 Thinking Machines Corporation Method for searching a database system including parallel processors
US4843389A (en) * 1986-12-04 1989-06-27 International Business Machines Corp. Text compression and expansion method and apparatus
US5335345A (en) * 1990-04-11 1994-08-02 Bell Communications Research, Inc. Dynamic query optimization using partial information
US5321833A (en) * 1990-08-29 1994-06-14 Gte Laboratories Incorporated Adaptive ranking system for information retrieval
US5265065A (en) * 1991-10-08 1993-11-23 West Publishing Company Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
PROC. ACM (SIGIR 85), 1985, BUCKLEY et al., "Optimization of Inverted Vector Searches", pages 97-110. *

Also Published As

Publication number Publication date Type
WO1994023386A2 (en) 1994-10-13 application
US5488725A (en) 1996-01-30 grant

Similar Documents

Publication Publication Date Title
Fumera et al. Spam filtering based on the analysis of text information embedded into images
US6928407B2 (en) System and method for the automatic discovery of salient segments in speech transcripts
US6181807B1 (en) Methods and related apparatus for fingerprint indexing and searching
US5732579A (en) Key having an air coil antenna and a method of construction
Suzuki Learning Bayesian belief networks based on the MDL principle: an efficient algorithm using the branch and bound technique
Augerat et al. Computational results with a branch-and-cut code for the capacitated vehicle routing problem
Mladenić Feature subset selection in text-learning
US20070005589A1 (en) Method and apparatus for document clustering and document sketching
US6778941B1 (en) Message and user attributes in a message filtering method and system
US6587850B2 (en) Method and apparatus for profile score threshold setting and updating
US7574409B2 (en) Method, apparatus, and system for clustering and classification
Gutman Asymptotically optimal classification for multiple tests with empirically observed statistics
US5329591A (en) Transmitter identification and validation system and method
US6185531B1 (en) Topic indexing method
US6766287B1 (en) System for genre-specific summarization of documents
US20060123083A1 (en) Adaptive spam message detector
van Rijsbergen et al. A test for the separation of relevant and non-relevant documents in experimental retrieval collections
Bratko et al. Spam filtering using statistical data compression models
US6304601B1 (en) Data compression apparatus
US20050114298A1 (en) System and method for indexing weighted-sequences in large databases
Littman et al. Efficient dynamic-programming updates in partially observable Markov decision processes
Dancer et al. On positive solutions of semilinear elliptic equations
Lee Measures of distributional similarity
Frank et al. Domain-specific keyphrase extraction
Stützle MAX-MIN ant system for quadratic assignment problems

Legal Events

Date Code Title Description
AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

AK Designated states

Kind code of ref document: A2

Designated state(s): AT AU BB BG BR BY CA CH CN CZ DE DK ES FI GB HU JP KP KR KZ LK LU LV MG MN MW NL NO NZ PL PT RO RU SD SE SI SK TT UA UZ VN

AK Designated states

Kind code of ref document: A3

Designated state(s): AT AU BB BG BR BY CA CH CN CZ DE DK ES FI GB HU JP KP KR KZ LK LU LV MG MN MW NL NO NZ PL PT RO RU SD SE SI SK TT UA UZ VN

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: CA