WO2001027713A2 - Method of categorization and indexing of information - Google Patents

Method of categorization and indexing of information Download PDF

Info

Publication number
WO2001027713A2
WO2001027713A2 PCT/IN2000/000101 IN0000101W WO0127713A2 WO 2001027713 A2 WO2001027713 A2 WO 2001027713A2 IN 0000101 W IN0000101 W IN 0000101W WO 0127713 A2 WO0127713 A2 WO 0127713A2
Authority
WO
WIPO (PCT)
Prior art keywords
recited
information
uniquely identified
assigned
index
Prior art date
Application number
PCT/IN2000/000101
Other languages
French (fr)
Other versions
WO2001027713A3 (en
Inventor
Milind Kotwal
Original Assignee
Milind Kotwal
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Milind Kotwal filed Critical Milind Kotwal
Priority to AU27027/01A priority Critical patent/AU2702701A/en
Publication of WO2001027713A2 publication Critical patent/WO2001027713A2/en
Publication of WO2001027713A3 publication Critical patent/WO2001027713A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification

Definitions

  • This invention generally relates to method of organizing information and more particularly to Internet search engines and directories.
  • a typical product directory for machines may group the machines as per the types like Lathes, Milling Machines, Grinding Machines, and Shaping M/c etc.
  • Search engine like Yahoo, Excite, AltaVista, Google etc. stores and index key words extracted from the text of the document.
  • the documents are given relative ranking for a particular keyword based on the emphasis, occurrences and location of the word in the document.
  • search engine For searching desired document, user enters keywords that are likely to appear in the desired document in the search field.
  • the result is generally a list of document, which contains the entered key word.
  • the user has to browse through the documents before finding the required document.
  • the current search engines employ various techniques like ranking based on user activity, proximity grouping etc.
  • the invention disclosed here has three main components, an indexing system including a grouping system, a search system and a user interface.
  • the indexing system provides multiple fields for indexing the document. Every field, provided for indexing the document, has a defined relationship with the information contained in the indexed document.
  • the defined relationships are Information Type, Object of the Information, Source Sector Of the object of information, Source process of the object of information, Function of the object of information, Branch of knowledge, Application Or Process for which object of information is used, Relation of the information with the application, Category of the Process or application, Output of the Process Or Application, Sector of the Output.
  • the fields Information Type, Source Sector Of the object of information, Source process of the object of information, Branch of knowledge, Relation of the information with the application, Category of the Process or application, Sector of the Output are predefined and rest are user defined.
  • each unique set of field entries is defined as a distinct category under which documents are grouped. These categories are defined by a text expression generated from the field entries and joining the terms with suitable defining terms.
  • the search is carried out in two stages. In the stage one of the search, appropriate entries are made in the search fields if known, and the output is categories available in the database conforming to the search query. In second stage desired category is selected to view documents registered under it.
  • making entries in additional fields reduces the number of resultant categories and helps to locate the information quickly.
  • queries like raw materials required for manufacturing a given product or technologies for manufacturing a given product or materials going from a given sector to another given sector can be raised.
  • complex queries like alternatives for raw materials, machinery, technology etc., products going from one sector to another, etc. can be queried to the database.
  • Figure 1 Relationships of the expressions used for indexing information.
  • Table 1 Table describing fields.
  • Table 2 List of document type categories
  • the method for organising information disclosed here is useful for variety of applications, which include indexing of web pages on Internet, indexing of classified advertisements, indexing of interests to receive information by e-mail or instant messaging etc.
  • the information is organised in precisely searchable categories and stored in the database.
  • stage one the appropriate category is searched and in stage two, information stored under the category is viewed
  • the system is basically consists of a database preferably a relational database, a user interface, an information entry program and information search program.
  • the database has following tables:
  • a) Document type table This table stores standardized categories of the documents for validation.
  • Object of information table This table stores entries made in the field of object of information.
  • Branch of Knowledge table This table stores standardized branch of knowledge of the documents for validation for the purpose of validation d)
  • Function Table This table stores entries made in the field of function of the object of information.
  • Process Table This table stores entries made in the field of process names.
  • Process Category Table This table stores standardized process category of the processes for validation.
  • Process Output Table This table stores entries made in the field of process output, h) Sector Table: This table stores standardized names of sectors for validation.
  • Object to process relationship table This table stores standardized object to process relationships for validation.
  • Category Table This table stores the categories created by unique combination of entries in eleven fields, which uniquely describes the category.
  • Category to URL table This table records title of the document, URL of the document and category as described in category table.
  • Category to Classified Advertisement Table This table records title of the classified Advertisement, Classified advertisement and category as described in category table.
  • Category to email table This table records e-mail and category as described in category table.
  • the user interface is WebPages and has three main functions information registration and search, where information is WebPages or classified advertisements.
  • Accept information for registration Carry out initial validation of data, Transfer the data to registration module for information registration.
  • the user interface also communicates other related messages.
  • This program accepts information received from users through user interface and updates it to database.
  • the sequence of steps is as following: 1. Accept entries to indexing field for selection of categories
  • This program accepts user queries received through user interface and carries out search in the database.
  • the sequence of steps is as following:
  • user interface in particular is simplified to enable users to register and search a particular type of information, which is used very often, quickly. For example: Machines and equipment, Raw materials, Flats and apartments, Plots & real estate, Cars and vehicles, Tours and travels, Computers, Jobs and assignments, Representation and franchises, etc.
  • user interface is modified where necessary field entries are built up, irrelevant field entries are deleted, which provides user with minimum and relevant option.
  • the system described above can be used to find out information even if the title of the information is not known, as by selecting appropriate entries in other field relevant categories can be viewed. For example:
  • information can be searched for technology, consumables, machines and equipment etc. Also the information can be searched only describing function, sector, branch of knowledge etc.

Abstract

A method of organising information is disclosed. The system disclosed here organises the information in categories described by type of information, function of the object of information, branch of knowledge, relationship with the downstream process, process output, sector, source process and source sector. Information group is created based on entries in all the fields. The database captures information from the entries made, and helps to locate the information even when exact title of information is not known, by selectively entering the terms in the search fields.

Description

Title Of Invention:
Method Of Categorization And Indexing Of Information
Field Of The Invention:
This invention generally relates to method of organizing information and more particularly to Internet search engines and directories.
Background Of The Invention:
Computers have become a very useful tool to save information for the purpose of retrieval of the desired document as it can manage a very large collection of records. However with ever increasing collection of documents that is Internet, organisation of the information has become a very difficult task. Currently mechanisms that are used for the purpose are directories and search engines. The directories are hierarchical structures, which group the similar type of documents together in groups and subgroups. This type of structure is unable to take into consideration multiple relationships that normally exist in the documents and information.
For Example:
• A typical product directory for machines may group the machines as per the types like Lathes, Milling Machines, Grinding Machines, and Shaping M/c etc.
• These may also be organised based on application like Machines for wristwatch industry, M/c for Automobile industry, Machines for heavy industry etc., • These can also be categorized based on type of usage like large-scale production, medium scale production, workshop m/c etc.
• These can also be classifies based on type of raw material the machines are designed to work on.
• These can also be categorized based on types of controls used like Automatic, semiautomatic, manual etc.
• There can be further categorisation based on make, source, quality certification, special construction features etc.
All these relations cannot be effectively addressed in directory indexing. Search engine like Yahoo, Excite, AltaVista, Google etc. stores and index key words extracted from the text of the document. The documents are given relative ranking for a particular keyword based on the emphasis, occurrences and location of the word in the document. For searching desired document, user enters keywords that are likely to appear in the desired document in the search field. The result is generally a list of document, which contains the entered key word. The user has to browse through the documents before finding the required document. To improve precision of the document search the current search engines employ various techniques like ranking based on user activity, proximity grouping etc.
Still the search is far from satisfactory especially for technical and business information.
Summary Of The Invention:
The invention disclosed here has three main components, an indexing system including a grouping system, a search system and a user interface.
In one aspect of the invention, the indexing system provides multiple fields for indexing the document. Every field, provided for indexing the document, has a defined relationship with the information contained in the indexed document. The defined relationships are Information Type, Object of the Information, Source Sector Of the object of information, Source process of the object of information, Function of the object of information, Branch of knowledge, Application Or Process for which object of information is used, Relation of the information with the application, Category of the Process or application, Output of the Process Or Application, Sector of the Output.
In one aspect of the invention, the fields Information Type, Source Sector Of the object of information, Source process of the object of information, Branch of knowledge, Relation of the information with the application, Category of the Process or application, Sector of the Output are predefined and rest are user defined.
In one aspect of the invention, each unique set of field entries is defined as a distinct category under which documents are grouped. These categories are defined by a text expression generated from the field entries and joining the terms with suitable defining terms. In one aspect of the invention, the search is carried out in two stages. In the stage one of the search, appropriate entries are made in the search fields if known, and the output is categories available in the database conforming to the search query. In second stage desired category is selected to view documents registered under it.
In one aspect of the invention, making entries in additional fields reduces the number of resultant categories and helps to locate the information quickly.
In one aspect of the invention, queries like raw materials required for manufacturing a given product or technologies for manufacturing a given product or materials going from a given sector to another given sector can be raised.
In one aspect of the invention, complex queries like alternatives for raw materials, machinery, technology etc., products going from one sector to another, etc. can be queried to the database.
Brief Description Of Drawings and Tables:
Figure 1 : Relationships of the expressions used for indexing information.
Figure 2: General arrangement
Table 1 : Table describing fields.
Table 2: List of document type categories
Table 3: List of Branch of knowledge categories
Table 4: List of Sector categories
Table 5: List Process Categories
Table 6: List of Object to process relationships
Detailed Description Of Preferred Embodiments
The method for organising information disclosed here is useful for variety of applications, which include indexing of web pages on Internet, indexing of classified advertisements, indexing of interests to receive information by e-mail or instant messaging etc. The information is organised in precisely searchable categories and stored in the database.
In the system described here the required information is searched in two stages. In stage one the appropriate category is searched and in stage two, information stored under the category is viewed
The system is basically consists of a database preferably a relational database, a user interface, an information entry program and information search program.
A. The database:
The database has following tables:
a) Document type table: This table stores standardized categories of the documents for validation. b) Object of information table: This table stores entries made in the field of object of information. c) Branch of Knowledge table: This table stores standardized branch of knowledge of the documents for validation for the purpose of validation d) Function Table: This table stores entries made in the field of function of the object of information. e) Process Table: This table stores entries made in the field of process names. f) Process Category Table: This table stores standardized process category of the processes for validation. g) Process Output Table: This table stores entries made in the field of process output, h) Sector Table: This table stores standardized names of sectors for validation. i) Object to process relationship table: This table stores standardized object to process relationships for validation. These are basically expanded from man, money, machine, material, system, and organisations further expanded. j) Category Table: This table stores the categories created by unique combination of entries in eleven fields, which uniquely describes the category. k) Category to URL table: This table records title of the document, URL of the document and category as described in category table. I) Category to Classified Advertisement Table: This table records title of the classified Advertisement, Classified advertisement and category as described in category table. m) Category to email table: This table records e-mail and category as described in category table.
The details of standardised categories are described elsewhere in the document.
B. The User interface:
The user interface is WebPages and has three main functions information registration and search, where information is WebPages or classified advertisements.
Registration of information: Sequence of steps are as following: -
To communicate requirement for registration of information to the users, Accept information for registration, Carry out initial validation of data, Transfer the data to registration module for information registration.
Search of Information: Sequence of steps is as following: -
1. To communicate requirement for search of information to the users,
2. Accept search query from user.
3. Transfer the query to search module for search of categories. 4. Accept the categories information from the search module.
5. Display the categories information received from the search module to the user. Accept selection of categories from the users.
6. Transfer the categories selected by the user to the search module.
7. Accept search results from the search module. 8. Display search results to the users.
The user interface also communicates other related messages.
C. The Information input program:
This program accepts information received from users through user interface and updates it to database. The sequence of steps is as following: 1. Accept entries to indexing field for selection of categories
2. If all the fields are defined then save the information to the database.
3. If the user does not define all the fields, then offer categories available for registration conforming to the entries made in the fields for selection to the user. 4 Accept categories selected by the user for registration and save the information against all selected categories.
D. The information search program:
This program accepts user queries received through user interface and carries out search in the database. The sequence of steps is as following:
1 Accept entries to search field for search of categories
2 Display categories available in the database conforming to the entries made in the fields for search by the user.
3. Accept categories selected by the user for displaying the information.
4. Send information available against the categories to the user interface
The System described above, user interface in particular is simplified to enable users to register and search a particular type of information, which is used very often, quickly. For example: Machines and equipment, Raw materials, Flats and apartments, Plots & real estate, Cars and vehicles, Tours and travels, Computers, Jobs and assignments, Representation and franchises, etc. In such cases user interface is modified where necessary field entries are built up, irrelevant field entries are deleted, which provides user with minimum and relevant option.
The system described above can be used to find out information even if the title of the information is not known, as by selecting appropriate entries in other field relevant categories can be viewed. For example:
To search raw material for any product select "Product and service information" in field - document type, "Raw material" in field - object to process relationship, and name of the product in the field - process output. The output from the search query shall display all the raw materials required for manufacturing of the product along with the processes.
Similarly information can be searched for technology, consumables, machines and equipment etc. Also the information can be searched only describing function, sector, branch of knowledge etc.
The foregoing description of an implementation of the invention has been presented for purposes of illustration and description. It is not exhaustive and does not limit the invention to the precise form disclosed. Modifications and variations are possible in light of the above teachings or may be acquired from practicing of the invention. For example, the described implementation includes software but the present invention may be implemented as a combination of hardware and software or in hardware alone. The scope of the invention is defined by the claims and their equivalents.
* * * * *

Claims

Claims
1. A method for organizing information in a computer system, including an index database, a system to input information, a system to search information, and a user interface
2. The index database as recited in claim 1 , wherein the database being able to store one or more index record for every indexed document.
2.1 The system as recited in claim 2, wherein every index record is a unique category described by multiple index expressions. 2.2 The system as recited in claim 2.01 , wherein every index expression has a distinct identification within the index record.
2.3 The system as recited in claim 2.02, wherein every uniquely identified expression has a defined relationship with the information indexed by the index record.
2. 3.1 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expressions is assigned to indicate the nature of the content in the indexed information.
2.3.1.1 The system as recited in claim 2.03.01 , wherein nature of the content is a standardized category to indicate the purpose of the indexed information.
2. 3.2 The system as recited in claim 2.03, wherein at least one uniquely identified index expression is assigned to store the name of the object of the information.
2. 3.3 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expressions is assigned to describe function of the object of the indexed information corresponding to the index record.
2. 3.4 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expression is assigned to identify downstream process of the object of the indexed information corresponding to the index record.
2. 3.5 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expressions is assigned to indicate the category of the process corresponding to the index record. 2.3.5.1 The system as recited in claim 2.03.5, wherein category of the process is one of the standardized categories.
2. 3.6 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expressions is assigned to indicate the relationship between object of the information and the process corresponding to the index record.
2.3.6.1 The system as recited in claim 2.03.6, wherein the relationship between object of the information and the process is one of the standardized categories.
2. 3.7 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expression is assigned to describe the output from the process corresponding to the index record.
2. 3.8 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expression is assigned to indicate sector to which the output from the process belongs, corresponding to the index record.
2.3.8.1 The system as recited in claim 2.03.8, wherein sector is one of the standardized categories.
2. 3.9 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expression is assigned to indicate dominant branch of knowledge corresponding to the index record, for the indexed information.
2.3.9.1 The system as recited in claim 2.03.9, wherein branch of knowledge is one of the standardized categories.
2. 3.10 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expression is assigned to indicate sector of the object of information.
2.3.10.1 The system as recited in claim 2.03.10, wherein sector is one of the standardized categories.
2. 3.11 The system as recited in claim 2.03, wherein at least one of the uniquely identified index expression is assigned to indicate the category of the process from which the object of the information has originated.
2.3.11.1 The system as recited in claim 2.03.11 , wherein category of the process is one of the standardized categories.
3. The system as recited in claim 2.01 , wherein the expression describes trade and/ or technical name/s of the object, function, process and process output, along with denominations of group, subgroups to which it belongs, in a sequence, and with mathematical operators, indicating the group, subgroup & synonym relationships, wherever appropriate.
4. The system as recited in Claim 1 , wherein the user interface is provided to accept the indexing expressions as required for claims 2.03.1 to 2.03.11.
5. The system as recited in Claim 1 , wherein the search system comprises of a query module and a user interface. 5.1 The system as recited in claim 5, wherein the query module is capable to receive and to process a query.
5.2 The system as recited in claim 5.01, wherein every query has multiple uniquely identified expressions and where some of the expressions may be null.
5.3 The system as recited in claim 5.02, wherein every uniquely identified expression has a defined relationship with the desired information where identification of expression and corresponding relationship being the same as that in the index being queried.
5. 3.1 The system as recited in claim 5.03, wherein at least one of the uniquely identified query expression is assigned to indicate the nature of the content in the desired document.
5.3.1.1 The system as recited in claim 5.03.01 , wherein nature of the content is a standardized category to indicate the purpose of the desired information.
5. 3.2 The system as recited in claim 5.03, wherein at least one uniquely identified index expression is assigned to indicate the name of the object of the desired information. 5. 3.3 The system as recited in claim 5.03, wherein at least one of the uniquely identified index expressions is assigned to describe function of the object of the desired information.
5. 3.4 The system as recited in claim 5.03, wherein at least one of the uniquely identified index expression is assigned to identify downstream process of the object of the desired information.
5. 3.5 The system as recited in claim 5.03, wherein at least one of the uniquely identified index expressions is assigned to indicate the category of the process for which the information is required.
5.3.5.1 The system as recited in claim 5.03.5, wherein category of the process is one of the standardized categories.
5. 3.6 The system as recited in claim 5.03, wherein at least one of the uniquely identified index expressions is assigned to indicate the relationship between object of the information and the process corresponding to the desired information.
5.3.6.1 The system as recited in claim 5.03.6, wherein the relationship between object of the information and the process is one of the standardized categories.
5. 3.7 The system as recited in claim 5.03, wherein at least one of the uniquely identified index expression is assigned to describe the output from the process corresponding to the desired information.
5. 3.8 The system as recited in claim 5.03, wherein at least one of the uniquely identified index expression is assigned to indicate sector to which the output from the process belongs, corresponding to the desired information.
5.3.8.1 The system as recited in claim 5.03.8, wherein sector is one of the standardized categories.
5. 3.9 The system as recited in claim 5.03, wherein at least one of the uniquely identified index expression is assigned to indicate dominant branch of knowledge corresponding to the desired information.
5.3.9.1 The system as recited in claim 5.03.9, wherein branch of knowledge is one of the standardized categories.
5. 3.10 The system as recited in claim 5.03, wherein at least one of the uniquely identified index expression is assigned to indicate sector of the object of the desired information.
5.3.10.1 The system as recited in claim 5.03.10, wherein sector is one of the standardized categories.
5. 3.11 The system as recited in claim 5.03, wherein at least one of the uniquely identified index expression is assigned to indicate the category of the process from which the object of the desired information has originated.
5.3.11.1 The system as recited in claim 5.03.11 , wherein category of the process is one of the standardized categories.
PCT/IN2000/000101 1999-10-15 2000-10-13 Method of categorization and indexing of information WO2001027713A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU27027/01A AU2702701A (en) 1999-10-15 2000-10-13 Method of categorization and indexing of information

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IN708BO1999 1999-10-15
IN708/BOM/99 1999-10-15

Publications (2)

Publication Number Publication Date
WO2001027713A2 true WO2001027713A2 (en) 2001-04-19
WO2001027713A3 WO2001027713A3 (en) 2001-12-27

Family

ID=11080264

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IN2000/000101 WO2001027713A2 (en) 1999-10-15 2000-10-13 Method of categorization and indexing of information

Country Status (2)

Country Link
AU (1) AU2702701A (en)
WO (1) WO2001027713A2 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL1025561C2 (en) * 2004-02-24 2005-08-29 Stichting Bouwradius Onderwijs Data retrieval system for e.g. construction projects, organises information, text and media into classes grouped according to user inputted criteria
US7464090B2 (en) * 2006-01-27 2008-12-09 Google Inc. Object categorization for information extraction
US7769579B2 (en) 2005-05-31 2010-08-03 Google Inc. Learning facts from semi-structured text
US7831545B1 (en) 2005-05-31 2010-11-09 Google Inc. Identifying the unifying subject of a set of facts
US7966291B1 (en) 2007-06-26 2011-06-21 Google Inc. Fact-based object merging
US7991797B2 (en) 2006-02-17 2011-08-02 Google Inc. ID persistence through normalization
US8244689B2 (en) 2006-02-17 2012-08-14 Google Inc. Attribute entropy as a signal in object normalization
US8650175B2 (en) 2005-03-31 2014-02-11 Google Inc. User interface for facts query engine with snippets from information sources that include query terms and answer terms
US8682891B2 (en) 2006-02-17 2014-03-25 Google Inc. Automatic object reference identification and linking in a browseable fact repository
US8700568B2 (en) 2006-02-17 2014-04-15 Google Inc. Entity normalization via name normalization
US8738643B1 (en) 2007-08-02 2014-05-27 Google Inc. Learning synonymous object names from anchor texts
US8751498B2 (en) 2006-10-20 2014-06-10 Google Inc. Finding and disambiguating references to entities on web pages
US8996470B1 (en) 2005-05-31 2015-03-31 Google Inc. System for ensuring the internal consistency of a fact repository
US9892132B2 (en) 2007-03-14 2018-02-13 Google Llc Determining geographic locations for place names in a fact repository

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9208229B2 (en) 2005-03-31 2015-12-08 Google Inc. Anchor text summarization for corroboration

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5678046A (en) * 1994-11-18 1997-10-14 The Chase Manhattan Bank, N.A. Method and apparatus for distributing files on a file storage device
US5974396A (en) * 1993-02-23 1999-10-26 Moore Business Forms, Inc. Method and system for gathering and analyzing consumer purchasing information based on product and consumer clustering relationships

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5974396A (en) * 1993-02-23 1999-10-26 Moore Business Forms, Inc. Method and system for gathering and analyzing consumer purchasing information based on product and consumer clustering relationships
US5678046A (en) * 1994-11-18 1997-10-14 The Chase Manhattan Bank, N.A. Method and apparatus for distributing files on a file storage device

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL1025561C2 (en) * 2004-02-24 2005-08-29 Stichting Bouwradius Onderwijs Data retrieval system for e.g. construction projects, organises information, text and media into classes grouped according to user inputted criteria
US8650175B2 (en) 2005-03-31 2014-02-11 Google Inc. User interface for facts query engine with snippets from information sources that include query terms and answer terms
US7769579B2 (en) 2005-05-31 2010-08-03 Google Inc. Learning facts from semi-structured text
US7831545B1 (en) 2005-05-31 2010-11-09 Google Inc. Identifying the unifying subject of a set of facts
US9558186B2 (en) 2005-05-31 2017-01-31 Google Inc. Unsupervised extraction of facts
US8996470B1 (en) 2005-05-31 2015-03-31 Google Inc. System for ensuring the internal consistency of a fact repository
US7464090B2 (en) * 2006-01-27 2008-12-09 Google Inc. Object categorization for information extraction
US9092495B2 (en) 2006-01-27 2015-07-28 Google Inc. Automatic object reference identification and linking in a browseable fact repository
US8682891B2 (en) 2006-02-17 2014-03-25 Google Inc. Automatic object reference identification and linking in a browseable fact repository
US8700568B2 (en) 2006-02-17 2014-04-15 Google Inc. Entity normalization via name normalization
US8244689B2 (en) 2006-02-17 2012-08-14 Google Inc. Attribute entropy as a signal in object normalization
US7991797B2 (en) 2006-02-17 2011-08-02 Google Inc. ID persistence through normalization
US9710549B2 (en) 2006-02-17 2017-07-18 Google Inc. Entity normalization via name normalization
US10223406B2 (en) 2006-02-17 2019-03-05 Google Llc Entity normalization via name normalization
US8751498B2 (en) 2006-10-20 2014-06-10 Google Inc. Finding and disambiguating references to entities on web pages
US9760570B2 (en) 2006-10-20 2017-09-12 Google Inc. Finding and disambiguating references to entities on web pages
US9892132B2 (en) 2007-03-14 2018-02-13 Google Llc Determining geographic locations for place names in a fact repository
US7966291B1 (en) 2007-06-26 2011-06-21 Google Inc. Fact-based object merging
US8738643B1 (en) 2007-08-02 2014-05-27 Google Inc. Learning synonymous object names from anchor texts

Also Published As

Publication number Publication date
AU2702701A (en) 2001-04-23
WO2001027713A3 (en) 2001-12-27

Similar Documents

Publication Publication Date Title
US7058661B2 (en) System and method for electronically managing discovery pleading information
EP0979470B1 (en) Method and apparatus for searching a database of records
CN100375090C (en) Retrieving matching documents by queries in any national language
US6694331B2 (en) Apparatus for and method of searching and organizing intellectual property information utilizing a classification system
US20060129538A1 (en) Text search quality by exploiting organizational information
WO2001027713A2 (en) Method of categorization and indexing of information
US20060161545A1 (en) Method and apparatus for ordering items within datasets
US20080259084A1 (en) Method and apparatus for organizing data sources
US20070271228A1 (en) Documentary search procedure in a distributed system
CN102955844B (en) Search Results is presented based on theme version
US20020186240A1 (en) System and method for providing data for decision support
WO2002056206A1 (en) System for searching collections of linked objects
CA2459182C (en) A method for automatically indexing documents
US20080147588A1 (en) Method for discovering data artifacts in an on-line data object
AU2002331728A1 (en) A method for automatically indexing documents
US8463763B2 (en) Method and tool for searching in several data sources for a selected community of users
US20020103794A1 (en) System and method for processing database queries
Wu et al. Collective taxonomizing: A collaborative approach to organizing document repositories
CN110928978A (en) Standard literature classification retrieval method
CN107291951B (en) Data processing method, device, storage medium and processor
WO1998049632A1 (en) System and method for entity-based data retrieval
WO2003034283A1 (en) Process and system for matching products and markets
CN101609461A (en) A kind of space querying system of personal core data and method based on user characteristics
Gupta Evaluation of next generation online public access catalogue (OPAC) features in library management system
US20190026370A1 (en) System and Method for Categorizing Web Search Results

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP