US20050050037A1 - Intranet mediator - Google Patents
Intranet mediator Download PDFInfo
- Publication number
- US20050050037A1 US20050050037A1 US10/960,804 US96080404A US2005050037A1 US 20050050037 A1 US20050050037 A1 US 20050050037A1 US 96080404 A US96080404 A US 96080404A US 2005050037 A1 US2005050037 A1 US 2005050037A1
- Authority
- US
- United States
- Prior art keywords
- data
- reconciled
- answer
- query
- search
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 claims description 27
- 238000010586 diagram Methods 0.000 description 4
- 238000013500 data storage Methods 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 238000000605 extraction Methods 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/9032—Query formulation
- G06F16/90332—Natural language query formulation or dialogue systems
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99939—Privileged access
Definitions
- the present invention relates to a system whereby a direct answer may be given to a specific natural language query through a convenient search of reconciled structured data repositories and unstructured data sources.
- One type of digital data gathering commonly in use is concerned with the retrieval of information from unstructured data sources, such as text documents, where each element of the data is not individually defined.
- the user will enter “search terms” as a data query and the unstructured data will merely be searched for occurrence of these terms.
- Results of, i.e., the response from, such a search may return the text, i.e., the data, or may, e.g., in a World Wide Web search, only return the location, or site, of the data.
- the user would then need to read the text or go to each site and locate the occurrence of the search term, which may, or may not, be relevant to an actual question which the user wants answered. This time consuming practice is commonly known as “surfing”.
- the second type of digital data gathering commonly in use is the structured data source search, where structured data within one or more specific structured data sources or reconciled structured data repositories, usually privately owned and accessed, are searched to return a specific answer.
- “Reconciled” as used herein refers to reconciling data to a common schema of a structured data storage entity.
- a reconciled structured data repository may contain unstructured data, e.g., pages of text, video clips, etc., and such unstructured data are reconciled by known techniques, including those listed above, to provide a structured means by which to search, access, and potentially retrieve them within the reconciled structured data repositories.
- a “data warehouse” can be at least one, or a collection of, reconciled structured data repositories of data.
- a “physical data warehouse” is also a repository of integrated, proprietarily loaded and controlled data sources assumed to be under the control of a single entity.
- the “physical data warehouse” may be constructed according to extraction/transform/load (ETL) techniques where the selected data are first extracted, i.e., selected for retention and cleaned up to retain those portions necessary for the common schema, then transformed, i.e., integrated within the structure of the reconciled structured data repository or physical data warehouse format by reconciling the data to the common schema of the structured data repository, and then loaded under the common schema into the reconciled structured data repository.
- ETL extraction/transform/load
- a reconciled structured data repository may also store unstructured data that have been reconciled to the common schema of the reconciled structured data repository by having or pointer to identifying metadata for the unstructured data.
- the structured and unstructured data within the reconciled structured data repository can be retrieved by structured search schemes. While some art references predating the current nomenclature and usage have labeled a mere aggregation or collection of unstructured databases searched by information retrieval means and terms as “data warehouses,” such unstructured databases do not contain reconciled structured data repositories, do not access their data in a structured manner, and are not “data warehouses” as the term is now known to those in the art because the data therein is not transformed to a common schema.
- U.S. Pat. No. 6,078,924 to Ainsbury et al. illustrates a technique of digital data gathering. According to this patent, the user is allowed to aggregate data found in the user's previous searches on a specific topic into a central file. This central file can then be controlled from a commercial desktop computer application to facilitate searching of the data.
- What is needed in the art is a system whereby the user can take advantage of both information retrieval and reconciled structured data repository data types of digital data gathering concurrently to provide a direct answer to a specific question, and preferably provide further context for that answer.
- the query be accepted in a natural language format whereby the user needs no special skills in query formulation.
- the query be intelligently parsed so as to weight the relevant parts of the query and that synonyms of the natural language query be provided to give a more thorough search and accurate answer.
- the answers, and any related information be limited in number to only that required or most relevant to the query.
- Query refers herein to any form of searchable subject matter, and may include query tokens, or elements of a total query, whether aggregate or separate, unless otherwise limited or defined by the context of the disclosure.
- Data refers herein to any form of digitally stored information, unless otherwise limited or defined by the context of the disclosure.
- Direct answer and “most likely answer” are used interchangeably herein and refer to the best available answer, whether factually based, referencing additional data, or refusal to answer, based upon the results of the data retrieved by the searches of the intranet mediator.
- a direct answer is to be distinguished from an information retrieval type “response” which merely indicates the presence of a document containing the queried term.
- “Common schema” means an organizational definition of structured data shared by multiple data sources.
- Data source means a logically, independently operating data storage, search, retrieval, and manipulation system.
- the system may store data of any digital form.
- the intranet mediator of the present invention obtains for the user a direct, or most likely, answer to a natural language question.
- the intranet mediator provides the user with a search of both structured data sources and a repository of unstructured data sources.
- the reconciled structured data repository data sources will preferably be integrated to provide for ease of searching a rapid attainment of search results.
- the reconciled structured data repository is previously constructed, e.g., by the corporate owner thereof, to integrate and abstract its collections of data sources with common schema, and providing meta-data within the reconciled structured data repository. The meta-data will give a global overview of the reconciled structured data repositories thus making the repositories easily searchable.
- the unstructured data sources may be data such as internal video, audio, or document files stored within the private databases of the owner, or they may be public sources, e.g., collections of documents available via the worldwide web or the like, or both.
- the unstructured data sources may be provided with a meta-data repository also.
- the intranet mediator allows the user to input, and obtain an answer to, a natural language question without having to read text or surf the data sources in which the answer might be contained, or without being limited to one specific factual item return.
- the intranet mediator operates in part on the supposition that most answers to a businessperson's questions are contained within privately owned data repositories of the business that were likely to be already integrated into a reconciled structured data repository. Selection of the most relevant data sources prior to searching is therefore possible.
- the direct answer selection to the user's question is accordingly weighted to the search answer from these reconciled structured data repositories.
- Searching of the unstructured data sources may be performed automatically, or upon determination that answers are not likely from the structured data sources, or if additional context surrounding either the query or the answer is justified or desired. A direct, or most likely, answer is then given to the user in answer to the input question.
- the intranet mediator may also display a list of related data or data sources where additional information relevant to the user's question may be found.
- the intranet mediator desirably includes the logical functionality of a natural language question input module; a parser module for assembling a search query from the natural language question; a query expander module; an unstructured data source manager operably connected to at least one unstructured data source; a data source selection module operably connected to a meta-data repository; a dispatcher module interfacing between the data source selection module and both a structured data source manager and the unstructured data source manager, the structured data source manager being operably connected to a reconciled structured data repository; a results manager module operably connected to both of the data source managers; and an answer output module.
- FIG. 1 is a block diagram representing the construction of a reconciled structured data repository such as might be used according to one embodiment of the current invention.
- FIG. 2 is a block diagram representing the architecture of an intranet mediator according to one embodiment of the current invention.
- raw data intended for placement into a reconciled structured data repository 13 may placed in structured form as at box 3 , or may be left unstructured as at line 5 . Both structured and unstructured data are then reconciled, as at box 7 , to the common schema of the reconciled structured data repository 13 and then loaded, i.e., stored, as at line 9 into a reconciled structured data repository 13 for future search, accessing, and retrieval, such as discussed in conjunction with the present invention below.
- an intranet mediator 11 comprises means for the searching of both structured data sources, such as a multiplicity of databases integrated into a reconciled structured data repository 13 , e.g., a data warehouse, and unstructured data sources, collectively 15 , whether in the owner's proprietary databases or within a public collection such as the Internet, to arrive at a direct answer to a natural language question input by the user.
- structured data sources such as a multiplicity of databases integrated into a reconciled structured data repository 13 , e.g., a data warehouse, and unstructured data sources, collectively 15 , whether in the owner's proprietary databases or within a public collection such as the Internet, to arrive at a direct answer to a natural language question input by the user.
- a user interface 17 such as a known graphical user interface, accepts input of a user question in natural language format and may allow the user to manually select data sources if desired. As indicated by the two boxes in FIG. 2 labeled 17 , the user interface 17 is preferably a part of an input/output user interface that is also tasked with displaying an answer to the user question.
- a parser module 19 then accepts the natural language question input by the user, as at line 20 , parsing and assembling the relevant concepts of the natural language question into a query, or queries, of weighted search tokens, hereinafter referred to simply as a query, and also eliminating the irrelevant, or non-indicative, words of the natural language question from use as search tokens. For example, in the query: “What are the three best Sushi restaurants in Chicago?,” words irrelevant to data retrieval such as “are” and “the” may be eliminated from use as tokens by the parser module.
- Various techniques such as grammar matching, lattices, partial lattices, etc. for accomplishing the tasks of the parser module 19 are available and considered within the skill of the person having ordinary skill in the art.
- a query expander module 21 is operably connected to the parser module 19 , as at line 22 , for receiving the parsed query tokens and obtaining additional terms synonymous or analogous to the tokens and expanding the query with additional tokens if desirable to expand the chance of a return of information relevant to the query.
- Various techniques such as thesauri, dictionaries, term expansion, stemming, phrase generation, and the like are available to accomplish this query, or token, expansion and are considered within the skill of the person having ordinary skill in the art.
- the query expander module 21 then passes the tokens, preferably including the expanded tokens, to an unstructured data source manager 23 , as at line 24 , for a limited use specialty search to acquire additional synonyms, or terms analogous to, the query tokens and expanded query tokens, if any.
- the unstructured data source manager 23 initiates a search, as at lines 26 , of one or more repositories of unstructured data sources 15 , hereinafter referred to in the singular for ease of explanation, and obtains the search results therefrom, as at lines 28 , with any acceptable method of information retrieval.
- Unstructured data sources may include private or public data repositories, e.g., a private video archive and the World Wide Web, respectively.
- the parser module 19 will then pass the expanded query to the data source selection module 25 , as at line 32 .
- the data source selection module 25 determines the most likely data source or sources to contain an answer for each token of the expanded query; e.g., adaptive, heuristic, hard-coded, user-directed, standard, handwritten, or data-mined inquiry techniques through its connection, as at line 34 , to a meta-data repository 27 abstracting the contents of a data warehouse 13 of structured data sources. Meta-data for the unstructured data repository 15 may further be contained in the meta-data repository 27 in certain embodiments of the present invention.
- the meta-data repository, or an operable link to such a repository may also be considered as a part of the intranet mediator according to certain embodiments of the present invention.
- the data source selection module 25 Upon selection of the appropriately likely data sources to return an answer for each token of the expanded query, the data source selection module 25 will pass the data source selections and accompanying search tokens onto the dispatcher module 31 , as at line 36 . The dispatcher module 31 will then route each token to be searched to the appropriate data source manager 33 , 23 . Routing will commonly occur via an intranet although other networks including the Internet or simply direct routing are also within the scope of this invention, as at line 38 to the structured data source manager 33 or to the unstructured data source manager 23 via line 40 . The structured data source manager 33 and the unstructured data source manager 23 will then interface most efficiently with the reconciled structured data repository 13 and unstructured data sources 15 , via lines 42 and 26 , respectively, according to the individual requirements of the data sources.
- the reconciled structured data repository 13 may be a physical data warehouse of integrated and structured private data sources, and may be constructed according to known techniques such as extraction/transform/load (ETL) techniques.
- ETL extraction/transform/load
- One of ordinary skill in the art will recognize that other reconciled structured data repository development techniques are likewise within the scope of this invention.
- the reconciled structured data repository 13 or an operable link to such a reconciled structured data repository, may also be considered a part of the intranet mediator 11 according to some embodiments of the present invention.
- Search results are returned to the structured and unstructured data source managers, 33 , 23 respectively, via lines 44 , 28 , respectively, for additional processing if any, and then forwarded via lines 46 , 48 respectively to the results manager module 51 .
- the results manager module 51 desirably accepts and consolidates the results of the structured and unstructured data source searches for each search token and integrates the results of the searches. Duplicate results are also eliminated as appropriate.
- the results manager module 51 then weights or ranks the results and at least one most likely answer is selected. In the presently preferred embodiment, if an answer is returned from the structured data source, for example, the data warehouse, it is published as the most likely answer. It will be understood that selection of the direct answer from the unstructured data source is within the scope of this invention.
- the associated data links, extracted text, or the like, from the unstructured data source search may be further selected and ranked.
- An answer module portion 53 of the user interface 17 may then receive answer data via line 50 and may then perform any formatting necessary for the display via the user interface 17 , of the most likely answer and, if desired, the associated data or data links available as additional context for the direct answer.
- the user interface 17 of the described embodiment will wait for some specified time interval or specified number of results to accumulate and then display the currently ranked results.
- the user interface 17 will further continue receiving results and ranking them in conjunction with the total accumulated results to capture answers which may have been delayed for various reasons.
- the user is then preferably presented with an option to activate a “get next” display command, and the now possibly revised rankings or additionally aggregated results are displayed.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
An intranet mediator for obtaining direct answers to natural language questions allowing users to search both reconciled structured data repositories and unstructured data sources. The intranet mediator allows the user to obtain an answer to a natural language question without having to surf the data sources in which the answer might be contained, or without being limited to one specific factual item return. The intranet mediator operates on the supposition that most answers to business queries are contained within data sources that have been integrated into the reconciled structured data repository thereby having common schema and known contents. Preselection of the most relevant data source(s) is thus possible before query output. Searches of unstructured data can also be performed for additional context surrounding either the question or the answer. A direct answer is given in answer to the question. If desired, the intranet mediator may also display a list of data sources where additional relevant information may be found.
Description
- 1. Field of the Invention
- The present invention relates to a system whereby a direct answer may be given to a specific natural language query through a convenient search of reconciled structured data repositories and unstructured data sources.
- 2. Discussion of the Related Art
- There are two types of digital data gathering commonly in use. One, information retrieval, is concerned with the retrieval of information from unstructured data sources, such as text documents, where each element of the data is not individually defined. The user will enter “search terms” as a data query and the unstructured data will merely be searched for occurrence of these terms. Results of, i.e., the response from, such a search may return the text, i.e., the data, or may, e.g., in a World Wide Web search, only return the location, or site, of the data. The user would then need to read the text or go to each site and locate the occurrence of the search term, which may, or may not, be relevant to an actual question which the user wants answered. This time consuming practice is commonly known as “surfing”.
- Information retrieval is thus not geared to efficiently provide a specific answer to a specific question. Attempts to alleviate this problem were the subject of U.S. Pat. No. 6,167,370 to Tsourikov et al., which suggests giving a summary of text findings as a response to a user query. But, for example, when a user wants to know “What are the three best Sushi restaurants in Chicago?” the user does not necessarily care to browse through text summaries, or restaurant guide web sites, which are the likely search results of a known information retrieval search. The surfing in this context may be particularly tedious if the query is submitted to the data sources as an equally weighted string of tokens. For example, where “Chicago” is equally weighted with “Sushi” when figured into the search results, a user may wade through scores of restaurant web sites having nothing to do with Sushi eateries. Avoidance of this problem may require the user to know Boolean logic or other specific search strategy formats, and individually structure each search. The user would most often prefer just a list of three Sushi restaurants in Chicago in answer to this natural language question.
- The second type of digital data gathering commonly in use is the structured data source search, where structured data within one or more specific structured data sources or reconciled structured data repositories, usually privately owned and accessed, are searched to return a specific answer. “Reconciled” as used herein refers to reconciling data to a common schema of a structured data storage entity. Techniques such as extracting, cleaning, transforming, and loading the data are used to place the data into a data repository with a common schema, here the “reconciled structured data repository.” As is known, a reconciled structured data repository may contain unstructured data, e.g., pages of text, video clips, etc., and such unstructured data are reconciled by known techniques, including those listed above, to provide a structured means by which to search, access, and potentially retrieve them within the reconciled structured data repositories.
- One known structured data storage technique for use with reconciled structured data repositories is a “data warehouse,” an example of which is described in Hellerstein et al., Independent, Open Enterprise Data Integration, Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, 1999. As will be known to those in the art, a “physical data warehouse” can be at least one, or a collection of, reconciled structured data repositories of data. Thus a “physical data warehouse” is also a repository of integrated, proprietarily loaded and controlled data sources assumed to be under the control of a single entity. As per the Hellerstein article, the “physical data warehouse” may be constructed according to extraction/transform/load (ETL) techniques where the selected data are first extracted, i.e., selected for retention and cleaned up to retain those portions necessary for the common schema, then transformed, i.e., integrated within the structure of the reconciled structured data repository or physical data warehouse format by reconciling the data to the common schema of the structured data repository, and then loaded under the common schema into the reconciled structured data repository. As will be known to those of ordinary skill in the art, a reconciled structured data repository may also store unstructured data that have been reconciled to the common schema of the reconciled structured data repository by having or pointer to identifying metadata for the unstructured data. Thus, the structured and unstructured data within the reconciled structured data repository can be retrieved by structured search schemes. While some art references predating the current nomenclature and usage have labeled a mere aggregation or collection of unstructured databases searched by information retrieval means and terms as “data warehouses,” such unstructured databases do not contain reconciled structured data repositories, do not access their data in a structured manner, and are not “data warehouses” as the term is now known to those in the art because the data therein is not transformed to a common schema.
- In the past, the nonintegrated data sources were required to be searched one data source at a time. Integration of their individual data sources is generally performed by private business to enable answers to queries whose answers require more than one factual component. This integration is expensive and can remain underutilized for reasons such as an arcane nature of query formulation or because extensive data source knowledge may be required of the user to make a rational search selection. That is, the user may need to know where to look and how to look to expect a relevant answer. Concurrent searching of nonintegrated structured data sources, and merging of their results, to solve some of these problems, was the subject of U.S. Pat. No. 5,995,961 to Levy, et al.
- Further, additional information, beyond the specific factual components of a query, cannot be provided from the results of a data source search. For example, assume the data source user, or searcher, wishes to know the building on the I.I.T. campus with the largest number of rooms. The user cannot expect a picture of the building, or a link to a picture of the building, returned with the search results, even though the user might wish to see such a picture.
- U.S. Pat. No. 6,078,924 to Ainsbury et al. illustrates a technique of digital data gathering. According to this patent, the user is allowed to aggregate data found in the user's previous searches on a specific topic into a central file. This central file can then be controlled from a commercial desktop computer application to facilitate searching of the data.
- What is needed in the art is a system whereby the user can take advantage of both information retrieval and reconciled structured data repository data types of digital data gathering concurrently to provide a direct answer to a specific question, and preferably provide further context for that answer. It is also desirable that the query be accepted in a natural language format whereby the user needs no special skills in query formulation. It is further desirable that the query be intelligently parsed so as to weight the relevant parts of the query and that synonyms of the natural language query be provided to give a more thorough search and accurate answer. It is further desirable that the answers, and any related information, be limited in number to only that required or most relevant to the query.
- “Query” refers herein to any form of searchable subject matter, and may include query tokens, or elements of a total query, whether aggregate or separate, unless otherwise limited or defined by the context of the disclosure.
- “Data” refers herein to any form of digitally stored information, unless otherwise limited or defined by the context of the disclosure.
- “Concurrent” means within the time frame between question output and answer display and does not necessarily imply that searches happen simultaneously.
- “Direct answer” and “most likely answer” are used interchangeably herein and refer to the best available answer, whether factually based, referencing additional data, or refusal to answer, based upon the results of the data retrieved by the searches of the intranet mediator. A direct answer is to be distinguished from an information retrieval type “response” which merely indicates the presence of a document containing the queried term.
- “Common schema” means an organizational definition of structured data shared by multiple data sources.
- “Data source” means a logically, independently operating data storage, search, retrieval, and manipulation system. The system may store data of any digital form.
- The intranet mediator of the present invention obtains for the user a direct, or most likely, answer to a natural language question. The intranet mediator provides the user with a search of both structured data sources and a repository of unstructured data sources. The reconciled structured data repository data sources will preferably be integrated to provide for ease of searching a rapid attainment of search results. The reconciled structured data repository is previously constructed, e.g., by the corporate owner thereof, to integrate and abstract its collections of data sources with common schema, and providing meta-data within the reconciled structured data repository. The meta-data will give a global overview of the reconciled structured data repositories thus making the repositories easily searchable. The unstructured data sources may be data such as internal video, audio, or document files stored within the private databases of the owner, or they may be public sources, e.g., collections of documents available via the worldwide web or the like, or both. The unstructured data sources may be provided with a meta-data repository also.
- The intranet mediator allows the user to input, and obtain an answer to, a natural language question without having to read text or surf the data sources in which the answer might be contained, or without being limited to one specific factual item return. The intranet mediator operates in part on the supposition that most answers to a businessperson's questions are contained within privately owned data repositories of the business that were likely to be already integrated into a reconciled structured data repository. Selection of the most relevant data sources prior to searching is therefore possible. The direct answer selection to the user's question is accordingly weighted to the search answer from these reconciled structured data repositories. Searching of the unstructured data sources may be performed automatically, or upon determination that answers are not likely from the structured data sources, or if additional context surrounding either the query or the answer is justified or desired. A direct, or most likely, answer is then given to the user in answer to the input question. If desired, the intranet mediator may also display a list of related data or data sources where additional information relevant to the user's question may be found.
- The intranet mediator desirably includes the logical functionality of a natural language question input module; a parser module for assembling a search query from the natural language question; a query expander module; an unstructured data source manager operably connected to at least one unstructured data source; a data source selection module operably connected to a meta-data repository; a dispatcher module interfacing between the data source selection module and both a structured data source manager and the unstructured data source manager, the structured data source manager being operably connected to a reconciled structured data repository; a results manager module operably connected to both of the data source managers; and an answer output module.
- Discussion of the modules will be given herein with respect to specific functional tasks or task groupings that are in some cases arbitrarily assigned to the specific modules for explanatory purposes. It will be appreciated by the person having ordinary skill in the art that an intranet mediator according to the present invention may be arranged in a variety of ways, or that functional tasks may be grouped according to other nomenclature or architecture than is used herein without doing violence to the spirit of the present invention.
-
FIG. 1 is a block diagram representing the construction of a reconciled structured data repository such as might be used according to one embodiment of the current invention. -
FIG. 2 is a block diagram representing the architecture of an intranet mediator according to one embodiment of the current invention. - Referencing the block diagram of
FIG. 1 , as is known, raw data intended for placement into a reconciledstructured data repository 13 may placed in structured form as atbox 3, or may be left unstructured as atline 5. Both structured and unstructured data are then reconciled, as atbox 7, to the common schema of the reconciledstructured data repository 13 and then loaded, i.e., stored, as at line 9 into a reconciledstructured data repository 13 for future search, accessing, and retrieval, such as discussed in conjunction with the present invention below. - Referencing the block diagram of
FIG. 2 , the preferred embodiment of anintranet mediator 11 according to the present invention comprises means for the searching of both structured data sources, such as a multiplicity of databases integrated into a reconciledstructured data repository 13, e.g., a data warehouse, and unstructured data sources, collectively 15, whether in the owner's proprietary databases or within a public collection such as the Internet, to arrive at a direct answer to a natural language question input by the user. - A
user interface 17, such as a known graphical user interface, accepts input of a user question in natural language format and may allow the user to manually select data sources if desired. As indicated by the two boxes inFIG. 2 labeled 17, theuser interface 17 is preferably a part of an input/output user interface that is also tasked with displaying an answer to the user question. - A
parser module 19 then accepts the natural language question input by the user, as atline 20, parsing and assembling the relevant concepts of the natural language question into a query, or queries, of weighted search tokens, hereinafter referred to simply as a query, and also eliminating the irrelevant, or non-indicative, words of the natural language question from use as search tokens. For example, in the query: “What are the three best Sushi restaurants in Chicago?,” words irrelevant to data retrieval such as “are” and “the” may be eliminated from use as tokens by the parser module. Various techniques such as grammar matching, lattices, partial lattices, etc. for accomplishing the tasks of theparser module 19 are available and considered within the skill of the person having ordinary skill in the art. - A
query expander module 21 is operably connected to theparser module 19, as atline 22, for receiving the parsed query tokens and obtaining additional terms synonymous or analogous to the tokens and expanding the query with additional tokens if desirable to expand the chance of a return of information relevant to the query. Various techniques such as thesauri, dictionaries, term expansion, stemming, phrase generation, and the like are available to accomplish this query, or token, expansion and are considered within the skill of the person having ordinary skill in the art. - The
query expander module 21 then passes the tokens, preferably including the expanded tokens, to an unstructureddata source manager 23, as atline 24, for a limited use specialty search to acquire additional synonyms, or terms analogous to, the query tokens and expanded query tokens, if any. The unstructureddata source manager 23 initiates a search, as atlines 26, of one or more repositories ofunstructured data sources 15, hereinafter referred to in the singular for ease of explanation, and obtains the search results therefrom, as atlines 28, with any acceptable method of information retrieval. Additional tokens, if any, may be filtered from the results of the unstructured data source search returned to thequery expander module 21 from unstructureddata source manager 23 vialine 46 and added to the expanded query for return to theparser module 19 through thequery expander module 21, as atline 30. Theunstructured data repository 15, or an operable link to such a repository, may also be considered as a part of theintranet mediator 11 according to certain embodiments of the present invention. Unstructured data sources may include private or public data repositories, e.g., a private video archive and the World Wide Web, respectively. - The
parser module 19 will then pass the expanded query to the datasource selection module 25, as atline 32. The datasource selection module 25 determines the most likely data source or sources to contain an answer for each token of the expanded query; e.g., adaptive, heuristic, hard-coded, user-directed, standard, handwritten, or data-mined inquiry techniques through its connection, as atline 34, to a meta-data repository 27 abstracting the contents of adata warehouse 13 of structured data sources. Meta-data for theunstructured data repository 15 may further be contained in the meta-data repository 27 in certain embodiments of the present invention. The meta-data repository, or an operable link to such a repository, may also be considered as a part of the intranet mediator according to certain embodiments of the present invention. - Upon selection of the appropriately likely data sources to return an answer for each token of the expanded query, the data
source selection module 25 will pass the data source selections and accompanying search tokens onto thedispatcher module 31, as atline 36. Thedispatcher module 31 will then route each token to be searched to the appropriatedata source manager line 38 to the structureddata source manager 33 or to the unstructureddata source manager 23 vialine 40. The structureddata source manager 33 and the unstructureddata source manager 23 will then interface most efficiently with the reconciledstructured data repository 13 andunstructured data sources 15, vialines structured data repository 13, e.g., may be a physical data warehouse of integrated and structured private data sources, and may be constructed according to known techniques such as extraction/transform/load (ETL) techniques. One of ordinary skill in the art will recognize that other reconciled structured data repository development techniques are likewise within the scope of this invention. The reconciledstructured data repository 13, or an operable link to such a reconciled structured data repository, may also be considered a part of theintranet mediator 11 according to some embodiments of the present invention. - Search results are returned to the structured and unstructured data source managers, 33, 23 respectively, via
lines lines results manager module 51. - The
results manager module 51 desirably accepts and consolidates the results of the structured and unstructured data source searches for each search token and integrates the results of the searches. Duplicate results are also eliminated as appropriate. Theresults manager module 51 then weights or ranks the results and at least one most likely answer is selected. In the presently preferred embodiment, if an answer is returned from the structured data source, for example, the data warehouse, it is published as the most likely answer. It will be understood that selection of the direct answer from the unstructured data source is within the scope of this invention. In the described embodiment, the associated data links, extracted text, or the like, from the unstructured data source search may be further selected and ranked. Ananswer module portion 53 of theuser interface 17 may then receive answer data vialine 50 and may then perform any formatting necessary for the display via theuser interface 17, of the most likely answer and, if desired, the associated data or data links available as additional context for the direct answer. - The
user interface 17 of the described embodiment will wait for some specified time interval or specified number of results to accumulate and then display the currently ranked results. Theuser interface 17 will further continue receiving results and ranking them in conjunction with the total accumulated results to capture answers which may have been delayed for various reasons. The user is then preferably presented with an option to activate a “get next” display command, and the now possibly revised rankings or additionally aggregated results are displayed. - Having thus described an intranet mediator for searching both structured and unstructured data sources for arriving at a most likely answer to a natural language question input by the user; it will be appreciated that many variations thereon will occur to the artisan upon an understanding of the present invention, which is therefore to be limited only by the appended claims.
Claims (15)
1. A method for digital data gathering in answer to a query, comprising: conducting concurrent searching of a reconciled structured data repository and unstructured data sources, and preselecting at least one data source from the reconciled structured data repository and the unstructured data sources most likely to contain a direct answer to the query before submitting the query to the data sources.
2. The method for digital data gathering in answer to a query, according to claim 1 , further comprising: combining results from said reconciled structured data repository and unstructured data source searches.
3. The method for digital data gathering in answer to a query, according to claim 1 , further comprising: performing a search of the query in the preselected data sources, combining results from the preselected data source searches, sorting the results and providing a direct answer to the query.
4. The method for digital data gathering in answer to a query, according to claim 1 , further comprising: combining a plurality of structured data sources into a reconciled structured data repository with a meta-data repository.
5. The method for digital data gathering in answer to a query, according to claim 4 , further comprising: performing a search of the query in the preselected data sources, combining results from the preselected data source searches, sorting the results and providing a direct answer to the query.
6. A method of digital data gathering for providing a direct answer to a natural language question, comprising:
a) accepting input of a natural language question;
b) identifying the relevant concepts of the natural language question;
c) assembling the relevant concepts of the natural language question into a query;
d) identifying a reconciled structured data repository likely to contain an answer to the query;
e) performing a first search of the query in the reconciled structured data repository;
f) performing a second search of the query in an unstructured data source;
g) integrating the results of the first and second searches and selecting a direct answer to the natural language question; and
h) displaying the direct answer to the natural language question.
7. The method of digital data gathering for providing a direct answer to a natural language question, according to claim 6 , further comprising: combining a plurality of structured data sources into the reconciled structured data repository.
8. The method of digital data gathering for providing a direct answer to a natural language question, according to claim 6 , further comprising: combining a plurality of unstructured data sources into the reconciled structured data repository.
9. The method of digital data gathering for providing a direct answer to a natural language question, according to claim 6 , further comprising: combining a plurality of structured and unstructured data sources into the reconciled structured data repository.
10. A method for digital data gathering in answer to a query, comprising:
conducting concurrent searching of a reconciled structured data repository and unstructured data sources,
preselecting individual structured data sources and unstructured data sources most likely to contain a direct answer to the query before submitting the query to the data sources; and
sorting results of the data source searches and providing a direct answer to the query.
11. A method for digital data gathering in answer to a query, comprising:
combining data sources into a reconciled structured data repository with a meta-data repository,
conducting a search of the data within the reconciled structured data repository,
conducting a search of unstructured data sources outside of the reconciled structured data repository if a direct answer is not selected from the reconciled structured data repository search, and
sorting results of the search and providing a direct answer to the query.
12. The method according to claim 1 further including conducting a search of unstructured data sources outside of the reconciled structured data repository if a direct answer is not selected from the reconciled structured data repositories search.
13. The method according to claim 1 further including conducting a search of an unstructured data source outside of the reconciled structured data repository and combining and sorting the results of the unstructured data source search outside of the reconciled structured data repository with the results of the reconciled structured data repository search.
14. The method according to claim 12 wherein the selection of the direct answer is weighted to the search results from the reconciled structured data repository.
15. The method according to claim 13 wherein the selection of the direct answer is weighted to the search results from the reconciled structured data repository.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/960,804 US20050050037A1 (en) | 2001-04-18 | 2004-10-07 | Intranet mediator |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/837,436 US6904428B2 (en) | 2001-04-18 | 2001-04-18 | Intranet mediator |
US10/960,804 US20050050037A1 (en) | 2001-04-18 | 2004-10-07 | Intranet mediator |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/837,436 Continuation US6904428B2 (en) | 2001-04-18 | 2001-04-18 | Intranet mediator |
Publications (1)
Publication Number | Publication Date |
---|---|
US20050050037A1 true US20050050037A1 (en) | 2005-03-03 |
Family
ID=25274432
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/837,436 Expired - Lifetime US6904428B2 (en) | 2001-04-18 | 2001-04-18 | Intranet mediator |
US10/960,804 Abandoned US20050050037A1 (en) | 2001-04-18 | 2004-10-07 | Intranet mediator |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/837,436 Expired - Lifetime US6904428B2 (en) | 2001-04-18 | 2001-04-18 | Intranet mediator |
Country Status (1)
Country | Link |
---|---|
US (2) | US6904428B2 (en) |
Cited By (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040163043A1 (en) * | 2003-02-10 | 2004-08-19 | Kaidara S.A. | System method and computer program product for obtaining structured data from text |
US20060057560A1 (en) * | 2004-03-05 | 2006-03-16 | Hansen Medical, Inc. | System and method for denaturing and fixing collagenous tissue |
US20070016570A1 (en) * | 2005-07-14 | 2007-01-18 | Nokia Corporation | Method, apparatus and computer program product providing an application integrated mobile device search solution using context information |
US20070208714A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Method for Suggesting Web Links and Alternate Terms for Matching Search Queries |
US20070208744A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Flexible Authentication Framework |
US20070208746A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Secure Search Performance Improvement |
US20070208734A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Link Analysis for Enterprise Environment |
US20070208713A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Auto Generation of Suggested Links in a Search System |
US20070208755A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Suggested Content with Attribute Parameterization |
US20070208745A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Self-Service Sources for Secure Search |
US20070209080A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Search Hit URL Modification for Secure Application Integration |
US20070214129A1 (en) * | 2006-03-01 | 2007-09-13 | Oracle International Corporation | Flexible Authorization Model for Secure Search |
US20070220268A1 (en) * | 2006-03-01 | 2007-09-20 | Oracle International Corporation | Propagating User Identities In A Secure Federated Search System |
US20070283425A1 (en) * | 2006-03-01 | 2007-12-06 | Oracle International Corporation | Minimum Lifespan Credentials for Crawling Data Repositories |
US20080104542A1 (en) * | 2006-10-27 | 2008-05-01 | Information Builders, Inc. | Apparatus and Method for Conducting Searches with a Search Engine for Unstructured Data to Retrieve Records Enriched with Structured Data and Generate Reports Based Thereon |
US20080120323A1 (en) * | 2006-11-17 | 2008-05-22 | Lehman Brothers Inc. | System and method for generating customized reports |
US20090006356A1 (en) * | 2007-06-27 | 2009-01-01 | Oracle International Corporation | Changing ranking algorithms based on customer settings |
US20090006359A1 (en) * | 2007-06-28 | 2009-01-01 | Oracle International Corporation | Automatically finding acronyms and synonyms in a corpus |
US20090287678A1 (en) * | 2008-05-14 | 2009-11-19 | International Business Machines Corporation | System and method for providing answers to questions |
US20090327230A1 (en) * | 2008-06-27 | 2009-12-31 | Microsoft Corporation | Structured and unstructured data models |
US7849048B2 (en) | 2005-07-05 | 2010-12-07 | Clarabridge, Inc. | System and method of making unstructured data available to structured data analysis tools |
US7849049B2 (en) | 2005-07-05 | 2010-12-07 | Clarabridge, Inc. | Schema and ETL tools for structured and unstructured data |
US20100332499A1 (en) * | 2009-06-26 | 2010-12-30 | Iac Search & Media, Inc. | Method and system for determining confidence in answer for search |
US7974681B2 (en) | 2004-03-05 | 2011-07-05 | Hansen Medical, Inc. | Robotic catheter system |
US20140195552A1 (en) * | 2013-01-10 | 2014-07-10 | Pvelocity Inc. | Method and system for collating and presenting information |
US9477749B2 (en) | 2012-03-02 | 2016-10-25 | Clarabridge, Inc. | Apparatus for identifying root cause using unstructured data |
US20180307735A1 (en) * | 2017-04-19 | 2018-10-25 | Ca, Inc. | Integrating relational and non-relational databases |
US10614725B2 (en) | 2012-09-11 | 2020-04-07 | International Business Machines Corporation | Generating secondary questions in an introspective question answering system |
US11630833B2 (en) * | 2020-10-29 | 2023-04-18 | International Business Machines Corporation | Extract-transform-load script generation |
Families Citing this family (91)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3921523B2 (en) * | 2001-12-27 | 2007-05-30 | 独立行政法人情報通信研究機構 | Text generation method and text generation apparatus |
US9418204B2 (en) * | 2002-01-28 | 2016-08-16 | Samsung Electronics Co., Ltd | Bioinformatics system architecture with data and process integration |
US7302383B2 (en) * | 2002-09-12 | 2007-11-27 | Luis Calixto Valles | Apparatus and methods for developing conversational applications |
US20050050454A1 (en) * | 2003-08-29 | 2005-03-03 | International Business Machines Corporation | Controlling the look and feel of a web |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US8412514B1 (en) * | 2005-10-27 | 2013-04-02 | At&T Intellectual Property Ii, L.P. | Method and apparatus for compiling and querying a QA database |
US8117196B2 (en) * | 2006-01-23 | 2012-02-14 | Chacha Search, Inc. | Search tool providing optional use of human search guides |
US8719005B1 (en) * | 2006-02-10 | 2014-05-06 | Rusty Shawn Lee | Method and apparatus for using directed reasoning to respond to natural language queries |
AU2007253724A1 (en) * | 2006-05-19 | 2007-11-29 | Jorn Lyseggen | Source search engine |
US7890440B1 (en) | 2006-08-29 | 2011-02-15 | Science Applications International Corporation | Systems and methods for evaluating a group detection tool |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US7774301B2 (en) * | 2006-12-21 | 2010-08-10 | International Business Machines Corporation | Use of federation services and transformation services to perform extract, transform, and load (ETL) of unstructured information and associated metadata |
US20080243823A1 (en) * | 2007-03-28 | 2008-10-02 | Elumindata, Inc. | System and method for automatically generating information within an eletronic document |
US20080263006A1 (en) * | 2007-04-20 | 2008-10-23 | Sap Ag | Concurrent searching of structured and unstructured data |
US9189478B2 (en) | 2008-04-03 | 2015-11-17 | Elumindata, Inc. | System and method for collecting data from an electronic document and storing the data in a dynamically organized data structure |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US20100023501A1 (en) * | 2008-07-22 | 2010-01-28 | Elumindata, Inc. | System and method for automatically selecting a data source for providing data related to a query |
US8041712B2 (en) * | 2008-07-22 | 2011-10-18 | Elumindata Inc. | System and method for automatically selecting a data source for providing data related to a query |
US8037062B2 (en) | 2008-07-22 | 2011-10-11 | Elumindata, Inc. | System and method for automatically selecting a data source for providing data related to a query |
US8176042B2 (en) | 2008-07-22 | 2012-05-08 | Elumindata, Inc. | System and method for automatically linking data sources for providing data related to a query |
US8615707B2 (en) | 2009-01-16 | 2013-12-24 | Google Inc. | Adding new attributes to a structured presentation |
US8452791B2 (en) | 2009-01-16 | 2013-05-28 | Google Inc. | Adding new instances to a structured presentation |
US8412749B2 (en) * | 2009-01-16 | 2013-04-02 | Google Inc. | Populating a structured presentation with new values |
US8977645B2 (en) * | 2009-01-16 | 2015-03-10 | Google Inc. | Accessing a search interface in a structured presentation |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US8442982B2 (en) * | 2010-11-05 | 2013-05-14 | Apple Inc. | Extended database search |
US8682647B1 (en) * | 2012-01-13 | 2014-03-25 | Google Inc. | Systems and methods for answering complex questions with mined natural language search results |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9305140B2 (en) | 2012-07-16 | 2016-04-05 | Georgetown University | System and method of applying state of being to health care delivery |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
US8914419B2 (en) | 2012-10-30 | 2014-12-16 | International Business Machines Corporation | Extracting semantic relationships from table structures in electronic documents |
US10289653B2 (en) | 2013-03-15 | 2019-05-14 | International Business Machines Corporation | Adapting tabular data for narration |
US9146987B2 (en) | 2013-06-04 | 2015-09-29 | International Business Machines Corporation | Clustering based question set generation for training and testing of a question and answer system |
US9230009B2 (en) | 2013-06-04 | 2016-01-05 | International Business Machines Corporation | Routing of questions to appropriately trained question and answer system pipelines using clustering |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
US9164977B2 (en) | 2013-06-24 | 2015-10-20 | International Business Machines Corporation | Error correction in tables using discovered functional dependencies |
US9600461B2 (en) | 2013-07-01 | 2017-03-21 | International Business Machines Corporation | Discovering relationships in tabular data |
US9275115B2 (en) | 2013-07-16 | 2016-03-01 | International Business Machines Corporation | Correlating corpus/corpora value from answered questions |
US9607039B2 (en) | 2013-07-18 | 2017-03-28 | International Business Machines Corporation | Subject-matter analysis of tabular data |
US9830314B2 (en) | 2013-11-18 | 2017-11-28 | International Business Machines Corporation | Error correction in tables using a question and answer system |
US9348900B2 (en) | 2013-12-11 | 2016-05-24 | International Business Machines Corporation | Generating an answer from multiple pipelines using clustering |
US9286290B2 (en) | 2014-04-25 | 2016-03-15 | International Business Machines Corporation | Producing insight information from tables using natural language processing |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9754207B2 (en) | 2014-07-28 | 2017-09-05 | International Business Machines Corporation | Corpus quality analysis |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10303798B2 (en) * | 2014-12-18 | 2019-05-28 | Nuance Communications, Inc. | Question answering from structured and unstructured data sources |
US10783159B2 (en) | 2014-12-18 | 2020-09-22 | Nuance Communications, Inc. | Question answering with entailment analysis |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
CN104915377A (en) * | 2015-05-07 | 2015-09-16 | 亿赞普(北京)科技有限公司 | Method and device for adding foreign language business object category labels |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US20160357861A1 (en) * | 2015-06-07 | 2016-12-08 | Apple Inc. | Natural language event detection |
US10095740B2 (en) | 2015-08-25 | 2018-10-09 | International Business Machines Corporation | Selective fact generation from table data in a cognitive system |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10592607B2 (en) * | 2016-06-03 | 2020-03-17 | Microsoft Technology Licensing, Llc | Iterative alternating neural attention for machine reading |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179309B1 (en) | 2016-06-09 | 2018-04-23 | Apple Inc | Intelligent automated assistant in a home environment |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | Far-field extension for digital assistant services |
CN110019445B (en) * | 2017-09-08 | 2021-07-06 | 北京京东尚科信息技术有限公司 | Data synchronization method and device, computing equipment and storage medium |
JP7046592B2 (en) * | 2017-12-21 | 2022-04-04 | 株式会社日立製作所 | Search support system, search support method, and search support program |
US10984461B2 (en) * | 2018-12-26 | 2021-04-20 | Paypal, Inc. | System and method for making content-based recommendations using a user profile likelihood model |
US10902955B1 (en) | 2020-05-01 | 2021-01-26 | Georgetown University | Detecting COVID-19 using surrogates |
Citations (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5335345A (en) * | 1990-04-11 | 1994-08-02 | Bell Communications Research, Inc. | Dynamic query optimization using partial information |
US5495605A (en) * | 1992-01-12 | 1996-02-27 | Bull S.A. | Method to help in optimizing a query from a relational data base management system, and resultant method of syntactical analysis |
US5546576A (en) * | 1995-02-17 | 1996-08-13 | International Business Machines Corporation | Query optimizer system that detects and prevents mutating table violations of database integrity in a query before execution plan generation |
US5574900A (en) * | 1994-02-25 | 1996-11-12 | International Business Machines Corporation | System and method for optimizing parallel processing of database queries |
US5600831A (en) * | 1994-02-28 | 1997-02-04 | Lucent Technologies Inc. | Apparatus and methods for retrieving information by modifying query plan based on description of information sources |
US5802536A (en) * | 1992-05-29 | 1998-09-01 | Sony Corporation | Information search and display apparatus |
US5901287A (en) * | 1996-04-01 | 1999-05-04 | The Sabre Group Inc. | Information aggregation and synthesization system |
US5920856A (en) * | 1997-06-09 | 1999-07-06 | Xerox Corporation | System for selecting multimedia databases over networks |
US5933822A (en) * | 1997-07-22 | 1999-08-03 | Microsoft Corporation | Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision |
US5960422A (en) * | 1997-11-26 | 1999-09-28 | International Business Machines Corporation | System and method for optimized source selection in an information retrieval system |
US5963940A (en) * | 1995-08-16 | 1999-10-05 | Syracuse University | Natural language information retrieval system and method |
US5964024A (en) * | 1997-06-25 | 1999-10-12 | Norkol/Fibercore, Inc. | Roll cutter |
US5970490A (en) * | 1996-11-05 | 1999-10-19 | Xerox Corporation | Integration platform for heterogeneous databases |
US5983214A (en) * | 1996-04-04 | 1999-11-09 | Lycos, Inc. | System and method employing individual user content-based data and user collaborative feedback data to evaluate the content of an information entity in a large information communication network |
US5995961A (en) * | 1995-11-07 | 1999-11-30 | Lucent Technologies Inc. | Information manifold for query processing |
US6006221A (en) * | 1995-08-16 | 1999-12-21 | Syracuse University | Multilingual document retrieval system and method using semantic vector matching |
US6067552A (en) * | 1995-08-21 | 2000-05-23 | Cnet, Inc. | User interface system and method for browsing a hypertext database |
US6070134A (en) * | 1997-07-31 | 2000-05-30 | Microsoft Corporation | Identifying salient semantic relation paths between two words |
US6078914A (en) * | 1996-12-09 | 2000-06-20 | Open Text Corporation | Natural language meta-search system and method |
US6078924A (en) * | 1998-01-30 | 2000-06-20 | Aeneid Corporation | Method and apparatus for performing data collection, interpretation and analysis, in an information platform |
US6105022A (en) * | 1997-02-26 | 2000-08-15 | Hitachi, Ltd. | Structured-text cataloging method, structured-text searching method, and portable medium used in the methods |
US6134559A (en) * | 1998-04-27 | 2000-10-17 | Oracle Corporation | Uniform object model having methods and additional features for integrating objects defined by different foreign object type systems into a single type system |
US6145003A (en) * | 1997-12-17 | 2000-11-07 | Microsoft Corporation | Method of web crawling utilizing address mapping |
US6151604A (en) * | 1995-03-28 | 2000-11-21 | Dex Information Systems, Inc. | Method and apparatus for improved information storage and retrieval system |
US6161084A (en) * | 1997-03-07 | 2000-12-12 | Microsoft Corporation | Information retrieval utilizing semantic representation of text by identifying hypernyms and indexing multiple tokenized semantic structures to a same passage of text |
US6167370A (en) * | 1998-09-09 | 2000-12-26 | Invention Machine Corporation | Document semantic analysis/selection with knowledge creativity capability utilizing subject-action-object (SAO) structures |
US6173279B1 (en) * | 1998-04-09 | 2001-01-09 | At&T Corp. | Method of using a natural language interface to retrieve information from one or more data resources |
US6182065B1 (en) * | 1996-11-06 | 2001-01-30 | International Business Machines Corp. | Method and system for weighting the search results of a database search engine |
US6263342B1 (en) * | 1998-04-01 | 2001-07-17 | International Business Machines Corp. | Federated searching of heterogeneous datastores using a federated datastore object |
US6269368B1 (en) * | 1997-10-17 | 2001-07-31 | Textwise Llc | Information retrieval using dynamic evidence combination |
US6304864B1 (en) * | 1999-04-20 | 2001-10-16 | Textwise Llc | System for retrieving multimedia information from the internet using multiple evolving intelligent agents |
US20020069194A1 (en) * | 2000-12-06 | 2002-06-06 | Robbins Benjamin Jon | Client based online content meta search |
US6460043B1 (en) * | 1998-02-04 | 2002-10-01 | Microsoft Corporation | Method and apparatus for operating on data with a conceptual data manipulation language |
US6502088B1 (en) * | 1999-07-08 | 2002-12-31 | International Business Machines Corporation | Method and system for improved access to non-relational databases |
US6523028B1 (en) * | 1998-12-03 | 2003-02-18 | Lockhead Martin Corporation | Method and system for universal querying of distributed databases |
US6567812B1 (en) * | 2000-09-27 | 2003-05-20 | Siemens Aktiengesellschaft | Management of query result complexity using weighted criteria for hierarchical data structuring |
US6910003B1 (en) * | 1999-09-17 | 2005-06-21 | Discern Communications, Inc. | System, method and article of manufacture for concept based information searching |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6625617B2 (en) * | 1996-01-02 | 2003-09-23 | Timeline, Inc. | Modularized data retrieval method and apparatus with multiple source capability |
US6076088A (en) * | 1996-02-09 | 2000-06-13 | Paik; Woojin | Information extraction system and method using concept relation concept (CRC) triples |
US5987454A (en) * | 1997-06-09 | 1999-11-16 | Hobbs; Allen | Method and apparatus for selectively augmenting retrieved text, numbers, maps, charts, still pictures and/or graphics, moving pictures and/or graphics and audio information from a network resource |
US6523022B1 (en) * | 1997-06-09 | 2003-02-18 | Allen Hobbs | Method and apparatus for selectively augmenting retrieved information from a network resource |
US6301584B1 (en) * | 1997-08-21 | 2001-10-09 | Home Information Services, Inc. | System and method for retrieving entities and integrating data |
WO2001037134A1 (en) * | 1999-11-16 | 2001-05-25 | Searchcraft Corporation | Method for searching from a plurality of data sources |
US20010037328A1 (en) * | 2000-03-23 | 2001-11-01 | Pustejovsky James D. | Method and system for interfacing to a knowledge acquisition system |
US6633867B1 (en) * | 2000-04-05 | 2003-10-14 | International Business Machines Corporation | System and method for providing a session query within the context of a dynamic search result set |
US7962326B2 (en) * | 2000-04-20 | 2011-06-14 | Invention Machine Corporation | Semantic answering system and method |
US20020111934A1 (en) * | 2000-10-17 | 2002-08-15 | Shankar Narayan | Question associated information storage and retrieval architecture using internet gidgets |
-
2001
- 2001-04-18 US US09/837,436 patent/US6904428B2/en not_active Expired - Lifetime
-
2004
- 2004-10-07 US US10/960,804 patent/US20050050037A1/en not_active Abandoned
Patent Citations (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5335345A (en) * | 1990-04-11 | 1994-08-02 | Bell Communications Research, Inc. | Dynamic query optimization using partial information |
US5495605A (en) * | 1992-01-12 | 1996-02-27 | Bull S.A. | Method to help in optimizing a query from a relational data base management system, and resultant method of syntactical analysis |
US5802536A (en) * | 1992-05-29 | 1998-09-01 | Sony Corporation | Information search and display apparatus |
US5574900A (en) * | 1994-02-25 | 1996-11-12 | International Business Machines Corporation | System and method for optimizing parallel processing of database queries |
US5600831A (en) * | 1994-02-28 | 1997-02-04 | Lucent Technologies Inc. | Apparatus and methods for retrieving information by modifying query plan based on description of information sources |
US5768578A (en) * | 1994-02-28 | 1998-06-16 | Lucent Technologies Inc. | User interface for information retrieval system |
US5546576A (en) * | 1995-02-17 | 1996-08-13 | International Business Machines Corporation | Query optimizer system that detects and prevents mutating table violations of database integrity in a query before execution plan generation |
US6151604A (en) * | 1995-03-28 | 2000-11-21 | Dex Information Systems, Inc. | Method and apparatus for improved information storage and retrieval system |
US5963940A (en) * | 1995-08-16 | 1999-10-05 | Syracuse University | Natural language information retrieval system and method |
US6006221A (en) * | 1995-08-16 | 1999-12-21 | Syracuse University | Multilingual document retrieval system and method using semantic vector matching |
US6067552A (en) * | 1995-08-21 | 2000-05-23 | Cnet, Inc. | User interface system and method for browsing a hypertext database |
US5995961A (en) * | 1995-11-07 | 1999-11-30 | Lucent Technologies Inc. | Information manifold for query processing |
US5901287A (en) * | 1996-04-01 | 1999-05-04 | The Sabre Group Inc. | Information aggregation and synthesization system |
US5983214A (en) * | 1996-04-04 | 1999-11-09 | Lycos, Inc. | System and method employing individual user content-based data and user collaborative feedback data to evaluate the content of an information entity in a large information communication network |
US5970490A (en) * | 1996-11-05 | 1999-10-19 | Xerox Corporation | Integration platform for heterogeneous databases |
US6182065B1 (en) * | 1996-11-06 | 2001-01-30 | International Business Machines Corp. | Method and system for weighting the search results of a database search engine |
US6078914A (en) * | 1996-12-09 | 2000-06-20 | Open Text Corporation | Natural language meta-search system and method |
US6105022A (en) * | 1997-02-26 | 2000-08-15 | Hitachi, Ltd. | Structured-text cataloging method, structured-text searching method, and portable medium used in the methods |
US6161084A (en) * | 1997-03-07 | 2000-12-12 | Microsoft Corporation | Information retrieval utilizing semantic representation of text by identifying hypernyms and indexing multiple tokenized semantic structures to a same passage of text |
US5920856A (en) * | 1997-06-09 | 1999-07-06 | Xerox Corporation | System for selecting multimedia databases over networks |
US5964024A (en) * | 1997-06-25 | 1999-10-12 | Norkol/Fibercore, Inc. | Roll cutter |
US5933822A (en) * | 1997-07-22 | 1999-08-03 | Microsoft Corporation | Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision |
US6070134A (en) * | 1997-07-31 | 2000-05-30 | Microsoft Corporation | Identifying salient semantic relation paths between two words |
US6269368B1 (en) * | 1997-10-17 | 2001-07-31 | Textwise Llc | Information retrieval using dynamic evidence combination |
US5960422A (en) * | 1997-11-26 | 1999-09-28 | International Business Machines Corporation | System and method for optimized source selection in an information retrieval system |
US6145003A (en) * | 1997-12-17 | 2000-11-07 | Microsoft Corporation | Method of web crawling utilizing address mapping |
US6078924A (en) * | 1998-01-30 | 2000-06-20 | Aeneid Corporation | Method and apparatus for performing data collection, interpretation and analysis, in an information platform |
US6460043B1 (en) * | 1998-02-04 | 2002-10-01 | Microsoft Corporation | Method and apparatus for operating on data with a conceptual data manipulation language |
US6263342B1 (en) * | 1998-04-01 | 2001-07-17 | International Business Machines Corp. | Federated searching of heterogeneous datastores using a federated datastore object |
US6173279B1 (en) * | 1998-04-09 | 2001-01-09 | At&T Corp. | Method of using a natural language interface to retrieve information from one or more data resources |
US6134559A (en) * | 1998-04-27 | 2000-10-17 | Oracle Corporation | Uniform object model having methods and additional features for integrating objects defined by different foreign object type systems into a single type system |
US6167370A (en) * | 1998-09-09 | 2000-12-26 | Invention Machine Corporation | Document semantic analysis/selection with knowledge creativity capability utilizing subject-action-object (SAO) structures |
US6523028B1 (en) * | 1998-12-03 | 2003-02-18 | Lockhead Martin Corporation | Method and system for universal querying of distributed databases |
US6304864B1 (en) * | 1999-04-20 | 2001-10-16 | Textwise Llc | System for retrieving multimedia information from the internet using multiple evolving intelligent agents |
US6502088B1 (en) * | 1999-07-08 | 2002-12-31 | International Business Machines Corporation | Method and system for improved access to non-relational databases |
US6910003B1 (en) * | 1999-09-17 | 2005-06-21 | Discern Communications, Inc. | System, method and article of manufacture for concept based information searching |
US6567812B1 (en) * | 2000-09-27 | 2003-05-20 | Siemens Aktiengesellschaft | Management of query result complexity using weighted criteria for hierarchical data structuring |
US20020069194A1 (en) * | 2000-12-06 | 2002-06-06 | Robbins Benjamin Jon | Client based online content meta search |
Cited By (68)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7174507B2 (en) * | 2003-02-10 | 2007-02-06 | Kaidara S.A. | System method and computer program product for obtaining structured data from text |
US20040163043A1 (en) * | 2003-02-10 | 2004-08-19 | Kaidara S.A. | System method and computer program product for obtaining structured data from text |
US20060057560A1 (en) * | 2004-03-05 | 2006-03-16 | Hansen Medical, Inc. | System and method for denaturing and fixing collagenous tissue |
US7976539B2 (en) | 2004-03-05 | 2011-07-12 | Hansen Medical, Inc. | System and method for denaturing and fixing collagenous tissue |
US7974681B2 (en) | 2004-03-05 | 2011-07-05 | Hansen Medical, Inc. | Robotic catheter system |
US7849048B2 (en) | 2005-07-05 | 2010-12-07 | Clarabridge, Inc. | System and method of making unstructured data available to structured data analysis tools |
US7849049B2 (en) | 2005-07-05 | 2010-12-07 | Clarabridge, Inc. | Schema and ETL tools for structured and unstructured data |
US20070016570A1 (en) * | 2005-07-14 | 2007-01-18 | Nokia Corporation | Method, apparatus and computer program product providing an application integrated mobile device search solution using context information |
US10769215B2 (en) * | 2005-07-14 | 2020-09-08 | Conversant Wireless Licensing S.A R.L. | Method, apparatus and computer program product providing an application integrated mobile device search solution using context information |
US9177124B2 (en) | 2006-03-01 | 2015-11-03 | Oracle International Corporation | Flexible authentication framework |
US8352475B2 (en) | 2006-03-01 | 2013-01-08 | Oracle International Corporation | Suggested content with attribute parameterization |
US20070209080A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Search Hit URL Modification for Secure Application Integration |
US20070214129A1 (en) * | 2006-03-01 | 2007-09-13 | Oracle International Corporation | Flexible Authorization Model for Secure Search |
US20070220268A1 (en) * | 2006-03-01 | 2007-09-20 | Oracle International Corporation | Propagating User Identities In A Secure Federated Search System |
US20070250486A1 (en) * | 2006-03-01 | 2007-10-25 | Oracle International Corporation | Document date as a ranking factor for crawling |
US20070283425A1 (en) * | 2006-03-01 | 2007-12-06 | Oracle International Corporation | Minimum Lifespan Credentials for Crawling Data Repositories |
US11038867B2 (en) | 2006-03-01 | 2021-06-15 | Oracle International Corporation | Flexible framework for secure search |
US20070208714A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Method for Suggesting Web Links and Alternate Terms for Matching Search Queries |
US10382421B2 (en) | 2006-03-01 | 2019-08-13 | Oracle International Corporation | Flexible framework for secure search |
US9853962B2 (en) | 2006-03-01 | 2017-12-26 | Oracle International Corporation | Flexible authentication framework |
US9479494B2 (en) | 2006-03-01 | 2016-10-25 | Oracle International Corporation | Flexible authentication framework |
US9467437B2 (en) | 2006-03-01 | 2016-10-11 | Oracle International Corporation | Flexible authentication framework |
US7725465B2 (en) | 2006-03-01 | 2010-05-25 | Oracle International Corporation | Document date as a ranking factor for crawling |
US20100185611A1 (en) * | 2006-03-01 | 2010-07-22 | Oracle International Corporation | Re-ranking search results from an enterprise system |
US20070208755A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Suggested Content with Attribute Parameterization |
US20070208713A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Auto Generation of Suggested Links in a Search System |
US9251364B2 (en) | 2006-03-01 | 2016-02-02 | Oracle International Corporation | Search hit URL modification for secure application integration |
US7941419B2 (en) | 2006-03-01 | 2011-05-10 | Oracle International Corporation | Suggested content with attribute parameterization |
US20070208734A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Link Analysis for Enterprise Environment |
US20070208746A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Secure Search Performance Improvement |
US20070208744A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Flexible Authentication Framework |
US8005816B2 (en) | 2006-03-01 | 2011-08-23 | Oracle International Corporation | Auto generation of suggested links in a search system |
US8027982B2 (en) | 2006-03-01 | 2011-09-27 | Oracle International Corporation | Self-service sources for secure search |
US8214394B2 (en) | 2006-03-01 | 2012-07-03 | Oracle International Corporation | Propagating user identities in a secure federated search system |
US8239414B2 (en) | 2006-03-01 | 2012-08-07 | Oracle International Corporation | Re-ranking search results from an enterprise system |
US9081816B2 (en) | 2006-03-01 | 2015-07-14 | Oracle International Corporation | Propagating user identities in a secure federated search system |
US8875249B2 (en) | 2006-03-01 | 2014-10-28 | Oracle International Corporation | Minimum lifespan credentials for crawling data repositories |
US8332430B2 (en) | 2006-03-01 | 2012-12-11 | Oracle International Corporation | Secure search performance improvement |
US20070208745A1 (en) * | 2006-03-01 | 2007-09-06 | Oracle International Corporation | Self-Service Sources for Secure Search |
US8868540B2 (en) * | 2006-03-01 | 2014-10-21 | Oracle International Corporation | Method for suggesting web links and alternate terms for matching search queries |
US8433712B2 (en) | 2006-03-01 | 2013-04-30 | Oracle International Corporation | Link analysis for enterprise environment |
US8595255B2 (en) | 2006-03-01 | 2013-11-26 | Oracle International Corporation | Propagating user identities in a secure federated search system |
US8601028B2 (en) | 2006-03-01 | 2013-12-03 | Oracle International Corporation | Crawling secure data sources |
US8626794B2 (en) | 2006-03-01 | 2014-01-07 | Oracle International Corporation | Indexing secure enterprise documents using generic references |
US8707451B2 (en) | 2006-03-01 | 2014-04-22 | Oracle International Corporation | Search hit URL modification for secure application integration |
US8725770B2 (en) | 2006-03-01 | 2014-05-13 | Oracle International Corporation | Secure search performance improvement |
US20080104542A1 (en) * | 2006-10-27 | 2008-05-01 | Information Builders, Inc. | Apparatus and Method for Conducting Searches with a Search Engine for Unstructured Data to Retrieve Records Enriched with Structured Data and Generate Reports Based Thereon |
US20080120323A1 (en) * | 2006-11-17 | 2008-05-22 | Lehman Brothers Inc. | System and method for generating customized reports |
US20090006356A1 (en) * | 2007-06-27 | 2009-01-01 | Oracle International Corporation | Changing ranking algorithms based on customer settings |
US8412717B2 (en) | 2007-06-27 | 2013-04-02 | Oracle International Corporation | Changing ranking algorithms based on customer settings |
US7996392B2 (en) | 2007-06-27 | 2011-08-09 | Oracle International Corporation | Changing ranking algorithms based on customer settings |
US20090006359A1 (en) * | 2007-06-28 | 2009-01-01 | Oracle International Corporation | Automatically finding acronyms and synonyms in a corpus |
US8316007B2 (en) | 2007-06-28 | 2012-11-20 | Oracle International Corporation | Automatically finding acronyms and synonyms in a corpus |
US8768925B2 (en) | 2008-05-14 | 2014-07-01 | International Business Machines Corporation | System and method for providing answers to questions |
US8275803B2 (en) * | 2008-05-14 | 2012-09-25 | International Business Machines Corporation | System and method for providing answers to questions |
US20090287678A1 (en) * | 2008-05-14 | 2009-11-19 | International Business Machines Corporation | System and method for providing answers to questions |
US9703861B2 (en) | 2008-05-14 | 2017-07-11 | International Business Machines Corporation | System and method for providing answers to questions |
US20090327230A1 (en) * | 2008-06-27 | 2009-12-31 | Microsoft Corporation | Structured and unstructured data models |
US20100332499A1 (en) * | 2009-06-26 | 2010-12-30 | Iac Search & Media, Inc. | Method and system for determining confidence in answer for search |
US9239879B2 (en) * | 2009-06-26 | 2016-01-19 | Iac Search & Media, Inc. | Method and system for determining confidence in answer for search |
US10372741B2 (en) | 2012-03-02 | 2019-08-06 | Clarabridge, Inc. | Apparatus for automatic theme detection from unstructured data |
US9477749B2 (en) | 2012-03-02 | 2016-10-25 | Clarabridge, Inc. | Apparatus for identifying root cause using unstructured data |
US10614725B2 (en) | 2012-09-11 | 2020-04-07 | International Business Machines Corporation | Generating secondary questions in an introspective question answering system |
US10621880B2 (en) | 2012-09-11 | 2020-04-14 | International Business Machines Corporation | Generating secondary questions in an introspective question answering system |
US10489415B2 (en) * | 2013-01-10 | 2019-11-26 | Pvelocity Inc. | Method and system for collating and presenting information |
US20140195552A1 (en) * | 2013-01-10 | 2014-07-10 | Pvelocity Inc. | Method and system for collating and presenting information |
US20180307735A1 (en) * | 2017-04-19 | 2018-10-25 | Ca, Inc. | Integrating relational and non-relational databases |
US11630833B2 (en) * | 2020-10-29 | 2023-04-18 | International Business Machines Corporation | Extract-transform-load script generation |
Also Published As
Publication number | Publication date |
---|---|
US6904428B2 (en) | 2005-06-07 |
US20020156771A1 (en) | 2002-10-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6904428B2 (en) | Intranet mediator | |
US6073130A (en) | Method for improving the results of a search in a structured database | |
US7136851B2 (en) | Method and system for indexing and searching databases | |
Lin et al. | Extracting classification knowledge of Internet documents with mining term associations: a semantic approach | |
US5926808A (en) | Displaying portions of text from multiple documents over multiple databases related to a search query in a computer network | |
US8156125B2 (en) | Method and apparatus for query and analysis | |
US20160246796A1 (en) | Methods of and systems for searching by incorporating user-entered information | |
US20050027694A1 (en) | User-friendly search results display system, method, and computer program product | |
US8661019B2 (en) | Join algorithms over full text indexes | |
US20090125504A1 (en) | Systems and methods for visualizing web page query results | |
US9009133B2 (en) | Methods and systems for formulating and executing concept-structured queries of unorganized data | |
KR20060048765A (en) | Dispersing search engine results by using page category information | |
Tibbo et al. | Finding finding aids on the world wide web | |
Dobie et al. | A flexible architecture for content and concept based multimedia information exploration | |
JP2003316824A (en) | Document file retrieval system, document file retrieval program and document file retrieval method | |
JPH09223150A (en) | Information classification processing method | |
JPH05324436A (en) | Intelligent data retrieving system | |
US8745035B1 (en) | Multistage pipeline for feeding joined tables to a search system | |
KR100493399B1 (en) | Information search management system and method tereof | |
JP3425784B2 (en) | Document information retrieval device | |
Stern | New search and navigation techniques in the digital library | |
Bourbakis et al. | Retrieval of multimedia web documents and removal of redundant information | |
JPH09231233A (en) | Network retrieval device | |
Bates | Human, database, and domain factors in content indexing and access to digital libraries and the Internet | |
JPH041853A (en) | Document retrieving device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |
|
AS | Assignment |
Owner name: BENT LIGHT TECHNOLOGY, LLC, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ILLINOIS INSTITUTE OF TECHNOLOGY;REEL/FRAME:028075/0711 Effective date: 20120403 |
|
AS | Assignment |
Owner name: INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW Y Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BENT LIGHT TECHNOLOGY, LLC;REEL/FRAME:028641/0022 Effective date: 20120524 |