JP2005509952A - 知識ベースに対して情報抽出および品質管理を実施する方法およびシステム - Google Patents
知識ベースに対して情報抽出および品質管理を実施する方法およびシステム Download PDFInfo
- Publication number
- JP2005509952A JP2005509952A JP2003544634A JP2003544634A JP2005509952A JP 2005509952 A JP2005509952 A JP 2005509952A JP 2003544634 A JP2003544634 A JP 2003544634A JP 2003544634 A JP2003544634 A JP 2003544634A JP 2005509952 A JP2005509952 A JP 2005509952A
- Authority
- JP
- Japan
- Prior art keywords
- information
- article
- server
- articles
- extracted
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/912—Applications of a database
- Y10S707/918—Location
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99935—Query augmenting and refining, e.g. inexact access
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99942—Manipulating data structure, e.g. compression, compaction, compilation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Description
本発明文書の開示の一部は著作権保護の対象となる資料を含んでいる。著作権所有者は誰でも米国特許庁特許ファイルまたは記録に載っているままの形で特許文書や特許開示をゼロックスで再生することに異議を唱えないが、そうでないものについては全て著作権を保持するものとする。
(関連出願の相互参照)
本出願は本出願の譲受人Ingenuity Systems, Inc.が予め譲り受けている2000年12月8日に出願した出願中の米国特許出願第09/733,495号“Techniques For Facilitating Information Acquisition and Storage”の一部継続出願である。前に出願した出願中の特許出願の全体が本開示の一部としてここに組み入れられている。
(発明の背景)
本発明は情報抽出および格納の分野に関するものであり、特に、分散情報取得および情報格納プロセスの管理技術に関する。
本発明は複数の記事から情報を抽出して情報記憶部内に格納する技術について検討する。一実施例では、本発明は情報が抽出される複数の記事を識別する。また、本発明は複数の記事から情報を抽出する複数の情報エクストラクタ(information extractors)を識別する。複数の記事および複数の情報エクストラクタに関連する情報を格納するデータベースも提供される。この実施例では、本発明は情報抽出のために複数の記事を複数の情報エクストラクタに割り当てる。本発明は情報エクストラクタに割り当てられた記事から情報エクストラクタにより抽出された情報を受信する。次に、抽出された情報は情報記憶部内に格納される。
本発明は複数の記事から情報または知識を分散式に抽出して情報消費者がアクセスまたは照会することができる体系化されたフォーマットで格納する技術を提供する。情報抽出および格納処理を管理する技術が検討される。図1は本発明の実施例を取り入れることができる分散コンピュータ網10の単純化されたブロック図である。コンピュータ網10は複数の通信リンク18を介して通信網16に接続されたいくつかのコンピュータシステム12,14−1,14−2,および14−3を含んでいる。コンピュータシステムは複数のクライアントコンピュータシステム14−1,14−2,および14−3とサーバコンピュータシステム12を含んでいる。クライアントシステム14は典型的にはサーバコンピュータシステムから情報を要求し、それはクライアント要求に応答して処理を実施して要求された情報をクライアントシステムに提供する。そのために、サーバは典型的にはクライアントシステムよりも大きい計算および格納能力を有する。しかしながら、特定のコンピュータシステムはそれが情報を要求しているかまたは提供しているかに応じてクライアントまたはサーバとして動作することができる。
“...GST-bax binds to bcl2...”
上に示す事実は2つの離散オブジェクト、すなわち“GST-bax”および“bcl2”を含んでいる。事実に対するメタデータは“CHO細胞およびGSTプルダウンアッセイ内のヒューマン・バックスおよびバッド(bax and bad)の再結合GST融合から表現され精製されたヒューマンbcl2により実験が実施された”を示すこともできる。事実に関連付けられた付加情報も情報エクストラクタにより入力することができる。本発明の実施例に従って情報エクストラクタにより入力することができる情報のタイプに関する詳細については付録Aを参照されたい。本発明は事実ベース情報抽出モデルに制限されないことは明白である。本発明に従って、いくつかの他のタイプの情報抽出モデルも使用することができる。
ここで、
FE=事実データエラー数を測定する。これらは記事に対して情報エクストラクタにより入力される事実データ内のエラーである。
FM=欠落事実データエラーを測定する。これらは情報エクストラクタが記事に対して必要な事実情報の入力に失敗する時の手落ちエラーである。
ME=メタデータエラー数を測定する。これらは記事に対して情報エクストラクタにより入力されるメタデータ内のエラーである。
MM=欠落メタデータデータエラーを測定する。これらは記事に対して情報エクストラクタにより入力されるメタデータ情報内の欠陥のエラーである。
MF=記事に対して情報エクストラクタにより入力される情報内の欠落事実数を測定する。
EF=記事に対して情報エクストラクタにより情報入力される外部事実数である。外部事実は一般的に情報エクストラクタにより入力されるが情報抽出プロトコルに従って資格を与えられない事実である。
総事実=品質管理プロセス後に決定された記事に対する事実の総数である。
前記公式に従って、低いQC得点は高品質を示す(理想的にはエラーがなければ、QC=0)。本発明の別の実施例では、さまざまな他の公式および変数を使用できることは明らかである。
Claims (8)
- 記事から情報を抽出し該抽出した情報をフレームベース知識表現で格納するシステムであって、
情報が抽出される記事を選択して優先順位付けする記事選択ユニットと、
該記事選択ユニットに接続されると共に通信を行い、前記記事選択ユニットから選択された記事を受信し予め定められた情報抽出プロトコルに従って前記選択された記事から情報を抽出する情報抽出ユニットと、
該情報抽出ユニットに接続されると共に通信を行い、前記抽出された情報が適切に抽出されかつフレームベース知識表現で格納するためにフォーマットされているかを確認する知識表現管理ユニットと、
該知識表現管理ユニットに接続されると共に通信を行い、適切に抽出されかつフォーマットされていれば前記情報をその表現で格納しかつ格納された表現に関する問い合わせに応答する情報格納ユニットと、
該情報格納ユニットに接続されると共に通信を行い、前記情報格納ユニット内に格納された情報に対するユーザの問い合わせに応答し、該問い合わせに応答して情報格納ユニットから情報を検索し該検索した情報を表示する照会管理および情報表示ユニットと、
を含む前記システム。 - 請求項1に記載のシステムであって、前記情報抽出ユニットと前記知識表現管理ユニットとは結合される前記システム。
- 請求項1に記載のシステムであって、少なくとも前記情報抽出ユニットと前記知識表現管理ユニットとは地理的に広範に離されており、前記各ユニットはその機能を最低コストで実施できるところならばどこでも配置される前記システム。
- フレームベース知識表現の構成方法であって、
前記知識表現に対する情報源として使える記事を選択するステップと、
前記選択された記事から情報を抽出しそれを知識表現で格納するためにフォーマットするステップと、
前記選択された記事から抽出された情報は正しいことおよび正しいフォーマットで配置されていることを検証するステップと、
フォーマットされた情報を前記知識表現で格納するステップと、
を含む前記方法。 - 請求項4に記載の方法であって、前記情報抽出ステップは知識抽出者によって実施され、前記検証ステップは品質管理者によって実施される前記方法。
- 請求項5に記載の方法であって、前記抽出ステップおよび前記検証ステップは共に同じ者によって実施され、その者は予め定められた手順により両方のステップを同時に実施する資格を与えられている者である前記方法。
- 請求項4に記載の方法であって、少なくとも前記抽出と前記検証ステップとは地理的に離れた場所で行われる前記方法。
- 請求項7に記載の方法であって、前記地理的に離れた場所は前記抽出および前記検証の各ステップを実施するコストに基づいて選択され、各ステップに対する最低コストとなる場所が選択される方法。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/038,197 US6741986B2 (en) | 2000-12-08 | 2001-11-09 | Method and system for performing information extraction and quality control for a knowledgebase |
PCT/US2002/035650 WO2003042872A1 (en) | 2001-11-09 | 2002-11-07 | Method and system for performing information extraction and quality control for a knowledge base |
Publications (1)
Publication Number | Publication Date |
---|---|
JP2005509952A true JP2005509952A (ja) | 2005-04-14 |
Family
ID=21898592
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2003544634A Pending JP2005509952A (ja) | 2001-11-09 | 2002-11-07 | 知識ベースに対して情報抽出および品質管理を実施する方法およびシステム |
Country Status (6)
Country | Link |
---|---|
US (3) | US6741986B2 (ja) |
EP (2) | EP2549392A3 (ja) |
JP (1) | JP2005509952A (ja) |
AU (1) | AU2002340393B2 (ja) |
CA (1) | CA2465592C (ja) |
WO (1) | WO2003042872A1 (ja) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008041090A (ja) * | 2006-08-04 | 2008-02-21 | Xerox Corp | 文書カタログシステム |
WO2008146807A1 (ja) * | 2007-05-31 | 2008-12-04 | Nec Corporation | オントロジ処理装置、オントロジ処理方法、及びオントロジ処理プログラム |
Families Citing this family (93)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6741986B2 (en) * | 2000-12-08 | 2004-05-25 | Ingenuity Systems, Inc. | Method and system for performing information extraction and quality control for a knowledgebase |
US6772160B2 (en) * | 2000-06-08 | 2004-08-03 | Ingenuity Systems, Inc. | Techniques for facilitating information acquisition and storage |
US7577683B2 (en) * | 2000-06-08 | 2009-08-18 | Ingenuity Systems, Inc. | Methods for the construction and maintenance of a knowledge representation system |
EP1308863A1 (de) * | 2001-11-06 | 2003-05-07 | ALSTOM (Switzerland) Ltd | Verfahren zur nichtlinearen Bewertung von Dokumenten |
US7024624B2 (en) * | 2002-01-07 | 2006-04-04 | Kenneth James Hintz | Lexicon-based new idea detector |
EP3633680A1 (en) * | 2002-02-04 | 2020-04-08 | QIAGEN Redwood City, Inc. | Drug discovery methods |
US8793073B2 (en) * | 2002-02-04 | 2014-07-29 | Ingenuity Systems, Inc. | Drug discovery methods |
JP3809863B2 (ja) * | 2002-02-28 | 2006-08-16 | インターナショナル・ビジネス・マシーンズ・コーポレーション | サーバ |
JP2003316807A (ja) * | 2002-04-23 | 2003-11-07 | Communication Research Laboratory | 情報検索装置及び情報検索ソフトウェアを格納した記憶媒体 |
US7865534B2 (en) * | 2002-09-30 | 2011-01-04 | Genstruct, Inc. | System, method and apparatus for assembling and mining life science data |
US7395536B2 (en) * | 2002-11-14 | 2008-07-01 | Sun Microsystems, Inc. | System and method for submitting and performing computational tasks in a distributed heterogeneous networked environment |
US20040249620A1 (en) * | 2002-11-20 | 2004-12-09 | Genstruct, Inc. | Epistemic engine |
GB2399665A (en) * | 2003-03-18 | 2004-09-22 | British Telecomm | Access control to shared resources |
US20040193591A1 (en) * | 2003-03-27 | 2004-09-30 | Winter Robert William | Searching content information based on standardized categories and selectable categorizers |
US8005709B2 (en) | 2003-06-17 | 2011-08-23 | Oracle International Corporation | Continuous audit process control objectives |
US7899693B2 (en) * | 2003-06-17 | 2011-03-01 | Oracle International Corporation | Audit management workbench |
US7941353B2 (en) * | 2003-06-17 | 2011-05-10 | Oracle International Corporation | Impacted financial statements |
US8296167B2 (en) * | 2003-06-17 | 2012-10-23 | Nigel King | Process certification management |
US20050055312A1 (en) * | 2003-08-18 | 2005-03-10 | Wilson Kelce Steven | Software control flow watermarking |
US8661559B2 (en) * | 2003-08-18 | 2014-02-25 | Riverside Research Institute | Software control flow watermarking |
EP1690212A2 (en) * | 2003-11-26 | 2006-08-16 | Genstruct, Inc. | System, method and apparatus for causal implication analysis in biological networks |
US20050154535A1 (en) * | 2004-01-09 | 2005-07-14 | Genstruct, Inc. | Method, system and apparatus for assembling and using biological knowledge |
US7584221B2 (en) * | 2004-03-18 | 2009-09-01 | Microsoft Corporation | Field weighting in text searching |
JP2007537515A (ja) * | 2004-05-13 | 2007-12-20 | ロジャーズ,ロバート,ジョン | 情報を取り出すためのシステムと方法および情報を保存するためのシステムと方法 |
US7464110B2 (en) * | 2004-06-30 | 2008-12-09 | Nokia Corporation | Automated grouping of image and other user data |
US20060004698A1 (en) * | 2004-06-30 | 2006-01-05 | Nokia Corporation | Automated prioritization of user data files |
US7606793B2 (en) * | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
US7761448B2 (en) * | 2004-09-30 | 2010-07-20 | Microsoft Corporation | System and method for ranking search results using click distance |
US7739277B2 (en) * | 2004-09-30 | 2010-06-15 | Microsoft Corporation | System and method for incorporating anchor text into ranking search results |
US7827181B2 (en) * | 2004-09-30 | 2010-11-02 | Microsoft Corporation | Click distance determination |
US20060089861A1 (en) * | 2004-10-22 | 2006-04-27 | Oracle International Corporation | Survey based risk assessment for processes, entities and enterprise |
US20060140860A1 (en) * | 2004-12-08 | 2006-06-29 | Genstruct, Inc. | Computational knowledge model to discover molecular causes and treatment of diabetes mellitus |
US7716198B2 (en) * | 2004-12-21 | 2010-05-11 | Microsoft Corporation | Ranking search results using feature extraction |
US20060200460A1 (en) * | 2005-03-03 | 2006-09-07 | Microsoft Corporation | System and method for ranking search results using file types |
US7792833B2 (en) * | 2005-03-03 | 2010-09-07 | Microsoft Corporation | Ranking search results using language types |
US20070016580A1 (en) * | 2005-07-15 | 2007-01-18 | International Business Machines Corporation | Extracting information about references to entities rom a plurality of electronic documents |
US7599917B2 (en) * | 2005-08-15 | 2009-10-06 | Microsoft Corporation | Ranking search results using biased click distance |
US8095565B2 (en) * | 2005-12-05 | 2012-01-10 | Microsoft Corporation | Metadata driven user interface |
US7885841B2 (en) * | 2006-01-05 | 2011-02-08 | Oracle International Corporation | Audit planning |
US8005873B2 (en) * | 2006-01-25 | 2011-08-23 | Microsoft Corporation | Filtering and sorting information |
US20070225956A1 (en) * | 2006-03-27 | 2007-09-27 | Dexter Roydon Pratt | Causal analysis in complex biological systems |
CA2658991A1 (en) * | 2006-07-28 | 2008-01-31 | Ingenuity Systems, Inc. | Genomics based targeted advertising |
US7668791B2 (en) * | 2006-07-31 | 2010-02-23 | Microsoft Corporation | Distinguishing facts from opinions using a multi-stage approach |
US10453029B2 (en) | 2006-08-03 | 2019-10-22 | Oracle International Corporation | Business process for ultra transactions |
JP2008083806A (ja) * | 2006-09-26 | 2008-04-10 | Hitachi Software Eng Co Ltd | 研究開発財産管理システム |
US9495358B2 (en) | 2006-10-10 | 2016-11-15 | Abbyy Infopoisk Llc | Cross-language text clustering |
US7958103B1 (en) * | 2007-03-30 | 2011-06-07 | Emc Corporation | Incorporated web page content |
US8082109B2 (en) * | 2007-08-29 | 2011-12-20 | Selventa, Inc. | Computer-aided discovery of biomarker profiles in complex biological systems |
EP2212815A1 (en) * | 2007-09-26 | 2010-08-04 | Genstruct, Inc. | Software assisted methods for probing the biochemical basis of biological states |
US7840569B2 (en) * | 2007-10-18 | 2010-11-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
US9348912B2 (en) * | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US9146985B2 (en) * | 2008-01-07 | 2015-09-29 | Novell, Inc. | Techniques for evaluating patent impacts |
US8812493B2 (en) * | 2008-04-11 | 2014-08-19 | Microsoft Corporation | Search results ranking using editing distance and document information |
US20090327229A1 (en) * | 2008-06-27 | 2009-12-31 | Microsoft Corporation | Automatic knowledge-based geographical organization of digital media |
US8126899B2 (en) | 2008-08-27 | 2012-02-28 | Cambridgesoft Corporation | Information management system |
US9223770B1 (en) * | 2009-07-29 | 2015-12-29 | Open Invention Network, Llc | Method and apparatus of creating electronic forms to include internet list data |
US20110083084A1 (en) * | 2009-10-05 | 2011-04-07 | Hans-Diedrich Kreft | Method and device for employing editors to compoile data for competence functions |
US8793208B2 (en) * | 2009-12-17 | 2014-07-29 | International Business Machines Corporation | Identifying common data objects representing solutions to a problem in different disciplines |
EP2530605A4 (en) * | 2010-01-29 | 2013-12-25 | Panasonic Corp | DATA PROCESSING UNIT |
US9760634B1 (en) | 2010-03-23 | 2017-09-12 | Firstrain, Inc. | Models for classifying documents |
US10643227B1 (en) | 2010-03-23 | 2020-05-05 | Aurea Software, Inc. | Business lines |
US10546311B1 (en) | 2010-03-23 | 2020-01-28 | Aurea Software, Inc. | Identifying competitors of companies |
US8463789B1 (en) | 2010-03-23 | 2013-06-11 | Firstrain, Inc. | Event detection |
EP2567338B1 (en) * | 2010-05-03 | 2020-04-08 | Perkinelmer Informatics, Inc. | Method and apparatus for processing documents to identify chemical structures |
US8738635B2 (en) | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US8793706B2 (en) | 2010-12-16 | 2014-07-29 | Microsoft Corporation | Metadata-based eventing supporting operations on data |
US8782042B1 (en) | 2011-10-14 | 2014-07-15 | Firstrain, Inc. | Method and system for identifying entities |
US8612990B1 (en) | 2011-10-25 | 2013-12-17 | Google Inc. | Prioritized rate scheduler for a storage system |
EP2776962A4 (en) | 2011-11-07 | 2015-12-02 | Ingenuity Systems Inc | METHODS AND SYSTEMS FOR IDENTIFICATION OF CAUSAL GENOMIC VARIANTS |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
US20130218914A1 (en) * | 2012-02-20 | 2013-08-22 | Xerox Corporation | System and method for providing recommendations based on information extracted from reviewers' comments |
US9977876B2 (en) | 2012-02-24 | 2018-05-22 | Perkinelmer Informatics, Inc. | Systems, methods, and apparatus for drawing chemical structures using touch and gestures |
US8747115B2 (en) | 2012-03-28 | 2014-06-10 | International Business Machines Corporation | Building an ontology by transforming complex triples |
US9600625B2 (en) | 2012-04-23 | 2017-03-21 | Bina Technologies, Inc. | Systems and methods for processing nucleic acid sequence data |
US9002702B2 (en) * | 2012-05-03 | 2015-04-07 | International Business Machines Corporation | Confidence level assignment to information from audio transcriptions |
WO2014019126A1 (en) * | 2012-07-31 | 2014-02-06 | Hewlett-Packard Development Company, L. P. | Context-aware category ranking for wikipedia concepts |
US8539001B1 (en) | 2012-08-20 | 2013-09-17 | International Business Machines Corporation | Determining the value of an association between ontologies |
US20140149846A1 (en) * | 2012-09-06 | 2014-05-29 | Locu, Inc. | Method for collecting offline data |
US20140089328A1 (en) * | 2012-09-27 | 2014-03-27 | International Business Machines Corporation | Association of data to a biological sequence |
US9535583B2 (en) | 2012-12-13 | 2017-01-03 | Perkinelmer Informatics, Inc. | Draw-ahead feature for chemical structure drawing applications |
US10592480B1 (en) * | 2012-12-30 | 2020-03-17 | Aurea Software, Inc. | Affinity scoring |
US10412131B2 (en) | 2013-03-13 | 2019-09-10 | Perkinelmer Informatics, Inc. | Systems and methods for gesture-based sharing of data between separate electronic devices |
US8854361B1 (en) | 2013-03-13 | 2014-10-07 | Cambridgesoft Corporation | Visually augmenting a graphical rendering of a chemical structure representation or biological sequence representation with multi-dimensional information |
CN105264555A (zh) | 2013-04-12 | 2016-01-20 | 培生教育公司 | 评估控制 |
US9430127B2 (en) | 2013-05-08 | 2016-08-30 | Cambridgesoft Corporation | Systems and methods for providing feedback cues for touch screen interface interaction with chemical and biological structure drawing applications |
US9751294B2 (en) | 2013-05-09 | 2017-09-05 | Perkinelmer Informatics, Inc. | Systems and methods for translating three dimensional graphic molecular models to computer aided design format |
US10460830B2 (en) | 2013-08-22 | 2019-10-29 | Genomoncology, Llc | Computer-based systems and methods for analyzing genomes based on discrete data structures corresponding to genetic variants therein |
US10162852B2 (en) | 2013-12-16 | 2018-12-25 | International Business Machines Corporation | Constructing concepts from a task specification |
RU2586577C2 (ru) | 2014-01-15 | 2016-06-10 | Общество с ограниченной ответственностью "Аби ИнфоПоиск" | Фильтрация дуг в синтаксическом графе |
US9626358B2 (en) | 2014-11-26 | 2017-04-18 | Abbyy Infopoisk Llc | Creating ontologies by analyzing natural language texts |
US10917304B2 (en) * | 2015-12-30 | 2021-02-09 | Paypal, Inc. | Task monitoring system |
US10706113B2 (en) | 2017-01-06 | 2020-07-07 | Microsoft Technology Licensing, Llc | Domain review system for identifying entity relationships and corresponding insights |
CA3055172C (en) | 2017-03-03 | 2022-03-01 | Perkinelmer Informatics, Inc. | Systems and methods for searching and indexing documents comprising chemical information |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11259498A (ja) * | 1998-03-10 | 1999-09-24 | Fujitsu Ltd | 文書処理装置および記録媒体 |
JP2001134600A (ja) * | 1999-11-08 | 2001-05-18 | Nec Corp | 情報抽出システム、情報抽出方法および情報抽出用プログラムを記録した記録媒体 |
Family Cites Families (48)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5317507A (en) * | 1990-11-07 | 1994-05-31 | Gallant Stephen I | Method for document retrieval and for word sense disambiguation using neural networks |
US5371807A (en) | 1992-03-20 | 1994-12-06 | Digital Equipment Corporation | Method and apparatus for text classification |
US5418971A (en) * | 1992-04-20 | 1995-05-23 | International Business Machines Corporation | System and method for ordering commands in an automatic volume placement library |
US5377103A (en) | 1992-05-15 | 1994-12-27 | International Business Machines Corporation | Constrained natural language interface for a computer that employs a browse function |
DE69331456T2 (de) | 1992-10-09 | 2002-11-07 | Matsushita Electric Ind Co Ltd | Überprüfbare optische Zeichenerkennung |
US5794050A (en) | 1995-01-04 | 1998-08-11 | Intelligent Text Processing, Inc. | Natural language understanding system |
US6061675A (en) * | 1995-05-31 | 2000-05-09 | Oracle Corporation | Methods and apparatus for classifying terminology utilizing a knowledge catalog |
US5963966A (en) | 1995-11-08 | 1999-10-05 | Cybernet Systems Corporation | Automated capture of technical documents for electronic review and distribution |
JP3612125B2 (ja) | 1995-12-14 | 2005-01-19 | 株式会社東芝 | 情報フィルタリング方法および情報フィルタリング装置 |
US6076088A (en) * | 1996-02-09 | 2000-06-13 | Paik; Woojin | Information extraction system and method using concept relation concept (CRC) triples |
JPH1049549A (ja) | 1996-05-29 | 1998-02-20 | Matsushita Electric Ind Co Ltd | 文書検索装置 |
US6101488A (en) | 1996-09-04 | 2000-08-08 | Fujitsu Limited | Intelligent information program generation and retrieval system |
US6023659A (en) | 1996-10-10 | 2000-02-08 | Incyte Pharmaceuticals, Inc. | Database system employing protein function hierarchies for viewing biomolecular sequence data |
JPH10207939A (ja) * | 1997-01-17 | 1998-08-07 | Nec Corp | ワークフローシステム |
US6038560A (en) | 1997-05-21 | 2000-03-14 | Oracle Corporation | Concept knowledge base search and retrieval system |
US6345235B1 (en) * | 1997-05-30 | 2002-02-05 | Queen's University At Kingston | Method and apparatus for determining multi-dimensional structure |
DE69823206T2 (de) | 1997-07-25 | 2004-08-19 | Affymetrix, Inc. (a Delaware Corp.), Santa Clara | Verfahren zur herstellung einer bio-informatik-datenbank |
US6554705B1 (en) * | 1997-08-22 | 2003-04-29 | Blake Cumbers | Passive biometric customer identification and tracking system |
US5976842A (en) * | 1997-10-30 | 1999-11-02 | Clontech Laboratories, Inc. | Methods and compositions for use in high fidelity polymerase chain reaction |
US6055538A (en) * | 1997-12-22 | 2000-04-25 | Hewlett Packard Company | Methods and system for using web browser to search large collections of documents |
US6226377B1 (en) * | 1998-03-06 | 2001-05-01 | Avaya Technology Corp. | Prioritized transaction server allocation |
US6236987B1 (en) | 1998-04-03 | 2001-05-22 | Damon Horowitz | Dynamic content organization in information retrieval systems |
US6424980B1 (en) | 1998-06-10 | 2002-07-23 | Nippon Telegraph And Telephone Corporation | Integrated retrieval scheme for retrieving semi-structured documents |
US6067548A (en) | 1998-07-16 | 2000-05-23 | E Guanxi, Inc. | Dynamic organization model and management computing system and method therefor |
US6370542B1 (en) * | 1998-10-23 | 2002-04-09 | Qwest Communications International, Inc. | Method and apparatus for knowledge acquisition and management |
US6498795B1 (en) | 1998-11-18 | 2002-12-24 | Nec Usa Inc. | Method and apparatus for active information discovery and retrieval |
US6442566B1 (en) * | 1998-12-15 | 2002-08-27 | Board Of Trustees Of The Leland Stanford Junior University | Frame-based knowledge representation system and methods |
US6904423B1 (en) * | 1999-02-19 | 2005-06-07 | Bioreason, Inc. | Method and system for artificial intelligence directed lead discovery through multi-domain clustering |
US6292796B1 (en) | 1999-02-23 | 2001-09-18 | Clinical Focus, Inc. | Method and apparatus for improving access to literature |
US6581038B1 (en) | 1999-03-15 | 2003-06-17 | Nexcura, Inc. | Automated profiler system for providing medical information to patients |
US6741976B1 (en) * | 1999-07-01 | 2004-05-25 | Alexander Tuzhilin | Method and system for the creation, application and processing of logical rules in connection with biological, medical or biochemical data |
WO2001013105A1 (en) | 1999-07-30 | 2001-02-22 | Agy Therapeutics, Inc. | Techniques for facilitating identification of candidate genes |
US6598043B1 (en) * | 1999-10-04 | 2003-07-22 | Jarg Corporation | Classification of information sources using graph structures |
US7022905B1 (en) * | 1999-10-18 | 2006-04-04 | Microsoft Corporation | Classification of information and use of classifications in searching and retrieval of information |
GB2363874B (en) * | 1999-11-06 | 2004-08-04 | Dennis Sunga Fernandez | Bioinformatic transaction scheme |
US20010049671A1 (en) * | 2000-06-05 | 2001-12-06 | Joerg Werner B. | e-Stract: a process for knowledge-based retrieval of electronic information |
US6772160B2 (en) * | 2000-06-08 | 2004-08-03 | Ingenuity Systems, Inc. | Techniques for facilitating information acquisition and storage |
US7577683B2 (en) * | 2000-06-08 | 2009-08-18 | Ingenuity Systems, Inc. | Methods for the construction and maintenance of a knowledge representation system |
US6741986B2 (en) * | 2000-12-08 | 2004-05-25 | Ingenuity Systems, Inc. | Method and system for performing information extraction and quality control for a knowledgebase |
US20020194201A1 (en) * | 2001-06-05 | 2002-12-19 | Wilbanks John Thompson | Systems, methods and computer program products for integrating biological/chemical databases to create an ontology network |
US20030018522A1 (en) * | 2001-07-20 | 2003-01-23 | Psc Scanning, Inc. | Biometric system and method for identifying a customer upon entering a retail establishment |
US8793073B2 (en) * | 2002-02-04 | 2014-07-29 | Ingenuity Systems, Inc. | Drug discovery methods |
EP3633680A1 (en) * | 2002-02-04 | 2020-04-08 | QIAGEN Redwood City, Inc. | Drug discovery methods |
US20040249620A1 (en) * | 2002-11-20 | 2004-12-09 | Genstruct, Inc. | Epistemic engine |
US7914468B2 (en) * | 2004-09-22 | 2011-03-29 | Svip 4 Llc | Systems and methods for monitoring and modifying behavior |
US20060143082A1 (en) * | 2004-12-24 | 2006-06-29 | Peter Ebert | Advertisement system and method |
US20070282632A1 (en) * | 2006-05-30 | 2007-12-06 | Eric Sachs | Method and apparatus for serving advertisements in an electronic medical record system |
CA2658991A1 (en) * | 2006-07-28 | 2008-01-31 | Ingenuity Systems, Inc. | Genomics based targeted advertising |
-
2001
- 2001-11-09 US US10/038,197 patent/US6741986B2/en not_active Expired - Lifetime
-
2002
- 2002-11-07 AU AU2002340393A patent/AU2002340393B2/en not_active Expired
- 2002-11-07 CA CA2465592A patent/CA2465592C/en not_active Expired - Lifetime
- 2002-11-07 WO PCT/US2002/035650 patent/WO2003042872A1/en active Application Filing
- 2002-11-07 JP JP2003544634A patent/JP2005509952A/ja active Pending
- 2002-11-07 EP EP12005074.5A patent/EP2549392A3/en not_active Withdrawn
- 2002-11-07 EP EP02778752A patent/EP1454264A4/en not_active Ceased
-
2004
- 2004-03-16 US US10/802,615 patent/US20050055347A9/en not_active Abandoned
-
2011
- 2011-02-16 US US13/029,089 patent/US20110191286A1/en not_active Abandoned
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH11259498A (ja) * | 1998-03-10 | 1999-09-24 | Fujitsu Ltd | 文書処理装置および記録媒体 |
JP2001134600A (ja) * | 1999-11-08 | 2001-05-18 | Nec Corp | 情報抽出システム、情報抽出方法および情報抽出用プログラムを記録した記録媒体 |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008041090A (ja) * | 2006-08-04 | 2008-02-21 | Xerox Corp | 文書カタログシステム |
WO2008146807A1 (ja) * | 2007-05-31 | 2008-12-04 | Nec Corporation | オントロジ処理装置、オントロジ処理方法、及びオントロジ処理プログラム |
US8244769B2 (en) | 2007-05-31 | 2012-08-14 | Nec Corporation | System and method for judging properties of an ontology and updating same |
Also Published As
Publication number | Publication date |
---|---|
US20050055347A9 (en) | 2005-03-10 |
CA2465592A1 (en) | 2003-05-22 |
US20110191286A1 (en) | 2011-08-04 |
AU2002340393B2 (en) | 2007-01-18 |
US20030074516A1 (en) | 2003-04-17 |
EP1454264A4 (en) | 2007-10-24 |
CA2465592C (en) | 2013-05-21 |
EP1454264A1 (en) | 2004-09-08 |
US6741986B2 (en) | 2004-05-25 |
EP2549392A2 (en) | 2013-01-23 |
EP2549392A3 (en) | 2014-02-12 |
WO2003042872A1 (en) | 2003-05-22 |
US20040236740A1 (en) | 2004-11-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2002340393B2 (en) | Method and system for performing information extraction and quality control for a knowledge base | |
US6772160B2 (en) | Techniques for facilitating information acquisition and storage | |
AU2002340393A1 (en) | Method and system for performing information extraction and quality control for a knowledge base | |
KR100996131B1 (ko) | 리스팅 관리 시스템 및 방법 | |
US7702621B2 (en) | System and method for providing profile matching within an unstructured document | |
US6694331B2 (en) | Apparatus for and method of searching and organizing intellectual property information utilizing a classification system | |
US8024333B1 (en) | System and method for providing information navigation and filtration | |
US20140143269A1 (en) | Simultaneous Intellectual Property Search and Valuation System and Methodology (SIPS-VSM) | |
US8103678B1 (en) | System and method for establishing relevance of objects in an enterprise system | |
US20130046782A1 (en) | Method and system to provide subsequent history field for intellectual property document | |
CN1650295A (zh) | 用于数据库查询和信息提交的方法和系统 | |
JP2013503400A (ja) | 公的セクタの雇用と私的セクタの雇用の間における労働力移行を管理するためのシステムおよび方法 | |
AU2014318392A1 (en) | Systems, methods, and software for manuscript recommendations and submissions | |
JP2008537811A (ja) | リスティングを管理するためのシステム及び方法 | |
US20090112850A1 (en) | Bioitem Searcher, Bioitem Search Terminal, Bioitem Search Method, and Program | |
JP5266975B2 (ja) | 個人検索システム、情報処理装置、個人検索方法、プログラムおよび記録媒体 | |
Moore | Performance Measures for Knowledge | |
US20020147596A1 (en) | On-line laboratory services brokerage system | |
Gáspári et al. | Efficient recognition of folds in protein 3D structures by the improved PRIDE algorithm | |
US8250024B2 (en) | Search relevance in business intelligence systems through networked ranking | |
AU2006201478B2 (en) | Method and system for performing information extraction and quality control for a knowledge base | |
Weinzierl et al. | Epidemic Question Answering: question generation and entailment for Answer Nugget discovery | |
Oliver et al. | Here, there and everywhere: an analysis of reference services in academic archives | |
CN117668242A (zh) | 一种数据分析方法、系统及相关设备 | |
Kochen | Quality Control in the Publishing Process and |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20051028 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20080919 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20081120 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20081128 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20090209 |
|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20090313 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20090611 |
|
A911 | Transfer to examiner for re-examination before appeal (zenchi) |
Free format text: JAPANESE INTERMEDIATE CODE: A911 Effective date: 20090724 |
|
A912 | Re-examination (zenchi) completed and case transferred to appeal board |
Free format text: JAPANESE INTERMEDIATE CODE: A912 Effective date: 20091009 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20110722 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20110727 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20110822 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20110825 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20110922 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20110929 |