JP5995409B2 - コンピュータ解析のためにテキスト文書を表現するためのグラフィカル・モデル - Google Patents
コンピュータ解析のためにテキスト文書を表現するためのグラフィカル・モデル Download PDFInfo
- Publication number
- JP5995409B2 JP5995409B2 JP2011096300A JP2011096300A JP5995409B2 JP 5995409 B2 JP5995409 B2 JP 5995409B2 JP 2011096300 A JP2011096300 A JP 2011096300A JP 2011096300 A JP2011096300 A JP 2011096300A JP 5995409 B2 JP5995409 B2 JP 5995409B2
- Authority
- JP
- Japan
- Prior art keywords
- document
- words
- edge
- data structure
- graph
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/151—Transformation
- G06F40/154—Tree transformation for tree-structured or markup documents, e.g. XSLT, XSL-FO or stylesheets
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/137—Hierarchical processing, e.g. outlines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/14—Tree-structured documents
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/177—Editing, e.g. inserting or deleting of tables; using ruled lines
- G06F40/18—Editing, e.g. inserting or deleting of tables; using ruled lines of spreadsheets
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/284—Lexical analysis, e.g. tokenisation or collocates
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
Description
グラフG(C,D,k)=(N(C),A(D,k))
として定義され、ここで、N(C)は、コーパスCに特有に定められたノードの組であり、A(D,k)は、文書内のエッジの組である。これらの組は、以下のように定められる。
130:残りの単語(余分なものを取り除いたテキスト表現)
140a−140f、150b、150c:自己ループ
160a−160f、170a−170j:有向エッジ
201:距離グラフ
202:データ構造体
301:無向距離グラフ
370a−370i:無向エッジ
401:装置(コンピュータ)
402:メモリ
403:ディスク
404:中央処理ユニット(CPU)
405:情報レポジトリ
410:要求
420:応答
Claims (8)
- コンピュータが実行する方法であって、
複数の順序付けられた単語を含む文書を受けるステップと、
前記文書を受け取った後で、且つグラフ・データ構造体を生成する前に、前記文書からストップワードを取り除くステップと、
前記文書に関して、各ノードが文書内の別個の単語を表現し、2つのノードが互いに隣接するか、または所定の単語数の内に現れる回数を関連づけた各エッジを含むグラフ・データ構造体を生成するステップと、
前記グラフ・データ構造体を情報レポジトリ内に格納するステップと、
前記文書についてのテキスト解析を実行する要求として検索要求を受けるステップと、
前記検索要求で要求された文章断片に対するグラフ・データ構造体を生成し、前記情報レポジトリ内を検索して、該文章断片のグラフ・データ構造体が該情報レポジトリ内のグラフ・データ構造体のいずれかに存在するかを判断することにより、前記グラフ・データ構造体についてテキスト解析を実行し、前記要求に応答する結果を提供するステップと、
を含む、方法。 - 前記所定の単語数は、前記グラフ・データ構造体の次数値を与える、請求項1に記載の方法。
- 前記エッジは、有向エッジ又は無向エッジである、請求項1に記載の方法。
- 前記グラフ・データ構造体は、余分なものを取り除いた文書から生成される、請求項1に記載の方法。
- プロセッサとメモリとを含むコンピュータが実行する方法であって、
前記プロセッサが複数の単語を含む文書を受け取るステップと、
前記プロセッサが前記メモリを使用して、前記文書からストップワードを除去するステップと、
前記プロセッサが前記メモリを使用して、前記ストップワードを除去した文書に関して、各ノードが前記文書内の別個の単語を表現し、2つのノードが互いに隣接するか、または所定の単語数の内に現れる回数を関連づけた各エッジを含むグラフ・データ構造体を生成するステップと、
前記プロセッサが前記メモリを使用して、前記2つのノードが互いに前記所定の単語数の内に現れる回数に等しい各エッジの頻度を有する擬似単語を、前記グラフ・データ構造体内の各エッジに割り当てることによって、前記文書のベクトル空間表現を構築するステップと、
前記プロセッサが前記ベクトル空間表現を出力するステップと、
を含む、方法。 - 前記プロセッサが前記メモリを使用して、前記文書についてのテキスト解析を実行する要求として2つの文書の類似性を計算する要求を受け取るステップと、前記擬似単語に基づき、前記2つの文書に対して生成されたグラフ・データ構造体間の共通のエッジの数を調べることにより、前記ベクトル空間表現についてテキスト解析を実行して、前記要求に応答する結果を取得するステップをさらに含む、請求項5に記載の方法。
- 請求項1乃至6のいずれかに記載の方法に含まれる各ステップをコンピュータに実行させるためのコンピュータ・プログラム。
- テキスト文書を解析するための装置であって、
複数の単語を含む文書を受け取る手段と、
前記文書からストップワードを除去する手段と、
前記ストップワードを除去した文書に関して、各ノードが前記文書内の別個の単語を表現し、2つのノードが互いに隣接するか、または所定の単語数の内に現れる回数を関連づけた各エッジを含むグラフ・データ構造体を生成する手段と、
前記2つのノードが互いに前記所定の単語数の内に現れる回数に等しい各エッジの頻度を有する擬似単語を、前記グラフ・データ構造体内の各エッジに割り当てることによって、前記文書のベクトル空間表現を構築する手段と、
前記ベクトル空間表現を出力する手段と、
を備える、装置。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/796266 | 2010-06-08 | ||
US12/796,266 US8375061B2 (en) | 2010-06-08 | 2010-06-08 | Graphical models for representing text documents for computer analysis |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2011258184A JP2011258184A (ja) | 2011-12-22 |
JP5995409B2 true JP5995409B2 (ja) | 2016-09-21 |
Family
ID=45065290
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2011096300A Active JP5995409B2 (ja) | 2010-06-08 | 2011-04-22 | コンピュータ解析のためにテキスト文書を表現するためのグラフィカル・モデル |
Country Status (3)
Country | Link |
---|---|
US (1) | US8375061B2 (ja) |
JP (1) | JP5995409B2 (ja) |
KR (1) | KR101790793B1 (ja) |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8775441B2 (en) | 2008-01-16 | 2014-07-08 | Ab Initio Technology Llc | Managing an archive for approximate string matching |
CA2738961A1 (en) | 2008-10-23 | 2010-04-29 | Ab Initio Technology Llc | Fuzzy data operations |
AU2012340429B2 (en) | 2011-11-15 | 2016-12-01 | Ab Initio Technology Llc | Data clustering based on candidate queries |
US8819078B2 (en) * | 2012-07-13 | 2014-08-26 | Hewlett-Packard Development Company, L. P. | Event processing for graph-structured data |
US8954441B1 (en) * | 2014-01-02 | 2015-02-10 | Linkedin Corporation | Graph-based system and method of information storage and retrieval |
US9251139B2 (en) * | 2014-04-08 | 2016-02-02 | TitleFlow LLC | Natural language processing for extracting conveyance graphs |
CN105701083A (zh) | 2014-11-28 | 2016-06-22 | 国际商业机器公司 | 文本表示方法和装置 |
US9483474B2 (en) * | 2015-02-05 | 2016-11-01 | Microsoft Technology Licensing, Llc | Document retrieval/identification using topics |
KR101697875B1 (ko) | 2015-10-30 | 2017-01-18 | 아주대학교산학협력단 | 그래프 모델에 기반하는 문서 분석 방법 및 그 시스템 |
US10013450B2 (en) | 2015-12-03 | 2018-07-03 | International Business Machines Corporation | Using knowledge graphs to identify potential inconsistencies in works of authorship |
US10013404B2 (en) * | 2015-12-03 | 2018-07-03 | International Business Machines Corporation | Targeted story summarization using natural language processing |
US10248738B2 (en) | 2015-12-03 | 2019-04-02 | International Business Machines Corporation | Structuring narrative blocks in a logical sequence |
KR101723770B1 (ko) | 2016-02-19 | 2017-04-06 | 아주대학교산학협력단 | 플레이어 매칭 기법에 기반하는 문제 추천 방법 및 그 시스템 |
US9645999B1 (en) * | 2016-08-02 | 2017-05-09 | Quid, Inc. | Adjustment of document relationship graphs |
US10437810B2 (en) | 2016-09-30 | 2019-10-08 | Microsoft Technology Licensing, Llc | Systems and methods for maintaining cardinality schema to prevent corruption in databases indifferent to cardinality |
JP6622236B2 (ja) * | 2017-03-06 | 2019-12-18 | 株式会社日立製作所 | 発想支援装置及び発想支援方法 |
US10621234B2 (en) | 2018-04-06 | 2020-04-14 | Runecast Solutions Limited | Method for discovering a group of interdependent computing objects within undirected graph structure in a computing environment corresponding to a set of known conditions |
US11429897B1 (en) | 2019-04-26 | 2022-08-30 | Bank Of America Corporation | Identifying relationships between sentences using machine learning |
US11783005B2 (en) | 2019-04-26 | 2023-10-10 | Bank Of America Corporation | Classifying and mapping sentences using machine learning |
US11449559B2 (en) | 2019-08-27 | 2022-09-20 | Bank Of America Corporation | Identifying similar sentences for machine learning |
US11556711B2 (en) | 2019-08-27 | 2023-01-17 | Bank Of America Corporation | Analyzing documents using machine learning |
US11423231B2 (en) | 2019-08-27 | 2022-08-23 | Bank Of America Corporation | Removing outliers from training data for machine learning |
US11526804B2 (en) | 2019-08-27 | 2022-12-13 | Bank Of America Corporation | Machine learning model training for reviewing documents |
CN112000788B (zh) * | 2020-08-19 | 2024-02-09 | 腾讯云计算(长沙)有限责任公司 | 一种数据处理方法、装置以及计算机可读存储介质 |
CN113312498B (zh) * | 2021-06-09 | 2022-06-17 | 上海交通大学 | 用无向图嵌入知识图谱的文本信息抽取方法 |
CN114219876B (zh) * | 2022-02-18 | 2022-06-24 | 阿里巴巴达摩院(杭州)科技有限公司 | 文本合并方法、装置、设备及存储介质 |
Family Cites Families (78)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4980918A (en) | 1985-05-09 | 1990-12-25 | International Business Machines Corporation | Speech recognition system with efficient storage and rapid assembly of phonological graphs |
US5740421A (en) * | 1995-04-03 | 1998-04-14 | Dtl Data Technologies Ltd. | Associative search method for heterogeneous databases with an integration mechanism configured to combine schema-free data models such as a hyperbase |
US6026388A (en) * | 1995-08-16 | 2000-02-15 | Textwise, Llc | User interface and other enhancements for natural language information retrieval system and method |
US5963940A (en) * | 1995-08-16 | 1999-10-05 | Syracuse University | Natural language information retrieval system and method |
US5737734A (en) * | 1995-09-15 | 1998-04-07 | Infonautics Corporation | Query word relevance adjustment in a search of an information retrieval system |
US5640553A (en) * | 1995-09-15 | 1997-06-17 | Infonautics Corporation | Relevance normalization for documents retrieved from an information retrieval system in response to a query |
US5742816A (en) * | 1995-09-15 | 1998-04-21 | Infonautics Corporation | Method and apparatus for identifying textual documents and multi-mediafiles corresponding to a search topic |
US5717914A (en) * | 1995-09-15 | 1998-02-10 | Infonautics Corporation | Method for categorizing documents into subjects using relevance normalization for documents retrieved from an information retrieval system in response to a query |
US5822731A (en) * | 1995-09-15 | 1998-10-13 | Infonautics Corporation | Adjusting a hidden Markov model tagger for sentence fragments |
US5675788A (en) * | 1995-09-15 | 1997-10-07 | Infonautics Corp. | Method and apparatus for generating a composite document on a selected topic from a plurality of information sources |
US5659742A (en) * | 1995-09-15 | 1997-08-19 | Infonautics Corporation | Method for storing multi-media information in an information retrieval system |
US5721902A (en) * | 1995-09-15 | 1998-02-24 | Infonautics Corporation | Restricted expansion of query terms using part of speech tagging |
US5873076A (en) * | 1995-09-15 | 1999-02-16 | Infonautics Corporation | Architecture for processing search queries, retrieving documents identified thereby, and method for using same |
US6128613A (en) * | 1997-06-26 | 2000-10-03 | The Chinese University Of Hong Kong | Method and apparatus for establishing topic word classes based on an entropy cost function to retrieve documents represented by the topic words |
US6070134A (en) * | 1997-07-31 | 2000-05-30 | Microsoft Corporation | Identifying salient semantic relation paths between two words |
US6360227B1 (en) * | 1999-01-29 | 2002-03-19 | International Business Machines Corporation | System and method for generating taxonomies with applications to content-based recommendations |
US6901402B1 (en) * | 1999-06-18 | 2005-05-31 | Microsoft Corporation | System for improving the performance of information retrieval-type tasks by identifying the relations of constituents |
US6549896B1 (en) * | 2000-04-07 | 2003-04-15 | Nec Usa, Inc. | System and method employing random walks for mining web page associations and usage to optimize user-oriented web page refresh and pre-fetch scheduling |
US20030033582A1 (en) * | 2001-05-09 | 2003-02-13 | Wavemarket, Inc. | Representations for estimating distance |
US6978274B1 (en) * | 2001-08-31 | 2005-12-20 | Attenex Corporation | System and method for dynamically evaluating latent concepts in unstructured documents |
ATE466345T1 (de) * | 2002-01-16 | 2010-05-15 | Elucidon Group Ltd | Abruf von informationsdaten, wobei daten in bedingungen, dokumenten und dokument-corpora organisiert sind |
JP4085156B2 (ja) | 2002-03-18 | 2008-05-14 | 独立行政法人情報通信研究機構 | テキスト生成方法及びテキスト生成装置 |
US6877001B2 (en) * | 2002-04-25 | 2005-04-05 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for retrieving documents with spoken queries |
US7353165B2 (en) * | 2002-06-28 | 2008-04-01 | Microsoft Corporation | Example based machine translation system |
JP2004110161A (ja) | 2002-09-13 | 2004-04-08 | Fuji Xerox Co Ltd | テキスト文比較装置 |
US7007069B2 (en) * | 2002-12-16 | 2006-02-28 | Palo Alto Research Center Inc. | Method and apparatus for clustering hierarchically related information |
US7197497B2 (en) * | 2003-04-25 | 2007-03-27 | Overture Services, Inc. | Method and apparatus for machine learning a document relevance function |
US7081839B2 (en) * | 2003-09-11 | 2006-07-25 | Lucent Technologies Inc. | Method and apparatus for compressing an input string to provide an equivalent decompressed output string |
US7346629B2 (en) * | 2003-10-09 | 2008-03-18 | Yahoo! Inc. | Systems and methods for search processing using superunits |
JP2005122515A (ja) * | 2003-10-17 | 2005-05-12 | Sony Corp | 電子機器装置、テキスト間の類似度計算方法、およびプログラム |
US7809548B2 (en) * | 2004-06-14 | 2010-10-05 | University Of North Texas | Graph-based ranking algorithms for text processing |
GB2415518A (en) * | 2004-06-24 | 2005-12-28 | Sharp Kk | Method and apparatus for translation based on a repository of existing translations |
US20060031219A1 (en) * | 2004-07-22 | 2006-02-09 | Leon Chernyak | Method and apparatus for informational processing based on creation of term-proximity graphs and their embeddings into informational units |
US7836076B2 (en) * | 2004-08-20 | 2010-11-16 | Hewlett-Packard Development Company, L.P. | Distributing content indices |
US20060074900A1 (en) * | 2004-09-30 | 2006-04-06 | Nanavati Amit A | Selecting keywords representative of a document |
US7551780B2 (en) * | 2005-08-23 | 2009-06-23 | Ricoh Co., Ltd. | System and method for using individualized mixed document |
US7457808B2 (en) * | 2004-12-17 | 2008-11-25 | Xerox Corporation | Method and apparatus for explaining categorization decisions |
US20060200461A1 (en) * | 2005-03-01 | 2006-09-07 | Lucas Marshall D | Process for identifying weighted contextural relationships between unrelated documents |
US20060235899A1 (en) * | 2005-03-25 | 2006-10-19 | Frontline Systems, Inc. | Method of migrating legacy database systems |
US7447683B2 (en) * | 2005-03-31 | 2008-11-04 | Jiles, Inc. | Natural language based search engine and methods of use therefor |
US9129038B2 (en) * | 2005-07-05 | 2015-09-08 | Andrew Begel | Discovering and exploiting relationships in software repositories |
US7599917B2 (en) * | 2005-08-15 | 2009-10-06 | Microsoft Corporation | Ranking search results using biased click distance |
US7672511B2 (en) * | 2005-08-30 | 2010-03-02 | Siemens Medical Solutions Usa, Inc. | System and method for lattice-preserving multigrid method for image segmentation and filtering |
US7499919B2 (en) * | 2005-09-21 | 2009-03-03 | Microsoft Corporation | Ranking functions using document usage statistics |
CN101305366B (zh) * | 2005-11-29 | 2013-02-06 | 国际商业机器公司 | 从非结构化文本提取和显现图表结构化关系的方法和系统 |
US7627559B2 (en) * | 2005-12-15 | 2009-12-01 | Microsoft Corporation | Context-based key phrase discovery and similarity measurement utilizing search engine query logs |
US8438486B2 (en) | 2006-02-09 | 2013-05-07 | Microsoft Corporation | Automatically converting text to business graphics |
US7461073B2 (en) * | 2006-02-14 | 2008-12-02 | Microsoft Corporation | Co-clustering objects of heterogeneous types |
US20070214137A1 (en) * | 2006-03-07 | 2007-09-13 | Gloor Peter A | Process for analyzing actors and their discussion topics through semantic social network analysis |
US7752243B2 (en) * | 2006-06-06 | 2010-07-06 | University Of Regina | Method and apparatus for construction and use of concept knowledge base |
US7624104B2 (en) * | 2006-06-22 | 2009-11-24 | Yahoo! Inc. | User-sensitive pagerank |
US20080004956A1 (en) | 2006-06-28 | 2008-01-03 | Andrew Ian Atherton | System and method for generating graphical advertisements based on text offers |
US7954059B2 (en) | 2006-07-24 | 2011-05-31 | National Instruments Corporation | Automatic conversion of text-based code having function overloading and dynamic types into a graphical program for compiled execution |
US8401841B2 (en) * | 2006-08-31 | 2013-03-19 | Orcatec Llc | Retrieval of documents using language models |
US8166029B2 (en) * | 2006-09-07 | 2012-04-24 | Yahoo! Inc. | System and method for identifying media content items and related media content items |
US7899822B2 (en) * | 2006-09-08 | 2011-03-01 | International Business Machines Corporation | Automatically linking documents with relevant structured information |
KR20090050086A (ko) * | 2006-09-11 | 2009-05-19 | 인터내셔널 비지네스 머신즈 코포레이션 | 내비게이션 동안의 사용자 지원 방법, 웹 애플리케이션 서버 컴퓨터 시스템, 컴퓨터 판독가능 저장 매체 |
US7917492B2 (en) * | 2007-09-21 | 2011-03-29 | Limelight Networks, Inc. | Method and subsystem for information acquisition and aggregation to facilitate ontology and language-model generation within a content-search-service system |
US8359190B2 (en) * | 2006-10-27 | 2013-01-22 | Hewlett-Packard Development Company, L.P. | Identifying semantic positions of portions of a text |
EP2100228A1 (en) * | 2007-01-05 | 2009-09-16 | Microsoft Corporation | Directed graph embedding |
US7966291B1 (en) * | 2007-06-26 | 2011-06-21 | Google Inc. | Fact-based object merging |
US20090024385A1 (en) * | 2007-07-16 | 2009-01-22 | Semgine, Gmbh | Semantic parser |
US20090031224A1 (en) | 2007-07-25 | 2009-01-29 | International Business Machines Corporation | Method, system, and computer program product for visually associating a static graphic image and html text on a web page |
JP2009048351A (ja) * | 2007-08-17 | 2009-03-05 | Nippon Telegr & Teleph Corp <Ntt> | 情報検索装置、情報検索方法および情報検索プログラム |
JP5141152B2 (ja) * | 2007-09-20 | 2013-02-13 | 富士通株式会社 | テキスト解析プログラム、テキスト解析方法、およびテキスト解析装置 |
US20090144262A1 (en) * | 2007-12-04 | 2009-06-04 | Microsoft Corporation | Search query transformation using direct manipulation |
JP5038939B2 (ja) * | 2008-03-03 | 2012-10-03 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 情報検索システム、方法及びプログラム |
US8290975B2 (en) * | 2008-03-12 | 2012-10-16 | Microsoft Corporation | Graph-based keyword expansion |
US20090254799A1 (en) | 2008-04-04 | 2009-10-08 | Michael Unger | System for creating graphical display from text |
JP5226401B2 (ja) * | 2008-06-25 | 2013-07-03 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 文書データの検索を支援する装置及び方法 |
JP5224953B2 (ja) * | 2008-07-17 | 2013-07-03 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 情報処理装置、情報処理方法およびプログラム |
JP2010113412A (ja) * | 2008-11-04 | 2010-05-20 | Omron Corp | 文書情報加工方法、文書情報加工装置、文書情報加工プログラムおよび記録媒体 |
US8095546B1 (en) * | 2009-01-09 | 2012-01-10 | Google Inc. | Book content item search |
US20100287148A1 (en) * | 2009-05-08 | 2010-11-11 | Cpa Global Patent Research Limited | Method, System, and Apparatus for Targeted Searching of Multi-Sectional Documents within an Electronic Document Collection |
US8935260B2 (en) * | 2009-05-12 | 2015-01-13 | Hewlett-Packard Development Company, L.P. | Document key phrase extraction method |
US8271414B2 (en) * | 2009-07-24 | 2012-09-18 | International Business Machines Corporation | Network characterization, feature extraction and application to classification |
US9026427B2 (en) * | 2009-10-30 | 2015-05-05 | Blackberry Limited | Method and apparatus for pruning side information including directed edges not possessing shortest expanded length for grammar-based compression |
US20110131244A1 (en) * | 2009-11-29 | 2011-06-02 | Microsoft Corporation | Extraction of certain types of entities |
-
2010
- 2010-06-08 US US12/796,266 patent/US8375061B2/en active Active
-
2011
- 2011-04-22 JP JP2011096300A patent/JP5995409B2/ja active Active
- 2011-06-07 KR KR1020110054634A patent/KR101790793B1/ko active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
KR20110134314A (ko) | 2011-12-14 |
JP2011258184A (ja) | 2011-12-22 |
KR101790793B1 (ko) | 2017-10-26 |
US8375061B2 (en) | 2013-02-12 |
US20110302168A1 (en) | 2011-12-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5995409B2 (ja) | コンピュータ解析のためにテキスト文書を表現するためのグラフィカル・モデル | |
US11763175B2 (en) | Systems and methods for semantic inference and reasoning | |
US10963794B2 (en) | Concept analysis operations utilizing accelerators | |
US10310812B2 (en) | Matrix ordering for cache efficiency in performing large sparse matrix operations | |
US11423082B2 (en) | Methods and apparatus for subgraph matching in big data analysis | |
JP6605573B2 (ja) | 並列ディシジョン・ツリー・プロセッサー・アーキテクチャ | |
US8559731B2 (en) | Personalized tag ranking | |
US20160259826A1 (en) | Parallelized Hybrid Sparse Matrix Representations for Performing Personalized Content Ranking | |
US10713429B2 (en) | Joining web data with spreadsheet data using examples | |
US20190303266A1 (en) | String transformation based trace classification and analysis | |
US10915537B2 (en) | System and a method for associating contextual structured data with unstructured documents on map-reduce | |
Kethavarapu et al. | Concept based dynamic ontology creation for job recommendation system | |
US8392393B2 (en) | Graph searching | |
KR101267038B1 (ko) | 벡터 공간 모델을 이용한 rdf 트리플 선택 방법, 장치, 및 그 방법을 실행하기 위한 프로그램 기록매체 | |
US20160350384A1 (en) | Mining Relevant Approximate Subgraphs from Multigraphs | |
Kumar et al. | Efficient structuring of data in big data | |
Shahzad et al. | Generating process model collection with diverse label and structural features | |
WO2014169481A1 (en) | Coarse semantic data set enhancement for a reasoning task | |
JP6005583B2 (ja) | 検索装置、検索方法および検索プログラム | |
JP2011248740A (ja) | データ出力装置、データ出力方法およびデータ出力プログラム | |
Ba et al. | ServOMap and ServOMap-lt Results for OAEI 2012 | |
JP6040141B2 (ja) | キーワード付与装置、キーワード付与方法及びプログラム | |
CN116226686B (zh) | 一种表格相似性分析方法、装置、设备和存储介质 | |
Ong et al. | Comparative Study of Leveraging Big Data Processing Techniques for Sentiment Analysis | |
Adachi et al. | A Query Anonymization Approach using Ontology Mappings. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20140110 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20140514 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20140603 |
|
RD12 | Notification of acceptance of power of sub attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7432 Effective date: 20140603 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A821 Effective date: 20140604 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20140821 |
|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20150303 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20150612 |
|
A911 | Transfer to examiner for re-examination before appeal (zenchi) |
Free format text: JAPANESE INTERMEDIATE CODE: A911 Effective date: 20150619 |
|
A912 | Re-examination (zenchi) completed and case transferred to appeal board |
Free format text: JAPANESE INTERMEDIATE CODE: A912 Effective date: 20150821 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20160426 |
|
RD14 | Notification of resignation of power of sub attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7434 Effective date: 20160802 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20160823 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 5995409 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |