JP5795743B2 - 適応的重み付けを用いた様々な文書間類似度計算方法に基づいた文書比較方法および文書比較システム - Google Patents
適応的重み付けを用いた様々な文書間類似度計算方法に基づいた文書比較方法および文書比較システム Download PDFInfo
- Publication number
- JP5795743B2 JP5795743B2 JP2012045250A JP2012045250A JP5795743B2 JP 5795743 B2 JP5795743 B2 JP 5795743B2 JP 2012045250 A JP2012045250 A JP 2012045250A JP 2012045250 A JP2012045250 A JP 2012045250A JP 5795743 B2 JP5795743 B2 JP 5795743B2
- Authority
- JP
- Japan
- Prior art keywords
- document
- inter
- weight
- similarity calculation
- documents
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/073,836 US8612457B2 (en) | 2011-03-28 | 2011-03-28 | Method and system for comparing documents based on different document-similarity calculation methods using adaptive weighting |
| US13/073,836 | 2011-03-28 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2012208924A JP2012208924A (ja) | 2012-10-25 |
| JP2012208924A5 JP2012208924A5 (enExample) | 2015-04-16 |
| JP5795743B2 true JP5795743B2 (ja) | 2015-10-14 |
Family
ID=45928682
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2012045250A Expired - Fee Related JP5795743B2 (ja) | 2011-03-28 | 2012-03-01 | 適応的重み付けを用いた様々な文書間類似度計算方法に基づいた文書比較方法および文書比較システム |
Country Status (4)
| Country | Link |
|---|---|
| US (1) | US8612457B2 (enExample) |
| EP (1) | EP2506167A1 (enExample) |
| JP (1) | JP5795743B2 (enExample) |
| KR (1) | KR101935765B1 (enExample) |
Families Citing this family (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8880530B2 (en) * | 2011-04-18 | 2014-11-04 | Palo Alto Research Center Incorporated | Method for searching related documents based on and guided by meaningful entities |
| US11308037B2 (en) | 2012-10-30 | 2022-04-19 | Google Llc | Automatic collaboration |
| US9721019B2 (en) * | 2012-12-10 | 2017-08-01 | Aol Inc. | Systems and methods for providing personalized recommendations for electronic content |
| EP3215944B1 (en) | 2014-11-03 | 2021-07-07 | Vectra AI, Inc. | A system for implementing threat detection using daily network traffic community outliers |
| WO2016073383A1 (en) | 2014-11-03 | 2016-05-12 | Vectra Networks, Inc. | A system for implementing threat detection using threat and risk assessment of asset-actor interactions |
| JP2017142640A (ja) * | 2016-02-10 | 2017-08-17 | 日本電信電話株式会社 | 類似文書推薦システム、類似文書推薦方法および類似文書推薦プログラム |
| KR101866411B1 (ko) * | 2016-09-06 | 2018-06-19 | 한양대학교 산학협력단 | 문서 추천 정보를 제공하는 방법 및 이를 이용하는 문서 추천 정보 제공 장치 |
| CN108182598A (zh) * | 2017-12-27 | 2018-06-19 | 东软集团股份有限公司 | 用户价值分类方法和装置 |
| US11410130B2 (en) * | 2017-12-27 | 2022-08-09 | International Business Machines Corporation | Creating and using triplet representations to assess similarity between job description documents |
| US12072897B2 (en) * | 2021-02-23 | 2024-08-27 | Sae International | Similarity searching across digital standards |
| TWI837682B (zh) * | 2022-05-27 | 2024-04-01 | 宜鼎國際股份有限公司 | 專案管理系統、專案管理方法和電子裝置的儲存媒體 |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH09212509A (ja) * | 1996-02-05 | 1997-08-15 | Oki Electric Ind Co Ltd | 単文類似度計算装置 |
| US6289342B1 (en) | 1998-01-05 | 2001-09-11 | Nec Research Institute, Inc. | Autonomous citation indexing and literature browsing using citation context |
| JP3503506B2 (ja) * | 1999-01-06 | 2004-03-08 | 日本電信電話株式会社 | 情報検索方法、情報検索装置及び情報検索プログラムを記録した記録媒体 |
| US6922699B2 (en) * | 1999-01-26 | 2005-07-26 | Xerox Corporation | System and method for quantitatively representing data objects in vector space |
| US6941321B2 (en) * | 1999-01-26 | 2005-09-06 | Xerox Corporation | System and method for identifying similarities among objects in a collection |
| JP3690216B2 (ja) * | 1999-11-26 | 2005-08-31 | 日本電気株式会社 | 文書間類似度計算方法及びシステムと装置ならびに類似度計算用プログラムを記録した記録媒体 |
| US7080073B1 (en) * | 2000-08-18 | 2006-07-18 | Firstrain, Inc. | Method and apparatus for focused crawling |
| US7120868B2 (en) | 2002-05-30 | 2006-10-10 | Microsoft Corp. | System and method for adaptive document layout via manifold content |
| JP3918531B2 (ja) | 2001-11-29 | 2007-05-23 | 株式会社日立製作所 | 類似文書検索方法およびシステム |
| US20040181527A1 (en) * | 2003-03-11 | 2004-09-16 | Lockheed Martin Corporation | Robust system for interactively learning a string similarity measurement |
| US7533094B2 (en) | 2004-11-23 | 2009-05-12 | Microsoft Corporation | Method and system for determining similarity of items based on similarity objects and their features |
| US20070005588A1 (en) * | 2005-07-01 | 2007-01-04 | Microsoft Corporation | Determining relevance using queries as surrogate content |
| US7801392B2 (en) * | 2005-07-21 | 2010-09-21 | Fuji Xerox Co., Ltd. | Image search system, image search method, and storage medium |
| JP2007172077A (ja) * | 2005-12-19 | 2007-07-05 | Fuji Xerox Co Ltd | 画像検索システム及び方法及びプログラム |
| US7472131B2 (en) | 2005-12-12 | 2008-12-30 | Justsystems Evans Research, Inc. | Method and apparatus for constructing a compact similarity structure and for using the same in analyzing document relevance |
| US7689559B2 (en) * | 2006-02-08 | 2010-03-30 | Telenor Asa | Document similarity scoring and ranking method, device and computer program product |
| US7562088B2 (en) * | 2006-12-27 | 2009-07-14 | Sap Ag | Structure extraction from unstructured documents |
| US20080162455A1 (en) * | 2006-12-27 | 2008-07-03 | Rakshit Daga | Determination of document similarity |
| US20080215571A1 (en) * | 2007-03-01 | 2008-09-04 | Microsoft Corporation | Product review search |
| US7996390B2 (en) | 2008-02-15 | 2011-08-09 | The University Of Utah Research Foundation | Method and system for clustering identified forms |
| US9268851B2 (en) * | 2010-04-29 | 2016-02-23 | International Business Machines Corporation | Ranking information content based on performance data of prior users of the information content |
-
2011
- 2011-03-28 US US13/073,836 patent/US8612457B2/en active Active
-
2012
- 2012-03-01 JP JP2012045250A patent/JP5795743B2/ja not_active Expired - Fee Related
- 2012-03-21 EP EP12160482A patent/EP2506167A1/en not_active Ceased
- 2012-03-26 KR KR1020120030377A patent/KR101935765B1/ko not_active Expired - Fee Related
Also Published As
| Publication number | Publication date |
|---|---|
| KR101935765B1 (ko) | 2019-01-08 |
| EP2506167A1 (en) | 2012-10-03 |
| US20120254165A1 (en) | 2012-10-04 |
| KR20120110035A (ko) | 2012-10-09 |
| JP2012208924A (ja) | 2012-10-25 |
| US8612457B2 (en) | 2013-12-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP5795743B2 (ja) | 適応的重み付けを用いた様々な文書間類似度計算方法に基づいた文書比較方法および文書比較システム | |
| US11836576B2 (en) | Distributed machine learning at edge nodes | |
| US11004012B2 (en) | Assessment of machine learning performance with limited test data | |
| GB2532542A (en) | Risk quantification for policy deployment | |
| CN109214436A (zh) | 一种针对目标场景的预测模型训练方法及装置 | |
| US11676060B2 (en) | Digital content interaction prediction and training that addresses imbalanced classes | |
| US20190138899A1 (en) | Processing apparatus, processing method, and nonvolatile recording medium | |
| US20160148246A1 (en) | Automated System for Safe Policy Improvement | |
| CN108122168A (zh) | 社交活动网络中种子节点筛选方法和装置 | |
| WO2019220653A1 (ja) | 因果関係推定装置、因果関係推定方法および因果関係推定プログラム | |
| CN111079944A (zh) | 迁移学习模型解释实现方法及装置、电子设备、存储介质 | |
| JP7573548B2 (ja) | 特徴ベクトル実現可能性推定 | |
| JP2019016324A (ja) | 予測装置、予測方法および予測プログラム | |
| US11574181B2 (en) | Fusion of neural networks | |
| US20170213236A1 (en) | Estimation of Causal Impact of Digital Marketing Content | |
| CN111400512B (zh) | 一种筛选多媒体资源的方法及装置 | |
| CN113361719A (zh) | 基于图像处理模型的增量学习方法和图像处理方法 | |
| JP6203313B2 (ja) | 特徴選択装置、特徴選択方法およびプログラム | |
| US9654365B2 (en) | Selection of message passing collectives in presence of system noise | |
| JP2016207136A (ja) | モデル推定システム、モデル推定方法およびモデル推定プログラム | |
| US10360509B2 (en) | Apparatus and method for generating an optimal set of choices | |
| US20210241346A1 (en) | Systems for Generating Recommendations | |
| US20210216374A1 (en) | Cluster update accelerator circuit | |
| CN114116151A (zh) | 一种基于先验知识的大数据框架配置参数优化方法 | |
| JP7687518B2 (ja) | 推定装置 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| RD04 | Notification of resignation of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7424 Effective date: 20131011 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20150226 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20150226 |
|
| A871 | Explanation of circumstances concerning accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A871 Effective date: 20150226 |
|
| A975 | Report on accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A971005 Effective date: 20150416 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20150519 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20150624 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20150721 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20150814 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 5795743 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| LAPS | Cancellation because of no payment of annual fees |