JP5753473B2 - 二次元ビジュアルフィンガープリントを用いる複製ドキュメントコンテンツの検出方法 - Google Patents

二次元ビジュアルフィンガープリントを用いる複製ドキュメントコンテンツの検出方法 Download PDF

Info

Publication number
JP5753473B2
JP5753473B2 JP2011227416A JP2011227416A JP5753473B2 JP 5753473 B2 JP5753473 B2 JP 5753473B2 JP 2011227416 A JP2011227416 A JP 2011227416A JP 2011227416 A JP2011227416 A JP 2011227416A JP 5753473 B2 JP5753473 B2 JP 5753473B2
Authority
JP
Japan
Prior art keywords
fingerprint
document
page
image
keypoint
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2011227416A
Other languages
English (en)
Japanese (ja)
Other versions
JP2012089132A (ja
JP2012089132A5 (enExample
Inventor
ドロン・クレッター
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Palo Alto Research Center Inc
Original Assignee
Palo Alto Research Center Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Palo Alto Research Center Inc filed Critical Palo Alto Research Center Inc
Publication of JP2012089132A publication Critical patent/JP2012089132A/ja
Publication of JP2012089132A5 publication Critical patent/JP2012089132A5/ja
Application granted granted Critical
Publication of JP5753473B2 publication Critical patent/JP5753473B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/418Document matching, e.g. of document images
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/414Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • Geometry (AREA)
  • Computer Graphics (AREA)
  • Editing Of Facsimile Originals (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)
JP2011227416A 2010-10-19 2011-10-14 二次元ビジュアルフィンガープリントを用いる複製ドキュメントコンテンツの検出方法 Active JP5753473B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/907,226 2010-10-19
US12/907,226 US8750624B2 (en) 2010-10-19 2010-10-19 Detection of duplicate document content using two-dimensional visual fingerprinting

Publications (3)

Publication Number Publication Date
JP2012089132A JP2012089132A (ja) 2012-05-10
JP2012089132A5 JP2012089132A5 (enExample) 2014-11-27
JP5753473B2 true JP5753473B2 (ja) 2015-07-22

Family

ID=44785568

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2011227416A Active JP5753473B2 (ja) 2010-10-19 2011-10-14 二次元ビジュアルフィンガープリントを用いる複製ドキュメントコンテンツの検出方法

Country Status (3)

Country Link
US (1) US8750624B2 (enExample)
EP (1) EP2444920B1 (enExample)
JP (1) JP5753473B2 (enExample)

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8943144B2 (en) * 2009-12-10 2015-01-27 International Business Machines Corporation Consolidating duplicate messages for a single destination on a computer network
US8554021B2 (en) 2010-10-19 2013-10-08 Palo Alto Research Center Incorporated Finding similar content in a mixed collection of presentation and rich document content using two-dimensional visual fingerprints
US8527516B1 (en) * 2011-02-25 2013-09-03 Google Inc. Identifying similar digital text volumes
US10685234B2 (en) 2012-03-31 2020-06-16 Xerox Corporation Automatic and semi-automatic metadata generation via inheritance in homogeneous and heterogeneous environments
US8838657B1 (en) 2012-09-07 2014-09-16 Amazon Technologies, Inc. Document fingerprints using block encoding of text
US9773039B2 (en) * 2012-09-14 2017-09-26 Fti Consulting, Inc. Computer-implemented system and method for identifying near duplicate documents
TWI512642B (zh) * 2013-01-25 2015-12-11 Delta Electronics Inc 快速圖形比對方法
US8958651B2 (en) * 2013-05-30 2015-02-17 Seiko Epson Corporation Tree-model-based stereo matching
US10303682B2 (en) 2013-09-21 2019-05-28 Oracle International Corporation Automatic verification and triage of query results
JP6232940B2 (ja) * 2013-11-01 2017-11-22 富士ゼロックス株式会社 画像情報処理装置及びプログラム
CN103714345B (zh) * 2013-12-27 2018-04-06 Tcl集团股份有限公司 一种双目立体视觉检测手指指尖空间位置的方法与系统
US9886422B2 (en) * 2014-08-06 2018-02-06 International Business Machines Corporation Dynamic highlighting of repetitions in electronic documents
US9805099B2 (en) * 2014-10-30 2017-10-31 The Johns Hopkins University Apparatus and method for efficient identification of code similarity
CN105787415B (zh) * 2014-12-18 2020-04-07 富士通株式会社 文档图像的处理装置、方法以及扫描仪
US9836435B2 (en) 2015-03-19 2017-12-05 International Business Machines Corporation Embedded content suitability scoring
RU2613848C2 (ru) * 2015-09-16 2017-03-21 Общество с ограниченной ответственностью "Аби Девелопмент" Выявление "нечетких" дубликатов изображений с помощью троек смежных оцененных признаков
WO2017210618A1 (en) 2016-06-02 2017-12-07 Fti Consulting, Inc. Analyzing clusters of coded documents
US10229315B2 (en) * 2016-07-27 2019-03-12 Intuit, Inc. Identification of duplicate copies of a form in a document
US20180101540A1 (en) * 2016-10-10 2018-04-12 Facebook, Inc. Diversifying Media Search Results on Online Social Networks
US10083353B2 (en) 2016-10-28 2018-09-25 Intuit Inc. Identifying document forms using digital fingerprints
US10372813B2 (en) 2017-01-17 2019-08-06 International Business Machines Corporation Selective content dissemination
US10318563B2 (en) 2017-08-23 2019-06-11 Lead Technologies, Inc. Apparatus, method, and computer-readable medium for recognition of a digital document
CN109697231A (zh) * 2017-10-24 2019-04-30 北京国双科技有限公司 一种案件文书的显示方法、系统、存储介质和处理器
CN108664900B (zh) * 2018-04-20 2022-05-27 上海掌门科技有限公司 一种用于识别文字作品异同的方法与设备
US20200019583A1 (en) * 2018-07-11 2020-01-16 University Of Southern California Systems and methods for automated repair of webpages
CN109471921A (zh) * 2018-11-23 2019-03-15 深圳市元征科技股份有限公司 一种文本查重方法、装置及设备
DE112020006703T5 (de) * 2020-02-10 2022-12-15 Mitsubishi Electric Corporation Anzeigebilddaten-Editierprogramm, Anzeigebilddaten-Editiervorrichtung und Anzeigebilddaten-Editierverfahren
JP7400543B2 (ja) * 2020-02-28 2023-12-19 富士フイルムビジネスイノベーション株式会社 情報処理装置及びプログラム
CN113297888B (zh) * 2020-09-18 2024-06-07 阿里巴巴集团控股有限公司 一种图像内容检测结果核查方法及装置

Family Cites Families (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0589190A (ja) * 1991-09-27 1993-04-09 Meidensha Corp 図面情報のチエツク方式
CA2077274C (en) 1991-11-19 1997-07-15 M. Margaret Withgott Method and apparatus for summarizing a document without document image decoding
US5465303A (en) 1993-11-12 1995-11-07 Aeroflex Systems Corporation Automated fingerprint classification/identification system and method
US5465353A (en) 1994-04-01 1995-11-07 Ricoh Company, Ltd. Image matching and retrieval by multi-access redundant hashing
US5613014A (en) 1994-10-12 1997-03-18 Martin Marietta Corp. Fingerprint matching system
US5987171A (en) * 1994-11-10 1999-11-16 Canon Kabushiki Kaisha Page analysis system
JP3647075B2 (ja) * 1995-01-26 2005-05-11 キヤノン株式会社 画像検索方法及びその装置
US5850476A (en) 1995-12-14 1998-12-15 Xerox Corporation Automatic method of identifying drop words in a document image without performing character recognition
US5893908A (en) 1996-11-21 1999-04-13 Ricoh Company Limited Document management system
US6041133A (en) 1996-12-13 2000-03-21 International Business Machines Corporation Method and apparatus for fingerprint matching using transformation parameter clustering based on local feature correspondences
US7844594B1 (en) 1999-06-18 2010-11-30 Surfwax, Inc. Information search, retrieval and distillation into knowledge objects
US20060259524A1 (en) 2003-03-17 2006-11-16 Horton D T Systems and methods for document project management, conversion, and filing
US7359532B2 (en) 2003-12-11 2008-04-15 Intel Corporation Fingerprint minutiae matching using scoring techniques
US7702673B2 (en) 2004-10-01 2010-04-20 Ricoh Co., Ltd. System and methods for creation and use of a mixed media environment
JP4517822B2 (ja) * 2004-11-05 2010-08-04 富士ゼロックス株式会社 画像処理装置及びプログラム
US20060104484A1 (en) 2004-11-16 2006-05-18 Bolle Rudolf M Fingerprint biometric machine representations based on triangles
US20060120686A1 (en) * 2004-12-03 2006-06-08 Frank Liebenow Method, apparatus and system for storage and retrieval of images
JP4533187B2 (ja) * 2005-03-01 2010-09-01 キヤノン株式会社 画像処理装置およびその制御方法
US7519200B2 (en) 2005-05-09 2009-04-14 Like.Com System and method for enabling the use of captured images through recognition
US7403932B2 (en) 2005-07-01 2008-07-22 The Boeing Company Text differentiation methods, systems, and computer program products for content analysis
US7801392B2 (en) 2005-07-21 2010-09-21 Fuji Xerox Co., Ltd. Image search system, image search method, and storage medium
US8103050B2 (en) 2006-01-16 2012-01-24 Thomson Licensing Method for computing a fingerprint of a video sequence
DE602006014803D1 (de) 2006-04-28 2010-07-22 Eidgenoess Tech Hochschule Robuster Detektor und Deskriptor für einen Interessenspunkt
US8244036B2 (en) 2007-01-24 2012-08-14 Bluebeam Software, Inc. Method for emphasizing differences in graphical appearance between an original document and a modified document with annotations
US8055079B2 (en) 2007-03-06 2011-11-08 Sharp Kabushiki Kaisha Image processing method, image processing apparatus, and image forming apparatus
US8972299B2 (en) 2008-01-07 2015-03-03 Bally Gaming, Inc. Methods for biometrically identifying a player
US8233722B2 (en) 2008-06-27 2012-07-31 Palo Alto Research Center Incorporated Method and system for finding a document image in a document collection using localized two-dimensional visual fingerprints
US8233716B2 (en) 2008-06-27 2012-07-31 Palo Alto Research Center Incorporated System and method for finding stable keypoints in a picture image using localized scale space properties
US8144947B2 (en) 2008-06-27 2012-03-27 Palo Alto Research Center Incorporated System and method for finding a picture image in an image collection using localized two-dimensional visual fingerprints
US8520941B2 (en) * 2008-12-09 2013-08-27 Xerox Corporation Method and system for document image classification
US8548193B2 (en) 2009-09-03 2013-10-01 Palo Alto Research Center Incorporated Method and apparatus for navigating an electronic magnifier over a target document
US8380588B2 (en) 2010-01-14 2013-02-19 Oracle International Corporation Side-by-side comparison of associations for multi-level bills of material
US9514103B2 (en) 2010-02-05 2016-12-06 Palo Alto Research Center Incorporated Effective system and method for visual document comparison using localized two-dimensional visual fingerprints
US8086039B2 (en) 2010-02-05 2011-12-27 Palo Alto Research Center Incorporated Fine-grained visual document fingerprinting for accurate document comparison and retrieval
US8554021B2 (en) 2010-10-19 2013-10-08 Palo Alto Research Center Incorporated Finding similar content in a mixed collection of presentation and rich document content using two-dimensional visual fingerprints

Also Published As

Publication number Publication date
EP2444920A2 (en) 2012-04-25
US8750624B2 (en) 2014-06-10
JP2012089132A (ja) 2012-05-10
US20120093421A1 (en) 2012-04-19
EP2444920A3 (en) 2013-04-03
EP2444920B1 (en) 2019-08-21

Similar Documents

Publication Publication Date Title
JP5753473B2 (ja) 二次元ビジュアルフィンガープリントを用いる複製ドキュメントコンテンツの検出方法
JP5183578B2 (ja) 局所的視覚的2次元指紋を用いた、文書コレクション内の文書画像を発見する方法およびシステム
JP5662917B2 (ja) 二次元ビジュアルフィンガープリントを用いるプレゼンテーション及びリッチドキュメントコンテンツの混合コレクションにおける類似コンテンツの発見方法
JP5180156B2 (ja) 局所化された2次元の視覚的指紋を使用してイメージコレクション内のピクチャイメージを見つけるシステムおよび方法
US8825682B2 (en) Architecture for mixed media reality retrieval of locations and registration of images
US8510283B2 (en) Automatic adaption of an image recognition system to image capture devices
US8856108B2 (en) Combining results of image retrieval processes
US8868555B2 (en) Computation of a recongnizability score (quality predictor) for image retrieval
US8086039B2 (en) Fine-grained visual document fingerprinting for accurate document comparison and retrieval
US8498487B2 (en) Content-based matching of videos using local spatio-temporal fingerprints
US9020966B2 (en) Client device for interacting with a mixed media reality recognition system
US20130346431A1 (en) Monitoring and Analyzing Creation and Usage of Visual Content
US9171203B2 (en) Scanbox
US8139860B2 (en) Retrieving and sharing electronic documents using paper
Prathima et al. A novel framework for handling duplicate images using hashing techniques

Legal Events

Date Code Title Description
RD04 Notification of resignation of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7424

Effective date: 20130621

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20141009

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20141009

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20141009

A975 Report on accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A971005

Effective date: 20141205

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20150113

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20150408

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20150428

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20150522

R150 Certificate of patent or registration of utility model

Ref document number: 5753473

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250