JP5364578B2 - トランスダクティブデータ分類のための方法およびシステム、ならびに機械学習手法を用いたデータ分類方法 - Google Patents

トランスダクティブデータ分類のための方法およびシステム、ならびに機械学習手法を用いたデータ分類方法 Download PDF

Info

Publication number
JP5364578B2
JP5364578B2 JP2009519439A JP2009519439A JP5364578B2 JP 5364578 B2 JP5364578 B2 JP 5364578B2 JP 2009519439 A JP2009519439 A JP 2009519439A JP 2009519439 A JP2009519439 A JP 2009519439A JP 5364578 B2 JP5364578 B2 JP 5364578B2
Authority
JP
Japan
Prior art keywords
unlabeled
data
labeled
label
document
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2009519439A
Other languages
English (en)
Japanese (ja)
Other versions
JP2009543254A (ja
Inventor
マウリチウス アー.アール. シュミットラー,
クリストファー ケー. ハリス,
ローランド ボレー,
アンソニー サラ,
ニコラ カルーソー,
Original Assignee
コファックス, インコーポレイテッド
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US11/752,691 external-priority patent/US20080086432A1/en
Priority claimed from US11/752,673 external-priority patent/US7958067B2/en
Priority claimed from US11/752,634 external-priority patent/US7761391B2/en
Priority claimed from US11/752,719 external-priority patent/US7937345B2/en
Application filed by コファックス, インコーポレイテッド filed Critical コファックス, インコーポレイテッド
Publication of JP2009543254A publication Critical patent/JP2009543254A/ja
Application granted granted Critical
Publication of JP5364578B2 publication Critical patent/JP5364578B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Medical Informatics (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)
  • Sorting Of Articles (AREA)
JP2009519439A 2006-07-12 2007-06-07 トランスダクティブデータ分類のための方法およびシステム、ならびに機械学習手法を用いたデータ分類方法 Active JP5364578B2 (ja)

Applications Claiming Priority (11)

Application Number Priority Date Filing Date Title
US83031106P 2006-07-12 2006-07-12
US60/830,311 2006-07-12
US11/752,691 US20080086432A1 (en) 2006-07-12 2007-05-23 Data classification methods using machine learning techniques
US11/752,691 2007-05-23
US11/752,634 2007-05-23
US11/752,673 US7958067B2 (en) 2006-07-12 2007-05-23 Data classification methods using machine learning techniques
US11/752,634 US7761391B2 (en) 2006-07-12 2007-05-23 Methods and systems for improved transductive maximum entropy discrimination classification
US11/752,719 US7937345B2 (en) 2006-07-12 2007-05-23 Data classification methods using machine learning techniques
US11/752,719 2007-05-23
US11/752,673 2007-05-23
PCT/US2007/013484 WO2008008142A2 (en) 2006-07-12 2007-06-07 Machine learning techniques and transductive data classification

Publications (2)

Publication Number Publication Date
JP2009543254A JP2009543254A (ja) 2009-12-03
JP5364578B2 true JP5364578B2 (ja) 2013-12-11

Family

ID=38923733

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2009519439A Active JP5364578B2 (ja) 2006-07-12 2007-06-07 トランスダクティブデータ分類のための方法およびシステム、ならびに機械学習手法を用いたデータ分類方法

Country Status (3)

Country Link
EP (1) EP1924926A4 (de)
JP (1) JP5364578B2 (de)
WO (1) WO2008008142A2 (de)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160066926A (ko) * 2014-12-03 2016-06-13 삼성전자주식회사 데이터 분류 방법 및 장치와 관심영역 세그멘테이션 방법 및 장치

Families Citing this family (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9769354B2 (en) 2005-03-24 2017-09-19 Kofax, Inc. Systems and methods of processing scanned data
US9137417B2 (en) 2005-03-24 2015-09-15 Kofax, Inc. Systems and methods for processing video data
US7937345B2 (en) 2006-07-12 2011-05-03 Kofax, Inc. Data classification methods using machine learning techniques
US7958067B2 (en) 2006-07-12 2011-06-07 Kofax, Inc. Data classification methods using machine learning techniques
US8190868B2 (en) 2006-08-07 2012-05-29 Webroot Inc. Malware management through kernel detection
CN102160066A (zh) * 2008-06-24 2011-08-17 沙伦·贝伦宗 特别适用于专利文献的搜索引擎和方法
US9349046B2 (en) 2009-02-10 2016-05-24 Kofax, Inc. Smart optical input/output (I/O) extension for context-dependent workflows
US9767354B2 (en) 2009-02-10 2017-09-19 Kofax, Inc. Global geographic information retrieval, validation, and normalization
US9576272B2 (en) 2009-02-10 2017-02-21 Kofax, Inc. Systems, methods and computer program products for determining document validity
US8774516B2 (en) 2009-02-10 2014-07-08 Kofax, Inc. Systems, methods and computer program products for determining document validity
US8958605B2 (en) 2009-02-10 2015-02-17 Kofax, Inc. Systems, methods and computer program products for determining document validity
US8438386B2 (en) * 2009-04-21 2013-05-07 Webroot Inc. System and method for developing a risk profile for an internet service
US11489857B2 (en) 2009-04-21 2022-11-01 Webroot Inc. System and method for developing a risk profile for an internet resource
US10146795B2 (en) 2012-01-12 2018-12-04 Kofax, Inc. Systems and methods for mobile image capture and processing
US9058580B1 (en) 2012-01-12 2015-06-16 Kofax, Inc. Systems and methods for identification document processing and business workflow integration
US9058515B1 (en) 2012-01-12 2015-06-16 Kofax, Inc. Systems and methods for identification document processing and business workflow integration
US9165188B2 (en) 2012-01-12 2015-10-20 Kofax, Inc. Systems and methods for mobile image capture and processing
US9483794B2 (en) 2012-01-12 2016-11-01 Kofax, Inc. Systems and methods for identification document processing and business workflow integration
US9208536B2 (en) 2013-09-27 2015-12-08 Kofax, Inc. Systems and methods for three dimensional geometric reconstruction of captured image data
US9355312B2 (en) 2013-03-13 2016-05-31 Kofax, Inc. Systems and methods for classifying objects in digital images captured using mobile devices
EP2973226A4 (de) 2013-03-13 2016-06-29 Kofax Inc Klassifizierung von objekten auf mit mobilvorrichtungen aufgenommenen digitalbildern
US20140316841A1 (en) 2013-04-23 2014-10-23 Kofax, Inc. Location-based workflows and services
EP2992481A4 (de) 2013-05-03 2017-02-22 Kofax, Inc. Systeme und verfahren zur detektion und klassifizierung von objekten in mithilfe von mobilen vorrichtungen aufgenommenen videos
WO2015031449A1 (en) * 2013-08-30 2015-03-05 3M Innovative Properties Company Method of classifying medical documents
US9386235B2 (en) 2013-11-15 2016-07-05 Kofax, Inc. Systems and methods for generating composite images of long documents using mobile video data
US9760788B2 (en) 2014-10-30 2017-09-12 Kofax, Inc. Mobile document detection and orientation based on reference object characteristics
CN104700099B (zh) * 2015-03-31 2017-08-11 百度在线网络技术(北京)有限公司 识别交通标志的方法和装置
US10242285B2 (en) 2015-07-20 2019-03-26 Kofax, Inc. Iterative recognition-guided thresholding and data extraction
US11550688B2 (en) 2015-10-29 2023-01-10 Micro Focus Llc User interaction logic classification
US10339193B1 (en) * 2015-11-24 2019-07-02 Google Llc Business change detection from street level imagery
US9779296B1 (en) 2016-04-01 2017-10-03 Kofax, Inc. Content-based detection and three dimensional geometric reconstruction of objects in image and video data
JP6973733B2 (ja) * 2017-11-07 2021-12-01 株式会社アイ・アール・ディー 特許情報処理装置、特許情報処理方法およびプログラム
US11062176B2 (en) 2017-11-30 2021-07-13 Kofax, Inc. Object detection and image cropping using a multi-detector approach
JP7024515B2 (ja) 2018-03-09 2022-02-24 富士通株式会社 学習プログラム、学習方法および学習装置
JP7079483B2 (ja) * 2018-06-18 2022-06-02 国立研究開発法人産業技術総合研究所 情報処理方法、システム及びプログラム
WO2020065611A1 (en) * 2018-09-28 2020-04-02 Element Ai Inc. Recommendation method and system and method and system for improving a machine learning system
US11880396B2 (en) 2018-10-08 2024-01-23 Arctic Alliance Europe Oy Method and system to perform text-based search among plurality of documents
KR102033136B1 (ko) * 2019-04-03 2019-10-16 주식회사 루닛 준지도 학습 기반의 기계학습 방법 및 그 장치
WO2020231188A1 (ko) * 2019-05-13 2020-11-19 삼성전자주식회사 검증 뉴럴 네트워크를 이용한 분류 결과 검증 방법, 분류 결과 학습 방법 및 상기 방법을 수행하는 컴퓨팅 장치
CN113240025B (zh) * 2021-05-19 2022-08-12 电子科技大学 一种基于贝叶斯神经网络权重约束的图像分类方法
JP2023144562A (ja) 2022-03-28 2023-10-11 富士通株式会社 機械学習プログラム,データ処理プログラム,情報処理装置,機械学習方法およびデータ処理方法
WO2024113266A1 (en) * 2022-11-30 2024-06-06 Paypal, Inc. Use of a training framework of a multi-class model to train a multi-label model

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7376635B1 (en) * 2000-07-21 2008-05-20 Ford Global Technologies, Llc Theme-based system and method for classifying documents
AU2002305652A1 (en) * 2001-05-18 2002-12-03 Biowulf Technologies, Llc Methods for feature selection in a learning machine
US7702526B2 (en) 2002-01-24 2010-04-20 George Mason Intellectual Properties, Inc. Assessment of episodes of illness
US7184929B2 (en) * 2004-01-28 2007-02-27 Microsoft Corporation Exponential priors for maximum entropy models
US7492943B2 (en) * 2004-10-29 2009-02-17 George Mason Intellectual Properties, Inc. Open set recognition using transduction

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160066926A (ko) * 2014-12-03 2016-06-13 삼성전자주식회사 데이터 분류 방법 및 장치와 관심영역 세그멘테이션 방법 및 장치
KR102315574B1 (ko) 2014-12-03 2021-10-20 삼성전자주식회사 데이터 분류 방법 및 장치와 관심영역 세그멘테이션 방법 및 장치

Also Published As

Publication number Publication date
WO2008008142A2 (en) 2008-01-17
JP2009543254A (ja) 2009-12-03
WO2008008142A3 (en) 2008-12-04
EP1924926A4 (de) 2016-08-17
EP1924926A2 (de) 2008-05-28

Similar Documents

Publication Publication Date Title
JP5364578B2 (ja) トランスダクティブデータ分類のための方法およびシステム、ならびに機械学習手法を用いたデータ分類方法
US7937345B2 (en) Data classification methods using machine learning techniques
US7958067B2 (en) Data classification methods using machine learning techniques
US7761391B2 (en) Methods and systems for improved transductive maximum entropy discrimination classification
US20080086432A1 (en) Data classification methods using machine learning techniques
US11704552B2 (en) Task detection in communications using domain adaptation
US11330009B2 (en) Systems and methods for machine learning-based digital content clustering, digital content threat detection, and digital content threat remediation in machine learning task-oriented digital threat mitigation platform
US20190147369A1 (en) Rule Determination for Black-Box Machine-Learning Models
Goldstein et al. A scaling approach to record linkage
Heredia et al. Improving detection of untrustworthy online reviews using ensemble learners combined with feature selection
CN108304568B (zh) 一种房地产公众预期大数据处理方法及系统
Chemchem et al. Deep learning and data mining classification through the intelligent agent reasoning
US11748561B1 (en) Apparatus and methods for employment application assessment
Qiu et al. Deep active learning with crowdsourcing data for privacy policy classification
Viktoriia et al. Machine learning methods in medicine diagnostics problem
Desai An Exploration of the Effectiveness of Machine Learning Algorithms for Text Classification
CN116932832B (zh) 数据资产目录生成方法、设备及计算机可读存储介质
CN118093881B (zh) 一种基于知识图谱的审计对象画像建模方法和系统
An et al. Interaction Identification and Clique Screening for Classification with Ultra-high Dimensional Discrete Features
US20240169195A1 (en) Machine learning-based systems and methods for identifying and resolving content anomalies in a target digital artifact
Zhou et al. AMDnet: An Academic Misconduct Detection Method for Authors’ Behaviors.
Bauder Machine Learning Algorithms with Big Medicare Fraud Data
Mena et al. Collective annotation patterns in learning from crowds
Liu et al. Deep Tree-based Retrieval for Efficient Recommendation: Theory and Method
Kozniewski Self-Confidence Measures of a Decision Support System Based on Bayesian Networks

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20100604

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20110523

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20120907

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20121107

A602 Written permission of extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A602

Effective date: 20121114

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20130227

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20130812

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20130909

R150 Certificate of patent or registration of utility model

Ref document number: 5364578

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250