JP7052016B2 - 訓練データを更新するための方法、システム、およびコンピュータ・プログラム - Google Patents

訓練データを更新するための方法、システム、およびコンピュータ・プログラム Download PDF

Info

Publication number
JP7052016B2
JP7052016B2 JP2020513922A JP2020513922A JP7052016B2 JP 7052016 B2 JP7052016 B2 JP 7052016B2 JP 2020513922 A JP2020513922 A JP 2020513922A JP 2020513922 A JP2020513922 A JP 2020513922A JP 7052016 B2 JP7052016 B2 JP 7052016B2
Authority
JP
Japan
Prior art keywords
training data
question
classifier
answer
class
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2020513922A
Other languages
English (en)
Japanese (ja)
Other versions
JP2020533692A (ja
JP2020533692A5 (enExample
Inventor
琢省 柳川
宏秋 小峯
かおり 丸山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of JP2020533692A publication Critical patent/JP2020533692A/ja
Publication of JP2020533692A5 publication Critical patent/JP2020533692A5/ja
Application granted granted Critical
Publication of JP7052016B2 publication Critical patent/JP7052016B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation
    • G06F16/90332Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/268Morphological analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/048Fuzzy inferencing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Automation & Control Theory (AREA)
  • Fuzzy Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Electrically Operated Instructional Devices (AREA)
JP2020513922A 2017-09-15 2018-09-13 訓練データを更新するための方法、システム、およびコンピュータ・プログラム Active JP7052016B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US15/705,596 2017-09-15
US15/705,596 US10387572B2 (en) 2017-09-15 2017-09-15 Training data update
US15/845,031 US10372826B2 (en) 2017-09-15 2017-12-18 Training data update
US15/845,031 2017-12-18
PCT/IB2018/057011 WO2019053629A1 (en) 2017-09-15 2018-09-13 UPDATE OF LEARNING DATA

Publications (3)

Publication Number Publication Date
JP2020533692A JP2020533692A (ja) 2020-11-19
JP2020533692A5 JP2020533692A5 (enExample) 2021-01-07
JP7052016B2 true JP7052016B2 (ja) 2022-04-11

Family

ID=65720294

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2020513922A Active JP7052016B2 (ja) 2017-09-15 2018-09-13 訓練データを更新するための方法、システム、およびコンピュータ・プログラム

Country Status (6)

Country Link
US (4) US10387572B2 (enExample)
JP (1) JP7052016B2 (enExample)
CN (1) CN111095234A (enExample)
DE (1) DE112018005167T5 (enExample)
GB (1) GB2580805A (enExample)
WO (1) WO2019053629A1 (enExample)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20200134093A (ko) * 2019-05-21 2020-12-01 엘지전자 주식회사 공기 조화기의 실내기

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10387572B2 (en) 2017-09-15 2019-08-20 International Business Machines Corporation Training data update
US10467640B2 (en) * 2017-11-29 2019-11-05 Qualtrics, Llc Collecting and analyzing electronic survey responses including user-composed text
CN109919317B (zh) * 2018-01-11 2024-06-04 华为技术有限公司 一种机器学习模型训练方法和装置
CN110949458B (zh) * 2019-11-27 2021-11-12 交控科技股份有限公司 基于微服务架构的轨道交通运维管理系统
CN111858875B (zh) * 2020-05-09 2024-06-07 北京嘀嘀无限科技发展有限公司 智能交互方法、装置、设备及存储介质
US12165082B1 (en) * 2020-06-29 2024-12-10 Amazon Technologies, Inc. Hyperparameter optimization with operational constraints
US11756663B2 (en) * 2020-07-27 2023-09-12 Kpn Innovations, Llc. Method of and system for determining a prioritized instruction set for a user
CN111967581B (zh) * 2020-08-06 2023-10-31 平安科技(深圳)有限公司 分群模型的解释方法、装置、计算机设备和存储介质
CN111949769B (zh) * 2020-08-23 2024-03-12 云知声智能科技股份有限公司 一种增强阅读理解系统鲁棒性的方法及装置
US11080484B1 (en) * 2020-10-08 2021-08-03 Omniscient Neurotechnology Pty Limited Natural language processing of electronic records
CN112541109B (zh) * 2020-12-22 2023-10-24 北京百度网讯科技有限公司 答案摘要抽取方法及装置、电子设备、可读介质、产品
US20220215034A1 (en) * 2021-01-05 2022-07-07 Samsung Electronics Co., Ltd. Electronic apparatus and controlling method thereof
CN112733932A (zh) * 2021-01-08 2021-04-30 北京匠数科技有限公司 基于训练数据相似度聚合的模型加速训练方法及装置
CN112948560A (zh) * 2021-03-23 2021-06-11 平安科技(深圳)有限公司 佛学问答数据生成方法、装置、计算机设备及存储介质
CN114238598A (zh) * 2021-12-07 2022-03-25 北京妙医佳健康科技集团有限公司 一种问答系统及其标注、审核与模型训练的方法

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080262985A1 (en) 2006-11-15 2008-10-23 Cretu Gabriela Systems, methods, and media for generating sanitized data, sanitizing anomaly detection models, and/or generating sanitized anomaly detection models
WO2015190203A1 (ja) 2014-06-10 2015-12-17 株式会社東芝 検出装置、修正システム、検出方法およびプログラム

Family Cites Families (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7925492B2 (en) * 2004-01-06 2011-04-12 Neuric Technologies, L.L.C. Method for determining relationships through use of an ordered list between processing nodes in an emulated human brain
US6289513B1 (en) * 1999-06-01 2001-09-11 Isaac Bentwich Interactive application generation and text processing
US7269545B2 (en) * 2001-03-30 2007-09-11 Nec Laboratories America, Inc. Method for retrieving answers from an information retrieval system
US7657935B2 (en) * 2001-08-16 2010-02-02 The Trustees Of Columbia University In The City Of New York System and methods for detecting malicious email transmission
US7489812B2 (en) * 2002-06-07 2009-02-10 Dynamic Digital Depth Research Pty Ltd. Conversion and encoding techniques
US7734554B2 (en) * 2005-10-27 2010-06-08 Hewlett-Packard Development Company, L.P. Deploying a document classification system
US8010410B2 (en) 2006-12-29 2011-08-30 Ebay Inc. Method and system for listing categorization
US9342588B2 (en) 2007-06-18 2016-05-17 International Business Machines Corporation Reclassification of training data to improve classifier accuracy
EP2230555B1 (en) 2007-12-27 2017-02-22 Canon Kabushiki Kaisha Toner and two-component developer
JP5206044B2 (ja) * 2008-03-17 2013-06-12 株式会社リコー 省エネ小粒径トナーの製造方法及び製造装置
US10482114B2 (en) * 2008-03-27 2019-11-19 Oath Inc. System and method for maintenance of questions and answers through collaborative and community editing
WO2009152154A1 (en) * 2008-06-09 2009-12-17 J.D. Power And Associates Automatic sentiment analysis of surveys
CN102903008B (zh) * 2011-07-29 2016-05-18 国际商业机器公司 用于计算机问答的方法及系统
US9213686B2 (en) * 2011-10-04 2015-12-15 Wfh Properties Llc System and method for managing a form completion process
JP5825676B2 (ja) * 2012-02-23 2015-12-02 国立研究開発法人情報通信研究機構 ノン・ファクトイド型質問応答システム及びコンピュータプログラム
US20140006012A1 (en) * 2012-07-02 2014-01-02 Microsoft Corporation Learning-Based Processing of Natural Language Questions
US20140067816A1 (en) * 2012-08-29 2014-03-06 Microsoft Corporation Surfacing entity attributes with search results
US9471559B2 (en) 2012-12-10 2016-10-18 International Business Machines Corporation Deep analysis of natural language questions for question answering system
US9390378B2 (en) 2013-03-28 2016-07-12 Wal-Mart Stores, Inc. System and method for high accuracy product classification with limited supervision
JP6328463B2 (ja) 2013-11-01 2018-05-23 ローランド株式会社 鍵盤装置
US9286910B1 (en) 2014-03-13 2016-03-15 Amazon Technologies, Inc. System for resolving ambiguous queries based on user context
US20150278264A1 (en) * 2014-03-31 2015-10-01 International Business Machines Corporation Dynamic update of corpus indices for question answering system
CN104166643A (zh) 2014-08-19 2014-11-26 南京金娃娃软件科技有限公司 一种智能问答系统中的对话行为分析方法
CN105447031A (zh) * 2014-08-28 2016-03-30 百度在线网络技术(北京)有限公司 训练样本的标注方法及装置
CN104182767B (zh) * 2014-09-05 2018-03-13 西安电子科技大学 主动学习和邻域信息相结合的高光谱图像分类方法
US9720963B2 (en) * 2014-11-05 2017-08-01 International Business Machines Corporation Answer category data classifying using dynamic thresholds
US9792549B2 (en) 2014-11-21 2017-10-17 International Business Machines Corporation Extraction of semantic relations using distributional relation detection
KR102415503B1 (ko) * 2015-08-21 2022-07-01 삼성전자주식회사 분류기 학습 방법 및 객체 검출 방법
CN106778796B (zh) * 2016-10-20 2020-04-21 江苏大学 基于混合式协同训练的人体动作识别方法及系统
CN106844530A (zh) * 2016-12-29 2017-06-13 北京奇虎科技有限公司 一种问答对分类模型的训练方法和装置
US10387572B2 (en) 2017-09-15 2019-08-20 International Business Machines Corporation Training data update

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080262985A1 (en) 2006-11-15 2008-10-23 Cretu Gabriela Systems, methods, and media for generating sanitized data, sanitizing anomaly detection models, and/or generating sanitized anomaly detection models
WO2015190203A1 (ja) 2014-06-10 2015-12-17 株式会社東芝 検出装置、修正システム、検出方法およびプログラム

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20200134093A (ko) * 2019-05-21 2020-12-01 엘지전자 주식회사 공기 조화기의 실내기

Also Published As

Publication number Publication date
JP2020533692A (ja) 2020-11-19
US10372826B2 (en) 2019-08-06
US10614269B2 (en) 2020-04-07
GB202004051D0 (en) 2020-05-06
CN111095234A (zh) 2020-05-01
US20190317998A1 (en) 2019-10-17
GB2580805A (en) 2020-07-29
US10621284B2 (en) 2020-04-14
US20190087408A1 (en) 2019-03-21
WO2019053629A1 (en) 2019-03-21
US20190317997A1 (en) 2019-10-17
US20190087411A1 (en) 2019-03-21
DE112018005167T5 (de) 2020-06-25
US10387572B2 (en) 2019-08-20

Similar Documents

Publication Publication Date Title
JP7052016B2 (ja) 訓練データを更新するための方法、システム、およびコンピュータ・プログラム
US11593642B2 (en) Combined data pre-process and architecture search for deep learning models
US10621074B2 (en) Intelligent device selection for mobile application testing
JP2022537912A (ja) 転移学習を用いた低リソース・エンティティ解決
JP2020532012A (ja) ランダム・ドキュメント埋め込みを用いたテキスト・データ表現学習
US11048564B2 (en) API evolution and adaptation based on cognitive selection and unsupervised feature learning
US11501111B2 (en) Learning models for entity resolution using active learning
US11302096B2 (en) Determining model-related bias associated with training data
US11501115B2 (en) Explaining cross domain model predictions
US11934891B2 (en) APIA configuration using auto-rationalization and modeling
US20230021563A1 (en) Federated data standardization using data privacy techniques
JP2022552140A (ja) 階層クラスタリングを使用する希少トピック検出
US20240112229A1 (en) Facilitating responding to multiple product or service reviews associated with multiple sources
US11520757B2 (en) Explanative analysis for records with missing values
US11681501B2 (en) Artificial intelligence enabled open source project enabler and recommendation platform
US12197846B2 (en) Mathematical function defined natural language annotation
US11556558B2 (en) Insight expansion in smart data retention systems
US10776411B2 (en) Systematic browsing of automated conversation exchange program knowledge bases
US11868167B2 (en) Automatically provisioned tag schema for hybrid multicloud cost and chargeback analysis
US20220076079A1 (en) Distributed machine learning scoring

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20201117

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20210222

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20211227

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20211228

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20220304

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20220322

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20220330

R150 Certificate of patent or registration of utility model

Ref document number: 7052016

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150