EP4026071A4 - Generating training data for machine-learning models - Google Patents

Generating training data for machine-learning models Download PDF

Info

Publication number
EP4026071A4
EP4026071A4 EP20860844.8A EP20860844A EP4026071A4 EP 4026071 A4 EP4026071 A4 EP 4026071A4 EP 20860844 A EP20860844 A EP 20860844A EP 4026071 A4 EP4026071 A4 EP 4026071A4
Authority
EP
European Patent Office
Prior art keywords
machine
training data
learning models
generating training
generating
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP20860844.8A
Other languages
German (de)
French (fr)
Other versions
EP4026071A1 (en
Inventor
Soham Banerjee
Jayatu Sen Chaudhury
Prodip Hore
Rohit Joshi
Snehansu Sekhar SAHU
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
American Express Travel Related Services Co Inc
Original Assignee
American Express Travel Related Services Co Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by American Express Travel Related Services Co Inc filed Critical American Express Travel Related Services Co Inc
Publication of EP4026071A1 publication Critical patent/EP4026071A1/en
Publication of EP4026071A4 publication Critical patent/EP4026071A4/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/20Ensemble learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/047Probabilistic or stochastic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0475Generative networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/094Adversarial learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Medical Informatics (AREA)
  • Algebra (AREA)
  • Operations Research (AREA)
  • Databases & Information Systems (AREA)
  • Debugging And Monitoring (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
EP20860844.8A 2019-09-06 2020-09-04 Generating training data for machine-learning models Pending EP4026071A4 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16/562,972 US20210073669A1 (en) 2019-09-06 2019-09-06 Generating training data for machine-learning models
PCT/US2020/049337 WO2021046306A1 (en) 2019-09-06 2020-09-04 Generating training data for machine-learning models

Publications (2)

Publication Number Publication Date
EP4026071A1 EP4026071A1 (en) 2022-07-13
EP4026071A4 true EP4026071A4 (en) 2023-08-09

Family

ID=74851051

Family Applications (1)

Application Number Title Priority Date Filing Date
EP20860844.8A Pending EP4026071A4 (en) 2019-09-06 2020-09-04 Generating training data for machine-learning models

Country Status (6)

Country Link
US (1) US20210073669A1 (en)
EP (1) EP4026071A4 (en)
JP (1) JP7391190B2 (en)
KR (1) KR20220064966A (en)
CN (1) CN114556360A (en)
WO (1) WO2021046306A1 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11158090B2 (en) * 2019-11-22 2021-10-26 Adobe Inc. Enhanced video shot matching using generative adversarial networks
KR20210071130A (en) * 2019-12-05 2021-06-16 삼성전자주식회사 Computing device, operating method of computing device, and storage medium
KR20220019894A (en) * 2020-08-10 2022-02-18 삼성전자주식회사 Simulation method for semiconductor manufacturing process and method for manufacturing semiconductor device
US20230083443A1 (en) * 2021-09-16 2023-03-16 Evgeny Saveliev Detecting anomalies in physical access event streams by computing probability density functions and cumulative probability density functions for current and future events using plurality of small scale machine learning models and historical context of events obtained from stored event stream history via transformations of the history into a time series of event counts or via augmenting the event stream records with delay/lag information
WO2023219371A1 (en) * 2022-05-09 2023-11-16 삼성전자주식회사 Electronic device for augmenting training data, and control method therefor
KR20240052394A (en) 2022-10-14 2024-04-23 고려대학교 산학협력단 Device and method for generating korean commonsense reasoning dataset
US12111797B1 (en) 2023-09-22 2024-10-08 Storytellers.ai LLC Schema inference system
US11961005B1 (en) * 2023-12-18 2024-04-16 Storytellers.ai LLC System for automated data preparation, training, and tuning of machine learning models

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2015176175A (en) 2014-03-13 2015-10-05 日本電気株式会社 Information processing apparatus, information processing method and program
WO2016061283A1 (en) * 2014-10-14 2016-04-21 Skytree, Inc. Configurable machine learning method selection and parameter optimization system and method
US20160132787A1 (en) * 2014-11-11 2016-05-12 Massachusetts Institute Of Technology Distributed, multi-model, self-learning platform for machine learning
US10332028B2 (en) * 2015-08-25 2019-06-25 Qualcomm Incorporated Method for improving performance of a trained machine learning model
GB201517462D0 (en) * 2015-10-02 2015-11-18 Tractable Ltd Semi-automatic labelling of datasets
JP6647632B2 (en) 2017-09-04 2020-02-14 株式会社Soat Generating training data for machine learning
US10592779B2 (en) 2017-12-21 2020-03-17 International Business Machines Corporation Generative adversarial network medical image generation for training of a classifier
US10388002B2 (en) 2017-12-27 2019-08-20 Facebook, Inc. Automatic image correction using machine learning
KR101990326B1 (en) * 2018-11-28 2019-06-18 한국인터넷진흥원 Discount factor auto adjusting type reinforcement learning method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
KRISTOFER SCHLACHTER ET AL: "Beyond Photo Realism for Domain Adaptation from Synthetic Data", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 4 September 2019 (2019-09-04), XP081473643 *
See also references of WO2021046306A1 *
SETHIA AKHIL ET AL: "Data Augmentation using Generative models for Credit Card Fraud Detection", 2018 4TH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION AND AUTOMATION (ICCCA), IEEE, 14 December 2018 (2018-12-14), pages 1 - 6, XP033584478, DOI: 10.1109/CCAA.2018.8777628 *

Also Published As

Publication number Publication date
JP2022546571A (en) 2022-11-04
EP4026071A1 (en) 2022-07-13
WO2021046306A1 (en) 2021-03-11
US20210073669A1 (en) 2021-03-11
KR20220064966A (en) 2022-05-19
CN114556360A (en) 2022-05-27
JP7391190B2 (en) 2023-12-04

Similar Documents

Publication Publication Date Title
EP4026071A4 (en) Generating training data for machine-learning models
EP3833455A4 (en) Interactive exercise machine data architecture
EP3776387A4 (en) Evolved machine learning models
EP3895082A4 (en) Distributed training of machine learning models for personalization
EP3411634A4 (en) Data learning server and method for generating and using learning model thereof
EP3899799A4 (en) Data denoising based on machine learning
EP3785199A4 (en) Decentralized data verification
EP4025311A4 (en) System for generating simulated animal data and models
EP3657428A4 (en) Data learning server, and method for generating and using learning model thereof
EP3703812A4 (en) A subject-tailored continuously developing randomization based method for improving organ function
GB201818237D0 (en) A dialogue system, a dialogue method, a method of generating data for training a dialogue system, a system for generating data for training a dialogue system
EP3497302A4 (en) Machine learning training set generation
EP3980946A4 (en) Executing machine-learning models
EP4184940A4 (en) Sound generating device
EP3711216A4 (en) Codebook feedback for data retransmissions
EP4042339A4 (en) Developing machine-learning models
EP4051549A4 (en) Generating environmental data
EP4006818A4 (en) Simulator
EP3694619A4 (en) Flexible computer gaming based on machine learning
EP3862749A4 (en) Training data generation device and training data generation program
EP3970024A4 (en) Systems and methods for generating datasets for model retraining
EP3936224A4 (en) Data generation device, data generation method, learning device, and learning method
GB201718895D0 (en) A method of generating training data
EP4046084A4 (en) Interactive machine learning
EP3665537A4 (en) Generating geo-fence data

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20220308

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
P01 Opt-out of the competence of the unified patent court (upc) registered

Effective date: 20230509

REG Reference to a national code

Ref country code: DE

Ref legal event code: R079

Free format text: PREVIOUS MAIN CLASS: G06N0020200000

Ipc: G06N0003047500

A4 Supplementary search report drawn up and despatched

Effective date: 20230712

RIC1 Information provided on ipc code assigned before grant

Ipc: G06N 3/084 20230101ALN20230706BHEP

Ipc: G06N 20/20 20190101ALI20230706BHEP

Ipc: G06N 3/094 20230101ALI20230706BHEP

Ipc: G06N 3/045 20230101ALI20230706BHEP

Ipc: G06N 3/047 20230101ALI20230706BHEP

Ipc: G06N 3/0475 20230101AFI20230706BHEP