SG11202011918WA - Detecting suitability of machine learning models for datasets - Google Patents

Detecting suitability of machine learning models for datasets

Info

Publication number
SG11202011918WA
SG11202011918WA SG11202011918WA SG11202011918WA SG11202011918WA SG 11202011918W A SG11202011918W A SG 11202011918WA SG 11202011918W A SG11202011918W A SG 11202011918WA SG 11202011918W A SG11202011918W A SG 11202011918WA SG 11202011918W A SG11202011918W A SG 11202011918WA
Authority
SG
Singapore
Prior art keywords
datasets
machine learning
learning models
detecting suitability
suitability
Prior art date
Application number
SG11202011918WA
Inventor
Sindhu Ghanta
Drew Roselli
Nisha Talagala
Vinay Sridhar
Swaminathan Sundararaman
Lior Amar
Lior Khermosh
Bharath Ramsundar
Sriram Subramanian
Original Assignee
Datarobot Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Datarobot Inc filed Critical Datarobot Inc
Publication of SG11202011918WA publication Critical patent/SG11202011918WA/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/217Validation; Performance evaluation; Active pattern learning techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • G06F18/2155Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/217Validation; Performance evaluation; Active pattern learning techniques
    • G06F18/2193Validation; Performance evaluation; Active pattern learning techniques based on specific statistical tests
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Computing Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Operations Research (AREA)
  • Algebra (AREA)
  • Databases & Information Systems (AREA)
  • Computational Linguistics (AREA)
  • Medical Informatics (AREA)
  • Image Analysis (AREA)
  • Debugging And Monitoring (AREA)
SG11202011918WA 2018-06-06 2019-06-06 Detecting suitability of machine learning models for datasets SG11202011918WA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16/001,904 US20190377984A1 (en) 2018-06-06 2018-06-06 Detecting suitability of machine learning models for datasets
PCT/US2019/035853 WO2019236894A1 (en) 2018-06-06 2019-06-06 Detecting suitability of machine learning models for datasets

Publications (1)

Publication Number Publication Date
SG11202011918WA true SG11202011918WA (en) 2020-12-30

Family

ID=67060489

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202011918WA SG11202011918WA (en) 2018-06-06 2019-06-06 Detecting suitability of machine learning models for datasets

Country Status (7)

Country Link
US (2) US20190377984A1 (en)
EP (1) EP3803697A1 (en)
JP (1) JP2021527288A (en)
KR (1) KR20210035164A (en)
AU (1) AU2019280855A1 (en)
SG (1) SG11202011918WA (en)
WO (1) WO2019236894A1 (en)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10558934B1 (en) * 2018-08-23 2020-02-11 SigOpt, Inc. Systems and methods for implementing an intelligent machine learning optimization platform for multiple tuning criteria
WO2020070916A1 (en) * 2018-10-02 2020-04-09 日本電信電話株式会社 Calculation device, calculation method, and calculation program
US10997277B1 (en) * 2019-03-26 2021-05-04 Amazon Technologies, Inc. Multinomial distribution on an integrated circuit
US11475002B1 (en) * 2019-06-19 2022-10-18 Amazon Technologies, Inc. Machine learning system for dynamic generation of computer-implemented policies
US11354579B2 (en) * 2019-07-15 2022-06-07 Microsoft Technology Licensing, Llc Dynamic multi-layer execution for artificial intelligence modeling
US11436019B2 (en) 2019-07-15 2022-09-06 Microsoft Technology Licensing, Llc Data parallelism in distributed training of artificial intelligence models
US11520592B2 (en) * 2019-07-15 2022-12-06 Microsoft Technology Licensing, Llc Executing large artificial intelligence models on memory-constrained devices
US11210471B2 (en) * 2019-07-30 2021-12-28 Accenture Global Solutions Limited Machine learning based quantification of performance impact of data veracity
US20210034924A1 (en) * 2019-07-31 2021-02-04 SigOpt, Inc. Systems and methods for implementing an intelligent tuning of multiple hyperparameter criteria of a model constrained with metric thresholds
US11651275B2 (en) * 2019-08-19 2023-05-16 International Business Machines Corporation Tree-based associative data augmentation
US11556862B2 (en) 2019-09-14 2023-01-17 Oracle International Corporation Techniques for adaptive and context-aware automated service composition for machine learning (ML)
US11663523B2 (en) 2019-09-14 2023-05-30 Oracle International Corporation Machine learning (ML) infrastructure techniques
US11562267B2 (en) 2019-09-14 2023-01-24 Oracle International Corporation Chatbot for defining a machine learning (ML) solution
US11605021B1 (en) * 2019-09-30 2023-03-14 Amazon Technologies, Inc. Iterative model training and deployment for automated learning systems
US11468365B2 (en) 2019-09-30 2022-10-11 Amazon Technologies, Inc. GPU code injection to summarize machine learning training data
US11449798B2 (en) * 2019-09-30 2022-09-20 Amazon Technologies, Inc. Automated problem detection for machine learning models
US20210133629A1 (en) * 2019-10-25 2021-05-06 Mote Marine Laboratory Coastal Aquatic Conditions Reporting System Using A Learning Engine
US20210166080A1 (en) * 2019-12-02 2021-06-03 Accenture Global Solutions Limited Utilizing object oriented programming to validate machine learning classifiers and word embeddings
US11586916B2 (en) * 2020-01-30 2023-02-21 EMC IP Holding Company LLC Automated ML microservice and function generation for cloud native platforms
US20230088561A1 (en) * 2020-03-02 2023-03-23 Telefonaktiebolaget Lm Ericsson (Publ) Synthetic data generation in federated learning systems
US11156969B1 (en) 2020-04-24 2021-10-26 MakinaRocks Co., Ltd. Environment factor control device and training method thereof
KR102472920B1 (en) * 2020-04-24 2022-12-01 주식회사 마키나락스 Environment factor control device and training method thereof
US20210342736A1 (en) * 2020-04-30 2021-11-04 UiPath, Inc. Machine learning model retraining pipeline for robotic process automation
US11620582B2 (en) * 2020-07-29 2023-04-04 International Business Machines Corporation Automated machine learning pipeline generation
US11688111B2 (en) 2020-07-29 2023-06-27 International Business Machines Corporation Visualization of a model selection process in an automated model selection system
US20220044117A1 (en) * 2020-08-06 2022-02-10 Nec Laboratories America, Inc. Federated learning for anomaly detection
US11941541B2 (en) * 2020-08-10 2024-03-26 International Business Machines Corporation Automated machine learning using nearest neighbor recommender systems
US11763084B2 (en) 2020-08-10 2023-09-19 International Business Machines Corporation Automatic formulation of data science problem statements
CN112508199A (en) * 2020-11-30 2021-03-16 同盾控股有限公司 Feature selection method, device and related equipment for cross-feature federated learning
KR20220095167A (en) * 2020-12-29 2022-07-06 (주)심플랫폼 Artificial Intelligence Verification System and Method
KR102513839B1 (en) * 2021-01-05 2023-03-27 한국조선해양 주식회사 Fault diagnosis system for construction equipment and fault diagnosis method
US20220237467A1 (en) * 2021-01-22 2022-07-28 GE Precision Healthcare LLC Model suitability coefficients based on generative adversarial networks and activation maps
EP4084010A1 (en) * 2021-04-30 2022-11-02 Siemens Healthcare GmbH Method for operating an evaluation system for medical image data sets, evaluation system, computer program and electronically readable storage medium
CN113283495B (en) * 2021-05-21 2024-02-13 长安大学 Aggregate particle grading method and device
WO2023083468A1 (en) * 2021-11-15 2023-05-19 Huawei Technologies Co., Ltd. Method and apparatus for eligibility evaluation of a machine learning system

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7509259B2 (en) * 2004-12-21 2009-03-24 Motorola, Inc. Method of refining statistical pattern recognition models and statistical pattern recognizers
US9445769B2 (en) * 2013-12-06 2016-09-20 President And Fellows Of Harvard College Method and apparatus for detecting disease regression through network-based gait analysis
EP3134823A4 (en) * 2014-06-03 2017-10-25 Excalibur IP, LLC Determining traffic quality using event-based traffic scoring
EP3079116A1 (en) * 2015-04-10 2016-10-12 Tata Consultancy Services Limited System and method for generating recommendations
US10353905B2 (en) * 2015-04-24 2019-07-16 Salesforce.Com, Inc. Identifying entities in semi-structured content
US10235629B2 (en) * 2015-06-05 2019-03-19 Southwest Research Institute Sensor data confidence estimation based on statistical analysis
US10594710B2 (en) * 2015-11-20 2020-03-17 Webroot Inc. Statistical analysis of network behavior using event vectors to identify behavioral anomalies using a composite score
US10600000B2 (en) * 2015-12-04 2020-03-24 Google Llc Regularization of machine learning models
WO2018005489A1 (en) * 2016-06-27 2018-01-04 Purepredictive, Inc. Data quality detection and compensation for machine learning
US11164119B2 (en) * 2016-12-28 2021-11-02 Motorola Solutions, Inc. Systems and methods for assigning roles to user profiles for an incident
US9882999B1 (en) * 2017-06-28 2018-01-30 Facebook, Inc. Analyzing tracking requests generated by client devices interacting with a website
JP6973197B2 (en) * 2018-03-09 2021-11-24 オムロン株式会社 Dataset validation device, dataset validation method, and dataset validation program
JP7033262B6 (en) * 2018-03-19 2022-04-18 日本電気株式会社 Information processing equipment, information processing methods and programs

Also Published As

Publication number Publication date
KR20210035164A (en) 2021-03-31
US20190377984A1 (en) 2019-12-12
WO2019236894A1 (en) 2019-12-12
US20230161843A1 (en) 2023-05-25
AU2019280855A1 (en) 2021-01-07
EP3803697A1 (en) 2021-04-14
JP2021527288A (en) 2021-10-11

Similar Documents

Publication Publication Date Title
SG11202011918WA (en) Detecting suitability of machine learning models for datasets
SG11202100975PA (en) Determining suitability of machine learning models for datasets
SG11202009597XA (en) Evolved machine learning models
EP3602420A4 (en) Embedded predictive machine learning models
FI4099168T3 (en) Compute optimizations for low precision machine learning operations
EP3510363C0 (en) Method for distributed acoustic sensing
IL283463A (en) Automated generation of machine learning models
EP3721408A4 (en) Inspection of reticles using machine learning
IL250221A0 (en) Biomarkers for predicting response of dlbcl to treatment with a btk inhibitor
EP3137302A4 (en) Determining a time instant for an impedance measurement
GB201407840D0 (en) A method of testing an optical sensor
HUE042517T2 (en) Method for measuring a spectral sample response
GB202216797D0 (en) Stem cell-based lung-on-chip models
GB201604480D0 (en) A method of redatuming geophysical data
GB2567076B (en) A method of detecting humidity
GB2539147B (en) Common beam path for determining particle-information by a direct image evaluation and by difference image analysis
IL291166A (en) Method for detecting biomarkers
DE112014006850A5 (en) Probe for a coordinate measuring machine
GB2571147B (en) Apparatus for sensing
BR112017000179A2 (en) analysis of displacement parts for selection of candidate lines for seismic surveys.
SG11201607941PA (en) Method for replacing a process measurement instrument
EP3618759C0 (en) Method for determining data for the production of a toothprosthesis
NO20151370A1 (en) Sensor node for point measurement on the seabed during seismic surveys
GB201821131D0 (en) Liveliness detection using features for machine learning
EP3635377C0 (en) Method for determining physical properties of a sample