CA3097036A1 - Selecting a neural network architecture for a supervised machine learning problem - Google Patents

Selecting a neural network architecture for a supervised machine learning problem Download PDF

Info

Publication number
CA3097036A1
CA3097036A1 CA3097036A CA3097036A CA3097036A1 CA 3097036 A1 CA3097036 A1 CA 3097036A1 CA 3097036 A CA3097036 A CA 3097036A CA 3097036 A CA3097036 A CA 3097036A CA 3097036 A1 CA3097036 A1 CA 3097036A1
Authority
CA
Canada
Prior art keywords
neural network
machine learning
learning problem
measure
candidate neural
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CA3097036A
Other languages
English (en)
French (fr)
Inventor
Saeed AMIZADEH
Ge Yang
Nicolo Fusi
Francesco Paolo CASALE
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Publication of CA3097036A1 publication Critical patent/CA3097036A1/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/047Probabilistic or stochastic networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0499Feedforward networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0985Hyperparameter optimisation; Meta-learning; Learning-to-learn
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Complex Calculations (AREA)
  • Image Analysis (AREA)
  • Debugging And Monitoring (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
CA3097036A 2018-05-10 2019-04-27 Selecting a neural network architecture for a supervised machine learning problem Pending CA3097036A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US15/976,514 2018-05-10
US15/976,514 US11995538B2 (en) 2018-05-10 2018-05-10 Selecting a neural network architecture for a supervised machine learning problem
PCT/US2019/029532 WO2019217113A1 (en) 2018-05-10 2019-04-27 Selecting a neural network architecture for a supervised machine learning problem

Publications (1)

Publication Number Publication Date
CA3097036A1 true CA3097036A1 (en) 2019-11-14

Family

ID=66429706

Family Applications (1)

Application Number Title Priority Date Filing Date
CA3097036A Pending CA3097036A1 (en) 2018-05-10 2019-04-27 Selecting a neural network architecture for a supervised machine learning problem

Country Status (6)

Country Link
US (3) US11995538B2 (enExample)
EP (1) EP3791326A1 (enExample)
JP (1) JP7344900B2 (enExample)
CN (2) CN120297334A (enExample)
CA (1) CA3097036A1 (enExample)
WO (1) WO2019217113A1 (enExample)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11196623B2 (en) 2016-12-30 2021-12-07 Intel Corporation Data packaging protocols for communications between IoT devices
CN111819580B (zh) * 2018-05-29 2025-01-14 谷歌有限责任公司 用于密集图像预测任务的神经架构搜索
US11537846B2 (en) * 2018-08-21 2022-12-27 Wisconsin Alumni Research Foundation Neural network architecture with concurrent uncertainty output
KR102200212B1 (ko) * 2018-12-07 2021-01-08 서울대학교 산학협력단 불확실성 예측을 위한 샘플링 모델 생성 장치 및 방법, 불확실성 예측 장치
US10616257B1 (en) * 2019-02-19 2020-04-07 Verizon Patent And Licensing Inc. Method and system for anomaly detection and network deployment based on quantitative assessment
US11240340B2 (en) * 2020-05-12 2022-02-01 International Business Machines Corporation Optimized deployment of analytic models in an edge topology
CN113807376A (zh) * 2020-06-15 2021-12-17 富泰华工业(深圳)有限公司 网络模型优化方法、装置、电子设备及存储介质
CN112134876A (zh) * 2020-09-18 2020-12-25 中移(杭州)信息技术有限公司 流量识别系统及方法、服务器
KR102535007B1 (ko) * 2020-11-13 2023-05-19 숭실대학교 산학협력단 Snn 모델 파라미터를 기반으로 모델 수행을 위한 뉴로모픽 아키텍처 동적 선택 방법, 이를 수행하기 위한 기록 매체 및 장치
US20220164646A1 (en) * 2020-11-24 2022-05-26 EMC IP Holding Company LLC Hydratable neural networks for devices
CN113204916B (zh) * 2021-04-15 2021-11-19 特斯联科技集团有限公司 基于强化学习的智能决策方法及系统
US20220027792A1 (en) * 2021-10-08 2022-01-27 Intel Corporation Deep neural network model design enhanced by real-time proxy evaluation feedback
US12367249B2 (en) * 2021-10-19 2025-07-22 Intel Corporation Framework for optimization of machine learning architectures
US12367248B2 (en) * 2021-10-19 2025-07-22 Intel Corporation Hardware-aware machine learning model search mechanisms
US12417260B2 (en) 2021-10-20 2025-09-16 Intel Corporation Machine learning model scaling system with energy efficient network data transfer for power aware hardware
CN114037058B (zh) * 2021-11-05 2024-05-17 北京百度网讯科技有限公司 预训练模型的生成方法、装置、电子设备以及存储介质
CN114188022A (zh) * 2021-12-13 2022-03-15 浙江大学 一种基于TextCNN模型的临床儿童咳嗽智能预诊断系统

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05314090A (ja) 1992-05-14 1993-11-26 Hitachi Ltd ニューラルネットを用いたパターン認識方法およびその装置
JP2008533615A (ja) 2005-03-14 2008-08-21 エル ターラー、ステフエン ニューラルネットワーク開発およびデータ解析ツール
EP2909803A1 (en) 2012-10-19 2015-08-26 Apixio, Inc. Systems and methods for medical information analysis with deidentification and reidentification
JP6444494B2 (ja) 2014-05-23 2018-12-26 データロボット, インコーポレイテッド 予測データ分析のためのシステムおよび技術
WO2017058489A1 (en) 2015-09-30 2017-04-06 Apple Inc. Methods for color and texture control of metallic glasses by the combination of blasting and oxidization
US9659248B1 (en) * 2016-01-19 2017-05-23 International Business Machines Corporation Machine learning and training a computer-implemented neural network to retrieve semantically equivalent questions using hybrid in-memory representations
WO2018075995A1 (en) 2016-10-21 2018-04-26 DataRobot, Inc. Systems for predictive data analytics, and related methods and apparatus

Also Published As

Publication number Publication date
US20190347548A1 (en) 2019-11-14
EP3791326A1 (en) 2021-03-17
CN120297334A (zh) 2025-07-11
US20250390745A1 (en) 2025-12-25
US20240273370A1 (en) 2024-08-15
CN112470171B (zh) 2025-04-18
US11995538B2 (en) 2024-05-28
CN112470171A (zh) 2021-03-09
JP2021523430A (ja) 2021-09-02
KR20210008480A (ko) 2021-01-22
JP7344900B2 (ja) 2023-09-14
WO2019217113A1 (en) 2019-11-14
US12423581B2 (en) 2025-09-23

Similar Documents

Publication Publication Date Title
US12423581B2 (en) Selecting a neural network architecture for a supervised machine learning problem
US11392859B2 (en) Large-scale automated hyperparameter tuning
US11361225B2 (en) Neural network architecture for attention based efficient model adaptation
EP3912056B1 (en) Leveraging query executions to improve index recommendations
JP7157154B2 (ja) 性能予測ニューラルネットワークを使用したニューラルアーキテクチャ探索
US10853739B2 (en) Machine learning models for evaluating entities in a high-volume computer network
US20190385073A1 (en) Visual recognition via light weight neural network
WO2022068623A1 (zh) 一种模型训练方法及相关设备
US20190019108A1 (en) Systems and methods for a validation tree
US12530332B1 (en) Search engine optimization by selective indexing
US12361288B2 (en) Method, device and medium for diagnosing and optimizing data analysis system
CN113869496A (zh) 一种神经网络的获取方法、数据处理方法以及相关设备
WO2019173032A1 (en) Automatic lot classification
US12361287B2 (en) Data retrieval using reinforced co-learning for semi-supervised ranking
KR102956712B1 (ko) 지도 머신 학습 문제를 위한 신경망 아키텍처 선택
US12519817B2 (en) Generative artificial intelligence penetration testing automation
WO2026006004A1 (en) Determining phenomic relationships between compounds and cell perturbations utilizing machine learning models

Legal Events

Date Code Title Description
EEER Examination request

Effective date: 20240424

D15 Examination report completed

Free format text: ST27 STATUS EVENT CODE: A-2-2-D10-D15-D126 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: EXAMINER'S REPORT

Effective date: 20250425

B12 Application deemed to be withdrawn, abandoned or lapsed

Free format text: ST27 STATUS EVENT CODE: N-6-6-B10-B12-B303 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: DEEMED ABANDONED - FAILURE TO RESPOND TO AN EXAMINER'S REQUISITION

Effective date: 20250825

W00 Other event occurred

Free format text: ST27 STATUS EVENT CODE: A-2-2-W10-W00-W100 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: LETTER SENT

Effective date: 20251014

U13 Renewal or maintenance fee not paid

Free format text: ST27 STATUS EVENT CODE: N-2-6-U10-U13-U300 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: DEEMED ABANDONED - FAILURE TO RESPOND TO MAINTENANCE FEE NOTICE

Effective date: 20251215