CN120297334A - 选择用于有监督机器学习问题的神经网络架构 - Google Patents

选择用于有监督机器学习问题的神经网络架构 Download PDF

Info

Publication number
CN120297334A
CN120297334A CN202510368001.XA CN202510368001A CN120297334A CN 120297334 A CN120297334 A CN 120297334A CN 202510368001 A CN202510368001 A CN 202510368001A CN 120297334 A CN120297334 A CN 120297334A
Authority
CN
China
Prior art keywords
architecture
machine learning
neural network
learning problem
machine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202510368001.XA
Other languages
English (en)
Chinese (zh)
Inventor
S·阿米扎德
杨格
N·富西
F·P·卡萨莱
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Publication of CN120297334A publication Critical patent/CN120297334A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0464Convolutional networks [CNN, ConvNet]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/047Probabilistic or stochastic networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0499Feedforward networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0985Hyperparameter optimisation; Meta-learning; Learning-to-learn
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • G06N20/10Machine learning using kernel methods, e.g. support vector machines [SVM]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Complex Calculations (AREA)
  • Image Analysis (AREA)
  • Debugging And Monitoring (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
CN202510368001.XA 2018-05-10 2019-04-27 选择用于有监督机器学习问题的神经网络架构 Pending CN120297334A (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US15/976,514 2018-05-10
US15/976,514 US11995538B2 (en) 2018-05-10 2018-05-10 Selecting a neural network architecture for a supervised machine learning problem
CN201980031270.XA CN112470171B (zh) 2018-05-10 2019-04-27 选择用于有监督机器学习问题的神经网络架构
PCT/US2019/029532 WO2019217113A1 (en) 2018-05-10 2019-04-27 Selecting a neural network architecture for a supervised machine learning problem

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201980031270.XA Division CN112470171B (zh) 2018-05-10 2019-04-27 选择用于有监督机器学习问题的神经网络架构

Publications (1)

Publication Number Publication Date
CN120297334A true CN120297334A (zh) 2025-07-11

Family

ID=66429706

Family Applications (2)

Application Number Title Priority Date Filing Date
CN202510368001.XA Pending CN120297334A (zh) 2018-05-10 2019-04-27 选择用于有监督机器学习问题的神经网络架构
CN201980031270.XA Active CN112470171B (zh) 2018-05-10 2019-04-27 选择用于有监督机器学习问题的神经网络架构

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201980031270.XA Active CN112470171B (zh) 2018-05-10 2019-04-27 选择用于有监督机器学习问题的神经网络架构

Country Status (6)

Country Link
US (3) US11995538B2 (https=)
EP (1) EP3791326A1 (https=)
JP (1) JP7344900B2 (https=)
CN (2) CN120297334A (https=)
CA (1) CA3097036A1 (https=)
WO (1) WO2019217113A1 (https=)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11196623B2 (en) 2016-12-30 2021-12-07 Intel Corporation Data packaging protocols for communications between IoT devices
CN111819580B (zh) * 2018-05-29 2025-01-14 谷歌有限责任公司 用于密集图像预测任务的神经架构搜索
US11537846B2 (en) * 2018-08-21 2022-12-27 Wisconsin Alumni Research Foundation Neural network architecture with concurrent uncertainty output
KR102200212B1 (ko) * 2018-12-07 2021-01-08 서울대학교 산학협력단 불확실성 예측을 위한 샘플링 모델 생성 장치 및 방법, 불확실성 예측 장치
US10616257B1 (en) * 2019-02-19 2020-04-07 Verizon Patent And Licensing Inc. Method and system for anomaly detection and network deployment based on quantitative assessment
US11240340B2 (en) * 2020-05-12 2022-02-01 International Business Machines Corporation Optimized deployment of analytic models in an edge topology
CN113807376A (zh) * 2020-06-15 2021-12-17 富泰华工业(深圳)有限公司 网络模型优化方法、装置、电子设备及存储介质
CN112134876A (zh) * 2020-09-18 2020-12-25 中移(杭州)信息技术有限公司 流量识别系统及方法、服务器
KR102535007B1 (ko) * 2020-11-13 2023-05-19 숭실대학교 산학협력단 Snn 모델 파라미터를 기반으로 모델 수행을 위한 뉴로모픽 아키텍처 동적 선택 방법, 이를 수행하기 위한 기록 매체 및 장치
US20220164646A1 (en) * 2020-11-24 2022-05-26 EMC IP Holding Company LLC Hydratable neural networks for devices
CN113204916B (zh) * 2021-04-15 2021-11-19 特斯联科技集团有限公司 基于强化学习的智能决策方法及系统
US20220027792A1 (en) * 2021-10-08 2022-01-27 Intel Corporation Deep neural network model design enhanced by real-time proxy evaluation feedback
US12367249B2 (en) * 2021-10-19 2025-07-22 Intel Corporation Framework for optimization of machine learning architectures
US12367248B2 (en) * 2021-10-19 2025-07-22 Intel Corporation Hardware-aware machine learning model search mechanisms
US12417260B2 (en) 2021-10-20 2025-09-16 Intel Corporation Machine learning model scaling system with energy efficient network data transfer for power aware hardware
CN114037058B (zh) * 2021-11-05 2024-05-17 北京百度网讯科技有限公司 预训练模型的生成方法、装置、电子设备以及存储介质
CN114188022A (zh) * 2021-12-13 2022-03-15 浙江大学 一种基于TextCNN模型的临床儿童咳嗽智能预诊断系统

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05314090A (ja) 1992-05-14 1993-11-26 Hitachi Ltd ニューラルネットを用いたパターン認識方法およびその装置
JP2008533615A (ja) 2005-03-14 2008-08-21 エル ターラー、ステフエン ニューラルネットワーク開発およびデータ解析ツール
EP2909803A1 (en) 2012-10-19 2015-08-26 Apixio, Inc. Systems and methods for medical information analysis with deidentification and reidentification
JP6444494B2 (ja) 2014-05-23 2018-12-26 データロボット, インコーポレイテッド 予測データ分析のためのシステムおよび技術
WO2017058489A1 (en) 2015-09-30 2017-04-06 Apple Inc. Methods for color and texture control of metallic glasses by the combination of blasting and oxidization
US9659248B1 (en) * 2016-01-19 2017-05-23 International Business Machines Corporation Machine learning and training a computer-implemented neural network to retrieve semantically equivalent questions using hybrid in-memory representations
WO2018075995A1 (en) 2016-10-21 2018-04-26 DataRobot, Inc. Systems for predictive data analytics, and related methods and apparatus

Also Published As

Publication number Publication date
US20190347548A1 (en) 2019-11-14
EP3791326A1 (en) 2021-03-17
US20250390745A1 (en) 2025-12-25
US20240273370A1 (en) 2024-08-15
CN112470171B (zh) 2025-04-18
US11995538B2 (en) 2024-05-28
CN112470171A (zh) 2021-03-09
CA3097036A1 (en) 2019-11-14
JP2021523430A (ja) 2021-09-02
KR20210008480A (ko) 2021-01-22
JP7344900B2 (ja) 2023-09-14
WO2019217113A1 (en) 2019-11-14
US12423581B2 (en) 2025-09-23

Similar Documents

Publication Publication Date Title
CN112470171B (zh) 选择用于有监督机器学习问题的神经网络架构
US11361225B2 (en) Neural network architecture for attention based efficient model adaptation
US11392859B2 (en) Large-scale automated hyperparameter tuning
US10853739B2 (en) Machine learning models for evaluating entities in a high-volume computer network
WO2020171920A1 (en) Leveraging query executions to improve index recommendations
US20190019108A1 (en) Systems and methods for a validation tree
US11861295B2 (en) Encoding a job posting as an embedding using a graph neural network
JP2017538195A (ja) 階層深層畳み込みニューラルネットワーク
US11295237B2 (en) Smart copy optimization in customer acquisition and customer management platforms
US20190385073A1 (en) Visual recognition via light weight neural network
US12530332B1 (en) Search engine optimization by selective indexing
CN113869496B (zh) 一种神经网络的获取方法、数据处理方法以及相关设备
US20190279097A1 (en) Systems and methods for decision tree ensembles for selecting actions
WO2025093915A1 (en) Reducing carbon footprint of machine learning models
KR102956712B1 (ko) 지도 머신 학습 문제를 위한 신경망 아키텍처 선택
US20260012762A1 (en) Trigger-Based Data Ingestion for Machine Learning Using Edge Device
US20250363417A1 (en) Computer architecture for predicting energy consumption of machine learning inference
US20250247416A1 (en) Generative artificial intelligence penetration testing automation
WO2025245298A1 (en) Computer architecture for predicting energy consumption of machine learning inference
KR20240121444A (ko) 잡음제거 오토인코더를 이용한 rssi 핑거프린팅 기반의 사용자 위치 측정 방법 및 이를 지원하는 장치

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination