JP7697824B2 - メタ学習データ拡張フレームワーク - Google Patents

メタ学習データ拡張フレームワーク Download PDF

Info

Publication number
JP7697824B2
JP7697824B2 JP2021096983A JP2021096983A JP7697824B2 JP 7697824 B2 JP7697824 B2 JP 7697824B2 JP 2021096983 A JP2021096983 A JP 2021096983A JP 2021096983 A JP2021096983 A JP 2021096983A JP 7697824 B2 JP7697824 B2 JP 7697824B2
Authority
JP
Japan
Prior art keywords
data
sequence
token
machine learning
operator
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2021096983A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022171502A5 (https=
JP2022171502A (ja
Inventor
リ,ユーリャン
ワン,シャオラン
ミャオ,ジェンジー
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Recruit Co Ltd
Original Assignee
Recruit Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Recruit Co Ltd filed Critical Recruit Co Ltd
Publication of JP2022171502A publication Critical patent/JP2022171502A/ja
Publication of JP2022171502A5 publication Critical patent/JP2022171502A5/ja
Application granted granted Critical
Publication of JP7697824B2 publication Critical patent/JP7697824B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/217Database tuning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/55Rule-based translation
    • G06F40/56Natural language generation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0895Weakly supervised learning, e.g. semi-supervised or self-supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0985Hyperparameter optimisation; Meta-learning; Learning-to-learn
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/027Frames

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Databases & Information Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Machine Translation (AREA)
JP2021096983A 2021-04-30 2021-06-10 メタ学習データ拡張フレームワーク Active JP7697824B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US17/246,354 2021-04-30
US17/246,354 US20220351071A1 (en) 2021-04-30 2021-04-30 Meta-learning data augmentation framework

Publications (3)

Publication Number Publication Date
JP2022171502A JP2022171502A (ja) 2022-11-11
JP2022171502A5 JP2022171502A5 (https=) 2024-04-04
JP7697824B2 true JP7697824B2 (ja) 2025-06-24

Family

ID=83807684

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2021096983A Active JP7697824B2 (ja) 2021-04-30 2021-06-10 メタ学習データ拡張フレームワーク

Country Status (3)

Country Link
US (1) US20220351071A1 (https=)
JP (1) JP7697824B2 (https=)
WO (1) WO2022230226A1 (https=)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2022098219A (ja) * 2020-12-21 2022-07-01 富士通株式会社 学習プログラム、学習方法、および学習装置
US12423614B2 (en) 2021-05-31 2025-09-23 International Business Machines Corporation Faithful and efficient sample-based model explanations
US20220383096A1 (en) * 2021-05-31 2022-12-01 International Business Machines Corporation Explaining Neural Models by Interpretable Sample-Based Explanations
JP2024098791A (ja) * 2023-01-11 2024-07-24 株式会社東芝 情報処理装置、情報処理方法及び情報処理プログラム
CN116166789B (zh) * 2023-03-23 2025-07-25 中国科学院软件研究所 一种方法命名精准推荐和审查方法

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019222734A1 (en) 2018-05-18 2019-11-21 Google Llc Learning data augmentation policies
JP2020187734A (ja) 2019-05-10 2020-11-19 富士通株式会社 遺伝モデルに基づきディープニューラルネットワーク(dnn)を訓練することにおけるデータ拡張

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10769491B2 (en) * 2017-09-01 2020-09-08 Sri International Machine learning system for generating classification data and part localization data for objects depicted in images
US11875120B2 (en) * 2021-02-22 2024-01-16 Robert Bosch Gmbh Augmenting textual data for sentence classification using weakly-supervised multi-reward reinforcement learning

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019222734A1 (en) 2018-05-18 2019-11-21 Google Llc Learning data augmentation policies
JP2020187734A (ja) 2019-05-10 2020-11-19 富士通株式会社 遺伝モデルに基づきディープニューラルネットワーク(dnn)を訓練することにおけるデータ拡張

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Chaitra Hegde, 外1名,"Unsupervised Paraphase Generation using Pre-trained Language Models",[online], [text],2020年06月09日,[取得日 2025.01.31], 取得先<https://arxiv.org/pdf/2006.05477>
Chetanya Rastogi, 外1名,"Can We Achieve More with Less? Exploring Data Augmentation for Toxic Comment Classification",[online], [text],2020年07月02日,[取得日 2025.01.31], 取得先<https://arxiv.org/pdf/2007.00875>
Yuliang Li, 外3名,"Deep Entity Matching with Pre-Trained Language Models",[online], [text],2020年09月02日,[取得日 2025.01.31], 取得先<https://arxiv.org/pdf/2004.00584>
Zhengjie Miao, 外3名,"Snippext: Semi-supervised Opinion Mining with Augmented Data",[online], [text],2020年02月07日,[取得日 2025.01.31], 取得先<https://arxiv.org/pdf/2002.03049>

Also Published As

Publication number Publication date
WO2022230226A1 (en) 2022-11-03
US20220351071A1 (en) 2022-11-03
JP2022171502A (ja) 2022-11-11

Similar Documents

Publication Publication Date Title
JP7697824B2 (ja) メタ学習データ拡張フレームワーク
JP7621805B2 (ja) テキスト分類情報の半教師あり抽出のためのシステム及び方法
US11657231B2 (en) Capturing rich response relationships with small-data neural networks
US11604956B2 (en) Sequence-to-sequence prediction using a neural network model
US20220067284A1 (en) Systems and methods for controllable text summarization
US10664744B2 (en) End-to-end memory networks
US12124487B2 (en) Search platform for unstructured interaction summaries
US12014276B2 (en) Deterministic training of machine learning models
JP7820630B2 (ja) ニューラルネットワークを使用したタスク記述からのコンピュータコード生成
US20230281390A1 (en) Systems and methods for enhanced review comprehension using domain-specific knowledgebases
US11790229B2 (en) Systems and methods for synthetic data generation using a classifier
CN114239589A (zh) 语义理解模型的鲁棒性评估方法、装置及计算机设备
US20230394236A1 (en) Extracting content from freeform text samples into custom fields in a software application
US20250265306A1 (en) Masked reference solutions for mathematical reasoning using language models
CN117743551A (zh) 问答信息的处理方法、装置、计算机可读介质及电子设备
US12412046B2 (en) Systems and methods for unsupervised paraphrase mining
US11875141B2 (en) System and method for training a neural machine translation model
US12189590B1 (en) Systems and methods for querying large data repositories
US20250181325A1 (en) Computer code generation from task descriptions using neural networks
US20260004075A1 (en) Text classification with weighted embeddings
Erd Data augmentation for named entity recognition in the German legal domain
WO2026094025A1 (en) Improvements in data processing
CN118963828A (zh) 基于Transformer神经网络架构的虚拟化代码还原方法、设备及介质
HK40048280B (zh) 样本集的获取方法、装置、计算机设备和存储介质
CN116910338A (zh) 文本补全方法、装置、电子设备和计算机存储介质

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20240327

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20240327

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20250122

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20250207

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20250408

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20250530

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20250606

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20250612

R150 Certificate of patent or registration of utility model

Ref document number: 7697824

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150