JP2024528393A - 訓練済み機械学習パイプラインを用いた分類階層の識別 - Google Patents

訓練済み機械学習パイプラインを用いた分類階層の識別 Download PDF

Info

Publication number
JP2024528393A
JP2024528393A JP2023575783A JP2023575783A JP2024528393A JP 2024528393 A JP2024528393 A JP 2024528393A JP 2023575783 A JP2023575783 A JP 2023575783A JP 2023575783 A JP2023575783 A JP 2023575783A JP 2024528393 A JP2024528393 A JP 2024528393A
Authority
JP
Japan
Prior art keywords
machine learning
classification
trained
model
learning model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2023575783A
Other languages
English (en)
Japanese (ja)
Other versions
JP2024528393A5 (https=
JPWO2022261233A5 (https=
Inventor
ポッレーリ,アルベルト
クマール,ラジブ
ブロン,マーク・ミシェル
チェン,グオドン
アグラワル,シェカール
ブーフハイム,リチャード・スティーブン
Original Assignee
オラクル・インターナショナル・コーポレイション
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by オラクル・インターナショナル・コーポレイション filed Critical オラクル・インターナショナル・コーポレイション
Publication of JP2024528393A publication Critical patent/JP2024528393A/ja
Publication of JP2024528393A5 publication Critical patent/JP2024528393A5/ja
Publication of JPWO2022261233A5 publication Critical patent/JPWO2022261233A5/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3335Syntactic pre-processing, e.g. stopword elimination, stemming
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/217Validation; Performance evaluation; Active pattern learning techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/231Hierarchical techniques, i.e. dividing or merging pattern sets so as to obtain a dendrogram
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0499Feedforward networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • G06V10/751Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Databases & Information Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Medical Informatics (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
JP2023575783A 2021-06-10 2022-06-08 訓練済み機械学習パイプラインを用いた分類階層の識別 Pending JP2024528393A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US17/303,918 US20220398445A1 (en) 2021-06-10 2021-06-10 Identifying a classification hierarchy using a trained machine learning pipeline
US17/303,918 2021-06-10
PCT/US2022/032705 WO2022261233A1 (en) 2021-06-10 2022-06-08 Identifying a classification hierarchy using a trained machine learning pipeline

Publications (3)

Publication Number Publication Date
JP2024528393A true JP2024528393A (ja) 2024-07-30
JP2024528393A5 JP2024528393A5 (https=) 2025-05-01
JPWO2022261233A5 JPWO2022261233A5 (https=) 2025-05-01

Family

ID=82482578

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023575783A Pending JP2024528393A (ja) 2021-06-10 2022-06-08 訓練済み機械学習パイプラインを用いた分類階層の識別

Country Status (5)

Country Link
US (1) US20220398445A1 (https=)
EP (1) EP4352655A1 (https=)
JP (1) JP2024528393A (https=)
CN (1) CN117677959A (https=)
WO (1) WO2022261233A1 (https=)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12093864B2 (en) * 2021-05-18 2024-09-17 Ebay Inc. Inventory item prediction and listing recommendation
US20220415524A1 (en) * 2021-06-29 2022-12-29 International Business Machines Corporation Machine learning-based adjustment of epidemiological model projections with flexible prediction horizon
WO2024015964A1 (en) * 2022-07-14 2024-01-18 SucceedSmart, Inc. Systems and methods for candidate database querying
US11841851B1 (en) * 2022-07-24 2023-12-12 SAS, Inc. Systems, methods, and graphical user interfaces for taxonomy-based classification of unlabeled structured datasets
US12056214B1 (en) * 2022-09-29 2024-08-06 Amazon Technologies, Inc. Systems for automatically correcting categories of items
CN115859128B (zh) * 2023-02-23 2023-05-09 成都瑞安信信息安全技术有限公司 一种基于档案数据交互相似度的分析方法和系统
CN117271674A (zh) * 2023-08-28 2023-12-22 杭州数梦工场科技有限公司 字段类型识别方法、装置、电子设备及存储介质
WO2026044241A1 (en) * 2024-08-23 2026-02-26 D. E. Shaw Research, Llc Modeling molecules with transformer machine-learned models

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180032599A1 (en) * 2016-07-29 2018-02-01 Blue Coat Systems, Inc. Grouped categorization of internet content
JP2019125340A (ja) * 2018-01-15 2019-07-25 タタ コンサルタンシー サービシズ リミテッドTATA Consultancy Services Limited 時空間画像の変化を自動推論するためのシステムおよび方法
US20200380964A1 (en) * 2019-05-31 2020-12-03 Clinc, Inc. Systems and methods for automactically categorizing unstructured data and improving a machine learning-based dialogue system

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9483768B2 (en) * 2014-08-11 2016-11-01 24/7 Customer, Inc. Methods and apparatuses for modeling customer interaction experiences
US9928448B1 (en) * 2016-09-23 2018-03-27 International Business Machines Corporation Image classification utilizing semantic relationships in a classification hierarchy
US20190354850A1 (en) * 2018-05-17 2019-11-21 International Business Machines Corporation Identifying transfer models for machine learning tasks
US20190294999A1 (en) * 2018-06-16 2019-09-26 Moshe Guttmann Selecting hyper parameters for machine learning algorithms based on past training results
US11693910B2 (en) * 2018-12-13 2023-07-04 Microsoft Technology Licensing, Llc Personalized search result rankings
US11494559B2 (en) * 2019-11-27 2022-11-08 Oracle International Corporation Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents
US11481448B2 (en) * 2020-03-31 2022-10-25 Microsoft Technology Licensing, Llc Semantic matching and retrieval of standardized entities
US11687812B2 (en) * 2020-08-18 2023-06-27 Accenture Global Solutions Limited Autoclassification of products using artificial intelligence

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180032599A1 (en) * 2016-07-29 2018-02-01 Blue Coat Systems, Inc. Grouped categorization of internet content
JP2019125340A (ja) * 2018-01-15 2019-07-25 タタ コンサルタンシー サービシズ リミテッドTATA Consultancy Services Limited 時空間画像の変化を自動推論するためのシステムおよび方法
US20200380964A1 (en) * 2019-05-31 2020-12-03 Clinc, Inc. Systems and methods for automactically categorizing unstructured data and improving a machine learning-based dialogue system

Also Published As

Publication number Publication date
EP4352655A1 (en) 2024-04-17
US20220398445A1 (en) 2022-12-15
WO2022261233A1 (en) 2022-12-15
CN117677959A (zh) 2024-03-08

Similar Documents

Publication Publication Date Title
JP2024528393A (ja) 訓練済み機械学習パイプラインを用いた分類階層の識別
US10725836B2 (en) Intent-based organisation of APIs
US11836120B2 (en) Machine learning techniques for schema mapping
US12072878B2 (en) Search architecture for hierarchical data using metadata defined relationships
US11494559B2 (en) Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents
US11507747B2 (en) Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents
US11481554B2 (en) Systems and methods for training and evaluating machine learning models using generalized vocabulary tokens for document processing
US20250322312A1 (en) Automated Data Hierarchy Extraction And Prediction Using A Machine Learning Model
US12499158B2 (en) Large language machine learning model query management
US12524611B2 (en) Training graph neural network to identify key-value pairs in documents
ALBayari et al. Cyberbullying classification methods for Arabic: A systematic review
US20230351176A1 (en) Machine-learning-guided issue resolution in data objects
US20250028511A1 (en) Source code conversion from an original computer programming language to a target programming language
CN111886596A (zh) 使用基于序列的锁定/解锁分类进行机器翻译锁定
US12518106B2 (en) Semantically classifying sets of data elements
US20240338233A1 (en) Form Field Recommendation Management
US12585679B2 (en) Executing unsupervised pre-training tasks with a machine learning model to predict document graph attributes
US12417239B2 (en) System, apparatus, and method for structuring documentary data for improved topic extraction and modeling
CN119452364A (zh) 机器学习模型的数据集的引导增强
US12468716B2 (en) Content-based operation selection
US20250004735A1 (en) Source code validation based on converting the source code to a non-programming language
US20260064746A1 (en) Multimodal Data Ingestion And Retrieval For Agent Systems

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20250422

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20250422

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20260128

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20260303