JP2024528393A - 訓練済み機械学習パイプラインを用いた分類階層の識別 - Google Patents
訓練済み機械学習パイプラインを用いた分類階層の識別 Download PDFInfo
- Publication number
- JP2024528393A JP2024528393A JP2023575783A JP2023575783A JP2024528393A JP 2024528393 A JP2024528393 A JP 2024528393A JP 2023575783 A JP2023575783 A JP 2023575783A JP 2023575783 A JP2023575783 A JP 2023575783A JP 2024528393 A JP2024528393 A JP 2024528393A
- Authority
- JP
- Japan
- Prior art keywords
- machine learning
- classification
- trained
- model
- learning model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3332—Query translation
- G06F16/3335—Syntactic pre-processing, e.g. stopword elimination, stemming
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3347—Query execution using vector based model
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/231—Hierarchical techniques, i.e. dividing or merging pattern sets so as to obtain a dendrogram
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0499—Feedforward networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/74—Image or video pattern matching; Proximity measures in feature spaces
- G06V10/75—Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
- G06V10/751—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Computing Systems (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Mathematical Physics (AREA)
- Databases & Information Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Medical Informatics (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/303,918 US20220398445A1 (en) | 2021-06-10 | 2021-06-10 | Identifying a classification hierarchy using a trained machine learning pipeline |
| US17/303,918 | 2021-06-10 | ||
| PCT/US2022/032705 WO2022261233A1 (en) | 2021-06-10 | 2022-06-08 | Identifying a classification hierarchy using a trained machine learning pipeline |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2024528393A true JP2024528393A (ja) | 2024-07-30 |
| JP2024528393A5 JP2024528393A5 (https=) | 2025-05-01 |
| JPWO2022261233A5 JPWO2022261233A5 (https=) | 2025-05-01 |
Family
ID=82482578
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023575783A Pending JP2024528393A (ja) | 2021-06-10 | 2022-06-08 | 訓練済み機械学習パイプラインを用いた分類階層の識別 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US20220398445A1 (https=) |
| EP (1) | EP4352655A1 (https=) |
| JP (1) | JP2024528393A (https=) |
| CN (1) | CN117677959A (https=) |
| WO (1) | WO2022261233A1 (https=) |
Families Citing this family (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12093864B2 (en) * | 2021-05-18 | 2024-09-17 | Ebay Inc. | Inventory item prediction and listing recommendation |
| US20220415524A1 (en) * | 2021-06-29 | 2022-12-29 | International Business Machines Corporation | Machine learning-based adjustment of epidemiological model projections with flexible prediction horizon |
| WO2024015964A1 (en) * | 2022-07-14 | 2024-01-18 | SucceedSmart, Inc. | Systems and methods for candidate database querying |
| US11841851B1 (en) * | 2022-07-24 | 2023-12-12 | SAS, Inc. | Systems, methods, and graphical user interfaces for taxonomy-based classification of unlabeled structured datasets |
| US12056214B1 (en) * | 2022-09-29 | 2024-08-06 | Amazon Technologies, Inc. | Systems for automatically correcting categories of items |
| CN115859128B (zh) * | 2023-02-23 | 2023-05-09 | 成都瑞安信信息安全技术有限公司 | 一种基于档案数据交互相似度的分析方法和系统 |
| CN117271674A (zh) * | 2023-08-28 | 2023-12-22 | 杭州数梦工场科技有限公司 | 字段类型识别方法、装置、电子设备及存储介质 |
| WO2026044241A1 (en) * | 2024-08-23 | 2026-02-26 | D. E. Shaw Research, Llc | Modeling molecules with transformer machine-learned models |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180032599A1 (en) * | 2016-07-29 | 2018-02-01 | Blue Coat Systems, Inc. | Grouped categorization of internet content |
| JP2019125340A (ja) * | 2018-01-15 | 2019-07-25 | タタ コンサルタンシー サービシズ リミテッドTATA Consultancy Services Limited | 時空間画像の変化を自動推論するためのシステムおよび方法 |
| US20200380964A1 (en) * | 2019-05-31 | 2020-12-03 | Clinc, Inc. | Systems and methods for automactically categorizing unstructured data and improving a machine learning-based dialogue system |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9483768B2 (en) * | 2014-08-11 | 2016-11-01 | 24/7 Customer, Inc. | Methods and apparatuses for modeling customer interaction experiences |
| US9928448B1 (en) * | 2016-09-23 | 2018-03-27 | International Business Machines Corporation | Image classification utilizing semantic relationships in a classification hierarchy |
| US20190354850A1 (en) * | 2018-05-17 | 2019-11-21 | International Business Machines Corporation | Identifying transfer models for machine learning tasks |
| US20190294999A1 (en) * | 2018-06-16 | 2019-09-26 | Moshe Guttmann | Selecting hyper parameters for machine learning algorithms based on past training results |
| US11693910B2 (en) * | 2018-12-13 | 2023-07-04 | Microsoft Technology Licensing, Llc | Personalized search result rankings |
| US11494559B2 (en) * | 2019-11-27 | 2022-11-08 | Oracle International Corporation | Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents |
| US11481448B2 (en) * | 2020-03-31 | 2022-10-25 | Microsoft Technology Licensing, Llc | Semantic matching and retrieval of standardized entities |
| US11687812B2 (en) * | 2020-08-18 | 2023-06-27 | Accenture Global Solutions Limited | Autoclassification of products using artificial intelligence |
-
2021
- 2021-06-10 US US17/303,918 patent/US20220398445A1/en active Pending
-
2022
- 2022-06-08 EP EP22740694.9A patent/EP4352655A1/en not_active Withdrawn
- 2022-06-08 JP JP2023575783A patent/JP2024528393A/ja active Pending
- 2022-06-08 WO PCT/US2022/032705 patent/WO2022261233A1/en not_active Ceased
- 2022-06-08 CN CN202280049145.3A patent/CN117677959A/zh active Pending
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180032599A1 (en) * | 2016-07-29 | 2018-02-01 | Blue Coat Systems, Inc. | Grouped categorization of internet content |
| JP2019125340A (ja) * | 2018-01-15 | 2019-07-25 | タタ コンサルタンシー サービシズ リミテッドTATA Consultancy Services Limited | 時空間画像の変化を自動推論するためのシステムおよび方法 |
| US20200380964A1 (en) * | 2019-05-31 | 2020-12-03 | Clinc, Inc. | Systems and methods for automactically categorizing unstructured data and improving a machine learning-based dialogue system |
Also Published As
| Publication number | Publication date |
|---|---|
| EP4352655A1 (en) | 2024-04-17 |
| US20220398445A1 (en) | 2022-12-15 |
| WO2022261233A1 (en) | 2022-12-15 |
| CN117677959A (zh) | 2024-03-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2024528393A (ja) | 訓練済み機械学習パイプラインを用いた分類階層の識別 | |
| US10725836B2 (en) | Intent-based organisation of APIs | |
| US11836120B2 (en) | Machine learning techniques for schema mapping | |
| US12072878B2 (en) | Search architecture for hierarchical data using metadata defined relationships | |
| US11494559B2 (en) | Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents | |
| US11507747B2 (en) | Hybrid in-domain and out-of-domain document processing for non-vocabulary tokens of electronic documents | |
| US11481554B2 (en) | Systems and methods for training and evaluating machine learning models using generalized vocabulary tokens for document processing | |
| US20250322312A1 (en) | Automated Data Hierarchy Extraction And Prediction Using A Machine Learning Model | |
| US12499158B2 (en) | Large language machine learning model query management | |
| US12524611B2 (en) | Training graph neural network to identify key-value pairs in documents | |
| ALBayari et al. | Cyberbullying classification methods for Arabic: A systematic review | |
| US20230351176A1 (en) | Machine-learning-guided issue resolution in data objects | |
| US20250028511A1 (en) | Source code conversion from an original computer programming language to a target programming language | |
| CN111886596A (zh) | 使用基于序列的锁定/解锁分类进行机器翻译锁定 | |
| US12518106B2 (en) | Semantically classifying sets of data elements | |
| US20240338233A1 (en) | Form Field Recommendation Management | |
| US12585679B2 (en) | Executing unsupervised pre-training tasks with a machine learning model to predict document graph attributes | |
| US12417239B2 (en) | System, apparatus, and method for structuring documentary data for improved topic extraction and modeling | |
| CN119452364A (zh) | 机器学习模型的数据集的引导增强 | |
| US12468716B2 (en) | Content-based operation selection | |
| US20250004735A1 (en) | Source code validation based on converting the source code to a non-programming language | |
| US20260064746A1 (en) | Multimodal Data Ingestion And Retrieval For Agent Systems |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20250422 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20250422 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20260128 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20260303 |