CN114175010B - 用于从数据字段的简档数据中发现数据字段的语义含义的方法、系统、计算机可读介质和程序产品 - Google Patents

用于从数据字段的简档数据中发现数据字段的语义含义的方法、系统、计算机可读介质和程序产品 Download PDF

Info

Publication number
CN114175010B
CN114175010B CN202080039989.0A CN202080039989A CN114175010B CN 114175010 B CN114175010 B CN 114175010B CN 202080039989 A CN202080039989 A CN 202080039989A CN 114175010 B CN114175010 B CN 114175010B
Authority
CN
China
Prior art keywords
data
field
tag
label
tests
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202080039989.0A
Other languages
English (en)
Chinese (zh)
Other versions
CN114175010A (zh
Inventor
C·T·巴特勒
T·S·布什
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ab Initio Technology LLC
Original Assignee
Ab Initio Technology LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ab Initio Technology LLC filed Critical Ab Initio Technology LLC
Publication of CN114175010A publication Critical patent/CN114175010A/zh
Application granted granted Critical
Publication of CN114175010B publication Critical patent/CN114175010B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/211Schema design and management
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/907Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/908Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Library & Information Science (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
CN202080039989.0A 2019-05-31 2020-05-29 用于从数据字段的简档数据中发现数据字段的语义含义的方法、系统、计算机可读介质和程序产品 Active CN114175010B (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201962855233P 2019-05-31 2019-05-31
US62/855,233 2019-05-31
US16/794,361 US11704494B2 (en) 2019-05-31 2020-02-19 Discovering a semantic meaning of data fields from profile data of the data fields
US16/794,361 2020-02-19
PCT/US2020/035226 WO2020243499A1 (en) 2019-05-31 2020-05-29 Discovering a semantic meaning of data fields from profile data of the data fields

Publications (2)

Publication Number Publication Date
CN114175010A CN114175010A (zh) 2022-03-11
CN114175010B true CN114175010B (zh) 2025-03-28

Family

ID=70977338

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080039989.0A Active CN114175010B (zh) 2019-05-31 2020-05-29 用于从数据字段的简档数据中发现数据字段的语义含义的方法、系统、计算机可读介质和程序产品

Country Status (10)

Country Link
US (2) US11704494B2 (https=)
EP (1) EP3745276A1 (https=)
JP (1) JP7590350B2 (https=)
CN (1) CN114175010B (https=)
AU (1) AU2020282778B2 (https=)
BR (1) BR112021023712A2 (https=)
CA (1) CA3142252A1 (https=)
DE (1) DE112020002600T5 (https=)
SG (1) SG11202112388XA (https=)
WO (1) WO2020243499A1 (https=)

Families Citing this family (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021144656A1 (en) 2020-01-15 2021-07-22 Monday.Com Digital processing systems and methods for graphical dynamic table gauges in collaborative work systems
US11410129B2 (en) 2010-05-01 2022-08-09 Monday.com Ltd. Digital processing systems and methods for two-way syncing with third party applications in collaborative work systems
WO2021161104A1 (en) 2020-02-12 2021-08-19 Monday.Com Enhanced display features in collaborative network systems, methods, and devices
US11436359B2 (en) 2018-07-04 2022-09-06 Monday.com Ltd. System and method for managing permissions of users for a single data type column-oriented data structure
US11698890B2 (en) 2018-07-04 2023-07-11 Monday.com Ltd. System and method for generating a column-oriented data structure repository for columns of single data types
US12353419B2 (en) 2018-07-23 2025-07-08 Monday.com Ltd. System and method for generating a tagged column-oriented data structure
US11704494B2 (en) * 2019-05-31 2023-07-18 Ab Initio Technology Llc Discovering a semantic meaning of data fields from profile data of the data fields
US11249964B2 (en) * 2019-11-11 2022-02-15 Microsoft Technology Licensing, Llc Generating estimated database schema and analytics model
US11507738B2 (en) 2019-11-18 2022-11-22 Monday.Com Digital processing systems and methods for automatic updates in collaborative work systems
EP4062313A1 (en) 2019-11-18 2022-09-28 Monday.com Ltd. Collaborative networking systems, methods, and devices
US11886399B2 (en) * 2020-02-26 2024-01-30 Ab Initio Technology Llc Generating rules for data processing values of data fields from semantic labels of the data fields
US11977603B1 (en) * 2020-03-06 2024-05-07 Wells Fargo Bank, N.A. Iteratively trained machine learning models for evaluations of internal consistency
US12326864B2 (en) * 2020-03-15 2025-06-10 International Business Machines Corporation Method and system for operation objects discovery from operation data
US20210326385A1 (en) * 2020-04-19 2021-10-21 International Business Machines Corporation Computerized data classification by statistics and neighbors.
US11829953B1 (en) 2020-05-01 2023-11-28 Monday.com Ltd. Digital processing systems and methods for managing sprints using linked electronic boards
EP4143732A1 (en) 2020-05-01 2023-03-08 Monday.com Ltd. Digital processing systems and methods for enhanced collaborative workflow and networking systems, methods, and devices
US11277361B2 (en) 2020-05-03 2022-03-15 Monday.com Ltd. Digital processing systems and methods for variable hang-time for social layer messages in collaborative work systems
US11645525B2 (en) * 2020-05-27 2023-05-09 International Business Machines Corporation Natural language explanation for classifier predictions
US12124805B2 (en) * 2020-06-22 2024-10-22 Accenture Global Solutions Limited Data ingestion using artificial intelligence and machine learning
DE102020120456A1 (de) * 2020-08-03 2022-02-03 Endress+Hauser Conducta Gmbh+Co. Kg Messwertverarbeitungssystem und Messwertverarbeitungsverfahren
US11841925B1 (en) * 2020-12-10 2023-12-12 Amazon Technologies, Inc. Enabling automatic classification for multi-label classification problems with label completion guarantees
CN112818000B (zh) * 2021-01-06 2023-06-27 佰聆数据股份有限公司 基于多标签主体的标签库管理与应用方法、系统及计算机设备
US11782582B2 (en) 2021-01-14 2023-10-10 Monday.com Ltd. Digital processing systems and methods for detectable codes in presentation enabling targeted feedback in collaborative work systems
CN113095064B (zh) * 2021-03-18 2025-02-25 杭州数梦工场科技有限公司 代码字段识别方法、装置、电子设备及存储介质
EP4102424A1 (en) * 2021-06-09 2022-12-14 ABB Schweiz AG Broker entity to bridge semantic gaps for information produced in industrial plants
US12056664B2 (en) 2021-08-17 2024-08-06 Monday.com Ltd. Digital processing systems and methods for external events trigger automatic text-based document alterations in collaborative work systems
CN113918599A (zh) * 2021-08-31 2022-01-11 度小满科技(北京)有限公司 一种用于基于多个标签字段进行排序的方法和装置
US12431129B2 (en) * 2021-09-28 2025-09-30 Cerence Voice assistant error detection system
US12517717B2 (en) 2021-10-08 2026-01-06 Ab Initio Technology Llc Automated modification of computer programs
JP2024538609A (ja) 2021-10-08 2024-10-23 アビニシオ テクノロジー エルエルシー コンピュータプログラムの自動修正
CN113642030B (zh) * 2021-10-14 2022-02-15 广东鸿数科技有限公司 敏感数据多层识别方法
US12105948B2 (en) 2021-10-29 2024-10-01 Monday.com Ltd. Digital processing systems and methods for display navigation mini maps
US11704371B1 (en) * 2022-02-07 2023-07-18 Microsoft Technology Licensing, Llc User centric topics for topic suggestions
CN115186023B (zh) * 2022-09-07 2022-12-06 杭州安恒信息技术股份有限公司 一种数据集生成方法、装置、设备及介质
WO2024064705A1 (en) * 2022-09-20 2024-03-28 Ab Initio Technology Llc Techniques for discovering and updating semantic meaning of data fields
US11741071B1 (en) 2022-12-28 2023-08-29 Monday.com Ltd. Digital processing systems and methods for navigating and viewing displayed content
US11886683B1 (en) 2022-12-30 2024-01-30 Monday.com Ltd Digital processing systems and methods for presenting board graphics
WO2024158920A1 (en) 2023-01-25 2024-08-02 Ab Initio Technology Llc On-demand retrieval of structured data in aggregating data across distinct sources
US11893381B1 (en) 2023-02-21 2024-02-06 Monday.com Ltd Digital processing systems and methods for reducing file bundle sizes
JP2026511562A (ja) 2023-03-23 2026-04-14 アビニシオ テクノロジー エルエルシー 拡張ビューのデータセットをプレビューするための論理アクセス
WO2024205582A1 (en) * 2023-03-29 2024-10-03 Pricewaterhousecoopers Llp Ai-augmented composable and configurable microservices for record linkage and reconciliation
US12265553B2 (en) 2023-03-29 2025-04-01 PwC Product Sales LLC AI-augmented composable and configurable microservices for record linkage and reconciliation
US12346288B2 (en) 2023-05-11 2025-07-01 Ab Initio Technology Llc Migration of datasets among federated database systems
WO2024233801A1 (en) 2023-05-11 2024-11-14 Ab Initio Technology Llc Migration of datasets among federated database systems
US20240411922A1 (en) * 2023-06-06 2024-12-12 Acante, Inc. Data classification and tracking sensitive data accesses
WO2024257014A1 (en) 2023-06-13 2024-12-19 Monday.com Ltd. Digital processing systems and methods for enhanced data representation
WO2025029579A1 (en) 2023-07-28 2025-02-06 Ab Initio Technology Llc Machine learning techniques for discovering keys in relational datasets
WO2025114749A1 (en) 2023-11-28 2025-06-05 Monday.com Ltd. Digital processing systems and methods for facilitating the development and implementation of applications in conjunction with a serverless environment
WO2025114750A1 (en) 2023-11-28 2025-06-05 Monday.com Ltd. Digital processing systems and methods for managing workflows
WO2025137522A1 (en) 2023-12-21 2025-06-26 Ab Initio Technology Llc A development environment for automatically generating code using a multi-tiered metadata model
US12524413B2 (en) 2024-04-26 2026-01-13 Ab Initio Technology Llc Metadata change triggers
WO2025226952A1 (en) 2024-04-26 2025-10-30 Ab Initio Technology Llc Metadata change triggers
CN121098946B (zh) * 2025-11-11 2026-01-27 上海宽域工业网络设备有限公司 一种交换机的数据传输方法、系统、设备及介质

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101802776A (zh) * 2008-07-29 2010-08-11 特克斯特怀茨有限责任公司 应用语义向量和关键字分析关联数据集的方法和装置
CN106528874A (zh) * 2016-12-08 2017-03-22 重庆邮电大学 基于Spark内存计算大数据平台的CLR多标签数据分类方法

Family Cites Families (76)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US361A (en) 1837-08-18 Jonathan keedy
US7426520B2 (en) * 2003-09-10 2008-09-16 Exeros, Inc. Method and apparatus for semantic discovery and mapping between data sources
US8868580B2 (en) * 2003-09-15 2014-10-21 Ab Initio Technology Llc Data profiling
US7912904B2 (en) 2004-03-31 2011-03-22 Google Inc. Email system with conversation-centric user interface
US9792351B2 (en) 2005-06-10 2017-10-17 International Business Machines Corporation Tolerant and extensible discovery of relationships in data using structural information and data analysis
AU2006260795A1 (en) 2005-06-20 2006-12-28 Future Route Limited Analytical system for discovery and generation of rules to predict and detect anomalies in data and financial fraud
US7587636B2 (en) 2005-08-04 2009-09-08 Microsoft Corporation Unit test generalization
US20080201319A1 (en) * 2006-04-25 2008-08-21 Mcnamar Richard Timothy Method, system and computer software for using an XBRL medical record for diagnosis, treatment, and insurance coverage
US7783564B2 (en) * 2006-07-25 2010-08-24 Visa U.S.A. Inc. Compliance control in a card based program
KR20100102607A (ko) 2007-11-12 2010-09-24 바이파 사이언스 인코포레이티드 Parp 억제제를 단독으로 사용하거나 항종양제와 병용하여 자궁암 및 난소암을 치료하는 방법
US20090276426A1 (en) * 2008-05-02 2009-11-05 Researchanalytics Corporation Semantic Analytical Search and Database
US8140531B2 (en) 2008-05-02 2012-03-20 International Business Machines Corporation Process and method for classifying structured data
US8176072B2 (en) 2009-07-28 2012-05-08 Vulcan Technologies Llc Method and system for tag suggestion in a tag-associated data-object storage system
US8379939B1 (en) * 2009-09-08 2013-02-19 Adobe Systems Incorporated Efficient and scalable face recognition in photo albums
AU2010295547B2 (en) 2009-09-16 2015-05-07 Ab Initio Technology Llc Mapping dataset elements
US20120254333A1 (en) * 2010-01-07 2012-10-04 Rajarathnam Chandramouli Automated detection of deception in short and multilingual electronic messages
US8751218B2 (en) * 2010-02-09 2014-06-10 Siemens Aktiengesellschaft Indexing content at semantic level
US8719207B2 (en) * 2010-07-27 2014-05-06 Oracle International Corporation Method and system for providing decision making based on sense and respond
US8380493B2 (en) * 2010-10-01 2013-02-19 Microsoft Corporation Association of semantic meaning with data elements using data definition tags
US10664862B1 (en) * 2011-06-21 2020-05-26 Contextlogic, Inc. Topic inference based contextual content
US20130006914A1 (en) 2011-06-28 2013-01-03 Microsoft Corporation Exposing search history by category
US8666919B2 (en) 2011-07-29 2014-03-04 Accenture Global Services Limited Data quality management for profiling, linking, cleansing and migrating data
US10248672B2 (en) 2011-09-19 2019-04-02 Citigroup Technology, Inc. Methods and systems for assessing data quality
US20130166515A1 (en) 2011-12-22 2013-06-27 David Kung Generating validation rules for a data report based on profiling the data report in a data processing tool
CN107451225B (zh) 2011-12-23 2021-02-05 亚马逊科技公司 用于半结构化数据的可缩放分析平台
US9461876B2 (en) * 2012-08-29 2016-10-04 Loci System and method for fuzzy concept mapping, voting ontology crowd sourcing, and technology prediction
CA2886603A1 (en) * 2012-09-28 2014-04-03 Alkis Papadopoullos A method and system for monitoring social media and analyzing text to automate classification of user posts using a facet based relevance assessment model
US9613125B2 (en) * 2012-10-11 2017-04-04 Nuance Communications, Inc. Data store organizing data using semantic classification
US10489360B2 (en) * 2012-10-17 2019-11-26 Ab Initio Technology Llc Specifying and applying rules to data
KR102113366B1 (ko) 2012-10-22 2020-05-20 아브 이니티오 테크놀로지 엘엘시 데이터 저장 시스템에서 데이터 소스 특성화
US9239889B2 (en) * 2013-03-15 2016-01-19 Sugarcrm Inc. Adaptive search and navigation through semantically aware searching
US9836986B2 (en) * 2013-05-21 2017-12-05 Pearson Education, Inc. Dynamic response entry
US20150032609A1 (en) 2013-07-29 2015-01-29 International Business Machines Corporation Correlation of data sets using determined data types
US10528718B2 (en) 2013-09-27 2020-01-07 Paypal, Inc. Method and apparatus for a data confidence index
WO2015084408A1 (en) 2013-12-06 2015-06-11 Hewlett-Packard Development Company, L.P. Flexible schema table
GB201322057D0 (en) 2013-12-13 2014-01-29 Qatar Foundation Descriptive and prescriptive data cleaning
GB2521198A (en) 2013-12-13 2015-06-17 Ibm Refactoring of databases to include soft type information
US10198460B2 (en) * 2014-06-04 2019-02-05 Waterline Data Science, Inc. Systems and methods for management of data platforms
US10346358B2 (en) * 2014-06-04 2019-07-09 Waterline Data Science, Inc. Systems and methods for management of data platforms
US10169642B2 (en) * 2014-08-06 2019-01-01 Facebook, Inc. Systems and methods for face alert
US20160055427A1 (en) * 2014-10-15 2016-02-25 Brighterion, Inc. Method for providing data science, artificial intelligence and machine learning as-a-service
US10417247B2 (en) * 2014-09-25 2019-09-17 Oracle International Corporation Techniques for semantic searching
US10210246B2 (en) * 2014-09-26 2019-02-19 Oracle International Corporation Techniques for similarity analysis and data enrichment using knowledge sources
AU2015360437A1 (en) * 2014-12-10 2017-06-29 Kyndi, Inc. Technical and semantic signal processing in large, unstructured data fields
US10409802B2 (en) * 2015-06-12 2019-09-10 Ab Initio Technology Llc Data quality analysis
US9910842B2 (en) * 2015-08-12 2018-03-06 Captricity, Inc. Interactively predicting fields in a form
US20170068891A1 (en) * 2015-09-04 2017-03-09 Infotech Soft, Inc. System for rapid ingestion, semantic modeling and semantic querying over computer clusters
US10067972B2 (en) * 2015-11-17 2018-09-04 International Business Machines Corporation Semantic database driven form validation
US20170177712A1 (en) * 2015-12-21 2017-06-22 Ebay Inc. Single step cross-linguistic search using semantic meaning vectors
WO2018025706A1 (ja) 2016-08-05 2018-02-08 日本電気株式会社 テーブル意味推定システム、方法およびプログラム
US10552443B1 (en) * 2016-08-11 2020-02-04 MuleSoft, Inc. Schemaless to relational representation conversion
US20180060404A1 (en) * 2016-08-29 2018-03-01 Linkedin Corporation Schema abstraction in data ecosystems
US11151100B2 (en) * 2016-10-17 2021-10-19 Sap Se Performing data quality functions using annotations
US10565177B2 (en) * 2016-11-14 2020-02-18 At&T Intellectual Property I, L.P. Software defined entities for digital service transactions
CN106897424A (zh) 2017-02-24 2017-06-27 北京时间股份有限公司 信息标注系统及方法
JP7235269B2 (ja) 2017-03-13 2023-03-08 日本電気株式会社 データ項目名推定装置、データ項目名推定プログラム、及びデータ項目名推定方法
US10409820B2 (en) * 2017-09-19 2019-09-10 Adobe Inc. Semantic mapping of form fields
US11967974B2 (en) * 2017-10-30 2024-04-23 AtomBeam Technologies Inc. System and method for data compression with protocol adaptation
CN111771364B (zh) * 2018-01-10 2022-08-23 爱维士软件有限责任公司 经由dns属性在远程网络中进行基于云的异常流量检测和保护
US20200110736A1 (en) * 2018-03-29 2020-04-09 Robert Paul Bauman Natural language, flat field, record management and file system that defines, integrates and operates records comprising best practices and establishes collaborative peer networks to evolve new best practice records
IL258689A (en) * 2018-04-12 2018-05-31 Browarnik Abel A system and method for computerized semantic indexing and searching
US11636376B2 (en) * 2018-06-03 2023-04-25 International Business Machines Corporation Active learning for concept disambiguation
US11474978B2 (en) * 2018-07-06 2022-10-18 Capital One Services, Llc Systems and methods for a data search engine based on data profiles
US12455778B2 (en) * 2018-07-06 2025-10-28 Capital One Services, Llc Systems and methods for data stream simulation
US12001800B2 (en) * 2018-09-13 2024-06-04 Feedzai— Consultadoria e Inovação Tecnológica, S.A. Semantic-aware feature engineering
JP7137007B2 (ja) * 2018-11-07 2022-09-13 ロレアル 2つの容器間の改善された封止を有する、製品を包装及び分配するための機器
CN109191030B (zh) 2018-11-21 2022-05-06 深圳越海全球供应链股份有限公司 一种提升数据质量的方法及提高仓库运作效率的方法
CN109635288B (zh) * 2018-11-29 2023-05-23 东莞理工学院 一种基于深度神经网络的简历抽取方法
US11328004B2 (en) * 2019-03-22 2022-05-10 Microsoft Technology Licensing, Llc Method and system for intelligently suggesting tags for documents
US11704494B2 (en) 2019-05-31 2023-07-18 Ab Initio Technology Llc Discovering a semantic meaning of data fields from profile data of the data fields
CN110442568A (zh) 2019-07-30 2019-11-12 北京明略软件系统有限公司 字段标签的获取方法及装置、存储介质、电子装置
CA3160259A1 (en) 2019-12-19 2021-06-24 Ryan Michael McKay Self-optimizing labeling platform
US11886399B2 (en) 2020-02-26 2024-01-30 Ab Initio Technology Llc Generating rules for data processing values of data fields from semantic labels of the data fields
MX2022015706A (es) * 2020-06-09 2023-01-24 Pfizer Antagonistas del receptor de melanocortina 4 y usos de estos.
US11630853B2 (en) 2021-01-29 2023-04-18 Snowflake Inc. Metadata classification
WO2024064705A1 (en) * 2022-09-20 2024-03-28 Ab Initio Technology Llc Techniques for discovering and updating semantic meaning of data fields

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101802776A (zh) * 2008-07-29 2010-08-11 特克斯特怀茨有限责任公司 应用语义向量和关键字分析关联数据集的方法和装置
CN106528874A (zh) * 2016-12-08 2017-03-22 重庆邮电大学 基于Spark内存计算大数据平台的CLR多标签数据分类方法

Also Published As

Publication number Publication date
US20200380212A1 (en) 2020-12-03
JP7590350B2 (ja) 2024-11-26
US12456016B2 (en) 2025-10-28
CA3142252A1 (en) 2020-12-03
US11704494B2 (en) 2023-07-18
BR112021023712A2 (pt) 2022-01-04
US20230409835A1 (en) 2023-12-21
AU2020282778A1 (en) 2021-12-02
EP3745276A1 (en) 2020-12-02
CN114175010A (zh) 2022-03-11
WO2020243499A1 (en) 2020-12-03
SG11202112388XA (en) 2021-12-30
AU2020282778B2 (en) 2025-12-11
DE112020002600T5 (de) 2022-02-24
JP2022535792A (ja) 2022-08-10

Similar Documents

Publication Publication Date Title
CN114175010B (zh) 用于从数据字段的简档数据中发现数据字段的语义含义的方法、系统、计算机可读介质和程序产品
JP7806137B2 (ja) データフィールドの意味論的ラベルからのデータフィールドのデータ処理値に対するルールの生成
US7043492B1 (en) Automated classification of items using classification mappings
US8972336B2 (en) System and method for mapping source columns to target columns
US6697799B1 (en) Automated classification of items using cascade searches
CN102197406B (zh) 模糊数据操作
JP7796298B2 (ja) データフィールドの意味論的意味を発見及び更新するための技法
US20040107205A1 (en) Boolean rule-based system for clustering similar records
US20040107189A1 (en) System for identifying similarities in record fields
US20030177118A1 (en) System and method for classification of documents
US20060047617A1 (en) Method and apparatus for analysis and decomposition of classifier data anomalies
WO2019113124A1 (en) Mtransaction processing improvements
Bogatu et al. Towards automatic data format transformations: data wrangling at scale
US12271355B2 (en) Semantic classification for data management
US7395254B2 (en) Method for dynamic knowledge capturing in production printing workflow domain
CA3268252C (en) Techniques for discovering and updating semantic meaning of data fields
JP2000293537A (ja) データ分析支援方法および装置
HK40039381A (en) Discovering a semantic meaning of data fields from profile data of the data fields
US20210224482A1 (en) Adaptive recognition of entities
Grishin et al. Possibility of obtaining functional dependences from database structure

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant