TWI563400B - Method, computer program product and system for extracting semantic relationships from table structures in electronic documents - Google Patents

Method, computer program product and system for extracting semantic relationships from table structures in electronic documents

Info

Publication number
TWI563400B
TWI563400B TW102129888A TW102129888A TWI563400B TW I563400 B TWI563400 B TW I563400B TW 102129888 A TW102129888 A TW 102129888A TW 102129888 A TW102129888 A TW 102129888A TW I563400 B TWI563400 B TW I563400B
Authority
TW
Taiwan
Prior art keywords
computer program
program product
electronic documents
semantic relationships
table structures
Prior art date
Application number
TW102129888A
Other languages
English (en)
Other versions
TW201419014A (zh
Inventor
Scott N Gerard
Mark G Megerian
Original Assignee
Ibm
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ibm filed Critical Ibm
Publication of TW201419014A publication Critical patent/TW201419014A/zh
Application granted granted Critical
Publication of TWI563400B publication Critical patent/TWI563400B/zh

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • G06F16/24522Translation of natural language queries to structured queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/177Editing, e.g. inserting or deleting of tables; using ruled lines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/186Templates

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Human Resources & Organizations (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Machine Translation (AREA)
TW102129888A 2012-10-30 2013-08-20 Method, computer program product and system for extracting semantic relationships from table structures in electronic documents TWI563400B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US13/664,144 US8914419B2 (en) 2012-10-30 2012-10-30 Extracting semantic relationships from table structures in electronic documents

Publications (2)

Publication Number Publication Date
TW201419014A TW201419014A (zh) 2014-05-16
TWI563400B true TWI563400B (en) 2016-12-21

Family

ID=50548409

Family Applications (1)

Application Number Title Priority Date Filing Date
TW102129888A TWI563400B (en) 2012-10-30 2013-08-20 Method, computer program product and system for extracting semantic relationships from table structures in electronic documents

Country Status (3)

Country Link
US (1) US8914419B2 (zh)
CN (1) CN103793372A (zh)
TW (1) TWI563400B (zh)

Families Citing this family (63)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021099839A1 (en) * 2019-11-18 2021-05-27 Roy Mann Collaborative networking systems, methods, and devices
US10289653B2 (en) 2013-03-15 2019-05-14 International Business Machines Corporation Adapting tabular data for narration
US9164977B2 (en) 2013-06-24 2015-10-20 International Business Machines Corporation Error correction in tables using discovered functional dependencies
US9600461B2 (en) 2013-07-01 2017-03-21 International Business Machines Corporation Discovering relationships in tabular data
US9495347B2 (en) * 2013-07-16 2016-11-15 Recommind, Inc. Systems and methods for extracting table information from documents
US9607039B2 (en) * 2013-07-18 2017-03-28 International Business Machines Corporation Subject-matter analysis of tabular data
US9830314B2 (en) 2013-11-18 2017-11-28 International Business Machines Corporation Error correction in tables using a question and answer system
US9542928B2 (en) * 2014-03-25 2017-01-10 Microsoft Technology Licensing, Llc Generating natural language outputs
US10318625B2 (en) 2014-05-13 2019-06-11 International Business Machines Corporation Table narration using narration templates
US9977780B2 (en) 2014-06-13 2018-05-22 International Business Machines Corporation Generating language sections from tabular data
US9514118B2 (en) * 2014-06-18 2016-12-06 Yokogawa Electric Corporation Method, system and computer program for generating electronic checklists
US20160019192A1 (en) * 2014-07-21 2016-01-21 General Electric Company System and method to extract structured semantic model from document
US9720962B2 (en) 2014-08-19 2017-08-01 International Business Machines Corporation Answering superlative questions with a question and answer system
US10191946B2 (en) 2015-03-11 2019-01-29 International Business Machines Corporation Answering natural language table queries through semantic table representation
JP6578685B2 (ja) * 2015-03-16 2019-09-25 富士通株式会社 関係推定方法、関係推定プログラムおよび情報処理装置
US10452661B2 (en) * 2015-06-18 2019-10-22 Microsoft Technology Licensing, Llc Automated database schema annotation
CN104933162B (zh) * 2015-06-26 2018-03-09 河海大学 一种从元数据标注的csv数据到rdf数据的转换方法
US10685173B2 (en) * 2015-08-17 2020-06-16 International Business Machines Corporation Formatting tables with complex patterns
US10095740B2 (en) 2015-08-25 2018-10-09 International Business Machines Corporation Selective fact generation from table data in a cognitive system
US10078629B2 (en) 2015-10-22 2018-09-18 International Business Machines Corporation Tabular data compilation
US20170116194A1 (en) 2015-10-23 2017-04-27 International Business Machines Corporation Ingestion planning for complex tables
US10650050B2 (en) 2016-12-06 2020-05-12 Microsoft Technology Licensing, Llc Synthesizing mapping relationships using table corpus
US10002146B1 (en) * 2017-02-13 2018-06-19 Sas Institute Inc. Distributed data set indexing
US11475488B2 (en) 2017-09-11 2022-10-18 Accenture Global Solutions Limited Dynamic scripts for tele-agents
US10997228B2 (en) 2017-10-26 2021-05-04 International Business Machines Corporation Comparing tables with semantic vectors
US11853930B2 (en) 2017-12-15 2023-12-26 Accenture Global Solutions Limited Dynamic lead generation
US11650970B2 (en) 2018-03-09 2023-05-16 International Business Machines Corporation Extracting structure and semantics from tabular data
US10762142B2 (en) 2018-03-16 2020-09-01 Open Text Holdings, Inc. User-defined automated document feature extraction and optimization
US11048762B2 (en) 2018-03-16 2021-06-29 Open Text Holdings, Inc. User-defined automated document feature modeling, extraction and optimization
US10878195B2 (en) * 2018-05-03 2020-12-29 Microsoft Technology Licensing, Llc Automated extraction of unstructured tables and semantic information from arbitrary documents
US11200413B2 (en) 2018-07-31 2021-12-14 International Business Machines Corporation Table recognition in portable document format documents
CN109117479B (zh) * 2018-08-13 2022-07-22 数据地平线(广州)科技有限公司 一种金融文档智能核查方法、装置及存储介质
CN109241514A (zh) * 2018-09-10 2019-01-18 深圳市泰洲科技有限公司 数据自动导入方法、装置、计算机设备及存储介质
US11443106B2 (en) 2018-09-20 2022-09-13 International Business Machines Corporation Intelligent normalization and de-normalization of tables for multiple processing scenarios
US10831798B2 (en) 2018-09-20 2020-11-10 International Business Machines Corporation System for extracting header labels for header cells in tables having complex header structures
US10776573B2 (en) * 2018-09-20 2020-09-15 International Business Machines Corporation System for associating data cells with headers in tables having complex header structures
US11514258B2 (en) 2018-09-20 2022-11-29 International Business Machines Corporation Table header detection using global machine learning features from orthogonal rows and columns
US11762890B2 (en) 2018-09-28 2023-09-19 International Business Machines Corporation Framework for analyzing table data by question answering systems
US11468882B2 (en) * 2018-10-09 2022-10-11 Accenture Global Solutions Limited Semantic call notes
US10923114B2 (en) 2018-10-10 2021-02-16 N3, Llc Semantic jargon
US12001972B2 (en) 2018-10-31 2024-06-04 Accenture Global Solutions Limited Semantic inferencing in customer relationship management
US11132695B2 (en) 2018-11-07 2021-09-28 N3, Llc Semantic CRM mobile communications sessions
US10742813B2 (en) 2018-11-08 2020-08-11 N3, Llc Semantic artificial intelligence agent
US10972608B2 (en) 2018-11-08 2021-04-06 N3, Llc Asynchronous multi-dimensional platform for customer and tele-agent communications
US11610277B2 (en) 2019-01-25 2023-03-21 Open Text Holdings, Inc. Seamless electronic discovery system with an enterprise data portal
US10546135B1 (en) 2019-03-06 2020-01-28 SecurityScorecard, Inc. Inquiry response mapping for determining a cybersecurity risk level of an entity
US11797756B2 (en) 2019-04-30 2023-10-24 Microsoft Technology Licensing, Llc Document auto-completion
CN110134957B (zh) * 2019-05-14 2023-06-13 云南电网有限责任公司电力科学研究院 一种基于语义分析的科技成果入库方法及系统
EP4010813A4 (en) * 2019-08-08 2023-03-08 Microsoft Technology Licensing, LLC SPREAD TABLE TRANSFORMATION
US11380116B2 (en) 2019-10-22 2022-07-05 International Business Machines Corporation Automatic delineation and extraction of tabular data using machine learning
US11443264B2 (en) 2020-01-29 2022-09-13 Accenture Global Solutions Limited Agnostic augmentation of a customer relationship management application
US11481785B2 (en) 2020-04-24 2022-10-25 Accenture Global Solutions Limited Agnostic customer relationship management with browser overlay and campaign management portal
US11392960B2 (en) 2020-04-24 2022-07-19 Accenture Global Solutions Limited Agnostic customer relationship management with agent hub and browser overlay
JP2022035594A (ja) * 2020-08-21 2022-03-04 株式会社日立製作所 表構造認識装置及び表構造認識方法
US11507903B2 (en) 2020-10-01 2022-11-22 Accenture Global Solutions Limited Dynamic formation of inside sales team or expert support team
CN112270165A (zh) * 2020-10-19 2021-01-26 中译语通科技股份有限公司 一种表格的处理方法、装置、介质和电子设备
US11688193B2 (en) 2020-11-13 2023-06-27 International Business Machines Corporation Interactive structure annotation with artificial intelligence
US12093642B2 (en) * 2020-12-01 2024-09-17 Jpmorgan Chase Bank, N.A. Method and system for conditioned generation of descriptive commentary for quantitative data
US11797586B2 (en) 2021-01-19 2023-10-24 Accenture Global Solutions Limited Product presentation for customer relationship management
US11816677B2 (en) 2021-05-03 2023-11-14 Accenture Global Solutions Limited Call preparation engine for customer relationship management
CN113656592B (zh) * 2021-07-22 2022-09-27 北京百度网讯科技有限公司 基于知识图谱的数据处理方法、装置、电子设备和介质
CN113868260A (zh) * 2021-10-09 2021-12-31 北京字跳网络技术有限公司 数据处理方法、装置和电子设备
US12026525B2 (en) 2021-11-05 2024-07-02 Accenture Global Solutions Limited Dynamic dashboard administration

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020078406A1 (en) * 2000-10-24 2002-06-20 Goh Kondoh Structure recovery system, parsing system, conversion system, computer system, parsing method, storage medium, and program transmission apparatus
WO2003012661A1 (en) * 2001-07-31 2003-02-13 Invention Machine Corporation Computer based summarization of natural language documents
US20090171999A1 (en) * 2007-12-27 2009-07-02 Cloudscale Inc. System and Methodology for Parallel Stream Processing
US8255789B2 (en) * 2008-09-30 2012-08-28 Apple Inc. Providing spreadsheet features

Family Cites Families (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4688195A (en) 1983-01-28 1987-08-18 Texas Instruments Incorporated Natural-language interface generating system
JP2926066B2 (ja) 1992-05-29 1999-07-28 富士ゼロックス株式会社 表認識装置
WO1999027556A2 (en) 1997-11-20 1999-06-03 Xacct Technologies, Inc. Network accounting and billing system and method
US6161103A (en) * 1998-05-06 2000-12-12 Epiphany, Inc. Method and apparatus for creating aggregates for use in a datamart
US6769096B1 (en) 1998-06-24 2004-07-27 Microsoft Corporation System and method for updating a table of contents in a frameset
US6452467B1 (en) 1999-04-01 2002-09-17 Mcewan Technologies, Llc Material level sensor having a wire-horn launcher
US7620665B1 (en) 2000-11-21 2009-11-17 International Business Machines Corporation Method and system for a generic metadata-based mechanism to migrate relational data between databases
US7054871B2 (en) * 2000-12-11 2006-05-30 Lucent Technologies Inc. Method for identifying and using table structures
US6904428B2 (en) 2001-04-18 2005-06-07 Illinois Institute Of Technology Intranet mediator
JP2004062446A (ja) 2002-07-26 2004-02-26 Ibm Japan Ltd 情報収集システム、アプリケーションサーバ、情報収集方法、およびプログラム
US20040064447A1 (en) 2002-09-27 2004-04-01 Simske Steven J. System and method for management of synonymic searching
US7143026B2 (en) 2002-12-12 2006-11-28 International Business Machines Corporation Generating rules to convert HTML tables to prose
US7792829B2 (en) 2005-01-28 2010-09-07 Microsoft Corporation Table querying
US20070011183A1 (en) 2005-07-05 2007-01-11 Justin Langseth Analysis and transformation tools for structured and unstructured data
US7672831B2 (en) 2005-10-24 2010-03-02 Invention Machine Corporation System and method for cross-language knowledge searching
US8024653B2 (en) 2005-11-14 2011-09-20 Make Sence, Inc. Techniques for creating computer generated notes
US8036889B2 (en) 2006-02-27 2011-10-11 Nuance Communications, Inc. Systems and methods for filtering dictated and non-dictated sections of documents
US7788580B1 (en) 2006-03-28 2010-08-31 Amazon Technologies, Inc. Processing digital images including headers and footers into reflow content
KR100918847B1 (ko) 2007-10-15 2009-09-28 한국전자통신연구원 온톨로지 인스턴스 자동 생성 장치 및 방법
JP4476318B2 (ja) 2007-10-31 2010-06-09 富士通株式会社 論理構造認識プログラム、論理構造認識装置、および論理構造認識方法
US7792823B2 (en) 2008-01-15 2010-09-07 International Business Machines Corporation Maintained symbol table only index
US8275803B2 (en) 2008-05-14 2012-09-25 International Business Machines Corporation System and method for providing answers to questions
WO2009155281A1 (en) * 2008-06-17 2009-12-23 The Trustees Of Columbia University In The City Of New York System and method for dynamically and interactively searching media data
US8959481B2 (en) 2009-04-30 2015-02-17 International Business Machines Corporation Determining system level dependencies
CN101887413B (zh) * 2009-05-14 2012-07-04 北大方正集团有限公司 版式表格的结构处理方法和系统
CN101556606B (zh) * 2009-05-20 2010-12-01 同方知网(北京)技术有限公司 一种基于Web数值表格抽取的数据挖掘方法
US8037108B1 (en) 2009-07-22 2011-10-11 Adobe Systems Incorporated Conversion of relational databases into triplestores
US8280838B2 (en) 2009-09-17 2012-10-02 International Business Machines Corporation Evidence evaluation system and method based on question answering
US20110125734A1 (en) 2009-11-23 2011-05-26 International Business Machines Corporation Questions and answers generation
US20110126197A1 (en) 2009-11-25 2011-05-26 Novell, Inc. System and method for controlling cloud and virtualized data centers in an intelligent workload management system
WO2012006509A1 (en) 2010-07-09 2012-01-12 Google Inc. Table search using recovered semantic information
EP2622592A4 (en) 2010-09-28 2017-04-05 International Business Machines Corporation Providing answers to questions using multiple models to score candidate answers
WO2012047557A1 (en) 2010-09-28 2012-04-12 International Business Machines Corporation Evidence diffusion among candidate answers during question answering
US9788834B2 (en) 2010-09-30 2017-10-17 Ethicon Llc Layer comprising deployable attachment members
US8442988B2 (en) * 2010-11-04 2013-05-14 International Business Machines Corporation Adaptive cell-specific dictionaries for frequency-partitioned multi-dimensional data
US8630989B2 (en) 2011-05-27 2014-01-14 International Business Machines Corporation Systems and methods for information extraction using contextual pattern discovery
US8874540B2 (en) 2011-09-07 2014-10-28 Xerox Corporation Method for semantic classification of numeric data sets
US8601030B2 (en) 2011-09-09 2013-12-03 International Business Machines Corporation Method for a natural language question-answering system to complement decision-support in a real-time command center
US8909648B2 (en) 2012-01-18 2014-12-09 Technion Research & Development Foundation Limited Methods and systems of supervised learning of semantic relatedness

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020078406A1 (en) * 2000-10-24 2002-06-20 Goh Kondoh Structure recovery system, parsing system, conversion system, computer system, parsing method, storage medium, and program transmission apparatus
WO2003012661A1 (en) * 2001-07-31 2003-02-13 Invention Machine Corporation Computer based summarization of natural language documents
US20090171999A1 (en) * 2007-12-27 2009-07-02 Cloudscale Inc. System and Methodology for Parallel Stream Processing
US8255789B2 (en) * 2008-09-30 2012-08-28 Apple Inc. Providing spreadsheet features

Also Published As

Publication number Publication date
US8914419B2 (en) 2014-12-16
US20140122535A1 (en) 2014-05-01
TW201419014A (zh) 2014-05-16
CN103793372A (zh) 2014-05-14

Similar Documents

Publication Publication Date Title
TWI563400B (en) Method, computer program product and system for extracting semantic relationships from table structures in electronic documents
HK1214670A1 (zh) 電子錢包設備、方法及計算機程序產品
EP2864933A4 (en) METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR EXTRACTING HUMAN FACE CHARACTERISTICS
EP2850555A4 (en) METHOD IN A PROCESSOR, DEVICE AND COMPUTER PROGRAM PRODUCT
HK1211560A1 (zh) 呼叫給出設備、電梯系統、方法和計算機程序產品
EP2732366A4 (en) INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND COMPUTER PROGRAM PRODUCT
EP2740040A4 (en) INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND COMPUTER PROGRAM PRODUCT
EP2804372A4 (en) DATA PROCESSING DEVICE, METHOD FOR DATA PROCESSING, AND COMPUTER PROGRAM
ZA201600360B (en) Key device and associated method, computer program and computer program product
EP2720176A4 (en) INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND COMPUTER PROGRAM
EP2695386A4 (en) PICTURE PROCESSING DEVICE, PICTURE PROCESSING METHOD AND PICTURE PROCESSING COMPUTER PROGRAM
SG2013082268A (en) Method, call-giving device, elevator system and computer program product
EP2920743A4 (en) Method and system for the spotting of arbitrary words in handwritten documents
EP2828741A4 (en) METHOD IN PROCESSOR, APPARATUS AND COMPUTER PROGRAM PRODUCT
EP2936412A4 (en) PAY-PER-SALE SYSTEM, PROCESS AND COMPUTER PROGRAM PRODUCT
EP2734950A4 (en) INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND COMPUTER PROGRAM
EP2718797A4 (en) INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND COMPUTER PROGRAM PRODUCT
GB201313827D0 (en) Method,apparatus and computer program product for activity recognition
EP2669778A4 (en) ENTRY DEVICE, ENTRY METHOD, AND COMPUTER PROGRAM
GB201409109D0 (en) Device, program and method for clustering documents
EP2764424A4 (en) INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND COMPUTER PROGRAM
EP2842297A4 (en) METHOD, APPARATUS AND COMPUTER PROGRAM PRODUCT FOR PROGRAMMING FILE TELEVERSEMENTS
EP2828748A4 (en) METHOD IN A PROCESSOR, DEVICE AND COMPUTER PROGRAM PRODUCT
GB201317574D0 (en) Method apparatus and computer program for product for similarity determination in multimedia content
EP2857975A4 (en) PARALLEL DATA PROCESSING SYSTEM, COMPUTER, AND METHOD FOR PARALLEL DATA PROCESSING

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees