CN102915321B - 用于处理数据的系统和方法 - Google Patents

用于处理数据的系统和方法 Download PDF

Info

Publication number
CN102915321B
CN102915321B CN201210227570.5A CN201210227570A CN102915321B CN 102915321 B CN102915321 B CN 102915321B CN 201210227570 A CN201210227570 A CN 201210227570A CN 102915321 B CN102915321 B CN 102915321B
Authority
CN
China
Prior art keywords
data
text
unstructured
section
unstructured data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210227570.5A
Other languages
English (en)
Chinese (zh)
Other versions
CN102915321A (zh
Inventor
L·J·夸特西
K·M·纳卡摩德
B·沃恩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Boeing Co
Original Assignee
Boeing Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Boeing Co filed Critical Boeing Co
Publication of CN102915321A publication Critical patent/CN102915321A/zh
Application granted granted Critical
Publication of CN102915321B publication Critical patent/CN102915321B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)
CN201210227570.5A 2011-06-30 2012-07-02 用于处理数据的系统和方法 Expired - Fee Related CN102915321B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US13/173,028 US9501455B2 (en) 2011-06-30 2011-06-30 Systems and methods for processing data
US13/173,028 2011-06-30

Publications (2)

Publication Number Publication Date
CN102915321A CN102915321A (zh) 2013-02-06
CN102915321B true CN102915321B (zh) 2018-04-27

Family

ID=46717696

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210227570.5A Expired - Fee Related CN102915321B (zh) 2011-06-30 2012-07-02 用于处理数据的系统和方法

Country Status (5)

Country Link
US (1) US9501455B2 (enExample)
EP (1) EP2541434A3 (enExample)
JP (1) JP6022239B2 (enExample)
CN (1) CN102915321B (enExample)
CA (1) CA2775879C (enExample)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8818978B2 (en) * 2008-08-15 2014-08-26 Ebay Inc. Sharing item images using a similarity score
US8521769B2 (en) 2011-07-25 2013-08-27 The Boeing Company Locating ambiguities in data
US8527695B2 (en) 2011-07-29 2013-09-03 The Boeing Company System for updating an associative memory
US9111014B1 (en) 2012-01-06 2015-08-18 Amazon Technologies, Inc. Rule builder for data processing
US9031967B2 (en) * 2012-02-27 2015-05-12 Truecar, Inc. Natural language processing system, method and computer program product useful for automotive data mapping
US9336187B2 (en) 2012-05-14 2016-05-10 The Boeing Company Mediation computing device and associated method for generating semantic tags
US10380246B2 (en) * 2014-12-18 2019-08-13 International Business Machines Corporation Validating topical data of unstructured text in electronic forms to control a graphical user interface based on the unstructured text relating to a question included in the electronic form
CN106375233B (zh) * 2015-11-09 2019-11-15 北京智谷技术服务有限公司 数据传输方法及数据传输装置
CN108369661B (zh) * 2015-11-12 2022-03-11 谷歌有限责任公司 神经网络编程器
US10360501B2 (en) * 2015-12-31 2019-07-23 International Business Machines Corporation Real-time capture and translation of human thoughts and ideas into structured patterns
GB2547887A (en) * 2016-01-29 2017-09-06 Waazon (Holdings) Ltd Method and apparatus for generating amended marked-up text
US10592749B2 (en) 2016-11-14 2020-03-17 General Electric Company Systems and methods for analyzing turns at an airport
EP3574627A1 (en) * 2017-01-16 2019-12-04 Turfan, Ercan Knowledge-based structured communication system
US10834336B2 (en) 2018-01-29 2020-11-10 Ge Aviation Systems Llc Thermal imaging of aircraft
WO2021026428A1 (en) * 2019-08-07 2021-02-11 Zinatt Technologies, Inc. Data entry feature for information tracking system
JP7412307B2 (ja) * 2020-08-28 2024-01-12 株式会社日立製作所 作成支援装置、作成支援方法、および作成支援プログラム

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1571968A (zh) * 2001-08-17 2005-01-26 通用商业矩阵有限责任公司 向数据添加元数据的方法

Family Cites Families (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5365430A (en) 1991-06-25 1994-11-15 At&T Bell Laboratories Method for parsing images
JP2001290801A (ja) 2000-02-04 2001-10-19 Fujitsu Ltd 構造文書化システム,構造文書化プログラム,及び、コンピュータ可読格納媒体
US7027974B1 (en) * 2000-10-27 2006-04-11 Science Applications International Corporation Ontology-based parser for natural language processing
US7194483B1 (en) * 2001-05-07 2007-03-20 Intelligenxia, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
US20030154071A1 (en) * 2002-02-11 2003-08-14 Shreve Gregory M. Process for the document management and computer-assisted translation of documents utilizing document corpora constructed by intelligent agents
US20040024760A1 (en) * 2002-07-31 2004-02-05 Phonetic Research Ltd. System, method and computer program product for matching textual strings using language-biased normalisation, phonetic representation and correlation functions
US7769628B2 (en) 2003-06-03 2010-08-03 The Boeing Company Systems, methods and computer program products for modeling uncertain future demand, supply and associated profitability of a good
US20050278362A1 (en) * 2003-08-12 2005-12-15 Maren Alianna J Knowledge discovery system
US20090204507A1 (en) * 2004-02-26 2009-08-13 Change Research Incorporated Method and system for discovering and generating an insight via a network
US8041713B2 (en) * 2004-03-31 2011-10-18 Google Inc. Systems and methods for analyzing boilerplate
US20060224682A1 (en) * 2005-04-04 2006-10-05 Inmon Data Systems, Inc. System and method of screening unstructured messages and communications
US7653633B2 (en) * 2005-11-12 2010-01-26 Logrhythm, Inc. Log collection, structuring and processing
US7747605B2 (en) * 2006-04-17 2010-06-29 Perry J. Narancic Organizational data analysis and management
US8015162B2 (en) * 2006-08-04 2011-09-06 Google Inc. Detecting duplicate and near-duplicate files
US20090300482A1 (en) * 2006-08-30 2009-12-03 Compsci Resources, Llc Interactive User Interface for Converting Unstructured Documents
US8326819B2 (en) * 2006-11-13 2012-12-04 Exegy Incorporated Method and system for high performance data metatagging and data indexing using coprocessors
US20140188919A1 (en) * 2007-01-26 2014-07-03 Google Inc. Duplicate document detection
US8161045B2 (en) 2007-02-01 2012-04-17 The Boeing Company Use of associate memory learning agent technology to identify interchangeable parts in parts catalogs
US20080313143A1 (en) 2007-06-14 2008-12-18 Boeing Company Apparatus and method for evaluating activities of a hostile force
US8811596B2 (en) 2007-06-25 2014-08-19 The Boeing Company Apparatus including associative memory for evaluating audio communications
WO2009061399A1 (en) * 2007-11-05 2009-05-14 Nagaraju Bandaru Method for crawling, mapping and extracting information associated with a business using heuristic and semantic analysis
US8086592B2 (en) * 2007-11-30 2011-12-27 SAP France S.A. Apparatus and method for associating unstructured text with structured data
US8000956B2 (en) * 2008-02-08 2011-08-16 Xerox Corporation Semantic compatibility checking for automatic correction and discovery of named entities
US20090204610A1 (en) * 2008-02-11 2009-08-13 Hellstrom Benjamin J Deep web miner
JP5364296B2 (ja) 2008-06-05 2013-12-11 株式会社東芝 文書構造化処理装置、及び方法
US8266148B2 (en) * 2008-10-07 2012-09-11 Aumni Data, Inc. Method and system for business intelligence analytics on unstructured data
US9542436B2 (en) 2009-02-09 2017-01-10 The Boeing Company Employing associative memory for enhanced lifecycle management
US10410146B2 (en) 2009-02-09 2019-09-10 The Boeing Company Associative memory learning agent for analysis of manufacturing non-conformance applications
US9053159B2 (en) 2009-02-09 2015-06-09 The Boeing Company Non-conformance analysis using an associative memory learning agent
US8335754B2 (en) * 2009-03-06 2012-12-18 Tagged, Inc. Representing a document using a semantic structure
US8838490B2 (en) 2009-04-07 2014-09-16 The Boeing Company Associate memory learning for analyzing financial transactions
US20100268673A1 (en) 2009-04-16 2010-10-21 The Boeing Company Associate memory learning agent technology for travel optimization and monitoring
US20100306249A1 (en) * 2009-05-27 2010-12-02 James Hill Social network systems and methods
US8577829B2 (en) * 2009-09-11 2013-11-05 Hewlett-Packard Development Company, L.P. Extracting information from unstructured data and mapping the information to a structured schema using the naïve bayesian probability model
EP2325762A1 (en) * 2009-10-27 2011-05-25 Exalead Method and system for processing information of a stream of information
US8417709B2 (en) * 2010-05-27 2013-04-09 International Business Machines Corporation Automatic refinement of information extraction rules
US9082140B2 (en) * 2010-06-09 2015-07-14 Ebay Inc. Systems and methods to extract and utilize textual semantics
US8239349B2 (en) * 2010-10-07 2012-08-07 Hewlett-Packard Development Company, L.P. Extracting data
US20120101860A1 (en) * 2010-10-25 2012-04-26 Ezzat Ahmed K Providing business intelligence
US8484245B2 (en) * 2011-02-08 2013-07-09 Xerox Corporation Large scale unsupervised hierarchical document categorization using ontological guidance
US8239425B1 (en) * 2011-02-28 2012-08-07 Battelle Memorial Institute Isolating desired content, metadata, or both from social media
US20120278336A1 (en) * 2011-04-29 2012-11-01 Malik Hassan H Representing information from documents

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1571968A (zh) * 2001-08-17 2005-01-26 通用商业矩阵有限责任公司 向数据添加元数据的方法

Also Published As

Publication number Publication date
JP2013016172A (ja) 2013-01-24
CA2775879C (en) 2016-08-30
CN102915321A (zh) 2013-02-06
US20130006610A1 (en) 2013-01-03
US9501455B2 (en) 2016-11-22
EP2541434A3 (en) 2017-11-29
CA2775879A1 (en) 2012-12-30
JP6022239B2 (ja) 2016-11-09
EP2541434A2 (en) 2013-01-02

Similar Documents

Publication Publication Date Title
CN102915321B (zh) 用于处理数据的系统和方法
CN114846461B (zh) 用于模式注释文件的自动创建的方法和系统
US10204168B2 (en) Systems and methods for automatically identifying and linking names in digital resources
Chen et al. BigGorilla: An open-source ecosystem for data preparation and integration.
US9286290B2 (en) Producing insight information from tables using natural language processing
CN114616572A (zh) 跨文档智能写作和处理助手
Kiyavitskaya et al. Cerno: Light-weight tool support for semantic annotation of textual documents
JP2011118526A (ja) 単語意味関係抽出装置
US12141211B2 (en) System, method, and computer program product for tokenizing document citations
Sannier et al. Legal markup generation in the large: An experience report
Hamann et al. Detailed mark‐up of semi‐monographic legacy taxonomic works using FlorML
Souza et al. ARCTIC: metadata extraction from scientific papers in pdf using two-layer CRF
Rauf et al. Logical structure extraction from software requirements documents
CN116204618B (zh) 一种智能问答生成方法、装置、电子设备及存储介质
Mirrezaei et al. The triplex approach for recognizing semantic relations from noun phrases, appositions, and adjectives
Patrick et al. Developing SNOMED CT subsets from clinical notes for intensive care service
CN116304347A (zh) 一种基于群智知识的Git命令推荐方法
Baral et al. An exploration of datalog applications to language documentation and reclamation
Bradley et al. SynFinTabs: a dataset of synthetic financial tables for information and table extraction
Labský et al. Information extraction with presentation ontologies
Berlanga et al. Faeton: form analysis and extraction tool for ontology construction
Kamal et al. Improve Academic Query Resolution through BERT-based Question Extraction from Images
Gurita Enriching Retrieval-AugmentedGeneration with Non-TextualInformation to Support ScientificWriting
Milošević A multi-layered approach to information extraction from tables in biomedical documents
Er Turkish factoid question answering using answer pattern matching

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180427