KR102865616B1 - 문서 간 지능형 저작 및 처리 보조기 - Google Patents
문서 간 지능형 저작 및 처리 보조기Info
- Publication number
- KR102865616B1 KR102865616B1 KR1020247028082A KR20247028082A KR102865616B1 KR 102865616 B1 KR102865616 B1 KR 102865616B1 KR 1020247028082 A KR1020247028082 A KR 1020247028082A KR 20247028082 A KR20247028082 A KR 20247028082A KR 102865616 B1 KR102865616 B1 KR 102865616B1
- Authority
- KR
- South Korea
- Prior art keywords
- chunks
- documents
- document
- document set
- identifying
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/131—Fragmentation of text files, e.g. creating reusable text-blocks; Linking to fragments, e.g. using XInclude; Namespaces
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2457—Query processing with adaptation to user needs
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/248—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/93—Document management systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
- G06F40/106—Display of layout of documents; Previewing
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
- G06F40/117—Tagging; Marking up; Designating a block; Setting of attributes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/169—Annotation, e.g. comment data or footnotes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0464—Convolutional networks [CNN, ConvNet]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/088—Non-supervised learning, e.g. competitive learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/0895—Weakly supervised learning, e.g. semi-supervised or self-supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/091—Active learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/096—Transfer learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/414—Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/416—Extracting the logical structure, e.g. chapters, sections or page numbers; Identifying elements of the document, e.g. authors
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/418—Document matching, e.g. of document images
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/186—Templates
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Software Systems (AREA)
- Evolutionary Computation (AREA)
- Mathematical Physics (AREA)
- Computing Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Databases & Information Systems (AREA)
- Multimedia (AREA)
- Business, Economics & Management (AREA)
- General Business, Economics & Management (AREA)
- Medical Informatics (AREA)
- Computer Graphics (AREA)
- Geometry (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| KR1020257031882A KR20250143131A (ko) | 2019-09-16 | 2020-07-24 | 문서 간 지능형 저작 및 처리 보조기 |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201962900793P | 2019-09-16 | 2019-09-16 | |
| US62/900,793 | 2019-09-16 | ||
| KR1020227011501A KR102699233B1 (ko) | 2019-09-16 | 2020-07-24 | 문서 간 지능형 저작 및 처리 보조기 |
| PCT/US2020/043606 WO2021055102A1 (en) | 2019-09-16 | 2020-07-24 | Cross-document intelligent authoring and processing assistant |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020227011501A Division KR102699233B1 (ko) | 2019-09-16 | 2020-07-24 | 문서 간 지능형 저작 및 처리 보조기 |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020257031882A Division KR20250143131A (ko) | 2019-09-16 | 2020-07-24 | 문서 간 지능형 저작 및 처리 보조기 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR20240129242A KR20240129242A (ko) | 2024-08-27 |
| KR102865616B1 true KR102865616B1 (ko) | 2025-09-30 |
Family
ID=74867926
Family Applications (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020247028082A Active KR102865616B1 (ko) | 2019-09-16 | 2020-07-24 | 문서 간 지능형 저작 및 처리 보조기 |
| KR1020257031882A Pending KR20250143131A (ko) | 2019-09-16 | 2020-07-24 | 문서 간 지능형 저작 및 처리 보조기 |
| KR1020227011501A Active KR102699233B1 (ko) | 2019-09-16 | 2020-07-24 | 문서 간 지능형 저작 및 처리 보조기 |
Family Applications After (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020257031882A Pending KR20250143131A (ko) | 2019-09-16 | 2020-07-24 | 문서 간 지능형 저작 및 처리 보조기 |
| KR1020227011501A Active KR102699233B1 (ko) | 2019-09-16 | 2020-07-24 | 문서 간 지능형 저작 및 처리 보조기 |
Country Status (6)
| Country | Link |
|---|---|
| US (7) | US11816428B2 (https=) |
| EP (1) | EP4028961A4 (https=) |
| JP (3) | JP7664262B2 (https=) |
| KR (3) | KR102865616B1 (https=) |
| CN (2) | CN121683697A (https=) |
| CA (1) | CA3150535A1 (https=) |
Families Citing this family (62)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3460685A1 (en) * | 2017-09-12 | 2019-03-27 | Bricsys NV | Improved semantic classification of an entity in a building information model |
| KR102865616B1 (ko) * | 2019-09-16 | 2025-09-30 | 도큐가미, 인크. | 문서 간 지능형 저작 및 처리 보조기 |
| US11875778B1 (en) * | 2019-11-15 | 2024-01-16 | Yahoo Assets Llc | Systems and methods for voice rendering of machine-generated electronic messages |
| US11763071B2 (en) * | 2020-01-06 | 2023-09-19 | Catachi Co. | Methods and systems for facilitating unifying of multiple regulatory documents |
| US12596924B1 (en) * | 2020-03-16 | 2026-04-07 | Eightfold AI Inc. | System and method for machine-readable electronic document |
| JP2023532669A (ja) * | 2020-06-25 | 2023-07-31 | プリオン インコーポレーテッド | 文書処理および応答生成システム |
| US20220092097A1 (en) * | 2020-09-18 | 2022-03-24 | Anurag Gupta | Method for Extracting and Organizing Information from a Document |
| US11899921B2 (en) * | 2020-09-29 | 2024-02-13 | Google Llc | Scroller interface for transcription navigation |
| US12229208B2 (en) * | 2020-09-30 | 2025-02-18 | Home Depot Product Authority, Llc | Responsive category prediction for user queries |
| US20220156489A1 (en) * | 2020-11-18 | 2022-05-19 | Adobe Inc. | Machine learning techniques for identifying logical sections in unstructured data |
| CN112435651B (zh) * | 2020-11-20 | 2023-05-02 | 昆明学院 | 一种语音数据自动标注的质量评估方法 |
| US11748555B2 (en) * | 2021-01-22 | 2023-09-05 | Bao Tran | Systems and methods for machine content generation |
| US12493744B2 (en) * | 2021-02-09 | 2025-12-09 | Ancestry.Com Operations Inc. | Context-based keyphrase extraction from input text |
| EP4295266A1 (en) | 2021-02-17 | 2023-12-27 | Applica Sp. z.o.o. | Text-image-layout transformer (tilt) |
| US11594054B2 (en) * | 2021-02-19 | 2023-02-28 | Capital One Services, Llc | Document lineage management system |
| US11790568B2 (en) * | 2021-03-29 | 2023-10-17 | Kyndryl, Inc | Image entity extraction and granular interactivity articulation |
| US11521639B1 (en) * | 2021-04-02 | 2022-12-06 | Asapp, Inc. | Speech sentiment analysis using a speech sentiment classifier pretrained with pseudo sentiment labels |
| US12174913B2 (en) * | 2021-04-29 | 2024-12-24 | International Business Machines Corporation | Parameterized neighborhood memory adaptation |
| US12277389B2 (en) * | 2021-05-10 | 2025-04-15 | International Business Machines Corporation | Text mining based on document structure information extraction |
| US11755839B2 (en) * | 2021-05-19 | 2023-09-12 | International Business Machines Corporation | Low resource named entity recognition for sensitive personal information |
| WO2023287952A1 (en) * | 2021-07-14 | 2023-01-19 | Kpmg Llp | System and method for implementing a medical records analytics platform |
| US11763803B1 (en) | 2021-07-28 | 2023-09-19 | Asapp, Inc. | System, method, and computer program for extracting utterances corresponding to a user problem statement in a conversation between a human agent and a user |
| CN113505201A (zh) * | 2021-07-29 | 2021-10-15 | 宁波薄言信息技术有限公司 | 一种基于SegaBert预训练模型的合同抽取方法 |
| CN113722555A (zh) * | 2021-07-29 | 2021-11-30 | 武汉光庭信息技术股份有限公司 | 一种数据标注项质检方法及系统 |
| US20230074189A1 (en) * | 2021-08-19 | 2023-03-09 | Fmr Llc | Methods and systems for intelligent text classification with limited or no training data |
| US11941147B2 (en) | 2021-08-31 | 2024-03-26 | Box, Inc. | Detection of personally identifiable information |
| US12072935B2 (en) | 2021-09-08 | 2024-08-27 | Microsoft Technology Licensing, Llc | Machine-learning of document portion layout |
| US20230102198A1 (en) * | 2021-09-30 | 2023-03-30 | Intuit Inc. | Artificial intelligence based compliance document processing |
| US11657078B2 (en) | 2021-10-14 | 2023-05-23 | Fmr Llc | Automatic identification of document sections to generate a searchable data structure |
| US11361151B1 (en) | 2021-10-18 | 2022-06-14 | BriefCatch LLC | Methods and systems for intelligent editing of legal documents |
| US12153880B2 (en) | 2021-10-18 | 2024-11-26 | BriefCatch LLC | Methods and systems for intelligent editing of legal documents |
| CN116186000A (zh) * | 2021-11-26 | 2023-05-30 | 华为云计算技术有限公司 | 数据治理的方法、装置及存储介质 |
| US12067363B1 (en) | 2022-02-24 | 2024-08-20 | Asapp, Inc. | System, method, and computer program for text sanitization |
| WO2024072483A2 (en) * | 2022-04-12 | 2024-04-04 | The Trustees Of Dartmouth College | Processing architecture for fundamental symbolic logic operations and method for employing the same |
| US12282503B2 (en) * | 2022-04-19 | 2025-04-22 | Microsoft Technology Licensing, Llc | Inline search based on intent-detection |
| US11907643B2 (en) * | 2022-04-29 | 2024-02-20 | Adobe Inc. | Dynamic persona-based document navigation |
| US20230350954A1 (en) * | 2022-05-02 | 2023-11-02 | SparkCognition, Inc. | Systems and methods of filtering topics using parts of speech tagging |
| JP2023166252A (ja) * | 2022-05-09 | 2023-11-21 | キヤノン株式会社 | 情報処理装置、情報処理方法及びプログラム |
| US12333244B2 (en) * | 2022-05-12 | 2025-06-17 | Dell Products L.P. | Automated address data determinations using artificial intelligence techniques |
| US12141208B2 (en) | 2022-05-23 | 2024-11-12 | International Business Machines Corporation | Multi-chunk relationship extraction and maximization of query answer coherence |
| US11853335B1 (en) | 2022-06-13 | 2023-12-26 | International Business Machines Corporation | Cooperative build and content annotation for conversational design of virtual assistants |
| US12205393B2 (en) * | 2022-07-12 | 2025-01-21 | Dell Products L.P. | Automating text and graphics coverage analysis of a website page |
| CN115495580B (zh) * | 2022-09-26 | 2026-02-10 | 中南大学 | 基于量子启发式算法的文本情感分类方法 |
| US12056175B2 (en) * | 2022-09-28 | 2024-08-06 | Atlassian Pty Ltd. | Label management system for an electronic document management service |
| US12079912B2 (en) * | 2022-11-10 | 2024-09-03 | International Business Machines Corporation | Enhancing images in text documents |
| US12026458B2 (en) | 2022-11-11 | 2024-07-02 | State Farm Mutual Automobile Insurance Company | Systems and methods for generating document templates from a mixed set of document types |
| US12124794B2 (en) * | 2022-11-22 | 2024-10-22 | Adobe Inc. | Stylizing digital content |
| EP4589464A4 (en) | 2022-12-02 | 2025-12-24 | Samsung Electronics Co Ltd | METHOD, ELECTRONIC DEVICE AND RECORDING MEDIUM FOR ADJUSTING A DOCUMENT STYLE |
| US20240296187A1 (en) * | 2023-03-02 | 2024-09-05 | Truist Bank | Automated classification of datasets using semantic type indentification |
| US12315051B2 (en) | 2023-03-14 | 2025-05-27 | Adobe Inc. | Reference based digital content stylization |
| US12525047B2 (en) | 2023-04-20 | 2026-01-13 | L&T Technology Services Limited | Method and system of classifying text data in a document |
| WO2025034755A2 (en) * | 2023-08-07 | 2025-02-13 | Trunk Tools, Inc. | Methods and systems for generative question answering for construction project data |
| US20250322167A1 (en) * | 2023-08-09 | 2025-10-16 | Instabase, Inc. | Systems and methods to extract semantic information from documents |
| US12405970B2 (en) | 2023-10-06 | 2025-09-02 | International Business Machines Corporation | Multi-layer approach to improving generation of field extraction models |
| US20250307286A1 (en) * | 2024-03-29 | 2025-10-02 | Microsoft Technology Licensing, Llc | Chunk synthesis for retrieval augmented generation assistants |
| JP2025182482A (ja) * | 2024-06-03 | 2025-12-15 | 株式会社東芝 | 文書処理プログラム、文書処理装置および文書処理方法 |
| CN119005137B (zh) * | 2024-06-28 | 2025-04-04 | 北京安锐卓越信息技术股份有限公司 | 基于icontent架构的一键修改文档错误内容的方法 |
| US20260010574A1 (en) * | 2024-07-03 | 2026-01-08 | Sas Institute Inc. | System and method for compressing prompts to language models for document processing |
| CN118504533B (zh) * | 2024-07-19 | 2024-11-08 | 青岛理工大学 | 一种基于大语言模型的在线文档智能操作系统及操作方法 |
| CN119473084A (zh) * | 2024-10-30 | 2025-02-18 | 北京字跳网络技术有限公司 | 用于模板创建的方法、装置、设备和存储介质 |
| US12602421B1 (en) | 2025-01-23 | 2026-04-14 | Dell Products L.P. | Classifying retrieved context data for a relativistic response |
| US12511925B1 (en) * | 2025-04-01 | 2025-12-30 | Qpiai India Private Limited | System and method for semi-automated dataset annotation using similarity based clustering and in-context learning for segmentation |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070136221A1 (en) | 2005-03-30 | 2007-06-14 | Peter Sweeney | System, Method and Computer Program for Facet Analysis |
| US20120005686A1 (en) | 2010-07-01 | 2012-01-05 | Suju Rajan | Annotating HTML Segments With Functional Labels |
Family Cites Families (105)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE69616093D1 (de) | 1996-07-03 | 2001-11-22 | Sopheon N V | System zum unterstützen der produktion von dokumenten |
| US6076051A (en) | 1997-03-07 | 2000-06-13 | Microsoft Corporation | Information retrieval utilizing semantic representation of text |
| US7287219B1 (en) | 1999-03-11 | 2007-10-23 | Abode Systems Incorporated | Method of constructing a document type definition from a set of structured electronic documents |
| US6924828B1 (en) | 1999-04-27 | 2005-08-02 | Surfnotes | Method and apparatus for improved information representation |
| US20020002481A1 (en) | 2000-05-16 | 2002-01-03 | Hirokazu Uchio | Information processing apparatus for management of documents relevant to patent application |
| WO2002017128A1 (en) | 2000-08-24 | 2002-02-28 | Science Applications International Corporation | Word sense disambiguation |
| WO2003012661A1 (en) | 2001-07-31 | 2003-02-13 | Invention Machine Corporation | Computer based summarization of natural language documents |
| US9009590B2 (en) * | 2001-07-31 | 2015-04-14 | Invention Machines Corporation | Semantic processor for recognition of cause-effect relations in natural language documents |
| US20040001099A1 (en) * | 2002-06-27 | 2004-01-01 | Microsoft Corporation | Method and system for associating actions with semantic labels in electronic documents |
| US7523394B2 (en) | 2002-06-28 | 2009-04-21 | Microsoft Corporation | Word-processing document stored in a single XML file that may be manipulated by applications that understand XML |
| US20050027664A1 (en) * | 2003-07-31 | 2005-02-03 | Johnson David E. | Interactive machine learning system for automated annotation of information in text |
| US20050060643A1 (en) * | 2003-08-25 | 2005-03-17 | Miavia, Inc. | Document similarity detection and classification system |
| US20050060140A1 (en) * | 2003-09-15 | 2005-03-17 | Maddox Paul Christopher | Using semantic feature structures for document comparisons |
| US20050108630A1 (en) * | 2003-11-19 | 2005-05-19 | Wasson Mark D. | Extraction of facts from text |
| JP4113145B2 (ja) * | 2004-03-16 | 2008-07-09 | 株式会社東芝 | 文書処理装置及び文書処理方法 |
| US7742911B2 (en) * | 2004-10-12 | 2010-06-22 | At&T Intellectual Property Ii, L.P. | Apparatus and method for spoken language understanding by using semantic role labeling |
| US8719700B2 (en) * | 2010-05-04 | 2014-05-06 | Xerox Corporation | Matching a page layout for each page of a document to a page template candidate from a list of page layout candidates |
| US20060235870A1 (en) | 2005-01-31 | 2006-10-19 | Musgrove Technology Enterprises, Llc | System and method for generating an interlinked taxonomy structure |
| US8249344B2 (en) | 2005-07-01 | 2012-08-21 | Microsoft Corporation | Grammatical parsing of document visual structures |
| JP4521343B2 (ja) * | 2005-09-29 | 2010-08-11 | 株式会社東芝 | 文書処理装置及び文書処理方法 |
| US8176004B2 (en) | 2005-10-24 | 2012-05-08 | Capsilon Corporation | Systems and methods for intelligent paperless document management |
| US20070150802A1 (en) * | 2005-12-12 | 2007-06-28 | Canon Information Systems Research Australia Pty. Ltd. | Document annotation and interface |
| US7788579B2 (en) | 2006-03-06 | 2010-08-31 | Ricoh Co., Ltd. | Automated document layout design |
| US20080008391A1 (en) * | 2006-07-10 | 2008-01-10 | Amir Geva | Method and System for Document Form Recognition |
| US9495358B2 (en) | 2006-10-10 | 2016-11-15 | Abbyy Infopoisk Llc | Cross-language text clustering |
| US8738359B2 (en) | 2006-10-18 | 2014-05-27 | Honda Motor Co., Ltd. | Scalable knowledge extraction |
| US7734623B2 (en) * | 2006-11-07 | 2010-06-08 | Cycorp, Inc. | Semantics-based method and apparatus for document analysis |
| US8671341B1 (en) * | 2007-01-05 | 2014-03-11 | Linguastat, Inc. | Systems and methods for identifying claims associated with electronic text |
| US7778953B2 (en) * | 2007-02-19 | 2010-08-17 | Kabushiki Kaisha Toshiba | Document management apparatus and document management method |
| US8180633B2 (en) | 2007-03-08 | 2012-05-15 | Nec Laboratories America, Inc. | Fast semantic extraction using a neural network architecture |
| US8209278B1 (en) * | 2007-03-23 | 2012-06-26 | Jay Bradley Straus | Computer editing system for common textual patterns in legal documents |
| WO2008132706A1 (en) * | 2007-04-26 | 2008-11-06 | Markport Limited | A web browsing method and system |
| US8527262B2 (en) | 2007-06-22 | 2013-09-03 | International Business Machines Corporation | Systems and methods for automatic semantic role labeling of high morphological text for natural language processing applications |
| US9342551B2 (en) | 2007-08-14 | 2016-05-17 | John Nicholas and Kristin Gross Trust | User based document verifier and method |
| WO2009029923A2 (en) * | 2007-08-31 | 2009-03-05 | Powerset, Inc. | Emphasizing search results according to conceptual meaning |
| US8229730B2 (en) | 2007-08-31 | 2012-07-24 | Microsoft Corporation | Indexing role hierarchies for words in a search index |
| US8280885B2 (en) | 2007-10-29 | 2012-10-02 | Cornell University | System and method for automatically summarizing fine-grained opinions in digital text |
| US8392436B2 (en) | 2008-02-07 | 2013-03-05 | Nec Laboratories America, Inc. | Semantic search via role labeling |
| US8145632B2 (en) | 2008-02-22 | 2012-03-27 | Tigerlogic Corporation | Systems and methods of identifying chunks within multiple documents |
| US8196030B1 (en) * | 2008-06-02 | 2012-06-05 | Pricewaterhousecoopers Llp | System and method for comparing and reviewing documents |
| US8286132B2 (en) | 2008-09-25 | 2012-10-09 | International Business Machines Corporation | Comparing and merging structured documents syntactically and semantically |
| US8214734B2 (en) | 2008-10-09 | 2012-07-03 | International Business Machines Corporation | Credibility of text analysis engine performance evaluation by rating reference content |
| US20100153318A1 (en) * | 2008-11-19 | 2010-06-17 | Massachusetts Institute Of Technology | Methods and systems for automatically summarizing semantic properties from documents with freeform textual annotations |
| US8473467B2 (en) | 2009-01-02 | 2013-06-25 | Apple Inc. | Content profiling to dynamically configure content processing |
| US9262395B1 (en) * | 2009-02-11 | 2016-02-16 | Guangsheng Zhang | System, methods, and data structure for quantitative assessment of symbolic associations |
| US8335754B2 (en) * | 2009-03-06 | 2012-12-18 | Tagged, Inc. | Representing a document using a semantic structure |
| WO2010120925A2 (en) * | 2009-04-15 | 2010-10-21 | Evri Inc. | Search and search optimization using a pattern of a location identifier |
| JP5340847B2 (ja) * | 2009-07-27 | 2013-11-13 | 株式会社日立ソリューションズ | 文書データ処理装置 |
| JP5477635B2 (ja) | 2010-02-15 | 2014-04-23 | ソニー株式会社 | 情報処理装置および方法、並びにプログラム |
| US9760634B1 (en) * | 2010-03-23 | 2017-09-12 | Firstrain, Inc. | Models for classifying documents |
| US9129300B2 (en) * | 2010-04-21 | 2015-09-08 | Yahoo! Inc. | Using external sources for sponsored search AD selection |
| US20150112664A1 (en) * | 2010-12-09 | 2015-04-23 | Rage Frameworks, Inc. | System and method for generating a tractable semantic network for a concept |
| US8818932B2 (en) | 2011-02-14 | 2014-08-26 | Decisive Analytics Corporation | Method and apparatus for creating a predictive model |
| US10303999B2 (en) | 2011-02-22 | 2019-05-28 | Refinitiv Us Organization Llc | Machine learning-based relationship association and related discovery and search engines |
| US8543577B1 (en) * | 2011-03-02 | 2013-09-24 | Google Inc. | Cross-channel clusters of information |
| US8719692B2 (en) | 2011-03-11 | 2014-05-06 | Microsoft Corporation | Validation, rejection, and modification of automatically generated document annotations |
| US20120296637A1 (en) | 2011-05-20 | 2012-11-22 | Smiley Edwin Lee | Method and apparatus for calculating topical categorization of electronic documents in a collection |
| US8606780B2 (en) | 2011-07-08 | 2013-12-10 | Microsoft Corporation | Image re-rank based on image annotations |
| US8488916B2 (en) * | 2011-07-22 | 2013-07-16 | David S Terman | Knowledge acquisition nexus for facilitating concept capture and promoting time on task |
| US9280525B2 (en) * | 2011-09-06 | 2016-03-08 | Go Daddy Operating Company, LLC | Method and apparatus for forming a structured document from unstructured information |
| PL2639749T3 (pl) | 2012-03-15 | 2017-05-31 | Cortical.Io Gmbh | Sposoby, urządzenia i produkty do przetwarzania semantycznego tekstu |
| US9008443B2 (en) * | 2012-06-22 | 2015-04-14 | Xerox Corporation | System and method for identifying regular geometric structures in document pages |
| US20150100877A1 (en) * | 2012-06-29 | 2015-04-09 | Yahoo! Inc. | Method or system for automated extraction of hyper-local events from one or more web pages |
| US9280520B2 (en) | 2012-08-02 | 2016-03-08 | American Express Travel Related Services Company, Inc. | Systems and methods for semantic information retrieval |
| US9582494B2 (en) | 2013-02-22 | 2017-02-28 | Altilia S.R.L. | Object extraction from presentation-oriented documents using a semantic and spatial approach |
| US20140324808A1 (en) | 2013-03-15 | 2014-10-30 | Sumeet Sandhu | Semantic Segmentation and Tagging and Advanced User Interface to Improve Patent Search and Analysis |
| US9922102B2 (en) * | 2013-07-31 | 2018-03-20 | Splunk Inc. | Templates for defining fields in machine data |
| GB2517976A (en) * | 2013-09-09 | 2015-03-11 | Ibm | Business rule management system |
| US9058374B2 (en) | 2013-09-26 | 2015-06-16 | International Business Machines Corporation | Concept driven automatic section identification |
| WO2015048275A2 (en) | 2013-09-26 | 2015-04-02 | Polis Technology Inc. | System and methods for real-time formation of groups and decentralized decision making |
| WO2015070093A1 (en) * | 2013-11-08 | 2015-05-14 | Thomas Fennell | System and method for translating texts |
| US9396763B2 (en) * | 2013-11-15 | 2016-07-19 | Clipmine, Inc. | Computer-assisted collaborative tagging of video content for indexing and table of contents generation |
| US10424016B2 (en) * | 2013-12-19 | 2019-09-24 | International Business Machines Corporation | Modeling asset transfer flow relationships discovered in unstructured data |
| US10055402B2 (en) * | 2014-03-17 | 2018-08-21 | Accenture Global Services Limited | Generating a semantic network based on semantic connections between subject-verb-object units |
| US10140578B1 (en) * | 2014-03-17 | 2018-11-27 | Intuit Inc. | System and method for managing social-based questions and answers |
| US9477654B2 (en) | 2014-04-01 | 2016-10-25 | Microsoft Corporation | Convolutional latent semantic models and their applications |
| US9760626B2 (en) | 2014-09-05 | 2017-09-12 | International Business Machines Corporation | Optimizing parsing outcomes of documents |
| US10325511B2 (en) | 2015-01-30 | 2019-06-18 | Conduent Business Services, Llc | Method and system to attribute metadata to preexisting documents |
| US10733256B2 (en) * | 2015-02-10 | 2020-08-04 | Researchgate Gmbh | Online publication system and method |
| US20160267165A1 (en) * | 2015-03-14 | 2016-09-15 | Hui Wang | Automated Key Words (Phrases) Discovery In Document Stacks And Its Application To Document Classification, Aggregation, and Summarization |
| JP2017004074A (ja) * | 2015-06-05 | 2017-01-05 | 日本電気株式会社 | 関係検出システム、関係検出方法、及び、関係検出プログラム |
| US9940681B2 (en) * | 2015-09-01 | 2018-04-10 | International Business Machines Corporation | Predictive approach to contract management |
| US10504010B2 (en) * | 2015-10-02 | 2019-12-10 | Baidu Usa Llc | Systems and methods for fast novel visual concept learning from sentence descriptions of images |
| US9760556B1 (en) | 2015-12-11 | 2017-09-12 | Palantir Technologies Inc. | Systems and methods for annotating and linking electronic documents |
| WO2018020462A1 (en) | 2016-07-27 | 2018-02-01 | Wix.Com Ltd. | System and method for implementing containers which extract and apply semantic page knowledge |
| US10755804B2 (en) | 2016-08-10 | 2020-08-25 | Talix, Inc. | Health information system for searching, analyzing and annotating patient data |
| CN106295706B (zh) | 2016-08-17 | 2019-04-19 | 山东大学 | 一种基于形状视觉知识库的图像自动分割和语义注释方法 |
| JP2018045664A (ja) | 2016-09-16 | 2018-03-22 | 株式会社リコー | 利用量管理装置、利用量管理方法、利用量管理プログラム、及び、利用量管理システム |
| US20180150768A1 (en) * | 2016-11-30 | 2018-05-31 | Gluru Limited | Automated generation of natural language task/expectation descriptions |
| US10380228B2 (en) * | 2017-02-10 | 2019-08-13 | Microsoft Technology Licensing, Llc | Output generation based on semantic expressions |
| US11416956B2 (en) * | 2017-03-15 | 2022-08-16 | Coupa Software Incorporated | Machine evaluation of contract terms |
| US20180300315A1 (en) * | 2017-04-14 | 2018-10-18 | Novabase Business Solutions, S.A. | Systems and methods for document processing using machine learning |
| US10540440B2 (en) * | 2017-06-05 | 2020-01-21 | International Business Machines Corporation | Relation extraction using Q and A |
| JP7187545B2 (ja) * | 2017-09-28 | 2022-12-12 | オラクル・インターナショナル・コーポレイション | 名前付きエンティティの構文解析および識別に基づくクロスドキュメントの修辞的つながりの判断 |
| EP3462331B1 (en) | 2017-09-29 | 2021-08-04 | Tata Consultancy Services Limited | Automated cognitive processing of source agnostic data |
| US20190102697A1 (en) * | 2017-10-02 | 2019-04-04 | International Business Machines Corporation | Creating machine learning models from structured intelligence databases |
| US10838996B2 (en) | 2018-03-15 | 2020-11-17 | International Business Machines Corporation | Document revision change summarization |
| US10650186B2 (en) | 2018-06-08 | 2020-05-12 | Handycontract, LLC | Device, system and method for displaying sectioned documents |
| US10891316B2 (en) | 2018-07-02 | 2021-01-12 | Salesforce.Com, Inc. | Identifying homogenous clusters |
| CN109582949B (zh) | 2018-09-14 | 2022-11-22 | 创新先进技术有限公司 | 事件元素抽取方法、装置、计算设备及存储介质 |
| US11232132B2 (en) * | 2018-11-30 | 2022-01-25 | Wipro Limited | Method, device, and system for clustering document objects based on information content |
| US20200311123A1 (en) | 2019-03-28 | 2020-10-01 | Wipro Limited | Method and a system for multimodal search key based multimedia content extraction |
| US10614345B1 (en) | 2019-04-12 | 2020-04-07 | Ernst & Young U.S. Llp | Machine learning based extraction of partition objects from electronic documents |
| WO2021055102A1 (en) | 2019-09-16 | 2021-03-25 | Docugami, Inc. | Cross-document intelligent authoring and processing assistant |
| KR102865616B1 (ko) | 2019-09-16 | 2025-09-30 | 도큐가미, 인크. | 문서 간 지능형 저작 및 처리 보조기 |
-
2020
- 2020-07-24 KR KR1020247028082A patent/KR102865616B1/ko active Active
- 2020-07-24 EP EP20864772.7A patent/EP4028961A4/en active Pending
- 2020-07-24 CA CA3150535A patent/CA3150535A1/en active Pending
- 2020-07-24 JP JP2022542307A patent/JP7664262B2/ja active Active
- 2020-07-24 KR KR1020257031882A patent/KR20250143131A/ko active Pending
- 2020-07-24 CN CN202511836092.1A patent/CN121683697A/zh active Pending
- 2020-07-24 CN CN202080064610.1A patent/CN114616572B/zh active Active
- 2020-07-24 KR KR1020227011501A patent/KR102699233B1/ko active Active
- 2020-08-05 US US16/986,139 patent/US11816428B2/en active Active
- 2020-08-05 US US16/986,136 patent/US11392763B2/en active Active
- 2020-08-05 US US16/986,146 patent/US11507740B2/en active Active
- 2020-08-05 US US16/986,142 patent/US11514238B2/en active Active
- 2020-08-05 US US16/986,151 patent/US11822880B2/en active Active
-
2022
- 2022-04-20 US US17/724,934 patent/US11960832B2/en active Active
-
2024
- 2024-03-19 US US18/609,740 patent/US20240232518A1/en active Pending
- 2024-12-02 JP JP2024209611A patent/JP7758836B2/ja active Active
-
2025
- 2025-10-09 JP JP2025171167A patent/JP7842294B2/ja active Active
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070136221A1 (en) | 2005-03-30 | 2007-06-14 | Peter Sweeney | System, Method and Computer Program for Facet Analysis |
| US20120005686A1 (en) | 2010-07-01 | 2012-01-05 | Suju Rajan | Annotating HTML Segments With Functional Labels |
Also Published As
| Publication number | Publication date |
|---|---|
| CN114616572A (zh) | 2022-06-10 |
| JP2025023185A (ja) | 2025-02-14 |
| US20210081602A1 (en) | 2021-03-18 |
| US20210081608A1 (en) | 2021-03-18 |
| JP2022547750A (ja) | 2022-11-15 |
| US11822880B2 (en) | 2023-11-21 |
| US11507740B2 (en) | 2022-11-22 |
| US20210081601A1 (en) | 2021-03-18 |
| EP4028961A4 (en) | 2023-10-18 |
| US11960832B2 (en) | 2024-04-16 |
| KR102699233B1 (ko) | 2024-08-27 |
| KR20240129242A (ko) | 2024-08-27 |
| US11816428B2 (en) | 2023-11-14 |
| KR20220059526A (ko) | 2022-05-10 |
| JP2025188195A (ja) | 2025-12-25 |
| JP7664262B2 (ja) | 2025-04-17 |
| US11514238B2 (en) | 2022-11-29 |
| CN114616572B (zh) | 2026-01-02 |
| CN121683697A (zh) | 2026-03-17 |
| KR20250143131A (ko) | 2025-09-30 |
| US20210081613A1 (en) | 2021-03-18 |
| JP7758836B2 (ja) | 2025-10-22 |
| US20210081411A1 (en) | 2021-03-18 |
| CA3150535A1 (en) | 2021-03-25 |
| US11392763B2 (en) | 2022-07-19 |
| US20220245335A1 (en) | 2022-08-04 |
| US20240232518A1 (en) | 2024-07-11 |
| EP4028961A1 (en) | 2022-07-20 |
| JP7842294B2 (ja) | 2026-04-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR102865616B1 (ko) | 문서 간 지능형 저작 및 처리 보조기 | |
| WO2021055102A1 (en) | Cross-document intelligent authoring and processing assistant | |
| US12210839B1 (en) | Multilevel data analysis | |
| US20190006027A1 (en) | Automatic identification and extraction of medical conditions and evidences from electronic health records | |
| WO2014093935A1 (en) | Vital text analytics system for the enhancement of requirements engineering documents and other documents | |
| US20230342383A1 (en) | Method and system for managing workflows for authoring data documents | |
| Essa et al. | Enhanced model for abstractive Arabic text summarization using natural language generation and named entity recognition | |
| RU61442U1 (ru) | Система автоматизированного упорядочения неструктурированного информационного потока входных данных | |
| Dubey et al. | Smart patient records using NLP and blockchain | |
| Gessler et al. | Midas loop: A prioritized human-in-the-loop annotation for large scale multilayer data | |
| Dahlberg et al. | Character Navigator: Automated Summarization of Characters in E-Books | |
| Hanafi | Human-in-the-loop Tools for Constructing and Debugging Data Extraction Pipelines | |
| Hao et al. | A user-oriented semantic annotation approach to knowledge acquisition and conversion | |
| Tan | Significant Revision Identification between Revised Texts in a Multi-Author Environment. | |
| Milosevic et al. | Table mining and data curation from biomedical literature | |
| CN120596093A (zh) | 数据可视化配置方法、装置、设备及介质 | |
| CN118708697A (zh) | 药品数据处理方法、装置、计算机设备和可读存储介质 | |
| Dawson et al. | The Role of Unstructured Data in Healthcare Analytics | |
| Moser | Linguistic aspects of software documentation corpora |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A107 | Divisional application of patent | ||
| A302 | Request for accelerated examination | ||
| PA0104 | Divisional application for international application |
St.27 status event code: A-0-1-A10-A18-div-PA0104 St.27 status event code: A-0-1-A10-A16-div-PA0104 |
|
| PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
| PA0302 | Request for accelerated examination |
St.27 status event code: A-1-2-D10-D17-exm-PA0302 St.27 status event code: A-1-2-D10-D16-exm-PA0302 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| E902 | Notification of reason for refusal | ||
| PE0902 | Notice of grounds for rejection |
St.27 status event code: A-1-2-D10-D21-exm-PE0902 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| E701 | Decision to grant or registration of patent right | ||
| PE0701 | Decision of registration |
St.27 status event code: A-1-2-D10-D22-exm-PE0701 |
|
| A16 | Divisional, continuation or continuation in part application filed |
Free format text: ST27 STATUS EVENT CODE: A-0-1-A10-A16-DIV-PA0104 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| A18 | Application divided or continuation or continuation in part accepted |
Free format text: ST27 STATUS EVENT CODE: A-0-1-A10-A18-DIV-PA0104 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| PA0104 | Divisional application for international application |
St.27 status event code: A-0-1-A10-A18-div-PA0104 St.27 status event code: A-0-1-A10-A16-div-PA0104 |
|
| F11 | Ip right granted following substantive examination |
Free format text: ST27 STATUS EVENT CODE: A-2-4-F10-F11-EXM-PR0701 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| PR0701 | Registration of establishment |
St.27 status event code: A-2-4-F10-F11-exm-PR0701 |
|
| PR1002 | Payment of registration fee |
St.27 status event code: A-2-2-U10-U12-oth-PR1002 Fee payment year number: 1 |
|
| U12 | Designation fee paid |
Free format text: ST27 STATUS EVENT CODE: A-2-2-U10-U12-OTH-PR1002 (AS PROVIDED BY THE NATIONAL OFFICE) Year of fee payment: 1 |
|
| PG1601 | Publication of registration |
St.27 status event code: A-4-4-Q10-Q13-nap-PG1601 |
|
| Q13 | Ip right document published |
Free format text: ST27 STATUS EVENT CODE: A-4-4-Q10-Q13-NAP-PG1601 (AS PROVIDED BY THE NATIONAL OFFICE) |