WO2022104458A1 - Procédé et système de gestion de contenu dans un document et publication de celui-ci - Google Patents

Procédé et système de gestion de contenu dans un document et publication de celui-ci Download PDF

Info

Publication number
WO2022104458A1
WO2022104458A1 PCT/CA2021/051623 CA2021051623W WO2022104458A1 WO 2022104458 A1 WO2022104458 A1 WO 2022104458A1 CA 2021051623 W CA2021051623 W CA 2021051623W WO 2022104458 A1 WO2022104458 A1 WO 2022104458A1
Authority
WO
WIPO (PCT)
Prior art keywords
document
content
computer
areas
corresponds
Prior art date
Application number
PCT/CA2021/051623
Other languages
English (en)
Inventor
Michael Zhou
Simon Tang
Zhexin JIANG
Yingchao JIANG
Original Assignee
Writeway Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Writeway Ltd. filed Critical Writeway Ltd.
Publication of WO2022104458A1 publication Critical patent/WO2022104458A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/106Display of layout of documents; Previewing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/14Tree-structured documents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Definitions

  • Fig. 4 illustrates how a particular Card may be provided with one or more Card Areas, into which relevant content may be placed/organized.
  • the functions described may be implemented in hardware, software, firmware, or any combination thereof. If implemented in software, the functions may be stored as one or more instructions or code on a computer-readable medium. The steps of a method or algorithm disclosed herein may be embodied in a processor-executable software module which may reside on a tangible, non-transitory computer-readable storage medium. Tangible, non-transitory computer-readable storage media may be any available media that may be accessed by a computer.
  • Image Object consists of an image resource (e.g. a JPG or a link to a URL location where the JPG is stored) and other information such as a caption. Captions are made up of Paragraph Objects.
  • the Conclusion Card for a research paper document might contain a Paragraph Object specifying what the conclusion of that research paper is.
  • the disclosed system is able to determine that such content is a conclusion. Then, depending on various considerations such as what document format the document is being output as, what type of device it is being published to, what digital platform it is being published on, what type of publication it is intended to be, etc., the appropriate style, layout or format can be applied to such conclusion content.
  • the semantic structure applied to a document may mean that the word processing and document publishing system is not entirely oblivious to the nature of the content making up a document; in some cases, it may be able to infer some understanding of the nature of some sections of the document, which information can be utilized in other processing of the document (such as during publication).
  • Project This feature will allow users to create a specific project of their work. This is captured in the Project Object Model (POM). Each project consists of a document and a corresponding Idea Board containing various Objects. Each document consists of a header, footer, and body. One or more Cards can be placed into the body.
  • POM Project Object Model
  • Output (Rendering): A brief discussion on rendering (referred to herein as the “Output” function) is warranted.
  • Output is the process of generating an output of a document to the desired medium. In the present system, it is contemplated that the overall Output process would comprise the following phases, namely: Pre-rendering; Rendering; Post-rendering; and Export. These are illustrated in Fig. 10.
  • POM Project Object Model
  • Pre-rendering 106 is the process that replaces any placeholder elements in the POM with actual data. For example, generating a table of contents or references. This phase relies on the plugin definitions to execute the appropriate code to perform the processing.
  • This in turn is the input into the Export phase 112.
  • the system converts the HTML+ and CSS into the desired output format, such as a PDF.
  • the output medium at this stage could be a file, (such as a PDF) or integrated with online platforms.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Document Processing Apparatus (AREA)

Abstract

La présente invention concerne un procédé et un système mis en œuvre par ordinateur permettant de gérer un contenu pour un document électronique, déconstruit en un document structuré. Le document structuré est décomposé en zones de document, une pluralité de cartes attribuées à des zones de document et chaque carte a également un ou plusieurs objets d'un type d'objet pris en charge correspondant à un type de contenu, dans lequel le contenu constitutif est désigné pour chacun des éléments de la pluralité de cartes et de la pluralité d'objets, ce qui permet d'appliquer une structure sémantique au contenu constitutif du document électronique, ce qui permet au système de déduire un certain degré de compréhension du contenu constitutif du document, qui peut être utilisé par le système pendant la publication du document.
PCT/CA2021/051623 2020-11-17 2021-11-16 Procédé et système de gestion de contenu dans un document et publication de celui-ci WO2022104458A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202063114682P 2020-11-17 2020-11-17
US63/114,682 2020-11-17

Publications (1)

Publication Number Publication Date
WO2022104458A1 true WO2022104458A1 (fr) 2022-05-27

Family

ID=81707895

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CA2021/051623 WO2022104458A1 (fr) 2020-11-17 2021-11-16 Procédé et système de gestion de contenu dans un document et publication de celui-ci

Country Status (1)

Country Link
WO (1) WO2022104458A1 (fr)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5553216A (en) * 1993-02-26 1996-09-03 Fujitsu Limited Structured database system together with structure definition frame storing document body data
US20010018697A1 (en) * 2000-01-25 2001-08-30 Fuji Xerox Co., Ltd. Structured document processing system and structured document processing method
US20060075337A1 (en) * 2004-09-30 2006-04-06 Microsoft Corporation Method, system, and computer-readable medium for creating, inserting, and reusing document parts in an electronic document
US20160357711A1 (en) * 2015-06-07 2016-12-08 Apple Inc. Article Authoring, Distribution & Rendering Architecture

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5553216A (en) * 1993-02-26 1996-09-03 Fujitsu Limited Structured database system together with structure definition frame storing document body data
US20010018697A1 (en) * 2000-01-25 2001-08-30 Fuji Xerox Co., Ltd. Structured document processing system and structured document processing method
US20060075337A1 (en) * 2004-09-30 2006-04-06 Microsoft Corporation Method, system, and computer-readable medium for creating, inserting, and reusing document parts in an electronic document
US20160357711A1 (en) * 2015-06-07 2016-12-08 Apple Inc. Article Authoring, Distribution & Rendering Architecture

Similar Documents

Publication Publication Date Title
Edhlund et al. NVivo 12 essentials
US10372810B2 (en) Smarter copy/paste
KR101150132B1 (ko) 시작 템플릿과 목표 템플릿 사이의 콘텐츠 맵핑을 위한방법 및 시스템
US11216253B2 (en) Application prototyping tool
Edhlund et al. Nvivo 11 essentials
US9870484B2 (en) Document redaction
Turco et al. Edition visualization technology: A simple tool to visualize TEI-based digital editions
US20130191728A1 (en) Systems, methods, and media for generating electronic books
Haaf et al. The dta “base format”: A tei subset for the compilation of a large reference corpus of printed text from multiple sources
US20060136811A1 (en) Method and computer-readable medium for generating a multiple column layout
US11238215B2 (en) Systems and methods for generating social assets from electronic publications
KR20150095663A (ko) E-리더에서의 플랫북에서 리치북으로의 변환 기법
Kottwitz LaTeX Beginner's Guide: Create visually appealing texts, articles, and books for business and science using LaTeX
US11775733B2 (en) Device dependent rendering of PDF content including multiple articles and a table of contents
KR101498533B1 (ko) 컴포넌트 분리 표시 기반의 전자 문서 출력 장치 및 방법
US9594737B2 (en) Natural language-aided hypertext document authoring
WO2022104458A1 (fr) Procédé et système de gestion de contenu dans un document et publication de celui-ci
CN106033348A (zh) 一种制作网页的方法、装置及电子设备
Strobbe et al. An accessibility checker for libreoffice and openoffice. org writer
US11842141B2 (en) Device dependent rendering of PDF content
Lambert et al. MOS 2016 Study Guide for Microsoft Word
Mrva-Montoya Editing skills in the era of digital [r] evolution
Schmidt Graphical editor for manuscripts
Lambert MOS Study Guide for Microsoft Word Exam MO-100
Leporini et al. Using InDesign Tool to Develop an Accessible Interactive EPUB 3: A Case Study.

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21893171

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21893171

Country of ref document: EP

Kind code of ref document: A1