WO2011069903A1 - Procede et systeme d'edition en direct de documents electroniques - Google Patents

Procede et systeme d'edition en direct de documents electroniques Download PDF

Info

Publication number
WO2011069903A1
WO2011069903A1 PCT/EP2010/068825 EP2010068825W WO2011069903A1 WO 2011069903 A1 WO2011069903 A1 WO 2011069903A1 EP 2010068825 W EP2010068825 W EP 2010068825W WO 2011069903 A1 WO2011069903 A1 WO 2011069903A1
Authority
WO
WIPO (PCT)
Prior art keywords
electronic document
application
format
file format
actions
Prior art date
Application number
PCT/EP2010/068825
Other languages
English (en)
Inventor
Zhi Bo Zuo
Xin Rang Wang
Original Assignee
International Business Machines Corporation
Ibm United Kingdom Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corporation, Ibm United Kingdom Limited filed Critical International Business Machines Corporation
Publication of WO2011069903A1 publication Critical patent/WO2011069903A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/11File system administration, e.g. details of archiving or snapshots
    • G06F16/116Details of conversion of file system types or formats
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/197Version control

Definitions

  • the present invention relates to the field of electronic document processing.
  • the present invention relates to a method and system for on-line editing electronic documents.
  • Electronic document editing software may be, for example, Microsoft® Office®, Open Office®, Lotus® Symphony®, Lotus® Notes®, etc., and support varieties of formats.
  • Such editing software enables creation and saving of an electronic document as well as editing operations such as addition, deletion, and modification to the content of the electronic document. These editing operations are typically called local edition to the electronic document.
  • an electronic document may be edited by various kinds of browsers, which, for example, may be Internet Explorer®, Mozilla® Firefox®, Apple® Safari®, etc.
  • document format conversion is first performed to the electronic document with a format converter, i.e., converting an electronic document created by electronic document editing software from its corresponding file format, for example, doc, odf, etc., into a file format identifiable to a browser, for example, html, xhtml, and xml, etc.
  • a user uses a browser to create and save the electronic document, and directly perform editing operations such as addition, deletion, and modification to the document on the browser.
  • editing operations to an electronic document with a browser are generally editing operations to the electronic document in a web manner over the Internet, they are also called electronic document on-line editing.
  • document format conversion is performed to the electronic document again with the format converter, i.e., converting the electronic document edited through a browser from a browser identifiable file format, for example, html, xhtml, xml, etc., into a rich document format, for example, doc, odf, etc.
  • a method for on-line editing an electronic document comprising: converting the electronic document from a first application file format into a file format identifiable to a second application so as to form a converted electronic document, wherein format information of the first application file format unidentifiable to the second application in the electronic document is retained;
  • converting an electronic document from a first application file format into a file format identifiable to a second application further comprises retaining format information of the first application file format unidentifiable to the second application in the electronic document.
  • the first application performs a sequence of actions to update the changes to the electronic document to the original electronic document, wherein the first application uses the retained format information to update the changes to the electronic document to the original electronic document.
  • a system for on-line editing an electronic document comprising: a format converter configured to convert the electronic document from a first application file format into a file format identifiable to a second application so as to form a converted electronic document, wherein format information of the first application file format unidentifiable to the second application in the electronic document is retained, and to send the converted electronic document to the second application; a document change comparator configured to receive an electronic document edited by the second application, to compare the electronic document edited by the second application with the converted electronic document, and to determine changes to the electronic document based on the above comparison; and an action sequence generator configured to generate a sequence of actions identifiable to the first application based on the changes to the electronic document.
  • converting an electronic document from a first application file format into a file format identifiable to a second application comprises retaining format information of the first application file format unidentifiable to the second application in the electronic document.
  • the first application performs a series of actions to update the changes to the electronic document to the original electronic document, wherein the first application uses the retained format information to update the changes to the electronic document to the original electronic document.
  • the format information of the on-line edited electronic document may be retained and updated, thereby solving the problem of format information loss in the prior art method.
  • Fig. 1 shows a method 100 for on-line editing an electronic apparatus according to an embodiment of the present invention
  • Figs. 2a-2c schematically show display results of an electronic document in electronic document editing software and in a browser
  • Figs. 3a and 3b schematically show DOM trees of a converted electronic document and of an on-line edited electronic document
  • Fig. 4 shows a system 400 for on-line editing an electronic document according to another embodiment of the present invention. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • a preferred embodiment of the present invention uses operation actions as a bridge for communicating the two file formats, and as long as the operation actions are identical, formats will not be lost.
  • format information unidentifiable to the browser is retained during the format conversion process, changes to the electronic document made by the on-line edition are recorded using the operation actions supported by the electronic document editing software, and then these operation actions are performed by the electronic document editing software. Thus, format information of the on-line edited electronic document will not be lost.
  • Fig. 1 shows a method 100 for on-line editing an electronic apparatus according to an embodiment of the present invention.
  • the method 100 of Fig. 1 starts from step 102. Then, the method proceeds to step 104, where the electronic document is converted from a first application file format into a file format identifiable to a second application so as to form a converted electronic document; next, the method proceeds to step 106, where the converted electronic document is sent to the second application; and next, the method proceeds to step 108, where the electronic document edited by the second application is received; and then, the method proceeds to step 110, where the electronic document edited by the second application is compared with the converted electronic document; afterwards, at step 112, changes to the electronic document are determined based on the above comparison; and then, at step 114, a sequence of actions identifiable to the first application is generated based on the changes to the electronic document. Afterwards, the method proceeds to ending at step 116.
  • converting an electronic document from a first application file format into a file format identifiable to a second application comprises retaining format information of the first application file format unidentifiable to the second application in the electronic document.
  • the generated sequence of actions, format information of the first application file format unidentifiable to the second application in the electronic document, and the electronic document edited by the second application are stored together; or the generated sequence of actions, format information of the first application file format unidentifiable to the second application in the electronic document, and the converted electronic document are stored together.
  • the first application performs a series of actions to update the changes to the electronic document to the original electronic document, wherein the first application uses the retained format information to update the changes to the electronic document to the original electronic document.
  • embodiments of the present invention can employ a sequence of actions identifiable to the electronic document editing software to record changes to the electronic document by the browser, execution of the sequence of actions by the electronic document editing software enables the changes to the electronic document by the browser to be updated to the electronic document of the file format corresponding to the electronic document editing software.
  • the format information of the file format corresponding to the electronic document editing software which is unidentifiable to the browser in the original electronic document is retained, by performing the sequence of actions, the format information of the file format corresponding to the electronic document editing software unidentifiable to the browser in the original electronic document is also updated while the changes to the electronic document by the browser may be updated to the electronic document of the file format corresponding to the electronic document editing software. Therefore, the format information of the file format corresponding to the electronic document editing software unidentifiable to the browser in an on-line edited electronic document will not be lost but may be updated.
  • User I uses Microsoft® Word® to create an electronic document A.
  • User II who has no electronic document editing software wishes to use a browser to perform on-line edition to the electronic document A.
  • User II uses a browser to open the electronic document A.
  • the following operations are performed in the background of User II: performing format conversion by a format converter to the electronic document A, i.e., converting the electronic document A from a doc format into, for example, an html format, such that the electronic document A is enabled to be identified by the browser of User II and to be on-line edited.
  • the electronic document A may be divided into two parts: Part I, wherein the format information in this part of the electronic document may be identified by the doc format and html format and may be freely switched between those two file formats; Part II, wherein the format information in this part of the electronic document can only be identified by the doc format, instead of the html format.
  • Part I wherein the format information in this part of the electronic document may be identified by the doc format and html format and may be freely switched between those two file formats
  • Part II wherein the format information in this part of the electronic document can only be identified by the doc format, instead of the html format.
  • the format converter converts the electronic document A from a doc format into an html format
  • the format information in the doc format which cannot be identified by the browser will be lost, i.e., the format information in Part II will be lost, for example, such information is word flickering, box shadow, etc.
  • the format information unidentifiable to the browser in Part II will be lost.
  • the format converter will convert the electronic document A from a doc format into an html format, and at this point, only Part I of the electronic document A is converted, and the obtained converted electronic document is recorded as electronic document B.
  • the electronic document B comprises Part I only.
  • the format information of the doc format unidentifiable to the html format in the electronic document A is retained, i.e., the format information in Part II.
  • User II performs on-line edition to the electronic document B with a browser, which edition may comprise operations such as addition, deletion or change to the content of the electronic document B and to the format information of the electronic document B.
  • the on-line edited electronic document B is recorded as an electronic document B 1.
  • the changes to the electronic document B by User II are recorded as Part III.
  • the electronic document edited by the browser is received, i.e., electric document Bl, and the electronic document Bl edited by the browser is compared with the converted electronic document B, i.e., determining changes to the electronic document by comparison, i.e., Part III.
  • changes to the electronic document are determined by comparing the DOM tree Dl of the electronic document Bl with the DOM tree D of the electronic document B. According to the sequence of DOM tree, starting from the root node, all contents and attributes of respective nodes are compared to determine the differences between Dl and D2, and the determined differences are changes to the electronic documents, i.e., Part III.
  • the determined differences may be addition or deletion of nodes or changes to node attributes; or may be addition, deletion or change of node content.
  • Part III is generated through a series of operations by User II on the browser, and if same operations are performed on the electronic document editing software (Microsoft® Word® here), a completely identical Part III will be generated.
  • Part III is converted into a series of actions (i.e., a sequence of actions) identifiable to the electronic document editing software, and then, the sequence of actions is performed by the electronic document editing software, so Part III will be generated on the electronic document editing software.
  • the above sequence of actions is macro, which is implemented by VBA (Visual Basic® for Applications) in Microsoft® Word®.
  • changes to the electronic documents are updated to the electronic document A of the file format corresponding to the electronic document editing software.
  • the above processed electronic document A will include Part I, Part II, and Part III. Namely, the format information in the original electronic document A will not be lost.
  • a document of a rich document format the electronic document having a table with one row and one column, the table having characters "one cell,” wherein the characters "one cell” have character flickering effect.
  • This electronic document is stored using a certain document format, for example, doc format or odf format. Display of this electronic document is shown in Fig. 2a (the word flickering effect is indicated by the content in parentheses).
  • the electronic document is converted into the html format by a format converter as follows:
  • the above electronic document B comprises a table, which table only comprises a table cell with characters "one cell" only.
  • User II on-line edits the converted electronic document B with a browser.
  • User II performs the following edition to the electronic document B:
  • An electronic document Bl is formed from the electronic document B on-line edited by User II, display of which in the browser is shown in Fig. 2c.
  • the electronic document B and the electronic document Bl are compared to determine changes to the electronic documents. This comparison may be performed by, for example, generating DOM trees.
  • the DOM trees of the electronic document B and electronic document Bl are shown in Figs. 3a and 3b.
  • the DOM tree of the electronic document B comprises 3 nodes, wherein the root node indicates table 1, i.e., tbl 1, which has a leaf node indicating a first column "col 1" of the table 1, the leaf node further comprising a leaf node indicating a table cell "cell 1" of the first column "col 1".
  • the table cell "cell 1" has attributes such as content, font, color, etc., which are not shown here.
  • the DOM tree of the electronic document Bl comprises 6 nodes.
  • the root node comprises two leaf nodes, respectively indicating a first column “col 1" and a second column “col 2" of the table 1, the second column “col 2” further comprising a leaf node indicating a table cell “cell 2" of the second column “col 2.”
  • the table cell “cell 2” also has attributes such as content, font, color, etc., which are not shown here.
  • deep grey color is used to indicate changes in DOM trees, wherein "col 2" and “cell 2" are added nodes, content is added to "cell 1," and font is changed (not shown), and content is added to "cell 2" (not shown).
  • the generated sequence of actions, the format information of the doc format unidentifiable to the browser in the electronic document, and the electric document edited by the browser are stored together, or the generated sequence of actions, the format information of the doc format unidentifiable to the browser in the electronic document, and the converted electronic document are stored together.
  • the electronic document editing software performs the sequence of actions to update changes to the electronic document to the original electronic document, wherein the electronic document editing software updates the changes to the electronic document to the original electronic document with the retained format information.
  • the system 400 for on-line editing an electronic document comprises: a format converter 402 configured to convert the electronic document from a first application file format into a file format identifiable to a second application so as to form a converted electronic document, wherein format information of the first application file format unidentifiable to the second application in the electronic document is retained, and to send the converted electronic document to the second application; a document change comparator 404 configured to receive an electronic document edited by the second application, to compare the electronic document edited by the second application with the converted electronic document, and to determine changes to the electronic document based on the above comparison; and an action sequence generator 406 configured to generate a sequence of actions identifiable to the first application based on the changes to the electronic document.
  • sequence of actions generated by an action generator, format information of the first application file format unidentifiable to the second application in the electronic document, and the electronic document edited by the second application are stored together, or the generated sequence of actions, format information of the first application file format unidentifiable to the second application in the electronic document, and the converted electronic document are stored together.
  • the first application performs a series of actions to update the changes to the electronic document to the original electronic document, wherein the first application uses the retained format information to update the changes to the electronic document to the original electronic document.
  • the second application is a browser
  • the file format identifiable to the second application is a markup language format, where comparing by the document change comparator the electronic document edited by the second application with the converted electronic document is implemented by comparing the structures of DOM trees.
  • the action sequence generator generates a sequence of actions identifiable to the first application based on changes to the DOM tree structures.
  • the objective of the present invention may also be implemented by running a program or a group of programs on any computing apparatus.
  • the computing apparatus may be a known, universal apparatus.
  • the objective of the present invention may be implemented merely by a program product including program code implementing the method or apparatus.
  • a program product also constitutes the present invention
  • a storage medium storing such a program product also constitutes the present invention.
  • the storage medium may be any known storage medium or any storage medium developed in future.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Document Processing Apparatus (AREA)

Abstract

L'invention concerne un procédé d'édition en direct d'un document électronique, le procédé consistant: à convertir le document électronique d'un format de fichier d'une première application en un format de fichier identifiable d'une seconde application pour former un document électronique converti; à envoyer le document électronique converti vers la seconde application; à recevoir le document électronique édité par la seconde application; à comparer le document électronique édité par la seconde application avec le document électronique converti; à déterminer les changements apportés au document électronique sur la base de la comparaison; et à générer une suite d'actions identifiables à la première application sur la base des changements apportés au document électronique. En enregistrant les changements apportés au document électronique avec les actions, on peut retenir et actualiser les informations du format édité comprenant les informations de format riche, ce qui permet de résoudre le problème de perte d'informations de format lié au procédé actuel.
PCT/EP2010/068825 2009-12-10 2010-12-03 Procede et systeme d'edition en direct de documents electroniques WO2011069903A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2009102583390A CN102096661A (zh) 2009-12-10 2009-12-10 用于对电子文档进行在线编辑的方法和系统
CN200910258339.0 2009-12-10

Publications (1)

Publication Number Publication Date
WO2011069903A1 true WO2011069903A1 (fr) 2011-06-16

Family

ID=43733923

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2010/068825 WO2011069903A1 (fr) 2009-12-10 2010-12-03 Procede et systeme d'edition en direct de documents electroniques

Country Status (2)

Country Link
CN (1) CN102096661A (fr)
WO (1) WO2011069903A1 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111967226A (zh) * 2020-06-29 2020-11-20 山东浪潮通软信息科技有限公司 一种文档格式转换方法、设备及介质
CN112541322A (zh) * 2020-12-16 2021-03-23 山东师范大学 一种基于Word文档的伪代码自动格式化方法及系统

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104737158B (zh) * 2013-03-13 2018-03-13 华为技术有限公司 网络数据处理方法及装置、终端
JP6308169B2 (ja) * 2015-05-20 2018-04-11 コニカミノルタ株式会社 文書変換プログラム及び文書変換方法
CN110018984A (zh) * 2017-10-31 2019-07-16 北京国双科技有限公司 一种文件格式的转换方法及装置
CN109634588B (zh) * 2018-12-06 2022-04-08 安徽科大国创软件科技有限公司 一种兼容多浏览器的Office文档在线编辑方法和系统
CN110287462B (zh) * 2019-07-01 2023-03-28 上海一者信息科技有限公司 一种基于浏览器的跟踪修订方法及系统
CN110674082B (zh) * 2019-09-24 2023-03-14 北京字节跳动网络技术有限公司 移除在线文档的方法、装置、电子设备及计算机可读介质
CN113361239B (zh) * 2021-06-01 2024-06-25 北京百度网讯科技有限公司 文档测试方法及其装置、电子设备以及存储介质
CN114492334A (zh) * 2022-01-28 2022-05-13 北京索为系统技术股份有限公司 基于文档编辑器的需求条目在线编辑方法及装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030172113A1 (en) * 2002-03-05 2003-09-11 Cameron Brian A. Synchronization of documents between a server and small devices
US20040230892A1 (en) * 2003-03-17 2004-11-18 Libac Corporation Systems and methods for document project management
US20040268235A1 (en) * 2003-06-26 2004-12-30 International Business Machines Corporation Rich text handling for a web application

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030172113A1 (en) * 2002-03-05 2003-09-11 Cameron Brian A. Synchronization of documents between a server and small devices
US20040230892A1 (en) * 2003-03-17 2004-11-18 Libac Corporation Systems and methods for document project management
US20040268235A1 (en) * 2003-06-26 2004-12-30 International Business Machines Corporation Rich text handling for a web application

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JACOB J ET AL: "CX-DIFF: a change detection algorithm for XML content and change visualization for WebVigiL", DATA & KNOWLEDGE ENGINEERING, ELSEVIER BV, NL, vol. 52, no. 2, 1 February 2005 (2005-02-01), pages 209 - 230, XP025296541, ISSN: 0169-023X, [retrieved on 20050201], DOI: DOI:10.1016/S0169-023X(04)00102-8 *
YUAN WANG ET AL: "X-Diff: an effective change detection algorithm for XML documents", PROCEEDINGS 19TH. INTERNATIONAL CONFERENCE ON DATA ENGINEERING. (ICDE'2003). BANGALORE, INDIA, MARCH 5 - 8, 2003; [INTERNATIONAL CONFERENCE ON DATA ENGINEERING. (ICDE)], NEW YORK, NY : IEEE, US, vol. CONF. 19, 5 March 2003 (2003-03-05), pages 519 - 530, XP010678765, ISBN: 978-0-7803-7665-6 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111967226A (zh) * 2020-06-29 2020-11-20 山东浪潮通软信息科技有限公司 一种文档格式转换方法、设备及介质
CN112541322A (zh) * 2020-12-16 2021-03-23 山东师范大学 一种基于Word文档的伪代码自动格式化方法及系统

Also Published As

Publication number Publication date
CN102096661A (zh) 2011-06-15

Similar Documents

Publication Publication Date Title
WO2011069903A1 (fr) Procede et systeme d'edition en direct de documents electroniques
US10534830B2 (en) Dynamically updating a running page
US20130227397A1 (en) Forming an instrumented text source document for generating a live web page
US7197510B2 (en) Method, system and program for generating structure pattern candidates
CN104461484B (zh) 前端模板的实现方法和装置
US10540416B2 (en) Linking source code to running element
KR101774257B1 (ko) 객체의 서식을 유지하는 문서 편집 장치 및 그의 동작 방법
CN101763255B (zh) 一种专用界面工具的格式转换方法及装置
CN110333863A (zh) 一种生成、显示小程序页面的方法及装置
CN108647025A (zh) 文档对象模型中节点的处理方法及装置、电子和存储设备
US7720814B2 (en) Repopulating a database with document content
CN106294493A (zh) 实现文档格式转换的方法及装置
US20020035580A1 (en) Computer readable medium containing HTML document generation program
US20110078165A1 (en) Document-fragment transclusion
CN111158687A (zh) Java插件的界面生成方法、装置、计算机设备和存储介质
CN118245050B (zh) 前端框架组件自动转换方法、系统、电子设备及存储介质
CN107423271B (zh) 文档生成方法和装置
US20040221228A1 (en) Method and apparatus for domain specialization in a document type definition
US8726152B2 (en) Automated detection and implementation of state and object modifications
CN114527979A (zh) 一种生成多端表单交互页面的方法及系统
CN112650479B (zh) 网页css结构自动生成方法及存储介质
JP2004348341A (ja) 構造化文書処理システム、構造化文書処理方法及びプログラム
JP2013218627A (ja) 構造化文書からの情報抽出方法、装置、及びプログラム
Rönnau et al. Versioning XML-based office documents: An efficient, format-independent, merge-capable approach
US11210454B2 (en) Method for preparing documents written in markup languages while implementing a user interface for dealing with data of an information system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10794926

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10794926

Country of ref document: EP

Kind code of ref document: A1