EP2732379A1 - Système et procédé de traitement de document - Google Patents

Système et procédé de traitement de document

Info

Publication number
EP2732379A1
EP2732379A1 EP12810545.9A EP12810545A EP2732379A1 EP 2732379 A1 EP2732379 A1 EP 2732379A1 EP 12810545 A EP12810545 A EP 12810545A EP 2732379 A1 EP2732379 A1 EP 2732379A1
Authority
EP
European Patent Office
Prior art keywords
document
indicator
hierarchical structure
computer
hierarchical
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP12810545.9A
Other languages
German (de)
English (en)
Other versions
EP2732379A4 (fr
Inventor
Nathan Alexander WHETSELL
Benjamin Stuart WHETSELL
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Paper Software LLC
Original Assignee
Paper Software LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Paper Software LLC filed Critical Paper Software LLC
Priority claimed from PCT/US2012/046297 external-priority patent/WO2013009904A1/fr
Publication of EP2732379A1 publication Critical patent/EP2732379A1/fr
Publication of EP2732379A4 publication Critical patent/EP2732379A4/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/137Hierarchical processing, e.g. outlines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/258Heading extraction; Automatic titling; Numbering

Definitions

  • This disclosure relates to document tools and, more particularly, to tools for processing and interacting with a document.
  • a method may include identifying a plurality of indicators of hierarchy within a document. The method may also include determining a hierarchical level associated with each of the plurality of indicators. The method may also include associating one or more portions of the document with a respective hierarchical level associated with each of the plurality of indicators.
  • Determining a hierarchical level associated with each of the plurality of indicators may include determining if a current indicator follows a preceding indicator. The method may also include determining if the current indicator follows an ancestor indicator in response to determining that the current indicator does not follow the preceding indicator. The method may also include determining an alternative interpretation of the preceding indicator. The method may also include setting the preceding indicator to the alternative interpretation, and setting the current indicator to follow the preceding indicator.
  • a computer program product may reside on a non-transitory computer readable medium having a plurality of instructions stored on it. When executed by a processor the instructions may cause the processor to perform operations including identifying a plurality of indicators of hierarchy within a document. Instructions may also be included for determining a hierarchical level associated with each of the plurality of indicators. Instructions may further be included for associating one or more portions of the document with a respective hierarchical level associated with each of the plurality of indicators.
  • Each of the plurality of indicators may include an itemizer having one or more of a prefix, a stem, and a suffix.
  • the instructions for determining the hierarchical level associated with each indicator may include instructions for interpreting at least a portion of each of the plurality of indicators to determine a numbering style associated with each indicator and a numerical value associated with each indicator.
  • the instructions for determining a hierarchical level associated with each of the plurality of indicators may include instructions for determining if a current indicator follows a preceding indicator. Instructions may also be included for determining if the current indicator follows an ancestor indicator in response to determining that the current indicator does not follow the preceding indicator. Instructions may also be included for determining an alternative interpretation of the preceding indicator. Instructions may also be included for setting the preceding indicator to the alternative interpretation, and setting the current indicator to follow the preceding indicator.
  • a computing system may include one or more processors configured to identify a plurality of indicators of hierarchy within a document.
  • the one or more processors may also be configured to determine a hierarchical level associated with each of the plurality of indicators.
  • the one or more processors may further be configured to associate one or more portions of the document with a respective hierarchical level associated with each of the plurality of indicators.
  • Each of the plurality of indicators may include an itemizer having one or more of a prefix, a stem, and a suffix.
  • the one or more processors configured to determine the hierarchical level associated with each indicator may be further configured to interpret at least a portion of each of the plurality of indicators to determine a numbering style associated with each indicator and a numerical value associated with each indicator.
  • the one or more processors configured to determine a hierarchical level associated with each of the plurality of indicators may be further configured to determine if a current indicator follows a preceding indicator.
  • the one or more processors may be further configured to determine if the current indicator follows an ancestor indicator in response to determining that the current indicator does not follow the preceding indicator.
  • the one or more processors may be further configured to determine an alternative interpretation of the preceding indicator.
  • the one or more processors may be further configured to set the preceding indicator to the alternative interpretation, and to set the current indicator to follow the preceding indicator.
  • a computer-implemented method may include processing a document to associate a plurality of items within the document with a group.
  • the method may also include modifying, on the processor, a first item included within the group.
  • the method may further include modifying, on the processor, all other items in the group based on the modified first item.
  • the document may include a legal agreement.
  • the group may include defined terms within the legal agreement.
  • the method may also include receiving a request to modify the first item included within the group. Modifying the first item included within the group may be based on, at least in part, the received request. Modifying the first item may include changing a formatting characteristic associated with the first item.
  • a computer program product may include a non-transitory computer readable medium having a plurality of instructions stored on it. When executed by a processor, the instructions may cause the processor to perform operations including processing a document to associate a plurality of items within the document with a group. Instructions may also be included for modifying a first item included with in the group. Instructions may further be included for modifying all other items in the group based on the modified first item.
  • the group may include a hierarchical group.
  • the instructions for processing the document may include instructions for processing the document to associate each of the plurality of items with one of a plurality of hierarchical groups.
  • the document may include a legal agreement.
  • the group may include defined terms within the legal agreement.
  • Instructions may be included for receiving a request to modify the first item included within the group. Modifying the first item included within the group may be based on, at least in part, the received request.
  • the instructions for modifying the first item may include instructions for changing a formatting characteristic associated with the first item.
  • a computing system may include at least one processor that may be configured to process a document to associate a plurality of items within the document with a group.
  • the one or more processors may also be configured to modify a first item included with in the group.
  • the one or more processors may also be configured to modify all other items in the group based on the modified first item.
  • the group may include a hierarchical group.
  • the one or more processors configured to process the document may be configured to process the document to associate each of the plurality of items with one of a plurality of hierarchical groups.
  • the document may include a legal agreement.
  • the group may include defined terms within the legal agreement.
  • the one or more processors may be further configured to receive a request to modify the first item included within the group.
  • the one or more processors may be configured to modify the first item included within the group is based on, at least in part, the received request.
  • the one or more processors may be configured to modify the first item are configured to change a formatting characteristic associated with the first item.
  • a computer-implemented method may include determining a hierarchical structure associated with a document. The method may also include associating the hierarchical structure with a plurality of elements of the document. The method may further include generating a map of the document. The map may include a plurality of links to one or more of the elements of the document.
  • the document may include a legal agreement.
  • the plurality of elements of the document may include one or more provisions of the legal agreement.
  • Determining the hierarchical structure associated with the document may include processing the document to determine the hierarchical structure.
  • Determining the hierarchical structure associated with the document may include accessing structure data associated with the document.
  • the plurality of links may include hyperlinks configured to navigate the document.
  • the method may further include generating a visual representation of the map.
  • the visual representation may include an arrangement of the plurality of links based on, at least in part, the hierarchical structure of the document.
  • a computer program product resides on a non-transitory computer readable medium having a plurality of instructions stored on it.
  • the instructions When executed by a processor, the instructions may cause the processor to perform operations including determining a hierarchical structure associated with a document. Instructions may also be included for associating the hierarchical structure with a plurality of elements of the document. Instructions may further be included for generating a map of the document. The map may include a plurality of links to one or more of the elements of the document.
  • the document may include a legal agreement.
  • the plurality of elements of the document may include one or more provisions of the legal agreement.
  • the instructions for determining the hierarchical structure associated with the document may include instructions for processing the document to determine the hierarchical structure.
  • the instructions for determining the hierarchical structure associated with the document may include instructions for accessing structure data associated with the document.
  • the plurality of links may include hyperlinks configured to navigate the document. Instructions may further be included for generating a visual representation of the map.
  • the visual representation may include an arrangement of the plurality of links based on, at least in part, the hierarchical structure of the document.
  • a computing system includes one or more processors.
  • the one or more processors may be configured to determine a hierarchical structure associated with a document.
  • the one or more processor may also be configured to associate the hierarchical structure with a plurality of elements of the document.
  • the one or more processors may further be configured to generate a map of the document.
  • the map may include a plurality of links to one or more of the elements of the document.
  • One or more of the following features may be included.
  • the document may include a legal agreement.
  • the plurality of elements of the document may include one or more provisions of the legal agreement.
  • the one or more processors configured to determine the hierarchical structure associated with the document may be configured to process the document to determine the hierarchical structure.
  • the one or more processors configured to determine the hierarchical structure associated with the document may be configured to access structure data associated with the document.
  • the plurality of links may include hyperlinks configured to navigate the document.
  • the one or more processors may be further configured to generate a visual representation of the map.
  • the visual representation may include an arrangement of the plurality of links based on, at least in part, the hierarchical structure of the document.
  • FIG. 1 is a diagrammatic view of a distributed computing network including a computing device that executes a document process according to an embodiment of the present disclosure
  • FIG. 2 is a flowchart of an embodiment of the document process of FIG. 1 according to an embodiment of the present disclosure
  • FIG. 3 is a diagrammatic representation of a portion of a document that may be processed by the document process of FIG. 1 according to an embodiment
  • FIG. 4 is a flow chart of another embodiment of the document process of FIG. 1 according to an embodiment
  • FIG. 5 is a diagrammatic representation of a user interface including a portion of a document that may be processed by the document process of FIG. 1 according to an embodiment
  • FIG. 6 is a diagrammatic representation of a the user interface of FIG. 5 including a reformatted portion of the document of FIG. 5 according to an embodiment
  • FIG. 7 is a flow chart of yet another embodiment of the document process of FIG. 1 according to an embodiment
  • FIG. 8 is a diagrammatic view of a user interface including a linkable map of a document that may be generated by the document process of FIG. 1 according to an embodiment
  • FIG. 9 is a diagrammatic view of a user interface including a portion of a document that may be navigated by the document process of FIG. 1 according to an embodiment.
  • FIG. 10 is a diagrammatic view of the computing device of FIG. 1 according to an embodiment of the present disclosure.
  • document process 10 may be implemented in a variety of ways.
  • document process 10 may be implemented as a server-side process, a client-side process, or a server-side / client-side process.
  • document process 10 may be implemented as a purely server- side process via document process 10s.
  • document process 10 may be implemented as a purely client-side process via one or more of client-side application lOcl, client-side application 10c2, client-side application 10c3, and client-side application 10c4.
  • document process 10 may be implemented as a server-side / client-side process via event monitoring process 10s in combination with one or more of client-side application lOcl, client-side application 10c2, client-side application 10c3, and client-side application 10c4.
  • document process 10 may include any combination of document process 10s, client-side application lOcl, client-side application 10c2, client-side application 10c3, and client-side application 10c4.
  • document process 10 may provide various functionality.
  • document process 10 may generally process a document to determine a hierarchical structure associated with one or more portions of the document.
  • document process 10 may generally process a document to generate a linkable map of the document based on, at least in part, a hierarchical structure of the document.
  • the linkable map of the document may be utilized to, e.g., navigate the document.
  • document process 10 may allow for groups or classes of items within a document to be modified based on, at least in part, the modification of one item included within a group or class.
  • Various additional and/or alternative embodiments and/or features may also be implemented. Further, various features and embodiments may be implemented alone and/or in combination with one another.
  • Document process 10s may be a server application and may reside on and may be executed by computing device 12, which may be connected to network 14 (e.g., the Internet or a local area network).
  • Examples of computing device 12 may include, but are not limited to: a personal computer, a server computer, a series of server computers, a mini computer, a mainframe computer, or a dedicated network device.
  • the instruction sets and subroutines of document process 10s may be stored on storage device 16 coupled to computing device 12, may be executed by one or more processors (not shown) and one or more memory architectures (not shown) included within computing device 12.
  • Examples of storage device 16 may include but are not limited to non-transitory computer readable mediums, such as a hard disk drive; a tape drive; an optical drive; a RAID device; an NAS device, a Storage Area Network, a random access memory (RAM); a read-only memory (ROM); and all forms of flash memory storage devices.
  • Network 14 may be connected to one or more secondary networks (e.g., network 18), examples of which may include but are not limited to: a local area network; a wide area network; or an intranet, for example.
  • secondary networks e.g., network 18
  • networks may include but are not limited to: a local area network; a wide area network; or an intranet, for example.
  • client-side applications lOcl, 10c2, 10c3, 10c4 may include but are not limited to a web browser, a game console user interface, a television user interface, or a specialized application (e.g., including an application running on a mobile platform).
  • the instruction sets and subroutines of client-side application lOcl, 10c2, 10c3, 10c4, which may be stored on storage devices 20, 22, 24, 26 (respectively) coupled to client electronic devices 28, 30, 32, 34 (respectively), may be executed by one or more processors (not shown) and one or more memory architectures (not shown) incorporated into client electronic devices 28, 30, 32, 34 (respectively).
  • Examples of storage devices 20, 22, 24, 26 may include but are not limited to: hard disk drives; tape drives; optical drives; RAID devices; random access memories (RAM); read-only memories (ROM), and all forms of flash memory storage devices.
  • Examples of client electronic devices 28, 30, 32, 34 may include, but are not limited to, data-enabled, cellular telephone 28, laptop computer 30, personal digital assistant 32, personal computer 34, a notebook computer (not shown), a server computer (not shown), a gaming console (not shown), a data-enabled television console (not shown), and a dedicated network device (not shown).
  • Client electronic devices 28, 30, 32, 34 may each execute an operating system.
  • Users 36, 38, 40, 42 may access event process 10 directly through network 14 or through secondary network 18. Further, event process 10 may be accessed through secondary network 18 via link line 44.
  • the various client electronic devices may be directly or indirectly coupled to network 14 (or network 18).
  • client electronic devices 28 and laptop computer 30 are shown wirelessly coupled to network 14 via wireless communication channels 46, 48 (respectively) established between data-enabled, cellular telephone 28, laptop computer 30 (respectively) and cellular network / bridge 50, which is shown directly coupled to network 14.
  • personal digital assistant 32 is shown wirelessly coupled to network 14 via wireless communication channel 52 established between personal digital assistant 32 and wireless access point (i.e., WAP) 54, which is shown directly coupled to network 14.
  • WAP wireless access point
  • WAP 54 may be, for example, an IEEE 802.11a, 802.11b, 802.1 lg, 802.11 ⁇ , Wi-Fi, and/or Bluetooth device that is capable of establishing wireless communication channel 52 between personal digital assistant 32 and WAP 54. Additionally, personal computer 34 is shown directly coupled to network 18 via a hardwired network connection.
  • document process 10 may identify 100 a plurality of indicators of hierarchy within a document. Document process 10 may also determine 102 a hierarchical level associated with each of the plurality of indicators. Further, document process 10 may associate 104 one or more portions of the document with a respective hierarchical level associated with each of the plurality of indicators.
  • document process 10 may identify 100 a plurality of indicators of hierarchy within a document.
  • the document may include a document that does not include meta-data or the like indicating a hierarchical structure of the document. Rather, the hierarchy of the document may be indicated by textual content and/or textual structure of the document.
  • the plurality of indicators of hierarchy within the document may include textual features, such as numbers, lettered captions, and the like; and/or textual structure of the document, such as formatting attributes, and the like.
  • An example of a document that may include indicators of hierarchy may include legal agreements.
  • the document may include any text document, such as a document created with a word processing application, a text editor application, or any other suitable application that may be utilized for the creation of a text document.
  • each of the plurality of indicators may include an itemizer.
  • an itemizer may include a sequence of characters that may identify a hierarchical position or level (i.e., may identify position within a hierarchical structure).
  • Each itemizer may include one or more of a prefix, a stem, and a suffix.
  • a first example of an itemizer may include "Article I", wherein the prefix may include the characters "Article " (including the space character).
  • the stem of the first example itemizer may include the character "I" and a suffix may not be included in the first example itemizer.
  • a second example of an itemizer may include "Section 1.01.".
  • the prefix of the second example itemizer may include "Section 1.0", (including the space character).
  • the stem of the second example itemizer may include the character "1", and the suffix of the second example itemizer may include the character
  • a third example itemizer may include "c)", which may not include a prefix and which may include the character "c" as the stem and the character ")" as the suffix.
  • a fourth example itemizer may include "(i)". In this fourth example itemizer the characters "(" and ")" may respectively be the prefix and suffix, and the character "i" may be the stem. It will be appreciated that various additional and/or alternative itemizers may also be utilized, wherein the itemizers may include a sequence of characters that may provide an indicator of hierarchical position or level.
  • Document process 10 may also determine 102 a hierarchical level associated with each of the plurality of indicators.
  • each of the plurality of indicators may represent and/or indicate a respective position within a hierarchical structure.
  • Document process 10 may determine 102 the hierarchical level associated with each of the indicators, and thereby determine a respective position associated with each of the plurality of indicators within a hierarchical structure.
  • determining 102 the hierarchical level associated with each indicator may include interpreting 106 at least a portion of each of the plurality of indicators to determine a numbering style associated with each indicator and a numerical value associated with each indicator.
  • document process 10 may interpret 106 at least the stem of an itemizer to determine a numbering style associated with the itemizer and to determine a numerical value associated with the itemizer.
  • document process 10 may interpret 106 the stem of the first example itemizer ("I") to determine a numbering style and a numerical value.
  • document process 10 may interpret 106 the stem "I", as an upper case Roman numeral having a numerical value of one.
  • document process 10 may interpret 106 the stem "I” as an upper case Latin letter having a numerical value of nine (e.g., based on "I” being the ninth letter of the Latin alphabet).
  • document process 10 may interpret 106 the stem "1" of the second example itemizer as an Arabic numeral having a numerical value of one. Further, document process 10 may interpret 106 the stem "c" of the third example itemizer as a lower case Latin letter having a numerical value of three. Alternatively, document process 10 may interpret 106 the stem "c" as a lower case Roman numeral having a numerical value of 100. As with the first example itemizer, document process 10 may interpret 106 the stem "i" of the fourth example itemizer in a similar manner as the first example itemizer (i.e., either as a lower case Roman numeral having a numerical value of one or as a lower case Latin letter having a numerical value of nine).
  • document process 10 may flag the indicator as having multiple possible interpretations and may determine an interpretation in a manner described in detail below. Additional and/or alternative numbering styles and associated numerical values may be utilized in connection with one or more of the plurality of indicators of hierarchy included within the document.
  • document process 10 may also interpret one or more of the prefix and the suffix, e.g., to determine a relative hierarchical position of the itemizer based on, at least in part, the prefix and/or the suffix.
  • Relative hierarchical position indicated by the prefix and/or suffix may be based on, for example, one or more rules and/or preferences that may be established, and/or based on, at least in part, an indication received from a user.
  • a rule may be established wherein an agreement may be divided into a plurality of articles, and each of the articles may be divided into a plurality of sections. Sections may be further divided into, e.g., subsections, paragraphs, clauses, etc.). Various additional and/or alternative embodiments may be implemented.
  • determining 102 a hierarchical level associated with each of the plurality of indicators may include determining 108 if a current indicator follows a preceding indicator.
  • a preceding indicator may include an immediately preceding indicator (i.e., not including an intervening indicator in the document).
  • a current indicator may follow a preceding indicator if, for example, document process interprets 106 the current indicator as having (1) the same prefix and suffix as the preceding indicator, (2) the same numbering style as the preceding indicator, and (3) a numerical value that is one greater than that of the preceding indicator.
  • a prefix and suffix of a current indicator may be the same as the prefix and suffix of a preceding indicator if the prefix and suffix exactly match or approximately match based on, for example, a string edit distance (e.g., the Levenshtein distance).
  • a string edit distance e.g., the Levenshtein distance
  • agreement 150 may generally include a plurality of provisions (e.g., provisions 152-174) of Article I.
  • Each of example provisions 152-174 may include text of agreement 150 and may be identified by a textual indicator (e.g., respective indicators "(a)", “(b)", “(c)”, “(d)”, “(e)”, “(f)”, “(g)”, “(h)”, “(i)”, “(ii)”, “(i)”, “0 ' )”), which may represent a hierarchical level of each provision within agreement 150.
  • document process 10 may determine that the indicator "Article I" may define a hierarchical root, e.g., which may be a primary, or top, hierarchical level, e.g., based on one or more defined rules and/or preferences. Further, document process 10 may determine a hierarchical level associated with indicator "(a)” including interpreting 106 indicator "(a)” as including respective prefix and suffix "(" and ")" and stem “a”. Further, document process 10 may interpret 106 the stem "a” as having a lower case Latin alphabetical numbering style and a numerical value of one. Further, document process 10 may determine 108 that current indicator "(a)" (i.e., document process 10 may be currently determining 102 the hierarchical level of indicator "(a)”) may begin a new list within a hierarchical level.
  • current indicator "(a)" i.e., document process 10 may be currently determining 102 the hierarchical level of indicator "(a)
  • document process 10 may determine 102 the hierarchical level associated with indicator "(b)” associated with document provision 154. Determining 102 the hierarchical level associated with indicator “(b)” may include interpreting the stem "b” as having a lower case Latin alphabetical numbering style and a numerical value of two. Further, document process 10 may determine 108 that current indicator "(b)" (i.e., the indicator for which document process 10 may currently be determining a hierarchical level) may follow a preceding indicator. Consistent with the illustrated example, the preceding indicator may include indicator "(a)” (e.g., indicator "(a)” may be the immediately preceding identified 100 indicator).
  • indicator "(b)" may include a prefix and a suffix which may match the prefix and suffix of indicator "(a)". Further, indicator “(b)” may have a lower case Latin alphabetical numbering style, which may be the same as indicator "(a)”. Further, indicator “(b)” may have a numerical value of two, which may be one greater than the numerical value of indicator "(a)”. Accordingly, document process 10 may determine that indicator "(b)" may follow indicator "(a)".
  • document process 10 may determine 108 that indicator "(c)", associated with provision 156, may follow indicator "(b)” associated with provision 154. Further, document process 10 may similarly determine that indicators “(d)”, “(e)”, “(f)”, “(g)”, “(h)”, and “(i)”, respectively associated with provisions 158-168, may follow one another in an ordered sequence residing in a common hierarchical level beneath root hierarchical level “Article I”. Continuing with the illustrative embodiment depicted in FIG. 3, document process 10 may interpret 106 indicator "(ii)", associated with provision 170, as having a lower case Roman numeral numbering style and a numerical value of two.
  • document process 10 may determine that indicator "(ii)" does not follow the preceding indicator (i.e., indicator "(i)” associated with provision 168, being the immediately preceding indicator in agreement 150). For example, document process 10 may have interpreted 106 indicator “(i)” associated with provision 168 as having a lower case Latin alphabetical numbering style and a numerical value of nine, whereby indicator "(i)” associated with provision 168 may follow indicator "(h)", which may be interpreted 106 as having a lower case Latin alphabetical numbering style and a numerical value of eight.
  • the numbering style of indicator "(ii)" may not match the interpretation of the numbering style of indicator "(i)" associated with provision 168.
  • the numerical value of indicator "(ii)” e.g., two
  • the interpreted 106 numerical value of indicator "(i)” associated with provision 168 e.g., which may have a numerical value of nine.
  • document process 10 may determine 110 if the current indicator follows an ancestor indicator.
  • An ancestor indicator may include any previously identified 100 indicator included under the rood hierarchical level (e.g., associated with the indicator "Article I" in the illustrated example).
  • an ancestor indicator may include an identified 100 previous indicator at a higher hierarchical level than the preceding indicator (e.g., at a hierarchical level closer to the root level than the preceding indicator).
  • document process 10 may determine 110 that current indicator "(ii)" does not follow an ancestor indicator (e.g., no previous indicator includes a lower case Roman numeral numbering style, a numerical value one less than indicator "(ii)", and respective prefix and suffix "(" and ")”).
  • an ancestor indicator e.g., no previous indicator includes a lower case Roman numeral numbering style, a numerical value one less than indicator "(ii)", and respective prefix and suffix "(" and ")
  • document process 10 may determine 112 an alternative interpretation of the preceding indicator.
  • the preceding indicator e.g., indicator "(i)" associated with provision 168
  • the preceding indicator may include more than one interpretation.
  • document process 10 may determine 112 that in addition to including a lower case Latin alphabetical numbering style and having a numerical value of nine, indicator "(i)" associated with provision 168 may include an interpretation as having a lower case Roman numeral numbering style and a numerical value of one.
  • document process 10 may determine 112 the alternative interpretation of indicator "(i)” in response to determining that indicator "(ii)” does not follow the preceding indicator and does not follow any ancestor indicator. In some embodiments, document process 10 may determine 112 the alternative interpretation of indicator "(i)” when interpreting 106 indicator "(i)". In such an embodiment document process 10 may flag indicator "(i)” as including multiple interpretations (e.g., by associating metadata with indicator "(i)", or via another suitable mechanism).
  • determining 112 the alternative interpretation of the preceding indicator may include determining if the alternative interpretation of the preceding indicator may include an interpretation as being a first item within a sequence, and/or an interpretation that the current indicator (e.g., indicator "(ii)") may follow.
  • document process 10 may set 114 the preceding indicator to the alternative interpretation, and may set 116 the current indicator to follow the preceding indicator.
  • indicator "(i)" associated with provision 168 may include the alternative interpretation of having a lower case Roman numeral numbering style and a numerical value of one.
  • Document process 10 may set 114 the interpretation of indicator "(i)” to such an alternative interpretation. Further, document process 10 may set 116 the current indicator "(ii)", to follow indicator "(i)” associated with provision 168, as document process 10 may interpret 106 indicator "(ii)” as having a numbering style that matches that of indicator "(i)” associated with provision 168 (i.e., lower case Roman numerals), as having a numerical value one greater (e.g., two) than that of indicator "(i)” associated with provision 168, and as having a prefix and suffix that match those of indicator "(i)” associated with provision 168.
  • document process 10 may interpret 106 indicator "(i)" associated with provision 172, as having either a lower case Roman numeral numbering style and a numerical value of one, or as having a lower case Latin alphabetical numbering style and a numerical value of nine. Document process 10 may further determine that indicator "(i)" associated with provision 172 does not follow the preceding indicator (i.e., indicator "(ii)”) based on either interpretation. Further, document process 10 may determine 110 that indicator "(i)" associated with provision 172 may follow an ancestor indicator.
  • document process 10 may determine 110 that an interpretation 106 of indicator "(i)" associated with provision 172 as having a lower case Latin alphabetical numbering style and a numerical value of nine (and respective prefix and suffix "(" and ")") may follow ancestor indicator "(h)", which may include an interpretation 106 as also having a lower case Latin alphabetical numbering style and a numerical value of eight (and respective prefix and suffix "(" and ")").
  • determining 110 that indicator "(i)" associated with provision 172 follows an ancestor indicator may include determining 112 an alternative interpretation of indicator "(i)" associated with provision 172 (e.g., if an initial interpretation of indicator "(i)” was as having a lower case Roman numeral numbering style and a numerical value of one).
  • Document process 10 may associate 104 one or more portions of the document with a respective hierarchical level associated with each of the plurality of indicators.
  • indicator “(a)” may be associated with provision 152
  • indicator “(b)” may be associated with provision 154
  • indicator “(c)” may be associated with provision 156
  • indicator “(d)” may be associated with provision 158
  • indicator “(e)” may be associated with provision 160
  • indicator “(f)” may be associated with provision 162
  • indicator “(g)” may be associated with provision 164
  • indicator “(h)” may be associated with provision 166
  • indicator "(i)” may be associated with provision 168
  • indicator “(ii)” may be associated with provision 170
  • indicator “(i)” may be associated with provision 172
  • indicator “(j)” may be associated with provision 174.
  • document process 10 may associate 104 the provision associated with each respective indicator with the hierarchical level associated with each indicator. Accordingly, each provision 152-174 may include a respective hierarchical level. In some embodiments, document process 10 may associate metadata including an indicator of hierarchical level with each provision. Other mechanisms for associating 104 one or more portions of the document with a respective hierarchical level associated with each of the plurality of indicators may similarly be utilized.
  • a respective hierarchical level may be associated with each of the one or more portions of the document.
  • the hierarchical level may be included as an intrinsic property of each of the one or more portions of the document.
  • the hierarchical level may be associated with each of the one or more portions of the document, e.g., as metadata associated with each of the one or more portions of the document.
  • the hierarchical level associated with each of the one or more portions of the document may include, for example, metadata indicating a relative hierarchical level of each of the portions of the document.
  • the respective hierarchical level associated with each of the one or more portions of the document may be independent of any particular indicators of hierarchy or numbering style or system.
  • the relative hierarchical position of the one or more portions of the document may be dynamically adjusted in response to one or more portions of the document being added, deleted, and/or moved. As such, the determined hierarchical structure of the document may remain intact.
  • document process 10 may process 200 a document to associate a plurality of items within the document with a group.
  • Document process 10 may also modify 202 a first item included within the group.
  • Document process 10 may further modify 204 all other items in the group based on the modified first item.
  • the document may include a legal agreement and/or another document type that may include a hierarchical structure.
  • document process 10 may process 200 a document to associate a plurality of items within the document with a group in which the group may include a hierarchical group.
  • processing 200 the document may include processing the document to associate 206 each of the plurality of items with one of a plurality of hierarchical groups.
  • document process 10 may generally process the document to group the plurality of items included within the document into one of a plurality of hierarchical groups.
  • document process 10 may process 200 the document to associate 206 the various provisions of the document with one of a plurality of hierarchical groups.
  • document process 10 may process 200 the document to associate 206 provisions 252, 262, and 272 with a first level hierarchical group.
  • document process 10 may associate 206 provision 252 with a first level hierarchical group under a root hierarchical group Article I.
  • document process 10 may associate 206 provisions 262 and 272 with a first level hierarchical group under a root hierarchical group Article II.
  • document process 10 may associate 206 provisions 254-256 and 264-266 with a second level hierarchical group (e.g., with provisions 254-256 being in a second level hierarchical group under provision 252, and with provisions 264-266 being in a second level hierarchical group under provision 262).
  • document process 10 may associate 206 provisions 268- 270 with a third level hierarchical group (e.g., a third level hierarchical group under provision 266).
  • Processing 200 the document and/or associating 206 each of the plurality of items within the document with one of a plurality of hierarchical groups may include determining a hierarchical structure of the document, e.g., in a manner as generally described above (e.g., with reference to FIGS. 2 and 3).
  • the document may include structural information (e.g., in the form of metadata associated with the various provisions, etc.) that document process 10 may utilize for processing 200 the document.
  • the group may include defined terms within the legal agreement.
  • the plurality of defined terms e.g., a plurality of items within the document
  • the plurality of defined terms may be identified and associated with the group including defined terms based on, for example, metadata associated with the defined terms (e.g., which may identify the items as being defined terms) and/or other common attributes of the items (e.g., formatting, such as the terms being in bold type, underlined, set in upper case letters, in title case, etc.).
  • Document process 10 may associate the identified items with a group.
  • Various items may be associated with various different groups.
  • Document process 10 may also modify 202 a first item included within the group. For example, formatting or other characteristics of the first item may be modified.
  • document process 10 may receive 208 a request to modify the first item included within the group. For example, from within user interface 250, a user (e.g., user 36 operating personal computer 28) may select provision 258 (e.g., using onscreen pointer 274, which may be controlled by a pointing device, such as a mouse; not shown) by clicking (and/or otherwise selecting) provision 258.
  • provision 258 e.g., using onscreen pointer 274, which may be controlled by a pointing device, such as a mouse; not shown
  • document process 10 may receive 208 a request to modify an item within a group that includes the third hierarchical level of the document.
  • document process 10 may provide pop-up window 276.
  • pop-up window 276 may include formatting options, e.g., which may include options to modify a numbering style 278 and/or an indentation style 280 associated with provision 258. While pop-up window 276 is shown to include options for modifying formatting characteristics of provision 258, options may be included for modifying other characteristics of provision 258. Further, while pop-up window 276 is shown to include options for modifying numbering style and indentation of provision 258, options may be included for modifying other formatting characteristics (e.g., typeface, font size, capitalization, etc.). Accordingly, the illustrated embodiment should not be understood to limit the scope of the present disclosure.
  • user 38 may choose, via onscreen pointer 274, the numbering style "a, b, c ! from dropdown menu 282.
  • document process 10 may receive 208 a request to modify the first item (e.g., provision 258) included within the group (e.g., the group including the third hierarchical level of the document) by changing the number of provision 258 to lowercase alphabetical numbering format. Accordingly, modifying 202 the first item may be based on, at least in part, the received 208 request.
  • first item is intended to indicate any one of the items, without constraint on the order in which the item occurs in the group and/or the order in which the item was added to the group.
  • Document process 10 may further modify 204 all other items in the group based on the modified first item.
  • document process 10 may modify 204 all of the other items with the group (e.g., provisions 260, 268, 270 included within the third level hierarchical group) to include a lower case alphabetical numbering style based on, at least in part, the modification of provision 258.
  • the group e.g., provisions 260, 268, 270 included within the third level hierarchical group
  • all items in a third level hierarchical group throughout the document may be modified 204 to have the same numbering style. Even in a situation, as shown in FIG.
  • document process 10 may modify the numbering of all items included within the group based on the modification of the first item.
  • modifying all of the other items included within the group may include changing formatting characteristics associated with all of the items within the group based on and/or to match the formatting characteristics of the first item. As discussed above, characteristics or attributes other than formatting characteristics may also be modified.
  • document process 10 may determine 300 a hierarchical structure associated with a document. Document process 10 may also associate 302 the hierarchical structure with a plurality of elements of the document. Document process 10 may further generate 304 a map of the document. The map may include a plurality of links to one or more of the elements of the document.
  • the document may include any type of document including a hierarchical structure that may be indicated by textual content and/or textual structure of the document.
  • An example of a document that may include a hierarchical structure may include a legal agreement.
  • the present disclosure may similarly be applicable to other documents having a hierarchical structure.
  • document process 10 may determine 300 a hierarchical structure associated with a document.
  • determining 300 the hierarchical structure associated with the document may include processing 306 the document to determine the hierarchical structure.
  • a document including a hierarchical structure indicated by the textual content and/or textual structure of the document may be processed 306 to determine the hierarchical structure in a manner as generally described above with reference to FIGS. 2 and 3.
  • document process 10 may process 306 the document to identify one or more indicators of hierarchy within the document, and may determine a hierarchical level associated with each of the indicators.
  • determining 300 the hierarchical structure associated with the document may include accessing 308 structure data associated with the document.
  • a document may have previously been processed, e.g., in a manner as described above.
  • hierarchical structure data may be associated with the document, e.g., in the form of metadata, or otherwise associated with the document.
  • the document may otherwise include structure data associated with the document.
  • document process 10 may access 308 the structure data associated with the document to determine the hierarchical structure associated with the document.
  • Document process 10 may also associate 302 the hierarchical structure with a plurality of elements of the document. For example, document process 10 may associate each of the plurality of elements of the document with a specific hierarchical level. Further, in an embodiment in which the document may include a legal agreement, the plurality of elements of the document may include one or more provisions of the legal agreement. In such an embodiment, documents process 10 may associate a specific hierarchical level with each of the provisions of the legal agreement.
  • Document process 10 may further generate 304 a map of the document.
  • the map may include a plurality of links to one or more of the elements of the document having an associated hierarchical level. Additionally, document process 10 may generate a visual representation of the map.
  • document process 10 (alone and/or in conjunction with one or more additional applications) may generate user interface 350.
  • User interface 350 may include a visual representation of the map of the document, which may include an arrangement of the plurality of links based on, at least in part, the hierarchical structure of the document.
  • the map shown in user interface 350 may include a map of a legal agreement including a plurality of provisions.
  • the visual representation of the map may include a link associated with each of the provisions (e.g., which may each be associated with a hierarchical level or position within the hierarchical structure of the document).
  • the links associated with each of the provisions of the document may be arranged in the visual representation in a manner that may be based on the hierarchical structure of the document.
  • the arrangement of the links within the visual representation may provide a visual indicator of the relative hierarchical position of each of the elements of the document associated with the links. While the illustrated arrangement of the links within the visual representation included in user interface 350 utilizes indentations to indicate relative hierarchical level associated with the various links, it will be appreciated that such an arrangement is only intended for the purpose of illustration, as other implementations may be utilized.
  • the plurality of links may include hyperlinks configured to navigate the document.
  • each of the section numbers and associated titles may include a hyperlink.
  • a user e.g., user 38 accessing document process lOcl via personal computer 28
  • may select a link within user interface 350 e.g., link "2.1 Dolor Sit Amet" using onscreen pointer 274.
  • document process 10 may navigate to a portion of the document associated with the hyperlink.
  • document process 10 may display user interface 352 which may include the portion of the document associated with the hyperlink (namely section 2.1 of the agreement).
  • FIG. 10 there is shown a diagrammatic view of computing system 12. While computing system 12 is shown in this figure, this is for illustrative purposes only and is not intended to be a limitation of this disclosure, as other configurations are possible. For example, any computing device capable of executing, in whole or in part, event monitoring process 10 may be substituted for computing device 12 within FIG. 10, examples of which may include but are not limited to client electronic devices 28, 30, 32, 34.
  • Computing system 12 may include microprocessor 400 configured to e.g., process data and execute instructions/code for event monitoring process 10.
  • Microprocessor 400 may be coupled to storage device 16.
  • storage device 16 may include but are not limited to: a hard disk drive; a tape drive; an optical drive; a RAID device; an NAS device, a Storage Area Network, a random access memory (RAM); a read-only memory (ROM); and all forms of flash memory storage devices.
  • IO controller 402 may be configured to couple microprocessor 400 with various devices, such as keyboard 406, mouse 408, USB ports (not shown), and printer ports (not shown).
  • Display adaptor 410 may be configured to couple display 412 (e.g., a CRT or LCD monitor) with microprocessor 400, while network adapter 414 (e.g., an Ethernet adapter) may be configured to couple microprocessor 400 to network 14 (e.g., the Internet or a local area network).
  • display 412 e.g., a CRT or LCD monitor
  • network adapter 414 e.g., an Ethernet adapter
  • network 14 e.g., the Internet or a local area network
  • the present disclosure may be embodied as a method (e.g., executing in whole or in part on computing device 12), a system (e.g., computing device 12), or a computer program product (e.g., encoded within storage device 16).
  • the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a "circuit,” “module” or “system.”
  • the present disclosure may take the form of a computer program product on a computer-usable storage medium (e.g., storage device 16) having computer-usable program code embodied in the medium.
  • Any suitable computer usable or computer readable medium (e.g., storage device 16) may be utilized.
  • the computer-usable or computer-readable medium may be, for example but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, device, or propagation medium.
  • the computer-readable medium may include the following: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or Flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a transmission media such as those supporting the Internet or an intranet, or a magnetic storage device.
  • the computer-usable or computer-readable medium may also be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via, for instance, optical scanning of the paper or other medium, then compiled, interpreted, or otherwise processed in a suitable manner, if necessary, and then stored in a computer memory.
  • a computer-usable or computer-readable medium may be any medium that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device.
  • the computer-usable medium may include a propagated data signal with the computer-usable program code embodied therewith, either in baseband or as part of a carrier wave.
  • the computer usable program code may be transmitted using any appropriate medium, including but not limited to the Internet, wireline, optical fiber cable, RF, etc.
  • Computer program code for carrying out operations of the present disclosure may be written in an object oriented programming language such as Java, Smalltalk, C++ or the like. However, the computer program code for carrying out operations of the present disclosure may also be written in conventional procedural programming languages, such as the "C" programming language or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer through a local area network / a wide area network / the Internet (e.g., network 14).
  • These computer program instructions may be provided to a processor (e.g., processor 400) of a general purpose computer / special purpose computer / other programmable data processing apparatus (e.g., computing device 12), such that the instructions, which execute via the processor (e.g., processor 400) of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • a processor e.g., processor 400
  • a general purpose computer / special purpose computer / other programmable data processing apparatus e.g., computing device 12
  • These computer program instructions may also be stored in a computer- readable memory (e.g., storage device 16) that may direct a computer (e.g., computing device 12) or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function/act specified in the flowchart and/or block diagram block or blocks.
  • a computer e.g., computing device 12
  • the computer program instructions may also be loaded onto a computer (e.g., computing device 12) or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • a computer e.g., computing device 12
  • other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks.
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s).
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

L'invention concerne un procédé et un système informatique qui permettent de déterminer une structure hiérarchique associée à un document. La structure hiérarchique peut être associée à une pluralité d'éléments du document. Une carte du document peut être générée. La carte peut comprendre une pluralité de liens vers un ou plusieurs des éléments du document.
EP12810545.9A 2011-07-11 2012-07-11 Système et procédé de traitement de document Withdrawn EP2732379A4 (fr)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201161506408P 2011-07-11 2011-07-11
US201161534515P 2011-09-14 2011-09-14
PCT/US2012/046297 WO2013009904A1 (fr) 2011-07-11 2012-07-11 Système et procédé de traitement de document

Publications (2)

Publication Number Publication Date
EP2732379A1 true EP2732379A1 (fr) 2014-05-21
EP2732379A4 EP2732379A4 (fr) 2015-10-21

Family

ID=50482861

Family Applications (3)

Application Number Title Priority Date Filing Date
EP12810666.3A Withdrawn EP2732367A4 (fr) 2011-07-11 2012-07-11 Système et procédé de traitement de document
EP12811496.4A Ceased EP2732382A4 (fr) 2011-07-11 2012-07-11 Système et procédé de traitement de document
EP12810545.9A Withdrawn EP2732379A4 (fr) 2011-07-11 2012-07-11 Système et procédé de traitement de document

Family Applications Before (2)

Application Number Title Priority Date Filing Date
EP12810666.3A Withdrawn EP2732367A4 (fr) 2011-07-11 2012-07-11 Système et procédé de traitement de document
EP12811496.4A Ceased EP2732382A4 (fr) 2011-07-11 2012-07-11 Système et procédé de traitement de document

Country Status (1)

Country Link
EP (3) EP2732367A4 (fr)

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7124358B2 (en) * 2002-01-02 2006-10-17 International Business Machines Corporation Method for dynamically generating reference identifiers in structured information
US20050102612A1 (en) * 2003-11-06 2005-05-12 International Business Machines Corporation Web-enabled XML editor
CA2686900C (fr) * 2007-05-09 2017-05-23 Lexisnexis Group Systemes et procedes pour analyser des documents
US8606796B2 (en) * 2008-09-15 2013-12-10 Kilac, LLC Method and system for creating a data profile engine, tool creation engines and product interfaces for identifying and analyzing files and sections of files

Also Published As

Publication number Publication date
EP2732367A1 (fr) 2014-05-21
EP2732382A1 (fr) 2014-05-21
EP2732382A4 (fr) 2015-10-21
EP2732367A4 (fr) 2015-10-21
EP2732379A4 (fr) 2015-10-21

Similar Documents

Publication Publication Date Title
US20200104345A1 (en) System and method for processing document
US8442998B2 (en) Storage of a document using multiple representations
AU2012281160B2 (en) System and method for processing document
JP7289556B2 (ja) 特許文書作成装置、方法、コンピュータプログラム、コンピュータで読み取り可能な記録媒体、サーバー、およびシステム
US11526481B2 (en) Incremental dynamic document index generation
AU2012207560A1 (en) Storage of a document using multiple representations
WO2007094901A1 (fr) barre de défilement de présentation d'un document
AU2012281166B2 (en) System and method for processing document
US11526484B2 (en) Methods and systems for creating and managing micro content from an electronic document
US20090287994A1 (en) Document processing device and document processing method
EP2732379A1 (fr) Système et procédé de traitement de document

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20140210

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DAX Request for extension of the european patent (deleted)
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1198218

Country of ref document: HK

RA4 Supplementary search report drawn up and despatched (corrected)

Effective date: 20150922

RIC1 Information provided on ipc code assigned before grant

Ipc: G06F 17/22 20060101AFI20150916BHEP

Ipc: G06F 17/27 20060101ALI20150916BHEP

Ipc: G06F 17/24 20060101ALI20150916BHEP

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20190613

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1198218

Country of ref document: HK

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20210202