GB2480703A - Automated report generation system - Google Patents

Automated report generation system Download PDF

Info

Publication number
GB2480703A
GB2480703A GB1010390A GB201010390A GB2480703A GB 2480703 A GB2480703 A GB 2480703A GB 1010390 A GB1010390 A GB 1010390A GB 201010390 A GB201010390 A GB 201010390A GB 2480703 A GB2480703 A GB 2480703A
Authority
GB
United Kingdom
Prior art keywords
client
format
report
interface
server computer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB1010390A
Other versions
GB201010390D0 (en
Inventor
Daniel Lord
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Content Catalyst Ltd
Original Assignee
Content Catalyst Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US12/787,519 external-priority patent/US8769392B2/en
Application filed by Content Catalyst Ltd filed Critical Content Catalyst Ltd
Priority to GB1010390A priority Critical patent/GB2480703A/en
Publication of GB201010390D0 publication Critical patent/GB201010390D0/en
Publication of GB2480703A publication Critical patent/GB2480703A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • G06F16/345Summarisation for human users
    • G06F17/30719
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/131Fragmentation of text files, e.g. creating reusable text-blocks; Linking to fragments, e.g. using XInclude; Namespaces
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A computer implemented system in a client-server environment, is provided for document research, aggregation, and compilation. The system includes a server having a search module configured to generate a search interface displayable to a client communicably coupled to the server, and responsive to instructions received via the search interface, to search and sort selections of interest within a plurality of source documents. The system further includes a customized report module configured to generate a custom report interface displayable to the client, and responsive to user selection instructions received via the custom report interface, to extract and aggregate the selections of interest into a customized report. The system further includes a plurality of format specific, preferably at least Adobe PDF (RTM), Microsoft Excel (RTM) and Microsoft PowerPoint (RTM), modules, respectively configured to enable source documents in the specific formats to be searched, sorted, and aggregated by the search module and customized report module.

Description

Automated Report Generation System and Method
BACKGROUND
Technical Field
This invention relates to web-enabled transactions, and more particularly to an automated system for document research, aggregation, and report generation.
Background Information
Existing publishing systems provide mechanisms for searching and navigating through a library of documents. Such systems, however, are limited in their ability enable users to repurpose content from source documents into their own project reports. For example, business consulting firms typically spend considerable staff time assembling relevant findings from various sources into a report that is specifically tailored to a particular client's project.
This task often requires considerable human operator time to: identify and assemble relevant content from a range of source material types; convert each to a common file format; maintain a record of the source documents, including author, publisher, publication dates, terms of use, etc.; provide output of aggregated content in desired output file formats; facilitate peer review and provide reference to the original sources; and retain selection criteria and other records in order to regenerate the report at a later date based on updated source documents.
Moreover, conventional approaches used to automate portions of this process have faltered in their attempts to convert the source material from their various native formats into a common format to facilitate manipulation. These approaches have either been unable to convert some formats (e.g., from image formats into text), or have done so in a manner that has been prone to errors including loss of original content and/or loss of original formatting including pagination, line numbering, and visual integrity errors, etc. Moreover, because such errors are generated early in the process, on the source material, they tend to be propagated to all users of that material, and may also be compounded during subsequent reformatting into the desired output format.
Thus, a need exists for a system and method that addresses the foregoing limitations.
SUMMARY
One aspect of the invention includes a computer implemented system in a client-server environment, for document research, aggregation, and compilation. The system includes a server having a search module configured to generate a search interface displayable to a client communicably coupled to the server, and responsive to instructions received via the search interface, to search and sort selections of interest within a plurality of source documents. A customized report module is configured to generate a custom report interface displayable to the client, and responsive to user selection instructions received via the custom report interface, to extract and aggregate the selections of interest into a customized report. PDF, PPT, and XLS modules are configured to respectively enable source documents in the PDF, PPT, and XLS formats to be searched, sorted, and aggregated by the search module and customized report module.
Another aspect of the invention includes a method for automated document research, aggregation, and compilation, in a client-server environment. The method includes generating, with a server computer, a search interface displayable to a client communicably coupled to the server. The method further includes searching and sorting, with the server computer in response to instructions received via the search interface, selections of interest within a plurality of source documents. A custom report interface displayable to the client is generated with the server computer. The selections of interest are extracted and aggregated into a custom report, in response to user selection instructions received via the custom report interface. In response to user selection instructions, the server computer enables documents in the PDF, PPT, and XLS formats to be searched, sorted, extracted and aggregated into the customized report.
Yet another aspect of the invention includes an article of manufacture for automated document research, aggregation, and compilation, in a client-server environment. The article of manufacture includes a computer usable medium having a computer readable program code embodied therein for generating a search interface displayable to a client communicably coupled to the server. Computer readable program code is also provided for searching and sorting, in response to instructions received via the search interface, selections of interest within a plurality of source documents. The program code is also configured for generating a custom report interface displayable to the client, and extracting and aggregating, in response to user selection instructions received via the custom report interface, the selections of interest into a customized report. Program code is also provided for enabling, with the server computer in response to user selection instructions, documents in the PDF, PPT, and XLS formats to be searched, sorted, extracted and aggregated into the customized report.
The features and advantages described herein are not all-inclusive and, in particular, many additional features and advantages will be apparent to one of ordinary skill in the art in view of the drawings, specification, and claims. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and not to limit the scope of the inventive subject matter.
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention is illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements and in which: Fig. 1 is a block diagram of one embodiment of a network-based transaction facility of the present invention; Fig. 2 is a block diagram of one embodiment of a database maintained by a database engine server of the embodiment of the present invention; Fig. 3 is a diagrammatic representation of one embodiment of a user table within the database; Figs. 4A-4G are diagrammatic representations of an embodiment of various data tables within the database; Fig. 5 is a simplified block diagram of one embodiment of a system of the present invention including a transaction facility; Fig. 6 is a functional block diagram of objects within an embodiment of a transaction facility of the present invention; Fig. 7A is a flow chart of one embodiment of a method of present invention; Fig. 7B is a flow chart of optional aspects of the embodiment of Fig. 7A; Figs. 8-15 are exemplary representations of various interfaces provided by the objects of Fig. 6; and Fig. 16 is a block diagram of an exemplary embodiment of a computer system usable in connection with the foregoing embodiments.
DETAILED DESCRIPTION
In the following detailed description, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration, specific embodiments in which the invention may be practiced. These embodiments are described in sufficient detail to enable is those skilled in the art to practice the invention, and it is to be understood that other embodiments may be utilized. It is also to be understood that structural, procedural and system changes may be made without departing from the spirit and scope of the present invention. In addition, well-known structures, circuits and techniques have not been shown in detail in order not to obscure the understanding of this description. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope of the present invention is defined by the appended claims and their equivalents.
General Overview A method and apparatus for providing automated document research and report generation in a network-based transaction facility are described. In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be evident, however, to one skilled in the art that the present invention may be practiced without these specific details.
Embodiments of the present invention effectively automate the report generation process by: enabling users to automatically search and select sections of relevant reports based on the user's search criteria; enabling selected sections of distinct file formats (including.doc, ppt, .xls and.pdf files, images, etc.), to be stored and indexed in their native formats; assembling the sections into a single, customized publication (report) that tracks the source of the sections for proper attribution, terms of use, etc.; and enables the customized publication to be viewed online by colleagues for comment, etc. In particular, while some of these aspects may be at least partially provided by conventional systems, the present invention has overcome a significant problem in the prior art, namely that of enabling file portions of various mutually distinct formats to be conveniently aggregated, while maintaining accurate records of the source(s) of those file io portions (sections), including their location(s) within the source documents. Embodiments of the invention provide this capability by operating contrary to the accepted practice of converting source material of various native formats into a common format prior to use.
Instead of attempting such conversions, these embodiments generate and store a definition of a user's custom report in the form of a sequence of references to the source material portions in their native formats (e.g., .doc, .ppt, .xls, .pdf, etc.). Each selected source document portion is thus parsed/tracked using units/parameters that are appropriate to those formats, such as slide numbers (for PowerPoint), cell coordinates (for.xls), paragraph numbers (for.doc), or rectangles having particular position coordinates on a page (for images, .pdf, etc.).
The system then uses the selected source document portions in their native formats (effectively inserting them as-is' into the custom report/publication), while tracking the particular units/parameters in a data table linked to the custom report. This approach thus conveniently and accurately permits source documents of substantially any native format to be conveniently incorporated into the custom report without generating format conversion errors.
Moreover, in the event the user wishes to save the custom report into a particular output format, such as WordTM (Microsoft Corp., Redmond, Wash.), PDF, or other format to facilitate webpage display, such a conversion occurs only after the custom report has been generated, so that each source document portion is re-formatted at most only once. This approach thus tends to minimize reformatting errors by effectively preventing errors from being compounded by multiple format conversions such as may otherwise occur in the event a source document were converted to a common file format prior to report generation, and then re-formatted into a user-selected output format.
In a non-limiting, simplified example, an embodiment of the present invention may be used to build a report by selecting a portion of a page from a Word (DOC) text file (which is identified as ID:3456), and a portion of a page of a PDF file (which is identified by page number and coordinates of the clipped section as PDF:p45{20,20:400,800}). The user may also select a table from an Excel® (Microsoft) file (in the XLS format, identified by worksheet and cell coordinates as XL:Sheetl{1,1:20,5} ). This custom report may thus be stored as a sequence of these identifiers/references to the source material in its native format.
The user may then view the custom report online, and/or save it to a project (output) file, such as a Word doe. In this latter instance, any source data not natively in the target (e.g., 10.doc) format, would be reformatted into.doc. Thus, in this example, the first selected portion would then be simply inserted as Word text, image or table; the PDF section may be converted to an image and inserted as an image; and the Excel® excerpt would be inserted as a table in the final Word doc. Each of these sections may include a reference link in the final Word doc to permit the user to link to the original source content in its native format.
In addition, the foregoing architecture permits particular embodiments of the invention to conveniently provide for enhanced version control. Publishers often need to provide continuous updates to their reports, and would like their users to sign up for subscriptions to receive these updates, rather than simply make one-off purchases of reports. There are also many other applications, such as legal guidelines, etc., which would benefit from being able to compare different dates/versions of the same report, to see what's new, or to see which rules were in effect when a particular issue arose. The embodiments described herein enable reports to evolve with a virtually unlimited number of revisions. For example, when a new revision is uploaded, these embodiments do not simply save the full report, but rather, add the new sections. The old sections are also retained, with every section stored with a unique ID code, date and version. This allows the system to reconstruct the report as it was at any date, or to compare the versions at particular dates.
In particular embodiments, this version control may be used in connection with a user's subscription expiration, such as to allow users to continue to access the latest version of the custom report at the point their subscription expired. The publisher may then, for example, encourage those users to renew their subscriptions by providing them with a customized summary of the updates they would be able to access if they choose to renew.
Terminology For the purposes of the present specification, the term transaction" shall be taken to include any communications between two or more entities and shall be construed to include, but not be limited to, commercial transactions including data searching, aggregation, and presentation, including sale and purchase transactions. The terms "computer" and "end-user device" are meant to encompass a workstation, personal computer, personal digital assistant (PDA), wireless telephone, or any other suitable computing device including a processor, a computer readable medium upon which computer readable program code (including instructions and/or data) may be disposed, and a user interface. Terms such as "server", "application", "engine" and the like are intended to refer to a computer-related component, including hardware, software, and/or software in execution. For example, an engine may be, but is not limited to being, a process running on a processor, a processor including an object, an executable, a thread of execution, a program, and a computer. Moreover, the various components may be localized on one computer and/or distributed between two or more computers. The terms "real-time" and "on-demand" refer to sensing and responding to external events nearly simultaneously (e.g., within milliseconds or microseconds) with their occurrence, or without intentional delay, given the processing limitations of the system and the time required to accurately respond to the inputs. Terms such as "component," "module", and the like are intended to refer to a computer-related entity, either hardware, a combination of hardware and software, software, or software in execution. For example, a component may be, but is not limited to being, a process running on a processor, a processor, an object, an executable, a thread of execution, a program, and a computer. By way of illustration, both an application running on a server and the server (or control related devices) can be components.
One or more components may reside within a process and/or thread of execution and a component may be localized on one computer and/or distributed between two or more computers or control devices.
Programming Languages The system and method embodying the present invention can be programmed in any suitable language and technology, such as, but not limited to: C++; Visual Basic; Java; VBScript; Jscript; BCMAscript; DHTM1; XML and CGI. Alternative versions may be developed using other programming languages including, Hypertext Markup Language (HTML), Active ServerPages (ASP) and Javascript. Any suitable database technology can be employed, such as, but not limited to, Microsoft SQL Server or IBM AS 400.
Transaction Facility Fig. 1 is a block diagram illustrating an exemplary network-based transaction facility in the form of an Internet-based report generation facility 10. The report generation facility 10 includes one or more of a number of types of front-end servers, namely page servers 12 that deliver web pages (e.g., markup language reports), picture servers 14 that dynamically deliver images to be displayed within Web pages, CGI (Common Gateway Interface) servers 18 that provide an intelligent interface to the back-end of facility 10, and search servers 20 that handle search requests to the facility 10. E-mail servers 21 provide, jilter alia, automated e-mail communications to users of the facility 10. The back-end servers may include a database engine server 22, a search index server 24 and a payment (e.g., credit card and/or subscription) database server 26, each of which may maintain and facilitate access to a respective database.
Facility 10 may also include an administrative application server 28 configured to provide various administrative functions.
The network-based report generation facility 10 may be accessed by a client program 30, such as a browser (e.g., the Internet Explorer distributed by Microsoft) that executes on a client machine 32 and accesses the facility 10 via a network such as, for example, the Internet 34. Other examples of networks that a client may utilize to access the report generation facility 10 include a wide area network (WAN), a local area network (LAN), a wireless network (e.g., a cellular network), or the Plain Old Telephone Service (POTS) network.
Database Structure Fig. 2 is a database diagram illustrating an exemplary database 23, maintained by and accessed via the database engine server 22, which at least partially implements and supports the report generation facility 10. The database 23 may, in one embodiment, be implemented as a relational database, and includes a number of tables having entries, or records, that are linked by indices and keys. In an alternative embodiment, the database 23 may be implemented as collection of objects in an object-oriented database.
As shown, central to the database 23 is a user table 40, which contains a record for each user of the report generation facility 10. The database 23 also includes tables 42 that may be linked to the user table 40. Specifically, the tables 42 may include source material (e.g., source document) content table 44, archive table 45, translation table 46, custom report tables 50 and 51, and version table 52. A user record in the user table 40 may be linked to multiple reports that are being, or have been, generated via the facility 10 and for which records exist within the report tables 42. The database 23 also includes a comment table 48 populatable with comment records that may be linked to one or more report records within the report tables 42 and/or to one or more user records within the user table 40, Each comment record is within the table 48 may include, jitter alia, a comment, description, history or other information pertaining to a report being generated via the report generation facility 10. A number of other tables may also be linked to the user table 40, such as an accounts table 56, an account balances table 58 and a transaction record table 60.
Fig. 3 is a diagrammatic representation of an exemplary embodiment of the user table 40 that is populated with records, or entries, for each user of the report generation facility 10.
As shown, table 40 includes a user identifier column 61 that stores a unique identifier for each user. A name column 63 may store a first name, a middle initial and a last name for each user.
An address column 65 may store full address information and/or other contact information for each user, e.g. a street name and number, city, zip code, state, email address, etc. A phone number column 67 stores a home phone number for each user. A subscription status column 69 may store, for each user, a value identifying the user's subscription status. That is, different values may be assigned to indicate whether a user has a currently valid subscription, has an expired subscription (and which provides limited access to facility 10), and/or is accessing the facility on a pay-as-you-go basis such as via credit card, etc. It will be appreciated that any information other than that described above may populate the user table 40 without loss of generality.
Figs. 4A-4F are diagrammatic representations of exemplary embodiments of tables 44, 45, 46, 48, 50, 51 and 52 which are populated with document/report records during use of the report generation facility 10. Referring now to Fig. 4A, source material (document) table 44 is configured to store the content of source documents. This table includes a report ID column 60 to identify the particular source document, a Version ID column 62 to track versions, an ID column 64 to provide a unique reference to each section of the document, a Type column 66 which provides the section type, such as text, heading, table, figure etc. for each section. For extended file types, such as PDF, etc., this column may be used to provide the file type.
Content column 68 is populated with the actual content (e.g., XML) of the particular section.
Any text capable of being extracted from particular file types such as PDF / PPT / XLS, i.e., from the particular page / slide / worksheet, respectively, is maintained in this column 68 for use by the search and translate functions, etc., as will be described in greater detail hereinbelow. This Content column 68 may also include variables to further define the source of the section content, including author, publisher, publication dates, copyright notices and terms of use.
Turning now to Fig. 4B, content archive table 45 enables the system to track multiple versions of the same document. This table 45 is substantially similar to content table 44, except that a ReferencedlD column 70 is added, and content column 68 may be populated as NULL, in which case the ReferencedlD column 70 maps to a previous entry with a non-null (e.g., XML) content. So in the event the (XML) content is unchanged relative to a previous version, column 70 is populated with the ID (from column 64) of the previous version of the particular section. The content archive table can be used to restore prior versions of a report, by retrieving all the XML content for that ReportlD and VersionlD. Where the XML content is NULL, this means it's unchanged from previous versions, and the system 10 uses the ReferencediD to identify the original text in an older version.
As shown in Fig. 4C, the translation table 46 is used to store the content that has been translated into foreign languages. This table includes an ID column 64, a language column 72 to identify the particular language of the translated section, and a translation source column 74. Column 74 is used to track the translator, such as for quality control and/or to facilitate the upgrade of machine translated text to human-translated text. A translation column 76 is populated with the translated content, and the retranslate column 78 may be populated by a user to flag text that may have been poorly translated, for re-translation by a professional.
Referring to Fig. 4D, the comments table 48, as mentioned above, may be used to retain users' comments on particular sections of documents. Table 48 includes a path column which, instead of using an ID 64 to link to the content table 44, is populated with a path of heading titles (e.g., filenames) to point to the correct section. Although IDs 64 may be used to point the comments, use of the heading title paths effectively sticks the comments to sections even in the event IDs change in a revised document. (In this regard, IDs 64 may be created automatically by the database as new rows are inserted in the various tables. If a document is updated then original content may be removed after the new content has been inserted and the io differences tracked, as discussed hereinabove. Thus, the IDs 64 for material used in a particular section may change, which illustrates the advantage of using heading title paths to link comments to a particular section.) UserID column 82 is used to identify the user who inserted a particular comment. Privacy column 84 is used to identify the scope of the comment, e.g., for private use, company-wide use, or public use. Comment column 86 is used to store the text of the actual comment. Position column 88 may be used to facilitate use in connection with extended file formats such as PDF, PPT, XLS, etc. This column 88 may thus be used to retain the co-ordinates of where the user inserted the comment on the particular page, slide, worksheet, etc. Custom report tables 50 and 51, as shown in Fig. 4E and 4F, allow users to rearrange document content into their own customized reports. Table 50 includes a CustomlD column which is populated with the ID of the particular user-generated custom report. UserID and Privacy columns 82 and 84 are provided as described above. Title column 92 identifies the
title of the custom report.
Custom report table 51 is a custom detail table which includes a Path column 94 used to provide a pointer to each particular section (e.g., to the particular location in the table storing the selected content) used in the custom report. Note that in the example shown, Path column 94 stores a pointer in the form of a file path to the particular section of interest.
However, any suitable pointer may be used, including for example, the section ID 64 of table 44. A Range column 96 is used to facilitate use of content in extended file formats such as PDF, PPT, XLS, etc., to retain the co-ordinates of the selected content on the particular page, slide, worksheet, etc. As shown, this is usually a range defining the top left and the bottom right coordinates of a rectangle bordering the selection, As shown in Fig. 4G, version table 52 is used to map version numbers to particular dates in time. Table 52 includes a ReportlD column 60, a VersionNumber column 97, a Date column 98, and an UploadedByUserlD column 99.
Report Generation Process As mentioned hereinabove, embodiments of the present invention provide an automated system and method for generating reports from source documents of various types/formats, including text (e.g., DOC), image, PDF, PPT, XLS, and other formats. These embodiments enable such report generation in a real-time, web-based, client-server environment, such as on a subscription or pay-as-you-go basis. While the present invention is discussed within the environment of the exemplary report generation facility 10, it will readily be appreciated that the present invention may be used in any number of environments including network and on-line based transaction facilities in business-to-business, business-to-consumer and consumer-to-consumer applications.
Fig. 5 is a simplified block diagram of a system 90 for generating reports in accordance with an exemplary embodiment of the present invention. In this embodiment, a client computer 92 is coupled to a transaction computer 98 via a communications network (e.g. a wide area network) 94. The client computer 92 represents a device that allows a user to interact with the report generation facility 10 or any other transaction facility 98. In one embodiment, the client computer 92 presents to the user a report generation interface for searching and aggregating content within one or more libraries of source documents associated with the transaction computer 98.
The transaction computer 98, which supports a report generation facility such as shown at 10 of Fig. 1, handles transactions between various participants of the facility 10 including the user of the client computer 92. In one embodiment, the transaction computer 98 may initially receive the personal information of the participant from the client computer 92, and generate a subscription result which determines whether, and to what extent, the user is granted access to the facility 10. The transaction computer then facilitates the generation of custom reports in accordance with various user interfaces presented by the computer 98, via the client computer 92, to the user.
Fig. 6 shows a series 100 of interfaces/modules, such as may take the form of a series of objects (or methods), that may be implemented by the report generation facility 10, e.g., in combination with the various tables of database 23, for the purposes of generating one or more custom reports. The series 100 of interface modules shown in Fig. 6 will be described with reference to exemplary representations of the various interfaces as shown in Figs. 8-11.
As shown, series 100 includes a login module 102, configured to generate a login interface through which a user of the facility 10 provides at least a user identifier and associated password. The user may also be requested to pay a fee for the subscription process.
A source document module 103 is configured to generate a source document interface, such as shown at 200 in Fig. 8. Module 103 interacts with the various tables of database 23 (Fig. 1) to enable interface 200 to display various source documents in an interactive format.
In particular, a collapsible document tree 202 is provided to enable a user to open sections, or to highlight the location of search matches. Function tabs 204 are provided to search within the document, extract the figures, tables, compare sections with split screen, translate or cut your own' personal report. A toolbox 206, generated by a tool handler 104, is provided for operations involving the currently displayed content -e.g. extract to WordTM, PowerPoint® (Microsoft), Excel®, prepare a printer friendly version, or email it to a colleague. Within the displayed document, interface 200 provides function buttons 208, also via tool handler 104, to copy individual sections (e.g., to the user's personal library) or to add comments that can be shared with other users. In particular embodiments, source document module 103 and interface 200 are configured to handle source documents in text file formats, such as DOC or ASCII, etc., while source documents in other formats, such as images (figures), XLS, PPT, PDF, etc., may be handled in conjunction with other modules, such as handlers 105, 112, 114 and 116, as discussed hereinbelow.
Returning to Fig. 6, figure handler 105 is configured to generate a figure interface such as shown at 210 of Fig. 9. This exemplary figure interface 210 illustrates how the storage format of the various tables facilitates flexible reproduction of the original document. In the example shown, the user is only interested in figures, so the system, via interface 210 has identified all the figures within a particular source document, e.g., shown in thumbnail form at 221, with the highlighted thumbnail shown enlarged in the center of the interface 210. As with all of the user interfaces shown and described herein, toolbox 206 and buttons 208 are provided, via tool handler 104, to enable the user to comment on figures, or save to Office or the user's own online personal library etc. Translation handler 106 is configured to generate a translation interface such as shown at 212 of Fig. 10. Translation handler 106 operates in conjunction with the aforementioned translation table 46 to display foreign translations, e.g., side-by-side with the corresponding sections of the source document, as shown. It is noted that handler 106 checks for a copy of the translated section in the database 23. If the translation is not found, then the system requests the section be translated from an external translation service. In particular embodiments, this happens in real time and the translation interface 212 displays a please wait icon to the user. The translation for each section may be displayed as soon as it's received from the external system. The translated section may then be saved in the database 23, so that the translation will be shown immediately if the same translation is requested by another user in the future.
User search module 108 is configured to generate a search interface, such as shown at 214 in Fig. 11. The document storage format embodied in the various tables described above, enables the search module 108 to implement a relatively precise search, based on keywords entered via this interface 214. As shown, search results are displayed, including relevant sections within a document. The interface 214 may then directly access the relevant sections by highlighting and using toolbox 206.
The custoni report module 110 is configured to generate a custom report interface such as shown at 216 of Fig. 12. As shown, the sections selected via the various interfaces shown and described herein, may be displayed by this custom report interface 216. This interface thus displays the user's custom report, which includes the various sections copied from the aforementioned source documents. In the particular embodiment shown, the personal report is presented with page tear' graphics to differentiate the sections obtained from different source documents. The source information for the original source document and/or section, as tracked in the aforementioned tables, is also displayed. In addition to tracking the source information for the source document / Section, custom report module 110 also includes a version control module 111. The version control table 111 uses the aforementioned version tables to enable a user, e.g., via interface 216, to display particular versions of a custom report.
It is noted that this custom report interface 216 also enables the user to perform the same functions with this custom report, as they can perform with the various source documents, such as add comments, translate, copy sections and share with colleagues, etc. PDF Handler 112 is configured to generate a PDF Handler interface, such as shown at 220 in Fig. 13. The PDF Handler 112 splits PDF documents into pages 222, and saves each separately as: (i) a high resolution image; (ii) a single page PDF; and through the use of conventional text extraction and/or optical character recognition tools, (iii) text in the aforementioned database tables. This enables the PDF content to be viewed and used in similar ways to the other documents on the system. In particular, the search module 108 may use the text in the database to present matches that link to the relevant page within a PDF document. The page text stored in the database tables may also be accessed by the toolbox 206 when the user seeks to extract the text to other (e.g., Microsoft OfficeTl) formats and/or translation. Other capabilities include enabling the user to comment on individual pages as described above. In addition, the high resolution image may be resized and inserted into the users' online personal reports and Office documents. The individual PDF pages may also be assembled when the user wants to save multiple pages to a new PDF file.
It is noted that the foregoing functionality, while providing for such capability as text extraction and manipulation, still provides for maintaining and using PDF files in their native format. In this regard, PDF Handler 112 is configured to permit the user, via interface 220, to draw a rectangle 224 to select page sections they would like to use (e.g., operate extraction functions on). This rectangle may then be manipulated (e.g., extracted to Microsoft OfficeTM files), using toolbox 206 as shown. In this regard, PDF Handler 112 will capture and populate the coordinates of the rectangle(s), along with the filename of the image (or PDF file), to the table(s) of database 23. Thus, when saving to the user's personal library, only the co-ordinates and the image filename need to be stored in the tables.
PPT Handler 114 is configured to generate a PPT Handler interface, such as shown at 226 in Fig. 14. As shown, PowerPoint® (Microsoft) files are integrated in much the same way as PDFs, except that each slide 228 of the PPT files (rather than each page of the PDF files) are saved as individual PPT files (rather than as PDF files).
Excel® (XL) Handler 116 is configured to generate an Excel® (XL) Handler interface, such as shown at 230 of Fig. 15. As shown, handler 116 provides a mechanism, via interface 230, in combination with the tables (e.g., table 51) of database 23, for saving XL files and integrating these with the substantially the same search, personal library, translation and save to Office' functions as described above for documents in other formats. The interface 230 also provides mechanisms to filter the table content and save these views to the personal library, etc. A method for automated document research, aggregation, and compilation, in a client-server environment, using a network-based transaction facility, such as the report generation facility 10, will now be described as illustrated by the flow chart of Figs. 7A and 7B. As shown in Fig. 7A, the method 700 commences with communicating 708 user interface information to a user of the transaction facility at client 32 (Fig. 1). More specifically, the user interface information may provide a login interface via login module 102, described above with reference to FIG. 6.
Subsequent to the login by the user, at 710 the user is provided with a search interface, such as shown and described with respect to Fig. 8. At 712, in response to instructions received via the search interface, the facility 10 searches and sorts selections of interest within a plurality of source documents. At 714, the facility generates a custom report interface displayable to the client. At 716, in response to user selection instructions received via the custom report interface, the facility extracts and aggregates the selections of interest into a customized report. As may be required in response to the user selection instructions, at 718, the facility enables documents in the PDF format to be searched, sorted, extracted and aggregated into the customized report. At 720 and 722, the facility provides similar functionality for documents in the PPT and XLS formats, respectively. At 724, the facility 10 optionally tracks and outputs various versions of the custom report.
Turning now to Fig. 7B, various optional steps shown as method 702 include retaining 730 the user selection instructions in order to regenerate the desired version of the custom report. At 732, the facility stores and indexes a plurality of source documents in a plurality of native formats including text (e.g., DOC or XML), PDF, PPT, and XLS. (It is noted that this step 732 may be effected independently of any particular user selection instructions, and typically occurs prior thereto, for an entire library of source documents.) At 734, the facility stores each of the plurality of source documents in a plurality of sub-document portions. At 736, source information is stored for each of the plurality of sub-document portions. At 738, the stored source information may include the author, publisher, publication date, copyright notice, and terms of use of the sub-document portions. At 740, the facility stores the plurality of source documents in one or more tables populated with the sub-document portions, source information, and with coordinate information for location of the sub-document portions within its source document. Figures are stored as sub-document portions of the source document at 742. At 744, 746 and 748, the facility respectively generates PDF, PPT, and XLS interfaces, for capturing user selection instructions associated with searching and sorting source document selections of interest in these formats. At 750, the facility outputs the custom report in any of a plurality of formats, such as XML, DOC, PDF, PPT, XLS, and substantially any io other desired format. In this regard, the various embodiments of the invention are not limited to the particular formats shown and described herein, but rather, the embodiments shown and described herein may be used with substantially any file formats without departing from the scope of the invention. The facility generates, at 752, a tool interface displayable to the client for capturing user comments to be linked to individual sub-document portions. At 754, the tool interface permits a plurality of users to add comments to the custom report.
In summary, it will be appreciated that the above described interfaces, and underlying technologies, provide a convenient vehicle for document research, aggregation, and compilation, in a real-time, multi-user collaborative environment using a seamlessly integrated transaction facility.
Fig. 16 shows a diagrammatic representation of a machine in the exemplary form of a computer system 300 within which a set of instructions, for causing the machine to perform any one of the methodologies discussed above, may be executed. In alternative embodiments, the machine may include a network router, a network switch, a network bridge, Personal Digital Assistant (PDA), a cellular telephone, a web appliance or any machine capable of executing a sequence of instructions that specify actions to be taken by that machine.
The computer system 300 includes a processor 302, a main memory 304 and a static memory 306, which communicate with each other via a bus 308. The computer system 300 may further include a video display unit 310 (e.g., a liquid crystal display (LCD), plasma, cathode ray tube (CRT), etc.). The computer system 300 may also include an alpha-numeric input device 312 (e.g., a keyboard or touchscreen), a cursor control device 314 (e.g.,a mouse), a drive (e.g., disk, flash memory, etc.,) unit 316, a signal generation device 320 (e.g., a speaker) and a network interface device 322.
The drive unit 316 includes a computer-readable medium 324 on which is stored a set of instructions (i.e., software) 326 embodying any one, or all, of the methodologies described above. The software 326 is also shown to reside, completely or at least partially, within the main memory 304 and/or within the processor 302. The software 326 may further be transmitted or received via the network interface device 322. For the purposes of this specification, the term "computer-readable medium" shall be taken to include any medium that is capable of storing or encoding a sequence of instructions for execution by the computer and that cause the computer to perform any one of the methodologies of the present invention.
The term "computer-readable medium" shall accordingly be taken to include, but not be limited to, solid-state memories, optical and magnetic disks, and carrier wave signals.
Thus, a method and apparatus for generating custom reports in a network-based transaction facility have been described. Although the present invention has been described with reference to specific exemplary embodiments, it will be evident that various modifications and changes may be made to these embodiments without departing from the broader spirit and scope of the invention. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.
Furthermore, embodiments of the present invention include a computer program code-based product, which includes a computer readable storage medium having program code stored therein which can be used to instruct a computer to perform any of the functions, methods and/or modules associated with the present invention. The computer storage medium includes any of, but not limited to, the following: CD-ROM, DVD, magnetic tape, optical disc, hard drive, floppy disk, ferroelectric memory, flash memory, ferromagnetic memory, optical storage, charge coupled devices, magnetic or optical cards, smart cards, EEPROM, EPROM, RAM, ROM, DRAM, SRAM, SDRAM, and/or any other appropriate static or dynamic memory or data storage devices.
It should be noted that the various modules and other components of the embodiments discussed hereinabove may be configured as hardware, as computer readable code stored in any suitable computer usable medium, such as ROM, RAM, flash memory, phase-change memory, magnetic disks, etc., and/or as combinations thereof, without departing from the scope of the present invention.
It should be further understood that any of the features described with respect to one of the embodiments described herein may be similarly applied to any of the other embodiments described herein without departing from the scope of the present invention.
In the preceding specification, the invention has been described with reference to specific exemplary embodiments for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise form disclosed. Many modifications and variations are possible in light of this disclosure. It is intended that the scope of the invention be limited not by this detailed description, but rather by the claims appended hereto.
The above systems are implemented in various computing environments. For example, the present invention may be implemented on a conventional IBM PC or equivalent, multi-nodal system (e.g., LAN) or networking system (e.g., Internet, WWW, wireless web). All programming and data related thereto are stored in computer memory, static or dynamic or non-volatile, and may be retrieved by the user in any of: conventional computer storage, display (e.g., CRT, flat panel LCD, plasma, etc.) and/or hardcopy (i.e., printed) formats. The programming of the present invention may be implemented by one skilled in the art of computer systems and/or software design.

Claims (52)

  1. CLAIMSA computer implemented system in a client-server environment, for document research, aggregation, and compilation, the system comprising: a server including: a search module configured to generate a search interface displayable to a client communicably coupled to the server, and responsive to instructions received via the search interface, to search and sort selections of interest within a plurality of source documents; a customized report module configured to generate a custom report interface displayable to the client, and responsive to user selection instructions received via the custom report interface, to extract and aggregate the selections of interest into a customized report; a PDF module configured to enable source documents in the PDF format to be searched, sorted, and aggregated by the search module and customized report module; a PPT module configured to enable source documents in the PPT format to be searched, sorted, and aggregated by the search module and customized report module; and an XLS module configured to enable source documents in the XLS format to be searched, sorted, and aggregated by the search module and customized report module.
  2. 2. The system of claim 1, wherein the customized report module is configured to extract and aggregate selections of interest disposed in one or more text formats.
  3. 3. The system of claim 1, wherein the search, customized report, PDF, PPT, and XLS modules are all disposed on computer readable media.
  4. 4. The system of claim 1, comprising a version control module in operative engagement with the customized report module, the version control module configured to track the version of the custom report generated by the customized report module, and, responsive to instructions received from the client, output a desired version of the custom report to the client.
  5. 5. The system of claim 4, wherein the version control module is configured to retain the user selection instructions in order to regenerate the desired version of the custom report to the client.
  6. 6. The system of claim 4, comprising a source document module configured to store and index a plurality of source documents in a plurality of native formats including DOC, PDF, PPT, and XLS.
  7. 7. The system of claim 6, wherein the customized report module is configured to extract and aggregate the selections of interest into a customized report while maintaining the original native formats of the selections of interest.
  8. 8. The system of claim 7, wherein the source document module is configured to store each of the plurality of source documents in a plurality of sub-document portions.
  9. 9. The system of claim 8, wherein the source document module is configured to store source information for each of the plurality of sub-document portions.
  10. 10. The system of claim 9, wherein the source information is selected from the group consisting of author, publisher, publication date, copyright notice, and terms of use.
  11. 11. The system of claim 9, wherein the source document module is configured to store the plurality of source documents in one or more tables populated with the sub-document portions, source information, and with coordinate information for location of the sub-document portions within its source document.
  12. 12. The system of claim 11, comprising a figure handler configured to generate a figure interface displayable to the client, and responsive to instructions received via the figure interface to search and sort selections of interest in the form of figures within the plurality of source documents.
  13. 13. The system of claim 11, comprising a PDF handler configured to generate a PDF interface displayable to the client, and responsive to instructions received via the PDF interface to search and Sort selections of interest in the PDF format within the plurality of source documents.
  14. 14. The system of claim 11, comprising a PPT handler configured to generate a PPT interface displayable to the client, and responsive to instructions received via the PPT interface to search and sort selections of interest in the PPT format within the plurality of source documents.
  15. 15. The system of claim 11, comprising an XIS handler configured to generate an XLS interface displayable to the client, and responsive to instructions received via the XIS interface to search and sort selections of interest in the XIS format within the plurality of source documents.
  16. 16. The system of claim 11, comprising a translation handler configured to generate a translation interface displayable to the client, and responsive to instructions received via the translation interface to provide language translation for selections of interest within the plurality of source documents.
  17. 17. The system of claim ii, wherein the customized report module is configured to output the custom report in any of a plurality of formats.
  18. 18. The system of claim 17, wherein the plurality of formats is selected from the group consisting of XML, DOC, PDF, PPT, XLS, and combinations thereof.
  19. 19. The system of claim 11, comprising a tool module configured to generate a tool interface displayable to the client, and responsive to instructions received from the client, link user comments to individual sub-document portions.
  20. 20. The system of claim 19, wherein the tool module is configured to permit the custom report to be displayed to, and comments to be added by, a plurality of clients.
  21. 21. A method for automated document research, aggregation, and compilation, in a client-server environment, the method comprising: (a) generating, with a server computer, a search interface displayable to a client communicably coupled to the server; (b) searching and sorting, with the server computer in response to instructions received via the search interface, selections of interest within a plurality of source documents; (c) generating, with the server computer, a custom report interface displayable to the client; (d) extracting and aggregating, with the server computer in response to user selection instructions received via the custom report interface, the selections of interest into a customized report; (e) enabling, with the server computer in response to user selection instructions, documents in the PDF format to be searched, sorted, extracted and aggregated into the customized report; (I) enabling, with the server computer in response to user selection instructions, documents in the PPT format to be searched, sorted, extracted and aggregated into the customized report; and (g) enabling, with the server computer in response to user selection instructions, documents in the XIS format to be searched, sorted, extracted and aggregated into the customized report.
  22. 22. The method of claim 21, wherein said extracting and aggregating (d) comprises extracting and aggregating selections of interest disposed in one or more text formats.
  23. 23. The method of claim 21, comprising (h) tracking, with the server computer, versions of the custom report, and outputting, in response to instructions received from the client, a desired version of the custom report.
  24. 24. The method of claim 23, wherein said tracking (h) comprises retaining the user selection instructions in order to regenerate the desired version of the custom report.
  25. 25. The method of claim 23, comprising storing and indexing, with the server computer, a plurality of source documents in a plurality of native formats including text, PDF, PPT, and XLS.
  26. 26. The system of claim 25, wherein said extracting and aggregating (d) is effected while maintaining the original native formats of the selections of interest.
  27. 27. The method of claim 26, comprising storing, with the server computer, each of the plurality of source documents in a plurality of sub-document portions.
  28. 28. The method of claim 27, comprising translating, with the server computer, in response to instructions received from the client, a plurality of the sub-document portions.
  29. 29. The method of claim 27, comprising storing, with the server computer, source information for each of the plurality of sub-document portions.
  30. 30. The method of claim 29, comprising storing, with the server computer, source information selected from the group consisting of author, publisher, publication date, copyright notice, and terms of use.
  31. 31. The method of claim 29, comprising storing, with the server computer, the plurality of source documents in one or more tables populated with the sub-document portions, source information, and with coordinate information for location of the sub-document portions within its source document.
  32. 32. The method of claim 31, wherein said storing comprises storing figures as sub-document portions of the source document.
  33. 33. The method of claim 31, comprising generating, with the server computer, a PDF interface displayable to the client for capturing user selection instructions associated with searching and sorting selections of interest in the PDF format within the plurality of source documents.
  34. 34. The method of claim 31, comprising generating, with the server computer, a PPT interface displayable to the client for capturing user selection instructions associated with searching and sorting selections of interest in the PPT format within the plurality of source documents.
  35. 35. The method of claim 31, comprising generating, with the server computer, an XLS interface displayable to the client for capturing user selection instructions associated with searching and sorting selections of interest in the XLS format within the plurality of source documents.
  36. 36. The method of claim 31, comprising generating, with the server computer, a translation interface displayable to the client for capturing user selection instructions associated with language translation of selections of interest within the plurality of source documents.
  37. 37. The method of claim 31, comprising outputting, with the server computer, the custom report in any of a plurality of formats.
  38. 38. The method of claim 37, comprising outputting, with the server computer, the custom report in a plurality of formats selected from the group consisting of XML, DOC, PDF, PPT, XLS, and combinations thereof.
  39. 39. The method of claim 31, comprising generating, with the server computer, a tool interface displayable to the client for capturing user comments to be linked to individual sub-document portions.
  40. 40. The method of claim 39, comprising permitting, with the server computer via the tool interface, a plurality of users to add comments to the custom report.
  41. 41. An article of manufacture for automated document research, aggregation, and compilation, in a client-server environment, said article of manufacture comprising: a computer usable medium having a computer readable program code embodied therein, said computer usable medium having computer readable program code for: generating, with a server computer, a search interface displayable to a client communicably coupled to the server; searching and sorting, with the server computer in response to instructions received via the search interface, selections of interest within a plurality of source documents; generating, with the server computer, a custom report interface displayable to the client; extracting and aggregating, with the server computer in response to user selection instructions received via the custom report interface, the selections of interest into a customized report; enabling, with the server computer in response to user selection instructions, documents in the PDF format to be searched, sorted, extracted and aggregated into the customized report; enabling, with the server computer in response to user selection instructions, documents in the PPT format to be searched, sorted, extracted and aggregated into the customized report; and enabling, with the server computer in response to user selection instructions, documents in the XLS format to be searched, sorted, extracted and aggregated into the customized report.
  42. 42. A computer implemented system in a client-server environment, for document research, aggregation, and compilation, the system comprising: a server including: a search module configured to generate a search interface displayable to a client communicably coupled to the server, and responsive to instructions received via the search interface, to search and Sort selections of interest within a plurality of source documents; a customized report module configured to generate a custom report interface displayable to the client, and responsive to user selection instructions received via the custom report interface, to extract and aggregate the selections of interest into a customized report; and a plurality of format modules, wherein: each of the format modules is configured to enable corresponding source documents to be searched, sorted, and aggregated using the search module and customized report module; and a format of the source documents corresponding to a particular format module is different to a format of the source documents corresponding to another of the format modules.
  43. 43. A system according to claim 42, wherein the number of format modules in the plurality of format modules is three.
  44. 44. A system according to claim 42 or 43, wherein the format of the source documents that correspond to one of the format modules is selected from the following list of formats: XML, DOC, PDF, PPT, XLS.
  45. 45. A method for automated document research, aggregation, and compilation, in a client-server environment, the method comprising: (a) generating, with a server computer, a search interface displayable to a client communicably coupled to the server; (b) searching and sorting, with the server computer in response to instructions received via the search interface, selections of interest within a plurality of source documents; (c) generating, with the server computer, a custom report interface displayable to the client; (d) extracting and aggregating, with the server computer in response to user selection instructions received via the custom report interface, the selections of interest into a customized report; (e) enabling, with the server computer in response to user selection instructions, documents to be searched, sorted, extracted and aggregated into the customized report using a plurality of format modules, wherein: each of the format modules is configured to enable corresponding source documents to be searched, sorted, and aggregated using the search module and customized report module; and a format of the source documents corresponding to a particular format module is different to a format of the source documents corresponding to another of the format modules.
  46. 46. A method according to claim 45, wherein the number of format modules in the plurality of format modules is three.
  47. 47. A method according to claim 45 or 46, wherein the format of the source documents that correspond to one of the format modules is selected from the following list of formats: XML, DOC, PDF, PPT, XLS.
  48. 48. A program or plurality of programs arranged such that when executed by a computer system or one or more processors it/they cause the computer system or the one or more processors to operate in accordance with the method of any of claims 45 to 47.
  49. 49. A machine readable storage medium storing a program or at least one of the plurality of programs according to claim 48.
  50. 50. A computer implemented system in a client-server environment, for document research, aggregation, and compilation, as described herein with reference to the accompanying figures.
  51. 51. A method for automated document research, aggregation, and compilation, in a client-server environment, as described herein with reference to the accompanying figures.
  52. 52. An article of manufacture for automated document research, aggregation, and compilation, in a client-server environment, as described herein with reference to the accompanying figures.
GB1010390A 2010-05-26 2010-06-22 Automated report generation system Withdrawn GB2480703A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
GB1010390A GB2480703A (en) 2010-05-26 2010-06-22 Automated report generation system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/787,519 US8769392B2 (en) 2010-05-26 2010-05-26 Searching and selecting content from multiple source documents having a plurality of native formats, indexing and aggregating the selected content into customized reports
GB1010390A GB2480703A (en) 2010-05-26 2010-06-22 Automated report generation system

Publications (2)

Publication Number Publication Date
GB201010390D0 GB201010390D0 (en) 2010-08-04
GB2480703A true GB2480703A (en) 2011-11-30

Family

ID=42582738

Family Applications (1)

Application Number Title Priority Date Filing Date
GB1010390A Withdrawn GB2480703A (en) 2010-05-26 2010-06-22 Automated report generation system

Country Status (1)

Country Link
GB (1) GB2480703A (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114968455B (en) * 2022-04-29 2024-02-27 北京百度网讯科技有限公司 Report generation method and device of application interface and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001097085A1 (en) * 2000-06-09 2001-12-20 Thanh Ngoc Nguyen Method and apparatus for data collection and knowledge management
US20070043689A1 (en) * 2005-08-22 2007-02-22 International Business Machines Corporation Lightweight generic report generation tool
US20090234818A1 (en) * 2008-03-12 2009-09-17 Web Access Inc. Systems and Methods for Extracting Data from a Document in an Electronic Format

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001097085A1 (en) * 2000-06-09 2001-12-20 Thanh Ngoc Nguyen Method and apparatus for data collection and knowledge management
US20070043689A1 (en) * 2005-08-22 2007-02-22 International Business Machines Corporation Lightweight generic report generation tool
US20090234818A1 (en) * 2008-03-12 2009-09-17 Web Access Inc. Systems and Methods for Extracting Data from a Document in an Electronic Format

Also Published As

Publication number Publication date
GB201010390D0 (en) 2010-08-04

Similar Documents

Publication Publication Date Title
US8769392B2 (en) Searching and selecting content from multiple source documents having a plurality of native formats, indexing and aggregating the selected content into customized reports
US9430470B2 (en) Automated report service tracking system and method
US8843814B2 (en) Automated report service tracking system and method
Stonebraker et al. Content integration for e-business
US20180060410A1 (en) System and method of applying globally unique identifiers to relate distributed data sources
US8914414B2 (en) Integrated repository of structured and unstructured data
US9009201B2 (en) Extended database search
Myllymaki Effective Web data extraction with standard XML technologies
Denoue et al. An annotation tool for Web browsers and its applications to information retrieval.
CA2587462C (en) System and method for an immutable identification scheme in a large-scale computer system
US7624114B2 (en) Automatically generating web forms from database schema
US8612431B2 (en) Multi-part record searches
US7509303B1 (en) Information retrieval system using attribute normalization
Berendsohn et al. OpenUp! Creating a cross-domain pipeline for natural history data
WO2011159660A2 (en) Concept-based data processing
GB2480703A (en) Automated report generation system
US20130024761A1 (en) Semantic tagging of user-generated content
Apps et al. Prototyping digital library technologies in zetoc
Wakimoto Scope of the library catalog in times of transition
Wyman Bringing US index standards into the modern age
US7349918B2 (en) Method and system for searching binary files
US20070250495A1 (en) Method and System For Accessing Referenced Information
de Boer Libris Canadiana: a review
Jin Accelerating queries for structured and unstructured data
Zhu Understanding OpenURL standard and electronic resources: effective use of available resources

Legal Events

Date Code Title Description
WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)