WO2018078614A1 - System and method for on-the-fly conversion of non-accessible online documents to accessible documents - Google Patents

System and method for on-the-fly conversion of non-accessible online documents to accessible documents Download PDF

Info

Publication number
WO2018078614A1
WO2018078614A1 PCT/IL2017/051147 IL2017051147W WO2018078614A1 WO 2018078614 A1 WO2018078614 A1 WO 2018078614A1 IL 2017051147 W IL2017051147 W IL 2017051147W WO 2018078614 A1 WO2018078614 A1 WO 2018078614A1
Authority
WO
WIPO (PCT)
Prior art keywords
document
accessible
documents
script
conversion
Prior art date
Application number
PCT/IL2017/051147
Other languages
French (fr)
Inventor
David ELIAV
David ADI
Original Assignee
Doubledu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Doubledu Ltd filed Critical Doubledu Ltd
Priority to CA3041224A priority Critical patent/CA3041224A1/en
Priority to ES17864830T priority patent/ES2912650T3/en
Priority to US16/345,656 priority patent/US11256776B2/en
Priority to EP17864830.9A priority patent/EP3532956B8/en
Priority to DK17864830.9T priority patent/DK3532956T3/en
Publication of WO2018078614A1 publication Critical patent/WO2018078614A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • G06F16/972Access to data in other repository systems, e.g. legacy data or dynamic Web page generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9558Details of hyperlinks; Management of linked annotations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/106Display of layout of documents; Previewing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/14Tree-structured documents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/14Tree-structured documents
    • G06F40/143Markup, e.g. Standard Generalized Markup Language [SGML] or Document Type Definition [DTD]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation

Definitions

  • the present invention relates to the field of Internet accessibility. More particularly, the invention relates to a system and method for providing accessibility to peoples with various disabilities to non-accessible online documents.
  • the accessibility of web content is intended to allow access and use of web contents to people with disabilities, such as those suffering from visual and physical limitations.
  • a set of accessibility guidelines which is accepted worldwide is the Web Content Accessibility Guidelines (WCAG) 2.O., which in fact has become a standard for accessibility.
  • WAG Web Content Accessibility Guidelines
  • the term "accessibility”, when used throughout this document, relates to the act of making documents to be viewable or understandable by those suffering from disabilities that prevent them from reading these documents in their original form.
  • a website typically includes a main content which is provided in HTML format, external documents (such as PDF documents, WORD documents, excel documents, etc.) that are stored in their original format, and links to HTML pages that are located within the same website or within external websites.
  • external documents such as PDF documents, WORD documents, excel documents, etc.
  • links to HTML pages that are located within the same website or within external websites.
  • external documents refers to documents that are stored in their original form within a website, while access to open said external documents becomes possible by means of suitable links or icons appearing within web pages of the website.
  • all said types of contents that are available within the website namely, both the HTML documents and the "external documents" must be accessible to both healthy people and to those people with disabilities.
  • a disability menu-ruler is provided within the homepage or each webpage of the website to enable a disabled user to adapt the display to his limitations.
  • the ruler includes tools that when activated enable the user: (a) to activate a reader sound that vocally reads the content, (b) to increase the size of the text letters; (c) to change the font or background color; (d) to vary the size of images; and more.
  • These tools are typically adapted to operate on the HTML pages.
  • said external documents i.e., those PDF, WORD, EXCEL, etc. documents
  • performance of a significant off-line manual preparation work is required on each of such documents to adapt it to react to the accessibility tools.
  • the following manual conversion operations must be performed on a regular PDF document in order to adapt it to react to an accessibility ruler:
  • the invention relates to a method for an automatic and on-the-fly conversion of website's non-accessible documents to respective documents in an accessible format, comprising: (a) inserting a first script within each web page of said website that contains one or more non accessible documents; (b) upon loading of a web page from said website to a user's browser, executing said first script, which in turn identifies all original links within said web page to non-accessible documents, said script also substitutes a respective alternative link for each of said original links respectively, each of said alternative links leads to an alternative address, respectively, within a conversion server; (c) upon clicking by a user of one of said alternative links, extracting by said conversion server the respective non-accessible document, and transferring the respective non accessible document to said conversion server; (d) converting said non- accessible document to a respective document in an HTML format; (e) adding to said HTML format document at least an accessibility ruler script, thereby creating an accessible document; (f) optionally adding one or more additional scripts to said accessible document; and (g
  • one of said additional scripts is a script which is common to all the converted documents, and which improves the layout of the document beyond the layout which is provided by said conversion to HTML format.
  • one of said additional scripts is a script which is specific to each converted document, and which resolves problems that are specific to the converted document.
  • said conversion server further comprises an internal database for storing each previously converted accessible document in HTML format, and wherein said accessible document is displayed at the user's browser without need for conversion.
  • the method further comprising searching said database to verify whether the database comprises said previously converted document.
  • said conversion to HTML format is performed within the conversion server.
  • said conversion to HTML format is performed within a third party server which is accessible by said conversion server.
  • each of said alternative links is provided in addition to the respective original link.
  • said non accessible documents are documents in a format selected from PDF, WORD, or Excel.
  • the invention also relates to a system for an automatic on the fly conversion of a non-accessible document at a website to an accessible document, which comprises: (A) a conversion server which in turn comprises: (a) means for inserting a first script within each web page of said website that contains one or more non accessible documents, which upon execution, identifies all original links within said web page to non- accessible documents, said script also provides a respective alternative link to each of said original links respectively, each said alternative links leads to an alternative address, respectively, within the conversion server; (b) means for extracting a respective non-accessible document from said website, to within said conversion server; a converter for converting each non-accessible document to a respective document in an HTML format; (c) means for adding to said HTML format document at least an accessibility ruler script, thereby creating an accessible document; (d) means for optionally adding one or more additional scripts to said accessible document; and (f) means for displaying said accessible document at the user's browser, while simultaneously executing said accessibility ruler script and said one or more additional scripts
  • said converter is located at a third party location.
  • Fig. 1 illustrates a flowchart describing the process for an automatic conversion of online documents from non-accessible form to accessible, according to an embodiment of the invention.
  • Fig. 2a shows a document before conversion
  • Fig. 2b shows the document of Fig. 2a after conversion by the invention to an accessible form, this accessible document comprises an accessibility ruler;
  • FIG. 3 schematically illustrates a general structure of the system of the invention.
  • Fig. 4 illustrates a structure of the system, according to an embodiment of the invention.
  • the present invention introduces a system and method for the automatic and on-the-fly conversion of external documents within a website, from their non-accessible form to their accessible form, and following this conversion, providing an alternative link within the website to the accessible document (in HTML format).
  • Fig. 1 illustrates in a flow-diagram form a process for converting each online document from its non-accessible form to an accessible form, according to an embodiment of the invention.
  • step 101 which is generally performed off-line (but could also be performed off-line)
  • a script containing several lines of code is inserted into the source code of a web- page.
  • the script can be inserted at the head or body section of the web- page's source code.
  • the script is automatically executed without requiring any further action by the user.
  • the script identifies all the links within the web page that direct to external documents, for example, PDF, WORD, EXCEL documents, etc.
  • each of said identified links is replaced to direct to a temporary alternative address, which is different from the original link. More specifically, the alternative address leads to a location requesting conversion of the file, rather than displaying the original (non-accessible) file to the user.
  • the process proceeds in step 104, where one of said links is clicked by the user.
  • a conversion request comprising the link to the original document is sent to an accessibility remote server associated with the present invention.
  • the remote server includes at least software for performing automatic conversion of documents from non-accessible format to an HTML accessible format, and a link database containing records of previously converted documents and their links.
  • step 106 and upon receipt of the request for conversion as sent in step 105, the remote server searches within its link-database for determining whether the link to the non-accessible document is already stored therein.
  • the link database stores only links for external documents that have already been converted, the existence of the present link within said database is an indication that a respective document in an accessible format already exists within the accessibility server. If, on the other hand, it is found in step 106 that the document has not been previously converted, in step 107 the original document is downloaded to the accessibility server, and converted to an HTML format. In such a manner, a respective accessible document which contains all the content of the non- accessible document is automatically created.
  • the accessible document is stored within the remote server, and the new address to the accessible file is stored within the link database, together with the original link for future reference.
  • an accessibility ruler script is embedded within the converted HTML document.
  • additional optional scripts are embedded within the document.
  • said additional scripts may include: (a) an optional generic script for correcting in each document issues that the converter to HTML has not resolved.
  • the generic script may improve the "readability" of some of the document sections a document sound reader utility; and (b) an optional unique script, which is specific to each document, which enables correction of specific issues that are specific to this document only.
  • step 110 the document is now displayed within the user's browser.
  • Said scripts that are embodied within the displayed HTML document are activated, including the accessibility ruler (or tool-bar) with all its tools, the document sound reader, etc.
  • the accessibility ruler or tool-bar
  • the accessibility ruler is well known in the art, and it enables the user to apply on the HTML document various accessibility features that meet the accessibility standards.
  • step 106 the previously converted document is extracted from a documents database within the remote server, and 108-110 are performed as described before. More specifically, once a specific document is converted to an accessible form, there is no necessity for an additional conversion, and the accessible document is simply displayed to the user in step 110. It should be noted that if the previously converted document includes the embedded scripts, there is no necessity to perform steps 108- 109, and the document is immediately displayed in step 110. If, however, the previously converted document has been stored without the scripts, performance of step 108 and 109 (only optional) is necessary.
  • all the links are extracted, after which the links are sent to a remote server.
  • a link database is searched for documents that have previously been converted, according to the links received at the server.
  • all the documents that have been identified as not previously converted are downloaded to the server, and are converted from a non-accessible form to an accessible form.
  • each of the documents is assigned with a unique web address.
  • the links in the original web -page are replaced to direct to a corresponding accessible document, as now existing within the accessibility server.
  • the conversion of a document from a non- accessible form to an accessible form is performed by a conversion-to HTML tool known in the art, such as "PDF Converter Ultimate” - www.micropdf.com/, or "PDF online” - www.pdfonline.com.
  • the conversion to HTML tool may reside locally within the accessibility server, or remotely.
  • the result of the conversion is an HTML file, which allows a display of the original content along with an accessibility ruler. Consequently, in addition to presenting a user-friendly document, this result may also allow a person with disabilities to use assistive technology, such as a screen reader or a braille display, in order to present to him in an accessible form an online document which was previously existed in within the website in a non-accessible form.
  • the accessible document which is provided within the user's browser in fact comprises two sections, a layout section, and a dynamic section.
  • the layout comprises, among other, the scripts 108 and 109 of Fig. 1.
  • the dynamic section contains the content, which is in fact identical to the content of the original non-accessible document, however in HTML format.
  • This section may include text, images, tables, or any other content presented within the document.
  • the accessibility of a document according to the invention includes various features, either visual or hidden, that conform to the guidelines defined in WCAG 2.0.
  • An example of such a feature is the ability to adjust the size of fonts within a document.
  • Another example is the ability to adjust the colors of a document for a color blind user, the background of the document, etc. All said features are activated by the ruler or the toolbar.
  • Fig. 2a shows a typical document 20 that exists within a website in PDF format.
  • Fig. 2b shows a document 220, which is in fact the document 20 of Fig. 2a, after conversion into an accessible HTML format, as presented to the user.
  • an accessibility ruler 201 is displayed besides the accessible document 220. This ruler allows applying accessibility features on document 220.
  • Tab 202 allows enlargement of the font size of text in the document.
  • Tab 203 allows one to decrease the size of the font.
  • Tab 204 allows to restores the size of the font to its original size.
  • Tab 205 allows adapting the document to visually impaired users by changing the colors of the text and the background to increase the contrast between the two.
  • Tab 206 allows adapting the document to color blind users by changing the colors of the document solely to black and white.
  • Tab 207 allows restoring the original features of the document.
  • Tab 208 allows sending an individual request to convert a document from non- accessible to accessible.
  • Tab 209 allows printing the original non-accessible document.
  • Tab 210 allows presenting an accessibility declaration.
  • Tab 211 allows a user to change the language of the ruler.
  • Fig. 3 schematically illustrates in general terms a system adapted to convert online documents from non-accessible form to an accessible form according to an embodiment of the invention.
  • User 301 is connected to a web page which contains a link, denoted in Fig. 3 by numeric 304, to a document which is hosted at server 302.
  • Link 304 is swapped with a link 305 that directs to an accessible version of the original document, namely a document in HTML format that is either hosted on accessibility server 303, or is converted on the fly by the accessibility server 303 into an accessible form.
  • Server 303 includes at least software for performing conversion of documents from non-accessible form to an accessible form, and a database containing records of previously converted documents and their parameters.
  • Fig. 4 schematically illustrates a block diagram of an accessibility server 303, for performing on-the-fly conversion of documents from a non- accessible form to an accessible form, according to an embodiment of the invention.
  • Accessibility server 303 comprises an input 401 for receiving links to documents for conversion; a link database 402 comprises a record of links to documents that were previously converted; a comparator 403 for comparing a link at input 401 to links in database 402; a document database 404 comprising documents that were previously converted from a non-accessible form to an accessible form; a document retriever 405 for downloading documents from their respecting location (not shown) within the website; a document converter 406 for converting documents from a non-accessible form to an accessible form; a script adder 414 for adding the scripts 108 and 109 of Fig.
  • the accessibility server further includes internet communication means 408 and 409, for receiving links from a web -page and for transmitting converted documents to a user.
  • a conversion request comprising a link
  • the link is compared by comparator 403 to each of the links in link database 402. If the link is identical to a link from database 402, the corresponding HTML document is extracted from previously converted documents database 404. If the link is not identical to any of the links in database 402, document retriever 405 downloads the non-accessible document from its location within the website, and document converter 406 converts the document to HTML format. The converted document is sent 413 to document database 404 for future reference. Then, the scripts adder 414 adds to the converted document (either from database 404 or optionally from documents converter 406 - as shown by arrow 417) the scripts 108 and 109.
  • MUX 407 selects the accessible document either from input 419 or from input 421, and outputs the selected accessible document 410 to be displayed at the user's browser.
  • the website may present to the user two separate links for a same document: a first "not accessible” link for "regular” people that do not need accessibility features.
  • the regular people that click on this link will “activate” the regular document without activating the process of the present invention.
  • the other link will relate to an icon for the "accessibility" document that upon clicking on the same, the procedure of the present invention will be activated.
  • the present invention enables an automatic on the fly conversion of non-accessible documents to an accessible format. It has been found that a non-accessible document can be automatically converted into an accessible form within very few seconds, typically within less than 15 seconds. Moreover, the entire conversion process of the invention is fully automatic, in contrast to the conversion process of the prior art which requires a manual handling of each and every document.
  • the accessibility solution of the present invention is suitable for a wide range of devices, such as computers, mobile smartphones, electronic notebooks and other electronic devices that a person with disabilities may wish to use in order to display an electronic document. Additionally the solution of the present invention allows a person with disabilities to open any online document while being assured the document will be accessible, and compatible with other accessibility software, such as screen readers and braille displays.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Information Transfer Between Computers (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention relates to a method for an automatic and on-the-fly conversion of website's non-accessible documents to respective documents in an accessible format, comprising: (a) inserting a first script within each web page of said website that contains one or more non accessible documents; (b) upon loading of a web page from said website to a user's browser, executing said first script, which in turn identifies all original links within said web page to non-accessible documents, said script also substitutes a respective alternative link for each of said original links respectively, each of said alternative links leads to an alternative address, respectively, within a conversion server; (c) upon clicking by a user of one of said alternative links, extracting by said conversion server the respective non-accessible document, and transferring the respective non accessible document to said conversion server; (d) converting said non- accessible document to a respective document in an HTML format; (e) adding to said HTML format document at least an accessibility ruler script, thereby creating an accessible document; (f) optionally adding one or more additional scripts to said accessible document; and (g) displaying said accessible document at the user's browser, while simultaneously executing said accessibility ruler script and said additional scripts, if exist.

Description

Field of the Invention
The present invention relates to the field of Internet accessibility. More particularly, the invention relates to a system and method for providing accessibility to peoples with various disabilities to non-accessible online documents.
Background of the Invention
The accessibility of web content is intended to allow access and use of web contents to people with disabilities, such as those suffering from visual and physical limitations. A set of accessibility guidelines which is accepted worldwide is the Web Content Accessibility Guidelines (WCAG) 2.O., which in fact has become a standard for accessibility. The term "accessibility", when used throughout this document, relates to the act of making documents to be viewable or understandable by those suffering from disabilities that prevent them from reading these documents in their original form.
Typically, a website includes a main content which is provided in HTML format, external documents (such as PDF documents, WORD documents, excel documents, etc.) that are stored in their original format, and links to HTML pages that are located within the same website or within external websites. The term "external documents", as used herein, refers to documents that are stored in their original form within a website, while access to open said external documents becomes possible by means of suitable links or icons appearing within web pages of the website. In order to meet the accessibility standard, all said types of contents that are available within the website, namely, both the HTML documents and the "external documents" must be accessible to both healthy people and to those people with disabilities.
Various prior art solutions have been provided so far in adapted websites to meet said disability standards. For example, a disability menu-ruler is provided within the homepage or each webpage of the website to enable a disabled user to adapt the display to his limitations. The ruler, for example, includes tools that when activated enable the user: (a) to activate a reader sound that vocally reads the content, (b) to increase the size of the text letters; (c) to change the font or background color; (d) to vary the size of images; and more. These tools are typically adapted to operate on the HTML pages. In order to allow the ruler to operate on said external documents (i.e., those PDF, WORD, EXCEL, etc. documents), performance of a significant off-line manual preparation work is required on each of such documents to adapt it to react to the accessibility tools. For example, the following manual conversion operations must be performed on a regular PDF document in order to adapt it to react to an accessibility ruler:
a. Classification of titles within the non-accessible document b. Provision of alternative text for photographs, if exist within the document;
c. Replacement of a font or font color, if it becomes necessary;
d. Insertion of definitions for accessibility screen readers;
e. Storing of the accessible document;
As becomes apparent, the prior art solutions for providing accessibility to external documents require significant manual work in each external document, before the converted document can be uploaded to website. Therefore, an owner of a web-site that contains many links to external documents is required to spend very significant time and resources in order to provide accessibility to all the documents included within his website.
In addition, in case of a manual conversion of documents (to become accessible), some documents within a frequently updated website may be overlooked, and may remain non-accessible even after the performance of the process to make the website accessible.
Furthermore, according to the prior art, each time a web-page owner adds to the web-page a link to a new external document, there is a need to ensure that the document becomes accessible, and this involves spending of a significant off-line manual work. Therefore, according to the prior art the manual maintenance in a dynamically changed website to make the documents accessible in fact never ends.
It is therefore an object of the present invention to eliminate the spending pf a significant amount of manual work on each external document within a website, in order to allow accessibility thereto.
It is another object of the present invention to provide a method and system for on-the-fly fully automatic conversion of each external document within the website, therefore to make all the documents, whether already existing within the website, or those newly introduced, to be immediately available in a form meeting the accessibility standards.
Other objects and advantages of this invention will become apparent as the description proceeds.
Summary of the Invention
The invention relates to a method for an automatic and on-the-fly conversion of website's non-accessible documents to respective documents in an accessible format, comprising: (a) inserting a first script within each web page of said website that contains one or more non accessible documents; (b) upon loading of a web page from said website to a user's browser, executing said first script, which in turn identifies all original links within said web page to non-accessible documents, said script also substitutes a respective alternative link for each of said original links respectively, each of said alternative links leads to an alternative address, respectively, within a conversion server; (c) upon clicking by a user of one of said alternative links, extracting by said conversion server the respective non-accessible document, and transferring the respective non accessible document to said conversion server; (d) converting said non- accessible document to a respective document in an HTML format; (e) adding to said HTML format document at least an accessibility ruler script, thereby creating an accessible document; (f) optionally adding one or more additional scripts to said accessible document; and (g) displaying said accessible document at the user's browser, while simultaneously executing said accessibility ruler script and said additional scripts, if exist.
In an embodiment of the invention, one of said additional scripts is a script which is common to all the converted documents, and which improves the layout of the document beyond the layout which is provided by said conversion to HTML format.
In an embodiment of the invention, one of said additional scripts is a script which is specific to each converted document, and which resolves problems that are specific to the converted document.
In an embodiment of the invention, said conversion server further comprises an internal database for storing each previously converted accessible document in HTML format, and wherein said accessible document is displayed at the user's browser without need for conversion. In an embodiment of the invention, the method further comprising searching said database to verify whether the database comprises said previously converted document.
In an embodiment of the invention, said conversion to HTML format is performed within the conversion server.
In an embodiment of the invention, said conversion to HTML format is performed within a third party server which is accessible by said conversion server.
In an embodiment of the invention, each of said alternative links is provided in addition to the respective original link.
In an embodiment of the invention, said non accessible documents are documents in a format selected from PDF, WORD, or Excel.
The invention also relates to a system for an automatic on the fly conversion of a non-accessible document at a website to an accessible document, which comprises: (A) a conversion server which in turn comprises: (a) means for inserting a first script within each web page of said website that contains one or more non accessible documents, which upon execution, identifies all original links within said web page to non- accessible documents, said script also provides a respective alternative link to each of said original links respectively, each said alternative links leads to an alternative address, respectively, within the conversion server; (b) means for extracting a respective non-accessible document from said website, to within said conversion server; a converter for converting each non-accessible document to a respective document in an HTML format; (c) means for adding to said HTML format document at least an accessibility ruler script, thereby creating an accessible document; (d) means for optionally adding one or more additional scripts to said accessible document; and (f) means for displaying said accessible document at the user's browser, while simultaneously executing said accessibility ruler script and said one or more additional scripts, if exist.
In an embodiment of the invention, said converter is located at a third party location.
Brief Description of the Drawings
In the drawings:
Fig. 1 illustrates a flowchart describing the process for an automatic conversion of online documents from non-accessible form to accessible, according to an embodiment of the invention.
Fig. 2a shows a document before conversion;
Fig. 2b shows the document of Fig. 2a after conversion by the invention to an accessible form, this accessible document comprises an accessibility ruler;
Fig. 3 schematically illustrates a general structure of the system of the invention; and
Fig. 4 illustrates a structure of the system, according to an embodiment of the invention.
Detailed Description of the Invention
The present invention introduces a system and method for the automatic and on-the-fly conversion of external documents within a website, from their non-accessible form to their accessible form, and following this conversion, providing an alternative link within the website to the accessible document (in HTML format).
Fig. 1 illustrates in a flow-diagram form a process for converting each online document from its non-accessible form to an accessible form, according to an embodiment of the invention. In step 101, which is generally performed off-line (but could also be performed off-line), a script containing several lines of code is inserted into the source code of a web- page. The script can be inserted at the head or body section of the web- page's source code. As result of this insertion, when the web-page is accessed by a user and loaded to his browser, the script is automatically executed without requiring any further action by the user. In step 102 the script identifies all the links within the web page that direct to external documents, for example, PDF, WORD, EXCEL documents, etc. In step 103, each of said identified links is replaced to direct to a temporary alternative address, which is different from the original link. More specifically, the alternative address leads to a location requesting conversion of the file, rather than displaying the original (non-accessible) file to the user. The process proceeds in step 104, where one of said links is clicked by the user. Next, and following said user's clicking, in step 105 a conversion request, comprising the link to the original document is sent to an accessibility remote server associated with the present invention. The remote server includes at least software for performing automatic conversion of documents from non-accessible format to an HTML accessible format, and a link database containing records of previously converted documents and their links.
In step 106, and upon receipt of the request for conversion as sent in step 105, the remote server searches within its link-database for determining whether the link to the non-accessible document is already stored therein. As the link database stores only links for external documents that have already been converted, the existence of the present link within said database is an indication that a respective document in an accessible format already exists within the accessibility server. If, on the other hand, it is found in step 106 that the document has not been previously converted, in step 107 the original document is downloaded to the accessibility server, and converted to an HTML format. In such a manner, a respective accessible document which contains all the content of the non- accessible document is automatically created. Once converted, the accessible document is stored within the remote server, and the new address to the accessible file is stored within the link database, together with the original link for future reference. In step 108 an accessibility ruler script is embedded within the converted HTML document. In step 109, additional optional scripts are embedded within the document. For example, said additional scripts may include: (a) an optional generic script for correcting in each document issues that the converter to HTML has not resolved. For example, the generic script may improve the "readability" of some of the document sections a document sound reader utility; and (b) an optional unique script, which is specific to each document, which enables correction of specific issues that are specific to this document only. As the document is now accessible and comprises all the required scripts that are embodied within it, in step 110 the document is now displayed within the user's browser. Said scripts that are embodied within the displayed HTML document are activated, including the accessibility ruler (or tool-bar) with all its tools, the document sound reader, etc. The accessibility ruler (or tool-bar) is well known in the art, and it enables the user to apply on the HTML document various accessibility features that meet the accessibility standards.
If, however, in step 106 the document has been determined to previously been converted, the previously converted document is extracted from a documents database within the remote server, and 108-110 are performed as described before. More specifically, once a specific document is converted to an accessible form, there is no necessity for an additional conversion, and the accessible document is simply displayed to the user in step 110. It should be noted that if the previously converted document includes the embedded scripts, there is no necessity to perform steps 108- 109, and the document is immediately displayed in step 110. If, however, the previously converted document has been stored without the scripts, performance of step 108 and 109 (only optional) is necessary.
According to another embodiment of the invention, after identifying all the links to external documents on a web page, i.e. after step 102, all the links are extracted, after which the links are sent to a remote server. A link database is searched for documents that have previously been converted, according to the links received at the server. Next, all the documents that have been identified as not previously converted are downloaded to the server, and are converted from a non-accessible form to an accessible form. Next, each of the documents is assigned with a unique web address. Finally the links in the original web -page are replaced to direct to a corresponding accessible document, as now existing within the accessibility server.
According to an embodiment of the invention, the conversion of a document from a non- accessible form to an accessible form is performed by a conversion-to HTML tool known in the art, such as "PDF Converter Ultimate" - www.micropdf.com/, or "PDF online" - www.pdfonline.com. The conversion to HTML tool may reside locally within the accessibility server, or remotely. As noted, the result of the conversion is an HTML file, which allows a display of the original content along with an accessibility ruler. Consequently, in addition to presenting a user-friendly document, this result may also allow a person with disabilities to use assistive technology, such as a screen reader or a braille display, in order to present to him in an accessible form an online document which was previously existed in within the website in a non-accessible form.
The accessible document which is provided within the user's browser in fact comprises two sections, a layout section, and a dynamic section. The layout comprises, among other, the scripts 108 and 109 of Fig. 1.
The dynamic section contains the content, which is in fact identical to the content of the original non-accessible document, however in HTML format. This section may include text, images, tables, or any other content presented within the document.
According to the invention, the accessibility of a document according to the invention includes various features, either visual or hidden, that conform to the guidelines defined in WCAG 2.0. An example of such a feature is the ability to adjust the size of fonts within a document. Another example is the ability to adjust the colors of a document for a color blind user, the background of the document, etc. All said features are activated by the ruler or the toolbar.
Fig. 2a shows a typical document 20 that exists within a website in PDF format. Fig. 2b shows a document 220, which is in fact the document 20 of Fig. 2a, after conversion into an accessible HTML format, as presented to the user. More specifically, an accessibility ruler 201 is displayed besides the accessible document 220. This ruler allows applying accessibility features on document 220. Tab 202 allows enlargement of the font size of text in the document. Tab 203 allows one to decrease the size of the font. Tab 204 allows to restores the size of the font to its original size. Tab 205 allows adapting the document to visually impaired users by changing the colors of the text and the background to increase the contrast between the two. Tab 206 allows adapting the document to color blind users by changing the colors of the document solely to black and white. Tab 207 allows restoring the original features of the document. Tab 208 allows sending an individual request to convert a document from non- accessible to accessible. Tab 209 allows printing the original non-accessible document. Tab 210 allows presenting an accessibility declaration. Tab 211 allows a user to change the language of the ruler.
Fig. 3 schematically illustrates in general terms a system adapted to convert online documents from non-accessible form to an accessible form according to an embodiment of the invention. User 301 is connected to a web page which contains a link, denoted in Fig. 3 by numeric 304, to a document which is hosted at server 302. Link 304 is swapped with a link 305 that directs to an accessible version of the original document, namely a document in HTML format that is either hosted on accessibility server 303, or is converted on the fly by the accessibility server 303 into an accessible form. Server 303 includes at least software for performing conversion of documents from non-accessible form to an accessible form, and a database containing records of previously converted documents and their parameters.
Fig. 4 schematically illustrates a block diagram of an accessibility server 303, for performing on-the-fly conversion of documents from a non- accessible form to an accessible form, according to an embodiment of the invention. Accessibility server 303 comprises an input 401 for receiving links to documents for conversion; a link database 402 comprises a record of links to documents that were previously converted; a comparator 403 for comparing a link at input 401 to links in database 402; a document database 404 comprising documents that were previously converted from a non-accessible form to an accessible form; a document retriever 405 for downloading documents from their respecting location (not shown) within the website; a document converter 406 for converting documents from a non-accessible form to an accessible form; a script adder 414 for adding the scripts 108 and 109 of Fig. 1 into the converted document (depending on the structure of the documents stored within the document database 404, the script adder may add the scripts also to previously converted documents that are stored within database 404; an output 410 for transmitting a converted HTML document into the user's browser; and a multiplexer (MUX) 407 for determining the origin of the HTML document which is to be presented at output 410. The accessibility server further includes internet communication means 408 and 409, for receiving links from a web -page and for transmitting converted documents to a user.
When a conversion request, comprising a link, arrives at input 401, the link is compared by comparator 403 to each of the links in link database 402. If the link is identical to a link from database 402, the corresponding HTML document is extracted from previously converted documents database 404. If the link is not identical to any of the links in database 402, document retriever 405 downloads the non-accessible document from its location within the website, and document converter 406 converts the document to HTML format. The converted document is sent 413 to document database 404 for future reference. Then, the scripts adder 414 adds to the converted document (either from database 404 or optionally from documents converter 406 - as shown by arrow 417) the scripts 108 and 109. MUX 407 selects the accessible document either from input 419 or from input 421, and outputs the selected accessible document 410 to be displayed at the user's browser.
It should be noted that the website may present to the user two separate links for a same document: a first "not accessible" link for "regular" people that do not need accessibility features. In such a case, the regular people that click on this link will "activate" the regular document without activating the process of the present invention. The other link will relate to an icon for the "accessibility" document that upon clicking on the same, the procedure of the present invention will be activated.
As shown, the present invention enables an automatic on the fly conversion of non-accessible documents to an accessible format. It has been found that a non-accessible document can be automatically converted into an accessible form within very few seconds, typically within less than 15 seconds. Moreover, the entire conversion process of the invention is fully automatic, in contrast to the conversion process of the prior art which requires a manual handling of each and every document.
The accessibility solution of the present invention is suitable for a wide range of devices, such as computers, mobile smartphones, electronic notebooks and other electronic devices that a person with disabilities may wish to use in order to display an electronic document. Additionally the solution of the present invention allows a person with disabilities to open any online document while being assured the document will be accessible, and compatible with other accessibility software, such as screen readers and braille displays.
As various embodiments have been described and illustrated, it should be understood that variations will be apparent to one skilled in the art without departing from the principles herein. Accordingly, the invention is not to be limited to the specific embodiments described and illustrated in the drawings.

Claims

Claims
1. A method for an automatic and on-the-fly conversion of website's non-accessible documents to respective documents in an accessible format, comprising:
a. inserting a first script within each web page of said website that contains one or more non accessible documents;
b. upon loading of a web page from said website to a user's browser, executing said first script, which in turn identifies all original links within said web page to non-accessible documents, said script also substitutes a respective alternative link for each of said original links respectively, each of said alternative links leads to an alternative address, respectively, within a conversion server;
c. upon clicking by a user of one of said alternative links, extracting by said conversion server the respective non- accessible document, and transferring the respective non accessible document to said conversion server;
d. converting said non-accessible document to a respective document in an HTML format;
e. adding to said HTML format document at least an accessibility ruler script, thereby creating an accessible document;
f. optionally adding one or more additional scripts to said accessible document; and
g. displaying said accessible document at the user's browser, while simultaneously executing said accessibility ruler script and said additional scripts, if exist.
2. The method of claim 1, wherein one of said additional scripts is a script which is common to all the converted documents, and which improves the layout of the document beyond the layout which is provided by said conversion to HTML format.
3. The method of claim 1, wherein one of said additional scripts is a script which is specific to each converted document, and which resolves problems that are specific to the converted document.
4. The method of claim 1, wherein said conversion server further comprises an internal database for storing each previously converted accessible document in HTML format, and wherein said accessible document is displayed at the user's browser without need for conversion.
5. The method of claim 4, further comprising searching said database to verify whether the database comprises said previously converted document.
6. A method according to claim 1, wherein said conversion to HTML format is performed within the conversion server.
7. A method according to claim 1, wherein said conversion to HTML format is performed within a third party server which is accessible by said conversion server.
8. A method according to claim 1, wherein each of said alternative links is provided in addition to the respective original link.
9. A method according to claim 1, wherein said non accessible documents are documents in a format selected from PDF, WORD, or Excel.
10. A system for an automatic on the fly conversion of a non accessible document at a website to an accessible document, which comprises:
A. a conversion server which in turn comprises:
a. means for inserting a first script within each web page of said website that contains one or more non accessible documents, which upon execution, identifies all original links within said web page to non-accessible documents, said script also provides a respective alternative link to each of said original links respectively, each said alternative links leads to an alternative address, respectively, within the conversion server;
b. means for extracting a respective non-accessible document from said website, to within said conversion server; c. a converter for converting each non-accessible document to a respective document in an HTML format;
d. means for adding to said HTML format document at least an accessibility ruler script, thereby creating an accessible document;
e. means for optionally adding one or more additional scripts to said accessible document; and
f. means for displaying said accessible document at the user's browser, while simultaneously executing said accessibility ruler script and said one or more additional scripts, if exist.
11. System according to claim 10, wherein said converter is located at a third party location.
PCT/IL2017/051147 2016-10-31 2017-10-18 System and method for on-the-fly conversion of non-accessible online documents to accessible documents WO2018078614A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
CA3041224A CA3041224A1 (en) 2016-10-31 2017-10-18 System and method for on-the-fly conversion of non-accessible online documents to accessible documents
ES17864830T ES2912650T3 (en) 2016-10-31 2017-10-18 System and method for on-the-fly conversion of non-accessible online documents to accessible documents
US16/345,656 US11256776B2 (en) 2016-10-31 2017-10-18 System and method for on-the-fly conversion of non-accessible online documents to accessible documents
EP17864830.9A EP3532956B8 (en) 2016-10-31 2017-10-18 System and method for on-the-fly conversion of non-accessible online documents to accessible documents
DK17864830.9T DK3532956T3 (en) 2016-10-31 2017-10-18 SYSTEM AND METHOD FOR FLYING CONVERSION OF NOT AVAILABLE ONLINE DOCUMENTS TO AVAILABLE DOCUMENTS

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IL248651 2016-10-31
IL248651A IL248651B2 (en) 2016-10-31 2016-10-31 System and method for on-the-fly conversion of non-accessible online documents to accessible documents

Publications (1)

Publication Number Publication Date
WO2018078614A1 true WO2018078614A1 (en) 2018-05-03

Family

ID=62023183

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IL2017/051147 WO2018078614A1 (en) 2016-10-31 2017-10-18 System and method for on-the-fly conversion of non-accessible online documents to accessible documents

Country Status (7)

Country Link
US (1) US11256776B2 (en)
EP (1) EP3532956B8 (en)
CA (1) CA3041224A1 (en)
DK (1) DK3532956T3 (en)
ES (1) ES2912650T3 (en)
IL (1) IL248651B2 (en)
WO (1) WO2018078614A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200334411A1 (en) * 2019-04-22 2020-10-22 INNsight.com, Inc. Computer implemented accessibility systems and methods

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7010581B2 (en) * 2001-09-24 2006-03-07 International Business Machines Corporation Method and system for providing browser functions on a web page for client-specific accessibility
US20070211071A1 (en) * 2005-12-20 2007-09-13 Benjamin Slotznick Method and apparatus for interacting with a visually displayed document on a screen reader
US20080065649A1 (en) * 2006-09-08 2008-03-13 Barry Smiler Method of associating independently-provided content with webpages
US20110249284A1 (en) * 2010-04-09 2011-10-13 Actuate Corporation Automated assistive technology for the visually impaired
US20140180846A1 (en) * 2011-08-04 2014-06-26 Userfirst Automatic website accessibility and compatibility

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7194411B2 (en) * 2001-02-26 2007-03-20 Benjamin Slotznick Method of displaying web pages to enable user access to text information that the user has difficulty reading
US20030093565A1 (en) * 2001-07-03 2003-05-15 Berger Adam L. System and method for converting an attachment in an e-mail for delivery to a device of limited rendering capability
US7370269B1 (en) * 2001-08-31 2008-05-06 Oracle International Corporation System and method for real-time annotation of a co-browsed document
US9137324B2 (en) * 2002-04-10 2015-09-15 International Business Machines Corporation Capacity on-demand in distributed computing environments
US7398464B1 (en) * 2002-05-31 2008-07-08 Oracle International Corporation System and method for converting an electronically stored document
US20070255792A1 (en) * 2006-04-26 2007-11-01 Momail, Ab Method and apparatus for an email gateway
US7752575B2 (en) * 2007-02-06 2010-07-06 International Business Machines Corporation Attachment activation in screen captures
US20090144158A1 (en) * 2007-12-03 2009-06-04 Matzelle Brent R System And Method For Enabling Viewing Of Documents Not In HTML Format
US9176953B2 (en) * 2008-06-04 2015-11-03 Tianjin Sursen Investment Co., Ltd. Method and system of web-based document service
US20110258535A1 (en) * 2010-04-20 2011-10-20 Scribd, Inc. Integrated document viewer with automatic sharing of reading-related activities across external social networks
US8407314B2 (en) * 2011-04-04 2013-03-26 Damaka, Inc. System and method for sharing unsupported document types between communication devices
US9268753B2 (en) * 2011-10-24 2016-02-23 Apollo Education Group, Inc. Automated addition of accessiblity features to documents
US20150193389A1 (en) * 2012-03-06 2015-07-09 Google Inc. Presenting updated hyperlink information on a webpage

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7010581B2 (en) * 2001-09-24 2006-03-07 International Business Machines Corporation Method and system for providing browser functions on a web page for client-specific accessibility
US20070211071A1 (en) * 2005-12-20 2007-09-13 Benjamin Slotznick Method and apparatus for interacting with a visually displayed document on a screen reader
US20080065649A1 (en) * 2006-09-08 2008-03-13 Barry Smiler Method of associating independently-provided content with webpages
US20110249284A1 (en) * 2010-04-09 2011-10-13 Actuate Corporation Automated assistive technology for the visually impaired
US20140180846A1 (en) * 2011-08-04 2014-06-26 Userfirst Automatic website accessibility and compatibility

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3532956A4 *

Also Published As

Publication number Publication date
EP3532956B1 (en) 2022-03-16
IL248651A0 (en) 2017-02-28
US11256776B2 (en) 2022-02-22
DK3532956T3 (en) 2022-05-02
EP3532956B8 (en) 2022-04-20
IL248651B2 (en) 2024-08-01
EP3532956A1 (en) 2019-09-04
ES2912650T3 (en) 2022-05-26
US20190278825A1 (en) 2019-09-12
IL248651B1 (en) 2024-04-01
EP3532956A4 (en) 2020-07-29
CA3041224A1 (en) 2018-05-03

Similar Documents

Publication Publication Date Title
US7496497B2 (en) Method and system for selecting web site home page by extracting site language cookie stored in an access device to identify directional information item
AU2019226143B2 (en) Modifying native document comments in a preview
EP3316149A1 (en) Information acquiring method and apparatus, device, and computer storage medium
US20090112824A1 (en) Method and apparatus for generating presentation configuration file of document content
US20120151310A1 (en) Method and system for identifying and delivering contextually-relevant information to end users of a data network
US10074104B2 (en) Content dynamically targetted according to context
CN106294658A (en) The quick methods of exhibiting of webpage and device
CN109074326B (en) Translation system
US9298689B2 (en) Multiple template based search function
JP6840597B2 (en) Search result summarizing device, program and method
JP6292190B2 (en) Document association apparatus, document association system, and program
US11256776B2 (en) System and method for on-the-fly conversion of non-accessible online documents to accessible documents
WO2019144259A1 (en) Transformation of resource files using mapped keys for tracking content location
CN106469189A (en) A kind of front end assists browsing method and the device of transcoding
JP5477785B2 (en) Formula display control apparatus, computer program, and program storage medium
JP6995405B1 (en) Information provision method, information provision device, information provision program and recording medium
WO2024202072A1 (en) Information processing method, information processing device, and information processing program
JP6564910B2 (en) CONVERSION DEVICE, CONVERSION METHOD, AND PROGRAM
JP2009128929A (en) Information output method, information output device, and information output program
US8082259B2 (en) Information processing apparatus for extracting objects
JP3949455B2 (en) Translation support method, translation support program, and computer-readable recording medium recording the program
CN117435282A (en) APP language automatic conversion method, device, equipment and medium
Sasaki et al. 10. Multilingual Computing
JP2017041039A (en) Conversion device, conversion method and program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17864830

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 3041224

Country of ref document: CA

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2017864830

Country of ref document: EP

Effective date: 20190531