WO2019005100A1 - Method and system to display content from a pdf document on a small screen - Google Patents

Method and system to display content from a pdf document on a small screen Download PDF

Info

Publication number
WO2019005100A1
WO2019005100A1 PCT/US2017/040264 US2017040264W WO2019005100A1 WO 2019005100 A1 WO2019005100 A1 WO 2019005100A1 US 2017040264 W US2017040264 W US 2017040264W WO 2019005100 A1 WO2019005100 A1 WO 2019005100A1
Authority
WO
WIPO (PCT)
Prior art keywords
text
viewer application
document
screen
content
Prior art date
Application number
PCT/US2017/040264
Other languages
French (fr)
Inventor
Søren D. THOMSEN
Anders H. MADSEN
Søren Vind
Mads Sejersen
Peter Assentorp
Original Assignee
Issuu, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Issuu, Inc. filed Critical Issuu, Inc.
Priority to PCT/US2017/040264 priority Critical patent/WO2019005100A1/en
Publication of WO2019005100A1 publication Critical patent/WO2019005100A1/en

Links

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09GARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
    • G09G5/00Control arrangements or circuits for visual indicators common to cathode-ray tube indicators and other visual indicators
    • G09G5/36Control arrangements or circuits for visual indicators common to cathode-ray tube indicators and other visual indicators characterised by the display of a graphic pattern, e.g. using an all-points-addressable [APA] memory
    • G09G5/37Details of the operation on graphic patterns
    • G09G5/373Details of the operation on graphic patterns for modifying the size of the graphic pattern
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/0483Interaction with page-structured environments, e.g. book metaphor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/106Display of layout of documents; Previewing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2203/00Indexing scheme relating to G06F3/00 - G06F3/048
    • G06F2203/048Indexing scheme relating to G06F3/048
    • G06F2203/04806Zoom, i.e. interaction techniques or interactors for controlling the zooming operation
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09GARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
    • G09G2340/00Aspects of display data processing
    • G09G2340/04Changes in size, position or resolution of an image
    • G09G2340/0407Resolution change, inclusive of the use of different resolutions for different screen areas
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09GARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
    • G09G2340/00Aspects of display data processing
    • G09G2340/04Changes in size, position or resolution of an image
    • G09G2340/045Zooming at least part of an image, i.e. enlarging it or shrinking it
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09GARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
    • G09G2340/00Aspects of display data processing
    • G09G2340/14Solving problems related to the presentation of information to be displayed
    • G09G2340/145Solving problems related to the presentation of information to be displayed related to small screens
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09GARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
    • G09G2354/00Aspects of interface with display user

Definitions

  • Fig. 2b shows the area of the full page displayed on the screen in Fig. 2a.
  • the viewer of the present invention may operate in two modes, which will be called page mode and text mode.
  • a displayed document may initially appear in page mode.
  • the original layout is replaced with a text view.
  • the body text is extracted from the content unit being read, and is reformatted in the text view to be continuous and complete in correct reading order.
  • the term "content unit" is used herein to refer to text readable by the user.
  • a content unit may be, for example, a newspaper article, a magazine article, a scholarly article, a textbook article or chapter, etc.
  • continuation guidance may be provided in the text.
  • Continuation guidance may include words indicating continuation at the bottom of a text column, such as "See page 23" or "Please turn to pg. 5,” and such guidance can be detected and deciphered.
  • a sidebar often has a box around it, or uses a different font or a different color background. Reading order will be different depending on the language of the document, for example for English, Hebrew, Japanese, etc. The result of this approach is a list of articles containing headlines and the text belonging to each headline.
  • a third approach employs machine learning. The purpose is to model the layout of a PDF by capturing the structure and calculating a latent representation which can be used as a similarity measure between PDFs.

Abstract

Roughly described, a viewer application is provided for viewing a PDF document on a screen of a device such as a mobile phone or tablet. The viewer application may operate in page mode or in text mode. In page mode the original layout is maintained, and navigation assistance is provided by use of a navigation pane indicating the contents of the screen with a superimposed frame. Display of the navigation pane is controllable by the user. In page mode a selected text column is scrolled and zoomed to optimize reading. In text mode, text is extracted from the document and reformatted in text view to be continuous and complete in correct reading order, and images and advertising may be excluded. The user may toggle between page mode and text mode. The viewer application is implemented in software to by executed by a processor on the device.

Description

METHOD AND SYSTEM TO DISPLAY CONTENT FROM A PDF DOCUMENT
ON A SMALL SCREEN
CROSS REFERENCE TO APPLICATION
[0001] This application claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Application No. 62/287,130, entitled "METHOD AND SYSTEM TO DISPLAY CONTENT FROM A PDF DOCUMENT ON A SMALL SCREEN", filed on 26 January 2016, by S0ren D. Thomsen, Anders H. Madsen, Soren Vind, Mads Sejersen, Peter Assentorp, which application is incorporated herein by reference in its entirety.
BACKGROUND
[0002] With the rise of electronic publishing, more and more readers are viewing published documents, such as popular magazines, newspapers, trade and scientific journals and the like on electronic devices. These documents are generally made available to electronic publishers as portable device format (PDF) files and are formatted for print, rather than for electronic viewing. Navigation in an electronic document described in a PDF file can be cumbersome, particularly on an electronic device having a small screen, such as mobile phone or a small tablet.
SUMMARY
[0003] The technology disclosed herein relates to a system and method to view PDF documents on screens of smaller devices, such as mobile phones and small tablets, specifically when the size of the screen renders reading difficult or impossible when an entire page is displayed.
[0004] A viewer application for viewing content from a document defined in a PDF file on a small screen is described herein. The document has an original layout and comprises at least one content area and at least one content unit, wherein the viewer application executes the following steps: Analyzing the document to: (a) identify content areas of the document of the body text type, (b) correlate each body text type content area with a content unit, (c) identify a correct reading order for the body text type content areas of each content unit; displaying a current page of the document in its original layout; and in response to reader selection of a body text type content area of one of the at least one content units, providing navigation means allowing the user to read the content unit in correct reading order. [0005] Analyzing the document may comprise analyzing the font size or case of the initial letters or word of the content area; or detecting and deciphering continuation guidance within a content area. For example, during the analyzing step, a content area is determined not to be of the body text type if it contains ten words or fewer, or if it is identical to a text entry in a table of contents.
[0006] A viewer application for viewing a document on a screen of a device, is described herein. In some embodiments, the document includes a current page, the current page having an original layout, wherein the viewer application may select between least two modes, the modes comprising: page mode, wherein the original layout of the current page is preserved; and text mode, wherein body text is extracted from the document and reformatted in a text view, wherein the document is described in a PDF file, and wherein the viewer application is implemented as software code portions. A user of the viewer application can toggle between page mode and text mode. The original layout may include multiple content areas, for example including at least two types of content areas from the group consisting of titles, subtitles, captions, body text, and images.
[0007] The device may be a mobile phone or a tablet. In some embodiments, in text mode, the body text is extracted from a content unit and is reformatted to be continuous and complete in correct reading order. The content unit may be a magazine article or newspaper article. In some embodiments, in text mode, no images are included in the text view. In some embodiments, in page mode, a navigation pane displays the current page with a superimposed frame indicating current contents of the screen, and the superimposed frame can be moved by a user to change the current contents of the screen.
[0008] In embodiments, the screen has a width and a top, and, in page mode, in response to a user tapping within an area of a first text column having a first line, a width of the first text column is zoomed to the width of the screen, and the first text column is positioned with its first line at the top of the screen. The first text column may include a first segment of a content unit, wherein a next icon appears at the bottom of the first text column, and wherein, in response to the user selecting the next icon, a second text column including a next segment of the content unit is displayed on the screen.
[0009] A viewer application for viewing a document on a screen of a device is described, wherein the document includes a current page, wherein the current page has an original layout comprising multiple content areas, wherein the multiple content areas include at least two types of content areas from the group consisting of titles, subtitles, captions, body text, pull quotes, images, and graphics, wherein, responsive to a text mode command from a user, body text is extracted from the document and reformatted to be continuous, complete and in correct reading order, and wherein the document is described in a PDF file. The body text may be extracted from a content unit, such as a magazine article or newspaper article. The text mode command from the user may be issued by a tap or double-tap on the screen. In some embodiments, the document may comprise one or more additional pages, each additional page having a respective original layout, and, responsive to a page mode command from the user, the viewer application may display at least a portion of the original layout of the current page or of an additional page. Correct reading order is determined by information in the PDF file, and may be determined based on rules. The rules may be compiled by a process of machine learning.
[0010] A viewer application for viewing a document on a screen of a device is described. The document includes a current page, wherein the current page has an original layout comprising multiple content areas, wherein the multiple content areas include at least two types of content areas from the group consisting of titles, subtitles, captions, body text, pull quotes, images, and graphics, wherein the current page is displayed in its original layout and a navigation pane displays the current page with a superimposed frame indicating current contents of the screen, and wherein the document is described in a PDF file. The device may be a mobile phone or a tablet. In some embodiments, the superimposed frame can be moved by a user to change the current contents of the screen, and a user can toggle display of the navigation pane off and on. In some embodiments, the screen has a width and a top and, in page mode, in response to a user tapping within an area of a first text colum having a first line, the first text column is zoomed to the width of the screen and positioned with its first line at the top of the screen.
[0011] In some embodiments, the first text column includes a first segment of a content unit, wherein a next icon appears at the bottom of the first text column, and, in response to the user selecting the next icon, a second text column including a next segment of the content unit is displayed on the screen.
BRIEF DESCRIPTION OF THE DRAWINGS
[0012] Fig. 1 a illustrates a full page of a document displayed on a large screen.
[0013] Fig. lb illustrates the same full page displayed on a device having a small screen.
[0014] Fig. 2a illustrates the same page zoomed to readable size on a small screen.
[0015] Fig. 2b shows the area of the full page displayed on the screen in Fig. 2a.
[0016] Fig. 3 shows an example PDF document.
[0017] Fig. 4a shows the document of Fig. 3 displayed on a small screen in an embodiment of the present invention.
[0018] Fig. 4b shows the document of Fig. 3 displayed on a small screen, a text column zoomed and aligned to the screen according to an embodiment of the present invention.
[0019] Fig. 5 is a simplified block diagram of a computer system 110 that can be used to implement software incorporating aspects of the present invention.
DETAILED DESCRIPTION
[0020] To standardize the appearance of printed documents across different devices and operating systems, PDF was developed in the 1990s, and has become the standard in print publishing.
[0021] For online publishing, documents such as magazines and newspapers are generally provided in PDF documents. Using its PDF definition, a document's online appearance can be the same as its print version, which is typically optimized for US letter format. On a screen of suitable size and resolution, a user can display an entire page or even two facing pages of a document, and can interact with the online version just as he or she would the print version. A screen is of suitable size and resolution for an electronic document if, when the page is displayed in its entirety on the screen, all text is comfortably readable to a user. Generally, for example, the screen of a conventional desktop computer is of suitable size and resolution to view a standard- size magazine page.
[0022] Difficulties arise, however, when viewing a document on a small screen, such as the screen of a mobile phone, small tablet, small laptop or palmtop computer. Fig la shows a full page of document 10 displayed on a desktop screen 20. Fig. lb shows a full page of the same document 10 displayed, in its original layout, on small screen 30. The body text of document 10 on small screen 30 is too small to be readable. Text is considered too small to be readable when a person having normal eyesight (without any visual aid) cannot readily discern text characters unaided when using the device in a typical manner while held at a typical reading distance from the eyes. For the purposes of this discussion a "small screen" is a screen too small for body text to be readable when an entire page of a document is displayed on the screen.
[0023] In order for the text of document 10 to be readable, document 10 may be zoomed so that only a portion of a page of document 10 is visible on small screen 30. Fig 2a shows the appearance of small screen 30, and in Fig. 2b, frame 40 indicates the portion of document 10 shown in small screen 30. The user then becomes responsible for selecting a suitable level of zoom and for scrolling to the correct location to display text, in this case first text column 12. Once the user has finished reading the contents of text column 12, the user must scroll to find the following text, in this case at the top of text column 14. Such navigation can be cumbersome.
[0024] This example has described the difficulties that arise when viewing a document like a standard-size magazine on a small screen like a mobile phone. Similar difficulties could arise when viewing a very large document on a desktop or large tablet or laptop screen.
[0025] Aspects of the present invention provide tools to assist the user in navigating in a document when the document is viewed on a device having a screen too small to allow at least some text in a document to be read when an entire page is displayed on the screen. Aspects of the invention include a viewer, implemented in software configured to be run by a processor on the device.
[0026] Turning to Fig. 3, consider a document 50. Only a first page of document 50 is shown in Fig. 3, though there may be multiple pages. Document 50 is described in a PDF file. The PDF file provides the size and the location of multiple content areas, and the text or image associated with each. A content area is a contiguous area on a single page, generally rectangular, enclosing text or an image. Generally in a content area enclosing text, the text is all or nearly all of the same font, including style and size. The first page of document 50, displayed in Fig. 3, includes content areas 52, 54, 56, 58, 60, 62, 64, and 66. Standard PDF does not, however, identify what the content areas are, or their relationship to each other.
[0027] For example, it's likely that content area 52 is a title. Content area 54 may be a subtitle. Content areas 56, 58 and 60 may be related or independent body text. (This discussion will use the term "body text" to refer to the text making up the bulk of the content of an article, as opposed to the headline or caption.) Content area 62 may be a title for content area 58, or content areas 56 and 58 may be consecutive, and content area 62 may be a pull quote from that article. Content area 64 contains an image, suggesting that content area 66 may be a caption. A typical page will include two or more among the several types, including titles, subtitles, captions, pull-quotes, body text, and images.
[0028] The software viewer of the present invention performs analysis on the PDF that describes document 50 in order to determine the types of the content areas making up the pages of document 50, and how they relate to each other. This analysis will be discussed later.
[0029] The viewer of the present invention may operate in two modes, which will be called page mode and text mode. A displayed document may initially appear in page mode.
[0030] In page mode, the document retains its original layout, with tools provided to assist the user in navigating within that layout. Turning to Fig. 4a, on screen 30 in some embodiments a navigation pane 34 is displayed, for example at the bottom of the screen. The screen 30 may be at any zoom level, and thus may display only a portion of document 50. Navigation pane 34, however, shows the entire page, and includes a superimposed frame 32 indicating the location of screen 30 within the current page of document 50. In some embodiments the user may drag superimposed frame 32 to scroll within the current page of document 50.
[0031] After viewing the current page and reading headlines, a user may choose to read body text on the current page, for example starting with body text in content area 56 (shown in Fig. 3). The user may scroll so that any portion of content area 56 is displayed on screen 30, for example by dragging superimposed frame 32. The user then selects content area 56, for example by tapping or double-tapping anywhere within its area. Through analysis, the viewer has identified that content area 56 contains body text. A rectangular content area containing body text may be referred to as a text column. In response to selection of text column 56 by the user, for example by a tap or double-tap within its area, the current page of document 50 is zoomed and scrolled to optimize reading of text column 56. The page is scrolled so that the first line of text column 56 is at the top of screen 30, and a zoom level is selected so that the width of text column 56 is about the width of screen 30, as shown in Fig. 4b.
[0032] Navigation pane 34 may be automatically toggled off (as in Fig. 4b) when a text column is zoomed and scrolled for reading. In addition, a user may toggle navigation pane 34 off or on at any time.
[0033] If, in the original layout of the document, the width of a text column is so great that zooming it to fit the width of screen 30 renders text too small to be readable, in some embodiments, the viewer may automatically switch into text mode, described below.
[0034] Once the user has finished reading text column 56, he may continue reading the following text in the article, which, in this example, may be in content area 58. Content area 58 is also a text column. The user may select text column 58, for example by tapping it. In response, the page is scrolled so that the first line of text column 58 is at the top of screen 30, and a zoom level is selected so that the width of text column 58 is about the width of screen 30.
[0035] A next icon (not shown) may be displayed at the bottom of text column 58. Referring to Fig. 3, suppose the article continues in text column 60. To continue reading in text column 60, the reader may select the next icon. The page is scrolled so that the first line of text column 60 is at the top of screen 30, and a zoom level selected so that the width of text column 60 is about the width of screen 30. The user may continue to read the rest of the article in this manner, continuing on subsequent pages, to the end of the article. In short, when a content unit is broken into segments, selecting the next icon will move the reader from the current segment to the next segment in correct reading order. In some embodiments, a previous icon (not shown) allows the user to move backward, in correct reading order, through segments of the content unit.
[0036] When viewing a text column, the user will also have the option of selecting a text icon 38. Selecting text icon 38 toggles from page mode to text mode.
[0037] When the user selects text mode, the original layout is replaced with a text view. The body text is extracted from the content unit being read, and is reformatted in the text view to be continuous and complete in correct reading order. The term "content unit" is used herein to refer to text readable by the user. A content unit may be, for example, a newspaper article, a magazine article, a scholarly article, a textbook article or chapter, etc.
[0038] In text view, the text is optimized for reading. Graphics and images, such as illustrations or advertising, may be excluded from the text view. A font is selected to be readily readable on screen 30. The font may be the same as that used in article 30, or may be different. In some embodiments, the user is able to read the entire content unit by scrolling to the end with no additional navigation required. Alternatively, the content unit may be broken into two or more consecutive pages. The user may toggle from text mode back to page mode after reading the content unit, or at any time.
[0039] As noted earlier, in order to extract all of the body text of a content unit and display it in text view in correct reading order in text mode, or in order to advance from one text column to the next in page mode, analysis is performed of the PDF file to determine the type of the various content areas (for example areas 52-66 of document 50 in Fig. 3), and their relationship. Three approaches used to perform this analysis will be discussed. One, two, or all three approaches may be used, and when more than one is used, they may be used separately or together, in any order.
[0040] A first approach uses information from authoring tools. A PDF file can be created using authoring tools such as, for example, InDesign, from Adobe Systems. Using InDesign, a document designer can specify a text column on a page layout in which an article will start. If the text is too large to fit in the starting text column, it automatically flows into subsequent columns. When the document is exported to PDF, the authoring tool allows the user to include information about the links between these subsequent columns in the PDF file. If this information is present in the PDF, the viewer described in the present application uses this information to determine the correct reading order for a content unit.
[0041] A second approach uses a set of defined rules. A human reader, when faced with a page in a magazine, newspaper, textbook, etc., typically has no trouble understanding how to read it: Based on explicit and implicit guidance in the text and layout, and based on a reader's experience reading print documents, she can easily determine what headlines relate to what articles, where an article starts, continues, and finishes, when two text columns on a page are two consecutive columns of the same article, two articles, an article and a sidebar, an article and an advertisement, etc. For the human reader, this determination is largely intuitive.
[0042] For the viewer of the present invention, the process can be codified in a set of rules. For example, for western languages, the following rules might apply for text boxes:
- If a text block starts with an oversize letter, it is assumed to be the start of an article.
- If a text block starts with a lowercase letter it is assumed to be a continuation of an article.
- If the content of a text block is identical to a text entry in a table of contents, it is assumed to be a headline.
- If a text block contains fewer than ten words it is assumed to be a headline or a caption rather than body text.
[0043] The following rules might apply for interpreting page layout:
- If a text box is below a detected headline, it is assumed to be the start of an article.
- If a normal text block is below a detected text block and has no headline above it, it is assumed to be a continuation of the same article.
[0044] In addition, continuation guidance may be provided in the text. Continuation guidance may include words indicating continuation at the bottom of a text column, such as "See page 23" or "Please turn to pg. 5," and such guidance can be detected and deciphered. A sidebar often has a box around it, or uses a different font or a different color background. Reading order will be different depending on the language of the document, for example for English, Hebrew, Japanese, etc. The result of this approach is a list of articles containing headlines and the text belonging to each headline.
[0045] A third approach employs machine learning. The purpose is to model the layout of a PDF by capturing the structure and calculating a latent representation which can be used as a similarity measure between PDFs.
[0046] The steps include a training stage to build the model and a similarity stage used in production. The steps include:
- Decompose the PDF into elements, such as text, images, location of text/image boxes, font sizes, font types etc.
- Extract features from these elements, such as number of fonts per page, distance between text and image boxes, and so on. - Train a layout model, which learns from these features. This could be Deep Neural Network model, for instance.
- The trained model can now produce a latent representation of a given PDF. In practice each representation is a list of real numbers, eg. [0.356, 0.01043, 0.023425...]
[0047] To evaluate the results from this automatic process, a large number of PDF documents, for example ten thousand documents (or more or fewer) may be marked manually. In each the beginning and ending of each content unit is identified, and headlines, subheads, pull quotes, captions, etc., are all identified and correctly associated with a content unit. Advertising, images, figures, etc. are identified as well. This manual marking is then compared to the automatic process, and the success of the automatic process can be awarded a score.
[0048] Fig. 5 is a simplified block diagram of a computer system 110 that can be used to implement software incorporating aspects of the present invention. While the foregoing description indicates that the viewer carries out specified operations, it will be appreciated in fact the viewer is implement as software code portions which cause computer system 110 to operate in the specified manner.
[0049] Computer system 110 typically includes a processor subsystem 114 which communicates with a number of peripheral devices via bus subsystem 112. These peripheral devices may include a storage subsystem 124, comprising a memory subsystem 126 and a file storage subsystem 128, user interface input devices 122, user interface output devices 120, and a network interface subsystem 116. The input and output devices allow user interaction with computer system 110. Network interface subsystem 116 provides an interface to outside networks, including an interface to communication network 118, and is coupled via communication network 118 to corresponding interface devices in other computer systems. Communication network 1 18 may comprise many interconnected computer systems and communication links. These communication links may be wireline links, optical links, wireless links, or any other mechanisms for communication of information. While in one embodiment, communication network 118 is the Internet, in other embodiments, communication network 118 may be any suitable computer network.
[0050] The physical hardware component of network interfaces are sometimes referred to as network interface cards (NICs), although they need not be in the form of cards: for instance they could be in the form of integrated circuits (ICs) and connectors fitted directly onto a motherboard, or in the form of maerocells fabricated on a single integrated circuit chip with other components of the computer system. [0051] As indicated earlier, when used to display a very large document, the viewer of the present application could operate on any computer, such as a standard desktop computer, but will more commonly be useful on devices having small screens, such as mobile phones, small tablets or laptops, palmtop computers, etc. User interface input devices 122 may include a keyboard, pointing devices such as a mouse, trackball, touchpad, or graphics tablet, a scanner, a touch screen incorporated into the display, audio input devices such as voice recognition systems, microphones, and other types of input devices. In general, use of the term "input device" is intended to include all possible types of devices and ways to input information into computer system 110 or onto computer network 118.
[0052] User interface output devices 120 may include a display subsystem, a printer, a fax machine, or non-visual displays such as audio output devices. The display subsystem may include a cathode ray tube (CRT), a flat-panel device such as a liquid crystal display (LCD), a projection device, or some other mechanism for creating a visible image. The display subsystem may also provide non- visual display such as via audio output devices. In general, use of the term "output device" is intended to include all possible types of devices and ways to output information from computer system 110 to the user or to another machine or computer system.
[0053] Storage subsystem 124 stores the basic programming and data constructs that provide the functionality of certain embodiments of the present invention. For example, the various modules implementing the functionality of certain embodiments of the invention may be stored in storage subsystem 124. These software modules are generally executed by processor subsystem 114.
[0054] Memory subsystem 126 typically includes a number of memories including a main random access memory (RAM) 130 for storage of instructions and data during program execution and a read only memory (ROM) 132 in which fixed instructions are stored. File storage subsystem 128 provides persistent storage for program and data files, and may include a hard disk drive, a floppy disk drive along with associated removable media, a CD ROM drive, an optical drive, or removable media cartridges. The databases and modules implementing the functionality of certain embodiments of the invention may have been provided on a computer readable medium such as one or more CD-ROMs, and may be stored by file storage subsystem 128. The host memory 126 contains, among other things, computer instructions which, when executed by the processor subsystem 1 14, cause the computer system to operate or perform functions as described herein. As used herein, processes and software that are said to run in or on "the host" or "the computer", execute on the processor subsystem 1 14 in response to computer instructions and data in the host memory subsystem 126 including any other local or remote storage for such instructions and data.
[0055] Bus subsystem 1 12 provides a mechanism for letting the various components and subsystems of computer system 110 communicate with each other as intended. Although bus subsystem 112 is shown schematically as a single bus, alternative embodiments of the bus subsystem may use multiple busses.
[0056] Computer system 110 itself can be of varying types including a personal computer, a portable computer, a workstation, a computer terminal, a network computer, a television, a mainframe, a server farm, or any other data processing system or user device. Due to the ever-changing nature of computers and networks, the description of computer system 110 depicted in Fig. 5 is intended only as a specific example for purposes of illustrating the preferred embodiments of the present invention. Many other configurations of computer system 110 are possible having more or less components than the computer system depicted in Fig. 5.
[0057] In particular and without limitation, though many of the inventive aspects are described individually herein, it will be appreciated that many can be combined or used together with each other. All such combinations are intended to be included in the scope of this document.
[0058] The foregoing description of preferred embodiments of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Obviously, many modifications and variations will be apparent to practitioners skilled in this art. In particular, and without limitation, any and all variations described, suggested or incorporated by reference herein with respect to any one embodiment are also to be considered taught with respect to all other embodiments. The embodiments described herein were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications as are suited to the particular use contemplated.

Claims

1. A viewer application for viewing content from a document defined in a PDF file on a small screen, the document having an original layout, the document comprising at least one content area and at least one content unit, wherein the viewer application executes the following steps:
analyzing the document to:
(a) identify content areas of the document of the body text type,
(b) correlate each body text type content area with a content unit,
(c) identify a correct reading order for the body text type content areas of each content unit;
displaying a current page of the document in its original layout; and
in response to reader selection of a body text type content area of one of the at least one content units, providing navigation means allowing the user to read the content unit in correct reading order.
2. The viewer application of claim 1 wherein the step of analyzing the document comprises analyzing the font size or case of the initial letters or word of the content area.
3. The viewer application of claim 1 wherein the step of analyzing the document comprises detecting and deciphering continuation guidance within a content area.
4. The viewer application of claim 1 wherein, during the analyzing step, a content area is determined not to be of the body text type if it contains ten words or fewer.
5. The viewer application of claim 1 wherein, during the analyzing step, a content area is determined not to be of the body text type if it is identical to a text entry in a table of contents.
6. A viewer application for viewing a document on a screen of a device, the document including a current page, the current page having an original layout, wherein the viewer application may select between least two modes, the modes comprising:
page mode, wherein the original layout of the current page is preserved; and text mode, wherein body text is extracted from the document and reformatted in a text view, wherein the document is described in a PDF file, and wherein the viewer application is implemented as software code portions.
7. The viewer application of claim 6 wherein a user of the viewer application can toggle between page mode and text mode.
8. The viewer application of claim 6 wherein the original layout includes multiple content areas.
9. The viewer application of claim 8 wherein the multiple content areas include at least two types of content areas from the group consisting of titles, subtitles, captions, body text, and images.
10. The viewer application of claim 6 wherein the device is a mobile phone or a tablet.
11. The viewer application of claim 6 wherein, in page mode, when the entire current page is displayed on the screen in the original layout, at least some text on the current page is too small to be readable by a user.
12. The viewer application of claim 6 wherein, in text mode, the body text is extracted from a content unit and is reformatted to be continuous and complete in correct reading order.
13. The viewer application of claim 12 wherein the content unit is a magazine article or newspaper article.
14. The viewer application of claim 12 wherein, in text mode, no images are included in the text view.
15. The viewer application of claim 6 wherein, in page mode, a navigation pane displays the current page with a superimposed frame indicating current contents of the screen.
16. The viewer application of claim 15 wherein the superimposed frame can be moved by a user to change the current contents of the screen.
17. The viewer application of claim 15 wherein a user can toggle display of the navigation pane off and on.
18. The viewer application of claim 6 wherein the screen has a width and a top and wherein, in page mode, in response to a user tapping within an area of a first text column having a first line, a width of the first text column is zoomed to the width of the screen, and the first text column is positioned with its first line at the top of the screen.
19. The viewer application of claim 18 wherein the first text column includes a first segment of a content unit, wherein a next icon appears at the bottom of the first text column, and wherein, in response to the user selecting the next icon, a second text column including a next segment of the content unit is displayed on the screen.
20. A viewer application for viewing a document on a screen of a device,
wherein the document includes a current page,
wherein the current page has an original layout comprising multiple content areas, wherein the multiple content areas include at least two types of content areas from the group consisting of titles, subtitles, captions, body text, pull quotes, images, and graphics,
wherein, responsive to a text mode command from a user, body text is extracted
from the document and reformatted to be continuous, complete and in correct reading order, and
wherein the document is described in a PDF file.
21. The viewer application of claim 20, wherein the body text is extracted from a content unit.
22. The viewer application of claim 21 wherein the content unit is a magazine article or
newspaper article.
23. The viewer application of claim 20 wherein the text mode command from the user is issued by a tap or double-tap on the screen.
24. The viewer application of claim 20 wherein the device is a mobile phone or a tablet.
25. The viewer application of claim 20
wherein the document may comprise one or more additional pages, each additional page having a respective original layout, and wherein, responsive to a page mode command from the user, the viewer application displays at least a portion of the original layout of the current page or of an additional page.
26. The viewer application of claim 20 wherein correct reading order is determined by
information in the PDF file.
27. The viewer application of claim 20 wherein correct reading order is determined based on rules.
28. The viewer application of claim 27 wherein the rules are compiled by a process of
machine learning.
29. A viewer application for viewing a document on a screen of a device,
wherein the document includes a current page,
wherein the current page has an original layout comprising multiple content areas, wherein the multiple content areas include at least two types of content areas from the group consisting of titles, subtitles, captions, body text, pull quotes, images, and graphics,
wherein the current page is displayed in its original layout and a navigation pane displays the current page with a superimposed frame indicating current contents of the screen, and
wherein the document is described in a PDF file.
30. The viewer application of claim 29 wherein the device is a mobile phone or a tablet.
31. The viewer application of claim 29 wherein the superimposed frame can be moved by a user to change the current contents of the screen.
32. The viewer application of claim 29 wherein a user can toggle display of the navigation pane off and on.
33. The viewer application of claim 29 wherein the screen has a width and a top and wherein, in page mode, in response to a user tapping within an area of a first text column having a first line, the first text column is zoomed to the width of the screen and positioned with its first line at the top of the screen.
34. The viewer application of claim 33 wherein the first text column includes a first segment of a content unit, wherein a next icon appears at the bottom of the first text column, and wherein, in response to the user selecting the next icon, a second text column including a next segment of the content unit is displayed on the screen.
PCT/US2017/040264 2017-06-30 2017-06-30 Method and system to display content from a pdf document on a small screen WO2019005100A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/US2017/040264 WO2019005100A1 (en) 2017-06-30 2017-06-30 Method and system to display content from a pdf document on a small screen

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2017/040264 WO2019005100A1 (en) 2017-06-30 2017-06-30 Method and system to display content from a pdf document on a small screen

Publications (1)

Publication Number Publication Date
WO2019005100A1 true WO2019005100A1 (en) 2019-01-03

Family

ID=64742932

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2017/040264 WO2019005100A1 (en) 2017-06-30 2017-06-30 Method and system to display content from a pdf document on a small screen

Country Status (1)

Country Link
WO (1) WO2019005100A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130014007A1 (en) * 2011-07-07 2013-01-10 Aquafadas Method for creating an enrichment file associated with a page of an electronic document
US20140006982A1 (en) * 2009-10-05 2014-01-02 Daniel Wabyick Paginated viewport navigation over a fixed document layout
US20140250372A1 (en) * 2008-12-01 2014-09-04 Adobe Systems Incorporated Methods and systems for page navigation of dynamically laid-out content
US20140250371A1 (en) * 2008-12-01 2014-09-04 Adobe Systems Incorporated Methods and Systems for Page Layout Using a Virtual Art Director

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140250372A1 (en) * 2008-12-01 2014-09-04 Adobe Systems Incorporated Methods and systems for page navigation of dynamically laid-out content
US20140250371A1 (en) * 2008-12-01 2014-09-04 Adobe Systems Incorporated Methods and Systems for Page Layout Using a Virtual Art Director
US20140006982A1 (en) * 2009-10-05 2014-01-02 Daniel Wabyick Paginated viewport navigation over a fixed document layout
US20130014007A1 (en) * 2011-07-07 2013-01-10 Aquafadas Method for creating an enrichment file associated with a page of an electronic document

Similar Documents

Publication Publication Date Title
US10216708B2 (en) Paginated viewport navigation over a fixed document layout
US5978754A (en) Translation display apparatus and method having designated windows on the display
US6891551B2 (en) Selection handles in editing electronic documents
US10162804B2 (en) Object resizing with content reflow
US7127673B2 (en) Electronic document display system
US6956979B2 (en) Magnification of information with user controlled look ahead and look behind contextual information
US5590264A (en) Method and apparatus for graphic association of user dialog displays with primary applications in a data processing system
US20020116420A1 (en) Method and apparatus for displaying and viewing electronic information
EP0701220A1 (en) Method and apparatus for viewing electronic documents
TWI291139B (en) Enhanced readability with flowed bitmaps
KR20180048774A (en) System and method of digital note taking
EP1475741B1 (en) Data processing apparatus and method
JP2003303047A (en) Image input and display system, usage of user interface as well as product including computer usable medium
US9684641B1 (en) Presenting content in multiple languages
US20020109687A1 (en) Visibility and usability of displayed images
US5890183A (en) Method, apparatus, electronic dictionary and recording medium for converting converted output into character code set accetpable for re-retrieval as original input
Sandnes Lost in OCR-translation: pixel-based text reflow to the rescue: magnification of archival raster image documents in the browser without horizontal scrolling
US8869061B1 (en) User interface for searching an electronic document
US20170212870A1 (en) Method and System to Display Content from a PDF Document on a Small Screen
WO2019005100A1 (en) Method and system to display content from a pdf document on a small screen
JP6933395B2 (en) Automatic translation device and automatic translation program
JP2633521B2 (en) Screen display control method at the time of input by Kana-Kanji conversion
US11379661B2 (en) Word verification editing for simple and detailed text editing
JP7003457B2 (en) Document reconstructor
JP7223450B2 (en) Automatic translation device and automatic translation program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17915631

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17915631

Country of ref document: EP

Kind code of ref document: A1