WO2006036785A1 - Image distortion for content security - Google Patents

Image distortion for content security Download PDF

Info

Publication number
WO2006036785A1
WO2006036785A1 PCT/US2005/034141 US2005034141W WO2006036785A1 WO 2006036785 A1 WO2006036785 A1 WO 2006036785A1 US 2005034141 W US2005034141 W US 2005034141W WO 2006036785 A1 WO2006036785 A1 WO 2006036785A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
document
image portion
generating
software
Prior art date
Application number
PCT/US2005/034141
Other languages
French (fr)
Inventor
Joseph K. O'sullivan
Original Assignee
Google, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=35463958&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2006036785(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Google, Inc. filed Critical Google, Inc.
Priority to BRPI0515479-0A priority Critical patent/BRPI0515479A2/en
Priority to JP2007532687A priority patent/JP2008513898A/en
Priority to CN2005800372319A priority patent/CN101049007B/en
Priority to CA2581366A priority patent/CA2581366C/en
Priority to EP05798318A priority patent/EP1792476A1/en
Priority to AU2005289725A priority patent/AU2005289725C1/en
Publication of WO2006036785A1 publication Critical patent/WO2006036785A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/44Secrecy systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00838Preventing unauthorised reproduction
    • H04N1/00856Preventive measures
    • H04N1/00864Modifying the reproduction, e.g. outputting a modified copy of a scanned original
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00838Preventing unauthorised reproduction
    • H04N1/00856Preventive measures
    • H04N1/00864Modifying the reproduction, e.g. outputting a modified copy of a scanned original
    • H04N1/00872Modifying the reproduction, e.g. outputting a modified copy of a scanned original by image quality reduction, e.g. distortion or blacking out
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/387Composing, repositioning or otherwise geometrically modifying originals
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching

Definitions

  • the present invention relates to document security and, more particularly, to preventing a user from obtaining a complete copy of a document. Description of the Background Art
  • a method for generating an image is presented, wherein the image displays a document, and the document is relevant to a search query.
  • the method comprises generating a first image portion, the first image portion containing a region of interest, the region of interest being a portion of the document that is relevant to the search query; generating a second image portion, the second image portion comprising a second portion of the document that contains the region of interest, the second image portion being distorted; and generating an image comprising the first image portion and the second image portion.
  • FIG. IA illustrates an undistorted image of a document.
  • FIG. IB illustrates a distorted image of the same document as that shown in FIG. IA, according to one embodiment of the invention.
  • FIG. 2A illustrates an image of the same document as that shown in
  • FIG. IA according to one embodiment of the invention.
  • FIG. 2B illustrates an image of the same document as that shown in
  • FIG. IA according to another embodiment of the invention.
  • FIG. 3 illustrates a block diagram of a general-purpose computing device for implementing the invention according to one embodiment.
  • FIG. 4 illustrates a block diagram of a software architecture for a system according to one embodiment of the invention.
  • FIG. 5 illustrates a flowchart of a method performed by a main program, according to one embodiment of the invention.
  • FIG. 6A illustrates an image similar to the image shown in FIG. 2A where a search term is underlined, according to one embodiment of the invention.
  • FIG. 6B illustrates an image similar to the image shown in FIG. 2B where a search term is underlined, according to one embodiment of the invention.
  • Search engine results typically comprise a list of links to electronic documents that satisfy a search query.
  • a "document” is understood to include any textual, graphical, visual, multimedia, or other type of work for which a visual representation can be derived and presented to a user.
  • the user views the document. This is generally performed by clicking on the link associated with the document, which causes the document to be displayed.
  • a document's relevancy can frequently be determined based on a portion of the document that is relevant to the search terms (a "region of interest").
  • a ROI can be, for example, a word, a sentence, a paragraph, a table, a graphic, or any other textual, graphical, visual, multimedia, or video element or the like, depending on the type of content involved. While the user does not need to see the entire document in order to determine whether it is relevant, it is useful to know the context of the ROI within the document.
  • FIG. IA illustrates an undistorted image of a document.
  • Image IOOA is a single page of a lengthy document, and is exemplary of images shown by conventional imaging tools that are used to display electronic documents. If so inclined, a user can copy the entirety of the text (or image) portions shown and use these copied portions without permission from, or payment to, the owner of the document.
  • Image IOOA is derived from a document that can be in, for example, text format, image format, a markup language, a page description language, or other format.
  • FIG. IB illustrates a distorted image of the same document as that shown in FIG. IA, according to one embodiment of the invention.
  • Image IOOB may be created directly from the underlying document, or it may be created from an undistorted image of the underlying document, such as image 10OA.
  • image IOOB is created by distorting the undistorted image 10OA.
  • the image IOOA is distorted by using pixelation and also by decreasing the brightness level of portions of the image that are outside of a region of interest of the user.
  • the user is not shown a complete, undistorted image of the document and thus is prevented from making a copy of the undistorted document.
  • FIG. 2A illustrates an image of the same document as that shown in FIG. IA, according to one embodiment of the invention.
  • FIG. 2B illustrates an image of the same document as that shown in FIG. IA, according to another embodiment of the invention.
  • an image 200 enables a user to determine the relevance of the underlying document by displaying an undistorted image portion 210 of a first portion of the document and a distorted image portion 220 of a second portion of the document.
  • the second portion of the document is one page of the document (for example, if the document is a multi-page document).
  • the second portion of the document is an area of the document (for example, if the document is graphical).
  • image 200A comprises image portions
  • image 200B comprises image portions 210B and 220B.
  • the first portions 210A, 220A of the documents comprise three partial lines of text, with the first partial line being "This is sample text.”
  • the second portions 210B, 220B comprise the remaining contents of the page represented by image 200.
  • the first portion of the document is the user's ROI (i.e., a portion of the document that is relevant to the user's search terms).
  • the contents of image portion 210 which displays the user's ROI, should be readable by a typical user so that the user can determine whether the ROI is relevant.
  • image portion 210 is undistorted, similar to image 10OA.
  • image portion 210 is modified to help the user determine the relevance of the document.
  • image portion 210 may indicate the presence of search terms by displaying these terms with underlining, or outlining, or highlighting.
  • FIG. 6A illustrates an image similar to the image shown in FIG. 2A where a search term is underlined, according to one embodiment of the invention.
  • FIG. 6B illustrates an image similar to the image shown in FIG. 2B where a search term is underlined, according to one embodiment of the invention.
  • the second portion of the document is that which corresponds to the page that is represented by image 200.
  • Image portion 220 which displays the second portion of the document, should be distorted so that its contents are unreadable by a typical user or otherwise degraded to devalue or impair a user's use or copying of them.
  • an image portion 220 can be pixilated, blurred, tinted, or converted to a lower resolution.
  • image 200A shows undistorted image portion 210A being located "on top of" distorted image portion 220A at a similar place to where the ROI would be located within the page of the document that is being displayed.
  • image 200B shows undistorted image portion 210B being located next to distorted image portion 220B and also shows a "callout" 230 from distorted image portion 220B to undistorted image portion 210B.
  • FIG. 3 illustrates a block diagram of a general-purpose computing device for implementing the invention according to one embodiment.
  • the computing device 300 preferably includes a processor 310, a main memory 320, a data storage device 330, and a network controller 380, all of which are communicatively coupled to a system bus 340.
  • Computing device 300 may be, for example, a workstation, a desktop computer, a laptop computer, a tablet computer, a personal digital assistant (PDA), or any other type of computing device.
  • PDA personal digital assistant
  • Processor 310 processes data signals and comprises various computing architectures including a complex instruction set computer (CISC) architecture, a reduced instruction set computer (RISC) architecture, or an architecture implementing a combination of instruction sets. Although only a single processor is shown in FIG. 3, multiple processors may be included.
  • CISC complex instruction set computer
  • RISC reduced instruction set computer
  • Main memory 320 stores instructions and/ or data that are executed by processor 310.
  • the instructions and/ or data comprise code for performing any and/ or all of the techniques described herein.
  • Main memory 320 is preferably a dynamic random access memory (DRAM) device, a static random access memory
  • SRAM static random access memory
  • Data storage device 330 stores data and instructions for processor 310 and comprises one or more devices including a hard disk drive, a floppy disk drive, a CD-ROM device, a DVD-ROM device, a DVD-RAM device, a DVD-RW device, a flash memory device, or some other mass storage device known in the art.
  • Network controller 380 links the computing device 300 to a network
  • System bus 340 represents a shared bus for communicating information and data throughout the computing device 300.
  • System bus 340 represents one or more buses including an industry standard architecture (ISA) bus, a peripheral component interconnect (PCI) bus, a universal serial bus (USB), or some other bus known in the art to provide similar functionality.
  • ISA industry standard architecture
  • PCI peripheral component interconnect
  • USB universal serial bus
  • Display device 350 represents any device equipped to display electronic images and data to a local user or maintainer.
  • Display device 350 is a cathode ray tube (CRT), a liquid crystal display (LCD), or any other similarly equipped display device, screen, or monitor.
  • Keyboard 360 represents an alphanumeric input device coupled to computing device 300 to communicate information and command selections to processor 310.
  • Cursor control device 370 represents a user input device equipped to communicate positional data as well as command selections to processor 310.
  • Cursor control device 370 includes a mouse, a trackball, a stylus, a pen, cursor direction keys, or other mechanisms to cause movement of a cursor.
  • computing device 300 includes more or fewer components than those shown in FIG. 3 without departing from the spirit and scope of the present invention.
  • computing device 300 may include additional memory, such as, for example, a first or second level cache or one or more application specific integrated circuits (ASICs).
  • ASICs application specific integrated circuits
  • computing device 300 may be comprised solely of ASICs.
  • components may be coupled computing device 300 including, for example, image scanning devices, digital still or video cameras, or other devices that may or may not be equipped to capture and/ or download electronic data to/ from computing device 300.
  • FIG. 4 illustrates a block diagram of a software architecture for a system according to one embodiment of the invention.
  • code modules and memory storage areas are stored in the memory 320 for generating an image that represents a portion of a document and conveys the context of that portion within the document.
  • the code modules and memory storage areas include a main program module 400, a document-to-image conversion module 410, an image distortion/ modification module 420, an image generation module 430, and a document and image repository module 440.
  • Code modules 400, 410, 420, and 430 and memory storage area 440 are communicatively coupled to each other.
  • Main program module 400 transmits instructions and data to as well as receives data from each code module and memory.
  • Document-to-image conversion module 410 generates, given an electronic document, an image of at least one page of that document. In a typical embodiment, document-to-image conversion module 410 generates a separate image for each page of the document that contains one or more of the search terms (or conceptually related terms) of the user's query.
  • document-to-image conversion module 410 generates undistorted image 10OA.
  • Undistorted image IOOA may be cropped to display only the user's ROI and then used as undistorted image portion 210.
  • undistorted image IOOA may be distorted using image distortion/ modification module 420 and then used as distorted image portion 220.
  • undistorted image IOOA is stored using document and image repository module 440 so that undistorted image IOOA does not have to be generated again.
  • document-to-image conversion module
  • Distorted image IOOB may be used as distorted image portion 220.
  • distorted image IOOB is stored using document and image repository module 440 so that distorted image IOOB does not have to be generated again.
  • Many distortion methods may be used. These methods include, for example, pixelation, change of brightness, change of contrast, blurring, and image filtering.
  • Document-to-image conversion module 410 may use one or more of these methods to generate distorted image IOOB.
  • Document-to-image conversion module 410 may also generate an image that has been modified based on the user's search terms (e.g., by highlighting the search terms within the image).
  • This modified image could be either undistorted or distorted. If the modified image is undistorted, it could be cropped to display only the user's ROI and then used as undistorted image portion 210. If the modified image is distorted, it could be used as distorted image portion 220. In one embodiment, a modified image would not be saved because its use is limited to a query containing the same search terms.
  • Document-to-image conversion module 410 can generate an image in several ways. If the electronic version of the original document is a PDF document, for example, document-to-image conversion module 410 can use the capabilities of PDF software to output the document's contents as an image. If it is a word processing file, document-to-image conversion module 410 can print the document's contents to a file (rather than to a printer) as an image. If it is an image (e.g., a physical document that has been scanned), document-to-image conversion module 410 can further process the image as necessary. For example, document-to-image conversion module 410 can divide the image into several parts and/ or reduce the resolution of the image by downsampling. Another possibility is for document-to- image conversion module 410 to use a software conversion program that converts a specific type of electronic file to an image.
  • Image distortion/ modification module 420 generates, given an image, a different version of that image.
  • image distortion/ modification module 420 generates a distorted version of the image 10OB.
  • distorted image IOOB may then be stored and/ or used as distorted image portion 220.
  • Many distortion methods may be used. These methods include, for example, pixelation, change of brightness, change of contrast, blurring, and image filtering.
  • Image distortion/ modification module 420 may use one or more of these methods to generate distorted image IOOB.
  • image distortion/ modification module 420 generates an image that has been modified based on the user's search terms (e.g., by highlighting the search terms within the image).
  • This modified image could be either distorted or undistorted. As discussed above with reference to document-to-image conversion module 410, this modified image could be used as distorted image portion 220 or cropped and then used as undistorted image portion 210. In one embodiment, the modified image would not be saved.
  • Image generation module 430 generates an image 200 that 1) represents a portion of a document (such as a ROI) and 2) conveys the context of that portion within the document.
  • image 200 comprises image portions 210 and 220.
  • Image portion 210 is used to represent the ROI, while image portions 210 and 220 are used to convey the context of the ROI by indicating the location of the ROI within the document.
  • Many types of images 200 can be used to indicate the context of the
  • Image 200A is a composite image comprising image portions 210A and 220A such that the combination of image portions 210A and 220A appears to be a single document.
  • image portion 210A is overlaid on image portion 220A such that image portion 210A covers the portion of image portion 220A that contains the ROI.
  • image portion 210A has a similar appearance to image portion 220A except that image portion 220A is distorted and image portion 210A is not.
  • image portion 210A has a different appearance from image portion 220A, besides the fact that image portion 220A is distorted and image portion 210A is not. This difference in appearance helps distinguish image portion 210A from the rest of image 200A and thereby makes it easier for the user to find image portion 210A within image 200A.
  • the font and/ or background color of image portion 210A may differ from the font and/ or background color of image portion 220A.
  • image portion 210A may be outlined, forming a bounding box (e.g., a rectangle) that extends a minimum distance (e.g., 0.5") outside of the contents of image portion 210A.
  • a bounding box e.g., a rectangle
  • a minimum distance e.g., 0.5
  • FIG. 2B Another example of an image that can be used to indicate the context of the ROI is shown in FIG. 2B.
  • Image 200B similarly comprises image portions 210B and 220B, but image 200B does not overlay image portion 210B onto image portion 220B. Instead, image 200B places image portion 210B outside of image 220B and uses a "callout" 230 from image portion 210B to the location of the ROI within distorted image 220B.
  • image generation module 430 generates a location map of the displayed document page showing the location of the ROI. Image generation module 430 then uses this map to generate image 200 such that image 200 indicates the context of the ROI. In one embodiment, image generation module 430 determines the location of the ROI based on the locations of words within the ROI. The locations of these words are obtained by querying document and image repository module 440.
  • Document and image repository module 440 stores documents and/ or images. These images may include, for example, undistorted images IOOA of a document and distorted images IOOB of a document. If a document exists in electronic format, the electronic format is stored in document and image repository module 450. If no electronic format exists, then the document is digitized by, for example, scanning the document and/ or performing Optical Character Recognition (OCR) on it. The results are then stored in document and image repository module 450.
  • OCR Optical Character Recognition
  • Document and image repository module 440 also stores positions of words within documents and/ or images. For example, document and image repository module 440 stores, for each word in an image or document, the dimensions of the smallest box that can enclose the word (the word's "bounding box") and the location of the box in the image or document (e.g., in x,y coordinates). Given a file that contains text, determining a word's bounding box is known to those of ordinary skill in the art. In one embodiment, if the file is an image file, the image is converted to text by OCR' ing it. As a by-product of the OCR process, the dimensions and locations of bounding boxes can be determined.
  • FIG. 5 illustrates a flowchart of a method performed by a main program, according to one embodiment of the invention. This method may be used, for example, in conjunction with a search engine. Before the method of FIG. 5 begins, a user enters a query into a search engine. The query may contain various search terms and expressions.
  • the search engine then generates a set of results, typically a list of documents.
  • Each result represents a reference to a document that is relevant to the query.
  • a document can be relevant to a query because, for example, its contents directly "match" the query terms (e.g., using a textual match).
  • a document can be relevant because its contents are conceptually, semantically, or topically related to the query terms.
  • a document can be relevant because meta-information associated with the document (e.g., the document's author or publication date) satisfy the query.
  • the particular way in which the search engine determines relevant documents is not material to the invention, which may be used with any type of search engine.
  • the search engine determines a portion of the document that is relevant to the query (a ROI). The search engine also determines where query terms appear in the document, if at all. This process is known to those of ordinary skill in the art. Main program module 400 then begins 500.
  • Steps 510 and 520 may occur in any order, including simultaneously.
  • Main program module 400 generates 510 distorted image portion 220.
  • Distorted image portion 220 is, for example, a page of the selected document that contains the user's ROI. In one embodiment, distorted image portion 220 is not modified based on the user's query. In this embodiment, main program module 400 uses a distorted image of the selected page IOOB as distorted image portion 220. There are several ways to obtain distorted image IOOB. A few of these ways are described below. [0055] In one embodiment, main program module 400 retrieves distorted image IOOB from document and image repository module 440 if image IOOB exists.
  • main program module 400 retrieves an undistorted image of the selected page IOOA from document and image repository module 440 if image IOOA exists. If image IOOA does exist, main program module 400 distorts image IOOA using image distortion/ modification module 420, thereby generating image IOOB. In one embodiment, main program module 400 also stores image IOOB in document and image repository module 440 for later use.
  • main program module 400 retrieves the selected document from document and image repository module 440. Main program module 400 then generates an image from the document using document-to-image conversion module 410. In one embodiment, main program module 400 uses document-to-image conversion module 410 to generate distorted image IOOB. In one embodiment, main program module 400 also stores image IOOB in document and image repository module 440 for later use. [0058] In another embodiment, main program module 400 uses document-to- image conversion module 410 to generate undistorted image 10OA. In one embodiment, main program module 400 stores image IOOA in document and image repository module 440 for later use. Main program module 400 then distorts image IOOA using image distortion/ modification module 420, thereby generating image 10OB. In one embodiment, main program module 400 also stores image IOOB in document and image repository module 440 for later use.
  • distorted image portion 220 is modified based on the user's query.
  • main program module 400 obtains distorted image IOOB as described above. Then, main program module 400 uses image distortion/ modification module 420 to modify image IOOB based on the user's query. This modified image is then used as distorted image portion 220.
  • Main program module 400 generates 520 undistorted image portion
  • Undistorted image portion 210 is, for example, the user's ROI. In one embodiment, undistorted image portion 210 is not modified based on the user's query. In this embodiment, main program module 400 obtains an undistorted image of the selected page IOOA and then crops this image to show the user's ROI. The cropped image is then used as undistorted image portion 210. There are several ways to obtain undistorted image IOOA. A few of these ways are described below. [0061] In one embodiment, main program module 400 retrieves undistorted image IOOA from document and image repository module 440 if image IOOA exists. [0062] In another embodiment, if image IOOA does not exist, main program module 400 retrieves the selected document from document and image repository module 440. Main program module 400 then uses document-to-image conversion module 410 to generate, from the document, undistorted image IOOA. In one embodiment, main program module 400 also stores image IOOA in document and image repository module 440 for later use.
  • undistorted image portion 210 is modified based on the user's query.
  • main program module 400 obtains undistorted image IOOA as described above. Then, main program module 400 uses image distortion/ modification module 420 to modify image IOOA based on the user's query. This modified image is then cropped and used as undistorted image portion 210.
  • main program module 400 uses image generation module 430 to generate combined image 200 using undistorted image portion 210 and distorted image portion 220. Main program module 400 then ends 540, and combined image
  • more than one computing device 300 is used, such as in a client-server setting.
  • a user may input a query into a search engine using a first computing device 300A (the "client").
  • the first computing device 300A the "client"
  • 300 A will then use the network controller 380A to send the query to a second computing device 300B (the "server").
  • the second computing device 300B will perform the search and then send the search results to the first computing device
  • the user will then select a document to display, and the first computing device 300A will send the user's selection to either the second computing device 300B or a third computing device 300C (another "server").
  • the first computing device 300A will send the user's selection to either the second computing device 300B or a third computing device 300C (another "server").
  • second computing device 300B or third computing device 300C will then generate combined image 200 by performing the method of FIG. 5 and send combined image 200 to the first computing device 300A.
  • First computing device 300A then displays combined image 200 to the user using display 350.
  • the first computing device 300A never contains a complete copy of either the underlying electronic document or an undistorted image of the underlying electronic document.
  • second computing device 300B or third computing device 300C sends to the first computing device 300A the requested electronic document, an undistorted image 100 A of the electronic document, and/ or a distorted image IOOB of the electronic document.
  • First computing device 300A then generates combined image 200 by performing the method of FIG. 5 and displays combined image 200 to the user using display 350.
  • these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.
  • the present invention also relates to an apparatus for performing the operations herein.
  • This apparatus may be specially constructed for the required purposes, or it may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer.
  • a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus.
  • the present invention provides various mechanisms for automatically presenting an analysis report for a prospective trade or other transaction, with a minimum of user effort.
  • One skilled in the art will recognize that the particular examples described herein are merely illustrative of representative embodiments of the invention, and that other arrangements, methods, architectures, and configurations may be implemented without departing from the essential characteristics of the invention. Accordingly, the disclosure of the present invention is intended to be illustrative, but not limiting, of the scope of the invention, which is set forth in the following claims.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Processing Or Creating Images (AREA)
  • Image Processing (AREA)
  • Editing Of Facsimile Originals (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A software module is presented that enables a person to determine the relevance of an electronic document while preventing the person from making a complete copy of the document. In one embodiment, this is accomplished by displaying an image that represents a region of interest and conveys the context of the region of interest within the document while distorting other portions of the document. In one embodiment, the software module is used in conjunction with a search engine to generate an image of a search result document.

Description

IMAGE DISTORTION FOR CONTENT SECURITY
Inventor: Joseph K. O'Sullivan
BACKGROUND OF THE INVENTION Field of the Invention
[0001] The present invention relates to document security and, more particularly, to preventing a user from obtaining a complete copy of a document. Description of the Background Art
[0002] It is easier to make a complete copy of information in electronic form than it is to make a complete copy of information in physical form. This fact makes content owners wary of making their electronic information accessible by the public. However, content owners desire to provide their content to users, often for a fee, and would benefit by having this information be searchable, in order to assist users in finding content that is relevant to their interests and needs. Users of search engines in particular expect to be able to view the relevant portions of a document or other content prior to purchasing the content. However, providing users access to the relevant portions typically results in giving users access to the entire document in a way that allows the user to make a complete copy of the content without paying for it.
[0003] Alternatively, it is possible to prohibit users' access to the relevant portions of a document until payment is received. However, in that situation, users are unable to see the relevant portions of the document and thus cannot best judge whether the document satisfies their interests or needs and, as a result, are less likely to purchase the content. Various other technologies have been developed with the goal of allowing a user to view a document while preventing the user from making a copy of it. These technologies include, for example, modifying the user's browser to disable printing and specifying that an image, if printed, should be blank. While many technologies exist, each of them can be circumvented.
[0004] What is needed is a way to allow a user to view an electronic document while preventing the user from making a copy of it. Summary Of the Invention
[0005] A method for generating an image is presented, wherein the image displays a document, and the document is relevant to a search query. The method comprises generating a first image portion, the first image portion containing a region of interest, the region of interest being a portion of the document that is relevant to the search query; generating a second image portion, the second image portion comprising a second portion of the document that contains the region of interest, the second image portion being distorted; and generating an image comprising the first image portion and the second image portion.
Brief Description of the Drawings
[0006] FIG. IA illustrates an undistorted image of a document.
[0007] FIG. IB illustrates a distorted image of the same document as that shown in FIG. IA, according to one embodiment of the invention.
[0008] FIG. 2A illustrates an image of the same document as that shown in
FIG. IA, according to one embodiment of the invention.
[0009] FIG. 2B illustrates an image of the same document as that shown in
FIG. IA, according to another embodiment of the invention.
[0010] FIG. 3 illustrates a block diagram of a general-purpose computing device for implementing the invention according to one embodiment.
[0011] FIG. 4 illustrates a block diagram of a software architecture for a system according to one embodiment of the invention.
[0012] FIG. 5 illustrates a flowchart of a method performed by a main program, according to one embodiment of the invention.
[0013] FIG. 6A illustrates an image similar to the image shown in FIG. 2A where a search term is underlined, according to one embodiment of the invention.
[0014] FIG. 6B illustrates an image similar to the image shown in FIG. 2B where a search term is underlined, according to one embodiment of the invention.
[0015] The figures depict a preferred embodiment of the present invention for purposes of illustration only. One skilled in the art will readily recognize from the following discussion that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles of the invention described herein.
Detailed Description of the Preferred Embodiments
[0016] Search engine results typically comprise a list of links to electronic documents that satisfy a search query. In this disclosure, a "document" is understood to include any textual, graphical, visual, multimedia, or other type of work for which a visual representation can be derived and presented to a user. In order to determine whether a particular electronic document is relevant to a user's interests or needs, the user views the document. This is generally performed by clicking on the link associated with the document, which causes the document to be displayed.
[0017] Although the entire document is usually displayed, a document's relevancy can frequently be determined based on a portion of the document that is relevant to the search terms (a "region of interest"). A ROI can be, for example, a word, a sentence, a paragraph, a table, a graphic, or any other textual, graphical, visual, multimedia, or video element or the like, depending on the type of content involved. While the user does not need to see the entire document in order to determine whether it is relevant, it is useful to know the context of the ROI within the document.
[0018] One embodiment of the invention enables a person to determine the relevance of an electronic document while preventing the person from making a complete copy of the document. In one embodiment, this is accomplished by displaying an image that represents a ROI and conveys the context of the ROI within the document while distorting other portions of the document. [0019] FIG. IA illustrates an undistorted image of a document. Image IOOA is a single page of a lengthy document, and is exemplary of images shown by conventional imaging tools that are used to display electronic documents. If so inclined, a user can copy the entirety of the text (or image) portions shown and use these copied portions without permission from, or payment to, the owner of the document. Image IOOA is derived from a document that can be in, for example, text format, image format, a markup language, a page description language, or other format.
[0020] FIG. IB illustrates a distorted image of the same document as that shown in FIG. IA, according to one embodiment of the invention. Image IOOB may be created directly from the underlying document, or it may be created from an undistorted image of the underlying document, such as image 10OA. Using the second option, image IOOB is created by distorting the undistorted image 10OA. While there are many ways to distort an image, in one embodiment, the image IOOA is distorted by using pixelation and also by decreasing the brightness level of portions of the image that are outside of a region of interest of the user. [0021] In one embodiment, the user is not shown a complete, undistorted image of the document and thus is prevented from making a copy of the undistorted document. However, the user is still able to determine the relevance of the document to the user's needs, and thus, for example, whether the user should purchase the document or not. FIG. 2A illustrates an image of the same document as that shown in FIG. IA, according to one embodiment of the invention. FIG. 2B illustrates an image of the same document as that shown in FIG. IA, according to another embodiment of the invention.
[0022] In one embodiment, an image 200 enables a user to determine the relevance of the underlying document by displaying an undistorted image portion 210 of a first portion of the document and a distorted image portion 220 of a second portion of the document. In one embodiment, the second portion of the document is one page of the document (for example, if the document is a multi-page document). In another embodiment, the second portion of the document is an area of the document (for example, if the document is graphical).
[0023] In the illustrated embodiments, image 200A comprises image portions
210A and 220A, and image 200B comprises image portions 210B and 220B. As illustrated in FIGS. 2A and 2B, the first portions 210A, 220A of the documents comprise three partial lines of text, with the first partial line being "This is sample text." The second portions 210B, 220B comprise the remaining contents of the page represented by image 200. [0024] In a preferred embodiment, the first portion of the document is the user's ROI (i.e., a portion of the document that is relevant to the user's search terms). The contents of image portion 210, which displays the user's ROI, should be readable by a typical user so that the user can determine whether the ROI is relevant. In one embodiment, image portion 210 is undistorted, similar to image 10OA. In another embodiment, image portion 210 is modified to help the user determine the relevance of the document. For example, image portion 210 may indicate the presence of search terms by displaying these terms with underlining, or outlining, or highlighting. FIG. 6A illustrates an image similar to the image shown in FIG. 2A where a search term is underlined, according to one embodiment of the invention. FIG. 6B illustrates an image similar to the image shown in FIG. 2B where a search term is underlined, according to one embodiment of the invention. [0025] In a preferred embodiment, the second portion of the document is that which corresponds to the page that is represented by image 200. Image portion 220, which displays the second portion of the document, should be distorted so that its contents are unreadable by a typical user or otherwise degraded to devalue or impair a user's use or copying of them. For example, an image portion 220 can be pixilated, blurred, tinted, or converted to a lower resolution.
[0026] In one embodiment, the relative locations of undistorted image portion
210 and distorted image portion 220 within image 200 convey the context of the ROI within the page of the document that is being displayed. In FIG. 2A, f dr example, image 200A shows undistorted image portion 210A being located "on top of" distorted image portion 220A at a similar place to where the ROI would be located within the page of the document that is being displayed. In contrast, in FIG. 2B, image 200B shows undistorted image portion 210B being located next to distorted image portion 220B and also shows a "callout" 230 from distorted image portion 220B to undistorted image portion 210B.
[0027] Embodiments of the invention will now be further described below with reference to FIGS. 3-5. FIG. 3 illustrates a block diagram of a general-purpose computing device for implementing the invention according to one embodiment. The computing device 300 preferably includes a processor 310, a main memory 320, a data storage device 330, and a network controller 380, all of which are communicatively coupled to a system bus 340. Computing device 300 may be, for example, a workstation, a desktop computer, a laptop computer, a tablet computer, a personal digital assistant (PDA), or any other type of computing device.
[0028] Processor 310 processes data signals and comprises various computing architectures including a complex instruction set computer (CISC) architecture, a reduced instruction set computer (RISC) architecture, or an architecture implementing a combination of instruction sets. Although only a single processor is shown in FIG. 3, multiple processors may be included.
[0029] Main memory 320 stores instructions and/ or data that are executed by processor 310. The instructions and/ or data comprise code for performing any and/ or all of the techniques described herein. Main memory 320 is preferably a dynamic random access memory (DRAM) device, a static random access memory
(SRAM) device, or some other memory device known in the art.
[0030] Data storage device 330 stores data and instructions for processor 310 and comprises one or more devices including a hard disk drive, a floppy disk drive, a CD-ROM device, a DVD-ROM device, a DVD-RAM device, a DVD-RW device, a flash memory device, or some other mass storage device known in the art.
[0031] Network controller 380 links the computing device 300 to a network
(not shown).
[0032] System bus 340 represents a shared bus for communicating information and data throughout the computing device 300. System bus 340 represents one or more buses including an industry standard architecture (ISA) bus, a peripheral component interconnect (PCI) bus, a universal serial bus (USB), or some other bus known in the art to provide similar functionality.
[0033] Additional components that may be coupled to the computing device
300 through system bus 340 include a display device 350, a keyboard 360, and a cursor control device 370. Display device 350 represents any device equipped to display electronic images and data to a local user or maintainer. Display device 350 is a cathode ray tube (CRT), a liquid crystal display (LCD), or any other similarly equipped display device, screen, or monitor. Keyboard 360 represents an alphanumeric input device coupled to computing device 300 to communicate information and command selections to processor 310. Cursor control device 370 represents a user input device equipped to communicate positional data as well as command selections to processor 310. Cursor control device 370 includes a mouse, a trackball, a stylus, a pen, cursor direction keys, or other mechanisms to cause movement of a cursor.
[0034] It should be apparent to one skilled in the art that computing device
300 includes more or fewer components than those shown in FIG. 3 without departing from the spirit and scope of the present invention. For example, computing device 300 may include additional memory, such as, for example, a first or second level cache or one or more application specific integrated circuits (ASICs). As noted above, computing device 300 may be comprised solely of ASICs. In addition, components may be coupled computing device 300 including, for example, image scanning devices, digital still or video cameras, or other devices that may or may not be equipped to capture and/ or download electronic data to/ from computing device 300.
[0035] FIG. 4 illustrates a block diagram of a software architecture for a system according to one embodiment of the invention. Generally, several code modules and memory storage areas are stored in the memory 320 for generating an image that represents a portion of a document and conveys the context of that portion within the document. Specifically, the code modules and memory storage areas include a main program module 400, a document-to-image conversion module 410, an image distortion/ modification module 420, an image generation module 430, and a document and image repository module 440. Code modules 400, 410, 420, and 430 and memory storage area 440 are communicatively coupled to each other. [0036] Main program module 400 transmits instructions and data to as well as receives data from each code module and memory.
[0037] Document-to-image conversion module 410 generates, given an electronic document, an image of at least one page of that document. In a typical embodiment, document-to-image conversion module 410 generates a separate image for each page of the document that contains one or more of the search terms (or conceptually related terms) of the user's query.
[0038] In one embodiment, document-to-image conversion module 410 generates undistorted image 10OA. Undistorted image IOOA may be cropped to display only the user's ROI and then used as undistorted image portion 210. Alternatively, undistorted image IOOA may be distorted using image distortion/ modification module 420 and then used as distorted image portion 220. In one embodiment, after document-to-image conversion module 410 has generated undistorted image IOOA, undistorted image IOOA is stored using document and image repository module 440 so that undistorted image IOOA does not have to be generated again.
[0039] In an alternative embodiment, document-to-image conversion module
410 generates distorted image 10OB. Distorted image IOOB may be used as distorted image portion 220. In one embodiment, after document-to-image conversion module 410 has generated distorted image IOOB, distorted image IOOB is stored using document and image repository module 440 so that distorted image IOOB does not have to be generated again. Many distortion methods may be used. These methods include, for example, pixelation, change of brightness, change of contrast, blurring, and image filtering. Document-to-image conversion module 410 may use one or more of these methods to generate distorted image IOOB. [0040] Document-to-image conversion module 410 may also generate an image that has been modified based on the user's search terms (e.g., by highlighting the search terms within the image). This modified image could be either undistorted or distorted. If the modified image is undistorted, it could be cropped to display only the user's ROI and then used as undistorted image portion 210. If the modified image is distorted, it could be used as distorted image portion 220. In one embodiment, a modified image would not be saved because its use is limited to a query containing the same search terms.
[0041] Document-to-image conversion module 410 can generate an image in several ways. If the electronic version of the original document is a PDF document, for example, document-to-image conversion module 410 can use the capabilities of PDF software to output the document's contents as an image. If it is a word processing file, document-to-image conversion module 410 can print the document's contents to a file (rather than to a printer) as an image. If it is an image (e.g., a physical document that has been scanned), document-to-image conversion module 410 can further process the image as necessary. For example, document-to-image conversion module 410 can divide the image into several parts and/ or reduce the resolution of the image by downsampling. Another possibility is for document-to- image conversion module 410 to use a software conversion program that converts a specific type of electronic file to an image.
[0042] Image distortion/ modification module 420 generates, given an image, a different version of that image. In one embodiment, image distortion/ modification module 420 generates a distorted version of the image 10OB. As discussed above with reference to document-to-image conversion module 410, distorted image IOOB may then be stored and/ or used as distorted image portion 220. Many distortion methods may be used. These methods include, for example, pixelation, change of brightness, change of contrast, blurring, and image filtering. Image distortion/ modification module 420 may use one or more of these methods to generate distorted image IOOB.
[0043] In another embodiment, image distortion/ modification module 420 generates an image that has been modified based on the user's search terms (e.g., by highlighting the search terms within the image). This modified image could be either distorted or undistorted. As discussed above with reference to document-to-image conversion module 410, this modified image could be used as distorted image portion 220 or cropped and then used as undistorted image portion 210. In one embodiment, the modified image would not be saved.
[0044] Image generation module 430 generates an image 200 that 1) represents a portion of a document (such as a ROI) and 2) conveys the context of that portion within the document. In one embodiment, image 200 comprises image portions 210 and 220. Image portion 210 is used to represent the ROI, while image portions 210 and 220 are used to convey the context of the ROI by indicating the location of the ROI within the document. [0045] Many types of images 200 can be used to indicate the context of the
ROI. One simple example is shown in FIG. 2A. Image 200A is a composite image comprising image portions 210A and 220A such that the combination of image portions 210A and 220A appears to be a single document. In one embodiment, image portion 210A is overlaid on image portion 220A such that image portion 210A covers the portion of image portion 220A that contains the ROI.
[0046] In one embodiment, image portion 210A has a similar appearance to image portion 220A except that image portion 220A is distorted and image portion 210A is not. In another embodiment, image portion 210A has a different appearance from image portion 220A, besides the fact that image portion 220A is distorted and image portion 210A is not. This difference in appearance helps distinguish image portion 210A from the rest of image 200A and thereby makes it easier for the user to find image portion 210A within image 200A. For example, the font and/ or background color of image portion 210A may differ from the font and/ or background color of image portion 220A. Similarly, image portion 210A may be outlined, forming a bounding box (e.g., a rectangle) that extends a minimum distance (e.g., 0.5") outside of the contents of image portion 210A. [0047] Another example of an image that can be used to indicate the context of the ROI is shown in FIG. 2B. Image 200B similarly comprises image portions 210B and 220B, but image 200B does not overlay image portion 210B onto image portion 220B. Instead, image 200B places image portion 210B outside of image 220B and uses a "callout" 230 from image portion 210B to the location of the ROI within distorted image 220B.
[0048] In one embodiment, image generation module 430 generates a location map of the displayed document page showing the location of the ROI. Image generation module 430 then uses this map to generate image 200 such that image 200 indicates the context of the ROI. In one embodiment, image generation module 430 determines the location of the ROI based on the locations of words within the ROI. The locations of these words are obtained by querying document and image repository module 440. [0049] Document and image repository module 440 stores documents and/ or images. These images may include, for example, undistorted images IOOA of a document and distorted images IOOB of a document. If a document exists in electronic format, the electronic format is stored in document and image repository module 450. If no electronic format exists, then the document is digitized by, for example, scanning the document and/ or performing Optical Character Recognition (OCR) on it. The results are then stored in document and image repository module 450.
[0050] Document and image repository module 440 also stores positions of words within documents and/ or images. For example, document and image repository module 440 stores, for each word in an image or document, the dimensions of the smallest box that can enclose the word (the word's "bounding box") and the location of the box in the image or document (e.g., in x,y coordinates). Given a file that contains text, determining a word's bounding box is known to those of ordinary skill in the art. In one embodiment, if the file is an image file, the image is converted to text by OCR' ing it. As a by-product of the OCR process, the dimensions and locations of bounding boxes can be determined. User Scenario
[0051] FIG. 5 illustrates a flowchart of a method performed by a main program, according to one embodiment of the invention. This method may be used, for example, in conjunction with a search engine. Before the method of FIG. 5 begins, a user enters a query into a search engine. The query may contain various search terms and expressions.
[0052] The search engine then generates a set of results, typically a list of documents. Each result represents a reference to a document that is relevant to the query. A document can be relevant to a query because, for example, its contents directly "match" the query terms (e.g., using a textual match). Alternatively, a document can be relevant because its contents are conceptually, semantically, or topically related to the query terms. Similarly, a document can be relevant because meta-information associated with the document (e.g., the document's author or publication date) satisfy the query. The particular way in which the search engine determines relevant documents is not material to the invention, which may be used with any type of search engine.
[0053] When a user selects one of the search results (e.g., by clicking on a link of the document's name), the search engine determines a portion of the document that is relevant to the query (a ROI). The search engine also determines where query terms appear in the document, if at all. This process is known to those of ordinary skill in the art. Main program module 400 then begins 500.
[0054] Steps 510 and 520 may occur in any order, including simultaneously.
Main program module 400 generates 510 distorted image portion 220. Distorted image portion 220 is, for example, a page of the selected document that contains the user's ROI. In one embodiment, distorted image portion 220 is not modified based on the user's query. In this embodiment, main program module 400 uses a distorted image of the selected page IOOB as distorted image portion 220. There are several ways to obtain distorted image IOOB. A few of these ways are described below. [0055] In one embodiment, main program module 400 retrieves distorted image IOOB from document and image repository module 440 if image IOOB exists. [0056] In another embodiment, if image IOOB does not exist, main program module 400 retrieves an undistorted image of the selected page IOOA from document and image repository module 440 if image IOOA exists. If image IOOA does exist, main program module 400 distorts image IOOA using image distortion/ modification module 420, thereby generating image IOOB. In one embodiment, main program module 400 also stores image IOOB in document and image repository module 440 for later use.
[0057] In yet another embodiment, if image IOOA does not exist, main program module 400 retrieves the selected document from document and image repository module 440. Main program module 400 then generates an image from the document using document-to-image conversion module 410. In one embodiment, main program module 400 uses document-to-image conversion module 410 to generate distorted image IOOB. In one embodiment, main program module 400 also stores image IOOB in document and image repository module 440 for later use. [0058] In another embodiment, main program module 400 uses document-to- image conversion module 410 to generate undistorted image 10OA. In one embodiment, main program module 400 stores image IOOA in document and image repository module 440 for later use. Main program module 400 then distorts image IOOA using image distortion/ modification module 420, thereby generating image 10OB. In one embodiment, main program module 400 also stores image IOOB in document and image repository module 440 for later use.
[0059] In another embodiment, distorted image portion 220 is modified based on the user's query. In this embodiment, main program module 400 obtains distorted image IOOB as described above. Then, main program module 400 uses image distortion/ modification module 420 to modify image IOOB based on the user's query. This modified image is then used as distorted image portion 220. [0060] Main program module 400 generates 520 undistorted image portion
210. Undistorted image portion 210 is, for example, the user's ROI. In one embodiment, undistorted image portion 210 is not modified based on the user's query. In this embodiment, main program module 400 obtains an undistorted image of the selected page IOOA and then crops this image to show the user's ROI. The cropped image is then used as undistorted image portion 210. There are several ways to obtain undistorted image IOOA. A few of these ways are described below. [0061] In one embodiment, main program module 400 retrieves undistorted image IOOA from document and image repository module 440 if image IOOA exists. [0062] In another embodiment, if image IOOA does not exist, main program module 400 retrieves the selected document from document and image repository module 440. Main program module 400 then uses document-to-image conversion module 410 to generate, from the document, undistorted image IOOA. In one embodiment, main program module 400 also stores image IOOA in document and image repository module 440 for later use.
[0063] In another embodiment, undistorted image portion 210 is modified based on the user's query. In this embodiment, main program module 400 obtains undistorted image IOOA as described above. Then, main program module 400 uses image distortion/ modification module 420 to modify image IOOA based on the user's query. This modified image is then cropped and used as undistorted image portion 210.
[0064] Finally, main program module 400 uses image generation module 430 to generate combined image 200 using undistorted image portion 210 and distorted image portion 220. Main program module 400 then ends 540, and combined image
200 is displayed to the user.
Additional Embodiments
[0065] In one embodiment, more than one computing device 300 is used, such as in a client-server setting. For example, a user may input a query into a search engine using a first computing device 300A (the "client"). The first computing device
300 A will then use the network controller 380A to send the query to a second computing device 300B (the "server"). The second computing device 300B will perform the search and then send the search results to the first computing device
300A using the network controller 380B.
[0066] The user will then select a document to display, and the first computing device 300A will send the user's selection to either the second computing device 300B or a third computing device 300C (another "server").
[0067] In a preferred embodiment, second computing device 300B or third computing device 300C will then generate combined image 200 by performing the method of FIG. 5 and send combined image 200 to the first computing device 300A.
First computing device 300A then displays combined image 200 to the user using display 350. In this embodiment, the first computing device 300A never contains a complete copy of either the underlying electronic document or an undistorted image of the underlying electronic document.
[0068] In an alternate embodiment, second computing device 300B or third computing device 300C sends to the first computing device 300A the requested electronic document, an undistorted image 100 A of the electronic document, and/ or a distorted image IOOB of the electronic document. First computing device 300A then generates combined image 200 by performing the method of FIG. 5 and displays combined image 200 to the user using display 350. [0069] In the above description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the invention. It will be apparent, however, to one skilled in the art that the invention can be practiced without these specific details. In other instances, structures and devices are shown in block diagram form in order to avoid obscuring the invention. [0070] Reference in the specification to "one embodiment" or "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the invention. The appearances of the phrase "in one embodiment" in various places in the specification are not necessarily all referring to the same embodiment. [0071] Some portions of the detailed description are presented in terms of algorithms and symbolic representations of operations on data bits within a computer memory. These algorithmic descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. An algorithm is here, and generally, conceived to be a self -consistent sequence of steps leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated. It has proven convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like.
[0072] It should be borne in mind, however, that all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the discussion, it is appreciated that throughout the description, discussions utilizing terms such as "processing" or "computing" or "calculating" or "determining" or "displaying" or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.
[0073] The present invention also relates to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general-purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus.
[0074] The algorithms and displays presented herein are not inherently related to any particular computer or other apparatus. Various general-purpose systems may be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatuses to perform the required method steps. The required structure for a variety of these systems appears from the description. In addition, the present invention is not described with reference to any particular programming language. It will be appreciated that a variety of programming languages may be used to implement the teachings of the invention as described herein.
[0075] The present invention provides various mechanisms for automatically presenting an analysis report for a prospective trade or other transaction, with a minimum of user effort. One skilled in the art will recognize that the particular examples described herein are merely illustrative of representative embodiments of the invention, and that other arrangements, methods, architectures, and configurations may be implemented without departing from the essential characteristics of the invention. Accordingly, the disclosure of the present invention is intended to be illustrative, but not limiting, of the scope of the invention, which is set forth in the following claims.

Claims

ClaimsWhat is claimed is:
1. A computer-implemented method for generating an image, wherein the image displays a document, and wherein the document is relevant to a search query, the method comprising: generating a first image portion, the first image portion comprising a region of interest, the region of interest comprising a first portion of the document that is relevant to the search query; generating a second image portion, the second image portion comprising a second portion of the document that contains the region of interest, the second image portion being distorted; and generating an image comprising the first image portion and the second image portion.
2. The method of claim 1, wherein the second portion of the document comprises a page of the document.
3. The method of claim 1, wherein the second portion of the document comprises an area of the document.
4. The method of claim 1, wherein generating the first image portion comprises: generating an undistorted image of the second portion of the document; and cropping the undistorted image.
5. The method of claim 4, wherein generating the undistorted image of the second portion of the document comprises obtaining the undistorted image from a document repository.
6. The method of claim 4, wherein generating the undistorted image of the second portion of the document comprises generating the undistorted image from an electronic document.
7. The method of claim 1, wherein generating the second image portion comprises obtaining the second image portion from a document repository.
8. The method of claim 1, wherein generating the second image portion comprises: generating an undistorted image of the second portion of the document; and distorting the undistorted image.
9. The method of claim 1, further comprising modifying, responsive to the search query, one of the first image portion and the second image portion.
10. The method of claim 9, wherein modifying, responsive to the search query, one of the first image portion and the second image portion comprises one of underlining, outlining, and highlighting a search term in one of the first image portion and the second image portion.
11. The method of claim 1, wherein generating the image comprising the first image portion and the second image portion comprises generating a composite image of the first image portion overlaid on the second image portion.
12. The method of claim 11, wherein generating the composite image of the first image portion overlaid on the second image portion comprises outlining the first image portion.
13. The method of claim 11, wherein generating the composite image of the first image portion overlaid on the second image portion comprises modifying one of a font color and a background color of the first image portion.
14. The method of claim 1, wherein generating the image comprising the first image portion and the second image portion comprises generating an image, the image comprising the first image portion, the second image portion, and a callout indicating the first image portion and the second image portion.
15. A system for generating an image, wherein the image displays a document, and wherein the document is relevant to a search query, the system comprising: a software portion configured to generate a first image portion, the first image portion comprising a region of interest, the region of interest comprising a first portion of the document that is relevant to the search query; a software portion configured to generate a second image portion, the second image portion comprising a second portion of the document that contains the region of interest, the second image portion being distorted; and a software portion configured to generate an image comprising the first image portion and the second image portion.
16. The system of claim 15, wherein the second portion of the document comprises a page of the document.
17. The system of claim 15, wherein the second portion of the document comprises an area of the document.
18. The system of claim 15, wherein the software portion configured to generate the first image portion comprises: a software portion configured to generate an undistorted image of the second portion of the document; and a software portion configured to crop the undistorted image.
19. The system of claim 18, wherein the software portion configured to generate the undistorted image of the second portion of the document comprises a software portion configured to obtain the undistorted image from a document repository.
20. The system of claim 18, wherein the software portion configured to generate the undistorted image of the second portion of the document comprises a software portion configured to generate the undistorted image from an electronic document.
21. The system of claim 15, wherein the software portion configured to generate the second image portion comprises a software portion configured to obtain the second image portion from a document repository.
22. The system of claim 15, wherein the software portion configured to generate the second image portion comprises: a software portion configured to generate an undistorted image of the second portion of the document; and a software portion configured to distort the undistorted image.
23. The system of claim 15, further comprising a software portion configured to modify, responsive to the search query, one of the first image portion and the second image portion.
24. The system of claim 23, wherein the software portion configured to modify, responsive to the search query, one of the first image portion and the second image portion comprises a software portion configured to perform one of underlining, outlining, and highlighting a search term in one of the first image portion and the second image portion.
25. The system of claim 15, wherein the software portion configured to generate the image comprising the first image portion and the second image portion comprises a software portion configured to generate a composite image of the first image portion overlaid on the second image portion.
26. The system of claim 25, wherein the software portion configured to generate the composite image of the first image portion overlaid on the second image portion comprises a software portion configured to outline the first image portion.
27. The system of claim 25, wherein the software portion configured to generate the composite image of the first image portion overlaid on the second image portion comprises a software portion configured to modify one of a font color and a background color of the first image portion.
28. The system of claim 15, wherein the software portion configured to generate the image comprising the first image portion and the second image portion comprises a software portion configured to generate an image, the image comprising the first image portion, the second image portion, and a callout indicating the first image portion and the second image portion.
29. A computer readable medium containing a computer program product for generating an image, wherein the image displays a document, and wherein the document is relevant to a search query, the computer program product comprising program code for: generating a first image portion, the first image portion comprising a region of interest, the region of interest comprising a first portion of the document that is relevant to the search query; generating a second image portion, the second image portion comprising a second portion of the document that contains the region of interest, the second image portion being distorted; and generating an image comprising the first image portion and the second image portion.
PCT/US2005/034141 2004-09-22 2005-09-21 Image distortion for content security WO2006036785A1 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
BRPI0515479-0A BRPI0515479A2 (en) 2004-09-22 2005-09-21 computer imaging system and its computer-readable method
JP2007532687A JP2008513898A (en) 2004-09-22 2005-09-21 Image transformation for content security
CN2005800372319A CN101049007B (en) 2004-09-22 2005-09-21 Image distortion for content security
CA2581366A CA2581366C (en) 2004-09-22 2005-09-21 Image distortion for content security
EP05798318A EP1792476A1 (en) 2004-09-22 2005-09-21 Image distortion for content security
AU2005289725A AU2005289725C1 (en) 2004-09-22 2005-09-21 Image distortion for content security

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/948,734 2004-09-22
US10/948,734 US7561755B2 (en) 2004-09-22 2004-09-22 Image distortion for content security

Publications (1)

Publication Number Publication Date
WO2006036785A1 true WO2006036785A1 (en) 2006-04-06

Family

ID=35463958

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/034141 WO2006036785A1 (en) 2004-09-22 2005-09-21 Image distortion for content security

Country Status (9)

Country Link
US (1) US7561755B2 (en)
EP (1) EP1792476A1 (en)
JP (1) JP2008513898A (en)
KR (1) KR100948319B1 (en)
CN (1) CN101049007B (en)
AU (1) AU2005289725C1 (en)
BR (1) BRPI0515479A2 (en)
CA (1) CA2581366C (en)
WO (1) WO2006036785A1 (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7664751B2 (en) 2004-09-30 2010-02-16 Google Inc. Variable user interface based on document access privileges
US7603355B2 (en) * 2004-10-01 2009-10-13 Google Inc. Variably controlling access to content
US20070203776A1 (en) * 2005-12-28 2007-08-30 Austin David J Method of displaying resume over the internet in a secure manner
JP5309570B2 (en) * 2008-01-11 2013-10-09 株式会社リコー Information retrieval apparatus, information retrieval method, and control program
US8229912B2 (en) 2009-05-06 2012-07-24 Mentis Technology, Llc Enhanced search engine
US8612845B2 (en) * 2009-06-29 2013-12-17 Palo Alto Research Center Incorporated Method and apparatus for facilitating directed reading of document portions based on information-sharing relevance
KR101548951B1 (en) 2014-01-24 2015-09-01 주식회사 인프라웨어 A server for providing an electrical document which is converted to an image, and a method for proving an electrical document using the same
US9535883B2 (en) 2014-10-24 2017-01-03 Dropbox, Inc. Modifying native document comments in a preview
US10015364B2 (en) * 2015-05-11 2018-07-03 Pictureworks Pte Ltd System and method for previewing digital content
US10140880B2 (en) * 2015-07-10 2018-11-27 Fujitsu Limited Ranking of segments of learning materials

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5881287A (en) * 1994-08-12 1999-03-09 Mast; Michael B. Method and apparatus for copy protection of images in a computer system
US5893101A (en) * 1994-06-08 1999-04-06 Systems Research & Applications Corporation Protection of an electronically stored image in a first color space by the alteration of digital component in a second color space
US20020032863A1 (en) * 2000-04-26 2002-03-14 Contents-Korea Co., Ltd. System and method for performing digital watermarking in realtime using encrypted algorithm
EP1359758A1 (en) * 2002-04-12 2003-11-05 Hewlett Packard Company, a Delaware Corporation Efficient encryption of image data

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5960448A (en) * 1995-12-15 1999-09-28 Legal Video Services Inc. System and method for displaying a graphically enhanced view of a region of a document image in which the enhanced view is correlated with text derived from the document image
JP2000113050A (en) * 1998-10-01 2000-04-21 Hitachi Ltd Electronic book system
JP3581048B2 (en) * 1999-06-29 2004-10-27 三菱電機株式会社 Contaminated content distribution system, contaminated content using apparatus, and contaminated content using method
AUPQ163899A0 (en) * 1999-07-14 1999-08-05 Canon Kabushiki Kaisha Aromated document production from a search environment
JP2001334709A (en) * 2000-05-24 2001-12-04 Seiko Epson Corp System for transmitting image data
JP2002091878A (en) * 2000-09-18 2002-03-29 Casio Comput Co Ltd Information terminal, information reading method, system and method for distributing information
US6799302B1 (en) * 2000-09-19 2004-09-28 Adobe Systems Incorporated Low-fidelity document rendering
US7015910B2 (en) * 2000-12-21 2006-03-21 Xerox Corporation Methods, systems, and computer program products for the display and operation of virtual three-dimensional books
US6883138B2 (en) * 2001-08-08 2005-04-19 Xerox Corporation Methods and systems for generating enhanced thumbnails usable for document navigation
JP2003132071A (en) * 2001-10-25 2003-05-09 Chunichi Shimbunsha Article providing system
JP2005025357A (en) * 2003-06-30 2005-01-27 National Institute Of Information & Communication Technology Medical information disclosure controller, medical information disclosure control method and computer program
GB2403558A (en) * 2003-07-02 2005-01-05 Sony Uk Ltd Document searching and method for presenting the results
JP2005222237A (en) * 2004-02-04 2005-08-18 Mitsubishi Electric Corp Document search display system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5893101A (en) * 1994-06-08 1999-04-06 Systems Research & Applications Corporation Protection of an electronically stored image in a first color space by the alteration of digital component in a second color space
US5881287A (en) * 1994-08-12 1999-03-09 Mast; Michael B. Method and apparatus for copy protection of images in a computer system
US20020032863A1 (en) * 2000-04-26 2002-03-14 Contents-Korea Co., Ltd. System and method for performing digital watermarking in realtime using encrypted algorithm
EP1359758A1 (en) * 2002-04-12 2003-11-05 Hewlett Packard Company, a Delaware Corporation Efficient encryption of image data

Also Published As

Publication number Publication date
KR20070052344A (en) 2007-05-21
KR100948319B1 (en) 2010-03-17
AU2005289725A1 (en) 2006-04-06
CA2581366C (en) 2013-12-31
AU2005289725B2 (en) 2009-09-03
BRPI0515479A2 (en) 2009-02-03
EP1792476A1 (en) 2007-06-06
US20060061796A1 (en) 2006-03-23
US7561755B2 (en) 2009-07-14
CN101049007B (en) 2010-10-06
CA2581366A1 (en) 2006-04-06
CN101049007A (en) 2007-10-03
AU2005289725C1 (en) 2010-03-11
JP2008513898A (en) 2008-05-01

Similar Documents

Publication Publication Date Title
CA2581366C (en) Image distortion for content security
US8015418B2 (en) Method and apparatus for improved information transactions
US8494281B2 (en) Automated method and system for retrieving documents based on highlighted text from a scanned source
US8583637B2 (en) Coarse-to-fine navigation through paginated documents retrieved by a text search engine
US20040148274A1 (en) Method and apparatus for improved information transactions
US20100315688A1 (en) Method of scanning
Monteil et al. Pathological laughing as a symptom of a tentorial edge tumour.
Joslin National cancer control and cancer registration.
Li et al. Iterative Development of a Web application to support teleconferencing of a distributed tumor board
Harrington et al. Towards a science of document intent
Jowett et al. Review of Early English books online (EEBO)
London et al. possible to provide bearing witness to Dame
BEECHEY ROGERS BOOTS PURE DRUG CO. LTD.
Sare CD-ROM Filings at Trial and Beyond
Westman QUALITY ASSURANCE FOR FAMILY DOCTORS: Report of the quality assurance working party
Rafuse Students, practising MDs should be more aware of sexual, cultural influences, committee says
Wilson TAKE IT TO HEART (video)
Richards ALSON E. BRALEY, md
Gummi cause so continually met with in practiceinasmuch as reference
Murdoch et al. Aberdeen Art Gallery Image Database Project a prototype project to create and maintain a low-cost art image database
Piersol and Company (India), Limited, Calcutta
van der Velde Liebman’s Neuroanatomy, Made Easy and Understandable
Schofield THE DYING PATIENT—TRYING TO TELL: video
Gibbes et al. acceptable.
DENTISTS Copies of the above patents may be obtained for ten cents each by addressing John A. Sanl, Solicitor of Patents, Fen

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV LY MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2007532687

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2581366

Country of ref document: CA

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2005289725

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2005798318

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 1020077008179

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 552/MUMNP/2007

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 2005289725

Country of ref document: AU

Date of ref document: 20050921

Kind code of ref document: A

WWP Wipo information: published in national office

Ref document number: 2005289725

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 200580037231.9

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 2005798318

Country of ref document: EP

ENP Entry into the national phase

Ref document number: PI0515479

Country of ref document: BR