AU2001272869A1 - Method and means for mobile capture, processing, storage and transmission of text and mixed information containing characters and images - Google Patents

Method and means for mobile capture, processing, storage and transmission of text and mixed information containing characters and images

Info

Publication number
AU2001272869A1
AU2001272869A1 AU2001272869A AU2001272869A AU2001272869A1 AU 2001272869 A1 AU2001272869 A1 AU 2001272869A1 AU 2001272869 A AU2001272869 A AU 2001272869A AU 2001272869 A AU2001272869 A AU 2001272869A AU 2001272869 A1 AU2001272869 A1 AU 2001272869A1
Authority
AU
Australia
Prior art keywords
text
image
information
original
interpreted
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
AU2001272869A
Other versions
AU2001272869B2 (en
AU2001272869B8 (en
Inventor
Jacob Weitman
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from SE0002736A external-priority patent/SE517295C2/en
Priority claimed from SE0004231A external-priority patent/SE519405C2/en
Application filed by Individual filed Critical Individual
Application granted granted Critical
Publication of AU2001272869B8 publication Critical patent/AU2001272869B8/en
Publication of AU2001272869A1 publication Critical patent/AU2001272869A1/en
Publication of AU2001272869B2 publication Critical patent/AU2001272869B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Description

Method and means for mobi l e capture , processing, storage and transmi ssion of text and mixed information containing characters and images
There are numerous situations where there is a genuine need to capture quickly, efficiently and in a simple way large amounts of information in the form of text or text+ images, without access to technical resources such as copying machines, scanners, faxes and computers, today frequently available at offices. As an example of a situation where the present invention would be highly useful we may take a journey by air, where the traveller just read an interesting, by images and diagrams possibly illustrated article in, let say, Financial Times and where the traveller either wishes to as quickly as possible transmit the corresponding information to a colleague or to save the article as reference material for himself and others. Today, this reader has the option to either tear out the interesting pages or to take along the complete newspaper. During a conference trip or another longer journey the situation may repeat itself, resulting in a cumbersome practical paper- handling problem.
There is a vast number of similar situations, where one wishes to be able to collect and/or to transfer printed information which one has received, without being limited by or dependent on an office with modern resources, such as, e.g., when reading or working in bed due to illness or laziness.
The aim of the present invention is to solve in an efficient, practical and flexible way the problem thus indicated. The solution is based on a combination and further development of available technologies, primarily digital photography, intelligent image processing incl. OCR, vector graphics, data compression, broadband data transmission and database handling. The basis for the invention is the use of a compact digital camera, preferably equipped with optics for wide angle, large aperture and a large depth of sharpening also at short distances, where the intelligence is based on software for processing and interpretation of the entire image in such a way that those parts containing text are recognized and transformed to and stored as, e.g., ASCII- or EBCDIC-code, while the remaining parts are stored as an image with desired resolution.
A special characteristic of the method according to the invention is furthermore that the software has intelligence for the interpretation of image qualities such as font and layout and the ability to use the interpretation to recreate/synthesize a picture, which is matched against (laid over) the original text. In case of acceptable result of the matching, those parts of the original image, which contain blocks of text, are deleted, where after the information stored consists of coded text, layout information and uninterpreted image parts.
In those cases where an acceptable match of the original and the recreated/synthesized images of the text blocks has not been achieved, the raw image is stored in its original format. The result of the matching may, e.g., be expressed as the percentage of dots in agreement. Also in case of a percentage-wise very good match there may be single characters, words or passages, which have not been correctly interpreted. Such uninterpreted or incorrectly interpreted original information is not deleted from the text block, but rather displayed as a suitably marked image insert in the interpreted text. The user thereby has the opportunity to thereafter intervene and help the programme with the interpretation of the sections thus marked. A further characteristic of the method according to the invention is that the interpretation software, which in a preferred embodiment of the invention is installed in the camera itself, but which also may be implemented in an external unit, includes algorithms based on vector graphical methods for analyzing and storing information about the layout of the original image and that this information is used in context with the matching procedure of the original and the synthesized images and, optionally, when later printing out the synthetic image, in order to recreate a layout which is adapted to the print out format chosen (e.g. A4) and as closely as possible reproduces the original layout. This is important, because the layout (including aspects such as under linings, italics, subdivision in sections, etc.) may be important for the understanding of content and context.
As an option, the camera may be provided with framing functions, so that only specifically chosen parts of the image are stored and processed, whereby text or image information, which is regarded as dispensable (such as a picture with a blue sky and a swaying cornfield in an article about our environment, or a picture of a provocative female in an article on the roles of the sexes)) is eliminated already at source.
According to the invention, the information may be tagged already by the software of the intelligent camera, so that later handling of information in databases is facilitated. This is achieved by inherent functionality for the automatic recognition of such characteristics as headings and names of authors, as well as automatic selection of keywords out of headings.
For greater versatility the software of the intelligent camera may be extended by options for translation between various languages and/or for interpretation of mathematical symbols and formulas and/or recognition of one or several handwritings. The handwriting recognition may be preferably based on algorithms for self-learning in neural systems. Depending on the state of development with respect to memory and processor capacities, as much as possible of the intelligence is located within the camera itself. However, functions and options, which at a given state of development are regarded as too demanding from the point of view of memory or processor capacity and performance, may be implemented and executed externally, whereby high-speed communication protocols (such as FIRE WiRE 1394) may be very useful.
Connecting the intelligent mobile digital camera to a mobile phone with broadband transmission capacity will enable transmission of interpreted and compressed data to one's own database or to third parties. The transmission may be performed either in real time or delayed, based on stored data.
A practically important characteristic of the means according to the invention is that the camera may be equipped for ultra-wide-angle photography, so that, e.g., a whole page of the initially mentioned newspaper publication can be captured in one exposure at a normal distance of observation (0.3 to 0.5 m). This may be achieved either by means of special wide angle lenses, whereby distortions are corrected numerically, or by facet lenses according to the apposition or superposition principle, whereby a complete image is synthesized computationally, or by optics with a scanning arrangement such as a moving mirror, in which case the complete picture is also composed by the software.
Within the scope of the invention, it is of course allowed that the intelligent camera may be used as a conventional digital camera as well.

Claims (11)

  1. CLAIMS.
    . Method for mobile intelligent capture, processing, storage and transmission of text and mixed information of text and images, comprising a digital camera with microprocessor, memory and software, characterized thereby that the entire image taken by the camera is analyzed with respect to its text information, that said information is recognized and interpreted by, e.g., OCR techniques and is stored as compressed text code, for further processing and/or transmission.
  2. 2. Method according to claim 1 , characterized thereby that text properties such as font, under linings, bold print, etc., are recognized and added to the interpreted text.
  3. 3. Method according to claims 1 and 2, characterized thereby that the original text is analyzed with respect to other specific information, such as subdivision in paragraphs and layout and that the total assembled information about the interpreted text is used to create a synthetic text image, which is compared to the original text image and that the latter is deleted from the memory of the camera when there is a sufficiently good match between the original and the synthetic image.
  4. 4. Method according to claim 3, characterized thereby that text information, which could not be interpreted, is not deleted but displayed in the interpreted/synthetic text as a suitably marked image of the pertinent original character/word/paragraph.
  5. . Method according to claims 1-4, characterized thereby that the original image is segmented into two blocks, whereby one block contains the interpreted text information and the other block the remaining relevant information from the original image and that these blocks are tagged such that they can be processed and transmitted individually and whenever desired recombined to create a reproduction of the original image.
  6. 6. Method according to claims 1-5, characterized thereby that in context with reproduction of the recombined image on another format than the format of the original image, the reproduction is performed such that the layout of the reproduced image agrees as closely as possible with that of the original image.
  7. 7. Method according to claims 1 -6, characterized thereby that the text information is automatically analyzed with regard to and tagged by such characteristics as name of author and publication and keywords out of headings, thereby facilitating systematic storage and retrieval of information in databases.
  8. 8. Means for mobile intelligent capture, processing, storage and transmission of text and mixed information of text and images, comprising a digital camera with microprocessor, memory and software, characterized thereby that the lens of the camera is designed for ultra-wide-angle.
  9. 9. Means according to claim 8, characterized thereby that distortion in the lens are numerically corrected, so that an undistorted image can be recreated.
  10. 0. Means according to claim 8, characterized thereby that the lens is designed as a facet lens according to the apposition principle, with certain overlapping between the partial images and that a continuous total image is produced by the software.
  11. 1. Means according to claim 8, characterized thereby that the lens is designed as a facet lens according to the superposition principle and that, when required, distortions are corrected by the software.
AU2001272869A 2000-07-19 2001-07-16 Method and means for mobile capture, processing, storage and transmission of text and mixed information containing characters and images Ceased AU2001272869B2 (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
SE0002736A SE517295C2 (en) 2000-07-19 2000-07-19 Mobile text and images processing method for converting non electronic information by segmenting original image into blocks and comparing synthetic text image with original
SE0002736-7 2000-07-19
SE0004231-7 2000-11-17
SE0004231A SE519405C2 (en) 2000-07-19 2000-11-17 Applications for an advanced digital camera that interprets the captured image based on its information content, such as transferring the image, ordering a service, controlling a flow, etc.
PCT/SE2001/001637 WO2002013128A1 (en) 2000-07-19 2001-07-16 Method and means for mobile capture,processing, storage and transmission of text and mixed information containing characters and images

Publications (3)

Publication Number Publication Date
AU2001272869B8 AU2001272869B8 (en) 2002-02-18
AU2001272869A1 true AU2001272869A1 (en) 2002-05-16
AU2001272869B2 AU2001272869B2 (en) 2007-07-05

Family

ID=26655189

Family Applications (2)

Application Number Title Priority Date Filing Date
AU7286901A Pending AU7286901A (en) 2000-07-19 2001-07-16 Method and means for mobile capture,processing, storage and transmission of textand mixed information containing characters and images
AU2001272869A Ceased AU2001272869B2 (en) 2000-07-19 2001-07-16 Method and means for mobile capture, processing, storage and transmission of text and mixed information containing characters and images

Family Applications Before (1)

Application Number Title Priority Date Filing Date
AU7286901A Pending AU7286901A (en) 2000-07-19 2001-07-16 Method and means for mobile capture,processing, storage and transmission of textand mixed information containing characters and images

Country Status (12)

Country Link
US (1) US20040101196A1 (en)
EP (1) EP1312041B1 (en)
JP (1) JP2004506274A (en)
KR (1) KR20030024786A (en)
CN (1) CN1443339A (en)
AT (1) ATE341034T1 (en)
AU (2) AU7286901A (en)
BR (1) BR0113000A (en)
DE (1) DE60123441T2 (en)
IL (1) IL153973A0 (en)
SE (1) SE519405C2 (en)
WO (1) WO2002013128A1 (en)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7199804B2 (en) * 2002-05-14 2007-04-03 Microsoft Corporation Ink information in image files
US7009524B2 (en) * 2003-08-04 2006-03-07 Eastman Kodak Company Shelf talker having short and long term information
KR100547738B1 (en) * 2003-08-12 2006-01-31 삼성전자주식회사 Apparatus and method for managing address book in portable terminal with camera
US20060290789A1 (en) * 2005-06-22 2006-12-28 Nokia Corporation File naming with optical character recognition
US7787693B2 (en) * 2006-11-20 2010-08-31 Microsoft Corporation Text detection on mobile communications devices
US20090046306A1 (en) * 2007-08-13 2009-02-19 Green Darryl A Method and apparatus for ordering and printing annotated photographs
CN101753473B (en) * 2008-12-09 2012-08-08 宏碁股份有限公司 Method for instantaneously transmitting interactive image and system using method
WO2010096193A2 (en) * 2009-02-18 2010-08-26 Exbiblio B.V. Identifying a document by performing spectral analysis on the contents of the document
CN101788849B (en) * 2009-12-31 2011-11-16 优视科技有限公司 Optical character recognition input method used for mobile communication equipment system
US9028344B2 (en) * 2010-01-28 2015-05-12 Chsz, Llc Electronic golf assistant utilizing electronic storing
US20150131913A1 (en) * 2011-12-30 2015-05-14 Glen J. Anderson Interactive drawing recognition using status determination
US9430035B2 (en) * 2011-12-30 2016-08-30 Intel Corporation Interactive drawing recognition
US20140192210A1 (en) * 2013-01-04 2014-07-10 Qualcomm Incorporated Mobile device based text detection and tracking
US9292537B1 (en) 2013-02-23 2016-03-22 Bryant Christopher Lee Autocompletion of filename based on text in a file to be saved
DE102015102369A1 (en) * 2015-02-19 2016-08-25 Bundesdruckerei Gmbh Mobile device for detecting a text area on an identification document
KR102585645B1 (en) * 2018-02-20 2023-10-10 삼성전자주식회사 Electronic device and method for recognizing character
US10755090B2 (en) 2018-03-16 2020-08-25 Open Text Corporation On-device partial recognition systems and methods
KR102457337B1 (en) * 2020-01-15 2022-10-20 김태호 Custom furniture brokerage server

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7158654B2 (en) * 1993-11-18 2007-01-02 Digimarc Corporation Image processor and image processing method
GB9621295D0 (en) * 1995-12-07 1996-11-27 Cambridge Antibody Tech Specific binding members,materials and methods
US6366698B1 (en) * 1997-03-11 2002-04-02 Casio Computer Co., Ltd. Portable terminal device for transmitting image data via network and image processing device for performing an image processing based on recognition result of received image data
US6618117B2 (en) * 1997-07-12 2003-09-09 Silverbrook Research Pty Ltd Image sensing apparatus including a microcontroller
WO1999017259A1 (en) * 1997-09-29 1999-04-08 Intergraph Corporation Automatic frame accumulator
DE19812082A1 (en) * 1998-03-19 1999-09-23 Siemens Ag Digital camera with transmission module
US7129860B2 (en) * 1999-01-29 2006-10-31 Quickshift, Inc. System and method for performing scalable embedded parallel data decompression

Similar Documents

Publication Publication Date Title
AU2001272869B8 (en) Method and means for mobile capture, processing, storage and transmission of text and mixed information containing characters and images
AU2001272869A1 (en) Method and means for mobile capture, processing, storage and transmission of text and mixed information containing characters and images
JP3323535B2 (en) Image storage device and control method of image storage device
US7382939B2 (en) Information processing apparatus, method, storage medium and program
EP2040451B1 (en) Information processing apparatus and information processing method
JPH03204274A (en) Color picture transmission method
RU2287183C2 (en) Method and device for mobile capture, processing, storage and transfer of text and mixed information, containing symbols and images
JP4143245B2 (en) Image processing method and apparatus, and storage medium
JPH11110412A (en) System for processing and displaying information concerning image captured by camera
CN112259074A (en) Method and system for obtaining voice playing based on high-speed shooting instrument
CN100511267C (en) Graph and text image processing equipment and image processing method thereof
JP2002024799A (en) Device, method and recording medium for image processing
KR100708389B1 (en) The device which the compression and memorial to a PDF file of the security and method thereof
JP2002044318A (en) Scanner and method for controlling the scanner and storage medium
JP3524208B2 (en) Composite image processing apparatus and image processing method
JPH0646271A (en) Facsimile equipment
JP2730073B2 (en) Title list creation device
JP2899263B2 (en) Computer control method
TW399387B (en) Method for enhancing usability of fax on small devices
Arora Digitisation: Methods, Tools and Technology
JP2000306076A (en) Image processor, control method and storage medium
Gazerro Document Image Processing For Office Applications
JP2003067406A (en) Image-searching device and method for controlling the device
JPS6391781A (en) Image processing system
JPH0822535A (en) Picture electronic filing device