WO2014154457A1 - Systems and methods for context based scanning - Google Patents

Systems and methods for context based scanning Download PDF

Info

Publication number
WO2014154457A1
WO2014154457A1 PCT/EP2014/054267 EP2014054267W WO2014154457A1 WO 2014154457 A1 WO2014154457 A1 WO 2014154457A1 EP 2014054267 W EP2014054267 W EP 2014054267W WO 2014154457 A1 WO2014154457 A1 WO 2014154457A1
Authority
WO
WIPO (PCT)
Prior art keywords
computing system
text
context based
scanning
based scanning
Prior art date
Application number
PCT/EP2014/054267
Other languages
French (fr)
Inventor
Basavaraj Varji Siddappa
Original Assignee
Alcatel Lucent
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alcatel Lucent filed Critical Alcatel Lucent
Publication of WO2014154457A1 publication Critical patent/WO2014154457A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/00127Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
    • H04N1/00326Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus
    • H04N1/00328Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus with an apparatus processing optically-read information
    • H04N1/00331Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus with an apparatus processing optically-read information with an apparatus performing optical character recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/0008Connection or combination of a still picture apparatus with another apparatus
    • H04N2201/0065Converting image data to a format usable by the connected apparatus or vice versa
    • H04N2201/0068Converting from still picture data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/0077Types of the still picture apparatus
    • H04N2201/0081Image reader
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/32Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
    • H04N2201/3201Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
    • H04N2201/3261Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of multimedia information, e.g. a sound signal
    • H04N2201/3266Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of multimedia information, e.g. a sound signal of text or character information, e.g. text accompanying an image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/32Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
    • H04N2201/3201Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
    • H04N2201/3274Storage or retrieval of prestored additional information
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/32Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
    • H04N2201/3201Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
    • H04N2201/3278Transmission
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/32Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
    • H04N2201/3201Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
    • H04N2201/328Processing of the additional information

Definitions

  • the present subject matter relates to scanning and, particularly, but not exclusively, to methods and systems for context based scanning.
  • Computing devices such as cellular phones, smart phones, personal digital assistants (PDAs), tablets, and laptops provide users with a variety of applications, services, and networking capabilities.
  • PDAs personal digital assistants
  • Such computing devices have seemingly become a ubiquitous part of today's lifestyle and digital technology has found its way into different aspects of human life, professional as well as personal.
  • Most computing systems include various accessories for providing additional functionalities, such as a scanner unit to capture an image.
  • the computing systems are often used for creating and transmitting content in various forms like electronic mails (e-mails), documents and messages.
  • the commercially available computing devices usually include various mechanisms, such as keypads, on-screen keyboards, voice to text conversion modules and handwriting recognition modules to facilitate the user to enter data for generating the content.
  • a method for context based scanning includes initiating context based scanning from an application running on a computing system by pressing at least one of a dedicated scan button and a pre-configured combination of keys of the computing system and scanning a document on initiation of the context based scanning. The method further comprises analyzing the scanned document, using optical recognition techniques to generate a digital document containing text and pasting the text onto the application.
  • a computing system for context based scanning is described. The computing system includes a processor. The computing system further includes a processor, and an input unit, coupled to the processor, to initiate the context based scanning from an application running on the computing system.
  • the computing system further includes a scanning module, coupled to the processor, to activate a scanner unit, communicatively coupled to the computing system, on initiation of the context based scanning and scans a document using the scanner unit.
  • the computing system further includes a character recognition module, coupled to the processor, to analyze the scanned document using optical character recognition technique and generate a digital document containing text based on the analysis.
  • the computing system also includes a format normalization module to paste the text onto the application.
  • a computer readable medium having embodied thereon computer readable instructions for context based scanning is described.
  • the computer readable instructions when executed, cause a computing system to receive a user input to initiate context based scanning from an application running on a computing system. Thereafter, the executed computer readable instructions cause the computing system to scan a document on initiation of the context based scanning and analyze the scanned document, using optical recognition techniques, to generate a digital document containing text. The computing system then facilitates a user to paste the text onto the application.
  • Figure 1 schematically illustrates the components of a computing system for context based scanning, according to an example of the present subject matter.
  • Figure 2 illustrates a method for context based scanning, according to an example of the present subject matter.
  • the present subject matter relates to systems and methods for context based scanning.
  • the methods and systems as described herein may be implemented as any computing system capable of capturing an image.
  • Example of such computing systems may include mobile phones, smart phones, tablets, laptops, personal computers and a desktop.
  • a user uses a computing system for generating content by inputting data or text using a keypad or an on-screen keyboard. This becomes time consuming and cumbersome if the volume of text is high.
  • Certain computing systems facilitate the input of data using voice. However, such systems do not produce text which is true conversion of the voice due to various reasons, such as variations in the accent and pronunciation amongst the users. Similarly, variance in handwriting of the users causes handwriting recognition modules, a popular feature in many commercially available computing devices, to generate erroneous results.
  • the text which the user wants to input may be available as a hardcopy, either in handwritten or printed form.
  • the user may wish to send his office address to the intended recipient.
  • the user's office address may already be available as a printed document in form of the user's business cards.
  • the user may have to quit his e-mail application, launch another application to scan his business card, copy the scanned image to his computing system's clipboard, launch the e-mail application again and paste the scanned image from his computing system's clipboard. This process is usually cumbersome, time consuming, and causes inconvenience to the user.
  • a computing system for context based scanning includes a button for initiating context based scanning.
  • the button may be implemented as a key on a keypad or on an on-screen keyboard of the computing system.
  • the button may be implemented as a special function button placed at a convenient location of the computing system, such as on any side of the computing system.
  • the user may press a pre-configured combination of keys or buttons to initiate context based scanning.
  • the user may initiate context based scanning while working within any application, such as a word processor, an e-mail client, and a messaging client.
  • the user may initiate context based scanning for writing onto an application some text that may be already available in the form of a hardcopy.
  • the initiation of context based scanning may activate a scanning module of the computing system.
  • the scanning module may activate a scanner unit, such as a camera, integrated within the computing system or communicatively coupled to the computing system using any port such as a universal serial bus (USB) port, a serial port, and a parallel port.
  • USB universal serial bus
  • a character recognition module of the computing system may convert the scanned document into a digital document containing text, using optical character recognition techniques.
  • the computing system may further normalize any formatting in the text, by removing formatting symbols, such as line breaks, section breaks, column breaks, and page breaks.
  • the computing system may format the scanned text based on the user input field from where the scanning is initiated. For example, in case the scanning is initiated from an address bar of a browser, the computing system may format the text in form of a uniform resource locator (URL). In another example, if the scanning is initiated from an Internet Protocol (IP) address field, the computing system may format the text in form of an IP address.
  • IP Internet Protocol
  • the computing system may operate in a language conversion mode.
  • a translation module of the computing system may convert the captured text to the default language of the application from which context based scanning has been launched or to the default language of the computing system.
  • the computing system may operate in a data correction mode.
  • a data correction module of the computing system may highlight errors, such as spelling errors and grammatical errors, in the captured text. Based on user configuration, the data correction module may prompt the user to correct the highlighted errors or may automatically correct errors in the captured text.
  • the user may then paste the captured text onto the application from which context based scanning was launched.
  • the user may further save the captured text in various file formats for future use.
  • the computing system for context based scanning as described above facilitates using of content, already available as hard copies, for inputting data in an application of the computing system, without having to exit or quit the application.
  • the computing system for context based scanning thus makes it convenient for the user to use captured text from various sources for generating content.
  • Figure 1 schematically illustrates the components of a computing system
  • the computing system 100 includes a processor 102, and a memory 104 connected to the processor 102.
  • the processor 102 may include microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, logic circuitries and/or any other devices that manipulate signals and data based on computer-readable instructions.
  • the processor 102 may fetch and execute computer-readable instructions stored in the memory 104.
  • the memory 104 communicatively coupled to the processor 102, can include any non-transitory computer-readable medium known in the art including, for example, volatile memory, such as static random access memory (SRAM) and dynamic random access memory (DRAM), and/or non-volatile memory, such as read only memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes.
  • volatile memory such as static random access memory (SRAM) and dynamic random access memory (DRAM)
  • non-volatile memory such as read only memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes.
  • the computing system 100 includes interfaces 106.
  • the interfaces 106 are described above.
  • I/O devices such as data input output devices, referred to as I/O devices, storage devices, network device.
  • the I/O device(s) may include Universal Serial Bus (USB) ports, Ethernet ports, host bus adaptors, and their corresponding device drivers.
  • USB Universal Serial Bus
  • the computing system 100 may include modules 108.
  • the modules 108 may be coupled to the processor 102.
  • the modules 108 include routines, programs, objects, components, and data structures, which perform particular tasks or implement particular abstract data types.
  • the modules 108 may also be implemented as, signal processor(s), state machine(s), logic circuitries, and/or any other device or component that manipulate signals based on computer- readable instructions.
  • the modules 108 include a scanning module 110, a character recognition module 112, a format recognition module 114, a translation module 116, a data correction module 118 and other module(s) 120.
  • the other module(s) 120 may include computer-readable instructions that supplement applications or functions performed by the computing system 100.
  • the computing system 100 may also include data 122.
  • the data 122 includes scan data 124 and other data 126.
  • the other data 126 may include data generated and saved by the modules 108 for providing various functionalities of the computing system 100.
  • the computing system 100 may include an input unit 128 which may be inform of a keypad comprising a plurality of keys or an on-screen keyboard comprising a plurality of keys.
  • the input unit 128 includes a dedicated scan button 130 to initiate context based scanning.
  • a pre- configured combination of keys may be used to initiate context based scanning.
  • the computing system 100 may include an integrated scanner unit 132, such as a camera, for scanning a document.
  • the scanner unit 132 may be communicatively coupled to the computing system 100 using various available ports of the computing system 100.
  • a user may press the pre-configured key combination or the dedicated scan button 130 to initiate context based scanning while working on any application running on the computing system 100, such as an e-mail client, a word processor and a messaging client.
  • the scanning module 110 may activate the integrated scanner unit 132 to capture a snapshot of or scan a hardcopy, whose contents the user wants to use in the application.
  • the scanning module 110 may set various scanner unit parameters, such as focal length, aperture depth, exposure time, film speed and usage of flash to optimally scan the hardcopy.
  • the scanning module 110 may prompt the user to set values for the various scanner unit parameters for scanning the hardcopy.
  • the scanning module 110 may further create and save the scanned document for future use.
  • the scanning module 110 may save the scanned document, as the scan data 124, in various file formats, such as portable document format (pdf).
  • pdf portable document format
  • the character recognition module 112 may use techniques, such as optical character recognition, to convert the scanned document into a digital document containing text.
  • the character recognition module 112 may prompt the user to mark the areas of the scanned document which contains text and graphics. The character recognition module 112 may then only process the areas which contain text so as to reduce processing load on the computing system 100.
  • the format normalization module 114 may then analyze the digital document to remove various formatting symbols, such as line breaks, section breaks, column breaks and page breaks, to generate unformatted text. For example, the user may have scanned a news paper which usually comprises of multiple columns. The format normalization module 114 may analyze the digital document to demarcate whether a gap between two words in a row is caused due to a space symbol between the words or a column break between the two words. In another implementation, the format normalization module 114 may facilitate the user to either retain the original formatting of the contents of the digital document, as is on the hardcopy, or format the contents of the digital document in accordance with the formatting styles of the application, from which the context based scanning was initiated.
  • various formatting symbols such as line breaks, section breaks, column breaks and page breaks
  • the format normalization module 114 may format the scanned text based on a user input field of the application, from where the scanning is initiated. For example, in case the scanning is initiated from an address bar of a browser, the computing system may format the text in form of a uniform resource locator (URL). Based on the selection of the user, the format normalization module 114 may format the document accordingly. Thus, the format normalization module 114 facilitates the user to capture text from documents having various formatting styles. The processed text may then be placed in a clipboard of the computing system 100. The user may then paste the contents of the clipboard onto the application from where the context based scanning was initiated.
  • URL uniform resource locator
  • the computing system 100 may be operated in a language conversion mode.
  • the translation module 116 may translate the captured text to a default language of the application from where the context based scanning was launched or to a default language of the computing system 100 or to any language specified by the user.
  • the user may initiate context based scanning while using an e-mail client whose default language is English. The user may then scan a newspaper in a different language, for example, a French newspaper.
  • the translation module 116 may convert the captured text in French to English.
  • the translation module 116 may retrieve data from databases located on the internet or from in-built databases of the computing system 100 to facilitate the translation of the captured text from one language to another.
  • the computing system 100 may be operated in a data correction mode.
  • the data correction module 118 may analyze the captured text to detect errors, such as spelling mistakes and grammatical errors.
  • the data correction module 118 may retrieve data from dictionaries available on the internet or in-built databases of the computing system 100 to facilitate error detection and error correction.
  • the data correction module 118 may automatically correct any detected errors in the captured text.
  • the data correction module 118 may highlight the detected errors and prompt the user to correct or ignore the same.
  • the computing system 100 for context based scanning facilitates using of content, already available in hardcopies, for inputting data in an application of the computing system, without having to exit or quit the application.
  • the computing system for context based scanning thus makes it convenient for the user to use captured text from various sources for generating content by providing the user with additional features such as formatting, translation, and error correction.
  • Figure 2 illustrates a method 200 for context based scanning, according to an example of the present subject matter.
  • the order in which the method 200 is described is not intended to be construed as a limitation, and any number of the described method blocks can be combined in any order to implement method 200, or an alternative method. Additionally, individual blocks may be deleted from the method 200 without departing from the spirit and scope of the subject matter described herein.
  • the method 200 may be implemented in any suitable hardware, machine readable instructions, firmware, or combination thereof.
  • the steps of the method 200 can be performed by programmed computers.
  • program storage devices for example, digital data storage media, which are machine or computer readable and encode machine-executable or computer-executable programs of instructions, where said instructions perform some or all of the steps of the described method 200.
  • the program storage devices may be, for example, digital memories, magnetic storage media such as a magnetic disks and magnetic tapes, hard drives, or optically readable digital data storage media.
  • a user input is received to scan a document.
  • the user may initiate context based scanning, while working on any application of the computing system 100, by pressing the pre-configured key combination or the dedicated scan button 130.
  • the scanning module 110 activates the integrated scanner unit 132 of the computing system 100.
  • the scanning module 110 may automatically set various scanner unit parameters associated with the integrated scanner unit 132, whereas in another implementation, the scanning module 110 may prompt the user to set the values for the various scanner unit parameters.
  • the contents of the scanned document are analyzed using optical character recognition techniques.
  • the contents of the scanned document are then extracted and a digital document containing text is generated.
  • the character recognition module 112 may implement optical character recognition techniques to extract contents from the scanned document and generate the digital document containing text.
  • the formatting of the digital document is normalized.
  • the format normalization module 114 analyzes the digital document to remove various formatting symbols to generate unformatted text.
  • the format normalization module 114 may also facilitate the user either retain the original formatting of the contents or format the contents of the digital document in accordance with the formatting styles of the application, from which the context based scanning was initiated. Based on the selection of the user, the format normalization module 114 may format the text of the digital document accordingly.
  • the language conversion mode is activated or on. If at block 210, it is determined that the language conversion mode is activated, then as depicted in block 212, the text of the digital document is converted into a language specified by the user. In one implementation, the translation module 116 may translate the text of the digital document to a default language of the application from where the context based scanning was launched or to a default language of the computing system 100 or to any language specified by the user. If at block 210, it is determined that the language conversion mode is deactivated or off then no changes are made to the digital document.
  • the data correction module 118 may analyze the captured text to detect errors, such as spelling mistakes and grammatical mistakes. The data correction module 118 may then automatically correct any detected errors in the captured text or highlight the detected errors and prompt the user to correct or ignore the same. If at block 214, it is determined that the language conversion mode is deactivated or off, then no changes are made to the digital document.
  • the contents of the digital document are then pasted onto an application selected by the user.
  • the format normalization module 114 may then place the text in a clipboard of the computing system 100. The user may then paste the contents of the clipboard onto the application from where the context based scanning was initiated.
  • the method 200 for context based scanning facilitates using of content, already available as hardcopies, for inputting data in an application of the computing system, without having to exit or quit the application.
  • the method 200 makes it convenient for the user to use captured text from various sources for generating content by providing the user with additional features such as formatting, translation and error correction.

Abstract

Systems and methods for context based scanning are described. In one implementation, the method for context based scanning comprises initiating context based scanning from an application running on a computing system (100) by pressing at least one of a dedicated scan button (130) and a pre-configured combination of keys of the computing system (100) and scanning a document on initiation of the context based scanning. The method further comprises analyzing the scanned document, using optical recognition techniques, to generate a digital document containing text and pasting the text onto the application.

Description

SYSTEMS AND METHODS FOR CONTEXT BASED SCANNING
FIELD OF INVENTION
[0001] The present subject matter relates to scanning and, particularly, but not exclusively, to methods and systems for context based scanning. BACKGROUND
[0002] Computing devices, such as cellular phones, smart phones, personal digital assistants (PDAs), tablets, and laptops provide users with a variety of applications, services, and networking capabilities. Such computing devices have seemingly become a ubiquitous part of today's lifestyle and digital technology has found its way into different aspects of human life, professional as well as personal. Most computing systems include various accessories for providing additional functionalities, such as a scanner unit to capture an image.
[0003] The computing systems are often used for creating and transmitting content in various forms like electronic mails (e-mails), documents and messages. The commercially available computing devices usually include various mechanisms, such as keypads, on-screen keyboards, voice to text conversion modules and handwriting recognition modules to facilitate the user to enter data for generating the content.
SUMMARY
[0004] This summary is provided to introduce concepts related to context based scanning. This summary is not intended to identify essential features of the claimed subject matter nor is it intended for use in determining or limiting the scope of the claimed subject matter.
[0005] According to an embodiment, a method for context based scanning is described. The method includes initiating context based scanning from an application running on a computing system by pressing at least one of a dedicated scan button and a pre-configured combination of keys of the computing system and scanning a document on initiation of the context based scanning. The method further comprises analyzing the scanned document, using optical recognition techniques to generate a digital document containing text and pasting the text onto the application. [0006] In another embodiment, a computing system for context based scanning is described. The computing system includes a processor. The computing system further includes a processor, and an input unit, coupled to the processor, to initiate the context based scanning from an application running on the computing system. The computing system further includes a scanning module, coupled to the processor, to activate a scanner unit, communicatively coupled to the computing system, on initiation of the context based scanning and scans a document using the scanner unit. The computing system further includes a character recognition module, coupled to the processor, to analyze the scanned document using optical character recognition technique and generate a digital document containing text based on the analysis. The computing system also includes a format normalization module to paste the text onto the application.
[0007] In one embodiment a computer readable medium having embodied thereon computer readable instructions for context based scanning is described. The computer readable instructions when executed, cause a computing system to receive a user input to initiate context based scanning from an application running on a computing system. Thereafter, the executed computer readable instructions cause the computing system to scan a document on initiation of the context based scanning and analyze the scanned document, using optical recognition techniques, to generate a digital document containing text. The computing system then facilitates a user to paste the text onto the application. BRIEF DESCRIPTION OF DRAWINGS
[0008] The detailed description is described with reference to the accompanying figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The same numbers are used throughout the figures to reference like features and components:
[0009] Figure 1 schematically illustrates the components of a computing system for context based scanning, according to an example of the present subject matter.
[0010] Figure 2 illustrates a method for context based scanning, according to an example of the present subject matter.
DETAILED DESCRIPTION [0011] The present subject matter relates to systems and methods for context based scanning. The methods and systems as described herein may be implemented as any computing system capable of capturing an image. Example of such computing systems may include mobile phones, smart phones, tablets, laptops, personal computers and a desktop.
[0012] Generally, a user uses a computing system for generating content by inputting data or text using a keypad or an on-screen keyboard. This becomes time consuming and cumbersome if the volume of text is high. Certain computing systems facilitate the input of data using voice. However, such systems do not produce text which is true conversion of the voice due to various reasons, such as variations in the accent and pronunciation amongst the users. Similarly, variance in handwriting of the users causes handwriting recognition modules, a popular feature in many commercially available computing devices, to generate erroneous results.
[0013] In certain scenarios, the text which the user wants to input may be available as a hardcopy, either in handwritten or printed form. For example, while drafting an e-mail the user may wish to send his office address to the intended recipient. In one example, the user's office address may already be available as a printed document in form of the user's business cards. In such a scenario, the user may have to quit his e-mail application, launch another application to scan his business card, copy the scanned image to his computing system's clipboard, launch the e-mail application again and paste the scanned image from his computing system's clipboard. This process is usually cumbersome, time consuming, and causes inconvenience to the user.
[0014] The systems and methods, described herein, implement context based scanning. In one implementation, a computing system for context based scanning includes a button for initiating context based scanning. In one example, the button may be implemented as a key on a keypad or on an on-screen keyboard of the computing system. In another example, the button may be implemented as a special function button placed at a convenient location of the computing system, such as on any side of the computing system. In yet another example, the user may press a pre-configured combination of keys or buttons to initiate context based scanning.
[0015] In operation, the user may initiate context based scanning while working within any application, such as a word processor, an e-mail client, and a messaging client. For example, the user may initiate context based scanning for writing onto an application some text that may be already available in the form of a hardcopy. The initiation of context based scanning may activate a scanning module of the computing system. The scanning module may activate a scanner unit, such as a camera, integrated within the computing system or communicatively coupled to the computing system using any port such as a universal serial bus (USB) port, a serial port, and a parallel port. Using the scanner unit, the user may scan the document, whose contents he intends to capture.
[0016] On successful scanning, a character recognition module of the computing system may convert the scanned document into a digital document containing text, using optical character recognition techniques. The computing system may further normalize any formatting in the text, by removing formatting symbols, such as line breaks, section breaks, column breaks, and page breaks. In one implementation, the computing system may format the scanned text based on the user input field from where the scanning is initiated. For example, in case the scanning is initiated from an address bar of a browser, the computing system may format the text in form of a uniform resource locator (URL). In another example, if the scanning is initiated from an Internet Protocol (IP) address field, the computing system may format the text in form of an IP address.
[0017] In one example, the computing system may operate in a language conversion mode. In the language conversion mode, a translation module of the computing system may convert the captured text to the default language of the application from which context based scanning has been launched or to the default language of the computing system.
[0018] In another example, the computing system may operate in a data correction mode. In the data correction mode, a data correction module of the computing system may highlight errors, such as spelling errors and grammatical errors, in the captured text. Based on user configuration, the data correction module may prompt the user to correct the highlighted errors or may automatically correct errors in the captured text.
[0019] The user may then paste the captured text onto the application from which context based scanning was launched. The user may further save the captured text in various file formats for future use.
[0020] Thus, the computing system for context based scanning as described above facilitates using of content, already available as hard copies, for inputting data in an application of the computing system, without having to exit or quit the application. The computing system for context based scanning thus makes it convenient for the user to use captured text from various sources for generating content.
[0021] The above systems and methods are further described in conjunction with the following figures. It should be noted that the description and figures merely illustrate the principles of the present subject matter. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the present subject matter and are included within its spirit and scope. Furthermore, all examples recited herein are principally intended expressly for pedagogical purposes to aid the reader in understanding the principles of the present subject matter and the concepts contributed by the inventor(s) to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions. Moreover, all statements herein reciting principles, aspects, and examples of the present subject matter, as well as specific examples thereof, are intended to encompass equivalents thereof.
[0022] The manner in which the systems and methods for context based scanning are implemented shall be explained in details with respect to Figures 1 and 2. While aspects of described systems and methods for context based scanning can be implemented in any number of different computing systems, environments, and/or configurations, the examples and implementations are described in the context of the following system(s).
[0023] Figure 1 schematically illustrates the components of a computing system
100 for context based scanning, according to an example of the present subject matter. In one implementation, the computing system 100 includes a processor 102, and a memory 104 connected to the processor 102. The processor 102 may include microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, logic circuitries and/or any other devices that manipulate signals and data based on computer-readable instructions. Among other capabilities, the processor 102 may fetch and execute computer-readable instructions stored in the memory 104.
[0024] Functions of the various elements shown in the figures, including any functional blocks labeled as "processor(s)", may be provided through the use of dedicated hardware as well as hardware capable of executing computer-readable instructions. [0025] The memory 104, communicatively coupled to the processor 102, can include any non-transitory computer-readable medium known in the art including, for example, volatile memory, such as static random access memory (SRAM) and dynamic random access memory (DRAM), and/or non-volatile memory, such as read only memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes.
[0026] Further the computing system 100 includes interfaces 106. The interfaces
106 may include a variety of commercially available interfaces, for example, interfaces for peripheral device(s), such as data input output devices, referred to as I/O devices, storage devices, network device. The I/O device(s) may include Universal Serial Bus (USB) ports, Ethernet ports, host bus adaptors, and their corresponding device drivers.
[0027] Further, the computing system 100 may include modules 108. The modules 108 may be coupled to the processor 102. The modules 108, amongst other things, include routines, programs, objects, components, and data structures, which perform particular tasks or implement particular abstract data types. The modules 108 may also be implemented as, signal processor(s), state machine(s), logic circuitries, and/or any other device or component that manipulate signals based on computer- readable instructions. [0028] In said implementation, the modules 108 include a scanning module 110, a character recognition module 112, a format recognition module 114, a translation module 116, a data correction module 118 and other module(s) 120. The other module(s) 120 may include computer-readable instructions that supplement applications or functions performed by the computing system 100.
[0029] Further, the computing system 100 may also include data 122. In one implementation, the data 122 includes scan data 124 and other data 126. The other data 126 may include data generated and saved by the modules 108 for providing various functionalities of the computing system 100.
[0030] In one example, the computing system 100 may include an input unit 128 which may be inform of a keypad comprising a plurality of keys or an on-screen keyboard comprising a plurality of keys. In said example, the input unit 128 includes a dedicated scan button 130 to initiate context based scanning. In other examples, a pre- configured combination of keys may be used to initiate context based scanning.
[0031] In one implementation, the computing system 100 may include an integrated scanner unit 132, such as a camera, for scanning a document. In other implementations, the scanner unit 132 may be communicatively coupled to the computing system 100 using various available ports of the computing system 100. [0032] In operation, a user may press the pre-configured key combination or the dedicated scan button 130 to initiate context based scanning while working on any application running on the computing system 100, such as an e-mail client, a word processor and a messaging client. On initiation of context based scanning, the scanning module 110 may activate the integrated scanner unit 132 to capture a snapshot of or scan a hardcopy, whose contents the user wants to use in the application. In one example, the scanning module 110 may set various scanner unit parameters, such as focal length, aperture depth, exposure time, film speed and usage of flash to optimally scan the hardcopy. In another example, the scanning module 110 may prompt the user to set values for the various scanner unit parameters for scanning the hardcopy. The scanning module 110 may further create and save the scanned document for future use. The scanning module 110 may save the scanned document, as the scan data 124, in various file formats, such as portable document format (pdf).
[0033] On successful scanning of the hardcopy and subsequent generation of the scanned document, the character recognition module 112 may use techniques, such as optical character recognition, to convert the scanned document into a digital document containing text. In one implementation, the character recognition module 112 may prompt the user to mark the areas of the scanned document which contains text and graphics. The character recognition module 112 may then only process the areas which contain text so as to reduce processing load on the computing system 100.
[0034] The format normalization module 114 may then analyze the digital document to remove various formatting symbols, such as line breaks, section breaks, column breaks and page breaks, to generate unformatted text. For example, the user may have scanned a news paper which usually comprises of multiple columns. The format normalization module 114 may analyze the digital document to demarcate whether a gap between two words in a row is caused due to a space symbol between the words or a column break between the two words. In another implementation, the format normalization module 114 may facilitate the user to either retain the original formatting of the contents of the digital document, as is on the hardcopy, or format the contents of the digital document in accordance with the formatting styles of the application, from which the context based scanning was initiated. In one implementation, the format normalization module 114 may format the scanned text based on a user input field of the application, from where the scanning is initiated. For example, in case the scanning is initiated from an address bar of a browser, the computing system may format the text in form of a uniform resource locator (URL). Based on the selection of the user, the format normalization module 114 may format the document accordingly. Thus, the format normalization module 114 facilitates the user to capture text from documents having various formatting styles. The processed text may then be placed in a clipboard of the computing system 100. The user may then paste the contents of the clipboard onto the application from where the context based scanning was initiated.
[0035] In one example, the computing system 100 may be operated in a language conversion mode. In the language conversion mode, the translation module 116 may translate the captured text to a default language of the application from where the context based scanning was launched or to a default language of the computing system 100 or to any language specified by the user. For example, the user may initiate context based scanning while using an e-mail client whose default language is English. The user may then scan a newspaper in a different language, for example, a French newspaper. In language conversion mode, the translation module 116 may convert the captured text in French to English. In one example, the translation module 116 may retrieve data from databases located on the internet or from in-built databases of the computing system 100 to facilitate the translation of the captured text from one language to another.
[0036] In another example, the computing system 100 may be operated in a data correction mode. In the data correction mode, the data correction module 118 may analyze the captured text to detect errors, such as spelling mistakes and grammatical errors. In one example, the data correction module 118 may retrieve data from dictionaries available on the internet or in-built databases of the computing system 100 to facilitate error detection and error correction. In one implementation, the data correction module 118 may automatically correct any detected errors in the captured text. In another implementation, the data correction module 118 may highlight the detected errors and prompt the user to correct or ignore the same.
[0037] Thus, the computing system 100 for context based scanning facilitates using of content, already available in hardcopies, for inputting data in an application of the computing system, without having to exit or quit the application. The computing system for context based scanning thus makes it convenient for the user to use captured text from various sources for generating content by providing the user with additional features such as formatting, translation, and error correction. [0038] Figure 2 illustrates a method 200 for context based scanning, according to an example of the present subject matter. The order in which the method 200 is described is not intended to be construed as a limitation, and any number of the described method blocks can be combined in any order to implement method 200, or an alternative method. Additionally, individual blocks may be deleted from the method 200 without departing from the spirit and scope of the subject matter described herein. Furthermore, the method 200 may be implemented in any suitable hardware, machine readable instructions, firmware, or combination thereof.
[0039] In one example, the steps of the method 200 can be performed by programmed computers. Herein, some examples are also intended to cover program storage devices, for example, digital data storage media, which are machine or computer readable and encode machine-executable or computer-executable programs of instructions, where said instructions perform some or all of the steps of the described method 200. The program storage devices may be, for example, digital memories, magnetic storage media such as a magnetic disks and magnetic tapes, hard drives, or optically readable digital data storage media.
[0040] With reference to method 200 as depicted in Figure 2, as depicted in block 202, a user input is received to scan a document. In one implementation, the user may initiate context based scanning, while working on any application of the computing system 100, by pressing the pre-configured key combination or the dedicated scan button 130.
[0041] As illustrated in block 204, a scanner unit 132 of the computing system
100 is activated to scan the document. In one example, the scanning module 110 activates the integrated scanner unit 132 of the computing system 100. In one implementation, the scanning module 110 may automatically set various scanner unit parameters associated with the integrated scanner unit 132, whereas in another implementation, the scanning module 110 may prompt the user to set the values for the various scanner unit parameters.
[0042] As shown at block 206, the contents of the scanned document are analyzed using optical character recognition techniques. The contents of the scanned document are then extracted and a digital document containing text is generated. In one implementation, the character recognition module 112 may implement optical character recognition techniques to extract contents from the scanned document and generate the digital document containing text. [0043] At block 208, the formatting of the digital document is normalized. In one implementation, the format normalization module 114 analyzes the digital document to remove various formatting symbols to generate unformatted text. In another example the format normalization module 114 may also facilitate the user either retain the original formatting of the contents or format the contents of the digital document in accordance with the formatting styles of the application, from which the context based scanning was initiated. Based on the selection of the user, the format normalization module 114 may format the text of the digital document accordingly.
[0044] As depicted in block 210, it is determined whether the language conversion mode is activated or on. If at block 210, it is determined that the language conversion mode is activated, then as depicted in block 212, the text of the digital document is converted into a language specified by the user. In one implementation, the translation module 116 may translate the text of the digital document to a default language of the application from where the context based scanning was launched or to a default language of the computing system 100 or to any language specified by the user. If at block 210, it is determined that the language conversion mode is deactivated or off then no changes are made to the digital document.
[0045] As illustrated in block 214, it is determined whether the data correction mode is activated or on. If at block 214, it is determined that the data correction mode is activated, as depicted in block 216, the text of the digital document is analyzed for error detection and error correction. In one implementation, the data correction module 118 may analyze the captured text to detect errors, such as spelling mistakes and grammatical mistakes. The data correction module 118 may then automatically correct any detected errors in the captured text or highlight the detected errors and prompt the user to correct or ignore the same. If at block 214, it is determined that the language conversion mode is deactivated or off, then no changes are made to the digital document.
[0046] As shown in block 218, the contents of the digital document are then pasted onto an application selected by the user. In one implementation, the format normalization module 114 may then place the text in a clipboard of the computing system 100. The user may then paste the contents of the clipboard onto the application from where the context based scanning was initiated.
[0047] Thus, the method 200 for context based scanning facilitates using of content, already available as hardcopies, for inputting data in an application of the computing system, without having to exit or quit the application. The method 200 makes it convenient for the user to use captured text from various sources for generating content by providing the user with additional features such as formatting, translation and error correction.
[0048] Although implementations for context based scanning have been described in language specific to structural features and/or methods, it is to be understood that the appended claims are not necessarily limited to the specific features or methods described. Rather, the specific features and methods are disclosed as examples of systems and methods for context based scanning over a network.

Claims

I/We claim:
1. A computing system (100) for performing context based scanning, the computing system (100) comprising:
a processor (102);
an input unit (128), coupled to the processor (102), to initiate the context based scanning from an application running on the computing system (100); a scanning module (110) coupled to the processor (102), to:
activate a scanner unit (132), communicatively coupled to the computing system (100), on initiation of the context based scanning; and scan a document using the scanner unit (132);
a character recognition module (112), coupled to the processor(102), to: analyze the scanned document using optical character recognition technique; and
generate, based on the analysis, a digital document containing text
; and
a format normalization module (114), coupled to the processor(102), to normalize the text in a format suitable for pasting the text onto the application.
2. The computing system (100) as claimed in claim 1, further comprising a dedicated scan button (130), to initiate context based scanning from the application running on the computing system (100).
3. The computing system (100) as claimed in claim 1, further comprising a translation module (116), coupled to the processor (102), to translate the text of the digital document to at least one of a default language of the application, a default language of the computing system (100), and a language specified by a user of the computing system (100).
4. The computing system (100) as claimed in claim 1, wherein the format normalization module (114) further removes formatting symbols from the text of the digital document to generate unformatted text.
The computing system (100) as claimed in claim 1, wherein the format normalization module (114) further processes the text of the digital document to format the text based on at least one of a formatting style of the application and a user input field from which the context based scanning is initiated.
The computing system (100) as claimed in claim 1, further comprising a data correction module (118), coupled to the processor (102), to analyze the text of the digital document for detection of errors present in the text.
The computing system (100) as claimed in claim 6, wherein the data correction module (118) further analyzes the text of the digital document to correct the detected errors.
A method to perform context based scanning, the method comprising:
initiating context based scanning from an application running on a computing system (100) by pressing at least one of a dedicated scan button (130) and a pre-configured combination of keys of the computing system (100);
scanning a document on initiation of the context based scanning;
analyzing the scanned document, using optical recognition techniques, to generate a digital document containing text;
normalizing the digital document; and
pasting the normalized text onto the application.
The method as claimed in claim 8, further comprising translating the text of the digital document to at least one of a default language of the application, a default language of the computing system (100) and a language specified by a user of the computing system (100).
The method as claimed in claim 9, further comprising removing formatting symbols from the text of the digital document to generate unformatted text.
The method as claimed in claim 8, further comprising processing the text of the digital document to format the text based on at least one formatting style of the application.
12. The method as claimed in claim 8, the method further comprising:
analyzing the text of the digital document for detection of errors present in the text; and
correcting the detected errors.
A non-transitory computer-readable medium having a set of computer readable instructions that, when executed, cause a computing system to:
receive a user input to initiate context based scanning from an application running on a computing system (100);
scan a document on initiation of the context based scanning; analyze the scanned document, using optical recognition techniques, to generate a digital document containing text; and
paste the text onto the application.
PCT/EP2014/054267 2013-03-29 2014-03-05 Systems and methods for context based scanning WO2014154457A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IN964/DEL/2013 2013-03-29
IN964DE2013 2013-03-29

Publications (1)

Publication Number Publication Date
WO2014154457A1 true WO2014154457A1 (en) 2014-10-02

Family

ID=50272592

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2014/054267 WO2014154457A1 (en) 2013-03-29 2014-03-05 Systems and methods for context based scanning

Country Status (1)

Country Link
WO (1) WO2014154457A1 (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050157955A1 (en) * 2004-01-15 2005-07-21 Hitachi Global Storage Technologies Self-contained OCR system using hard disk drive
US20050205671A1 (en) * 2004-02-13 2005-09-22 Tito Gelsomini Cellular phone with scanning capability
WO2006011641A1 (en) * 2004-07-30 2006-02-02 Canon Kabushiki Kaisha Communication apparatus, information processing method, program, and storage medium
US20060085477A1 (en) * 2004-10-01 2006-04-20 Ricoh Company, Ltd. Techniques for retrieving documents using an image capture device
US20080237347A1 (en) * 2005-06-17 2008-10-02 Wolf Erich W PDA compatible text scanner
US20100188419A1 (en) * 2009-01-28 2010-07-29 Google Inc. Selective display of ocr'ed text and corresponding images from publications on a client device
US20120050819A1 (en) * 2010-08-30 2012-03-01 Jiang Hong Approach For Processing Scanned Document Data
WO2012075315A1 (en) * 2010-12-01 2012-06-07 Google Inc. Identifying matching canonical documents in response to a visual query

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050157955A1 (en) * 2004-01-15 2005-07-21 Hitachi Global Storage Technologies Self-contained OCR system using hard disk drive
US20050205671A1 (en) * 2004-02-13 2005-09-22 Tito Gelsomini Cellular phone with scanning capability
WO2006011641A1 (en) * 2004-07-30 2006-02-02 Canon Kabushiki Kaisha Communication apparatus, information processing method, program, and storage medium
US20060085477A1 (en) * 2004-10-01 2006-04-20 Ricoh Company, Ltd. Techniques for retrieving documents using an image capture device
US20080237347A1 (en) * 2005-06-17 2008-10-02 Wolf Erich W PDA compatible text scanner
US20100188419A1 (en) * 2009-01-28 2010-07-29 Google Inc. Selective display of ocr'ed text and corresponding images from publications on a client device
US20120050819A1 (en) * 2010-08-30 2012-03-01 Jiang Hong Approach For Processing Scanned Document Data
WO2012075315A1 (en) * 2010-12-01 2012-06-07 Google Inc. Identifying matching canonical documents in response to a visual query

Similar Documents

Publication Publication Date Title
CN107451128B (en) Multi-language code adaptation method, device and storage medium
US20170300128A1 (en) Multimodel Text Input by a Keyboard/Camera Text Input Module Replacing a Conventional Keyboard Text Input Module on a Mobile Device
CN107797754B (en) Method and device for text replication and medium product
CN108108342B (en) Structured text generation method, search method and device
WO2022057707A1 (en) Text recognition method, image recognition classification method, and document recognition processing method
US9081412B2 (en) System and method for using paper as an interface to computer applications
US20190370339A1 (en) System and method for real time translation
US9098759B2 (en) Image processing apparatus, method, and medium for character recognition
US20220108556A1 (en) Method of comparing documents, electronic device and readable storage medium
US11521365B2 (en) Image processing system, image processing apparatus, image processing method, and storage medium
US9977766B2 (en) Keyboard input corresponding to multiple languages
US10460192B2 (en) Method and system for optical character recognition (OCR) of multi-language content
US9733825B2 (en) East Asian character assist
CN111291572A (en) Character typesetting method and device and computer readable storage medium
CN112149680A (en) Wrong word detection and identification method and device, electronic equipment and storage medium
US11941903B2 (en) Image processing apparatus, image processing method, and non-transitory storage medium
US20210295033A1 (en) Information processing apparatus and non-transitory computer readable medium
CN107590136B (en) Translation device, translation system, and translation method
WO2014154457A1 (en) Systems and methods for context based scanning
JP2019057137A (en) Information processing apparatus and information processing program
US20150261746A1 (en) Translation system, information processing apparatus, information processing method, and computer program product
US11006015B2 (en) Information processing apparatus and non-transitory computer readable medium storing information processing program
JP7342518B2 (en) Image processing device and image processing program
Krejcar Smart implementation of text recognition (OCR) for smart mobile devices
US20230102476A1 (en) Information processing apparatus, non-transitory computer readable medium storing program, and information processing method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14709603

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14709603

Country of ref document: EP

Kind code of ref document: A1