WO2014154457A1 - Systems and methods for context based scanning - Google Patents
Systems and methods for context based scanning Download PDFInfo
- Publication number
- WO2014154457A1 WO2014154457A1 PCT/EP2014/054267 EP2014054267W WO2014154457A1 WO 2014154457 A1 WO2014154457 A1 WO 2014154457A1 EP 2014054267 W EP2014054267 W EP 2014054267W WO 2014154457 A1 WO2014154457 A1 WO 2014154457A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- computing system
- text
- context based
- scanning
- based scanning
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 53
- 230000000977 initiatory effect Effects 0.000 claims abstract description 13
- 230000003287 optical effect Effects 0.000 claims abstract description 6
- 238000010606 normalization Methods 0.000 claims description 14
- 238000012015 optical character recognition Methods 0.000 claims description 6
- 238000001514 detection method Methods 0.000 claims description 4
- 238000006243 chemical reaction Methods 0.000 description 11
- 230000015654 memory Effects 0.000 description 8
- 230000006870 function Effects 0.000 description 3
- 238000013500 data storage Methods 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/00127—Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture
- H04N1/00326—Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus
- H04N1/00328—Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus with an apparatus processing optically-read information
- H04N1/00331—Connection or combination of a still picture apparatus with another apparatus, e.g. for storage, processing or transmission of still picture signals or of information associated with a still picture with a data reading, recognizing or recording apparatus, e.g. with a bar-code apparatus with an apparatus processing optically-read information with an apparatus performing optical character recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/0008—Connection or combination of a still picture apparatus with another apparatus
- H04N2201/0065—Converting image data to a format usable by the connected apparatus or vice versa
- H04N2201/0068—Converting from still picture data
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/0077—Types of the still picture apparatus
- H04N2201/0081—Image reader
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3261—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of multimedia information, e.g. a sound signal
- H04N2201/3266—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of multimedia information, e.g. a sound signal of text or character information, e.g. text accompanying an image
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3274—Storage or retrieval of prestored additional information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/3278—Transmission
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N2201/00—Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
- H04N2201/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N2201/3201—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N2201/328—Processing of the additional information
Definitions
- the present subject matter relates to scanning and, particularly, but not exclusively, to methods and systems for context based scanning.
- Computing devices such as cellular phones, smart phones, personal digital assistants (PDAs), tablets, and laptops provide users with a variety of applications, services, and networking capabilities.
- PDAs personal digital assistants
- Such computing devices have seemingly become a ubiquitous part of today's lifestyle and digital technology has found its way into different aspects of human life, professional as well as personal.
- Most computing systems include various accessories for providing additional functionalities, such as a scanner unit to capture an image.
- the computing systems are often used for creating and transmitting content in various forms like electronic mails (e-mails), documents and messages.
- the commercially available computing devices usually include various mechanisms, such as keypads, on-screen keyboards, voice to text conversion modules and handwriting recognition modules to facilitate the user to enter data for generating the content.
- a method for context based scanning includes initiating context based scanning from an application running on a computing system by pressing at least one of a dedicated scan button and a pre-configured combination of keys of the computing system and scanning a document on initiation of the context based scanning. The method further comprises analyzing the scanned document, using optical recognition techniques to generate a digital document containing text and pasting the text onto the application.
- a computing system for context based scanning is described. The computing system includes a processor. The computing system further includes a processor, and an input unit, coupled to the processor, to initiate the context based scanning from an application running on the computing system.
- the computing system further includes a scanning module, coupled to the processor, to activate a scanner unit, communicatively coupled to the computing system, on initiation of the context based scanning and scans a document using the scanner unit.
- the computing system further includes a character recognition module, coupled to the processor, to analyze the scanned document using optical character recognition technique and generate a digital document containing text based on the analysis.
- the computing system also includes a format normalization module to paste the text onto the application.
- a computer readable medium having embodied thereon computer readable instructions for context based scanning is described.
- the computer readable instructions when executed, cause a computing system to receive a user input to initiate context based scanning from an application running on a computing system. Thereafter, the executed computer readable instructions cause the computing system to scan a document on initiation of the context based scanning and analyze the scanned document, using optical recognition techniques, to generate a digital document containing text. The computing system then facilitates a user to paste the text onto the application.
- Figure 1 schematically illustrates the components of a computing system for context based scanning, according to an example of the present subject matter.
- Figure 2 illustrates a method for context based scanning, according to an example of the present subject matter.
- the present subject matter relates to systems and methods for context based scanning.
- the methods and systems as described herein may be implemented as any computing system capable of capturing an image.
- Example of such computing systems may include mobile phones, smart phones, tablets, laptops, personal computers and a desktop.
- a user uses a computing system for generating content by inputting data or text using a keypad or an on-screen keyboard. This becomes time consuming and cumbersome if the volume of text is high.
- Certain computing systems facilitate the input of data using voice. However, such systems do not produce text which is true conversion of the voice due to various reasons, such as variations in the accent and pronunciation amongst the users. Similarly, variance in handwriting of the users causes handwriting recognition modules, a popular feature in many commercially available computing devices, to generate erroneous results.
- the text which the user wants to input may be available as a hardcopy, either in handwritten or printed form.
- the user may wish to send his office address to the intended recipient.
- the user's office address may already be available as a printed document in form of the user's business cards.
- the user may have to quit his e-mail application, launch another application to scan his business card, copy the scanned image to his computing system's clipboard, launch the e-mail application again and paste the scanned image from his computing system's clipboard. This process is usually cumbersome, time consuming, and causes inconvenience to the user.
- a computing system for context based scanning includes a button for initiating context based scanning.
- the button may be implemented as a key on a keypad or on an on-screen keyboard of the computing system.
- the button may be implemented as a special function button placed at a convenient location of the computing system, such as on any side of the computing system.
- the user may press a pre-configured combination of keys or buttons to initiate context based scanning.
- the user may initiate context based scanning while working within any application, such as a word processor, an e-mail client, and a messaging client.
- the user may initiate context based scanning for writing onto an application some text that may be already available in the form of a hardcopy.
- the initiation of context based scanning may activate a scanning module of the computing system.
- the scanning module may activate a scanner unit, such as a camera, integrated within the computing system or communicatively coupled to the computing system using any port such as a universal serial bus (USB) port, a serial port, and a parallel port.
- USB universal serial bus
- a character recognition module of the computing system may convert the scanned document into a digital document containing text, using optical character recognition techniques.
- the computing system may further normalize any formatting in the text, by removing formatting symbols, such as line breaks, section breaks, column breaks, and page breaks.
- the computing system may format the scanned text based on the user input field from where the scanning is initiated. For example, in case the scanning is initiated from an address bar of a browser, the computing system may format the text in form of a uniform resource locator (URL). In another example, if the scanning is initiated from an Internet Protocol (IP) address field, the computing system may format the text in form of an IP address.
- IP Internet Protocol
- the computing system may operate in a language conversion mode.
- a translation module of the computing system may convert the captured text to the default language of the application from which context based scanning has been launched or to the default language of the computing system.
- the computing system may operate in a data correction mode.
- a data correction module of the computing system may highlight errors, such as spelling errors and grammatical errors, in the captured text. Based on user configuration, the data correction module may prompt the user to correct the highlighted errors or may automatically correct errors in the captured text.
- the user may then paste the captured text onto the application from which context based scanning was launched.
- the user may further save the captured text in various file formats for future use.
- the computing system for context based scanning as described above facilitates using of content, already available as hard copies, for inputting data in an application of the computing system, without having to exit or quit the application.
- the computing system for context based scanning thus makes it convenient for the user to use captured text from various sources for generating content.
- Figure 1 schematically illustrates the components of a computing system
- the computing system 100 includes a processor 102, and a memory 104 connected to the processor 102.
- the processor 102 may include microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, logic circuitries and/or any other devices that manipulate signals and data based on computer-readable instructions.
- the processor 102 may fetch and execute computer-readable instructions stored in the memory 104.
- the memory 104 communicatively coupled to the processor 102, can include any non-transitory computer-readable medium known in the art including, for example, volatile memory, such as static random access memory (SRAM) and dynamic random access memory (DRAM), and/or non-volatile memory, such as read only memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes.
- volatile memory such as static random access memory (SRAM) and dynamic random access memory (DRAM)
- non-volatile memory such as read only memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes.
- the computing system 100 includes interfaces 106.
- the interfaces 106 are described above.
- I/O devices such as data input output devices, referred to as I/O devices, storage devices, network device.
- the I/O device(s) may include Universal Serial Bus (USB) ports, Ethernet ports, host bus adaptors, and their corresponding device drivers.
- USB Universal Serial Bus
- the computing system 100 may include modules 108.
- the modules 108 may be coupled to the processor 102.
- the modules 108 include routines, programs, objects, components, and data structures, which perform particular tasks or implement particular abstract data types.
- the modules 108 may also be implemented as, signal processor(s), state machine(s), logic circuitries, and/or any other device or component that manipulate signals based on computer- readable instructions.
- the modules 108 include a scanning module 110, a character recognition module 112, a format recognition module 114, a translation module 116, a data correction module 118 and other module(s) 120.
- the other module(s) 120 may include computer-readable instructions that supplement applications or functions performed by the computing system 100.
- the computing system 100 may also include data 122.
- the data 122 includes scan data 124 and other data 126.
- the other data 126 may include data generated and saved by the modules 108 for providing various functionalities of the computing system 100.
- the computing system 100 may include an input unit 128 which may be inform of a keypad comprising a plurality of keys or an on-screen keyboard comprising a plurality of keys.
- the input unit 128 includes a dedicated scan button 130 to initiate context based scanning.
- a pre- configured combination of keys may be used to initiate context based scanning.
- the computing system 100 may include an integrated scanner unit 132, such as a camera, for scanning a document.
- the scanner unit 132 may be communicatively coupled to the computing system 100 using various available ports of the computing system 100.
- a user may press the pre-configured key combination or the dedicated scan button 130 to initiate context based scanning while working on any application running on the computing system 100, such as an e-mail client, a word processor and a messaging client.
- the scanning module 110 may activate the integrated scanner unit 132 to capture a snapshot of or scan a hardcopy, whose contents the user wants to use in the application.
- the scanning module 110 may set various scanner unit parameters, such as focal length, aperture depth, exposure time, film speed and usage of flash to optimally scan the hardcopy.
- the scanning module 110 may prompt the user to set values for the various scanner unit parameters for scanning the hardcopy.
- the scanning module 110 may further create and save the scanned document for future use.
- the scanning module 110 may save the scanned document, as the scan data 124, in various file formats, such as portable document format (pdf).
- pdf portable document format
- the character recognition module 112 may use techniques, such as optical character recognition, to convert the scanned document into a digital document containing text.
- the character recognition module 112 may prompt the user to mark the areas of the scanned document which contains text and graphics. The character recognition module 112 may then only process the areas which contain text so as to reduce processing load on the computing system 100.
- the format normalization module 114 may then analyze the digital document to remove various formatting symbols, such as line breaks, section breaks, column breaks and page breaks, to generate unformatted text. For example, the user may have scanned a news paper which usually comprises of multiple columns. The format normalization module 114 may analyze the digital document to demarcate whether a gap between two words in a row is caused due to a space symbol between the words or a column break between the two words. In another implementation, the format normalization module 114 may facilitate the user to either retain the original formatting of the contents of the digital document, as is on the hardcopy, or format the contents of the digital document in accordance with the formatting styles of the application, from which the context based scanning was initiated.
- various formatting symbols such as line breaks, section breaks, column breaks and page breaks
- the format normalization module 114 may format the scanned text based on a user input field of the application, from where the scanning is initiated. For example, in case the scanning is initiated from an address bar of a browser, the computing system may format the text in form of a uniform resource locator (URL). Based on the selection of the user, the format normalization module 114 may format the document accordingly. Thus, the format normalization module 114 facilitates the user to capture text from documents having various formatting styles. The processed text may then be placed in a clipboard of the computing system 100. The user may then paste the contents of the clipboard onto the application from where the context based scanning was initiated.
- URL uniform resource locator
- the computing system 100 may be operated in a language conversion mode.
- the translation module 116 may translate the captured text to a default language of the application from where the context based scanning was launched or to a default language of the computing system 100 or to any language specified by the user.
- the user may initiate context based scanning while using an e-mail client whose default language is English. The user may then scan a newspaper in a different language, for example, a French newspaper.
- the translation module 116 may convert the captured text in French to English.
- the translation module 116 may retrieve data from databases located on the internet or from in-built databases of the computing system 100 to facilitate the translation of the captured text from one language to another.
- the computing system 100 may be operated in a data correction mode.
- the data correction module 118 may analyze the captured text to detect errors, such as spelling mistakes and grammatical errors.
- the data correction module 118 may retrieve data from dictionaries available on the internet or in-built databases of the computing system 100 to facilitate error detection and error correction.
- the data correction module 118 may automatically correct any detected errors in the captured text.
- the data correction module 118 may highlight the detected errors and prompt the user to correct or ignore the same.
- the computing system 100 for context based scanning facilitates using of content, already available in hardcopies, for inputting data in an application of the computing system, without having to exit or quit the application.
- the computing system for context based scanning thus makes it convenient for the user to use captured text from various sources for generating content by providing the user with additional features such as formatting, translation, and error correction.
- Figure 2 illustrates a method 200 for context based scanning, according to an example of the present subject matter.
- the order in which the method 200 is described is not intended to be construed as a limitation, and any number of the described method blocks can be combined in any order to implement method 200, or an alternative method. Additionally, individual blocks may be deleted from the method 200 without departing from the spirit and scope of the subject matter described herein.
- the method 200 may be implemented in any suitable hardware, machine readable instructions, firmware, or combination thereof.
- the steps of the method 200 can be performed by programmed computers.
- program storage devices for example, digital data storage media, which are machine or computer readable and encode machine-executable or computer-executable programs of instructions, where said instructions perform some or all of the steps of the described method 200.
- the program storage devices may be, for example, digital memories, magnetic storage media such as a magnetic disks and magnetic tapes, hard drives, or optically readable digital data storage media.
- a user input is received to scan a document.
- the user may initiate context based scanning, while working on any application of the computing system 100, by pressing the pre-configured key combination or the dedicated scan button 130.
- the scanning module 110 activates the integrated scanner unit 132 of the computing system 100.
- the scanning module 110 may automatically set various scanner unit parameters associated with the integrated scanner unit 132, whereas in another implementation, the scanning module 110 may prompt the user to set the values for the various scanner unit parameters.
- the contents of the scanned document are analyzed using optical character recognition techniques.
- the contents of the scanned document are then extracted and a digital document containing text is generated.
- the character recognition module 112 may implement optical character recognition techniques to extract contents from the scanned document and generate the digital document containing text.
- the formatting of the digital document is normalized.
- the format normalization module 114 analyzes the digital document to remove various formatting symbols to generate unformatted text.
- the format normalization module 114 may also facilitate the user either retain the original formatting of the contents or format the contents of the digital document in accordance with the formatting styles of the application, from which the context based scanning was initiated. Based on the selection of the user, the format normalization module 114 may format the text of the digital document accordingly.
- the language conversion mode is activated or on. If at block 210, it is determined that the language conversion mode is activated, then as depicted in block 212, the text of the digital document is converted into a language specified by the user. In one implementation, the translation module 116 may translate the text of the digital document to a default language of the application from where the context based scanning was launched or to a default language of the computing system 100 or to any language specified by the user. If at block 210, it is determined that the language conversion mode is deactivated or off then no changes are made to the digital document.
- the data correction module 118 may analyze the captured text to detect errors, such as spelling mistakes and grammatical mistakes. The data correction module 118 may then automatically correct any detected errors in the captured text or highlight the detected errors and prompt the user to correct or ignore the same. If at block 214, it is determined that the language conversion mode is deactivated or off, then no changes are made to the digital document.
- the contents of the digital document are then pasted onto an application selected by the user.
- the format normalization module 114 may then place the text in a clipboard of the computing system 100. The user may then paste the contents of the clipboard onto the application from where the context based scanning was initiated.
- the method 200 for context based scanning facilitates using of content, already available as hardcopies, for inputting data in an application of the computing system, without having to exit or quit the application.
- the method 200 makes it convenient for the user to use captured text from various sources for generating content by providing the user with additional features such as formatting, translation and error correction.
Abstract
Systems and methods for context based scanning are described. In one implementation, the method for context based scanning comprises initiating context based scanning from an application running on a computing system (100) by pressing at least one of a dedicated scan button (130) and a pre-configured combination of keys of the computing system (100) and scanning a document on initiation of the context based scanning. The method further comprises analyzing the scanned document, using optical recognition techniques, to generate a digital document containing text and pasting the text onto the application.
Description
SYSTEMS AND METHODS FOR CONTEXT BASED SCANNING
FIELD OF INVENTION
[0001] The present subject matter relates to scanning and, particularly, but not exclusively, to methods and systems for context based scanning. BACKGROUND
[0002] Computing devices, such as cellular phones, smart phones, personal digital assistants (PDAs), tablets, and laptops provide users with a variety of applications, services, and networking capabilities. Such computing devices have seemingly become a ubiquitous part of today's lifestyle and digital technology has found its way into different aspects of human life, professional as well as personal. Most computing systems include various accessories for providing additional functionalities, such as a scanner unit to capture an image.
[0003] The computing systems are often used for creating and transmitting content in various forms like electronic mails (e-mails), documents and messages. The commercially available computing devices usually include various mechanisms, such as keypads, on-screen keyboards, voice to text conversion modules and handwriting recognition modules to facilitate the user to enter data for generating the content.
SUMMARY
[0004] This summary is provided to introduce concepts related to context based scanning. This summary is not intended to identify essential features of the claimed subject matter nor is it intended for use in determining or limiting the scope of the claimed subject matter.
[0005] According to an embodiment, a method for context based scanning is described. The method includes initiating context based scanning from an application running on a computing system by pressing at least one of a dedicated scan button and a pre-configured combination of keys of the computing system and scanning a document on initiation of the context based scanning. The method further comprises analyzing the scanned document, using optical recognition techniques to generate a digital document containing text and pasting the text onto the application. [0006] In another embodiment, a computing system for context based scanning is described. The computing system includes a processor. The computing system further
includes a processor, and an input unit, coupled to the processor, to initiate the context based scanning from an application running on the computing system. The computing system further includes a scanning module, coupled to the processor, to activate a scanner unit, communicatively coupled to the computing system, on initiation of the context based scanning and scans a document using the scanner unit. The computing system further includes a character recognition module, coupled to the processor, to analyze the scanned document using optical character recognition technique and generate a digital document containing text based on the analysis. The computing system also includes a format normalization module to paste the text onto the application.
[0007] In one embodiment a computer readable medium having embodied thereon computer readable instructions for context based scanning is described. The computer readable instructions when executed, cause a computing system to receive a user input to initiate context based scanning from an application running on a computing system. Thereafter, the executed computer readable instructions cause the computing system to scan a document on initiation of the context based scanning and analyze the scanned document, using optical recognition techniques, to generate a digital document containing text. The computing system then facilitates a user to paste the text onto the application. BRIEF DESCRIPTION OF DRAWINGS
[0008] The detailed description is described with reference to the accompanying figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The same numbers are used throughout the figures to reference like features and components:
[0009] Figure 1 schematically illustrates the components of a computing system for context based scanning, according to an example of the present subject matter.
[0010] Figure 2 illustrates a method for context based scanning, according to an example of the present subject matter.
DETAILED DESCRIPTION [0011] The present subject matter relates to systems and methods for context based scanning. The methods and systems as described herein may be implemented as any computing system capable of capturing an image. Example of such computing
systems may include mobile phones, smart phones, tablets, laptops, personal computers and a desktop.
[0012] Generally, a user uses a computing system for generating content by inputting data or text using a keypad or an on-screen keyboard. This becomes time consuming and cumbersome if the volume of text is high. Certain computing systems facilitate the input of data using voice. However, such systems do not produce text which is true conversion of the voice due to various reasons, such as variations in the accent and pronunciation amongst the users. Similarly, variance in handwriting of the users causes handwriting recognition modules, a popular feature in many commercially available computing devices, to generate erroneous results.
[0013] In certain scenarios, the text which the user wants to input may be available as a hardcopy, either in handwritten or printed form. For example, while drafting an e-mail the user may wish to send his office address to the intended recipient. In one example, the user's office address may already be available as a printed document in form of the user's business cards. In such a scenario, the user may have to quit his e-mail application, launch another application to scan his business card, copy the scanned image to his computing system's clipboard, launch the e-mail application again and paste the scanned image from his computing system's clipboard. This process is usually cumbersome, time consuming, and causes inconvenience to the user.
[0014] The systems and methods, described herein, implement context based scanning. In one implementation, a computing system for context based scanning includes a button for initiating context based scanning. In one example, the button may be implemented as a key on a keypad or on an on-screen keyboard of the computing system. In another example, the button may be implemented as a special function button placed at a convenient location of the computing system, such as on any side of the computing system. In yet another example, the user may press a pre-configured combination of keys or buttons to initiate context based scanning.
[0015] In operation, the user may initiate context based scanning while working within any application, such as a word processor, an e-mail client, and a messaging client. For example, the user may initiate context based scanning for writing onto an application some text that may be already available in the form of a hardcopy. The initiation of context based scanning may activate a scanning module of the computing system. The scanning module may activate a scanner unit, such as a camera, integrated within the computing system or communicatively coupled to the computing system
using any port such as a universal serial bus (USB) port, a serial port, and a parallel port. Using the scanner unit, the user may scan the document, whose contents he intends to capture.
[0016] On successful scanning, a character recognition module of the computing system may convert the scanned document into a digital document containing text, using optical character recognition techniques. The computing system may further normalize any formatting in the text, by removing formatting symbols, such as line breaks, section breaks, column breaks, and page breaks. In one implementation, the computing system may format the scanned text based on the user input field from where the scanning is initiated. For example, in case the scanning is initiated from an address bar of a browser, the computing system may format the text in form of a uniform resource locator (URL). In another example, if the scanning is initiated from an Internet Protocol (IP) address field, the computing system may format the text in form of an IP address.
[0017] In one example, the computing system may operate in a language conversion mode. In the language conversion mode, a translation module of the computing system may convert the captured text to the default language of the application from which context based scanning has been launched or to the default language of the computing system.
[0018] In another example, the computing system may operate in a data correction mode. In the data correction mode, a data correction module of the computing system may highlight errors, such as spelling errors and grammatical errors, in the captured text. Based on user configuration, the data correction module may prompt the user to correct the highlighted errors or may automatically correct errors in the captured text.
[0019] The user may then paste the captured text onto the application from which context based scanning was launched. The user may further save the captured text in various file formats for future use.
[0020] Thus, the computing system for context based scanning as described above facilitates using of content, already available as hard copies, for inputting data in an application of the computing system, without having to exit or quit the application. The computing system for context based scanning thus makes it convenient for the user to use captured text from various sources for generating content.
[0021] The above systems and methods are further described in conjunction with the following figures. It should be noted that the description and figures merely illustrate
the principles of the present subject matter. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the present subject matter and are included within its spirit and scope. Furthermore, all examples recited herein are principally intended expressly for pedagogical purposes to aid the reader in understanding the principles of the present subject matter and the concepts contributed by the inventor(s) to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions. Moreover, all statements herein reciting principles, aspects, and examples of the present subject matter, as well as specific examples thereof, are intended to encompass equivalents thereof.
[0022] The manner in which the systems and methods for context based scanning are implemented shall be explained in details with respect to Figures 1 and 2. While aspects of described systems and methods for context based scanning can be implemented in any number of different computing systems, environments, and/or configurations, the examples and implementations are described in the context of the following system(s).
[0023] Figure 1 schematically illustrates the components of a computing system
100 for context based scanning, according to an example of the present subject matter. In one implementation, the computing system 100 includes a processor 102, and a memory 104 connected to the processor 102. The processor 102 may include microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, logic circuitries and/or any other devices that manipulate signals and data based on computer-readable instructions. Among other capabilities, the processor 102 may fetch and execute computer-readable instructions stored in the memory 104.
[0024] Functions of the various elements shown in the figures, including any functional blocks labeled as "processor(s)", may be provided through the use of dedicated hardware as well as hardware capable of executing computer-readable instructions. [0025] The memory 104, communicatively coupled to the processor 102, can include any non-transitory computer-readable medium known in the art including, for example, volatile memory, such as static random access memory (SRAM) and dynamic random access memory (DRAM), and/or non-volatile memory, such as read only
memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes.
[0026] Further the computing system 100 includes interfaces 106. The interfaces
106 may include a variety of commercially available interfaces, for example, interfaces for peripheral device(s), such as data input output devices, referred to as I/O devices, storage devices, network device. The I/O device(s) may include Universal Serial Bus (USB) ports, Ethernet ports, host bus adaptors, and their corresponding device drivers.
[0027] Further, the computing system 100 may include modules 108. The modules 108 may be coupled to the processor 102. The modules 108, amongst other things, include routines, programs, objects, components, and data structures, which perform particular tasks or implement particular abstract data types. The modules 108 may also be implemented as, signal processor(s), state machine(s), logic circuitries, and/or any other device or component that manipulate signals based on computer- readable instructions. [0028] In said implementation, the modules 108 include a scanning module 110, a character recognition module 112, a format recognition module 114, a translation module 116, a data correction module 118 and other module(s) 120. The other module(s) 120 may include computer-readable instructions that supplement applications or functions performed by the computing system 100.
[0029] Further, the computing system 100 may also include data 122. In one implementation, the data 122 includes scan data 124 and other data 126. The other data 126 may include data generated and saved by the modules 108 for providing various functionalities of the computing system 100.
[0030] In one example, the computing system 100 may include an input unit 128 which may be inform of a keypad comprising a plurality of keys or an on-screen keyboard comprising a plurality of keys. In said example, the input unit 128 includes a dedicated scan button 130 to initiate context based scanning. In other examples, a pre- configured combination of keys may be used to initiate context based scanning.
[0031] In one implementation, the computing system 100 may include an integrated scanner unit 132, such as a camera, for scanning a document. In other implementations, the scanner unit 132 may be communicatively coupled to the computing system 100 using various available ports of the computing system 100.
[0032] In operation, a user may press the pre-configured key combination or the dedicated scan button 130 to initiate context based scanning while working on any application running on the computing system 100, such as an e-mail client, a word processor and a messaging client. On initiation of context based scanning, the scanning module 110 may activate the integrated scanner unit 132 to capture a snapshot of or scan a hardcopy, whose contents the user wants to use in the application. In one example, the scanning module 110 may set various scanner unit parameters, such as focal length, aperture depth, exposure time, film speed and usage of flash to optimally scan the hardcopy. In another example, the scanning module 110 may prompt the user to set values for the various scanner unit parameters for scanning the hardcopy. The scanning module 110 may further create and save the scanned document for future use. The scanning module 110 may save the scanned document, as the scan data 124, in various file formats, such as portable document format (pdf).
[0033] On successful scanning of the hardcopy and subsequent generation of the scanned document, the character recognition module 112 may use techniques, such as optical character recognition, to convert the scanned document into a digital document containing text. In one implementation, the character recognition module 112 may prompt the user to mark the areas of the scanned document which contains text and graphics. The character recognition module 112 may then only process the areas which contain text so as to reduce processing load on the computing system 100.
[0034] The format normalization module 114 may then analyze the digital document to remove various formatting symbols, such as line breaks, section breaks, column breaks and page breaks, to generate unformatted text. For example, the user may have scanned a news paper which usually comprises of multiple columns. The format normalization module 114 may analyze the digital document to demarcate whether a gap between two words in a row is caused due to a space symbol between the words or a column break between the two words. In another implementation, the format normalization module 114 may facilitate the user to either retain the original formatting of the contents of the digital document, as is on the hardcopy, or format the contents of the digital document in accordance with the formatting styles of the application, from which the context based scanning was initiated. In one implementation, the format normalization module 114 may format the scanned text based on a user input field of the application, from where the scanning is initiated. For example, in case the scanning is initiated from an address bar of a browser, the computing system may format the text in
form of a uniform resource locator (URL). Based on the selection of the user, the format normalization module 114 may format the document accordingly. Thus, the format normalization module 114 facilitates the user to capture text from documents having various formatting styles. The processed text may then be placed in a clipboard of the computing system 100. The user may then paste the contents of the clipboard onto the application from where the context based scanning was initiated.
[0035] In one example, the computing system 100 may be operated in a language conversion mode. In the language conversion mode, the translation module 116 may translate the captured text to a default language of the application from where the context based scanning was launched or to a default language of the computing system 100 or to any language specified by the user. For example, the user may initiate context based scanning while using an e-mail client whose default language is English. The user may then scan a newspaper in a different language, for example, a French newspaper. In language conversion mode, the translation module 116 may convert the captured text in French to English. In one example, the translation module 116 may retrieve data from databases located on the internet or from in-built databases of the computing system 100 to facilitate the translation of the captured text from one language to another.
[0036] In another example, the computing system 100 may be operated in a data correction mode. In the data correction mode, the data correction module 118 may analyze the captured text to detect errors, such as spelling mistakes and grammatical errors. In one example, the data correction module 118 may retrieve data from dictionaries available on the internet or in-built databases of the computing system 100 to facilitate error detection and error correction. In one implementation, the data correction module 118 may automatically correct any detected errors in the captured text. In another implementation, the data correction module 118 may highlight the detected errors and prompt the user to correct or ignore the same.
[0037] Thus, the computing system 100 for context based scanning facilitates using of content, already available in hardcopies, for inputting data in an application of the computing system, without having to exit or quit the application. The computing system for context based scanning thus makes it convenient for the user to use captured text from various sources for generating content by providing the user with additional features such as formatting, translation, and error correction.
[0038] Figure 2 illustrates a method 200 for context based scanning, according to an example of the present subject matter. The order in which the method 200 is described is not intended to be construed as a limitation, and any number of the described method blocks can be combined in any order to implement method 200, or an alternative method. Additionally, individual blocks may be deleted from the method 200 without departing from the spirit and scope of the subject matter described herein. Furthermore, the method 200 may be implemented in any suitable hardware, machine readable instructions, firmware, or combination thereof.
[0039] In one example, the steps of the method 200 can be performed by programmed computers. Herein, some examples are also intended to cover program storage devices, for example, digital data storage media, which are machine or computer readable and encode machine-executable or computer-executable programs of instructions, where said instructions perform some or all of the steps of the described method 200. The program storage devices may be, for example, digital memories, magnetic storage media such as a magnetic disks and magnetic tapes, hard drives, or optically readable digital data storage media.
[0040] With reference to method 200 as depicted in Figure 2, as depicted in block 202, a user input is received to scan a document. In one implementation, the user may initiate context based scanning, while working on any application of the computing system 100, by pressing the pre-configured key combination or the dedicated scan button 130.
[0041] As illustrated in block 204, a scanner unit 132 of the computing system
100 is activated to scan the document. In one example, the scanning module 110 activates the integrated scanner unit 132 of the computing system 100. In one implementation, the scanning module 110 may automatically set various scanner unit parameters associated with the integrated scanner unit 132, whereas in another implementation, the scanning module 110 may prompt the user to set the values for the various scanner unit parameters.
[0042] As shown at block 206, the contents of the scanned document are analyzed using optical character recognition techniques. The contents of the scanned document are then extracted and a digital document containing text is generated. In one implementation, the character recognition module 112 may implement optical character recognition techniques to extract contents from the scanned document and generate the digital document containing text.
[0043] At block 208, the formatting of the digital document is normalized. In one implementation, the format normalization module 114 analyzes the digital document to remove various formatting symbols to generate unformatted text. In another example the format normalization module 114 may also facilitate the user either retain the original formatting of the contents or format the contents of the digital document in accordance with the formatting styles of the application, from which the context based scanning was initiated. Based on the selection of the user, the format normalization module 114 may format the text of the digital document accordingly.
[0044] As depicted in block 210, it is determined whether the language conversion mode is activated or on. If at block 210, it is determined that the language conversion mode is activated, then as depicted in block 212, the text of the digital document is converted into a language specified by the user. In one implementation, the translation module 116 may translate the text of the digital document to a default language of the application from where the context based scanning was launched or to a default language of the computing system 100 or to any language specified by the user. If at block 210, it is determined that the language conversion mode is deactivated or off then no changes are made to the digital document.
[0045] As illustrated in block 214, it is determined whether the data correction mode is activated or on. If at block 214, it is determined that the data correction mode is activated, as depicted in block 216, the text of the digital document is analyzed for error detection and error correction. In one implementation, the data correction module 118 may analyze the captured text to detect errors, such as spelling mistakes and grammatical mistakes. The data correction module 118 may then automatically correct any detected errors in the captured text or highlight the detected errors and prompt the user to correct or ignore the same. If at block 214, it is determined that the language conversion mode is deactivated or off, then no changes are made to the digital document.
[0046] As shown in block 218, the contents of the digital document are then pasted onto an application selected by the user. In one implementation, the format normalization module 114 may then place the text in a clipboard of the computing system 100. The user may then paste the contents of the clipboard onto the application from where the context based scanning was initiated.
[0047] Thus, the method 200 for context based scanning facilitates using of content, already available as hardcopies, for inputting data in an application of the
computing system, without having to exit or quit the application. The method 200 makes it convenient for the user to use captured text from various sources for generating content by providing the user with additional features such as formatting, translation and error correction.
[0048] Although implementations for context based scanning have been described in language specific to structural features and/or methods, it is to be understood that the appended claims are not necessarily limited to the specific features or methods described. Rather, the specific features and methods are disclosed as examples of systems and methods for context based scanning over a network.
Claims
1. A computing system (100) for performing context based scanning, the computing system (100) comprising:
a processor (102);
an input unit (128), coupled to the processor (102), to initiate the context based scanning from an application running on the computing system (100); a scanning module (110) coupled to the processor (102), to:
activate a scanner unit (132), communicatively coupled to the computing system (100), on initiation of the context based scanning; and scan a document using the scanner unit (132);
a character recognition module (112), coupled to the processor(102), to: analyze the scanned document using optical character recognition technique; and
generate, based on the analysis, a digital document containing text
; and
a format normalization module (114), coupled to the processor(102), to normalize the text in a format suitable for pasting the text onto the application.
2. The computing system (100) as claimed in claim 1, further comprising a dedicated scan button (130), to initiate context based scanning from the application running on the computing system (100).
3. The computing system (100) as claimed in claim 1, further comprising a translation module (116), coupled to the processor (102), to translate the text of the digital document to at least one of a default language of the application, a default language of the computing system (100), and a language specified by a user of the computing system (100).
4. The computing system (100) as claimed in claim 1, wherein the format normalization module (114) further removes formatting symbols from the text of the digital document to generate unformatted text.
The computing system (100) as claimed in claim 1, wherein the format normalization module (114) further processes the text of the digital document to format the text based on at least one of a formatting style of the application and a user input field from which the context based scanning is initiated.
The computing system (100) as claimed in claim 1, further comprising a data correction module (118), coupled to the processor (102), to analyze the text of the digital document for detection of errors present in the text.
The computing system (100) as claimed in claim 6, wherein the data correction module (118) further analyzes the text of the digital document to correct the detected errors.
A method to perform context based scanning, the method comprising:
initiating context based scanning from an application running on a computing system (100) by pressing at least one of a dedicated scan button (130) and a pre-configured combination of keys of the computing system (100);
scanning a document on initiation of the context based scanning;
analyzing the scanned document, using optical recognition techniques, to generate a digital document containing text;
normalizing the digital document; and
pasting the normalized text onto the application.
The method as claimed in claim 8, further comprising translating the text of the digital document to at least one of a default language of the application, a default language of the computing system (100) and a language specified by a user of the computing system (100).
The method as claimed in claim 9, further comprising removing formatting symbols from the text of the digital document to generate unformatted text.
The method as claimed in claim 8, further comprising processing the text of the digital document to format the text based on at least one formatting style of the application.
12. The method as claimed in claim 8, the method further comprising:
analyzing the text of the digital document for detection of errors present in the text; and
correcting the detected errors.
A non-transitory computer-readable medium having a set of computer readable instructions that, when executed, cause a computing system to:
receive a user input to initiate context based scanning from an application running on a computing system (100);
scan a document on initiation of the context based scanning; analyze the scanned document, using optical recognition techniques, to generate a digital document containing text; and
paste the text onto the application.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
IN964/DEL/2013 | 2013-03-29 | ||
IN964DE2013 | 2013-03-29 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014154457A1 true WO2014154457A1 (en) | 2014-10-02 |
Family
ID=50272592
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2014/054267 WO2014154457A1 (en) | 2013-03-29 | 2014-03-05 | Systems and methods for context based scanning |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2014154457A1 (en) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050157955A1 (en) * | 2004-01-15 | 2005-07-21 | Hitachi Global Storage Technologies | Self-contained OCR system using hard disk drive |
US20050205671A1 (en) * | 2004-02-13 | 2005-09-22 | Tito Gelsomini | Cellular phone with scanning capability |
WO2006011641A1 (en) * | 2004-07-30 | 2006-02-02 | Canon Kabushiki Kaisha | Communication apparatus, information processing method, program, and storage medium |
US20060085477A1 (en) * | 2004-10-01 | 2006-04-20 | Ricoh Company, Ltd. | Techniques for retrieving documents using an image capture device |
US20080237347A1 (en) * | 2005-06-17 | 2008-10-02 | Wolf Erich W | PDA compatible text scanner |
US20100188419A1 (en) * | 2009-01-28 | 2010-07-29 | Google Inc. | Selective display of ocr'ed text and corresponding images from publications on a client device |
US20120050819A1 (en) * | 2010-08-30 | 2012-03-01 | Jiang Hong | Approach For Processing Scanned Document Data |
WO2012075315A1 (en) * | 2010-12-01 | 2012-06-07 | Google Inc. | Identifying matching canonical documents in response to a visual query |
-
2014
- 2014-03-05 WO PCT/EP2014/054267 patent/WO2014154457A1/en active Application Filing
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050157955A1 (en) * | 2004-01-15 | 2005-07-21 | Hitachi Global Storage Technologies | Self-contained OCR system using hard disk drive |
US20050205671A1 (en) * | 2004-02-13 | 2005-09-22 | Tito Gelsomini | Cellular phone with scanning capability |
WO2006011641A1 (en) * | 2004-07-30 | 2006-02-02 | Canon Kabushiki Kaisha | Communication apparatus, information processing method, program, and storage medium |
US20060085477A1 (en) * | 2004-10-01 | 2006-04-20 | Ricoh Company, Ltd. | Techniques for retrieving documents using an image capture device |
US20080237347A1 (en) * | 2005-06-17 | 2008-10-02 | Wolf Erich W | PDA compatible text scanner |
US20100188419A1 (en) * | 2009-01-28 | 2010-07-29 | Google Inc. | Selective display of ocr'ed text and corresponding images from publications on a client device |
US20120050819A1 (en) * | 2010-08-30 | 2012-03-01 | Jiang Hong | Approach For Processing Scanned Document Data |
WO2012075315A1 (en) * | 2010-12-01 | 2012-06-07 | Google Inc. | Identifying matching canonical documents in response to a visual query |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107451128B (en) | Multi-language code adaptation method, device and storage medium | |
US20170300128A1 (en) | Multimodel Text Input by a Keyboard/Camera Text Input Module Replacing a Conventional Keyboard Text Input Module on a Mobile Device | |
CN107797754B (en) | Method and device for text replication and medium product | |
CN108108342B (en) | Structured text generation method, search method and device | |
WO2022057707A1 (en) | Text recognition method, image recognition classification method, and document recognition processing method | |
US9081412B2 (en) | System and method for using paper as an interface to computer applications | |
US20190370339A1 (en) | System and method for real time translation | |
US9098759B2 (en) | Image processing apparatus, method, and medium for character recognition | |
US20220108556A1 (en) | Method of comparing documents, electronic device and readable storage medium | |
US11521365B2 (en) | Image processing system, image processing apparatus, image processing method, and storage medium | |
US9977766B2 (en) | Keyboard input corresponding to multiple languages | |
US10460192B2 (en) | Method and system for optical character recognition (OCR) of multi-language content | |
US9733825B2 (en) | East Asian character assist | |
CN111291572A (en) | Character typesetting method and device and computer readable storage medium | |
CN112149680A (en) | Wrong word detection and identification method and device, electronic equipment and storage medium | |
US11941903B2 (en) | Image processing apparatus, image processing method, and non-transitory storage medium | |
US20210295033A1 (en) | Information processing apparatus and non-transitory computer readable medium | |
CN107590136B (en) | Translation device, translation system, and translation method | |
WO2014154457A1 (en) | Systems and methods for context based scanning | |
JP2019057137A (en) | Information processing apparatus and information processing program | |
US20150261746A1 (en) | Translation system, information processing apparatus, information processing method, and computer program product | |
US11006015B2 (en) | Information processing apparatus and non-transitory computer readable medium storing information processing program | |
JP7342518B2 (en) | Image processing device and image processing program | |
Krejcar | Smart implementation of text recognition (OCR) for smart mobile devices | |
US20230102476A1 (en) | Information processing apparatus, non-transitory computer readable medium storing program, and information processing method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14709603 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 14709603 Country of ref document: EP Kind code of ref document: A1 |