GB2596333A - Improvements relating to sequence recognition to enable communications - Google Patents
Improvements relating to sequence recognition to enable communications Download PDFInfo
- Publication number
- GB2596333A GB2596333A GB2009716.8A GB202009716A GB2596333A GB 2596333 A GB2596333 A GB 2596333A GB 202009716 A GB202009716 A GB 202009716A GB 2596333 A GB2596333 A GB 2596333A
- Authority
- GB
- United Kingdom
- Prior art keywords
- character set
- sequence
- content
- valid character
- captured
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/26—Devices for calling a subscriber
- H04M1/27—Devices whereby a plurality of signals may be stored simultaneously
- H04M1/274—Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc
- H04M1/2745—Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc using static electronic memories, e.g. chips
- H04M1/2753—Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc using static electronic memories, e.g. chips providing data content
- H04M1/2755—Devices whereby a plurality of signals may be stored simultaneously with provision for storing more than one subscriber number at a time, e.g. using toothed disc using static electronic memories, e.g. chips providing data content by optical scanning
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M7/00—Arrangements for interconnection between switching centres
- H04M7/0024—Services and arrangements where telephone services are combined with data services
- H04M7/003—Click to dial services
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M7/00—Arrangements for interconnection between switching centres
- H04M7/0024—Services and arrangements where telephone services are combined with data services
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- General Engineering & Computer Science (AREA)
- Telephonic Communication Services (AREA)
- Character Discrimination (AREA)
Abstract
Content is displayed on a display device. A content capture program is enabled, and an input is received from the capture program designating a portion of the content to be captured. The captured portion is analysed to recognise a sequence of characters and the recognised sequence is extracted. The recognized sequence is evaluated to determine a valid character set for enabling a communication. The valid character set may be provided to a telephony system to instigate the communication via a communication network using the valid character set. The display may be augmented to identify the valid character set i.e. altering its appearance by placing a telephone icon next to the telephone number. Analysing the captured portion may use optical character recognition (OCR).
Description
IMPROVEMENTS RELATING TO SEQUENCE RECOGNITION TO ENABLE COMMUNCIATIONS
The present invention relates to a method of recognizing a character sequence on a display for enabling communication over a communication network. Specifically, recognizing a telephone number to enable click-to-call functionality. The term "telephone number" encompasses any numerical or alphanumeric sequences which can be used to instigate a communication between at least two end-users.
Integrated software applications which enable the ability to make calls from other software applications wherever telephone numbers exist are known. Recently, new genres of software are being produced that make it impossible to natively integrate with and to fetch numbers. One such application is "Microsoft Teams". Therefore, it is desirable to provide a method to allow dialling from any application where telephone numbers are displayed with only minimal configuration.
According to a first aspect of the present invention, there is provided a method of recognising a sequence of characters for enabling a communication via a communication network, the method comprising the steps: displaying content on a display device; enabling a content capture program; receiving an input from the capture program designating a portion of the content to be captured; analysing the captured portion to recognise a sequence of characters; extracting the recognised sequence; evaluating the recognized sequence to determine a valid character set for enabling a communication.
Advantageously, the method is application agnostic, useable on any displayed content to identify and retrieve valid character sets regardless of inherent presentation and tolerant of display quality, thereby enabling the valid character sets to be used to launch a communication.
In an embodiment, the method includes the further step of providing the valid character set to a telephony system to instigate the communication via the communication network using the valid character set.
In an embodiment, the method includes the further step of augmenting the displayto identify the valid character set.
In an embodiment, the captured portion is captured in a lossless image format.
In an embodiment, the step of analysing comprises optical character recognition techniques.
In an embodiment the step of analysing further comprises upscaling techniques.
According to a second aspect of the present invention, there is provided a system comprising: a display device, a processing unit, and a memory comprising computer storage media including instructions that when executed by the processing unit: display content on the display device; enable a content capture program; receive an input from the capture program designating a portion of the content to be captured; analyse the captured portion to recognise a sequence of characters; extract the recognised sequence; evaluate the recognized sequence to determine a valid character set for enabling a communication.
In an embodiment, the computer storage media include instructions executable by the processing unit to provide the valid character set to a telephony system to instigate the communication via the communication network using the valid number.
In an embodiment, the display device includes a touch-sensitive or proximity sensitive display surface.
According to a third aspect of the present invention, there is provided at least one computer-readable storage medium containing computer-executable instructions for causing a computer device to perform operations comprising: displaying content on a display device; enabling a content capture program; receiving an input from the capture program designating a portion of the content to be captured; analysing the captured portion to recognise a sequence of characters; extracting the recognised sequence; evaluating the recognized sequence to determine a valid character set for enabling a communication.
In an embodiment, the operations further comprises: providing the valid character set to a telephony system to instigate the communication via the communication network using the valid character set.
In an embodiment, the operations further comprises: augmenting the display to identify the valid character set.
The invention may be produced in various ways and an embodiment thereof will now be described, by way of example only, reference being made to the accompanying drawings, in which:-Figure 1 is a flow chart illustrating in overview an embodiment of the method according to the present invention; Figure 2 illustrates an embodiment of the system according to the present invention; and, Figure 3 is a screen print showing operation of the method according to the present invention on a display in an embodiment as a software application.
Referring to Figure 1 of the drawings, there is illustrated a flow chart sequencing the steps associated with the method (100) according to an embodiment of the present invention. Prior to implementing the method, the content is displayed on a display device of a computer system (an example of which is described with reference to Figure 2). Once the content has been displayed, the first step (110) associated with the method (100) includes enabling a content capture program; this step may correspond to a user inputting a command on a device, to implement the subsequent method steps. The second step (120) comprises the user designating or selecting a portion of the content to be captured and may involve the user directing an input device cursor over a specific area of the displayed content. The third step (130) comprises analysing the captured portion to recognise a sequence of characters and extracting the recognised sequence. This step may involve the use of optical character recognition techniques. The fourth step (140) comprises evaluating the recognized sequence to determine a valid character set for enabling a communication and may involve the use of validation algorithms and/or lookups via stored databases of valid characters sets.
A further step (150) includes providing the valid character set to a telephony system to instigate the communication via the communication network using the valid character set; this step may involve known protocols to enable the user to place a call such as via telephone system, for example a system including a VolP (voice over internet protocol) enabled telephone connected to a computer system running the method.
A further step may include augmenting or modifying the display to identify the valid character set, once the valid character set has been found. This enables the user to understand that a valid number has been found and that a call can be made, for example a click to call function.
The present invention combines several technologies to allow a software application to read and understand telephone numbers from a user display. An overview of the exemplary usage of the method according to the invention can be described as follows: * The user activates the method herein via a software application * A section of the display is captured * The section is analysed using optical character recognition techniques to extract readable characters * The extracted characters are parsed and validated * If a valid telephone number is found, the telephone number is presented to the user with an option to make a telephone call to the number.
The user can indicate that they want to call the extracted telephone number, by clicking on the modified display, then the method can enable this using known telephony communication techniques. The specific implementation of each step is important to ensure sufficient usability.
OCR can be computationally intensive and in an envisaged use of the method, computer terminals which may run the method will likely have restricted processing power due to cost and scale. Indication of activation (110) by the user reduces the processing load and in avoiding retrieving false positives, were the display being continuously analysed for valid character sets. A potential embodiment may include a keypress input on a user input device such as a keyboard, alternatively activation may be controlled by holding the input key for the duration of the process or toggled on and off. In an example the method may be running while a user holds the activation key and navigates the display with a mouse, an area surrounding the cursor is analysed for valid character sets, this reduces the processing required to only an area of interest of the user and when they are interested.
Determining a section of the displayed content to be captured (120) reduces the processing load from that required to analyse the entirety of the displayed content constantly. In a potential embodiment a region surrounding a pointer cursor is shown on the display (as described with reference to Figure 3), which intuitively allows the user to indicate a telephone number by positioning the cursor close to the number. The region may be illustrated with a borderline having a preferred shape to denote the region of the displayed content to be analysed, for example a rectangle can be used which is wider than it is tall to suit telephone numbers as commonly displayed. The size and shape of the section should be sufficiently large to surround on-screen displayed telephone numbers, however, if the region is too large then it can include extraneous data and waste CPU time. The capture section of the display can be in an uncompressed and/or lossless format, for example a bitmap map may be used.
Analysing the captured portion (130) using optical character recognition (OCR) requires the technique to be resilient to the variance in display quality or font style. OCR is more effective on high quality images such as the printed word, and difficulty can arise with small images having a low number of pixels. In an exemplary embodiment, an upscaling technique (such as a bicubic) can covert the captured content to a level suitable for recognition. OCR is typically done on print having a resolution of 300 dots per inch (DPI), thus allowing the OCR to work with a high confidence level. Computer displays, commonly used in call centres, generally operate at 96 DPI, while less common computers can also work at 240 DPI ("High DPI"). Known OCR techniques will work satisfactorily on High DPI, however, on common low DPI displays upscaling techniques render the images larger, and as such, the images become increasingly "blocky" or "fuzzy" and give no extra value to OCR. It has been found that a using a bicubic upscaling technique will artificially create the needed missing content for the OCR to work. Other potential upscaling techniques such as angle compensation can also produce satisfactory results.
The extracted numbers require parsing to form valid telephone numbers (140). The extracted numbers needed to be distinguished from other similar strings of numbers, e.g. dates, account numbers, monetary values. Functions such as string searching algorithms or lookups are used to understand valid numbers, extract their location (area code) information and validate the length.
If a valid number is found, then the number is presented to the user with an option to make a call to the number. The number is represented in a similar style and displayed above its original location to make it stand out as an element to be clicked on. The display size is compensated for the displayed value such that it can be superimposed above the original, to make it clickable.
Figure 2 illustrates an exemplary computer system (200) that may be used to implement aspects of the invention. In a basic configuration the system includes a display (210), input means (221/222), communication means (230), and standard components such as a processing unit and memory. The display (210) may be used to present content such as graphical user interfaces and data. Content (240) is displayed on the display (210) when the method (as described with reference to Figure 1) is launched, and a region or area around the cursor (250), corresponding to the input device position is illustrated via the borderline. Within the area around the cursor (250) the method is attempting to find valid characters sets, which can be used to make a call, via communication means (230) such as a Session Initiation Protocol (SIP) phone.
Figure 3 illustrates the steps of a method as they are performed on content on a display (300), including a portion that has been designated by a user in accordance with an embodiment of the invention. In the Figure, the displayed content is shown as it would appear to the user (a), the portion of content (340) is surrounded by a borderline (350) denoting where the method is extracting and analysing the content for valid character sets (b). Within the borderline the valid characters set can be seen to have its appearance altered (360) to enable the user to interact with the number to make a call (c).
Select embodiments of the invention only have been described and illustrated, and it will be readily apparent that other embodiments, modifications, additions and omissions are possible within the scope of the invention.
The invention may be varied according to requirements, including but not limited to programming language or emulation, having as its objective the ability to recognising a sequence of characters for enabling a communication via a communication network.
Claims (12)
- CLAIMS1. A method of recognising a sequence of characters for enabling a communication via a communication network, the method comprising the steps: displaying content on a display device; enabling a content capture program; receiving an input from the capture program designating a portion of the content to be captured; analysing the captured portion to recognise a sequence of characters; extracting the recognised sequence; evaluating the recognized sequence to determine a valid character set for enabling a communication.
- 2. The method according to claim 1, including the further step: providing the valid character set to a telephony system to instigate the communication via the communication network using the valid character set.
- 3. The method according to any previous claim, including the further step: augmenting the display to identify the valid character set.
- 4. The method according to any previous claim, wherein the captured portion is captured in a lossless image format.
- 5. The method according to any previous claim, wherein the step of analysing comprises optical character recognition techniques.
- 6. The method according to any previous claim, wherein the step of analysing further comprises upscaling techniques.
- 7. A system comprising: a display device, a processing unit, and a memory comprising computer storage media including instructions that when executed by the processing unit: display content on the display device; enable a content capture program; receive an input from the capture program designating a portion of the content to be captured; analyse the captured portion to recognise a sequence of characters; extract the recognised sequence; evaluate the recognized sequence to determine a valid character set for enabling a communication.
- 8. The system of claim 7, wherein the computer storage media include instructions executable by the processing unit to provide the valid character set to a telephony system to instigate the communication via the communication network using the valid number.
- 9. The system of claim 8, wherein the display device includes a touch-sensitive or proximity sensitive display surface.
- 10. At least one computer-readable storage medium containing computer-executable instructions for causing a computer device to perform operations comprising: displaying content on a display device; enabling a content capture program; receiving an input from the capture program designating a portion of the content to be captured; analysing the captured portion to recognise a sequence of characters; extracting the recognised sequence; evaluating the recognized sequence to determine a valid character set for enabling a communication.
- 11. The at least one computer-readable storage medium of claim 10, wherein the operations further comprises: providing the valid character set to a telephony system to instigate the communication via the communication network using the valid character set.
- 12. The at least one computer-readable storage medium of claim 11, wherein the operations further comprises: augmenting the display to identify the valid character set.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB2009716.8A GB2596333A (en) | 2020-06-25 | 2020-06-25 | Improvements relating to sequence recognition to enable communications |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB2009716.8A GB2596333A (en) | 2020-06-25 | 2020-06-25 | Improvements relating to sequence recognition to enable communications |
Publications (2)
Publication Number | Publication Date |
---|---|
GB202009716D0 GB202009716D0 (en) | 2020-08-12 |
GB2596333A true GB2596333A (en) | 2021-12-29 |
Family
ID=71949629
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB2009716.8A Pending GB2596333A (en) | 2020-06-25 | 2020-06-25 | Improvements relating to sequence recognition to enable communications |
Country Status (1)
Country | Link |
---|---|
GB (1) | GB2596333A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007102045A1 (en) * | 2006-03-09 | 2007-09-13 | C.D.C. S.R.L. | Method for placing telephone, video or internet calls through automatic number seek from selections in electronic pages |
EP2091222A1 (en) * | 2008-02-18 | 2009-08-19 | Univerza v Ljubljani FAKULTETA ZA ELEKTROTEHNIKO | Click-to-dial service on IPTV |
EP2414969B1 (en) * | 2009-05-07 | 2018-05-23 | Skype | Communication system and method for browser-based phone number detection |
US20190166255A1 (en) * | 2017-11-30 | 2019-05-30 | T-Mobile Usa, Inc. | Dynamically Generated Call Triggers |
-
2020
- 2020-06-25 GB GB2009716.8A patent/GB2596333A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007102045A1 (en) * | 2006-03-09 | 2007-09-13 | C.D.C. S.R.L. | Method for placing telephone, video or internet calls through automatic number seek from selections in electronic pages |
EP2091222A1 (en) * | 2008-02-18 | 2009-08-19 | Univerza v Ljubljani FAKULTETA ZA ELEKTROTEHNIKO | Click-to-dial service on IPTV |
EP2414969B1 (en) * | 2009-05-07 | 2018-05-23 | Skype | Communication system and method for browser-based phone number detection |
US20190166255A1 (en) * | 2017-11-30 | 2019-05-30 | T-Mobile Usa, Inc. | Dynamically Generated Call Triggers |
Also Published As
Publication number | Publication date |
---|---|
GB202009716D0 (en) | 2020-08-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5457738A (en) | Method and system for searching an on-line directory at a telephone station | |
US8121413B2 (en) | Method and system for controlling browser by using image | |
KR101014075B1 (en) | Boxed and lined input panel | |
US11573939B2 (en) | Process and apparatus for selecting an item from a database | |
CN106484266B (en) | Text processing method and device | |
CN107785021B (en) | Voice input method, device, computer equipment and medium | |
KR100790700B1 (en) | Speech recognition assisted autocompletion of composite characters | |
EP2472372A1 (en) | Input method of contact information and system | |
US8831209B2 (en) | Conference call dialing | |
US20090079702A1 (en) | Method, Apparatus and Computer Program Product for Providing an Adaptive Keypad on Touch Display Devices | |
KR20140030361A (en) | Apparatus and method for recognizing a character in terminal equipment | |
US20050268231A1 (en) | Method and device for inputting Chinese phrases | |
CN108256523B (en) | Identification method and device based on mobile terminal and computer readable storage medium | |
US9934422B1 (en) | Digitized handwriting sample ingestion systems and methods | |
US9886626B1 (en) | Digitized handwriting sample ingestion and generation systems and methods | |
US20040176139A1 (en) | Method and wireless communication device using voice recognition for entering text characters | |
WO2020253368A1 (en) | Electronic reading display method, storage method, electronic device, computer device, and medium | |
US20040126017A1 (en) | Grammar-determined handwriting recognition | |
US20070139367A1 (en) | Apparatus and method for providing non-tactile text entry | |
CN111142683B (en) | Input assisting program, input assisting method, and input assisting device | |
CN112863495A (en) | Information processing method and device and electronic equipment | |
US10832081B2 (en) | Image processing apparatus and non-transitory computer-readable computer medium storing an image processing program | |
GB2596333A (en) | Improvements relating to sequence recognition to enable communications | |
KR100933270B1 (en) | Method, system and computer-readable recording medium for performing web search based on image information | |
CN109992121B (en) | Input method, input device and input device |