CA2653843A1 - Learning character segments during text input - Google Patents

Learning character segments during text input Download PDF

Info

Publication number
CA2653843A1
CA2653843A1 CA002653843A CA2653843A CA2653843A1 CA 2653843 A1 CA2653843 A1 CA 2653843A1 CA 002653843 A CA002653843 A CA 002653843A CA 2653843 A CA2653843 A CA 2653843A CA 2653843 A1 CA2653843 A1 CA 2653843A1
Authority
CA
Canada
Prior art keywords
character
characters
another
proposed
determination
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002653843A
Other languages
French (fr)
Other versions
CA2653843C (en
Inventor
Vadim Fux
Sergey Kolomiets
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BlackBerry Ltd
Original Assignee
Research In Motion Limited
Vadim Fux
Sergey Kolomiets
2012244 Ontario Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Research In Motion Limited, Vadim Fux, Sergey Kolomiets, 2012244 Ontario Inc. filed Critical Research In Motion Limited
Publication of CA2653843A1 publication Critical patent/CA2653843A1/en
Application granted granted Critical
Publication of CA2653843C publication Critical patent/CA2653843C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/018Input/output arrangements for oriental characters
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/53Processing of non-Latin text

Abstract

An improved method of learning character segments during text input enables facilitated text input on an improved handheld electronic device. In response to a series of inputs, segments and other objects are analyzed to generate a proposed character interpretation of the series of inputs. Responsive to detecting a replacement of a character of the character interpretation with another character, a character learning string comprising the another character and a number of additional characters of the character interpretation are stored as a candidate. In response to another series of inputs, another proposed character interpretation is generated. Responsive to detecting another replacement of a character of the another character interpretation with a different character, another character learning string comprising the different character and a number of characters of the another character interpretation are compared with the stored candidate. If a set of characters in the another character learning string match characters in the candidate, the set of characters are stored as a segment.

Claims (22)

1. A method of enabling input on a handheld electronic device comprising a memory having stored therein a plurality of characters and a plurality of segments, the segments each comprising a plurality of the characters, the method comprising:
receiving as a first entry a plurality of first inputs, at least some of the first inputs each corresponding with a number of the characters;
comparing at least a portion of the first entry with at least some of the segments to identify for each of at least some of the first inputs a proposed first character of the characters with which the first input corresponds;
outputting the proposed first characters;
for at least one of the first inputs, detecting as an editing input a replacement of the proposed first character with another first character with which the first input corresponds;
responsive to said detecting, storing as a candidate character string a string of characters comprising the another first character and a number of the proposed first characters output adjacent thereto;
receiving as a second entry a plurality of second inputs, at least some of the second inputs each corresponding with a number of the characters;
comparing at least a portion of the second entry with at least some of the segments to identify for each of at least some of the second inputs a proposed second character of the characters with which the second input corresponds;
outputting the proposed second characters;
for at least one of the second inputs, detecting as an editing input a replacement of the proposed second character with another second character with which the second input corresponds;
making a determination that a series of characters comprising the another second character and a number of proposed second characters output adjacent thereto sequentially match at least a portion of the candidate character string, the at least portion of the candidate character string comprising the another first character and at least a subset of the number of proposed first characters; and responsive to said making a determination, storing the series of characters as a segment.
2. The method of Claim 1, further comprising making as at least a portion of said determination a determination that the series of characters sequentially match the another first character and a quantity of said proposed first characters disposed adjacent the another first character.
3. The method of Claim 1, wherein the handheld electronic device comprises an input apparatus comprising a plurality of input members, and further comprising detecting as the first entry a plurality of input member actuations, at least some of the first inputs each comprising a plurality of the input member actuations.
4. The method of Claim 1, further comprising storing as the candidate character string another first character, a quantity of the proposed first characters preceding the another first character, and a quantity of the proposed first characters following the another first character.
5. The method of Claim 4, further comprising making as at least a portion of said determination:
a determination that the another second character matches the another first character;
followed by a determination that a proposed second character that one of precedes and follows the another second character matches a proposed first character that the one of precedes and follows the another first character;
followed by a determination that a proposed second character that the other of precedes and follows the another second character matches a proposed first character that the other of precedes and follows the another first character.
6. The method of Claim 4, further comprising making as at least a portion of said determination:
a determination that the another second character matches the another first character;
followed by a number of determinations that a number of proposed second characters which alternately precede and follow the another second character in a fashion progressively moving outwardly from the another second character match a number of proposed first characters which similarly alternately precede and follow the another first character in a fashion progressively moving outwardly from the another first character;
followed by a determination that a particular proposed second character at a particular position in the series with respect to the another second character differs from a proposed first character similarly positioned in the candidate character string with respect to the another first character; and storing as the series of characters the another second character and the number of proposed second characters.
7. The method of Claim 6, further comprising:
responsive to said determination that the proposed second character differs, making as one of said number of determinations a determination that a quantity of additional proposed second characters positioned in the series at a side of the another second character opposite that of the particular proposed second character match a quantity of proposed first characters positioned in the candidate character string with respect to the another first character at a position similar to that of the quantity of additional proposed second characters with respect to the another second character; and storing the quantity of additional proposed second characters as a portion of the number of proposed second characters of the series of characters.
8. The method of Claim 6, further comprising, responsive to said determination that the proposed second character differs, initiating said storing the series of characters.
9. The method of Claim 1, further comprising deleting the candidate character string.
10. The method of Claim 1, further comprising determining that the series of characters comprises more than a predetermined quantity of characters and, responsive thereto:
determining that another segment matches a portion of the series of characters; and storing as the segment the portion of the series of characters not matched by the another segment.
11. The method of Claim 10, further comprising storing as a combination object at least a representation of the segment and at least a representation of the another segment.
12. A handheld electronic device comprising an input apparatus, a processor apparatus, and an output apparatus, the processor apparatus comprising a processor and a memory having stored therein a plurality of objects comprising a plurality of characters and a plurality of segments, the segments each comprising a plurality of the characters, the memory having stored therein a number of routines which, when executed by the processor, cause the handheld electronic device to be adapted to perform operations comprising:
receiving as a first entry a plurality of first inputs, at least some of the first inputs each corresponding with a number of the characters;
comparing at least a portion of the first entry with at least some of the segments to identify for each of at least some of the first inputs a proposed first character of the characters with which the first input corresponds;
outputting the proposed first characters;
for at least one of the first inputs, detecting as an editing input a replacement of the proposed first character with another first character with which the first input corresponds;
responsive to said detecting, storing as a candidate character string a string of characters comprising the another first character and a number of the proposed first characters output adjacent thereto;
receiving as a second entry a plurality of second inputs, at least some of the second inputs each corresponding with a number of the characters;
comparing at least a portion of the second entry with at least some of the segments to identify for each of at least some of the second inputs a proposed second character of the characters with which the second input corresponds;
outputting the proposed second characters;
for at least one of the second inputs, detecting as an editing input a replacement of the proposed second character with another second character with which the second input corresponds;
making a determination that a series of characters comprising the another second character and a number of proposed second characters output adjacent thereto sequentially match at least a portion of the candidate character string, the at least portion of the candidate character string comprising the another first character and at least a subset of the number of proposed first characters; and responsive to said making a determination, storing the series of characters as a segment.
13. The handheld electronic device of Claim 12 wherein the operations further comprise making as at least a portion of said determination a determination that the series of characters sequentially match the another first character and a quantity of said proposed first characters disposed adjacent the another first character.
14. The handheld electronic device of Claim 12, wherein the input apparatus comprises a plurality of input members, and wherein the operations further comprise detecting as the first entry a plurality of input member actuations, at least some of the first inputs each comprising a plurality of the input member actuations.
15. The handheld electronic device of Claim 12 wherein the operations further comprise storing as the candidate character string another first character, a quantity of the proposed first characters preceding the another first character, and a quantity of the proposed first characters following the another first character.
16. The handheld electronic device of Claim 15 wherein the operations further comprise making as at least a portion of said determination:
a determination that the another second character matches the another first character;
followed by a determination that a proposed second character that one of precedes and follows the another second character matches a proposed first character that the one of precedes and follows the another first character;
followed by a determination that a proposed second character that the other of precedes and follows the another second character matches a proposed first character that the other of precedes and follows the another first character.
17. The handheld electronic device of Claim 15 wherein the operations further comprise making as at least a portion of said determination:
18 a determination that the another second character matches the another first character;
followed by a number of determinations that a number of proposed second characters which alternately precede and follow the another second character in a fashion progressively moving outwardly from the another second character match a number of proposed first characters which similarly alternately precede and follow the another first character in a fashion progressively moving outwardly from the another first character;
followed by a determination that a particular proposed second character at a particular position in the series with respect to the another second character differs from a proposed first character similarly positioned in the candidate character string with respect to the another first character; and storing as the series of characters the another second character and the number of proposed second characters.

18. The handheld electronic device of Claim 17 wherein the operations further comprise:
responsive to said determination that the proposed second character differs, making as one of said number of determinations a determination that a quantity of additional proposed second characters positioned in the series at a side of the another second character opposite that of the particular proposed second character match a quantity of proposed first characters positioned in the candidate character string with respect to the another first character at a position similar to that of the quantity of additional proposed second characters with respect to the another second character; and storing the quantity of additional proposed second characters as a portion of the number of proposed second characters of the series of characters.
19. The handheld electronic device of Claim 17 wherein the operations further comprise, responsive to said determination that the proposed second character differs, initiating said storing the series of characters.
20. The handheld electronic device of Claim 12 wherein the operations further comprise deleting the candidate character string.
21. The handheld electronic device of Claim 12 wherein the operations further comprise determining that the series of characters comprises more than a predetermined quantity of characters and, responsive thereto:
determining that another segment matches a portion of the series of characters; and storing as the segment the portion of the series of characters not matched by the another segment.
22. The handheld electronic device of Claim 21 wherein the operations further comprise storing as a combination object at least a representation of the segment and at least a representation of the another segment.
CA2653843A 2006-06-30 2006-06-30 Learning character segments during text input Active CA2653843C (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CA2006/001089 WO2008000058A1 (en) 2006-06-30 2006-06-30 Learning character segments during text input

Publications (2)

Publication Number Publication Date
CA2653843A1 true CA2653843A1 (en) 2008-01-03
CA2653843C CA2653843C (en) 2012-02-07

Family

ID=38845060

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2653843A Active CA2653843C (en) 2006-06-30 2006-06-30 Learning character segments during text input

Country Status (2)

Country Link
CA (1) CA2653843C (en)
WO (1) WO2008000058A1 (en)

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7124080B2 (en) * 2001-11-13 2006-10-17 Microsoft Corporation Method and apparatus for adapting a class entity dictionary used with language models
US7228267B2 (en) * 2002-07-03 2007-06-05 2012244 Ontario Inc. Method and system of creating and using Chinese language data and user-corrected data
US7478033B2 (en) * 2004-03-16 2009-01-13 Google Inc. Systems and methods for translating Chinese pinyin to Chinese characters
CA2496872C (en) * 2004-03-17 2010-06-08 America Online, Inc. Phonetic and stroke input methods of chinese characters and phrases

Also Published As

Publication number Publication date
CA2653843C (en) 2012-02-07
WO2008000058A1 (en) 2008-01-03

Similar Documents

Publication Publication Date Title
CA2647938A1 (en) Handheld electronic device and method for learning contextual data during disambiguation of text input
CA2509010A1 (en) Handheld electronic device with text disambiguation
GB2451035A (en) Handheld electronic device and method for performing optimized spell checking during text entry by providing a sequentially ordered series of spell-check algo
CA2635045A1 (en) Handheld electronic device and method for disambiguation of text input and providing spelling substitution
ATE407409T1 (en) AUTOMATIC GESTURE DETECTION
DE602006016846D1 (en) SYSTEM AND METHOD FOR BROWSING AND COMPARING DATA WITH IDEOGRAMMATIC CONTENTS
WO2005059672A3 (en) Communication device and method for inputting and predicting text
WO2019201511A8 (en) Method and data processing apparatus
CA2636207A1 (en) Handheld electronic device providing proposed corrected input in response to erroneous text entry in environment of text requiring multiple sequential actuations of the same key, and associated method
CA2509009A1 (en) Handheld electronic device with text disambiguation
CA2509012A1 (en) Handheld electronic device with text disambiguation
CA2509014A1 (en) Handheld electronic device with text disambiguation
DE60324585D1 (en) METHOD, METHOD AND COMPUTER PROGRAM FOR DETECTING POINT CORRESPONDENCES IN POINT QUANTITIES
Xiao et al. Data mining based on segmented time warping distance in time series database.
CA2653843A1 (en) Learning character segments during text input
WO2007033228A3 (en) Reducing false positives for automatic computerized detection of objects
WO2006105641B1 (en) Handheld electronic device with text disambiguation employing advanced editing feature
CA2583923A1 (en) Handheld electronic device and method for performing spell checking during text entry and for providing a spell-check learning feature
CA2554397A1 (en) Handheld electronic device with disambiguation of compound word text input employing separating input
WO2009013818A1 (en) Character recognition processing method and device
CA2658586A1 (en) Learning character segments from received text
CA2639224A1 (en) Handheld electronic device and associated method providing disambiguation of an ambiguous object during editing and selectively providing prediction of future characters
CA2605785A1 (en) Handheld electronic device with reduced keyboard and associated method of providing improved disambiguation with reduced degradation of device performance
CN102902918A (en) Malicious file detection method based on composite feature code
CA2635009A1 (en) Handheld electronic device and method for disambiguation of compound text input and that employs n-gram data to limit generation of low-probability compound language solutions

Legal Events

Date Code Title Description
EEER Examination request