CA2653843A1 - Learning character segments during text input - Google Patents
Learning character segments during text input Download PDFInfo
- Publication number
- CA2653843A1 CA2653843A1 CA002653843A CA2653843A CA2653843A1 CA 2653843 A1 CA2653843 A1 CA 2653843A1 CA 002653843 A CA002653843 A CA 002653843A CA 2653843 A CA2653843 A CA 2653843A CA 2653843 A1 CA2653843 A1 CA 2653843A1
- Authority
- CA
- Canada
- Prior art keywords
- character
- characters
- another
- proposed
- determination
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/018—Input/output arrangements for oriental characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/126—Character encoding
- G06F40/129—Handling non-Latin characters, e.g. kana-to-kanji conversion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/53—Processing of non-Latin text
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Machine Translation (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
An improved method of learning character segments during text input enables facilitated text input on an improved handheld electronic device. In response to a series of inputs, segments and other objects are analyzed to generate a proposed character interpretation of the series of inputs. Responsive to detecting a replacement of a character of the character interpretation with another character, a character learning string comprising the another character and a number of additional characters of the character interpretation are stored as a candidate. In response to another series of inputs, another proposed character interpretation is generated. Responsive to detecting another replacement of a character of the another character interpretation with a different character, another character learning string comprising the different character and a number of characters of the another character interpretation are compared with the stored candidate. If a set of characters in the another character learning string match characters in the candidate, the set of characters are stored as a segment.
Claims (22)
1. A method of enabling input on a handheld electronic device comprising a memory having stored therein a plurality of characters and a plurality of segments, the segments each comprising a plurality of the characters, the method comprising:
receiving as a first entry a plurality of first inputs, at least some of the first inputs each corresponding with a number of the characters;
comparing at least a portion of the first entry with at least some of the segments to identify for each of at least some of the first inputs a proposed first character of the characters with which the first input corresponds;
outputting the proposed first characters;
for at least one of the first inputs, detecting as an editing input a replacement of the proposed first character with another first character with which the first input corresponds;
responsive to said detecting, storing as a candidate character string a string of characters comprising the another first character and a number of the proposed first characters output adjacent thereto;
receiving as a second entry a plurality of second inputs, at least some of the second inputs each corresponding with a number of the characters;
comparing at least a portion of the second entry with at least some of the segments to identify for each of at least some of the second inputs a proposed second character of the characters with which the second input corresponds;
outputting the proposed second characters;
for at least one of the second inputs, detecting as an editing input a replacement of the proposed second character with another second character with which the second input corresponds;
making a determination that a series of characters comprising the another second character and a number of proposed second characters output adjacent thereto sequentially match at least a portion of the candidate character string, the at least portion of the candidate character string comprising the another first character and at least a subset of the number of proposed first characters; and responsive to said making a determination, storing the series of characters as a segment.
receiving as a first entry a plurality of first inputs, at least some of the first inputs each corresponding with a number of the characters;
comparing at least a portion of the first entry with at least some of the segments to identify for each of at least some of the first inputs a proposed first character of the characters with which the first input corresponds;
outputting the proposed first characters;
for at least one of the first inputs, detecting as an editing input a replacement of the proposed first character with another first character with which the first input corresponds;
responsive to said detecting, storing as a candidate character string a string of characters comprising the another first character and a number of the proposed first characters output adjacent thereto;
receiving as a second entry a plurality of second inputs, at least some of the second inputs each corresponding with a number of the characters;
comparing at least a portion of the second entry with at least some of the segments to identify for each of at least some of the second inputs a proposed second character of the characters with which the second input corresponds;
outputting the proposed second characters;
for at least one of the second inputs, detecting as an editing input a replacement of the proposed second character with another second character with which the second input corresponds;
making a determination that a series of characters comprising the another second character and a number of proposed second characters output adjacent thereto sequentially match at least a portion of the candidate character string, the at least portion of the candidate character string comprising the another first character and at least a subset of the number of proposed first characters; and responsive to said making a determination, storing the series of characters as a segment.
2. The method of Claim 1, further comprising making as at least a portion of said determination a determination that the series of characters sequentially match the another first character and a quantity of said proposed first characters disposed adjacent the another first character.
3. The method of Claim 1, wherein the handheld electronic device comprises an input apparatus comprising a plurality of input members, and further comprising detecting as the first entry a plurality of input member actuations, at least some of the first inputs each comprising a plurality of the input member actuations.
4. The method of Claim 1, further comprising storing as the candidate character string another first character, a quantity of the proposed first characters preceding the another first character, and a quantity of the proposed first characters following the another first character.
5. The method of Claim 4, further comprising making as at least a portion of said determination:
a determination that the another second character matches the another first character;
followed by a determination that a proposed second character that one of precedes and follows the another second character matches a proposed first character that the one of precedes and follows the another first character;
followed by a determination that a proposed second character that the other of precedes and follows the another second character matches a proposed first character that the other of precedes and follows the another first character.
a determination that the another second character matches the another first character;
followed by a determination that a proposed second character that one of precedes and follows the another second character matches a proposed first character that the one of precedes and follows the another first character;
followed by a determination that a proposed second character that the other of precedes and follows the another second character matches a proposed first character that the other of precedes and follows the another first character.
6. The method of Claim 4, further comprising making as at least a portion of said determination:
a determination that the another second character matches the another first character;
followed by a number of determinations that a number of proposed second characters which alternately precede and follow the another second character in a fashion progressively moving outwardly from the another second character match a number of proposed first characters which similarly alternately precede and follow the another first character in a fashion progressively moving outwardly from the another first character;
followed by a determination that a particular proposed second character at a particular position in the series with respect to the another second character differs from a proposed first character similarly positioned in the candidate character string with respect to the another first character; and storing as the series of characters the another second character and the number of proposed second characters.
a determination that the another second character matches the another first character;
followed by a number of determinations that a number of proposed second characters which alternately precede and follow the another second character in a fashion progressively moving outwardly from the another second character match a number of proposed first characters which similarly alternately precede and follow the another first character in a fashion progressively moving outwardly from the another first character;
followed by a determination that a particular proposed second character at a particular position in the series with respect to the another second character differs from a proposed first character similarly positioned in the candidate character string with respect to the another first character; and storing as the series of characters the another second character and the number of proposed second characters.
7. The method of Claim 6, further comprising:
responsive to said determination that the proposed second character differs, making as one of said number of determinations a determination that a quantity of additional proposed second characters positioned in the series at a side of the another second character opposite that of the particular proposed second character match a quantity of proposed first characters positioned in the candidate character string with respect to the another first character at a position similar to that of the quantity of additional proposed second characters with respect to the another second character; and storing the quantity of additional proposed second characters as a portion of the number of proposed second characters of the series of characters.
responsive to said determination that the proposed second character differs, making as one of said number of determinations a determination that a quantity of additional proposed second characters positioned in the series at a side of the another second character opposite that of the particular proposed second character match a quantity of proposed first characters positioned in the candidate character string with respect to the another first character at a position similar to that of the quantity of additional proposed second characters with respect to the another second character; and storing the quantity of additional proposed second characters as a portion of the number of proposed second characters of the series of characters.
8. The method of Claim 6, further comprising, responsive to said determination that the proposed second character differs, initiating said storing the series of characters.
9. The method of Claim 1, further comprising deleting the candidate character string.
10. The method of Claim 1, further comprising determining that the series of characters comprises more than a predetermined quantity of characters and, responsive thereto:
determining that another segment matches a portion of the series of characters; and storing as the segment the portion of the series of characters not matched by the another segment.
determining that another segment matches a portion of the series of characters; and storing as the segment the portion of the series of characters not matched by the another segment.
11. The method of Claim 10, further comprising storing as a combination object at least a representation of the segment and at least a representation of the another segment.
12. A handheld electronic device comprising an input apparatus, a processor apparatus, and an output apparatus, the processor apparatus comprising a processor and a memory having stored therein a plurality of objects comprising a plurality of characters and a plurality of segments, the segments each comprising a plurality of the characters, the memory having stored therein a number of routines which, when executed by the processor, cause the handheld electronic device to be adapted to perform operations comprising:
receiving as a first entry a plurality of first inputs, at least some of the first inputs each corresponding with a number of the characters;
comparing at least a portion of the first entry with at least some of the segments to identify for each of at least some of the first inputs a proposed first character of the characters with which the first input corresponds;
outputting the proposed first characters;
for at least one of the first inputs, detecting as an editing input a replacement of the proposed first character with another first character with which the first input corresponds;
responsive to said detecting, storing as a candidate character string a string of characters comprising the another first character and a number of the proposed first characters output adjacent thereto;
receiving as a second entry a plurality of second inputs, at least some of the second inputs each corresponding with a number of the characters;
comparing at least a portion of the second entry with at least some of the segments to identify for each of at least some of the second inputs a proposed second character of the characters with which the second input corresponds;
outputting the proposed second characters;
for at least one of the second inputs, detecting as an editing input a replacement of the proposed second character with another second character with which the second input corresponds;
making a determination that a series of characters comprising the another second character and a number of proposed second characters output adjacent thereto sequentially match at least a portion of the candidate character string, the at least portion of the candidate character string comprising the another first character and at least a subset of the number of proposed first characters; and responsive to said making a determination, storing the series of characters as a segment.
receiving as a first entry a plurality of first inputs, at least some of the first inputs each corresponding with a number of the characters;
comparing at least a portion of the first entry with at least some of the segments to identify for each of at least some of the first inputs a proposed first character of the characters with which the first input corresponds;
outputting the proposed first characters;
for at least one of the first inputs, detecting as an editing input a replacement of the proposed first character with another first character with which the first input corresponds;
responsive to said detecting, storing as a candidate character string a string of characters comprising the another first character and a number of the proposed first characters output adjacent thereto;
receiving as a second entry a plurality of second inputs, at least some of the second inputs each corresponding with a number of the characters;
comparing at least a portion of the second entry with at least some of the segments to identify for each of at least some of the second inputs a proposed second character of the characters with which the second input corresponds;
outputting the proposed second characters;
for at least one of the second inputs, detecting as an editing input a replacement of the proposed second character with another second character with which the second input corresponds;
making a determination that a series of characters comprising the another second character and a number of proposed second characters output adjacent thereto sequentially match at least a portion of the candidate character string, the at least portion of the candidate character string comprising the another first character and at least a subset of the number of proposed first characters; and responsive to said making a determination, storing the series of characters as a segment.
13. The handheld electronic device of Claim 12 wherein the operations further comprise making as at least a portion of said determination a determination that the series of characters sequentially match the another first character and a quantity of said proposed first characters disposed adjacent the another first character.
14. The handheld electronic device of Claim 12, wherein the input apparatus comprises a plurality of input members, and wherein the operations further comprise detecting as the first entry a plurality of input member actuations, at least some of the first inputs each comprising a plurality of the input member actuations.
15. The handheld electronic device of Claim 12 wherein the operations further comprise storing as the candidate character string another first character, a quantity of the proposed first characters preceding the another first character, and a quantity of the proposed first characters following the another first character.
16. The handheld electronic device of Claim 15 wherein the operations further comprise making as at least a portion of said determination:
a determination that the another second character matches the another first character;
followed by a determination that a proposed second character that one of precedes and follows the another second character matches a proposed first character that the one of precedes and follows the another first character;
followed by a determination that a proposed second character that the other of precedes and follows the another second character matches a proposed first character that the other of precedes and follows the another first character.
a determination that the another second character matches the another first character;
followed by a determination that a proposed second character that one of precedes and follows the another second character matches a proposed first character that the one of precedes and follows the another first character;
followed by a determination that a proposed second character that the other of precedes and follows the another second character matches a proposed first character that the other of precedes and follows the another first character.
17. The handheld electronic device of Claim 15 wherein the operations further comprise making as at least a portion of said determination:
18 a determination that the another second character matches the another first character;
followed by a number of determinations that a number of proposed second characters which alternately precede and follow the another second character in a fashion progressively moving outwardly from the another second character match a number of proposed first characters which similarly alternately precede and follow the another first character in a fashion progressively moving outwardly from the another first character;
followed by a determination that a particular proposed second character at a particular position in the series with respect to the another second character differs from a proposed first character similarly positioned in the candidate character string with respect to the another first character; and storing as the series of characters the another second character and the number of proposed second characters.
18. The handheld electronic device of Claim 17 wherein the operations further comprise:
responsive to said determination that the proposed second character differs, making as one of said number of determinations a determination that a quantity of additional proposed second characters positioned in the series at a side of the another second character opposite that of the particular proposed second character match a quantity of proposed first characters positioned in the candidate character string with respect to the another first character at a position similar to that of the quantity of additional proposed second characters with respect to the another second character; and storing the quantity of additional proposed second characters as a portion of the number of proposed second characters of the series of characters.
followed by a number of determinations that a number of proposed second characters which alternately precede and follow the another second character in a fashion progressively moving outwardly from the another second character match a number of proposed first characters which similarly alternately precede and follow the another first character in a fashion progressively moving outwardly from the another first character;
followed by a determination that a particular proposed second character at a particular position in the series with respect to the another second character differs from a proposed first character similarly positioned in the candidate character string with respect to the another first character; and storing as the series of characters the another second character and the number of proposed second characters.
18. The handheld electronic device of Claim 17 wherein the operations further comprise:
responsive to said determination that the proposed second character differs, making as one of said number of determinations a determination that a quantity of additional proposed second characters positioned in the series at a side of the another second character opposite that of the particular proposed second character match a quantity of proposed first characters positioned in the candidate character string with respect to the another first character at a position similar to that of the quantity of additional proposed second characters with respect to the another second character; and storing the quantity of additional proposed second characters as a portion of the number of proposed second characters of the series of characters.
19. The handheld electronic device of Claim 17 wherein the operations further comprise, responsive to said determination that the proposed second character differs, initiating said storing the series of characters.
20. The handheld electronic device of Claim 12 wherein the operations further comprise deleting the candidate character string.
21. The handheld electronic device of Claim 12 wherein the operations further comprise determining that the series of characters comprises more than a predetermined quantity of characters and, responsive thereto:
determining that another segment matches a portion of the series of characters; and storing as the segment the portion of the series of characters not matched by the another segment.
determining that another segment matches a portion of the series of characters; and storing as the segment the portion of the series of characters not matched by the another segment.
22. The handheld electronic device of Claim 21 wherein the operations further comprise storing as a combination object at least a representation of the segment and at least a representation of the another segment.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CA2006/001089 WO2008000058A1 (en) | 2006-06-30 | 2006-06-30 | Learning character segments during text input |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2653843A1 true CA2653843A1 (en) | 2008-01-03 |
CA2653843C CA2653843C (en) | 2012-02-07 |
Family
ID=38845060
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA2653843A Active CA2653843C (en) | 2006-06-30 | 2006-06-30 | Learning character segments during text input |
Country Status (2)
Country | Link |
---|---|
CA (1) | CA2653843C (en) |
WO (1) | WO2008000058A1 (en) |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7124080B2 (en) * | 2001-11-13 | 2006-10-17 | Microsoft Corporation | Method and apparatus for adapting a class entity dictionary used with language models |
CA2413055C (en) * | 2002-07-03 | 2006-08-22 | 2012244 Ontario Inc. | Method and system of creating and using chinese language data and user-corrected data |
US7478033B2 (en) * | 2004-03-16 | 2009-01-13 | Google Inc. | Systems and methods for translating Chinese pinyin to Chinese characters |
CA2496872C (en) * | 2004-03-17 | 2010-06-08 | America Online, Inc. | Phonetic and stroke input methods of chinese characters and phrases |
-
2006
- 2006-06-30 CA CA2653843A patent/CA2653843C/en active Active
- 2006-06-30 WO PCT/CA2006/001089 patent/WO2008000058A1/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
WO2008000058A1 (en) | 2008-01-03 |
CA2653843C (en) | 2012-02-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2647938A1 (en) | Handheld electronic device and method for learning contextual data during disambiguation of text input | |
CA2509010A1 (en) | Handheld electronic device with text disambiguation | |
GB2451035A (en) | Handheld electronic device and method for performing optimized spell checking during text entry by providing a sequentially ordered series of spell-check algo | |
CA2635045A1 (en) | Handheld electronic device and method for disambiguation of text input and providing spelling substitution | |
ATE407409T1 (en) | AUTOMATIC GESTURE DETECTION | |
DE602006016846D1 (en) | SYSTEM AND METHOD FOR BROWSING AND COMPARING DATA WITH IDEOGRAMMATIC CONTENTS | |
WO2019201511A8 (en) | Method and data processing apparatus | |
CA2636207A1 (en) | Handheld electronic device providing proposed corrected input in response to erroneous text entry in environment of text requiring multiple sequential actuations of the same key, and associated method | |
CA2509014A1 (en) | Handheld electronic device with text disambiguation | |
CN103577547A (en) | Webpage type identification method and device | |
CA2647934A1 (en) | Handheld electronic device and method for employing contextual data for disambiguation of text input | |
Xiao et al. | Data mining based on segmented time warping distance in time series database. | |
CA2653843A1 (en) | Learning character segments during text input | |
CN105630769A (en) | Document subject term extraction method and device | |
WO2007033228A3 (en) | Reducing false positives for automatic computerized detection of objects | |
WO2006105641B1 (en) | Handheld electronic device with text disambiguation employing advanced editing feature | |
CA2583923A1 (en) | Handheld electronic device and method for performing spell checking during text entry and for providing a spell-check learning feature | |
CA2554397A1 (en) | Handheld electronic device with disambiguation of compound word text input employing separating input | |
CA2658586A1 (en) | Learning character segments from received text | |
CA2639224A1 (en) | Handheld electronic device and associated method providing disambiguation of an ambiguous object during editing and selectively providing prediction of future characters | |
CA2605785A1 (en) | Handheld electronic device with reduced keyboard and associated method of providing improved disambiguation with reduced degradation of device performance | |
CA2635009A1 (en) | Handheld electronic device and method for disambiguation of compound text input and that employs n-gram data to limit generation of low-probability compound language solutions | |
CA2610116A1 (en) | Method for automatically preferring a diacritical version of a linguistic element on a handheld electronic device based on linguistic source and associated apparatus | |
JP2011076264A5 (en) | ||
KR101763329B1 (en) | Sentence pattern classification method based on multi combination keyword of syllables |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request |