AU2022228148B2 - Method and system - Google Patents

Method and system Download PDF

Info

Publication number
AU2022228148B2
AU2022228148B2 AU2022228148A AU2022228148A AU2022228148B2 AU 2022228148 B2 AU2022228148 B2 AU 2022228148B2 AU 2022228148 A AU2022228148 A AU 2022228148A AU 2022228148 A AU2022228148 A AU 2022228148A AU 2022228148 B2 AU2022228148 B2 AU 2022228148B2
Authority
AU
Australia
Prior art keywords
characters
sound
word
character
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
AU2022228148A
Other versions
AU2022228148A1 (en
Inventor
Christopher STEPHEN
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=86499269&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=AU2022228148(B2) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Priority claimed from AU2021903667A external-priority patent/AU2021903667A0/en
Application filed by Individual filed Critical Individual
Priority to CA3238661A priority Critical patent/CA3238661A1/en
Priority to PCT/AU2022/051361 priority patent/WO2023087051A1/en
Publication of AU2022228148A1 publication Critical patent/AU2022228148A1/en
Application granted granted Critical
Publication of AU2022228148B2 publication Critical patent/AU2022228148B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B17/00Teaching reading
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/169Annotation, e.g. comment data or footnotes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass
    • G09B19/04Speaking
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/04Electrically-operated educational appliances with audible presentation of the material to be studied
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/06Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Business, Economics & Management (AREA)
  • Educational Technology (AREA)
  • Educational Administration (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Electrical Discharge Machining, Electrochemical Machining, And Combined Machining (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

A method of encoding a word to make the encoded word phonetic and intuitive to the reader by encoding the word by: using respective compound characters that each include the spelling character and a sound character (which can be represented by a superscript), wherein the sound characters: are human-readable characters in the base alphabet and/or in one or more secondary alphabets, are added to spelling characters to indicate that the spelling character makes the usual sound of the sound character, are displayed so that it is easy to discriminate spelling characters from sound characters, are added such that it is easy for a skilled reader to recognize the word by sight (sightword read the word), and are added to the spelling characters such that the spelling characters and the sound characters remain human-readable such that the spelling character and the sound character of each compound character are within one visual field; and outputting the encoded word in a human-readable form/format such that the compound characters in the encoded word visually indicate which of the spelling characters have a sound other than a usual sound (for that character). 23347610.1:DCC 4/10 -4 0 t 410 .MA~ .u -c U a si _ ) -' CL~4 t'- ~ -~ CL 0 mEa '- go0~ x w m '-- o a) 0) - 0) 0 wMA' ( a -

Description

4/10
-4
t0 410 .MA~ .u
-c U
a si _ ) -' CL 0 CL~4 t'- ~ -~
x mEa '- go0~ w
mwMA' '-- oa) 0) - 0) 0 (
a -
METHOD ANDSYSTEM TECHNICAL FIELD
[0001] The present disclosure relates to an integrated publishing method and system of converting/encoding a text document to a new format or formats that can be optimized for groups or individuals, including to improve text readability/pronounceability, improved auditory discrimination, vocabulary acquisition by a reader and/or comprehension by a reader and can be extended with integrated interactive intelligent systems to optimize the alphabet characters a reader will use, to optimize vocabulary acquisition by a reader, to optimize reading skills and/or reader comprehension in the language being studied, and to efficiently improve verbal communication skills, including auditory discrimination and pronunciation in the language being studied.
BACKGROUND
[0002] Reading, speaking and hearing a language are fundamentally related activities that can be optimized. A child will learn to hear and speak its first language without instruction because a speech region of the human brain has evolved. There is no such region of the brain for reading, and several parts of the brain must work rapidly and efficiently in a systematic way for reading to occur. Reading required the development of new neural pathways and physically changes the brain.
[0003] Understanding written and/or spoken communication requires vocabulary: knowing what words mean. Teaching how words are spelled and what words look like, how they are pronounced, what they sound like, and what they mean all contribute to mastery of the language and mastery of a set of skills, such as being a medical practitioner.
[0004] Reading is decoding the sounds of written words and the reader hearing the sounds of the words in their heads using the speech regions of the brain. On hearing the sound, the reader recalls the meaning of the words and then understands the meaning of the text. Therefore, accurately decoding the sound of a written word is a condition precedent to understanding the meaning of the text representing that word.
[0005] Cognitive Load Theory is an experimentally developed theory focusing on how to efficiently transfer new information from working memory to long term memory, a process often called learning or understanding. Working memory can hold 3-5 pieces of new information for 20-30 seconds before the new information needs to be refreshed. Having all the new information displayed in the one place allows the information to be refreshed without the learner having to stop learning to refresh the information. Learning can be assisted by breaking up learning tasks into smaller sequential learning tasks to minimize new information in working memory. Overloading working memory with too much information stops learning.
[0006] Speed is important: humans need to hear the words in their heads at about the same rate as spoken language to maximize their comprehension, which is called fluent reading. Fluent reading is required for good comprehension. Read too slowly, and the reader will find it hard to comprehend the text as the reader will forget the words at the beginning of the text. To develop fluent reading and good comprehension, students therefore need to develop sightword recognition - seeing a word and instantly knowing its sound and meaning. An example of sightword recognition is seeing a STOP sign. As soon as someone sees the STOP sign, they hear the word STOP in their mind. Cognitive psychology and cognitive load theory tell us that the fastest way to develop sightword recognition is by sounding out words one phoneme at a time ("phonemes" are sounds, and "characters" are symbols that represent the sounds) - our memories remember things that make sense, but are very poor at remembering random information like the sound of a word and its shape when the word does not sound the way it is spelled. If the word does not sound the way it is spelled, it must be learned by rote. Rote learning requires a lot of repetition. A reader phonetically sounding out a new word 3-5 times may develop sightword recognition, because there is a simple relationship between the sounds and the letters, but 20-50 repetitions may be needed for rote learning of a word which does not sound the way it is spelled because there may be little relationship between the sound of the word and its spelling. Humans have evolved to remember things that make sense, but we have not evolved to remember random information.
[0007] An example of a non phonetic language is English, as more than half the words in English are not pronounced as they are spelled. Examples include: "baked" and "naked" have similar spelling and are pronounced differently, the character "U" has 7 sounds (up, use, put, fruit, busy, quick and bury), and in the word "signature", the character "g" is pronounced, but not in the word "sign".
[0008] English has 42-45 phonemes (depending on definition) and 26 characters. This means that many English characters must make more than 1 sound, which is confusing for readers.
[0009] Erratic English spelling is a huge literacy problem. A study conducted in Europe found Finns learned to read phonetic Finnish in 3-6 months, whereas English language students took 2.5-3 years to reach the same level. A second study of 1200 Italian students found a number of dyslexic students could read Italian (a phonetic language) well enough to progress to university without needing special assistance. Clearly it is much easier to learn to decode a language written phonetically as there are no rules to learn: readers learn a new word by sounding out that word character by character with no rules or exceptions. Readers can therefore be confident in their decoding skills and in their ability to correctly pronounce that word.
[0010] The erratic spelling of English also causes problems with accurate auditory discrimination of English phonemes, syllables and words, and pronunciation problems, resulting in poor spoken communications. Consider the following: a person is in a meeting and hears a foreign language speaker say their name. The person hearing the name does not recognize what was said. But when handed a card with the foreigner's name written on it, the person hearing the name can discriminate the name because that person knows what to listen for.
[0011] Similarly, having words marked up phonetically allows people to accurately know how to pronounce a word, and experiments published in peer reviewed academic journals show better pronunciation from written words than just by hearing the words.
[0012] People learn by absorbing information by reading and hearing information. It is critically important that people are able to understand what they read and hear. They need to read fluently, at a similar rate to spoken communication. They need good auditory discrimination so that they can accurately recognize individual words. They need to be able to pronounce words accurately enough for people to understand them. They need to know what the words mean to be able to understand what they read or hear. The erratic spelling of English makes it more difficult for people to master reading and spoken communications.
[0013] The current methods of teaching how to decode the sound of English words that is used in many countries is a system called synthetic phonics. This system has multiple rules and exceptions. When a student encounters a new word, the student does not know if the word is phonetic and can be sounded out, if a pronunciation rule applies, e.g., "cake" is pronounced /cayk/, or if the word is an exception. (The characters /cat/ represent the sound of the word "cat" when pronounced by an English speaker.) There are many exceptions to such pronunciation rules. This can be very confusing to students.
[0014] Another method of teaching the decoding of English words that eliminates the rules of synthetic phonics is to mark up words so they can be sounded out phonetically. One system uses glyphs to inform a reader what the sound of a character makes in a particular word (watch), what characters are not pronounced in that word (sign), and where the syllable breaks are in that word (baked and naked). This system works considerably better than synthetic phonics. However, there are problems that became apparent with this approach: the glyphs did not intuitively mean something to the reader, and had to be learned, which took time and created significant barriers to wide spread adoption, not all
English words could be marked up using this system, and the mark up system was still ambiguous, because syllable stress was not explicitly displayed.
[0015] Existing technologies for generating educational texts and running training exercises are very limited, and many language teaching texts and exercises are manually taught, or poorly automated, e.g., often being difficult for a particular learner.
[0016] Long and complex sentences can be hard for everyone to understand, and can be especially hard for someone learning a new language that has a different word order in a sentence from the native language of the learner. To understand a complex paragraph, readers may need to first read a sentence or a collection of sentences several times to be able to break the sentences into meaningful groups of words. Then the reader needs to be able to relate the different groups of words together to understand what is written in the text.
[0017] The above examples show the complex and interrelated nature of any language learning system. It also demonstrated the need, in any communication system that uses English, for a system that makes all English words phonetic without the student having to learn anything, allowing anybody who knows the sounds of English characters to decode the sound of any word.
[0018] It is desired to address or ameliorate one or more disadvantages or limitations associated with the prior art, or to at least provide a useful alternative.
SUMMARY
[0019] Disclosed herein is an integrated publishing method for converting/encoding a text document to a new format or formats, the integrated publishing method including:
a computing system receiving data representing a source text that includes a plurality of human-readable characters in a base alphabet forming a plurality of words;
the computing system encoding the source text by:
for each word in the source text that has one or more spelling characters that are identified as having a sound other than a usual sound for that character, using a replacement word with respective compound characters that each include the spelling character and a sound character, wherein the sound characters:
are human-readable characters in the base alphabet and/or in one or more secondary alphabets, are added to spelling characters to indicate that the spelling character makes the usual sound of the sound character, are displayed so that it is easy to discriminate spelling characters from sound characters, and are added to the spelling characters such that the spelling characters and the sound characters remain human-readable such that the spelling character and the sound character of each compound character are within one visual field; and the computing system outputting the encoded word in a human-readable form/format such that the encoded text includes the plurality of words from the source text with the compound characters visually indicate which of the spelling characters have a sound other than a usual sound for that character, and such that the spelling characters in the words in the encoded text are the same and in the same order as the characters in the respective words in the source text.
[0020] Disclosed herein is an integrated publishing system for converting/encoding a text document to a new format or formats that can be optimized for groups or individuals, the integrated publishing system configured to perform the method above.
BRIEF DESCRIPTION OF THE DRAWINGS
[0021] Some embodiments of the present invention are hereinafter described, by way of non-limiting example only, with reference to the accompanying drawing, in which:
a. FIG. 1 is a chart of a method for learning described herein;
b. FIG. 2 is a block diagram of a computing system configured to perform the method;
c. FIG. 3 are diagrams of compound characters used in the method;
d. FIG. 4 is a chart of English phonemes and corresponding marked-up phonetic words (including the compound characters) with IPA and Japanese superscripts;
e. FIG. 5 is a screenshot of a program that can take an English word and the translation of that word into the IPS and automatically produce a marked up version of the English word in "Fonetic English" using the compound characters;
f. FIG. 6 is a screenshot of a user interface (UI) for selecting compound characters to replace original characters in an English word to form a marked-up phonetic word for a database of English words linked to corresponding marked-up phonetic words; g. FIG. 7A is an example of 3 paragraphs of a book; h. FIG. 7B is an example of encoded text using the compound characters, with the text of 7A encoded into Fonetic English; i. FIG. 8A is a list of English examples of displayed silent characters; j. FIG. 8B is a list of English examples of displayed stressed/unstressed syllable breaks; k. FIG. 9A is an example text in English in a non-standard font; and
I. FIG. 9B is encoded text in the non-standard font, encoded in Fonetic English, corresponding to the text in FIG. 9A.
DETAILED DESCRIPTION
[0022] Described herein is an education system that can improve the ability of students to develop mastery in reading, reading comprehension, spoken communication, comprehension of spoken communication and acquisition of a vocabulary, which in turn will lead to improved learning outcomes in many other subject areas.
[0023] Central to this education system is a method of converting/encoding a text document, the method including:
a. receiving data representing a source text that includes a plurality of human readable characters in a base alphabet forming a plurality of words;
b. encoding the source text by:
i. for each word in the source text (i.e., in a word-by-word search of a database of words and corresponding marked-up phonetic words words) that has or more characters ("spelling characters") that are identified as having a sound other than a usual sound (for that character), using a replacement word with respective compound characters that each include the spelling character and a sound character (which can be represented by a superscript), wherein the sound characters:
1. are human-readable characters in the base alphabet and/or in one or more secondary alphabets (the sounds of whose letters are already known to a reader),
2. are added to spelling characters to indicate that the spelling character makes the usual sound of the sound character,
3. are displayed so that it is easy to discriminate spelling characters from sound characters, and
4. are added to the spelling characters such that the spelling characters and the sound remain human-readable such that the spelling character and the sound character of each compound character are within one visual field; and
c. outputting the encoded text in a human-readable form/format such that the encoded text includes the plurality of words from the source text with the compound characters visually indicating which of the spelling characters have a sound other than a usual sound (for that character) (and such that shapes made by the spelling characters in the words in the encoded text are substantially the same as shapes of the respective words in the source text).
[0024] Described herein is a method of converting/encoding a text document, the method including:
a. receiving data representing a source text that includes a plurality of human readable characters in a base alphabet forming a plurality of words;
b. encoding the source text by:
i. for each word in the source text (i.e., in a word-by-word search of a database of word and corresponding marked-up phonetic words ) that has one or more characters ("spelling characters") that are identified as having a sound other than a usual sound (for that character), using a replacement word with respective compound characters that each include the spelling character and a sound character (which can be represented by a superscript), wherein the sound characters::
1. are human-readable characters in the base alphabet and/or in one or more secondary alphabets (the sounds of whose letters are already known to a reader),
2. are added to spelling characters to indicate that the spelling character makes the usual sound of the sound character,
3. are displayed to that it is easy to discriminate spelling characters from sound characters, and
4. are added to the spelling characters such that the spelling characters and the sound remain human-readable such that the spelling character and the sound character of each compound character are within one visual field; and
c. outputting the encoded text in a human-readable form/format such that the encoded text includes the plurality of words from the source text with the compound characters visually indicating which of the spelling characters have a sound other than a usual sound (for that character), and such that shapes made by the spelling characters in the words in the encoded text are substantially the same as shapes of the respective words in the source text.
[0025] The preselected phoneme set may include:
a. a plurality of sound characters in a secondary alphabet associated with phonemes in the primary alphabet that can be defined/pronounced using characters in the secondary alphabet;
b. a plurality of sound characters in the primary alphabet associated with phonemes in the primary alphabet that do not exist in the secondary alphabet; and/or
c. a plurality of sound characters in the primary alphabet associated with phonemes in the IPA that do not exist in the secondary alphabet.
[0026] The method may include indicating a syllable break.
[0027] The method may include indicating a syllable break by adding a symbol preceding the syllable.
[0028] The method may include indicating silent characters, optionally by visually differentiating the silent characters from the spelling characters without changing shapes of the silent characters.
[0029] The method may include indicating silent characters and syllable breaks.
[0030] The adding of the one or more size sound characters may include adding a gap/space between the sound characters and the respective spelling characters such that the words in the encoded text are clearly visible and not touching the sound characters.
[0031] The method may include indicating a syllable break, wherein the syllable break indicates if a syllable following the syllable break is stressed or unstressed. One or more of any lowercase sound characters may be shaped differently from the corresponding uppercase characters, including having a different font.
[0032] The outputted text may be in human-readable form/format, including in a physical printed book and/or in an electronic book, optionally including printing the physical book and/or storing the electronic book in a non-transient computer-readable medium.
[0033] The sound characters may have a font size ("sound font size") based on a font size ("source font size") of the source text in a ratio of 6:9 (sound character font size:spelling character font size).
[0034] The sound characters may have a font size of at least 6 point.
[0035] The name of the compound characters may be /spelling character/ says /sound character/ and/or /spelling character/ rhymes with /sound character/.
[0036] The method may include:
a. receiving user inputs from a user of a computing system;
b. the computing system classifying the user into one of a plurality of categories based on the user inputs; and
c. the computing system selecting the phoneme set from the plurality of sets based on the user category using a predefined mapping between user categories and phoneme sets.
[0037] The method may include:
a. the computing system generating measured values of the user's knowledge; and
b. the computing system classifying the user into one of the plurality of categories based on the measured values.
[0038] The method may include:
a. the computing system presenting a test text in the primary alphabet to the user by displaying the test text visibly or playing the test text audibly, wherein the test text includes: a plurality of words that can be selected by the user using the user interface ("user-selectable words") including at least one test word and one or more distractor words (which are not the test word); and
b. the computing system measuring the values from user selections of the user selectable words, including measuring how many of the least one test words are user selected, and/or how much time is taken to select the test words.
[0039] The method may include generating the measured values by analysing user pronunciation using a voice analysis tool.
[0040] The method may include generating the measured values by:
[0041] Playing the sounds of different syllables containing phonemes to be learned in the order that enabled other students in the same student category to score more correct answers in learning exercises or tests;
[0042] Playing the sounds in an accent that enables the student to score more correct answers in learning exercises or tests; and/or
[0043] When the student can discriminate the syllables and phonemes pronounced in an accent the student finds easier than a native speaker of the base language, transition the student to hearing phonemes and syllables pronounced by a native speaker of the base language.
[0044] Generating the measured values may include:
[0045] Playing the sounds of different syllables containing phonemes to be learned in the order that enabled other students in the same student category to score more correct answers in learning exercises or tests;
[0046] Playing the sounds in an accent that the student to score more correct answers in learning exercises or tests; and/or
[0047] When the student can discriminate the syllables and phonemes pronounced in an accent the student finds easier than a native speaker of the base language, transition the student to hearing phonemes and syllables pronounced by a native speaker of the base language, by first playing the sound of the syllables and phonemes that the student found easier when listening to a person, e.g., with the same native language as the student speaking these syllables and phonemes.
[0048] The method may include generating the measured values by:
a. the computing system displaying at least 3 marked up words, 2 of which are wrong; and
b. the computing system measuring how many wrong words the user selects.
[0049] The method may include generating measured values by:
a. the computing system playing a multi-syllable word with at least 2 syllables defining at least 2 respective correct syllables;
b. the computing system displaying at least 2 respective blank boxes and a plurality of user-selectable syllables greater than at least 2;
c. the computing system receiving input from the user selecting at least 2 of the user-selectable syllables; and
d. the computing system measuring how many correct syllables are in the user selected syllables.
[0050] The method may include generating measured values by:
a. the computing system playing the sound of a word defining a first plurality of correct characters;
b. the computing system displaying blank boxes equal to the first plurality;
c. the computing system displaying a second plurality of user-selectable characters, wherein the second plurality is greater than the first plurality; d. the computing system receiving input from the user selecting the user-selectable characters; and e. the computing system measuring how many correct characters are in the user selected characters.
[0051] The method may include generating measured values by:
a. the computing system playing the sounds of a respective plurality of words with a time of silence between adjacent ones of the played sounds of the words;
b. the computing system reducing the time of silence between the played sounds based on the input from the user;
c. the computing system playing the sounds of the words slowly in a continuous sound (as used in human to human speech) based on input from the user; and
d. the computing system playing the sounds of the words at the speed of normal speech in a continuous sound (as used in human to human speech) based on input from the user.
[0052] The computing system can take a word and pass the word to an automated text to speech software module that is configured to produce an audio file containing the sound of the word which can be stored in a database linking the word text to the word sound. A sentence can be played one word at a time by selecting a word, looking up the database for the sound of that word, and playing that word. The sentence as played will not contain intonation and cadence.
[0053] The computing system can take a sentence and pass the sentence to an automated text to speech software module that is configured to produce an audio file of the sentence being read aloud with intonation and cadence. The audio file of the sentence can then be passed to an automated speech segmentation system, such as Aligner developed by Dr Stefan Rapp, which can break the continuous speech containing the sounds of the words in the sentence into discrete words and phonemes. Because the intonation and cadence applying to a word can be different in different sentences, a database linking the word to its sound also contains the document identifier, a sentence identifier and a word identifier, which allows the words in a sentence to be played sequentially with cadence and intonation, with a variable gap between the words.
[0054] By breaking the audio signal into the sounds of discrete words, and linking the text of the word to its sound in a database, each word in a sentence can be highlighted as its sound is played. This may assist some readers, e.g., those readers who were never read to as children, and will help to improve auditory discrimination be seeing the word and hearing it pronounced. Multiple choice questions can be added to measure the comprehension of the reader.
[0055] Playing the sound of a word whilst highlighting the text of the word may be a useful feature in a reading practice tool.
[0056] Using the above methodology, a reading practice tool can also play a sentence in phrases whilst highlighting the sentences.
[0057] The split concentration principle of Cognitive Load Theory states that information from multiple sources at once can reduce learning because attention is split. Once readers develop some reading competence, attention will not be split if the student only reads, or if the student just listens.
[0058] By clicking on a phoneme, syllable, word, phrase, phrasal verb, idiom etc., the reading practice tool could also play individual words, syllable, phonemes, phrases, phrasal verbs (to come up with means to think of) and idioms with or without intonation and cadence.
[0059] Similarly, the reader could click a word, phrase, phrasal verb, idiom etc and be presented with the definition or translation of words, phrases, phrasal verbs, idioms etc.
[0060] The user inputs may include user-selected values, and the method includes receiving the user-selected values by:
a. the computing system presenting text input prompts and/or selectable lists to the user; and
b. the computing system receiving the user-selected values as inputs in the text input prompts and/or selectable lists,
wherein the method includes the computing system classifying the user into one of the plurality of categories based on the user-selected values.
[0061] The text input prompts and/or the selectable lists may define a plurality of user-selectable values defining: age, sex, native language, education level, and/or English or other language skill level.
[0062] The method may include the computing system authenticating and identifying the user by connecting to a learning management system in which the user has a user account.
[0063] The method may include the computing system authenticating and identifying the user by connecting to a learning management system in which the user has a user account through an application programming interface (API) of the LMS.
[0064] The method may include sending the user category and/or the measured values to the LMS associated with the user account for storage with a user record in the LMS.
[0065] The method may include the computing system classifying the user into one of the plurality of categories using a trained classifier, trained on data from users in each of the plurality of the user category groups.
[0066] The method may include:
a. the computing system dividing a word from the source text into a plurality of individual phonemes/partial syllables/syllables, including the first character/letter and progressively more characters/letters/digraphs (two letters representing a different sound, e.g. the digraph "ph" represents the sound /f/), incrementing by one phoneme (character or digraph other than a silent character) for each partial syllable/syllable;
b. the computing system sounding out (playing) the partial syllables/syllables of the phoneme progressively in order of length from the user interface of the computing system for the user to hear;
c. the computing system instructing the user to repeat the partial syllables/syllables via the user interface;
d. the computing system recording the user speaking the syllables/partial syllables; and
e. the computing system sounding out the set of the partial syllables/syllables and then the set of the user's recorded partial syllables/syllables, optionally including repeating the sounding out step two or more times.
[0067] The method may include the computing system displaying characters, syllables and words on the user interface and simultaneously sounding out (playing) the sound files containing the phonemes, syllables and words.
[0068] The method may include:
a. the computing system playing a test sound for a/the user to hear;
b. the computing system displaying a plurality of characters including a test character or characters representing the test sounds in the language; and
c. the computing system measuring whether the user clicks on the test character or characters.
[0069] The method may include: the computing system selecting phonemes that are in the primary alphabet but not the secondary alphabet based on data representing a predefined phoneme chart (e.g., the Japanese phoneme chart in FIG. 4) including phonemes of the primary alphabet and phonemes of the secondary alphabet connected/linked by a corresponding IPA symbol
[0070] The method may include:
a. the computing system recording the user pronouncing a test character (and/or phoneme/syllable/word); and
b. the computing system repeatedly playing the user's recording for the user to hear;
c. the computing system repeatedly playing a prerecorded pronunciation of the test character (and/or phoneme/syllable/word) after the user's recording such that the user can hear a difference between the user's recording and the prerecorded pronunciation.
[0071] The method may include identifying syllables in the source text, and adding spaces/syllable breaks between adjacent syllables in the encoded text.
[0072] The method may include encoding the source text by:
a. identifying at least one word ("identified word") in the source text that matches one of a plurality of preselected words in a preselected set of words formed of the base alphabet, wherein the identified words includes at least one stressed syllable and/or at least one unstressed syllable defined in the preselected set, wherein each syllable includes one or more of the spelling characters, and
b. replacing/adjusting the identified word by adding a dot preceding each syllable, wherein the spelling characters of the syllable remain unchanged, and wherein the dot is a closed dot for the stressed syllable, and/or an open dot for the unstressed syllable.
[0073] The method may include:
a. receiving user inputs from a user of a computing system;
b. the computing system classifying the user into one of a plurality of categories based on the user inputs; and
c. the computing system selecting phoneme set from the plurality of sets based on the user category using a predefined mapping between user categories and phoneme sets.
[0074] Described herein is a method (of automatically generating a database of words with compound characters, each including a sound character) including: a. receiving each word written in English ("English word") and written in the international phonetic alphabet ("IPA word"); b. comparing characters ("English characters") in the English word to characters in the IPA word to identify whether one or more characters have a sound other than a usual sound (for that one or more characters), and if so c. using a compound character instead of the English character to automatically generate a word ("marked-up phonetic word") that includes the spelling character and a sound character, wherein the sound characters: d. are human-readable characters in the base alphabet and/or in one or more secondary alphabets, e. have a selected secondary font/character size smaller than a primary font/character size selected for the spelling character in each compound character, and f. are added to the compound characters such that the spelling characters remain human-readable such that the spelling character and the sound character of each compound character are within one visual field; g. outputting a database of English words and marked-up phonetic words such that shapes of the marked-up phonetic words are substantially the same as shapes of the respective English words.
[0075] The same process can be used to mark up words in languages other than English for which there is an IPA marked up word, wherein the method received each word written in the other language, compares characters in the other language, and outputs a database of words in the other language.
[0076] The method may include providing a user interface for a user to manually select marked-up phonetic words for the English words.
[0077] Described herein is a method of encoding a word to make the encoded word phonetic and intuitive to the reader by encoding the word by:
a. using respective compound characters that each include the spelling character and a sound character (which can be represented by a superscript), wherein the sound characters:
i. are human-readable characters in the base alphabet and/or in one or more secondary alphabets,
ii. are added to spelling characters to indicate that the spelling character makes the usual sound of the sound character, iii. are displayed so that it is easy to discriminate spelling characters from sound characters, iv. are added such that it is easy for a skilled reader to recognize the word by sight (sightword read the word), and v. are added to the spelling characters such that the spelling characters and the sound characters remain human-readable such that the spelling character and the sound character of each compound character are within one visual field; and b. outputting the encoded word in a human-readable form/format such that the compound characters in the encoded word visually indicate which of the spelling characters have a sound other than a usual sound (for that character).
[0078] The method may include adding characters and/or symbols to represent sounds that are not unambiguously specified by characters in the base alphabet.
[0079] The method may include adding syllable breaks, wherein the syllable breaks explicitly and unambiguously inform the reader whether the syllable is stressed or unstressed and/or adding a symbol to explicitly and unambiguously inform the reader whether a phoneme or digraph is voiced.
[0080] The syllable breaks may be added in a way to minimize the number of syllables.
[0081] The shape of the word as spelled by the spelling characters may be substantially preserved.
[0082] The sound characters may be easily visible so that a reader can quickly and efficiently sound out a new word, and at the same time, the sound characters may be displayed so that they do not interfere with the efficiency of reading by a reader skilling in reading words as spelled using the spelling characters.
[0083] A word may be fully or partially marked up using the mark up in the IPA of that word.
[0084] Described herein is a method including:
a. receiving data representing a source text that includes a plurality of human readable characters in a base alphabet forming a plurality of words;
b. encoding the source text by:
i. for each word in the source text (i.e., in a word-by-word search of a database of words and corresponding marked-up phonetic words words) that has or more pieces of explicit information about the decoding the sound of that word; and c. outputting the encoded text in a human-readable form/format such that the encoded text includes the plurality of words from the source text with the explicit information about decoding the sound of that word.
[0085] Described herein is non-volatile computer-readable storage including machine-readable instructions configured to cause a computing system to perform of any one of the above methods when the machine-readable instructions are executed/performed by one or more microprocessors of the computing system.
Overview
[0086] To be commercially successful, a mark up system may need to:
a. be intuitive to readers so learning is absolutely minimized and thus efficient;
b. be comprehensive so that all English words can be marked up; and/or
c. explicitly and unambiguously display everything a reader needs to know to accurately decode the sound of a written word, requiring that:
i. all English phonemes are explicitly and uniquely represented in the mark up,
ii. all stressed syllables are explicitly marked, and/or
iii. voiced letters and digraphs are marked (for example, the digraph "th" can be voiced, meaning that the larynx is engaged as in the word "the" but the "th" digraph is not engaged in the word "with"; a speaker icon can be used to indicate voiced letters or digraphs).
[0087] To overcome the above mark up system problems, and to make a phonetic word mark up system intuitive, instead of assigning new symbols which need to be learned to inform a reader what sound a character makes in a particular word, instead symbols are used in this disclosure that represent a specific sound that is known to the reader. Symbols with a known sound are characters in an alphabet that is known to the reader. The characters in the word as it is spelled are called the spelling characters, and the added symbols informing the reader of what sound that character actually makes in the word is called the sound character. In the example of English, sound characters could include:
a. English characters (if the reader is a native English speaker or knows the sound of English characters), b. IPA characters if the reader knows the IPA, or c. characters from the native language of the reader if each character makes the same sound as an English phoneme.
[0088] In English, there is an additional complication because the word the digraph "oo" can make two sounds as in the word "too" and the word "foot". Two Greek letter lowercase sigma characters "ac" are used to represent the sound of "oo" as in "foot".
[0089] One application of the new phonetic format is for readers to quickly and efficiently learn the sounds of new words whilst they are reading, e.g.. an academic text or a technical document. It is important that readers can:
a. quickly and conveniently read words they know by sight so the shape of the original words are preserved, and the sound characters are formatted so that they do not distract a reader when reading; and
b. be able to efficiently decode the sound of new words, so when the reader wants to decode the sound of a new word, the sound characters are clearly visible and explicitly inform the reader the sound that each character makes in that word.
[0090] Animals need to instantly see and recognize dangerous animals so they can escape or defend themselves. Humans are very good at seeing snakes even when the snakes are hiding. When a human sees a snake, the reaction is instant. Scientists call this the Snake Detection Theory. Reading likely uses the skill to detect snakes. Readers recognize the shape of the word and have the word shape, sound and meaning almost instantaneously recalled from long term memory. And it doesn't matter how the word is written - it can be printed normally, can be in a curly type, can be printed badly or the letters can be handwritten, and we can still recognize it by its shape. Being able to recognize disguised words is how readers can prove to some websites that they are humans, not computers. Provided the word shape is preserved, readers will be able to recognize the shape of a word even if mark up is added to the word, so readers can read marked up words by sight.
[0091] Breaking words into syllables means that the sound and meaning of syllables can be taught independently. Some 13,802 words can be made from the most frequently used 500 syllables. In medicine, the term "ectomy" means removal of something. For example, a tonsillectomy is the removal of the tonsils, and an appendectomy is removal of the appendix. In a similar way to the way the sounds of words can be simply decoded from the sounds of syllables, the meaning of a word can often be decoded from the meaning of the components of a word. This is called morphology. The central part of a word is called the root. It is important to clearly visually discriminate the prefixes and suffixes of a word, and be able to clearly identify the root of a word. Prefixes and suffixes are added to the root to give additional meaning. In addition, the meaning of the prefixes, suffixes and root word must be known for the student to understand the compound word made up of prefixes and suffixes appended to the root word.
[0092] Recall vs recognition: recognition of a word is easier to achieve than recall of the word. Recognition is required for reading - you recognize the word when you see it; recall is required for speaking and/or writing when there is no memory cue. Reading practice will improve recall.
[0093] Some medical practitioners and medical academics consider the acquisition of a medical vocabulary one of the most time consuming tasks in becoming a doctor. Many medical terms are non phonetic, meaning they are not spelled as they sound, which significantly complicates vocabulary acquisition. Many medical schools leave medical vocabulary learning to the students without instruction on how best to learn the required vocabulary.
[0094] Displaying the words in a sentence as meaningful groups may reduce the time, effort and eye movement to understand a complex document, and improve comprehension outcomes.
Encoding
[0095] In some languages, every letter in the alphabet of the language, makes only one sound. In some languages like English, where there are 42-45 phonemes (depending on definition), and only 26 letters, some letters make more than one sound. The term "usual sound for a letter" means the most common sound that the letter makes. The usual sounds for the 26 letters in English are shown in Table A in Appendix B. Table B in Appendix B lists the sounds of English Vowels saying their name. Table C in Appendix B lists the sounds of English digraphs. In English, the letter A can make the sound in "at" and the sound in "ape". When A makes the sound in "ape", it is described as "a saying its name". The characters on the left column of the three tables (Table A, Table B and Table C in Appendix B) can be used to phonetically spell out the sound of every word in English to a high degree ofaccuracy.
[0096] Some linguists have introduced characters like the schwa, which sounds like the sound the character "a" in the word "about". It has a sound similar to the sound the character "u" makes in the word "up". To simplify the system, fine distinctions like the schwa above have not been adopted.
[0097] To make a phonetic word mark up system intuitive, instead of assigning new symbols which need to be learned to inform a reader what the sound a character makes in a particular word, instead symbols are used in this disclosure that represent a specific sound that is known to the reader. Symbols with a known sound are characters in an alphabet that is known to the reader. In the example of English, characters could include: a. English characters (if the reader is a native English speaker or knows the sound of English characters), b. IPA characters if the reader knows the IPA, or c. characters from the native language of the reader if each character makes the same sound as an English phoneme.
[0098] In English, many words are not pronounced as they are spelled, with some characters making the sound of other characters-these may be described as having sounds that are not unambiguously specified by characters in the base alphabet. Take the English word "any" for example. To be able to pronounce the word "any", a reader needs to know that the letter "a" is pronounced as the usual sound of the letter "e", the letter "n" makes its usual sound, and the letter "y" makes the sound when the letter "e" says its name. Because characters can make different sounds, a reader needs to know how each character in an English word is pronounced. The English characters in the left column of the three tables make one and only one sound, and are used to uniquely indicate to the reader that the associated character makes in an English word.
[0099] Therefore the information to recognize the English word "any" are 3 character pairs (a:e)(n:n)(y:E) which indicates that the character "a" in "any" makes the usual sound of the letter "e", "n" makes its usual sound, and "y" makes the sound of the character E saying its name. This character and pronunciation information forms a novel composite phonetic alphabet made up of character pairs representing the character in the English word as the first character (referred to herein as "the spelled character" or "spelling character", or "base character") and the second character representing the sound that the first character actually makes (referred to herein as "the sound character"). The term "Fonetic Alphabet" refers herein to any composite alphabet made up of, i.e., including/comprising character pairs. The term "Fonetic English" refers herein to the composite alphabet where the spelled characters and the sound characters are English characters. Examples of Fonetic English characters are set out in the table in Appendix C, which includes example words that have been marked up using compound characters (which are described hereinafter), not including example words where a compound character has not yet been used to mark up a word.
[0100] With this information suitably displayed, including syllable breaks and silent characters, people with a knowledge of the usual sounds of English letters and digraphs will be able intuitively to accurately decode the sound of any character within a written word in English with little or no learning.
[0101] The composite alphabet can be further extended by adding as sound characters from other alphabets. For example, the Japanese and English alphabets share a number of common phonemes. To assist a Japanese learner to learn the English alphabet, an English sound character could be replaced by the Japanese character making the same sound as the English phoneme. There is a character (or combination of characters) in International Phonetic Alphabet (IPA) that makes the sound of every English phoneme, so an IPA character or IPA characters could be used as the sound character in the Fonetic English alphabet, which will be of assistance to a person who knows the IPA and wishes to learn the sounds of English characters.
[0102] Disclosed herein are methods to capture data about how to pronounce non phonetic English words in a Fonetic Alphabet format and in particular, English words using English spelling characters and English sound characters (called the Fonetic English format herein), and various display rules that can be used to display the words in the Fonetic English format in an optimal way.
[0103] The principles of displaying a word in a Fonetic Alphabet and in particular the Fonetic English format are to keep the spelling and shape of the original word intact so that it can be recognized by sight, add information to tell a reader the sound a character makes in a word, and do not add information that is redundant. For example, if a character in an English word makes its usual sound, then it can be displayed as the spelled character with no sound character as the sound character is not needed. One way of displaying the spelled character with the sound character is to have the spelled character as a smaller superscript character. However other ways of displaying the sound character are possible, including superscripts, subscripts, placing the sound character wholly or partially within the spelled character, and a combination of the above.
[0104] The English alphabet is not a complete representation of all English phonemes or the sounds made by particular phonemes.
[0105] There is currently no way to distinguish the usual sound of a vowel (/a/ and in "at"), and the vowel saying its own name (/a/ as in "ape"). In Fonetic English, a capital as a sound character vowel says its name, and a lower case vowel as a sound character makes its usual sound.
[0106] There is no character in English that uniquely makes the sound of oo in "foot", or the sound u in "put", which is the same sound. Fonetic English introduces a new digraph co to represent the sound of oo in "foot", or the sound u in "put". Experiments were run using oo in different fonts to distinguish oo as in "too" from oo as in "foot". The most recognizable was to use the Greek letter lower case sigma.
[0107] In addition, there is no character in English to display the difference between the unvoiced th in the word "with", or in the voiced word "them". A small speaker symbol is used to indicate voiced "th". This does not change th sound, but the way the sound is spoken.
[0108] Silent characters are not displayed differently from other characters.
[0109] Syllable breaks are not marked in English, leading to ambiguous pronunciations. Syllables are marked in Fonetic English with a solid syllable mark indicating a stressed syllable, and a hollow syllable mark indicating an unstressed syllable.
[0110] The pronunciation of words marked up using other technologies, e.g., with glyphs as described in WO 2012/071630 Al (ACCESSIBLE PUBLISHING SYSTEMS PTY LTD) 07 June 2012 (referred to herein as "Readable English"), is ambiguous because stressed syllables are not marked. The example used in Readable English, where the stress in a word can be inferred by use of the schwa (which cannot be stressed), may work with 2 syllable words where one syllable has a schwa, but not in other cases.
[0111] With the above features, Fonetic English alphabet can:
a. represent every English phoneme unambiguously;
b. display explicitly and unambiguously display everything a reader needs to know about the sound a word makes using letters whose sound the reader already knows (unlike the glyphs used in WO 2012/071630 Al which need to be learned by the reader before the reader can decode the sounds of words marked up with the glyphs); and
c. by displaying the spelling and sound characters unchanged, and by keeping the shape of the word the same as in standard English, readers (e.g., of an academic document) may read the document marked up into Fonetic English as if it were in Standard English, but if they come across a new word, they can sound out that word and quickly develop sightword recognition. It is important for fast reading that the English word shape is preserved. It is important for learning new words that all the information about the sound of a word is explicitly and unambiguously displayed for simple and accurate decoding of the word sound by a reader.
[0112] Disclosed herein is a specific instance of a method of converting/encoding a text document, the method including:
a. receiving data representing a source text that includes a plurality of human readable characters in a base alphabet forming a plurality of words;
b. encoding the source text by: i. for each word in the source text (i.e., in a word-by-word search of a database of words and for each word, the word marked up as a phonetic word) that has one or more characters ("spelled characters") that are identified as having a sound other than a usual sound for that character (that is, they are not unambiguously specified by characters in the base alphabet), using a replacement word with respective compound characters (a compound or combined character is a character containing both a spelled and sound character different from the spelled character, and combined together as a new character) that each include the spelled character and a sound character (which can be represented by a superscript), wherein the sound characters:
1. are human-readable characters in the base alphabet and/or in one or more secondary alphabets (the sounds of whose letters are already known to a reader), each representing a sound in the base alphabet (and each sound character represents the same sound in the base alphabet regardless of which spelled character it is above-for example, the superscript "i" can be used with both of the following spelled characters in English because they both have the same sound/phoneme: "e" in the word "English" makes the sound /i/, and "o" also makes the sound /i/ in the word "women"),
2. have a selected secondary font/character size smaller than a primary font/character size selected for the spelled character in each compound character, and
3. are added to the compound characters such that the spelled characters remain human-readable such that the spelled character and the sound character of each compound character are within one visual field (this means that a reader does not have to move their eyes to see both characters); and
c. outputting the encoded text in a human-readable form/format (including electronic display and/or a printed page) such that the encoded text includes the plurality of words from the source text with the compound characters visually indicating which of the spelled characters have a sound other than a usual sound (for that character), and such that shapes of the words in the encoded text are substantially the same as shapes of the respective words in the source text.
[0113] When English is the language of the source text (and thus primary language, which may be referred to as Fonetic English), the human-readable characters in the base alphabet include: a, b, c, d, e, f, g, h, i,j, k, I, m, n, o, p, q, r, s, t, u, v, w, x, y, and z.
[0114] For computational simplicity, every compound character is given a unique ID which has two components: the spelling character and a number that refers to the sound character. In an example, aO is a silent a, al makes its usual sound, a2 says its name, and a3, a4, a5 etc. all refer to the spelling character a with a different sound character, which could include characters from different alphabets. In addition, ali indicates that the symbol represented by al is presented as italic, alb is bold, and alc is bold italic as shown in Appendix D.
[0115] The names of example compound characters are set out in Appendix C. The compound character (a:o) is called /a/ says /o/, where /a/ makes the sound in the word "at" and /o/ makes the sound in the word "off". To make the table in Appendix C more readable, /a/ says /o/ is displayed as "a says o". In addition, the name of a compound character can be specified with reference to a word or sounds it rhymes with. For example, in the word "coin", the sound /o/ makes rhymes with /oy/, the sound of the word "boy". If a person knows the sounds of the English phonemes, they will intuitively know the name of each compound character.
[0116] The sound character (font) size and the spelled character (font) size are fixed in relation at least such that the sound character (font) size is smaller than a spelled character (font) size, e.g., at least 2 points smaller; however, the absolute (font) sizes can change, e.g., based on a selected printing font size for the physical book, and/or a user selected digital font size for the electronic book, e.g., with the compound character and marked-up phonetic words defined using a TrueType(TM) font.
[0117] The method may include printing the physical book and/or storing the electronic book in a non-transient computer-readable medium, e.g., on a CD ROM or a memory stick, on a hard drive or on a cloud server.
[0118] The encoding of the text can be performed at the word level, including marking up the words automatically using specific processes, e.g.: if there is only one vowel, it will be a one syllable word and there will be no syllable breaks; if the word is being translated into the IPA and there are the same number of IPA phonemes (characters or digraphs representing an English phoneme or sound) as in the English word, there are no silent characters in the IPA.
[0119] Where it may be difficult to discriminate superscript characters, different fonts can be used to make the superscripts easily recognizable and distinguishable. For example, if there are two superscripts U and u, if they were in the same font, it could be hard to recognize which is which because the only difference will be the size of the superscript. For example, a curly u over an "a" means a small u making the sound as in /up/, a larger straight U makes the sound /you/, i.e., says its name. The characters are easy to see because the lower case vowels are curly and smaller, and the uppercase vowels are straighter and larger. Readers are accustomed to recognizing even little capitals and can distinguish the capitals from little lowercase characters because the fonts of the superscript characters look different. Some superscripts like 0 and o can be distinguished by their roundness and by their placement: the lower case o can be round and sits in the centre of an uppercase A, when the capital 0 can be more elliptical and sits slightly to the side of the A.
[0120] The sound characters (which can be displayed as superscripts) are selected to be at least 6-8 point in size. The spelling characters are selected to be at least 2 points larger than the sound characters. In addition to the spelled characters being larger, the spelled characters are in a straight horizontal line (as words are currently displayed in typeset documents), and the superscripts will only occur when there is a sound character to be displayed, and its placement will vary with the spelled and sound characters.
[0121] A Japanese-fluent reader learning the sounds of English characters may select-as part of the initial alphabet English-characters with Japanese characters over them (i.e., Japanese sound characters) so they learn the sounds of the English characters that are the same as Japanese characters. This would be chosen so that there was a mapping of English to Japanese for some characters (i.e., with Japanese sound characters), and Japanese to IPA for the other characters (i.e., IPA sound characters) if the reader knew the IPA.
[0122] For example, for someone who knows English and the English phonemes, the system will not need the second alphabet to read English words marked up phonetically the spelling characters and the sound character will come from the same base alphabet. The use of the second alphabet is to teach the sound syllable association between the base alphabet and the second alphabet - e.g., telling a Japanese speaker that a particular English character makes the sound of the Japanese character (second alphabet) added to the English character (base alphabet).
[0123] The preselected character set may include:
a. a plurality of sound characters in the secondary alphabet associated with spelling words in the primary alphabet for common phonemes that exist in both alphabets, i.e., for spelling characters in the primary alphabet that can be defined/pronounced using characters in the secondary alphabet (e.g., as set out in the table in FIG. 4 with Japanese sound characters and IPA sound characters); b. a plurality of sound characters in the primary alphabet associated with phonemes in the primary alphabet for phonemes that do not exist in the secondary alphabet; and/or c. a plurality of sound characters in the primary alphabet associated with phonemes in the IPA for phonemes that do not exist in the secondary alphabet. (Thus, if there is a sound that is in English but not in Japanese, then the method can use English characters or use an IPA character. Accordingly, the preselected character set may be a combination of primary alphabet (English) spelling characters with sound characters in the primary alphabet (English), the IPA and the secondary alphabet. If an English speaker is learning say Italian or Spanish, the Spanish or Italian sounds can be spelled to reasonable accuracy for English speakers using English sound characters. "LL" in Spanish make the sound /y/, and in Italian, "ce" as in "cello" makes the sound /che/. The system and method can include foreign alphabets using Roman Alphabets with sound characters that tell an English speaker the approx. sounds made by the foreign words.)
[0124] The reader pronounces the compound character following the usual sound represented by the sound characters (superscript), which might be one or more characters, and the sound characters (superscript) represent the same sound regardless of which spelling characters they have been added to.
[0125] The sound characters can include diphthongs, e.g., "th", "oy" "ow", "ch", "sh" etc, which may be a set of characters, optionally typeset as an overlapping combination, e.g.,"m".
[0126] The compound characters displaying the character pairs, which in the above example are the spelled characters with sound character (superscripts) as well as the Roman spelling characters when these characters make their usual sound, effectively define a new alphabet with combinations of characters from one or more alphabets, e.g., English, IPA, other language characters and Roman Alphabet/Roman Alphabet characters from different primary and secondary alphabets which both use the Roman Alphabet. An example is the Fonetic English Alphabet which is set out in Appendix C.
[0127] The method shows the sound of a character with another character that is already understood by the reader. The phoneme set may be based on the secondary alphabet such that they are appropriate for teaching the primary alphabet to speakers of the secondary alphabet. If a foreign language speaker is learning English, the sound/character association can be taught by having English as the primary language (defining the base alphabet) and the foreign language as secondary language (defining the second alphabet). Once the sound/character association has been taught/learned, then the foreign language speaker will be able to use English characters as both the primary and secondary alphabets, or could continue to use English as the primary alphabet and the foreign language characters as the secondary alphabet. In electronic form, a custom alphabet best suited to the reader can be selected by the reader or by an AI system based on reader usage information collected by the system. Documents that are printed locally can have a customized alphabet. With bound books in printed form that require print runs to be produce cost effectively, a cost effective solution will be to have English as both the primary and secondary alphabets. But individualized print solutions for bound books can be provided by using print on demand.
[0128] The phoneme set may include, in the superscript sound characters, a plurality of each Roman character in the base alphabet, wherein these characters in the plurality are mutually visually distinguished by a font, including a bold font, an uppercase font, a cursive font, and/or an italic font. For example, vowels may have two sounds (e.g., their usual sound /a/ as in /at/ and /ay/ as in /day/), so /a/ may be shown as a superscript "a" and /ay/ as "A". For example, to differentiate the characters "u" and "U", the method and system use different fonts with the capital character an upright capital and the lower case character a visually different representation (e.g., a more curvy representation), i.e. using a different font to make it clear which is an uppercase and which is a lower case vowel. See Table B of Appendix B to see the way in which the 5 vowels of English "say their name".
[0129] The plurality of compound characters thus formed may be referred as a compound alphabet or alphabets that enable the reader to decode the sound of the word with the identified phonemes by visually indicating the sound to the reader in a manner that may minimize additional learning of symbols by the reader. In the case of someone learning English, this alphabet may include characters with sound characters from a plurality of mutually different alphabets, e.g., English, the IPA and a non-English alphabet. What is important is that the shape of the word in English looks the same without the superscripts and with superscripts, i.e., the shape of the word without mark-up looks the same compared to the shape of the word with mark-up. This is necessary to efficiently develop sight word recognition.
[0130] The primary alphabet may be English and the secondary alphabet may be English, e.g., for native English speakers to manage the decoding of the sound of non phonetic English words and by foreign alphabet speakers who have mastery of the sounds of English letters. The primary alphabet may be English and the secondary alphabet may be the International Phonetic Alphabet, e.g., for foreign language speakers who know the IPA. The primary alphabet may be English and the secondary alphabet may be a non English alphabet, thus using the non-English language characters to tell the reader what sound is being made for those sounds which are the same in the non-English language and
English. The primary alphabet may be non-English and the secondary alphabet may be English. Both the primary alphabet and the secondary alphabet may be non-English and mutually different, e.g., Hindi alphabet characters and Japanese sound characters. However,as English becomes the lingua franca, more focus is on English to a foreign language and a foreign language to English. The most common format may be English with English sound characters.
[0131] The phonemes of a language are the sounds the language uses to create word sounds by using an ordered set of phonemes to create a word. In an alphabetic language, every phoneme can be represented by one or more characters. There are no words whose cannot be represented by the characters in the alphabet of that language. Take English for example. By extending the alphabet to spelling and sound characters, every word in English can be represented by a set of character pairs of English characters. Take the non phonetic word "colonel". It can be represented as a set of character pairs: <sbst><c:c><o:ur><l:O><o:O><unsb><n:n><e:e><l:l>. The character pairs <l:0(zero)> and <o:O (zero)> indicate that the I and o are silent characters which is explained below.
[0132] The adding of the one or more sound characters includes adding a gap/space between the sound characters and the respective spelling characters such that the words in the encoded text are clearly visible and not touching the sound characters (e.g., so the ink of a printed sound character has a visible gap from the ink of its corresponding spelling character), or selecting a maximum overlap so that is less than 5%, and optionally less than 1% of the length of the character, i.e., the length of a 2 dimensional character in an xy plane is the character's projection onto the 1 dimensional x plane.
[0133] The space between the spelling character and the superscript allows the shape of the word in which the spelling character sits to be clearly visible. The space does not have to be on top of the primary alphabet character, for example, it can use the space between the uprights on the character U above the lower portion of the character U.
[0134] The spelling characters (e.g., English spelling characters) are bigger than the sound characters (e.g., superscripts) so the reader can clearly see the shape of the word in which the spelling character sits (e.g., the English word in the source text), e.g., as shown in FIG. 4. Problem characters are lower case u and uppercase U and lowercase o and uppercase 0. The sound character (e.g., a superscript) may be arranged in different positions for different compound characters: e.g., the lowercase o can go on the top of an A but the capital 0 can go a little to one side.
[0135] The sound characters (e.g., superscripts) are smaller and may be shaped differently from the spelling characters, e.g., straight and curly, the lowercase "o" is round but the capital 0 is more an oval shape 0. In the case of the "curly u", if the two dimensional curly u in the xy plane is projected onto the x plane, then the curvy u will exceed the projection of the straight u with the same width and height by 5%. With the oval 0, the width of the horizontal cross section will be at least 10% smaller than the height of the vertical cross section.
[0136] Keeping the word in which the spelling character sits in the source text allows and encourages the development of sightword recognition (seeing the word and instantly knowing its sound and meaning), and once sightword recognition has developed, the sound characters are no longer required. In other words, the encoded text retains the source text such that the words in the source text can be read by sightword recognition even after the sound characters have been added, thus allowing the reader to use the sound characters if they need to (e.g., if there is a word they do not recognise by sight) but simultaneously allowing the reader to read the words they do recognise by sight (thus faster). FIG. 7A shows 3 paragraphs from Alice in Wonderland. As shown in FIG. 7B, the words in FIG. 7A are in the encoded text (in this example, in Fonetic English outputted text) and retain the shapes of the words in the original source text in FIG. 7A: e.g., "the" retains the same shape in the source text and the encoded text and can be easily read by people who can read English words by sight. The size and placement of the sound character superscripts do not interfere with eye tracking. For example, in a word like "conversations", which has considerable markup, interference with sightword recognition or eye tracking along the line of text in which that word appears is minimal. (It should be noted that solid syllable breaks are not displayed, and that in some words, syllable breaks have not been displayed (e.g., the word "pleasure" in the second paragraph and there are some other irregularities, e.g. the word "close" was not translated into Fonetic English.)
[0137] The outputted text in human-readable form/format may include a properly formatted printed document, a printed and bound book with text and a cover (i.e., a printed with ink on paper) that can be printed in a print run or printed on demand, and an electronic book (i.e., displayed on a computer/e-reader screen, including a PC, laptop, Kindle device, tablet or mobile phone).
[0138] The method may provide a phonetic encoded text, even if the source text is non-phonetic (e.g., plain English) by unambiguously telling the reader what sound a character makes, when it does not make its usual sound, by adding the sound characters.
[0139] The method may also include: adding syllable breaks; and retaining / showing silent characters. The method may include identifying syllables in the source text, and adding spaces/syllable breaks between adjacent syllables in the encoded text.
[0140] The silent characters are retained, but may be de-emphasized by using a visual differentiator from the spelling characters that is consistent for the silent characters in the word/text, without changing shapes of the silent characters, e.g., using a different font colour, e.g., grey instead of black. There are multiple ways of indicating silent characters with some examples shown in FIG. 8A. Other ways of indicating silent characters include showing an outline of the character, a different coloured outline (e.g. grey), reverse video, different font, different character size and so on. It is also possible to indicate a silent character using single or double strike through, using outline, a different font, by crossing it out with a watermark, such as an X, or placing an x above the silent character in a similar position and with a similar shape as a sound character. Examples of different ways to indicate a silent character and different ways to indicate a stressed or unstressed syllable are shown on FIGs. 8A and 8B. It is possible that users of a logographic language, such as Mandarin or Japanese may prefer different ways of indicating a silent character than people whose native language is alphabetic, which will be experimentally determined.
[0141] A syllable break at the beginning of a word will only be necessary if the first syllable of the word is stressed. Syllable breaks, and stressed and unstressed syllable breaks can be shown in a large variety of different ways, with some examples of stressed and unstressed syllable breaks shown in FIG. 8B.
[0142] In digital type, an "Em" is a space, subdivided into a grid, referred to as Units Per eM (UPM) size. With 12 point type, the eM is 12 point, and not the specific size of a character within the eM. For example, a 12 pt Verdana character is larger than a 12 pt Calibri character. The UPM for True Type fonts is often 2048, which means that there are 2048 grid rectangles in each eM.
[0143] As shown in FIG. 3, examples of the compound characters may be formed of the following pairs of sound characters and spelling characters:
a. a sound character "e" above a capital spelling character "a" (i.e., "A");
b. a sound character "e" above a lower-case spelling character "a";
c. a sound character "s" above a capital spelling character "c" (i.e., "C");
d. a sound character "s" above a lower-case spelling character "c";
e. a sound character "J"above a capital spelling character "d" (i.e., "D");
f. a sound character "J"above a lower-case spelling character "d";
g. a sound character "A" above a capital spelling character "e" (i.e., "E"); and
h. a sound character "E" above a lower-case spelling character "e".
[0144] In FIG. 3, the left hand column shows compound characters with the spelling character an uppercase character and on the left hand column, the spelling character is a lowercase character. The box around the letter represents the size of the eM and in the example in FIG. 3 the height of every character is 3700 UPM for all spelling characters without or without a superscript.
[0145] The vertical height between the base of the EM and the base of the spelling character (excluding the descender) is represented by the numeral 1, and in the example in FIG. 3 is 500 UPM. In the current example in FIG. 3, the font is selected for the spelling characters, and the spelling characters appear in compound fonts unchanged, including the descenders of lower case letters, which is why they are not separately marked. Therefore the vertical spaces represented by numerals 1 and 2 are not changed from the original spelling font, whether uppercase or lowercase, with or without descenders or ascenders.
[0146] The vertical spelling letter height is represented by the numeral 2, and in the example in FIG. 3 is 2000 UPM for uppercase spelling characters and for lowercase spelling characters with ascenders.
[0147] The vertical space between the top of the spelling character and the base of the sound character is represented by the numeral 3, which is 150 UPM for uppercase spelling characters and for lowercase spelling characters with ascenders (the lowercase letter "d" has an ascender that is 500 UPM high, so that the character has the same overall height as an uppercase letter). For lowercase spelling characters without ascenders, the vertical space between the top of the spelling character and the base of the sound character is 200 UPM.
[0148] The vertical height of the sound character is represented by the numeral 4, and is 700 UPM except for the special cases of the vowels o and U which are discussed below.
[0149] The vertical distance between the top of the sound character and the top of the eM is represented by the numeral 5, and with uppercase spelling characters and lowercase spelling characters with ascenders, the height is 350 UPM. With lowercase spelling characters without ascenders, the height is 800 UPM.
[0150] In the current example in FIG. 3, the ratio of the height of the superscript sound character is 700 UPM compared to the height of an uppercase spelling characters and lowercase spelling characters with ascenders is 35%, and the ratio of the height of the superscript sound character is 700 UPM compared to the height of an uppercase spelling characters and lowercase spelling characters without ascenders is 46.6%.
[0151] There are exceptions to a standard sound character superscript size, including the following.
[0152] With digraphs, such as "ur" or "or", the limiting dimension can be the digraph character width, not the height.
[0153] It is possible to increase the width of the spelling character slightly (5% or less) as increases above this amount can affect the horizontal spacing of characters and therefore the shape of the word.
[0154] Spelling characters in words are separated with vertical separation spaces. Superscript sound characters that are digraphs are treated as a single character so vertical separation space between digraph characters is not required. For example, the digraph "oy", the "o" and "y" can be fitted more closely together by having a curved separation line, enabling a larger superscript.
[0155] The lowercase letter "o" and the uppercase "O" are more easily visually discriminated by having the lowercase "o" substantially smaller than the usual superscript size, and by making the uppercase "O" an elliptical shape, keeping it as the same size as the other superscripts and for certain characters like uppercase "A" locating the uppercase "0" to the side of the A. The smaller "o" can be easily recognized as it is a circle and is always centred over the spelling character. A similar exception is made for the character IU".
[0156] As described above, the lowercase vowel represents the usual sound of a vowel and the uppercase vowel makes the sound of its name. To make the superscript vowel characters easier to visually discriminate, the lower case vowels are presented in a "curly" font. The uppercase character "I" can be presented in a serif font to make it easier to visually discriminate.
[0157] There is a tradeoff with the size of the superscript character: the larger the superscript sound character, the easier it is to recognize, but the more the superscript sound character may impact the easy recognition of the word shape. The strategies to overcome this include the following.
[0158] There are a limited number of spelling and sound character combinations that actually occur in English words: the spelling character "a" can make the sound /o/ as in the word "watch" but there is no word yet found where "a" makes the sound "t". When there is just one sound character found, such as the spelling character "z" saying "zh", a reader seeing "z" with a sound character will quickly learn it is the character pair<z:zh>. The spelling character "d" can have two character pairs <d:j> as in "dew", and <d:t> as in "baked". In these cases, the reader will quickly learn to discriminate the "D" and "T" used as superscript sound characters. With only a small amount of practice, the reader will start to recognize the combined spelling and sound character, which enables faster reading and smaller superscripts. Larger superscripts may be helpful for people when they first encounter Fonetic English.
[0159] As noted above, where there are multiple sound characters for a spelling character, e.g. with vowels, the size of the superscript characters can be reduced without losing recognizability of the compound character by choosing the shape of the superscript sound characters to be easy to differentiate from other sound characters associated with a particular by their shape, the font and the position of the superscript sound character relative to the sound character.
[0160] The size of characters can be manipulated in many electronic displays, allowing the reader to size the compound character to best suit their eyesight and reading styles. Print on demand also allow scaling of the size of the compound characters for optimization for individual readability. A smaller print run with fewer pages in a book may be provide a lower cost print edition for people with moderate to good eyesight, and a large print edition can be produced print on demand for those who prefer larger type. Multiple Fonetic English fonts can be produced with different sized sound character superscripts, although these additional fonts may not be needed.
[0161] The reader may select a size of font such that they can read the superscript. Small characters are hard to discriminate and can slow the speed of a reader, especially when the reader is first introduced to the Fonetic English font and comes across a new word that they need to sound out. The reader will need to visually discriminate both the spelling character and the sound character to sound out the word. Once sight recognition for those words has been achieved, the reader will be able to read the word using just the spelling characters.
[0162] As shown in FIG. 3:
a. the example sound characters may be 24% to 36% of the size of the capital letters of the spelling characters-measured vertically from a bottom of each character to a top ofeach character;
b. a 12 point uppercase spelling character will produce an approximately 22 point compound character with a 4.2 pt superscript sound character;
c. the uppercase spelling characters are approximately 5mm, with the superscript sound characters approximately 2.1mm;
d. superscript sound characters should be at least 1.8mm so they can be visually discriminated by a significant number readers with normal eyesight; and
e. the size of the sound characters should be 3.7 point or more so that they can be visually discriminated by the reader.
[0163] A variety of different compound characters can be created for the same character pair which have different sizes of the spelling character and the sound character, different sound and spelling character fonts and different locations of the sound character relative to the spelling character. There are several ways that the best compound characters can be selected for a reader which include the following:
[0164] A user can interactively select online the Fonetic English compound character font that the user judges is easiest for the user to read by choosing one or more of the Fonetic English compound character font options.
[0165] Another selection method is to flash compound characters onto a screen and have the user select the compound characters that are the easiest to read.
[0166] A further way to select compound characters that best suit the reader is to present text in different Fonetic English compound character fonts, which the user can read and is tested to see which Fonetic English compound character font is fastest for them to read and/or provides the highest comprehension score.
[0167] The collection of this empirical data may allow machine learning systems to be trained to intelligently suggest fonts to users to improve their reading fluency and comprehension.
[0168] As explained above, the superscript sound characters can be in different alphabets, such as Japanese and Mandarin. Larger superscripts may be needed where the superscript sound characters are very intricate or where two characters may look alike. An alternative is to use a combination of different alphabets, replacing a character that looks similar to another character with say the IPA or a Roman character.
[0169] The method described herein adds the sound characters to the spelling characters, rather than replacing the characters as one would when translating a text into the International Phonetic Alphabet (IPA), so the spelling characters from the source text remaining legible in the encoded text, so the encoded text is at least as easy to read as the source text for a reader of the primary alphabet (in contrast to IPA, which needs to be learned separately, and words represented in the IPA do not look anything like words in the base alphabet (spelling characters)). As the encoding method does not change the spelling of the source text, (this is why the method doesn't delete the silent characters, i.e., silent characters are retained to keep the original words in the source text in their original shapes but are de-emphasized to show they are silent characters) when a reader sounds out a word and develops sightword recognition of that word, the reader will recognize the word by shape, and can read that word whether it is encoded (or "marked up") or not: in other words, the encoding process adds information to the source text without removing or moving any of the original source characters. In an example, if the source text is in
English, which is non phonetic, making the English phonetic by showing unambiguously the sounds of a word in the right order in the encoded text may enable an efficient way to teach reading.
[0170] Learning how to syllabify words is only needed to enable readers to decode the sound of new words. In the new mark up system, syllables are marked for the reader. When sightword recognition of the encoded word is achieved, it is unnecessary separately to teach syllabification (which is complex to teach with multiple rules and exceptions) because the shape of the word and its sound have been learned. Many people can learn syllabification simply from seeing words broken into syllables without further teaching. A key is to teach sightword recognition: once a reader can recognize a word by sight, the reader can in general read that word in standard English without any mark up.
[0171] Furthermore, by adding the sound characters to the spelling characters, i.e., substantially above or below the spelling character (i.e., so they do not substantially overlap when printed with the spelling character), the sound characters lie within the same visual field as the corresponding spelling characters when in the encoded text, thus the encoded text may have the same ocular load (i.e., how much eye movement is required for reading) as the base text, thus the information content of an encoded character can be higher without increasing the ocular load on the reader. The reader can look at (and recognize) the spelling character and superscript/subscript sound character as a single visual unit with a compound character name expressing the sound that the character makes (a says o). This enables the reader to recognize the spelling characters in the word and also be able to sound out the word, efficiently developing sightword recognition.
[0172] As the sound characters lie within the same visual field as the corresponding spelling characters when in the encoded text, and when the sound characters are from the primary or secondary alphabet, the level of concentration and/or cognitive load may be minimised relative to reading a text in a pure phonetic alphabet like IPA. If a reader has to concentrate to visually discriminate and recognize a character, the reader may have less concentration for understanding the meaning. A word spelled out in the IPA does not look anything like the word spelled in say the English alphabet. Sounding out a word in IPA will not develop sightword recognition in a base alphabet. Providing the spelling characters with the sound characters may assist recall of the sound character.
[0173] The sound characters have been designed to lie within the same visual field as the corresponding spelling characters when in the encoded text, the reader will typically recognise the combination as a compound character which makes the sound of superscript character. As the reader becomes more and more familiar with the allowed combinations of spelling characters and sound characters, the reader should intuitively know what the sound character is, because they will know there are only certain combinations of spelling and sound characters, and the visual memory part of their brain will match the entire compound character.
[0174] The method may include including encoding the source text by:
a. identifying at least one word ("identified word") in the source text that matches one of a plurality of preselected words in a preselected set of words formed of the base alphabet, wherein the identified words includes at least one stressed syllable and/or at least one unstressed syllable defined in the preselected set (including words with equal stress on each syllable), wherein each syllable includes one or more of the spelling characters, and
b. replacing/adjusting the identified word by adding a dot preceding each syllable, wherein the spelling characters of the syllable remain unchanged, and wherein the dot is a closed dot for the stressed syllable, and/or an open dot for the unstressed syllable.
[0175] The diameters of the dots are substantially the width of the thinnest character, usually an i in a sans serif font and no more than 2x the width to preserve word shape.
[0176] The method may include showing stress in a word with a closed dot preceding a stressed syllable, and an open dot preceding an unstressed syllable. For example, "CONTRACT" with a stress on the first syllable may be encoded as "[CLOSED DOT]Con[OPEN DOT]tract", whereas "CONTRACT" with a stress on the second syllable may be encoded as "Con[CLOSED DOT]tract". One way to show stress is to have a solid dot and a circle. Another way is to have a dot and a square. Any pair of mutually different syllable marks may show the difference between syllable marks. There may be 3 levels of stress: high stress, medium stress and low stress. The closed and open dots may be used while retaining the overall word shape for sightword recognition.
[0177] "Fonetic English" ("FE") refers herein to an alphabet, where the spelling characters and the sound characters are English characters.
[0178] As shown in FIG. 5, the method may include automatically generating and displaying to a user, via a human-machine interface (HMI), a word (e.g., "acknowledgement" 502) in the source text together with the corresponding encoded word (e.g., "acknowledgement" in FE 504) in the encoded text, and optionally with the corresponding IPA word (e.g., "acknowledgement" in IPA 506). The method can include providing a user-interface control via the HMI (e.g., a "translate" button 508) that receives an input command from the user to commence / perform the encoding steps of the method. Fonetic English aims to use only absolutely necessary mark up. For example, in FIG. 5, the dictionary has marked up the syllable "ment" replacing the character "e" with a schwa.
Pronouncing the syllable without the schwa, i.e., as "ment", will not confuse English speakers, and is simpler, and so the marked up word will be postprocessed to change this schwa mark up back to the letter "e".
[0179] The method may include automatically encoding / marking up an English word into Fonetic English (including silent characters, syllable breaks, stress syllables and the sound each character makes) based on inputs from a dictionary/database of word-IPA pairs, i.e., a plurality of words in the primary language and the IPA representations of those words, e.g., the word "Thoroughgoing" and its IPA representation "/,Ar.a'gau.ir/" (or the word "acknowledgement" 502 and its IPA representation "/ek'nDl.rd3.ment/" 506, as shown in FIG. 5), as follows:
a. break the IPA characters into phonemes (sounds) and add in compound characters that make the same sound as the IPA phonemes, e.g., <sbun><6:th><A:u><r:r><.:sbun><a):u><':sbst><g :g><au:O><.:sbun><i:i><rj:ng> for "Thoroughgoing"-or (ek)'(nl).(d3).(ment) for "acknowledgement" 502), where <sbun> means unstressed syllable break with no syllable break indicated as it was at the start of the word, or <.:sbun> means unstressed syllable break with a syllable break marked with "." in the IPA, and <':sbst> means stressed syllable break with the stressed syllable break marked with "'" in the IPA;
b. sequentially number phonemes and stress marks (a word will have an unstressed first syllable unless the first syllable is stressed), which is the last number in the brackets <> for each character/diphthong in the IPA, e.g., <sbun1><6:th2><A:u3><r:r4><.:sbun5><a):u6><7':sbst7><g
:g8> <au: 09> <.:sbun10> <i: i11><rj: ng12>;
c. add the IPA characters to the primary language characters by referring to the IPA to compound-word translation table (e.g., as shown in Appendix A) progressively by adding the IPA information showing the sound the spelled characters make which is invariant (which may dramatically reduce the number of possible ways the word can be marked up);
d. the characters th, r and ng are added first because each of these characters makes only one English sound: there are two instances of the character "g", but only one phoneme making the sound /g/, so the character "g" is left for a further iteration, e.g., Thoroughgoing becomes <Th:6:th2>o<r:r:r4>oughgoi<ng:r:ng12>,
e. going from right to left, the first syllable is not marked as stressed so it is marked as unstressed, e.g., an unstressed syllable break follows the character
"r", there is only one character between "th" and "r", so it must have the sound A, thus "Thoroughgoing" is <sbunl><Th:6:th2><A:u3><r:r:r4><.:sbun5>oughgoi<ng::ng12>,
f. going from right to left, if there are three characters, "ing", in the last syllable of the English word, the character "i" makes the sound /i/ and is preceded by an unstressed syllable break, thus "Thoroughgoing" becomes <sbunl><Th:6:th2><A:u3><r:r:r4><.:sbun5>oughgo<.:sbun10><i:i11><ng: O:ng12>
g. continuing the progression from right to left, the next two phonemes may be 0 and /g/, and the two English characters are "o" and "g", so the IPA phonemes can be added to these characters, thus <sbunl><Th:6:th2><o:A:u3><r:r:r4><.:sbun5>ough<g:g :g8><o:au:09><.:sbunlO><i:ill><ng::ngl2>
h. continuing the progression from right to left, a stressed syllable break can be added in front of the second "g", thus <sbunl><Th:6:th2><o:A:u3><r:r:r4><.:sbun5>ough<':sbst7><g:g :g8><o:au:09><.:sbun10><i:i:i11><ng: :ng12>,
i. by counting the phonemes, it is clear that the characters "ough" make 1 sound: /u/ with the characters "o", "g" and "h" are silent and the character "u" makes its usual sound /u/, thus <sbunl><Th:6:th2><o:A:u3><r:r:r4><.:sbun5><o:O><u:e:u6><g:O><h:O> <':sbst7><g:g:g8><o:au:09><.:sbunlO><i:i:i11><ng::ng12> (where <o:0>, <g:0>, and <h:0> indicate that o, g and h are silent.) (The US pronunciation is slightly different: in the 6th entry the "u" is silent and the character "o" makes the sound /0/.)
[0180] Checking the mark up with the following automated checks:
a. Is there an IPA character or IPA characters in the IPA mark up that is not in the IPA to HA translation table? (An English phoneme may sometimes be represented by different IPA symbols.)
b. Checking that the characters pairs in the spelled word marked up into Fonetic English are all valid character pairs
c. Can the IPA be extracted from the marked up word by taking the middle character from the mark up, together with the syllable breaks?
d. Can the English word can be extracted by taking the first character in the brackets<>?
[0181] The words marked up in the IPA may not comply with the standards of Fonetic English for the following reasons:
a. There can be mistakes in the IPA dictionary mark up. For example, in one well known dictionary, the word "acknowledge" is translated as /ek'nl.1d3/, which has 3 syllable breaks. However, the word "acknowledgement" in that same dictionary has only one syllable break /ak'nolid3mant/.
b. The objectives of dictionaries marking up words are different from those of Fonetic English. The aim of many dictionaries is to have the pronunciation as close as possible to the way the word is pronounced by an English speaker in the country where the dictionary is published.
[0182] The objectives of Fonetic English may include:
a. To make the marked up word as easy to read as possible. This means that additional information about the sounds of words will be added only if not adding this information will make it impossible to decode the sound of the written word. In turn this means that if the original spelling can be decoded so that the word when spoken is recognizable, then additional sound characters are not needed. Take the word "real" for example. One mark up for the word in an IPA translation is "rE.ul" where the character "a" is made a schwa. But this is an unnecessary addition, as the marked up word "rE.al" can be decoded to make a sound very similar to the sound "rE.ul".
b. To make prefixes, suffixes and root easily recognizable. The word "getting" is marked up in some dictionaries as "get.ting". The Fonetic English mark up is "getl.ing" (with the second t a silent character). This has a several benefits: the suffix "ing" is clearly distinguished, which reduces the number of syllables a student needs to learn, and clearly identified the suffix "ing", and the root word "get" is clearly recognizable.
c. To standardize the mark up of the root words to make then as recognizable as possible. Take the word "real". The following mark up is in different dictionaries "rEal" (the "e" says its name, one syllable with a silent "a"), "rE.ul" with the "a" a schwa, and "rE.al" with the unstressed "a" making its usual sound. The word "reality" is marked up as "rEoaloiotE" with the "a" in "real" stressed and making its usual sound. Therefore the most consistent mark up of the root word "real" will be "rEoal", and this mark up is therefore selected for Fonetic English.
d. To mark the word mark up consistent with methods currently used to teach reading. There is a system called "The Science of Reading" where there is a specific mark up. An open syllable is a syllable that ends in a vowel or a vowel diphthong. The vowel in an open syllable usually says its name, e.g. paper, where the word is pronounced pAper. In closed syllables, the vowel tends to say its usual sound. If there are 2 valid mark up options, then the preference will be to use the open and closed syllable mark up.
[0183] The marked up words can be checked programmatically using one or more of the following methods:
a. Translating the IPA mark up from more than one dictionary, or from other systems such as the word mark up developed by DARPA, and comparing the translations. If there are differences, then these words will be edited by computer algorithms (including machine learning derived algorithms) and/or edited manually.
b. Using computer comparison tools to locate and standardize words marked up with prefixes and suffixes. For example, "getting" will be standardized to "getting" with the second "t" as a silent character.
c. Using computer algorithms to analyze the marked up words to locate root words like "real" discussed above to ensure that the mark up of the root word is as simple and standard as possible. For example, the mark up if the word "real" is "real", replacing the sound character schwa in the IPA version with the original sound character being the same as the spelling character. This can be done automatically by changing the mark up of the root word automatically to a predefined and then having the changed checked automatically by comparing it to similar words or checked manually or both.
d. Comparing the translation from IPA of similar words, like "acknowledge" and "acknowledgement" to check that the mark up is consistent. In the above described example, additional syllable breaks can be added to the translation of the word "acknowledgement".
e. Checking for syllable break rule compliance. Every syllable has a vowel. So a word with one vowel will necessarily be a one syllable word. A word with 3 vowels that are separated by consonants will have 3 syllables. A word with only 2 adjacent vowels will likely be a one syllable word, but it is possible that there could be a syllable break between the vowels, although this has a low probability. Words which do not comply with the open and closed syllable rule should be highlighted for inspection by an editor.
f. If a new syllable is created, then the word with the new syllable can be checked. New syllables in which all the spelling characters are the same as the sound characters are likely to be correct, and will have a lower priority for checking than syllables in which some spelling characters have different sound characters.
[0184] Words with the same root can be marked up using the following process:
a. A comprehensive table of prefixes and suffixes is created and the suffixes and prefixes are marked up, e.g. "ly" makes the sound "olE". Some prefixes and suffixes can have more than one syllable e.g. the prefix "inter", and the suffix "ectomy".
b. A search of a comprehensive dictionary for a root word like "real" will locate the words containing this root word, called the searched words.
c. The root word is marked up and the marked up root word (with syllable breaks as needed) will replace the root word in the English word.
d. The prefixes and suffixes can in the English word can be replaced by the marked up prefixes and suffixes (with the syllable breaks but not so there are 2 syllable breaks which are not separated by a character).
[0185] A word can be checked manually or by comparing that word to the mark up of that word derived by the IPA mark up process discussed above.
[0186] The information collected in the mark up operations described above may be used to train a machine learning system to edit words marked up into Fonetic English and suggest accurate mark up for new words. The training information may include:
a. for each English word, the English word and one or more IPA representation of the sounds of the English word, and, if available, the word correctly marked up into Fonetic English;
b. multiple IPA representations to assist in the detection of errors in the IPA mark up and to assist in selecting the least complex Fonetic English mark up;
c. analysing the mark up of root words (e.g., "real" described above) to ensure that the mark up is consistent across all the words that contain the root word;
d. the table of prefixes and suffixes, and their mark up;
e. a database of the syllables in the marked up Fonetic English and their frequency, which will enable the machine learning system to give higher probability to the syllable with the higher frequency; and
f. a database of rules, such as each syllable must have at least one sound, each syllable must have at least one vowel, each prefix and suffix must contain at least one syllable, if there are two vowels in a word and these vowels are separated by a consonant, there must be 2 syllables, is the root word consistently marked up across all the words that contain the root (see the discussion of "real" above), and so on.
[0187] The method may include automatically generating a database of words with compound characters, each including a sound character by:
a. receiving each word written in English ("English word") and written in the international phonetic alphabet ("IPA word");
b. comparing characters ("English characters") in the English word to characters in the IPA word to identify whether one or more characters have a sound other than a usual sound (for that one or more characters), and if so
c. using a compound character instead of the English character to automatically generate a word ("marked-up phonetic word") that includes the spelling character and a sound character, wherein the sound characters:
i. are human-readable characters in the base alphabet and/or in one or more secondary alphabets,
ii. have a selected secondary font/character size smaller than a primary font/character size selected for the spelling character in each compound character, and
iii. are added to the compound characters such that the spelling characters remain human-readable such that the spelling character and the sound character of each compound character are within one visual field;
d. outputting a database of English words and marked-up phonetic words such that shapes of the marked-up phonetic words are substantially the same as shapes of the respective English words.
[0188] The same process can be used to mark up words in languages other than English for which there is an IPA marked up word, wherein the method received each word written in the other language, compares characters in the other language, and outputs a database of words in the other language.
[0189] The method may include providing a user interface of the human-machine interface (HMI) for a user to manually select marked-up phonetic words for the English words, e.g., as shown in FIG. 6, thus manually generating the database of marked-up phonetic words. The mark up system allows an editor to select appropriate sound character for the spelling character in a word, which either is greyed out (meaning it is silent), has a sound character superscript, which indicates the spelling character makes the sound of the sound character superscript, or the spelling character by itself, in which case the spelling character makes the usual sound of the spelling character. The lower boxes between the spelling characters indicate unstressed syllable breaks, and the upper boxes indicate stressed syllable breaks.
[0190] With documents that are not standard typeset documents, e.g., children's picture books or comics, where text might be in artistic fonts and may not be horizontal or vertical, converting the text to Fonetic English fonts may include (in order to substantially retain the look of the book/page):
a. machine scanning a page of the non-standard typeset document, and using automatic optical character recognition (OCR) convert the non-standard typeset text on the page to machine readable English text;
b. converting machine readable English text to Fonetic English as described herein;
c. automatically comparing (by a first program) the machine readable English text with the text in Fonetic English, and automatically creating a table of additional markings that are needed for each word on the page, which could include adding sound characters, syllable breaks or changing the colour of silent characters; and
d. automatically and directly inserting (by a second program) the additional information into the non-standard typeset text on the scanned non-standard typeset page with:
i. the size and location of the sound characters calculated based on the size of the type of the non-standard typeset text, with the default location based on the location of the sound characters in the Fonetic English font (in addition, irregular type may require that the second program changes the location to fit the situation),
ii. new characters superimposed over a non-standard typeset character (in the non-standard typeset text) to indicate a silent character, for example, a letter is imaged, converted to a light grey and them superimposed over the existing character in the non-standard typeset text, or alternatively the silent character could be lightly crossed out so that it does not substantially change the shape of the word, and
iii. syllable breaks added where there is room for them, which may involve the syllable breaks being added lower than the syllable breaks in standard Fonetic English, e.g., substantially under the spelling characters.
[0191] An example of how words can be marked up into Fonetic English without changing the original typeset words is set out in FIG. 9B. A portion of a page of a book is reproduced at the top, and the words in this page portion are marked up into Fonetic English and displayed in the lower portion.
[0192] The publishing system can also present sentences broken up into meaningful groups of words, which are called phrases in this document even though some academics may hold the opinion that some groups of words would not technically be phrases. Breaking sentences into phrases is of considerable assistance when the sentence is long, complex and/or convoluted, because even skilled readers may need to read a sentence several times to determine where the phrase breaks are. Adding phrase breaks may assist readers who are reading a foreign language especially if that foreign language has a different word order than their native language.
[0193] Sentences can be broken into phrases by:
a. using punctuation in the sentence
b. recognizing conjunctions like "because" which usually start a new phrase: conjunctions like "and" are more complex as they may denote a phrase break where two sentences are joined together in one sentence by using an "and", or may not denote a syllable break e.g. in the phrase "bat and ball".
[0194] Analysing the grammatical structure of a sentence using a tool like AutoMap developed at Carnegie Mellon University to be able to classify each word by its parts of speech, and then using this information to group words into phrases e.g. nouns and adjectives may for the subject of a sentence, verbs and adverbs will be grouped together, and nouns and adjectives grouped together as the object of the sentence. This is the way to classify the effect of conjunctions like "and".
[0195] Requesting teachers and academics to break text into phrases manually and then using this information, together will rules such as in a-c above to train a machine learning system to analyse text and break it into phrases. This method can be extended to students marking up documents, and then by using a statistical to accept only those phrase breaks that have a high probability of being right.
[0196] There are many ways that phrase breaks can be displayed in multiple ways including such ways as retaining the text format but increasing the word spacing between phrases to indicate a phrase break, using an easily recognizable symbol, such as o phrase 1 o phrase 2 o to separate the phrases using the same block format, or by displaying each phrase on separate lines.
[0197] Breaking sentences into phrases is an example of the Comprehension Improvement Tools in step 12 of the Method 100 and is further explained below.
Testing and practice
[0198] Described above are a number of ways that an alphabet can be created that will contain additional information to assist readers to learn to read a language, learn a language more efficiently, efficiently improve their language skills, and/or better comprehend documents written in that language. In order to assist a particular class of readers, or individual readers, information needs to be collected in the User Progress Database, step 9 of Method 100, and analysed to optimize learning and comprehension outcomes.
[0199] The methods to achieve the optimization of learning activities of the Method 100 Step 8, Exercise Skill Improvement Database, Step 10, Vocabulary Improvement Tools, Step 11, Reading Improvement Tools and Step 12, Comprehension Improvement Tools include:
a. Measure what a class of people (or an individual) know and what they don't know and teach what they don't know e.g. does an individual know what sound a particular character makes,
b. Measure the skill levels of individuals or classes of individuals e.g. how well can someone accurately discriminate the sounds of particular phonemes, and/or
c. The number of times a person needs to learn something for that knowledge to be stored in their relatively non volatile long term memory.
[0200] The optimization of the learning activities of the Method 100 Step 8, Exercise Skill Improvement Database, Step 10, Vocabulary Improvement Tools, Step 11, Reading Improvement Tools and Step 12, Comprehension Improvement Tools will usually involve interactive learning modules and this interactivity will enable the system to progressively record and analyse information such as
a. Information about what a person needs to know e.g. they are studying medicine and they need to acquire a medical vocabulary,
b. Information and/or predictions of what a person knows, and what they need to know, from which the system can calculate what they don't know and need to know,
c. Their current skill levels, and/or
d. What revision plan works best for that individual person or group of individuals with similar learning profiles.
[0201] Step 9 of the Method 100 is the User Progress DB. This system logs everything and logs the answers and compares the answers to the correct answers and decoded whether the answer is correct or not.
[0202] The method of logging and storing user information may include:
a. receiving user inputs from a user of a computing system which the computing system stores;
b. the computing system generating measured values of the user's knowledge;
c. the computing system classifying the user into one of the plurality of categories based on the measured values; and
d. the computing system predicting what activities will be optimal for the user based on the stored user data of users in the same category or categories.
[0203] An example of the method of logging and storing user information may include:
a. receiving user inputs from a user of a computing system;
b. the computing system classifying the user into one of a plurality of categories based on the user inputs; and
c. the computing system selecting the alphabet characters from the plurality of alphabet characters based on the user category using a predefined mapping between user categories and character sets.
[0204] Another example of the method of logging and storing user information may include:
a. the computing system presenting a test text in the primary alphabet to the user by displaying the test text visibly or playing the test text audibly, wherein the test text includes: a plurality of words that can be selected by the user using the user interface ("user-selectable words") including at least one test word and one or more distractor words (which are not the test word); and
b. the computing system measuring the values from user selections of the user selectable words, including measuring how many of the least one test words are user selected, whether the at least one test word was correctly chosen, and/or how much time is taken to select the test words.
[0205] Another example of the method of logging and storing user information may include generating the measured values by:
a. the computing system displaying a plurality (e.g., 3) marked up words, a subset (e.g., 2) of which are wrong; and
b. the computing system measuring how many correct and incorrect words the user selects. The incorrect words will be logged in a database and the system will continue to reteach these incorrect answers until the answers are correct, say 3 times in a row.
[0206] The correct answers will be presented to the user occasionally to revise the correct answers to ensure they will be retained in long term memory, as even long term memory requires revision but less frequently. This will be measured and an individual pattern will be developed to ensure long term memory retention with minimized time spent in revision.
[0207] Another example of the method of logging and storing user information may include generating measured values by:
a. the computing system playing the sound of a word with a plurality of syllables (e.g., a multi-syllable word with at least 2 syllables, e.g., a 3 syllable word) defining a corresponding plurality of correct syllables (e.g., at least 2 respective correct syllables, e.g., 3);
b. the computing system displaying a corresponding plurality of blank boxes (e.g., at least 2 respective blank boxes) and a plurality of user-selectable syllables greater than the corresponding plurality (i.e., there are more user-selectable syllables than blank boxes, e.g., more than 2 or more than 3);
c. the computing system receiving input from the user selecting a corresponding plurality (e.g., at least 2, or 3) of the user-selectable syllables;
d. the computing system measuring how many of the correct and incorrect syllables are in the user-selected syllables; and
e. reteaching the incorrect syllables until they are learned, thus until the measured number of incorrect syllables is below a selected threshold (the correct syllables will be revised occasionally as discussed above).
[0208] This process (the above example of the method of logging and storing user information) can be used to teach spelling, i.e., being able to have the correct syllables selected so that the spelling characters accurately spell out the word and the sound characters make the correct sound. This process can be used to improve sightword recognition, and enable a student to detect misspellings because "the word doesn't look right".
[0209] Another example of the method of logging and storing user information may include generating measured values by:
a. the computing system playing the sound of a word defining a first plurality of correct characters; b. the computing system displaying blank boxes equal to the first plurality; c. the computing system displaying a second plurality of user-selectable characters, wherein the second plurality is greater than the first plurality; d. the computing system receiving input from the user selecting the user-selectable characters; and e. the computing system measuring how many correct characters are in the user selected characters.
[0210] This process (the above example of the method of logging and storing user information) can be used to teach a student to improve their sightword recognition and recognize the word correctly spelled. This will allow the student to detect a word that is misspelled because "the word doesn't look right".
[0211] Another example of the method of logging and storing user information may include generating measured values by:
a. the computing system playing the sounds of a respective plurality of words with a time of silence between adjacent ones of the played sounds of the words;
b. the computing system reducing the time of silence between the played sounds based on the input from the user;
c. the computing system playing the sounds of the words slowly in a continuous sound (as used in human to human speech) based on input from the user with the tone corrected for speed (with some audio players, as the speed is slowed, the tone of speech drops); and
d. the computing system playing the sounds of the words at the speed of normal speech in a continuous sound (as used in human to human speech) based on input from the user.
[0212] The computing system can take a word and pass the word to an automated text to speech software module that is configured to produce an audio file containing the sound of the word which can be stored in a database linking the word text to the word sound. A sentence can be played one word at a time by selecting a word, looking up the database for the sound of that word, and playing that word. The sentence as played will not contain intonation and cadence.
[0213] The computing system can take a sentence and pass the sentence to an automated text to speech software module that is configured to produce an audio file of the sentence being read aloud with intonation and cadence. The audio file of the sentence can then be passed to an automated speech segmentation system, such as Aligner developed by
Dr Stefan Rapp, which can break the continuous speech containing the sounds of the words in the sentence into discrete words and phonemes. Because the intonation and cadence applying to a word can be different in different sentences, a database linking the word to its sound also contains the document identifier, a sentence identifier and a word identifier, which allows the words in a sentence to be played sequentially with cadence and intonation, with a variable gap between the words.
[0214] By breaking the audio signal into the sounds of discrete words, and linking the text of the word to its sound in a database, each word in a sentence can be highlighted as its sound is played. This may assist some readers, e.g. those readers who were never read to as children, and will help to improve auditory discrimination be seeing the word and hearing it pronounced. Multiple choice questions can be added to measure the comprehension of the reader.
[0215] Playing the sound of a word whilst highlighting the text of the word may be a useful feature in a reading practice tool.
[0216] Using the above methodology, a reading practice tool can also play a sentence in phrases whilst highlighting the sentences.
[0217] The split concentration principle of Cognitive Load Theory states that information from multiple sources at once can reduce learning because attention is split. Once readers develop some reading competence, attention will not be split if the student only reads, or if the student just listens.
[0218] By clicking on a phoneme, syllable, word, phrase, phrasal verb, idiom etc., the reading practice tool could also play individual words, syllable, phonemes, phrases, phrasal verbs (to come up with means to think of) and idioms with or without intonation and cadence.
[0219] Similarly, the reader could click a word, phrase, phrasal verb, idiom etc and be presented with the definition or translation of words, phrases, phrasal verbs, idioms etc.
[0220] The user inputs may include user-selected values, and the method includes receiving the user-selected values by:
a. the computing system presenting text input prompts and/or selectable lists to the user; and
b. the computing system receiving the user-selected values as inputs in the text input prompts and/or selectable lists,
wherein the method includes the computing system classifying the user into one of the plurality of categories based on the user-selected values.
[0221] In step (2) of the method 100, information about the user collected at registration may include:
a. Name and contact details (phone, email etc.)
b. Country of residence
c. Native language
d. Age
e. Sex
f. Educational attainment in native language (primary, secondary, tertiary)
g. Professional qualifications
h. Self assessment of reading ability in native language
i. Years spent learning English (or other language being taught)
j. Educational attainment in English (e.g., IELTS 5)
k. Self assessment of reading ability in English
I. Self assessment of spoken English communication in English
m. What does the user want to learn English for? E.g. to study accounting.
[0222] Step 4 of method 100 is the User Classification System which uses information from the Registration System, step 2 of the method 100, to form user groups where the members in the group have similar knowledge and skills.
[0223] This will allow users to be conveniently categorized into groups by answering a small number of questions, and then the usage data of the group can be used to predict the optimal learning strategies for the new member.
[0224] For example, a new user responds to the questions stating their native language is Mandarin, they are 18, male and have IELTS grade 5. By analysing the recorded student data, many students in this category can recognize the English alphabet, have a 3-500 word vocabulary, can accurately discriminate about half the sounds of English phonemes, but find the discrimination of other phonemes problematical. Their pronunciation of English words containing the difficult phonemes is often poor.
[0225] From the database recording usage of students, it can be calculated that it is statistically likely that the new user will benefit from taking a specific course to improve auditory discrimination, and when auditory discrimination has improved, take a specific course on pronunciation.
[0226] The specific auditory discrimination course might teach syllables with difficult phonemes by teaching words with the phonemes that other users in the same group could hear and progressing to more difficult to hear syllables and then phonemes. In addition, a student may be better able to hear these words spoken by someone with the same native language as the user. When the student can accurately discriminate the sounds of syllables and phonemes when pronounced by a person with the same native language, the student can start working with sounds made by a native speaker of the base language, the language being learned.
[0227] These predictions will then be tested by giving the user activities and recording and analysing the outcomes and then modifying the course based on user data. For example, the user may have better auditory discrimination than the group and not need to have time spent on something the user already knew or had the skills. This user information will then be analysed to provide better group predictions in the future, and may cause a larger group to be split into two or more groups with more defined knowledge and skills.
[0228] If the user wished to study accounting, a vocabulary list can be generated which the student will need to know. The student will be tested on the vocabulary to determine what is known and what needs to be taught and the system will predict what words are known and what words need to be learned and this prediction will be tested by the interactive course the user will undertake.
[0229] In step 3 of method 100, exercises can provide additional data for classification of users into user groups using the User Classification System, step 4 of method 100, which may include:
a. measuring a user of a computing system (i.e., testing reader knowledge and skills) by:
b. the computing system presenting a test text in the primary alphabet to the user (speaker of the secondary alphabet) by displaying the test text visibly or playing the test text audibly, wherein the test text includes: a plurality of words that can be selected by the user using the user interface ("user-selectable words") including at least one test word and one or more distractor words (which are not the test word), and
c. the computing system measuring user selections of the user-selectable words, including measuring how many of the least one test words are user selected, and/or how much time is taken to select the test words-wherein the user is the speaker of the secondary alphabet reading the test text (e.g., identifying how many errors and/or what errors are in the text test, measuring a time lag between the word playing and receiving the user selection); d. the computing system classifying the user into one of a plurality of categories based on the measurements (and optionally based on additional user-selected values, described hereinafter, and optionally using a trained classifier, e.g., a trained artificial neural network, ANN, trained on data from training users in each of the plurality of the user category groups); and e. the computing system selecting the phoneme set from the plurality of sets based on the user category (i.e., use the category of the user to select the character set for the encoding) using a predefined mapping between user categories and character sets (which is a many to one mapping, as many different groups/categories map to English/English alphabets).
[0230] Together the user classification system, step 4 of method 100, data for the user classification system, step 5 of method 100 and user classification algorithm improvement system step 6 of method 100 form and Step 7 of method 100, User assignment to a user group, form a heuristic system illustrated by the following examples.
[0231] A Chinese student aged 11 has 3 years of English training. Testing shows that other Chinese students aged 11 with 3 years of English training have difficulty clearly discrimination hearing certain English phonemes but these students can hear a number of syllables containing these phonemes that are pronounced by a native Chinese speaker. It has also been found that if these students learn additional syllables containing the difficult phonemes pronounced by a Chinese speaker, their auditory discrimination will improve and that they can then start correctly discriminating the easier syllables when a native English speaker speaks these syllables. The computer system predicts from the usage results of other students that the student should be presented with the same syllables in the same order. As more students use the system, some students may learn more quickly than others, and there may be a fast track for students finding the learning easier. The system analyses the results of the student and based on the logged usage, will provide learning activities to maximize the individual performance of that student.
[0232] Further information for the user classification improvement algorithm, steps 5 and 6 of method 100, is illustrated in the following examples.
[0233] The training users are asked test questions like "how many errors are this in this short passage of text?", and the errors they identify are recorded. Based on their answers to these questions from manually categorized users, the ANN is trained to categorize users. ANN categorized users have their usage information logged and to provide information as to the accuracy of the prediction/classification. The data with the manually corrected user classification may be added to the ANN in training.
[0234] Once the user registration data and the additional data based on student activity have been received, the user is assigned to a user group in step 7 of the method 100 based on the probability that the user will have the best learning outcomes in that group. Once the user is assigned to a user group, the user is assigned a User ID and a group ID.
[0235] The English skill improvement database, step 8 of method 100, the Vocabulary Improvement Tools, step 10 of method 100, the Reading Improvement Tools, step 11 of methos 100 and the Comprehension Improvement Tools, step 12 of method 100, use logged user information to select exercises, exercise content, alphabet selection, word selection and priorities based on the calculated probability of optimal group learning outcomes, and when sufficient data is collected, individually optimized exercises, exercise content, alphabet selection and priorities.
[0236] For example, using the user information collected as described above, different optimized alphabets for different reader user categories ("groups") can be provided. Each group (e.g., of language students) has a learning profile. For example, for each language, there may be three (or more) groups: e.g., beginner, intermediate and advanced. By dividing users into groups via the classification system, the teaching steps described hereinafter may be optimized for these users based on their groups by predicting what they know and don't know, teaching what they don't know, and not teaching what they do know.
[0237] The classification system can select the phoneme set that is individualized and optimized for the group of individuals based on their age, their language mastery (including of the IPA), their education level, their English literacy (e.g., if someone knows the English phoneme set then they can use the English/English alphabet).
[0238] Critical to the method 100 may be the collection of all user registration and activity in a database that can measure user progress which is step 9 in method 100. The data is collected with as much detail as possible and non SQL databases like DymanoDB from Amazon will allow for the restructuring the database if required.
[0239] The method may include the computing system authenticating and identifying a user by connecting to a learning management system (LMS, e.g., from Blackboard Inc., DC USA) in which the user has a user account, e.g., through an application programming interface (API) of the LMS. The method may include sending the user category and/or the measurements (measured values) to the LMS associated with the user account for storage with a user record in the LMS. There are several formats which can be used here which include: xAPI, Tin Can, Sharable Content Object Storage Model (SCORM).
[0240] There are a number of Reading Improvement Tools, step 11 of the method 100 which are described below.
[0241] The method described herein encodes text document in a manner optimised for personal learning of a language, e.g., English, based on the language of the learner, e.g., Japanese: thus the encoded text is optimised for each individual/student. The method uses sound characters that are also characters in the primary alphabet (e.g., English) or in the secondary alphabet (e.g., Japanese) so the human reader does not need to learn new symbols (e.g., glyphs).
[0242] The method includes testing the user to find the best character set for that user based on their known language (the "secondary language" herein) and other listening, reading and pronunciation abilities.
[0243] The reader's mental overheads may be reduced if the symbols have optimal automaticity both in ease of recognition and also in recall of sound. For beginners, characters in the native language of the reader will be easily recognizable visually and also the reader will know what sound it makes. So if there are phonemes in the primary language (e.g., English) that are the same in the secondary language (e.g., not English), and if the reader has not mastered English letter/sound association (i.e., the sounds English characters make as their usual sound)-referred to as "full automaticity", a fast way to learn these sounds is to see the English character and the secondary character making the same sound. (Obviously if there are sounds in English that are not in the secondary language, then there will be no characters making those sounds. Either the English characters will be used, or IPA characters. The user will need to be taught how to hear and make those sounds. Students cannot self correct their pronunciation unless they can accurately discriminate the differences between their pronunciation and that of a native speaker of the language they are learning.) In addition, the reader may be more confident in hearing and speaking the sounds if they see sounds in their native (secondary) language rather than the IPA if the students are not confident of their knowledge of the IPA. In contrast, countries like Japan and China teach the IPA to their students. The secondary alphabet may be the IPA, e.g., by having IPA characters over Chinese characters.
[0244] As the method includes identifying pre-existing reader knowledge, efficiency of teaching the reader the primary language may be improved (it is not efficient to teach someone something they already know). As the method includes classifying the reader into one of a plurality of reader groups (or "categories"), the level of the reader's knowledge may be estimated quickly, with only a few questions.
[0245] General comprehension of any language requires accurate auditory discrimination, accurate pronunciation and fluent reading. In addition, nobody will be able to understand spoken or written information if they do not understand the meaning of the words used in the communication. Development of vocabulary is essential for improved comprehension. Comprehension Improvement Tools, step 12 of the method 100 provide of efficient ways to improve reader comprehension, examples of which are described below. There is overlap between the tools. An example is auditory discrimination. Improving a user's auditory discrimination will improve that user's ability to hear and understand spoken words (accurate listening and improved comprehension of oral communication), will improve the pronunciation (accurately discrimination spoken phonemes will allow more effective self correction of pronunciation), and knowing how to spell a word enable the user to know what to listen for.
[0246] A second example is learning to sound out a word, which will improve auditory discrimination, pronunciation and will efficiently develop sightword recognition which will improve reading fluency and comprehension.
[0247] Given the interrelated nature of language learning, a reference to developing a specific skill, e.g., sounding out words, will include the other benefits such as increasing the number of words a user can recognize by sight, improvements to auditory discrimination and improvements to pronunciation.
[0248] Examples of the Reading Improvement Tools, step 11 of the method are set out below. The elements of step 11 include:
a. the computing system dividing a word (from the source text , e.g., "cat") into a plurality of individual phonemes/partial syllables/syllables including the first character/letter and progressively more characters/letters/digraphs (two letters representing a different sound, e.g. the digraph "ph" represents the sound/f/), incrementing by one phoneme (character or digraph other than a silent character) for each partial syllable/syllable (each including the first phoneme of the word plus zero or more phonemes of the word in order, up to reconstructing the full word);
b. the computing system sounding out (playing) the partial syllables/syllables progressively in order of length (e.g., /c/, /ca/, /cat/) from the user interface of the computing system for the user to hear (i.e., playing the sequence) (the symbols /cat/ represent the sound of the word "cat" when pronounced by a native English speaker);
c. the computing system instructing the user to repeat the partial syllables/syllables via the user interface; d. the computing system recording the user speaking the syllables/partial syllables; and e. the computing system sounding out the set of the partial syllables/syllables and then the set of the user's recorded partial syllables/syllables (e.g., the user's recording of /c/, /ca/, /cat/ and then the official recording of /c/, /ca/, /cat/) and the sounding out step can be repeated two or more times.
[0249] Students may need to learn blending: e.g., that /c/, /a/, /t/ makes the sound /cat/. Some students already know this, especially students whose native language is phonetic, like Finnish, Italian or Spanish. Most systems do not teach the progressive phoneme blending where a student learns to progressively implement the partial syllable sound phoneme by phoneme /c/, /ca/, /cat/. Usually a student hears /c/ /a/ /t/ and is told this makes the sound /cat/. The current teaching practice requires at least 4 bits of information in short term memory for a three phoneme word. Longer words will cause higher loads on working memory. The strategy of breaking words into syllables, learning how to blend each syllable, and then pronouncing the syllables quickly one after another to pronounce the word can significantly reduce the load on short term memory. Blending can be taught online using a system where students hear /c/, /ca/, /cat/ etc and then they are recorded and they can play back their recordings to compare with a native speaker making the same recording. Some people whose native language is logographic (like Mandarin) may not be good at sounding out words (blending) and these people will require teaching.
[0250] Computerized voice recognition systems such as those downloadable from www.speechace.com can be used to measure the reading skills of users by measuring the words that were read correctly and those words that were not read correctly, and the accuracy of the pronunciation of users. This information is logged and used to classify users into groups, optimize exercises, predict optimal exercises for particular users, and provide feedback on the accuracy of the predictions.
[0251] In learning to sound out words, students will learn to read syllables by sight and this means if a student sees a syllable, they will instantly know its sound. There are at least 13,802 words made up solely of the most common 500 syllables. For many students, learning just 500 syllables will enable them to read basic English. This is vastly more efficient than having to learn 13,802 words individually, especially as many words are non phonetic and need to be learned by rote, which can take a lot of repetition. The efficiency of the system is further enhanced by the standardization of the Fonetic English mark up, by clearing marking prefixes and suffixes, which makes it easier to decode the meaning of a word, and by the standardization of the Fonetic English syllable breaks, which reduces the number of syllables that need to be learned.
[0252] Over half of the most common English words are non phonetic, meaning that these words are not pronounced as they are spelled. These non phonetic words must be learned by rote, requiring a lot of repetition to learn each word. There are over 5000 words that need to be learned by repetition. Compare this approach to teaching people to quickly learn 500 syllables by sight by sounding them out 2 or 3 times. Syllables are short and easy to learn. Knowing the syllable sounds makes it easy to sound out words, as we pronounce words syllable by syllable. Breaking words into phonetic syllables and teaching the most common syllables is significantly more efficient than the current rote learning approach.
[0253] Several word frequency lists have been produced by academics that rank English words by their frequency in documents. Syllables in the most common 1000 words will necessarily occur more frequently that those syllables occurring in say the 5000-6000 most common words. A database of all marked up Fonetic English words together with the frequency of the word in English documents will enable the creation of a database of all Fonetic English syllables with their weighted frequency, which takes into account both the frequency of the words containing a syllable and the number of words in which that syllable occurs. For example, a syllable may occur in 30 of the most common 1000 English words. A second syllable may occur in words whose frequency in English documents is less frequent than the most common 9000 words but more frequent than the most common 10000 words. Clearly the syllable occurring in the first 1000 words will be viewed more frequently than words whose frequency lies between 9-10000.
[0254] An example of the method of improving reading by using syllabification may include an efficient way of teaching the sounds of a list of words that a user needs to learn, e.g. for a medical course:
a. the words in the list are marked up into Fonetic English if they are not already marked up;
b. the computing system analyses the words in the list and creates a list of syllables together with the frequency that the syllables occur in the list;
c. the computing system prioritize the syllables to be taught by giving those syllables that the user is predicted to know a lower priority, based on their user classification;
[0255] Students learning English for the first time may be taught the sounds of the most common words and syllables, e.g., the most common 1-200 or more syllables, to significantly reduce the cognitive load required to learn to read because the reader will often only need to sound out a few characters in a new syllable in a new word containing a common syllable.
[0256] An example of the method of improving auditory discrimination may include:
a. the computing system displaying characters representing phonemes, syllables and words on the user interface and simultaneously sounding out (playing) the sound files containing the phonemes, syllables and words (the method includes showing the characters so that the user can see what to listen for, and can simultaneously hear the word pronounced, or pronounced syllable by syllable while seeing the words, syllables or phonemes displayed); and
b. requiring the user to take some form of action to provide user input: e.g., hearing a sound and clicking on the collection of characters that best represent that sound.
[0257] Auditory discrimination of English phonemes, syllables and words is improved because a student knows what to listen for. Many people have heard a person say their name and were not able to accurately discriminate the phonemes of the name. When given a card with the name spelled out, people can often discriminate what was said. The system may make this easier by also finding out what accents the person can hear and presenting the sounds in the accents a person can hear and then progressively moving the accents towards a native English speaker.
[0258] An example of the method to improve auditory discrimination may include:
a. the computing system measuring a user of a computing system (i.e., testing reader skill and knowledge) by:
b. the computing system presenting a test text in the primary alphabet to the user (speaker of the secondary alphabet) by displaying a plurality of test texts visibly ("a" and "e") and playing one of the test texts audibly (which can be the sound for "a" or "e"), and
c. the computing system measuring user selections of the user-selectable words, including measuring how many of the at least one test words are user selected, and/or how much time, to determine auditory discrimination (e.g., between "a" and "e"); and
d. if the measurement is below a preselected threshold, the computing system providing a plurality of syllables and words with the test texts and additional characters and phonemes (visibly and audibly in one or more accents) for the user to train their auditory discrimination, including providing the plurality of syllables and words starting with short syllables and progressing to long syllables and then words increasing in length.
[0259] By recording and analysing the user interactions, the system can intelligently determine which words, syllables and phonemes are problematical, and which are not, and develop different learning strategies for the problematical words, syllables and phonemes which can include using logged user data to intelligently predict other activities to improve learning outcomes, such as different accents, finding words that the user can discriminate, adding difficult phonemes and/or syllables to test whether this improves learning and skill development outcomes and so on.
[0260] An example of the method of improving auditory discrimination may include:
a. the computing system selecting phonemes (sounds) that are in the primary alphabet but not the secondary alphabet based on a predefined phoneme chart (or data representing a phoneme chart), e.g., the Japanese phoneme chart in FIG. 4, which may include phonemes of the primary alphabet and phonemes of the secondary alphabet connected or linked by a corresponding IPA symbol making the same sound.
[0261] For auditory discrimination, only those sounds that are not the same need to be taught.
[0262] An example of the method of improving auditory discrimination may include teaching auditory discrimination by:
a. the computing system playing a test sound for the user to hear;
b. the computing system displaying a plurality of characters including a test character or characters representing the test sounds in the language; and
c. the computing system measuring whether the user clicks on the test character or characters.
[0263] An example of the method of improving auditory discrimination may include recording the test sound by someone from the same language background as the student recording the sounds (i.e., whose native language is the secondary language), or by a native language speaker of the primary language. This method may allow the user/student to build new neural networks in the brain when the student can discriminate something approximating the sound, and/or allow the student to be more motivated to practice if they have initial success.
[0264] An example of the method of improving auditory discrimination may include recording and playing words and syllables to teach sounds that are in the base alphabet but are not in the language with which the learner is familiar, e.g., not in the secondary alphabet. An analysis of the IPA characters in 2 languages will show what phonemes are the same in each language ("common phonemes"). This information can be used to measure the accuracy of the auditory discrimination of different learners when they hear phonemes, syllables or words containing only common phonemes pronounced in different accents. If a learner can hear accurately discriminate some phonemes in a foreign language, they will be able to efficiently expand their discrimination to include other phonemes in a syllable if they can accurately discriminate some phonemes in that syllable. If a learner cannot discriminate a phoneme, simple repetition will not teach that phoneme to that learner. For example, if someone cannot hear the difference between /a/ and /e/, they are likely to be able to hear the difference in words like /cat/ and /ket/ if the letters "c", "k" and "t" are common phonemes. The system experimentally determines what the syllables and words are that are easiest for the student to accurately discriminate by testing different combinations with a lot of students, and using statistical analysis to determine the best combinations for each of the different student groups. The result will be an ordered set of syllables that will be taught to learners using different accents spoken at different speeds that will rapidly, effectively and efficiently build the auditory discrimination skills of the learner.
[0265] An example of the method may include teaching auditory discrimination by:
a. the computing system recording the user pronouncing a test character (and/or phoneme/syllable/word); and
b. the computing system repeatedly playing the user's recording for the user to hear;
c. the computing system repeatedly playing a prerecorded pronunciation of the test character (and/or phoneme/syllable/word) after the user's recording such that the user can hear a difference between the user's recording and the prerecorded pronunciation; and
d. the user's recording and the prerecorded recording may be played one after the other.
[0266] The prerecorded pronunciation can be prerecorded in different accents to test what accents the user can hear, and then progress the user to being able to hear all the sounds of the language they want to learn pronounced by a native speaker.
[0267] In order to understand what is being spoken, someone listening needs to be able to discriminate the sounds of words as they are being spoken. When a native speaker speaks in their native language, their words are not pronounced one after the other with a space of silence between the words. Instead, the words are run together with a lower volume of sound indicating the separation between the words. When learning a new language, a person must learn not only to discriminate the sounds of individual words, but also to discriminate individual words spoken rapidly when there is no silence between the spoken words.
[0268] One way to teach the ability to accurately discriminate words spoken by a native speaker of that language is to teach the sounds of the individual words played independently. This may involve teaching the sounds of the syllables and then teaching the sound of the word by combining the syllables, e.g.:
a. Play the sounds of individual words with discrete silences between the words, reducing the duration of the silences between the words;
b. Play the sounds of the words as if spoken slowly by a native speaker; and
c. Play the sound of the words spoken at a normal rate.
[0269] The efficacy of the system can be further improved by having the sounds of the words spoken by someone with the same language as the person learning, and when the learner can accurately discriminate the words spoken with a familiar accent, the learner can then go through the same process with words spoken by a native speaker.
[0270] The system can be made more efficient by predicting the words and syllables that the learner is likely to hear based on the experiences of other learners in the same group, and teaching the words and syllables that the learner is most likely to encounter, using the prediction techniques described in this document.
[0271] If a person is taught to spell a word, and if they know the sounds of the phonemes and how to blend phonemes, they may master that word. If they do not know the meaning of a word, it is unlikely they will be able to understand a sentence in which the word is used. Vocabulary is therefore essential to being able to read and understand a document.
[0272] Vocabulary Improvement Tools are step 12 of the method 100 addresses the need for efficient vocabulary acquisition, with examples provided below.
[0273] The vocabulary acquisition burden varies subject by subject. In many subjects, the acquisition of vocabulary is a critically important to being able to understand the topic. In medicine, for example, some doctors estimate that vocabulary acquisition is the single most time consuming part of the course.
[0274] Vocabulary acquisition is made more difficult because many medical terms are not pronounced as they are spelled. This means that simply recognizing the word and its sound - sightword recognition - will take considerable repetition. Much lower repetition would be needed if the word could be sounded out character by character. So having words marked up phonetically will significantly improve any vocabulary acquisition activity.
Vocabulary acquisition will also be improved by efficiently teaching the sounds of the most common syllables in the list of words students are required to learn.
[0275] Recognition is seeing or hearing a word and knowing its meaning. Recall is hearing a definition of a word or seeing an image of a word, and recalling its name. Recognition is easier to acquire than recall and is usually acquired first.
[0276] Vocabulary recognition can be acquired by having people pair the marked up word with an image, a definition, a translation or getting the student to fill in the correct word into a sentence, choosing one word from maybe 4 words. Recall acquisition exercises include seeing a definition of a term or an image related to the term, and having to type in the name, i.e. there is no prompt to trigger recognition.
[0277] There are various formats to do this. Think of a diagram of the bones in a foot. The system shows students the names of a few bones. Then the system has the names of say 4 or 5 of these bones in a list and they have to click on an arrow pointing to a bone to develop recognition. Then when there is good recognition, the system helps people to acquire recall, as they have to fill in the name by typing without any recognition prompts.
[0278] The system can do the same with statistics. Showing lines on a bell curve can show e.g. what a standard deviation is. Eventually the aim is getting the student to learn and understand the equation. Again, going from recognition to recall.
[0279] Humans have evolved to remember information they understand far more easily than random information. So separating the acquisition of vocabulary from learning about e.g. the function of the vocabulary terms to be acquired can actually impede long term memory development. Consider the human foot example. If a student needed not just to name the bones, but had to name adjacent bones, and how these adjacent bones could move relative to each other, the student would know a lot more about the foot than the names of the bones on a diagram. Describing how the weight of the body was distributed across the bones of the foot would provide the student with further information about the anatomy of the foot at the same time as acquiring the vocabulary. This kind of understanding will require less repetition and revision to remain in long term memory.
[0280] The benefits of a deeper understanding of a vocabulary term, such as in the foot example above, include:
a. saving student time and effort by acquiring information about a term at the same time as learning the sound or the term as written and how it is spelled;
b. making the vocabulary acquisition process more interesting for the student, so that the student will be motivated to spend more time and make greater effort; and/or c. vocabulary learned in this way is more likely to be able to be recalled for longer with far less revision.
[0281] For many people, vocabulary acquisition is to learn a word, forget it, learn it again, forget it again and so on until the word is stored in long term memory. Once in long term memory, there is still a need to revise the word, but this does not require the frequency of repetition to learn the word initially.
[0282] There is a need to optimize the repetition needed to put the meaning of words into long term memory, to minimize the student time and effort involved. In order to recall the words in long term memory, repetition is required. When the word is in long term memory, the repetition periods will get considerably longer. The aim is to develop a system to minimize the number of repetitions required. The information to do this will be based on testing a student with different words at different elapse times to estimate when the student will start forgetting the meaning of words and ensure repetition and the refreshing of the long term memory happens before the students forgets that word.
[0283] The method of an integrated vocabulary acquisition that may contain the following elements:
a. providing a phonetic rendering of the vocabulary terms to be acquired;
b. optimizing the phonetic rendering to make the root word and the prefixes and suffixes easily recognizable and assisting students to understand the meaning of the word by understanding the prefixes, suffixes and the root word;
c. experimentally testing different interactive vocabulary acquisition methods and logging all user activity to create a database that can be used to predict the optimal learning strategy of an individual student;
d. collecting and analyzing user registration data to create user groups that are predicted to have similar learning outcomes;
e. using the user registration information to assign a new user to a user group;
f. using the usage data collected by members of the group to which the new user has been assigned to predict the optimal interactive exercises for the new user, and the optimal revision regime;
g. experimentally testing the predictions and modifying the predictions for a user based on actual user data; and
h. using this new user data to modify the user group classifications and the predictions based on user group membership.
[0284] The method may provide a comprehensive, integrated language-teaching system for reading, hearing speaking, spelling and vocabulary built around a phonetic alphabet with characters meaningful to a group of readers made up of the primary language characters (e.g., English characters) and where the primary language character does not make its usual sound, a sound character (in size) from the primary language, the secondary language, or the IPA, e.g., as a superscript over the primary-language character.
Implementations
[0285] The method described herein may be embodied in computer readable/machine-readable instructions compiled from source code written in the C# programming language.
[0286] The method of encoding the text document is performed/executed by a specifically programmed computing system 200.
[0287] FIG. 2 is a schematic block diagram of the specifically programmed computing system 200 ("computer system 200").
[0288] In the described embodiment, the system 200 includes a standard computer system 200 such as a commercially available personal computer or server computer system based on a 32-bit or 64-bit Intel architecture, and the processes and/or methods executed or performed by microprocessors of the system 200 are implemented in the form of programming instructions of one or more software components or modules 202 stored on non-volatile (e.g., hard disk) computer-readable storage 204 associated with the computer system 200, as shown in FIG. 2. At least parts of the software modules 202 could alternatively be implemented as one or more dedicated hardware components, such as application-specific integrated circuits (ASICs) and/or field programmable gate arrays (FPGAs).
[0289] The computer system 200 includes at least one or more of the following standard, commercially available, computer components, all interconnected by a bus 216: random access memory (RAM) 206, at least one computer processor 208, and external computer interfaces. The external computer interfaces include: universal serial bus (USB) interfaces 210 (at least one of which is connected to one or more user-interface devices, such as a keyboard, a pointing device (e.g., a mouse 218 or touchpad), a network interface connector (NIC) 212 which connects the computer system 200 to a data communications network such as the Internet 220, and a display adapter 214, which is connected to a display device Q322 such as a liquid-crystal display (LCD) panel device.
[0290] The computer system 200 includes a plurality of standard software modules, including: an operating system (OS) 224 (e.g., Linux or Microsoft Windows); web server software 226 (e.g., Apache, available at http://www.apache.org); scripting language modules 228 (e.g., personal home page or PHP, available at http://www.php.net, or Microsoft ASP); and structured query language (SQL) modules 230 (e.g., MySQL, available from http://www.mysql.com), which allow data to be stored in and retrieved/accessed from an SQL database 232.
[0291] Together, the web server 226, scripting language 228, and SQL modules 230 provide the computer system 200 with the general ability to allow users of the Internet 220 with standard computing devices equipped with standard web browser software to access the computer system 200 and in particular to provide data to and receive data from the database 232. It will be understood by those skilled in the art that the specific functionality provided by the system 200 to such users is provided by scripts accessible by the web server 226, including the one or more software modules 202 implementing the processes described herein, and also any other scripts and supporting data 234, including markup language (e.g., HTML, XML)scripts, PHP (or ASP), and/or CGI scripts, image files, style sheets, and the like.
[0292] The boundaries between the modules and components in the software modules 202 are exemplary, and alternative embodiments may merge modules or impose an alternative decomposition of functionality of modules. For example, the modules discussed herein may be decomposed into submodules to be executed as multiple computer processes, and, optionally, on multiple computers. Moreover, alternative embodiments may combine multiple instances of a particular module or submodule. Furthermore, the operations may be combined or the functionality of the operations may be distributed in additional operations in accordance with the invention. Alternatively, such actions may be embodied in the structure of circuitry that implements such functionality, such as the micro code of a complex instruction set computer (CISC), reduced instruction set computer (RISC), firmware programmed into programmable or erasable/programmable devices, the configuration of a field- programmable gate array (FPGA), the design of a gate array or full custom application-specific integrated circuit (ASIC), or the like.
[0293] Each of the blocks of the flow diagrams of the processes of the computer system 200 may be executed by a module (of software modules 202) or a portion of a module. The processes may be embodied in a machine-readable and/or computer readable medium for configuring a computer system to execute the method. The software modules may be stored within and/or transmitted to a computer system memory to configure the computer system to perform the functions of the module.
[0294] The computer system 200 normally processes information according to a program (a list of internally stored machine-readable instructions such as a particular application program and/or an operating system) and produces resultant output information via input/output (I/0) devices. A computer process typically includes an executing (running) program or portion of a program, current program values and state information, and the resources used by the operating system to manage the execution of the process. A parent process may spawn other, child processes to help perform the overall functionality of the parent process. Because the parent process specifically spawns the child processes to perform a portion of the overall functionality of the parent process, the functions performed by child processes (and grandchild processes, etc.) may sometimes be described as being performed by the parent process.
[0295] The data generation, data storage and data communications operations relate to digital data operations. The digital data may include electronic data defined by logic circuits-which may include binary logic circuits-represented by electronic quantities, which may include voltage, current and/or resistance.
Interpretation
[0296] Many modifications will be apparent to those skilled in the art without departing from the scope of the present invention.
[0297] The presence of "/" in a FIG. or text herein is understood to mean "and/or", i.e., "X/Y" is to mean "X" or "Y" or "both X and Y", unless otherwise indicated.
[0298] The term "letter" and the term "character" have the same meaning in this document, unless a contrary intention is clear from the usage.
[0299] A lowercase letter within lines such as /a/ represents the usual sound made by the letter "a" in e.g. the word "at". An uppercase vowel within lines such as /A/ says the names of the vowel, such as the sound A makes in the word "ape".
[0300] As used herein, the term "set" corresponds to or is defined as a non-empty finite organization of elements that mathematically exhibits a cardinality of at least 1 (i.e., a set as defined herein can correspond to a unit, singlet, or single element set, or a multiple element set), in accordance with known mathematical definitions (for instance, in a manner corresponding to that described in An Introduction to Mathematical Reasoning: Numbers, Sets, and Functions , "Chapter 11 : Properties of Finite Sets" (e.g., as indicated on p. 140), by Peter J. Eccles, Cambridge University Press (1998)). Thus, a set includes at least one element. In general, an element of a set can include or be one or more portions of a system, an apparatus, a device, a structure, an object, a process, a procedure, physical parameter, or a value depending upon the type of set under consideration.
[0301] The reference in this specification to any prior publication (or information derived from it), or to any matter which is known, is not, and should not be taken as an acknowledgment or admission or any form of suggestion that prior publication (or information derived from it) or known matter forms part of the common general knowledge in the field of endeavor to which this specification relates.
[0302] Throughout this specification and the claims which follow, unless the context requires otherwise, the word "comprise", and variations such as "comprises" and "comprising", will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps.
[0303] It is to be noted that the discussions contained in the "Background" section should not be interpreted as a representation by the present inventor(s) or the patent applicant that such discussion, or referenced documents or device, in any way form part of the common general knowledge in the art.
APPENDIX A
Example IPA to compound-word translation table
M0 0 0 C UU=
ae /a/ a al a <a:a> S /e/ e el e, a, u <e:e> <e:a> <e:u>
o /o/ 0 01 o, a, e <0:0> <o:a> <o:e> A andr /uh/ u U1 a, e,i,0o,u,y <u:u> <u:a> <u:e> <u:i> <u:o> <~
ei /ay/ A a,e, i,0o,u,y <A:a> <A:e> <A:i> <A:o> <A:u> <A:y
l ee! E a, e,i,0o,u,y <E:e> <E:a> <E:i> <E:o> <E:u> <E:y
ax i i, a, e <1.1> <I~a> Ie
or /oh/ 0 o, a, e <0:0> <0:a> <o:e> ju
or /yoo/ U u, e <U:u> <U:e>
U /00/ 00 0, e, u <00:00> <oo:e> <00:u> U /00/ Go 00, U <00c:00> < GO: u>
au loU! ow ow, ou <ow:ow> <ow:ou>
01 /ay! oy Oy,01i <oy:oy> <oy:oi> or /ar/ ar a r, a, <ar:ar> <ar:a> or /or/ or or, o <or:or> <or:o> 3r /Ur/ ur ur, er < ur: u r> <ur:er> b /b/ b b b> k /k/ k k k> d /d/ d d <d:d> f If/ f f, ph <f:f> <f:ph> g /g/ g g <g:g> h /h/ h h <h:h> d3 hi/ j j, d, g <j:j> <j:d> jg I /I/ I I <1:1>
m /M/ m m M> n /n/ n n n> p /p/ p p p> r /I/ r r<r:> s /s/ S S, c <s:s> S> t It! t t t> v /V/ v v, ph <v:v> <v:ph> w /w/ w W, U, 0 <w:w> <w:u> <w:o> ks /X/ x x <x:x> j /y/ y y y> z /z/ z z <z:z> tJ /ch/ ch ch <ch:ch> f /sh/ sh sh <sh:sh> r0 /ng/ ng ng <ng:ng>
6 /th/ th th <th:th>
6 th/ th the LOUD th <th:th>
3 /zh/ zh g , Z <zh:zh>
APPENDIX B
TABLE A: Usual Sounds of English Characters
Usual Sound of English International Example word English Characters phoneme Phonetic Alphabet
a /a/ o apple e /e/ e egg i /ih/ 1 igloo 0 /o/ D octopus u /uh/ A and/or a umbrella b /b/ b bed c /k/ k cat d /d/ d dog f /f/ f fun g /g/ g goat h /h/ h hat /j/ d3 jump k /k/ k king I /I/ I like m /m/ m man n /n/ n neck p /p/ p pen q /k/ k quick r /r/ r rat s /s/ s sun t I/ t ten V /v/ v van w /w/ w wind x /x/ ks six y /y/ y yes z /z/ z zoo
TABLE B: Sounds of English Vowels saying their name
Sounds of English English International Example word vowels saying phoneme Phonetic their name Alphabet
A /ay/ er ape E /ee/ i eat I /i/ ai ice o /oh/ ou open U /yoo/ yu unit
TABLE C: Usual Sounds of English Digraphs
Sounds of English English International Example word Digraphs Digraph Phonetic Sound Alphabet ch /ch/ ti cheese sh /sh/ I shoe ng /ng/ 0 long th /th/ e thumb th Ith/ 6 them zh /zh/ 3 vision 00 /oo/ u boot 00 /oo/ u Book ow /ow/ au cow oy /oy/ 31 oil ar /ar/ ar art or /or/ or horse ur /ur/ 3r fur
APPENDIX C
The Fonetic English Alphabet
Fonetic English
Letter name Character Pair Example Word FE Code Lowercase Uppercase
syllable
stressed syllable
a <a:a> and al a A a says A <a:A> gr at a2 aA a says ar <a:ar> patn a3 a a says e <a:e> any a4 a A a says i <a:i> a5 a A a says o <a:o> wach a6 a A 0 0 a says o <a:o> 'biOreau a7 a A a says or <a:or> all a8 A a says u <a:u> a-bout a9 a A b or B <b:b> before b1 b B c or C <c:c> coroner c1 C C c says ch <c:ch> celIo c2 C C c says s <c:s> Aolide c3 C c c says sh <c:sh> inoo°i0n c4 c C d or D <d:d> golden d1 d D d says j <d:j> de d2 a O d says t <d:t> Walk d d3 d D
e <e:e> over el e E e rhymes with too <e:oo> crew e2 1 E esays A <e:A> they e3 eE e says E <e:E> be-fore e4 e ____ e says i ce:i> tn-glish e5 e E e says o <e:o> e6 E esayso0 <e:O> *bLY'reaul e7 e E e says u <e:U> the e eE esays U <e:u> fIle, e9 e E f or F <f>f irst ft f F g or G ______ get ~gl G g says f <g____> e-nough g2 g G g says j _________ rouge g3 g G h or H ______ hen hi h H <:>ink iiI I says E Stb-jdi-o *iE Q I saysI <i ____ fil-t BI
Ssaysu _________ epenodi j4 I jaor]I <j:j> jug 1___ __
k or K <k_____ kin ki k K I or L <1:1> lent 11 _ __ 1 L
mor M MM> men Ml m M n rN<~>nun ft n N o <0:0> cot O1 0 0 o says i <a:i> woomen oW 10 0 o sayso0 <o:O> moment 03 10 6 o says oo <o:oo> rouge 00 0 0 o says or <o:or> th0 ght 03 0 0 o says u <o:u> inococion 06 0 6 o says ur <o:ur> C -loonel 07 0 o says w <o:w> on.o8 0 0 o rhymes with boy <o:oy> coin 09 o as in foot <0:5> 10 0100 0 o rhymes with cow <o:ow> i-bout o1l 0 0 p or P <p:p> pin p1 p P ph says f <p:f> photo p2 ph ph says v <p:v> Ste°phef p3 ph q or Q <q:q> qiZ q Q r or R <r:r> red ri r R s or S <s:s> sent S1 S S SHo S.
s says sh <s:sh> sur s3 5 S s says z <s:z> •inoches s2 S S s says zh <s:zh> viO s4 S S t ot T <t:t> ten t t T elH CH t says ch <t:ch> ttin t2 F. t says sh <t:sh> loton t3 t T The Loud TH <th:xx that h2 t0 TH u <u:u> but Ul U U u says e <u:e> u2 U l u says i <u:i> u3 U u as in foot <u:> p4 0 U 00 rhymes with too <u:oo> bW0O U5 U 0 u says o <u:o> sure u6 U u says U <u:U> oS u7 says w <u:w> qu n u8 U v or V <v:v> vent v1 V V w or W <w:w> wet w1 w W x or X <x:x> Xe°n00 x1 X X y or Y <y:y> yes yl Y y says E <y:E> eveorV y2 y y says I <y:I> sk9 y3
' y says i <y:i> sync y4 V y says u <y:u> sy°ringe y5 y Y z or Z <z:z> zen z1 Z Z z says zh <z:zh> ViozIOn z2 Z Z
APPENDIX D
Some Bold, Italic and Bold Italic Fonetic English Characters in Fonetic English Alphabet
Bold, Italic and BoldItalic Fonts
Letter name Character Pair Regular Bold Italic BoldItalic
a <a:a> a a C a a says A <a:A> a a 3 a a says ar <a:ar> a a a a says e <a:e> a _ _ _ a a says i <a:i>a a a says o <a:o> 0 0 0 0 a says O <a:O> a a a or of, a says or <a:or> 5 a a a a says u <a:u>a a

Claims (12)

THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS:
1. An integrated publishing method for converting/encoding a text document to a new format or formats, the integrated publishing method including:
a computing system receiving data representing a source text that includes a plurality of human-readable characters in a base alphabet forming a plurality of words;
the computing system encoding the source text by:
for each word in the source text that has one or more spelling characters that are identified as having a sound other than a usual sound for that character, using a replacement word with respective compound characters that each include the spelling character and a sound character, wherein the sound characters:
are human-readable characters in the base alphabet and/or in one or more secondary alphabets,
are added to spelling characters to indicate that the spelling character makes the usual sound of the sound character,
are displayed so that it is easy to discriminate spelling characters from sound characters, and
are added to the spelling characters such that the spelling characters and the sound characters remain human-readable such that the spelling character and the sound character of each compound character are within one visual field; and
the computing system outputting the encoded word in a human-readable form/format such that the encoded text includes the plurality of words from the source text with the compound characters visually indicate which of the spelling characters have a sound other than a usual sound for that character, and such that the spelling characters in the words in the encoded text are the same and in the same order as the characters in the respective words in the source text.
2. The method of claim 1, wherein the sound characters are in a preselected phoneme set that includes:
a plurality of sound characters in a secondary alphabet associated with phonemes in the base alphabet that can be defined/pronounced using characters in the secondary alphabet; a plurality of sound characters in the base alphabet associated with phonemes in the base alphabet that do not exist in the secondary alphabet; and/or a plurality of sound characters in the base alphabet associated with phonemes in the IPA that do not exist in the secondary alphabet.
3. The method of claim 1 or 2, including indicating silent characters, optionally by visually differentiating the silent characters from the spelling characters without changing shapes of the silent characters.
4. The method of any one of claims 1 to 3, wherein the adding of the one or more size sound characters includes adding a gap/space between the sound characters and the respective spelling characters such that, in the words in the encoded text, the spelling characters are not touching the sound characters or if the sound characters do touch the spelling characters, selecting a maximum overlap that is less than 5% of the length of the character less than 5%.
5. The method of any one of claims 1 to 4, wherein one or more of any lowercase sound characters are shaped differently from the corresponding uppercase characters, including having a different font.
6. The method of any one of claims 1 to 5, wherein the sound characters have a sound font size based on a source font size of the source text in a ratio of 6:9 and/or wherein the sound characters have a font size of at least 6 point.
7. The method of any one of claims 1 to 6, including:
receiving user inputs from a user of a computing system;
the computing system classifying the user into one of a plurality of categories based on the user inputs; and
the computing system selecting the phoneme set from the plurality of sets based on the user category using a predefined mapping between user categories and phoneme sets,
wherein the classifying includes:
the computing system generating measured values of the user's knowledge;
the computing system classifying the user into one of the plurality of categories based on the measured values;
the computing system presenting a test text in the primary alphabet to the user by displaying the test text visibly or playing the test text audibly, wherein the test text includes: a plurality of user-selectable words that can be selected by the user using the user interface including at least one test word and one or more distractor words which are not the test word; and the computing system measuring the values from user selections of the user selectable words, including measuring how many of the least one test words are user selected, and/or how much time is taken to select the test words.
8. The method of any one of claims 1 to 7, including encoding the source text by:
identifying at least one word in the source text that matches one of a plurality of preselected words in a preselected set of words formed of the base alphabet, wherein the identified words includes at least one stressed syllable and/or at least one unstressed syllable defined in the preselected set, wherein each syllable includes one or more of the spelling characters, and
replacing/adjusting the identified word by adding a dot preceding each syllable, wherein the spelling characters of the syllable remain unchanged, and wherein the dot is a closed dot for the stressed syllable, and/or an open dot for the unstressed syllable.
9. The method of any one of claims 1 to 8, including automatically generating a database of words for the encoding.
10. The method of claim 9, including providing a user interface for a user to manually select marked-up phonetic words for the base alphabet.
11. The method of any one of claims 1 to 10, including automatically encoding / marking up an English word into an encoded word, including silent characters, syllable breaks, stress syllables and/or the sound each character makes, based on inputs from a dictionary/database of word-IPA pairs comprising a plurality of words in the base alphabet and the International Phonetic Alphabet (IPA) representations of those words.
12. An integrated publishing system for converting/encoding a text document to a new format or formats that can be optimized for groups or individuals, the integrated publishing system configured to perform the method of any one of claims 1 to 11.
AU2022228148A 2021-11-16 2022-09-08 Method and system Active AU2022228148B2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CA3238661A CA3238661A1 (en) 2021-11-16 2022-11-15 Method and system
PCT/AU2022/051361 WO2023087051A1 (en) 2021-11-16 2022-11-15 Method and system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
AU2021903667A AU2021903667A0 (en) 2021-11-16 Method and System
AU2021903667 2021-11-16

Publications (2)

Publication Number Publication Date
AU2022228148A1 AU2022228148A1 (en) 2023-06-01
AU2022228148B2 true AU2022228148B2 (en) 2023-07-27

Family

ID=86499269

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2022228148A Active AU2022228148B2 (en) 2021-11-16 2022-09-08 Method and system

Country Status (1)

Country Link
AU (1) AU2022228148B2 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012071630A1 (en) * 2010-12-02 2012-06-07 Accessible Publishing Systems Pty Ltd Text conversion and representation system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012071630A1 (en) * 2010-12-02 2012-06-07 Accessible Publishing Systems Pty Ltd Text conversion and representation system

Also Published As

Publication number Publication date
AU2022228148A1 (en) 2023-06-01

Similar Documents

Publication Publication Date Title
Reed Why Teach Spelling?.
Nassaji The role and importance of lower-level processes in second language reading
Cook et al. An introduction to researching second language writing systems
Liberman et al. Speech, the alphabet, and teaching to read.
Kamii Early Literacy: A Constructivist Foundation for Whole Language. NEA Early Childhood Education Series.
Venezky In search of the perfect orthography
Carreker et al. Multisensory teaching of basic language skills activity book
Miller et al. Differences in the reading of shallow and deep orthography: Developmental evidence from Hebrew and Turkish readers
Moore et al. A cross-orthographic view of dyslexia identification
Lü et al. The roles of phonological awareness and oral vocabulary knowledge in English-Chinese biliteracy acquisition among Chinese heritage language learners
Hoover et al. The cognitive foundations of reading acquisition
Russak How can a fine-grained analysis of spelling errors inform our understanding of the development of spelling in EFL?
AU2022228148B2 (en) Method and system
Desberg et al. Word Attack Skills: Review of Literature.
Alipour et al. An Overview of the Persian EFL learners’ Spelling Difficulties
Salas Early development of text writing in two contrasting orthographies: English and Spanish
CA3238661A1 (en) Method and system
Genelza et al. Phonological Awareness and Word Decoding Ability of Second Year BSED-English Students in USEP–Tagum Campus
Vagh et al. The assessment of emergent and early literacy skills in the akshara languages
Gibreel et al. An investigation of English spelling achievement among second level Saudi students at King Khalid University
Martin L1 impacts on L2 component reading skills, word skills, and overall reading achievement
Hughes et al. Learning from children who read at an early age
Kart VISUAL PHONICS: RELEVANT FOR ALL EARLY READERS, ESPECIALLY STRUGGLING READERS.
Randall Second language reading Proficiency and word recognition: The concept of saliency and its application across different scripts
Awramiuk et al. Sound form signalization in L1 Polish, Czech and Slovak textbooks: In search of best practices

Legal Events

Date Code Title Description
CB Opposition filed

Opponent name: READABLE ENGLISH, LLC

CW Opposition withdrawn - proceed to grant

Opponent name: READABLE ENGLISH, LLC

FGA Letters patent sealed or granted (standard patent)