WO2021000756A1 - 英语文本的拼读标注方法、拼读方法、其装置、存储介质和电子设备 - Google Patents

英语文本的拼读标注方法、拼读方法、其装置、存储介质和电子设备 Download PDF

Info

Publication number
WO2021000756A1
WO2021000756A1 PCT/CN2020/097548 CN2020097548W WO2021000756A1 WO 2021000756 A1 WO2021000756 A1 WO 2021000756A1 CN 2020097548 W CN2020097548 W CN 2020097548W WO 2021000756 A1 WO2021000756 A1 WO 2021000756A1
Authority
WO
WIPO (PCT)
Prior art keywords
spelling
pronunciation
mark
unit
data
Prior art date
Application number
PCT/CN2020/097548
Other languages
English (en)
French (fr)
Inventor
陈俪
Original Assignee
陈俪
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 陈俪 filed Critical 陈俪
Publication of WO2021000756A1 publication Critical patent/WO2021000756A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B5/00Electrically-operated educational appliances
    • G09B5/06Electrically-operated educational appliances with both visual and audible presentation of the material to be studied

Definitions

  • the embodiments of the present invention relate to information processing technology, and in particular to a method for spelling and marking English text, a method for spelling, a device, a storage medium, and an electronic device.
  • a word is the smallest unit of independent meaning in English, so you need to be familiar with the logical system of English from the word.
  • English words are composed of syllables, and each vowel of the pronunciation forms a syllable. As a pinyin text, 84% of English words conform to the pronunciation rules. Even in words that do not conform to the pronunciation rules, there are often parts that conform to the pronunciation rules, or show special regularities. For students whose mother tongue is not English, the first entry point to learn English well is to read the pronunciation of words. That is to say, when you see an English word, you can pronounce it quickly and usually correctly.
  • Phonics that is, "the pronunciation of words.” Also called “natural spelling.”
  • English phonics is different from pinyin, it is 100% in line with the law, which results in an English word being judged only from the shape of the spelling, and there are many possibilities for pronunciation.
  • This is like in Chinese characters, we will use the glyph to guess the pronunciation of a Chinese character, but we still need teachers to teach students the pronunciation or use a dictionary to confirm the pronunciation. Therefore, in actual English teaching, most schools and teachers still adopt the preconceived "holistic recognition method", which makes Phonics a crutch that cannot serve its due role.
  • the current English auxiliary pronunciation method namely phonetic transcription
  • the purpose of the embodiments of the present invention is to provide a spelling labeling scheme and a spelling scheme for English texts, so as to add pronunciation labels to English texts and effectively help learners learn English through natural spelling.
  • a method for spelling and labeling English text including: acquiring English text data to be processed and pronunciation data of the English text data; and dividing the English text data into at least A phonetic unit, the phonetic unit includes one or more letters that form a basic phonetic unit or a combined phonetic unit; according to the phonetic data, at least one phonetic phonetic unit is individually pronounced to make the phonetic
  • the annotations are integrated in the English text data; the English text data integrated with pronunciation annotations for each of the spelling units is provided, so that the reader can directly read the English text through the pronunciation annotations.
  • the marking the pronunciation of the spelling unit according to the pronunciation data includes: marking the spelling unit for display according to the pronunciation data, and/or, according to the pronunciation data, marking the pronunciation of the spelling unit. Add spelling prompt marks to the spelling unit.
  • the displaying and marking the spelling unit according to the pronunciation data includes: marking the spelling unit with color and adding an underline mark according to the pronunciation data in the spelling unit And add a cross mark.
  • the adding a spelling prompt mark to the spelling unit according to the pronunciation data includes: adding a spelling unit with at least two pronunciations according to the pronunciation data in the spelling unit The corresponding spelling prompt symbol mark or spelling prompt number mark.
  • the spelling unit includes letters or letter combinations corresponding to monophones, single consonants, compound vowels, compound consonants, or specific pronunciation combinations.
  • said respectively performing pronunciation annotations on at least one of the spelling units according to the pronunciation data so that the pronunciation annotations are integrated into the English text data includes: according to the pronunciation data, softening The letters corresponding to the pronounced consonants are marked with corresponding spelling prompts, and the letters corresponding to the softened consonants include the letter c or the letter g; and/or, according to the pronunciation data, add an underline to the letter combination corresponding to the dumb consonant And/or, according to the pronunciation data, add an underline mark to the letter combination corresponding to a compound consonant; and/or, according to the pronunciation data, add an underline mark to the letter combination corresponding to a specific pronunciation combination; and/or, according to In the pronunciation data, corresponding letter combinations corresponding to compound consonants with at least two pronunciations are added with corresponding spelling prompt number marks or corresponding prompt pronunciation phonetic symbols.
  • said respectively performing pronunciation annotations on at least one of the spelling units according to the pronunciation data so that the pronunciation annotations are integrated into the English text data includes: according to the pronunciation data, The letters corresponding to the vowels of the long vowels and the short vowels are marked with color, and the letters corresponding to the vowels of the long vowel and the short vowels are added respectively.
  • Different spelling prompts are marked; and/or, according to the pronunciation data, add cross-marks to the letters corresponding to the tail vowels of the silent words; and/or, according to the pronunciation data, corresponding to the compound vowels Add an underline mark to the letter combination; and/or, according to the pronunciation data, add a corresponding spelling prompt mark to the letter combination corresponding to the compound vowel with at least two pronunciations.
  • the providing the English text data fused with the pronunciation annotated for each of the spelling units includes: writing the English text data fused with the pronunciation annotated for each of the spelling units Importing a designated file; and/or displaying the English text data fused with pronunciation and annotation of each of the spelling units.
  • the English text data is words, phrases, sentence fragments, entire sentences or one or more text paragraphs.
  • a method for spelling English text including: acquiring English text data fused with pronunciation annotations, where at least one spelling unit in the English text data has been performed as in any English Pronunciation labeling of the text spelling labeling method; spelling out each of the spelling units according to the pronunciation labeling.
  • the pronunciation mark includes a display mark and/or a spelling prompt mark of the spelling unit.
  • the display mark includes a color mark, an underline mark, and a cross-out mark;
  • the spelling prompt mark includes a spelling prompt symbol mark or a spelling prompt number mark.
  • the spelling out each of the spelling units according to the pronunciation annotations includes: for the spelling units that have not been pronunciation annotated, spelling out the spelling units according to the conventional pronunciation;
  • the phonetic unit marked by pronunciation distinguishes the phonetic unit according to the display mark of the phonetic unit, and spells the differentiated phonetic unit according to the phonetic prompt mark.
  • an English text spelling and labeling device which includes: an acquisition module for acquiring English text data to be processed and pronunciation data of the English text data; a segmentation module using To divide the English text data into at least one phonetic unit, the phonetic unit includes one or more letters forming a basic pronunciation unit or a combined pronunciation unit; a pronunciation labeling module is used to separately Perform pronunciation annotations on at least one of the spelling units, so that the pronunciation annotations are integrated into the English text data; a providing module is used to provide the English text integrated with pronunciation annotations for each of the spelling units Data so that readers can directly read English text through the pronunciation annotations.
  • the pronunciation labeling module is configured to: display and mark the spelling unit based on the pronunciation data, and/or add a spelling prompt mark to the spelling unit based on the pronunciation data.
  • the pronunciation labeling module is configured to: according to the pronunciation data in the spelling unit, color-mark the spelling unit, add an underline mark, and add a cross-out mark.
  • the pronunciation labeling module is configured to: add corresponding spelling prompt symbol marks or spelling prompt number marks to a spelling unit with at least two pronunciations according to the pronunciation data in the spelling unit.
  • the spelling unit includes letters or letter combinations corresponding to monophones, single consonants, compound vowels, compound consonants, or specific pronunciation combinations.
  • the pronunciation labeling module is configured to: add corresponding spelling prompt symbol marks to the letters corresponding to the softened consonants according to the pronunciation data, and the letters corresponding to the softened consonants include letter c or letter g And/or, according to the pronunciation data, add an underline mark to the letter combination corresponding to the dumb consonant; and/or, according to the pronunciation data, add an underline mark to the letter combination corresponding to the compound consonant; and/or, according to the Pronunciation data, adding an underline mark to the letter combination corresponding to a specific pronunciation combination; and/or, according to the pronunciation data, adding a corresponding spelling prompt number mark or adding a corresponding letter combination corresponding to a compound consonant with at least two pronunciations The prompt pronunciation phonetic transcription.
  • the pronunciation labeling module is configured to: according to the pronunciation data, add color marks to the letters corresponding to the unit sounds with the long vowel pronunciation and the short vowel pronunciation, and to mark the unit of the long vowel pronunciation
  • the letters corresponding to the sounds and the letters corresponding to the monophonic sounds of the short vowels are respectively added with corresponding different spelling prompt symbols; and/or, according to the pronunciation data, the letters corresponding to the tail vowels of the silent words are added Cross out the mark; and/or, according to the pronunciation data, add an underline mark to the letter combination corresponding to the compound vowel; and/or, according to the pronunciation data, the letter combination corresponding to the compound vowel with at least two pronunciations Add the corresponding spelling hint mark.
  • the providing module is configured to: write the English text data fused with pronunciation annotations for each of the spelling units into a designated file; The English text data that is pronounced and annotated by the spelling unit.
  • the English text data is words, phrases, sentence fragments, entire sentences or one or more text paragraphs.
  • an English text spelling device including: an acquisition module for acquiring English text data fused with pronunciation annotations, at least one spelling unit in the English text data The pronunciation label of the spelling labeling device for English text according to any one of claims 14-22; the spelling module is used to spell out each of the spelling units according to the pronunciation label.
  • the pronunciation mark includes a display mark and/or a spelling prompt mark of the spelling unit.
  • the display mark includes a color mark, an underline mark, and a cross-out mark;
  • the spelling prompt mark includes a spelling prompt symbol mark or a spelling prompt number mark.
  • the spelling module includes: a first spelling unit, which is used to spell out the spelling unit according to the conventional pronunciation for the spelling unit that has not been marked with pronunciation; the second spelling unit is used For the phonetic unit that has been marked with pronunciation, the phonetic unit is distinguished according to the display mark of the phonetic unit, and/or the distinguished phonetic unit is spelled according to the spelling prompt mark .
  • a computer-readable storage medium having computer program instructions stored thereon, wherein the program instructions, when executed by a processor, implement any of the aforementioned methods for spelling out English text step.
  • a computer-readable storage medium having computer program instructions stored thereon, wherein the program instructions, when executed by a processor, implement the steps of any of the aforementioned English text spelling methods .
  • an electronic device including: a processor, a memory, a communication element, and a communication bus.
  • the processor, the memory, and the communication element communicate with each other through the communication bus.
  • the communication; the memory is used to store at least one executable instruction, the executable instruction causes the processor to perform any operation corresponding to the spelling and labeling method of English text.
  • an electronic device including: a processor, a memory, a communication element, and a communication bus.
  • the processor, the memory, and the communication element communicate with each other through the communication bus.
  • the communication; the memory is used to store at least one executable instruction, the executable instruction causes the processor to perform any operation corresponding to the spelling and labeling method of English text.
  • English texts of different granularities such as single English words and phrases
  • each spelling unit can be processed according to the pronunciation data.
  • Pronunciation annotation which integrates the pronunciation annotation as the pronunciation prompt information into the English text data, and provides the English text data fused with the pronunciation annotation, so that the reader can directly read the corresponding English text according to the integrated pronunciation prompt information , Upgrading the traditional 80% reliable natural phonics tool into a 100% reliable method, effectively helping a basic-trained learner to read words 100% accurately and confidently without having to look up the dictionary Pronunciation.
  • the marked text can lower the learner's instant observation of English words from the overall word to the syllables that make up the word.
  • the meaning of the word is ultimately related to its shape, so the instantaneous observation of the shape of the word is in-depth and detailed Strength, can help learners naturally feel the regular changes in word spelling in English reading, thereby enhancing learners' sense of English words, establishing intuition of meaning in reading, and enhancing comprehensive sense of English.
  • FIG. 1 is a flowchart illustrating a method for spelling and labeling English text according to some embodiments of the present invention
  • FIG. 2 is a flowchart showing a method for spelling out English texts according to other embodiments of the present invention.
  • 3-12 are schematic diagrams respectively illustrating pronunciation annotating of English text by the phonetic annotation method according to some embodiments of the present invention.
  • Figure 13 shows an example of fusion of English text annotated by the aforementioned phonetic annotation method
  • FIG. 14 is a schematic diagram showing the structure of an electronic device according to an embodiment of the present invention.
  • 15 is a logical block diagram showing a device for spelling and labeling English text according to some embodiments of the present invention.
  • FIG. 16 is a logical block diagram showing a phonetic device for English text according to some embodiments of the present invention.
  • plural refers to two or more than two, and “at least one” refers to one, two or more than two.
  • any component, data, or structure mentioned in this application it can be understood as one or more unless one is clearly defined.
  • FIG. 1 is a flowchart illustrating a method for spelling and labeling English text according to some embodiments of the present invention.
  • English can establish the perception of letters and letter combinations and pronunciation by directly learning the pronunciation rules of 26 letters and letter combinations in words, allowing students to understand and learn the mystery of English letter combinations in a relaxed and pleasant atmosphere. English spelling rules, so as to achieve the learning purpose of reading words when you see them and spelling them when you hear them.
  • Natural Phonics is a practical tool and method. If you can master it, you can read 80% of English words without using phonetic symbols, turning boring words into a simple piece. In order to achieve the effect of getting twice the result with half the effort.
  • the overall inventive concept of the present invention aims to provide a method for pronunciation labeling of English text, so as to prompt English learners through these pronunciation labels integrated in English text, and effectively help English learners use English spelling. Method to spell English text, improve the efficiency of English learning.
  • step S110 the English text data to be processed and pronunciation data of the English text data are acquired.
  • the English text data to be processed may be a single word or a combination of any number of words, for example, a phrase including at least two words, a sentence fragment, an entire sentence, or one or more text paragraphs.
  • the phonetic transcription data of each English word can be acquired by querying a dictionary, and these phonetic transcription data can be composed of English text data. Pronunciation data, or each word in the acquired English text data has been marked with phonetic symbols, then these phonetic symbols constitute the pronunciation data of the English text data.
  • other methods can also be used to obtain the pronunciation data of the English text data, which is not limited here.
  • step S120 the English text data is divided into at least one spelling unit.
  • English words can be divided into one or more syllables, and the pronunciation of the entire word can be spelled out according to the pronunciation of each syllable.
  • the word delay is divided into two syllables, de-'lay. According to the pronunciation of these two syllables, the word can be read.
  • the phonetic unit of the determined combined pronunciation unit includes monophones, single consonants, compound vowels, compound consonants, and specific pronunciation combinations, and any phonetic unit cut out can be any of them.
  • the phonetic unit is determined as the unit sound, single consonant, compound vowel, and compound consonant.
  • the phonetic unit that is pronounced as vowels (including single and compound vowels) can correspond to vowels, and can also correspond to letter combinations containing consonants.
  • consonants can correspond to consonants or letter combinations containing vowels; in addition, a specific pronunciation combination is a pronunciation combination with a fixed and common pronunciation in English, which can include a combination of vowels and consonants, such as [ ⁇ n] (tion) Contains the pronunciation of both vowels and consonants.
  • vowels and consonants such as [ ⁇ n] (tion)
  • a specific pronunciation combination is a pronunciation combination with a fixed and common pronunciation in English, which can include a combination of vowels and consonants, such as [ ⁇ n] (tion) Contains the pronunciation of both vowels and consonants.
  • vowels can be monophonic or compound vowels
  • consonants can be single consonants or coincident consonants.
  • the word delay can be divided into four spelling units, [d](d), (e), [l](l) and (ay), where, (ay) is a compound vowel.
  • the phonetic transcription data of any word can be used to segment the word into spelling units. For example, according to the phonetic transcription of the word delay Divide the word into four spelling units d, l and
  • the spelling unit defined herein may be a letter or a combination of letters corresponding to a basic pronunciation unit (for example, a single sound, a single consonant, a compound vowel, a compound consonant) or a determined combined pronunciation unit (for example, a specific pronunciation combination).
  • a basic pronunciation unit for example, a single sound, a single consonant, a compound vowel, a compound consonant
  • a determined combined pronunciation unit for example, a specific pronunciation combination
  • step S130 according to the pronunciation data, at least one of the spelling units is respectively labeled with pronunciation, so that the pronunciation labels are integrated into the English text data.
  • the text of the phonetic unit can be pronounced separately according to the composition and/or phonetic features of the phonetic unit as a unit, single consonant, compound vowel, and compound consonant, so that the pronunciation tag is used as the phonetic
  • the prompt information is integrated into the English text data to facilitate the reader's spelling.
  • the following processing may be performed on the spelling unit to mark the pronunciation: according to the pronunciation data, the spelling unit is marked for display, and/or a spelling prompt mark is added to the spelling unit.
  • the process of displaying and marking the phonetic unit based on the pronunciation data may include: color-marking the phonetic unit and adding an underline to the phonetic unit according to the pronunciation data corresponding to the phonetic unit Mark and add cross mark.
  • the process of adding a spelling prompt mark to the spelling unit includes: according to the pronunciation data corresponding to the spelling unit, performing a spelling with at least two pronunciations Add the corresponding spelling prompt symbol mark or spelling prompt number mark to the unit.
  • step S140 the English text data integrated with pronunciation annotations for each of the spelling units is provided, so that the reader can directly read the English text through the pronunciation annotations.
  • the English text data fused with pronunciation annotations for each of the spelling units can be written into a designated file, or the English text data fused with pronunciation annotations for each of the spelling units can be directly displayed.
  • the text data or, alternatively, the English text data integrated with pronunciation and annotation of each of the spelling units can also be sent to a designated host or server via a network.
  • the English text of different granularity as a single English word, phrase, etc. can be divided into smaller-grained spelling units, and the pronunciation of each spelling unit can be marked according to the pronunciation data so that it can be used as pronunciation
  • the pronunciation annotation of the prompt information is integrated into the English text data, and the English text data integrated with the pronunciation annotation is provided, so that the reader can directly read the corresponding English text according to the integrated pronunciation prompt information, making the traditional 80% reliable
  • the natural phonics tool has been promoted to become a 100% reliable method, effectively helping a basic-trained learner to read the pronunciation of words with 100% accuracy and confidence without having to look up the dictionary.
  • the marked text can lower the learner's instant observation of English words from the overall word to the syllables that make up the word.
  • the meaning of the word is ultimately related to its shape, so the instantaneous observation of the shape of the word is in-depth and detailed Strength, can help learners naturally feel the regular changes in word spelling in English reading, thereby enhancing learners' sense of English words, establishing intuition of meaning in reading, and enhancing comprehensive sense of English.
  • step S130 Various exemplary processes of step S130 will be described below with reference to FIGS. 3 to 12.
  • the vowel letter may be marked in a different color from other letters in the word.
  • vowels such as a in lab, e in men, etc.
  • other letters l, b, m, and n
  • the letters corresponding to the long vowel sounds and the letters corresponding to the short vowel sounds are respectively added with different spelling prompt symbols.
  • the letters in the phonetic unit of the short vowel pronunciation (such as a in lab) are added with "" spelling prompt marks
  • the long The letters in the phonetic unit of vowel pronunciation (such as the o in joke) are marked with "-" spelling prompts.
  • a cross-out mark is added to the letter corresponding to the tail vowel of the silent (silent) word. As shown in Figure 4, the silent e in all words is crossed out to indicate that there is no need Spell and pronounce the letter.
  • the letter combinations ar, or, er, ei, ue, ew, ou, etc. corresponding to the compound vowels are all underlined. I will not list the spelling prompt marks for other compound vowels here.
  • corresponding spelling prompt marks are added to letter combinations corresponding to compound vowels with at least two pronunciations.
  • the letter combination ei corresponding to the compound vowel has two pronunciations [i:] and [ei]. Since the pronunciation [i:] is used as the common pronunciation of the letter combination ei, the pronunciation The spelling prompt number mark "2" is added to the letter combination ei in the word of [ei].
  • the letter combination ue corresponding to the compound vowel has two pronunciations [u:] and [ju:], and the letter combination ue in the word pronounced [u:] is added
  • the spelling reminder mark “” is added, and the spelling reminder mark “” is added to the letter combination ue in the word pronounced [ju:].
  • the letter combination ou has 6 pronunciations, which are more complex and special spelling units. As shown in Figure 8, for these 6 pronunciations, different spelling prompt marks are added to each, including symbol marks (such as "" , “-”, “ ⁇ ”), phonetic marks (such as [ ⁇ ]), etc., according to the situation, you can also cross out the silent letters, such as cross out the u in would and o in ouph.
  • symbol marks such as "" , “-”, “ ⁇ ”
  • phonetic marks such as [ ⁇ ]
  • corresponding spelling prompt marks can also be added to letter combinations corresponding to compound consonants with at least two pronunciations.
  • the letter combination th corresponding to the compound consonant has [ ⁇ ] and Two pronunciations, the opposite pronunciation is The letter combination th in the word of has added the spelling prompt number mark "1", and the spelling prompt number mark "2" has been added to the letter combination th in the word pronounced [ ⁇ ].
  • a corresponding spelling prompt symbol mark is added to the letter corresponding to the softened consonant, and the letter corresponding to the softened consonant includes the letter c or the letter g.
  • the letter c When the letter c is placed before the vowels e and i, its pronunciation is softened and pronounced as [s].
  • a spelling prompt mark " ⁇ " is added to remind the reader that the softened sound is needed.
  • the spelling prompt mark " ⁇ " is also added.
  • an underline mark is added to a letter combination corresponding to a specific pronunciation combination.
  • letter combinations "cial”, “rial”, “tual”, etc. with specific pronunciations are all underlined to remind readers to spell the corresponding letter combinations together.
  • FIG. 13 shows an example of fusion of English text annotated by the aforementioned phonetic annotation method. It can be seen that the letters or letter combinations corresponding to compound vowels, compound consonants, and specific pronunciation combinations have been underlined, the silent letters in the spelling unit have been crossed out, and the spelling unit with at least two pronunciations has been added With the help of spelling prompt marks, readers can easily read the English text with the help of these pronunciation tags when spelling these spelling units in the English text.
  • the embodiment of the present invention also provides a computer-readable storage medium storing the steps of performing any of the aforementioned methods for spelling out English text.
  • an embodiment of the present invention also provides a computer program product including at least one executable instruction, which is used to implement the spelling and marking method of any of the foregoing English texts when the executable instruction is executed by a processor.
  • Fig. 2 is a flowchart showing a method for spelling English text according to other embodiments of the present invention. Here, spelling is performed for the English text annotated by the method shown in FIG. 1.
  • step S210 English text data fused with pronunciation annotations are obtained, and at least one spelling unit in the English text data is integrated with pronunciation annotations as described in the aforementioned spelling annotation method for English text.
  • step S220 each of the spelling units is spelled out according to the pronunciation label.
  • each spelling unit can be easily spelled out according to the pronunciation of each spelling unit.
  • the pronunciation annotation includes a display mark and/or a spelling prompt mark of the spelling unit.
  • the display mark includes a color mark, an underline mark, and a cross-out mark;
  • the spelling prompt mark includes a spelling prompt symbol mark or a spelling prompt number mark.
  • step S220 includes: for the spelling unit that has not been marked by pronunciation, the spelling unit is spelled out according to the conventional pronunciation; for the spelling unit that has been marked by pronunciation, according to the display of the spelling unit The spelling unit is distinguished by a mark, and/or, according to the spelling prompt mark, the distinguished spelling unit is spelled.
  • the embodiment of the present invention also provides a computer-readable storage medium storing the steps of the method for performing any of the foregoing English texts.
  • the embodiment of the present invention also provides a computer program product including at least one executable instruction, which is used to implement the spelling method of any of the foregoing English texts when the executable instruction is executed by a processor.
  • an apparatus 1500 for spelling and labeling English text includes a first obtaining module 1510, a segmentation module 1520, a pronunciation labeling module 1530, and a providing module 1540.
  • the first obtaining module 1510 is configured to obtain the English text data to be processed and pronunciation data of the English text data.
  • the English text data is words, phrases, sentence fragments, entire sentences or one or more text paragraphs.
  • the segmentation module 1520 is configured to segment the English text data acquired by the first acquisition module 1510 into at least one spelling unit, the spelling unit including one or more letters forming a basic pronunciation unit or a combined pronunciation unit.
  • the pronunciation tagging module 1530 is configured to respectively perform pronunciation tagging on at least one of the spelling units according to the pronunciation data, so that the pronunciation tagging is integrated into the English text data.
  • the spelling unit includes letters or letter combinations corresponding to monophones, single consonants, compound vowels, compound consonants, or specific pronunciation combinations.
  • the pronunciation marking module 1530 is configured to: display and mark the spelling unit according to the pronunciation data, and/or add a spelling prompt mark to the spelling unit according to the pronunciation data.
  • the pronunciation marking module 1530 is configured to: according to the pronunciation data in the spelling unit, color-mark the spelling unit, add an underline mark, and add a cross-out mark.
  • the pronunciation labeling module 1530 is configured to add corresponding spelling prompt symbol marks or spelling prompt digital marks to the spelling unit with at least two pronunciations according to the pronunciation data in the spelling unit.
  • the pronunciation labeling module 1530 is configured to: add corresponding spelling prompt symbol marks to the letters corresponding to the softened consonants according to the pronunciation data, and the letters corresponding to the softened consonants include the letter c or the letter g; And/or, add an underline mark to the letter combination corresponding to a dumb consonant according to the pronunciation data; and/or add an underline mark to the letter combination corresponding to a compound consonant according to the pronunciation data; and/or, according to the pronunciation data Data, adding an underline mark to a letter combination corresponding to a specific pronunciation combination; and/or, according to the pronunciation data, adding a corresponding spelling prompt number mark or adding a corresponding letter combination corresponding to a compound consonant with at least two pronunciations Prompt pronunciation phonetic transcription.
  • the pronunciation labeling module 1530 is configured to: according to the pronunciation data, add color marks to the letters corresponding to the vowels with long vowels and short vowels, and to mark the vowels with long vowels.
  • the corresponding letters and the letters corresponding to the vowels of the short vowel pronunciation are respectively added with corresponding different spelling prompt symbols; and/or, according to the pronunciation data, add a stroke to the letters corresponding to the tail vowels of the silent words And/or, according to the pronunciation data, add an underline mark to the letter combination corresponding to the compound vowel; and/or, according to the pronunciation data, add the letter combination corresponding to the compound vowel with at least two pronunciations The corresponding spelling hint mark.
  • the providing module 1540 is configured to provide the English text data integrated with pronunciation annotations for each of the spelling units, so that the reader can directly read the English text through the pronunciation annotations.
  • the providing module 1540 is configured to: write the English text data fused with pronunciation annotations for each of the spelling units into a designated file; The English text data that the spelling unit performs pronunciation annotated.
  • the device for spelling and marking any English text can achieve the same effect as any of the aforementioned methods for spelling and marking any English text, which will not be repeated here.
  • an English text spelling device 1600 includes a second acquiring module 1610 and a spelling module 1620.
  • the second acquisition module is used to acquire English text data fused with pronunciation annotations, and at least one spelling unit in the English text data has been pronunciation annotated by the phonetic annotation device for English text as described above.
  • the spelling module 1620 is configured to spell out each of the spelling units according to the pronunciation tags.
  • the pronunciation mark includes a display mark and/or a spelling prompt mark of the spelling unit.
  • the display mark includes a color mark, an underline mark, and a cross-out mark;
  • the spelling prompt mark includes a spelling prompt symbol mark or a spelling prompt number mark.
  • the spelling module 1620 includes: a first spelling unit, which is used to spell out the spelling unit according to the conventional pronunciation for the spelling unit that has not been marked with pronunciation; and the second spelling unit for For the phonetic unit that has been marked with pronunciation, the phonetic unit is distinguished according to the display mark of the phonetic unit, and/or the distinguished phonetic unit is spelled according to the spelling prompt mark.
  • the spelling device of any English text according to the embodiment of the present invention can achieve the same effect as any of the aforementioned spelling methods of English text, which will not be repeated here.
  • FIG. 14 is a schematic diagram showing the structure of an electronic device according to an embodiment of the present invention.
  • the electronic device may be, for example, a mobile terminal, a personal computer (PC), a tablet computer, a server, etc.
  • FIG. 14 shows a schematic structural diagram of an electronic device suitable for implementing the image processing apparatus of the embodiment of the present invention:
  • the electronic device may include a memory and a processor.
  • the electronic device includes one or more processors, communication elements, etc., such as one or more central processing units (CPU) 1401, and/or one or more image processors ( GPU) 1413, etc., the processor can perform various appropriate actions according to executable instructions stored in a read-only memory (ROM) 1402 or executable instructions loaded from the storage portion 1408 to a random access memory (RAM) 1403. deal with.
  • the communication element includes a communication component 1412 and/or a communication interface 1409.
  • the communication component 1412 may include, but is not limited to, a network card.
  • the network card may include but is not limited to an IB (Infiniband) network card.
  • the communication interface 1409 includes a communication interface of a network interface card such as a LAN card and a modem. The network performs communication processing.
  • the processor can communicate with the read-only memory 1402 and/or the random access memory 1403 to execute executable instructions, connect with the communication component 1412 through the communication bus 1404, and communicate with other target devices through the communication component 1412, thereby completing the embodiment of the present invention Provide any operation corresponding to the broadcast-based anti-lost detection method, for example, obtaining the English text data to be processed and the pronunciation data of the English text data; dividing the English text data into at least one spelling unit, The spelling unit includes one or more letters that form a basic pronunciation unit or a combined pronunciation unit; according to the pronunciation data, at least one of the spelling units is individually pronounced, so that the pronunciation label is integrated in the pronunciation In the English text data, the English text data integrated with pronunciation annotations for each of the spelling units is provided, so that the reader can directly read the English text through the pronunciation annotations.
  • RAM 1403 various programs and data required for device operation can also be stored.
  • the CPU 1401 or GPU 1413, ROM 1402, and RAM 1403 are connected to each other through a communication bus 1404.
  • ROM 1402 is an optional module.
  • the RAM 1403 stores executable instructions, or writes executable instructions into the ROM 1402 at runtime, and the executable instructions enable the processor to perform the operations corresponding to the aforementioned communication methods.
  • An input/output (I/O) interface 1405 is also connected to the communication bus 1404.
  • the communication component 1412 may be integrated, or may be configured to have multiple sub-modules (for example, multiple IB network cards) and be on the communication bus link.
  • the following components are connected to the I/O interface 1405: an input part 1406 including a keyboard, a mouse, etc.; an output part 1407 including a cathode ray tube (CRT), a liquid crystal display (LCD), etc., and speakers, etc.; a storage part 1408 including a hard disk, etc. ; And a communication interface 1409 including a network interface card such as a LAN card and a modem.
  • the driver 1410 is also connected to the I/O interface 1405 as needed.
  • a removable medium 1411 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc., is installed on the drive 1410 as needed, so that the computer program read from it is installed into the storage portion 1408 as needed.
  • FIG. 14 is only an optional implementation. In the specific practice process, the number and types of components in Figure 14 can be selected, deleted, added or replaced according to actual needs; In the setting of different functional components, implementation methods such as separate settings or integrated settings can also be used. For example, GPU and CPU can be set separately or GPU can be integrated on CPU. Communication elements can be set separately or integrated on CPU or GPU. ,and many more. These alternative embodiments all fall into the protection scope of the present disclosure.
  • the process described above with reference to the flowchart can be implemented as a computer software program.
  • the embodiment of the present invention includes a computer program product, which includes a computer program tangibly contained on a machine-readable medium.
  • the computer program includes program code for executing the method shown in the flowchart.
  • the program code may include corresponding execution
  • the instructions corresponding to the method steps provided in the embodiment of the present invention are used to obtain the executable code of the English text data to be processed and the pronunciation data of the English text data; and are used to divide the English text data into at least one A spelling unit, the spelling unit includes executable code of one or more letters forming a basic pronunciation unit or a combined pronunciation unit; and is used to individually label at least one of the spelling units according to the pronunciation data,
  • the executable code that reads the English text directly.
  • executable code for obtaining English text data fused with pronunciation annotations where at least one spelling unit in the English text data has performed pronunciation annotations as in any previous English text spelling annotation method;
  • the pronunciation tag spelling reads the executable code of each of the spelling units.
  • the computer program may be downloaded and installed from the network through the communication element, and/or installed from the removable medium 1411.
  • the computer program is executed by the central processing unit (CPU) 1401, the above-mentioned functions defined in the method of the embodiment of the present invention are executed.
  • the electronic device of the embodiment of the present invention can be used to implement the corresponding spelling labeling method or the spelling method of the English text in the above embodiment, and each device in the electronic device can be used to execute each step in the above method embodiment, for example
  • the spelling labeling method or spelling method of the English text described above can be implemented by the processor of the electronic device invoking the relevant instructions stored in the memory. For brevity, it will not be repeated here.
  • each component/step described in this application can be split into more components/steps, or two or more components/steps or partial operations of components/steps can be combined into new ones. Components/steps to achieve the purpose of the embodiments of the present invention.
  • the method and apparatus, electronic equipment, and storage medium of the present disclosure may be implemented in many ways.
  • the method, device, electronic device, and storage medium of the embodiments of the present invention may be implemented by software, hardware, firmware or any combination of software, hardware, and firmware.
  • the above-mentioned order of the steps for the method is for illustration only, and the steps of the method in the embodiment of the present invention are not limited to the order specifically described above, unless specifically stated otherwise.
  • the present disclosure can also be implemented as programs recorded in a recording medium, and these programs include machine-readable instructions for implementing methods according to embodiments of the present invention.
  • the present disclosure also covers a recording medium storing a program for executing the method according to the embodiment of the present invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Document Processing Apparatus (AREA)

Abstract

一种英语文本的拼读标注方法、拼读方法、其装置、存储介质和电子设备。英语文本的拼读标注方法包括:获取待处理的英语文本数据以及英语文本数据的发音数据(S110);将英语文本数据切分为至少一个拼读单元(S120),所述拼读单元包括形成基本发音单位或组合发音单位的一个或多个字母;根据发音数据,分别对至少一个拼读单元进行发音标注,以使发音标注融合在英语文本数据中(S130);提供融合有对各个拼读单元进行发音标注的英语文本数据,以使读者能够通过发音标注直接读出英语文本(S140),从而有效地帮助英语学习者使用英语自然拼读法来进行英语文本拼读,提高英语学习效率。

Description

英语文本的拼读标注方法、拼读方法、其装置、存储介质和电子设备 技术领域
本发明实施例涉及信息处理技术,尤其涉及一种英语文本的拼读标注方法、拼读方法、其装置、存储介质和电子设备。
背景技术
单词是英语中代表独立意思的最小单位,因此需要从单词开始熟悉英语的逻辑系统。
英语单词由音节组成,每个发音的元音形成一个音节。作为一种拼音文字,英语中84%的单词符合发音规律。即使在不符合发音规律的单词中,也常常存在部分符合发音规律,或展现出特殊的规律性。对于母语不是英语的学生,学好英语的首要入门功夫就是看字发音,也就是说,看到一个英语单词就可以快速并且通常能够正确地发出读音,这就是Phonics,即“看字发音法”,也称作“自然拼读法”。但是,由于英文的phonics不同于拼音是百分百符合规律,这就导致一个英文单词仅从拼写的外形上来判断,存在多种发音的可能性。这就如同在汉字中,我们会借助字形来猜测一个汉字的读音,但仍然需要老师教导学生读音,或者借助字典来确认发音。因此,在实际的英语教学中,多数学校和老师仍然采用先入为主的“整体认读法”,使得Phonics如同起不到其应有作用的一根拐杖。
另一方面,目前的英语辅助发音方法,也就是音标,将每个单词作为整体单独标注出单词的发音音标,这样一来需要逐字翻查字典才能找到,二来即使找到,音标和单词本身也是分离的,注意力无法同时观察拼写和音标,这样导致学生对于英语的发音观察本能仍会停留在单词整体认读的层面,无法细化下行到音节层面,这样一方面学生依然无法准确自信地拼读出单词发音,另一方面,因为无法在阅读时自然观察到英文单词的结构,对英文单词意义的敏感度也就低。
发明内容
本发明实施例的目的在于,提供一种英语文本的拼读标注方案和拼读方案,以为英语文本添加发音标注,有效地帮助学习者通过自然拼读法学习英语。
根据本发明实施例的第一方面,提供一种英语文本的拼读标注方法,包括:获取待处理的英语文本数据以及所述英语文本数据的发音数据;将所述英语文本数据切分为至少一个拼读单元,所述拼读单元包括形成基本发音单位或组合发音单位的一个或多个字母;根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中;提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,以使读者能够通过所述发音标注直接读出英语文本。
可选地,所述根据所述发音数据,对所述拼读单元发音标注包括:根据所述发音数据,对所述拼读单元进行显示标记,以及/或者,根据所述发音数据,对所述拼读单元添加拼读提示标记。
可选地,所述根据所述发音数据,对所述拼读单元进行显示标记,包括:根据 所述拼读单元中的所述发音数据,将所述拼读单元进行颜色标记、添加下划线标记以及添加划掉标记。
可选地,所述根据所述发音数据,对所述拼读单元添加拼读提示标记,包括:根据所述拼读单元中的所述发音数据,对具有至少两种发音的拼读单元添加相应的拼读提示符号标记或拼读提示数字标记。
可选地,所述拼读单元包括与单元音、单辅音、复合元音、复合辅音或特定发音组合对应的字母或字母组合。
可选地,所述根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中,包括:根据所述发音数据,对软化发音的辅音对应的字母添加相应的拼读提示符号标记,所述软化发音的辅音对应的字母包括字母c或字母g;以及/或者,根据所述发音数据,对哑辅音对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对复合辅音对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对特定发音组合对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对具有至少两种发音的复合辅音对应的字母组合添加相应的拼读提示数字标记或加注相应的提示发音音标。
可选地,所述根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中,包括:根据所述发音数据,对具有长元音发音和短元音发音的单元音对应的字母进行添加颜色标记,并且对长元音发音的所述单元音对应的字母和短元音发音的所述单元音对应的字母分别添加相应不同的拼读提示符号标记;以及/或者,根据所述发音数据,对不发音的单词尾部元音对应的字母添加划掉标记;以及/或者,根据所述发音数据,对复合元音对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对具有至少两种发音的复合元音对应的字母组合添加相应的拼读提示标记。
可选地,所述提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,包括:将所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据写入指定的文件;以及/或者,显示所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据。
可选地,所述英语文本数据为单词、短语、语句片段、整个语句或一个或多个文本段落。
根据本发明实施例的第二方面,提供一种英语文本的拼读方法,包括:获取融合有发音标注的英语文本数据,所述英语文本数据中的至少一个拼读单元进行过如前任一英语文本的拼读标注方法的发音标注;根据所述发音标注拼读出各个所述拼读单元。
可选地,所述发音标注包括所述拼读单元的显示标记和/或拼读提示标记。
可选地,所述显示标记包括颜色标记、下划线标记以及划掉标记;所述拼读提示标记包括拼读提示符号标记或拼读提示数字标记。
可选地,所述根据所述发音标注拼读出各个所述拼读单元,包括:对于未进行过发音标注的拼读单元,按照常规读法拼读出所述拼读单元;对于进行过发音标注的拼读单元,根据所述拼读单元的显示标记区分出所述拼读单元,并且根据所述拼读提示标记,对区分出的拼读单元进行拼读。
根据本发明实施例的第三方面,提供一种英语文本的拼读标注装置,包括:获取模块,用于获取待处理的英语文本数据以及所述英语文本数据的发音数据;切分模块,用于将所述英语文本数据切分为至少一个拼读单元,所述拼读单元包括形成基本发音单位或组合发音单位的一个或多个字母;发音标注模块,用于根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中;提供模块,用于提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,以使读者能够通过所述发音标注直接读出英语文本。
可选地,所述发音标注模块用于:根据所述发音数据,对所述拼读单元进行显示标记,以及/或者,根据所述发音数据,对所述拼读单元添加拼读提示标记。
可选地,所述发音标注模块用于:根据所述拼读单元中的所述发音数据,将所述拼读单元进行颜色标记、添加下划线标记以及添加划掉标记。
可选地,所述发音标注模块用于:根据所述拼读单元中的所述发音数据,对具有至少两种发音的拼读单元添加相应的拼读提示符号标记或拼读提示数字标记。
可选地,所述拼读单元包括与单元音、单辅音、复合元音、复合辅音或特定发音组合对应的字母或字母组合。
可选地,所述发音标注模块用于:根据所述发音数据,对软化发音的辅音对应的字母添加相应的拼读提示符号标记,所述软化发音的辅音对应的字母包括字母c或字母g;以及/或者,根据所述发音数据,对哑辅音对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对复合辅音对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对特定发音组合对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对具有至少两种发音的复合辅音对应的字母组合添加相应的拼读提示数字标记或加注相应的提示发音音标。
可选地,所述发音标注模块用于:根据所述发音数据,对具有长元音发音和短元音发音的单元音对应的字母进行添加颜色标记,并且对长元音发音的所述单元音对应的字母和短元音发音的所述单元音对应的字母分别添加相应不同的拼读提示符号标记;以及/或者,根据所述发音数据,对不发音的单词尾部元音对应的字母添加划掉标记;以及/或者,根据所述发音数据,对复合元音对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对具有至少两种发音的复合元音对应的字母组合添加相应的拼读提示标记。
可选地,所述提供模块用于:将所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据写入指定的文件;以及/或者,显示所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据。
可选地,所述英语文本数据为单词、短语、语句片段、整个语句或一个或多个文本段落。
根据本发明实施例的第四方面,提供一种英语文本的拼读装置,包括:获取模块,用于获取融合有发音标注的英语文本数据,所述英语文本数据中的至少一个拼读单元进行过如权利要求14~22任一项所述的英语文本的拼读标注装置的发音标注;拼读模块,用于根据所述发音标注拼读出各个所述拼读单元。
可选地,所述发音标注包括所述拼读单元的显示标记和/或拼读提示标记。
可选地,所述显示标记包括颜色标记、下划线标记以及划掉标记;所述拼读提 示标记包括拼读提示符号标记或拼读提示数字标记。
可选地,所述拼读模块包括:第一拼读单元,用于对于未进行过发音标注的拼读单元,按照常规读法拼读出所述拼读单元;第二拼读单元,用于对于进行过发音标注的拼读单元,根据所述拼读单元的显示标记区分出所述拼读单元,并且/或者,根据所述拼读提示标记,对区分出的拼读单元进行拼读。
根据本发明实施例的第五方面,提供一种计算机可读存储介质,其上存储有计算机程序指令,其中,所述程序指令被处理器执行时实现任一前述英语文本的拼读标注方法的步骤。
根据本发明实施例的第六方面,提供一种计算机可读存储介质,其上存储有计算机程序指令,其中,所述程序指令被处理器执行时实现任一前述英语文本的拼读方法的步骤。
根据本发明实施例的第七方面,提供一种电子设备,包括:处理器、存储器、通信元件和通信总线,所述处理器、所述存储器和所述通信元件通过所述通信总线完成相互间的通信;所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行任一前述英语文本的拼读标注方法对应的操作。
根据本发明实施例的第八方面,提供一种电子设备,包括:处理器、存储器、通信元件和通信总线,所述处理器、所述存储器和所述通信元件通过所述通信总线完成相互间的通信;所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行任一前述英语文本的拼读标注方法对应的操作。
根据本发明实施例提供的英语文本的拼读标注方案,能够将作为单个英语单词、短语等不同粒度的英语文本切分为颗粒度较小的拼读单元,根据发音数据对各个拼读单元进行发音标注,使作为发音提示信息的发音标注融合在英语文本数据中,并且提供融合有发音标注的所述英语文本数据,从而读者能够根据融合在内的发音提示信息,直接读出相应的英语文本,将传统八成可靠的自然拼读工具,提升成为百分之百可依靠的方法,有效地帮助一位经过基本训练的学习者,无需查考字典,就能通过拼读法百分百准确自信地读出单词的发音。更重要的是,标注好的文本可以将学习者对于英文单词的瞬间观察力从整体单词下行到组成单词的音节,单词的意思终究和它的外形相关联,所以对于单词外形深入细致的瞬间观察力,能帮助学习者在英文阅读中自然感受单词拼写中的规律性变化,从而提升学习者的英文词感,在阅读中建立对意思的直觉,提升综合英文语感。
附图说明
图1是示出根据本发明一些实施例的英语文本的拼读标注方法的流程图;
图2是示出根据本发明另一些实施例的英语文本的拼读方法的流程图;
图3~图12分别是示出根据本发明一些实施例的拼读标注方法对英语文本进行的发音标注的示意图;
图13示出融合有经过前述拼读标注方法标注过的英语文本的示例;
图14是示出根据就本发明实施例的电子设备的结构示意图;
图15是示出根据本发明一些实施例的英语文本的拼读标注装置的逻辑框图;
图16是示出根据本发明一些实施例的英语文本的拼读装置的逻辑框图。
具体实施方式
下面结合附图详细描述本发明实施例的示例性实施例。
在本申请中,“多个”指两个或两个以上,“至少一个”指一个、两个或两个以上。对于本申请中提及的任一部件、数据或结构,在没有明确限定一个的情况下,可理解为一个或多个。
图1是示出根据本发明一些实施例的英语文本的拼读标注方法的流程图。
英语国家的孩子记单词不是靠“背”,而是靠“拼读”,这就是所谓的“自然拼读法”,又称“英语自然拼读法”。自然拼读法是目前国际主流的英语教学法,它不仅是以英语为母语国家的孩子学习英语读音与拼字,增进阅读能力与理解力的教学法,更是以英语为第二语言的英语初学者学习发音规则与拼读技巧的教学方法。这种教学法简单高效、有趣,符合小朋友学习语言的规律,大大提高了学习效率。
英语自然拼读法通过直接学习26个字母及字母组合在单词中的发音规则,建立字母及字母组合与发音的感知,让学生在轻松愉快的氛围中,了解和学习英语字母组合的奥妙,掌握英语拼读规律,从而达到看到单词就会读,听到单词就会拼的学习目的。
对于大多数英语初学者,自然拼读法是一门实用的工具与方法,如能掌握,不需要借助音标,就都能够读出80%的英语单词,把枯燥无味的背单词变成一件简单的事,从而达到事半功倍的效果。
本发明的总体发明构思旨在提供一种对英语文本进行发音标注的方法,以通过这些融合于英语文本中的发音标注对英语学习者进行发音提示,有效地帮助英语学习者使用英语自然拼读法来进行英语文本拼读,提高英语学习效率。
参照图1,在步骤S110,获取待处理的英语文本数据以及所述英语文本数据的发音数据。
这里,待处理的英语文本数据可以是单个单词或任意个数的单词的组合,例如,包括至少两个单词的短语、语句片段、整个语句或一个或多个文本段落。
由于获取的英语文本数据包括一个或多个英语单词,而每个英语单词具有对应的音标,因此可例如,通过查询词典的方式获取各个英语单词的音标数据,将这些音标数据构成英语文本数据的发音数据,或者,获取的英语文本数据中的各个单词已被标注了音标,则这些音标的数据构成英语文本数据的发音数据。当然,也可以采用其他的方式获得英语文本数据的发音数据,在此不做限制。
在步骤S120,将所述英语文本数据切分为至少一个拼读单元。
通常,可以将英语单词分割成一个或多个音节,根据各个音节的发音能够拼读出整个单词的读音。例如,单词delay被划分为两个音节,de-'lay,根据这两个音节的发音,可读出这个单词。
根据本发明的总体发明构思,为了能够便于通过“自然拼读法”进行拼读,我们对英语单词进行更细化的切分,进一步将英语单词切分为一个或多个作为基本发音单位或确定的组合发音单位的拼读单元,包括单元音、单辅音、复合元音、复合辅音以及特定发音组合,切分出的任一拼读单元可以是其中的任一者。在此,根据 其拼读发音的类型来确定作为单元音、单辅音、复合元音、复合辅音的拼读单元。发音为元音(包括单元音和复合元音)的拼读单元可对应元音字母,也可以对应包含辅音字母的字母组合,同理,发音为辅音(包括单辅音和复合辅音)的拼读单元可对应辅音字母,也可以对应包含元音字母的字母组合;此外,特定发音组合是英语中具有固定俗成发音的发音组合,其可包含元音和辅音的发音组合,例如[∫n](tion)既包含元音的发音,也包含辅音的发音。在本文中,凡泛指元音发音的,可为单元音或复合元音;凡泛指辅音发音的,可为单辅音或符合辅音。
例如,可将单词delay切分出四个拼读单元,[d](d)、
Figure PCTCN2020097548-appb-000001
(e)、[l](l)和
Figure PCTCN2020097548-appb-000002
(ay),其中,
Figure PCTCN2020097548-appb-000003
(ay)为复合元音。
由于任一单词的音标数据已标出单词各个组成部分的发音,因此可借助单词的音标数据对单词进行拼读单元的切分。例如,根据单词delay的音标
Figure PCTCN2020097548-appb-000004
将该单词切分出四个拼读单元d、
Figure PCTCN2020097548-appb-000005
l和
Figure PCTCN2020097548-appb-000006
这里限定的拼读单元可以是与基本发音单位(例如单元音、单辅音、复合元音、复合辅音)或确定的组合发音单位(如特定发音组合)对应的字母或字母组合。
在步骤S130,根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中。
对于切分出的任一拼读单元,可根据该拼读单元的特征对其进行发音标注。例如,可根据作为单元音、单辅音、复合元音以及复合辅音的拼读单元的组成特征和/或拼读特征,分别对该拼读单元的文本进行发音标注,使得发音标注作为拼读的提示信息融合在英语文本数据中,以便于读者进行拼读。
具体地,可对拼读单元执行以下处理,以进行发音标注:根据所述发音数据,对所述拼读单元进行显示标记,和/或,对所述拼读单元添加拼读提示标记。
其中,根据本发明的可选实施方式,根据发音数据,对拼读单元进行显示标记的处理可包括:根据所述拼读单元对应的发音数据,将所述拼读单元进行颜色标记、添加下划线标记以及添加划掉标记。
其中,根据本发明的可选实施方式,根据发音数据,对拼读单元添加拼读提示标记的处理包括:根据所述拼读单元对应的所述发音数据,对具有至少两种发音的拼读单元添加相应的拼读提示符号标记或拼读提示数字标记。
稍后将参照图3~图12详细描述拼读单元的多种示例性发音标注。
在步骤S140,提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,以使读者能够通过所述发音标注直接读出英语文本。
可将所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据写入指定的文件,也可直接显示所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据,或者,也可以通过网络将该融合有对各个所述拼读单元进行发音标注的所述英语文本数据发送给指定的主机或服务器。
通过前述步骤S110~S140的处理,能够将作为单个英语单词、短语等不同粒度的英语文本切分为颗粒度较小的拼读单元,根据发音数据对各个拼读单元进行发音标注,使作为发音提示信息的发音标注融合在英语文本数据中,并且提供融合有发音标注的所述英语文本数据,从而读者能够根据融合在内的发音提示信息,直接读出相应的英语文本,将传统八成可靠的自然拼读工具,提升成为百分之百可依靠的 方法,有效地帮助一位经过基本训练的学习者,无需查考字典,就能通过拼读法百分百准确自信地读出单词的发音。更重要的是,标注好的文本可以将学习者对于英文单词的瞬间观察力从整体单词下行到组成单词的音节,单词的意思终究和它的外形相关联,所以对于单词外形深入细致的瞬间观察力,能帮助学习者在英文阅读中自然感受单词拼写中的规律性变化,从而提升学习者的英文词感,在阅读中建立对意思的直觉,提升综合英文语感。
以下将参照图3~图12描述步骤S130的多种示例性处理。
根据本发明的可选实施方式,对于单音节的英语单词中的单元音对应的字母,可将该元音字母标注为与该单词中的其他字母不同的颜色。例如,如图3和图4所示,在显示的各个单词中,元音字母(如lab中a、men中的e等),被标注显示为与其他字母(l、b、m和n)不同的颜色。
根据本发明的可选实施方式,对长元音发音的单元音对应的字母和短元音发音的单元音对应的字母分别添加相应不同的拼读提示符号标记。例如,在图3示出的示意图中,短元音发音的拼读单元中的字母(如lab中的a)被添加有“ ”拼读提示标记,而在图4示出的示意图中,长元音发音的拼读单元中的字母(如joke中的o)被添加有“-”拼读提示标记。
根据本发明的可选实施方式,还对不发音(静音)的单词尾部元音对应的字母添加划掉标记,如图4中示出的全部单词中不发音的e被划掉,以提示无需对该字母进行拼读发音。
此外,还可对英语单词中的复合元音对应的字母组合添加下划线标记。例如,在图5~图8示出的示意图中,复合元音对应的字母组合ar、or、er、ei、ue、ew、ou等均被添加下划线标记。在此不再一一列举对其他复合元音的拼读提示标记。
同理,也可对英语单词中的复合辅音对应的字母组合添加下划线标记。例如,在图9~图10示出的示意图中,复合辅音对应的字母组合kn、wh、wr、th等均被添加下划线标记。在此不再一一列举对其他复合辅音的拼读提示标记。
根据本发明的可选实施方式,对具有至少两种发音的复合元音对应的字母组合添加相应的拼读提示标记。例如,在图6示出的示意图中,复合元音对应的字母组合ei具有[i:]和[ei]两种发音,由于将发音[i:]作为字母组合ei的常用发音,因此对发音为[ei]的单词中的该字母组合ei添加了拼读提示数字标记“2”。
再例如,在图7示出的示意图中,复合元音对应的字母组合ue具有[u:]和[ju:]两种发音,对发音为[u:]的单词中的该字母组合ue添加了拼读提示标记“ ”,而对发音为[ju:]的单词中的该字母组合ue添加了拼读提示标记“ ”。
再例如,字母组合ou具有6种发音,属于较复杂、特殊的拼读单元,如图8所示,针对这6种发音,各自添加了不同的拼读提示标记,包括符号标记(如“ ”、“-”、“^”)、音标标记(如[Λ])等,还可根据情形将不发音的字母划掉,如将would中的u划掉,将ouph中的o划掉。
同理,可对具有至少两种发音的复合辅音对应的字母组合也添加相应的拼读提示标记。例如,在图10示出的示意图中,复合辅音对应的字母组合th具有[θ]和
Figure PCTCN2020097548-appb-000007
两种发音,对发音为
Figure PCTCN2020097548-appb-000008
的单词中的该字母组合th添加了拼读提示数字标记“1”,并且对发音为[θ]的单词中的该字母组合th添加了拼读提示数字标记“2”。
根据本发明的可选实施方式,对软化发音的辅音对应的字母添加相应的拼读提示符号标记,所述软化发音的辅音对应的字母包括字母c或字母g。当字母c置于元音字母e和i之前时,其发音被软化,发[s]的音。相应地,如图11所示,在这些软化的字母c之上,添加拼读提示标记“~”,以提示读者需要发软化音。同理,在当字母g置于元音字母e、i和y之前时,其发音被软化,发[s]的音。在这些软化的字母g之上,也添加拼读提示标记“~”。
根据本发明的可选实施方式,对特定发音组合对应的字母组合添加下划线标记。例如,在图12示出的示意图中,具有特定发音的字母组合“cial”、“rial”、“tual”等均被添加下划线标记,以提示读者一同拼读相应的字母组合。
图13示出融合有经过前述拼读标注方法标注过的英语文本的示例。可以看出,将复合元音、复合辅音及特定发音组合对应的字母或字母组合添加了下划线标记,已将拼读单元中不发音的字母划掉,对具有至少两种发音的拼读单元添加了拼读提示标记,读者在拼读英语文本中的这些拼读单元时,能够借助这些发音标注,容易地读出英文文本。
本发明实施例还提供一种存储有执行前述任一英语文本的拼读标注方法的步骤的计算机可读存储介质。
此外,本发明实施例还提供一种包括至少一个可执行指令的计算机程序产品,所述可执行指令被处理器执行时用于实现前述任一英语文本的拼读标注方法。
图2是示出根据本发明另一些实施例的英语文本的拼读方法的流程图。这里,针对图1示出的方法标注的英语文本执行拼读。
参照图2,在步骤S210,获取融合有发音标注的英语文本数据,所述英语文本数据中的至少一个拼读单元融合有如前述英语文本的拼读标注方法的发音标注。
可例如,从如硬盘、光盘、闪存盘等存储介质读取该融合有发音标注的英语文本数据
在步骤S220,根据所述发音标注拼读出各个所述拼读单元。
由于具有特殊拼读的拼读单元均标注有拼读提示信息,因此能够根据各个拼读单元的发音标注,容易地拼读出各个拼读单元。
如前所述,可选地,所述发音标注包括所述拼读单元的显示标记和/或拼读提示标记。
如前所述,可选地,所述显示标记包括颜色标记、下划线标记以及划掉标记;所述拼读提示标记包括拼读提示符号标记或拼读提示数字标记。
可选地,步骤S220包括:对于未进行过发音标注的拼读单元,按照常规读法拼读出所述拼读单元;对于进行过发音标注的拼读单元,根据所述拼读单元的显示标记区分出所述拼读单元,并且/或者,根据所述拼读提示标记,对区分出的拼读单元进行拼读。
本发明实施例还提供一种存储有执行前述任一英语文本的拼读方法的步骤的计算机可读存储介质。
此外,本发明实施例还提供一种包括至少一个可执行指令的计算机程序产品,所述可执行指令被处理器执行时用于实现前述任一英语文本的拼读方法。
以下参照图15描述本发明一些实施例的英语文本的拼读标注装置。
如图15所示,一种英语文本的拼读标注装置1500包括第一获取模块1510、切分模块1520、发音标注模块1530和提供模块1540。
第一获取模块1510用于获取待处理的英语文本数据以及所述英语文本数据的发音数据。
可选地,所述英语文本数据为单词、短语、语句片段、整个语句或一个或多个文本段落。
切分模块1520用于将第一获取模块1510获取的英语文本数据切分为至少一个拼读单元,所述拼读单元包括形成基本发音单位或组合发音单位的一个或多个字母。
发音标注模块1530用于根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中。
可选地,所述拼读单元包括与单元音、单辅音、复合元音、复合辅音或特定发音组合对应的字母或字母组合。
可选地,发音标注模块1530用于:根据所述发音数据,对所述拼读单元进行显示标记,以及/或者,根据所述发音数据,对所述拼读单元添加拼读提示标记。
可选地,发音标注模块1530用于:根据所述拼读单元中的所述发音数据,将所述拼读单元进行颜色标记、添加下划线标记以及添加划掉标记。
可选地,发音标注模块1530用于:根据所述拼读单元中的所述发音数据,对具有至少两种发音的拼读单元添加相应的拼读提示符号标记或拼读提示数字标记。
可选地,发音标注模块1530用于:根据所述发音数据,对软化发音的辅音对应的字母添加相应的拼读提示符号标记,所述软化发音的辅音对应的字母包括字母c或字母g;以及/或者,根据所述发音数据,对哑辅音对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对复合辅音对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对特定发音组合对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对具有至少两种发音的复合辅音对应的字母组合添加相应的拼读提示数字标记或加注相应的提示发音音标。
可选地,发音标注模块1530用于:根据所述发音数据,对具有长元音发音和短元音发音的单元音对应的字母进行添加颜色标记,并且对长元音发音的所述单元音对应的字母和短元音发音的所述单元音对应的字母分别添加相应不同的拼读提示符号标记;以及/或者,根据所述发音数据,对不发音的单词尾部元音对应的字母添加划掉标记;以及/或者,根据所述发音数据,对复合元音对应的字母组合添加下划线标记;以及/或者,根据所述发音数据,对具有至少两种发音的复合元音对应的字母组合添加相应的拼读提示标记。
提供模块1540用于提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,以使读者能够通过所述发音标注直接读出英语文本。
可选地,提供模块1540用于:将所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据写入指定的文件;以及/或者,显示所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据。
根据本发明实施例的任一英语文本的拼读标注装置能够实现与前述任一英语文本的拼读标注方法相同的效果,在此不予赘述。
以下参照图16描述本发明一些实施例的英语文本的拼读装置。
参照图16,一种英语文本的拼读装置1600包括第二获取模块1610和拼读模块1620。
第二获取模块用于获取融合有发音标注的英语文本数据,所述英语文本数据中的至少一个拼读单元进行过如前所述英语文本的拼读标注装置的发音标注。
拼读模块1620用于根据所述发音标注拼读出各个所述拼读单元。
可选地,所述发音标注包括所述拼读单元的显示标记和/或拼读提示标记。
可选地,所述显示标记包括颜色标记、下划线标记以及划掉标记;所述拼读提示标记包括拼读提示符号标记或拼读提示数字标记。
可选地,拼读模块1620包括:第一拼读单元,用于对于未进行过发音标注的拼读单元,按照常规读法拼读出所述拼读单元;第二拼读单元,用于对于进行过发音标注的拼读单元,根据所述拼读单元的显示标记区分出所述拼读单元,并且/或者,根据所述拼读提示标记,对区分出的拼读单元进行拼读。
根据本发明实施例的任一英语文本的拼读装置能够实现与前述任一英语文本的拼读方法相同的效果,在此不予赘述。
本发明实施例还提供了一种电子设备。图14是示出根据就本发明实施例的电子设备的结构示意图。该电子设备可以是例如移动终端、个人计算机(PC)、平板电脑、服务器等。下面参考图14,其示出了适于用来实现本发明实施例的图像处理装置的电子设备的结构示意图:如图14所示,电子设备可以包括存储器和处理器。具体地,电子设备包括一个或多个处理器、通信元件等,所述一个或多个处理器例如:一个或多个中央处理单元(CPU)1401,和/或一个或多个图像处理器(GPU)1413等,处理器可以根据存储在只读存储器(ROM)1402中的可执行指令或者从存储部分1408加载到随机访问存储器(RAM)1403中的可执行指令而执行各种适当的动作和处理。通信元件包括通信组件1412和/或通信接口1409。其中,通信组件1412可包括但不限于网卡,所述网卡可包括但不限于IB(Infiniband)网卡,通信接口1409包括诸如LAN卡、调制解调器等的网络接口卡的通信接口,通信接口1409经由诸如因特网的网络执行通信处理。
处理器可与只读存储器1402和/或随机访问存储器1403中通信以执行可执行指令,通过通信总线1404与通信组件1412相连、并经通信组件1412与其他目标设备通信,从而完成本发明实施例提供的任一项基于广播的防丢检测方法对应的操作,例如,获取待处理的英语文本数据以及所述英语文本数据的发音数据;将所述英语文本数据切分为至少一个拼读单元,所述拼读单元包括形成基本发音单位或组合发音单位的一个或多个字母;根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中;提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,以使读者能够通过所述发音标注直接读出英语文本。
此外,在RAM 1403中,还可存储有装置操作所需的各种程序和数据。CPU 1401或GPU 1413、ROM 1402以及RAM 1403通过通信总线1404彼此相连。在有RAM 1403的情况下,ROM 1402为可选模块。RAM 1403存储可执行指令,或在运行时向ROM 1402中写入可执行指令,可执行指令使处理器执行上述通信方法对应的操 作。输入/输出(I/O)接口1405也连接至通信总线1404。通信组件1412可以集成设置,也可以设置为具有多个子模块(例如多个IB网卡),并在通信总线链接上。
以下部件连接至I/O接口1405:包括键盘、鼠标等的输入部分1406;包括诸如阴极射线管(CRT)、液晶显示器(LCD)等以及扬声器等的输出部分1407;包括硬盘等的存储部分1408;以及包括诸如LAN卡、调制解调器等的网络接口卡的通信接口1409。驱动器1410也根据需要连接至I/O接口1405。可拆卸介质1411,诸如磁盘、光盘、磁光盘、半导体存储器等等,根据需要安装在驱动器1410上,以便于从其上读出的计算机程序根据需要被安装入存储部分1408。
需要说明的是,如图14所示的架构仅为一种可选实现方式,在具体实践过程中,可根据实际需要对上述图14的部件数量和类型进行选择、删减、增加或替换;在不同功能部件设置上,也可采用分离设置或集成设置等实现方式,例如GPU和CPU可分离设置或者可将GPU集成在CPU上,通信元件可分离设置,也可集成设置在CPU或GPU上,等等。这些可替换的实施方式均落入本公开的保护范围。
特别地,根据本发明实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本发明实施例包括一种计算机程序产品,其包括有形地包含在机器可读介质上的计算机程序,计算机程序包含用于执行流程图所示的方法的程序代码,程序代码可包括对应执行本发明实施例提供的方法步骤对应的指令,例如,用于获取待处理的英语文本数据以及所述英语文本数据的发音数据的可执行代码;用于将所述英语文本数据切分为至少一个拼读单元,所述拼读单元包括形成基本发音单位或组合发音单位的一个或多个字母的可执行代码;用于根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中的可执行代码;用于提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,以使读者能够通过所述发音标注直接读出英语文本的可执行代码。再例如,用于获取融合有发音标注的英语文本数据的可执行代码,所述英语文本数据中的至少一个拼读单元进行过如前任一英语文本的拼读标注方法的发音标注;用于根据所述发音标注拼读出各个所述拼读单元的可执行代码。
在这样的实施例中,该计算机程序可以通过通信元件从网络上被下载和安装,和/或从可拆卸介质1411被安装。在该计算机程序被中央处理单元(CPU)1401执行时,执行本发明实施例的方法中限定的上述功能。
本发明实施例的电子设备可以用于实现上述实施例中相应的英语文本的拼读标注方法或拼读方法,该电子设备中的各个器件可以用于执行上述方法实施例中的各个步骤,例如,上文中描述的英语文本的拼读标注方法或拼读方法可以通过电子设备的处理器调用存储器存储的相关指令来实现,为了简洁,在此不再赘述。
需要指出,根据实施的需要,可将本申请中描述的各个部件/步骤拆分为更多部件/步骤,也可将两个或多个部件/步骤或者部件/步骤的部分操作组合成新的部件/步骤,以实现本发明实施例的目的。
可能以许多方式来实现本公开的方法和装置、电子设备和存储介质。例如,可通过软件、硬件、固件或者软件、硬件、固件的任何组合来实现本发明实施例的方法和装置、电子设备和存储介质。用于方法的步骤的上述顺序仅是为了进行说明,本发明实施例的方法的步骤不限于以上具体描述的顺序,除非以其它方式特别说 明。此外,在一些实施例中,还可将本公开实施为记录在记录介质中的程序,这些程序包括用于实现根据本发明实施例的方法的机器可读指令。因而,本公开还覆盖存储用于执行根据本发明实施例的方法的程序的记录介质。
本发明实施例的描述是为了示例和描述起见而给出的,而并不是无遗漏的或者将本公开限于所公开的形式,很多修改和变化对于本领域的普通技术人员而言是显然的。选择和描述实施例是为了更好说明本公开的原理和实际应用,并且使本领域的普通技术人员能够理解本公开从而设计适于特定用途的带有各种修改的各种实施例。

Claims (29)

  1. 一种英语文本的拼读标注方法,包括:
    获取待处理的英语文本数据以及所述英语文本数据的发音数据;
    将所述英语文本数据切分为至少一个拼读单元,所述拼读单元包括形成基本发音单位或组合发音单位的一个或多个字母;
    根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中;
    提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,以使读者能够通过所述发音标注直接读出英语文本。
  2. 根据权利要求1所述的方法,其中,所述根据所述发音数据,对所述拼读单元发音标注包括:
    根据所述发音数据,对所述拼读单元进行显示标记,以及/或者,
    根据所述发音数据,对所述拼读单元添加拼读提示标记。
  3. 根据权利要求2所述的方法,其中,所述根据所述发音数据,对所述拼读单元进行显示标记,包括:
    根据所述拼读单元中的所述发音数据,将所述拼读单元进行颜色标记、添加下划线标记以及添加划掉标记。
  4. 根据权利要求2所述的方法,其中,所述根据所述发音数据,对所述拼读单元添加拼读提示标记,包括:
    根据所述拼读单元中的所述发音数据,对具有至少两种发音的拼读单元添加相应的拼读提示符号标记或拼读提示数字标记。
  5. 根据权利要求1~4任一项所述的方法,其中,所述拼读单元包括与单元音、单辅音、复合元音、复合辅音或特定发音组合对应的字母或字母组合。
  6. 根据权利要求5所述的方法,其中,所述根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中,包括:
    根据所述发音数据,对软化发音的辅音对应的字母添加相应的拼读提示符号标记,所述软化发音的辅音对应的字母包括字母c或字母g;以及/或者,
    根据所述发音数据,对哑辅音对应的字母组合添加下划线标记;以及/或者,
    根据所述发音数据,对复合辅音对应的字母组合添加下划线标记;以及/或者,
    根据所述发音数据,对特定发音组合对应的字母组合添加下划线标记;以及/或者,
    根据所述发音数据,对具有至少两种发音的复合辅音对应的字母组合添加相应的拼读提示数字标记或加注相应的提示发音音标。
  7. 根据权利要求5所述的方法,其中,所述根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中,包括:
    根据所述发音数据,对具有长元音发音和短元音发音的单元音对应的字母进行添加颜色标记,并且对长元音发音的所述单元音对应的字母和短元音发音的所述单元音对应的字母分别添加相应不同的拼读提示符号标记;以及/或者,
    根据所述发音数据,对不发音的单词尾部元音对应的字母添加划掉标记;以及/或者,
    根据所述发音数据,对复合元音对应的字母组合添加下划线标记;以及/或者,
    根据所述发音数据,对具有至少两种发音的复合元音对应的字母组合添加相应的拼读提示标记。
  8. 根据权利要求1~4任一项所述的方法,其中,所述提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,包括:
    将所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据写入指定的文件;以及/或者,
    显示所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据。
  9. 根据权利要求1~4任一项所述的方法,其中,所述英语文本数据为单词、短语、语句片段、整个语句或一个或多个文本段落。
  10. 一种英语文本的拼读方法,包括:
    获取融合有发音标注的英语文本数据,所述英语文本数据中的至少一个拼读单元进行过如权利要求1~9任一项所述的英语文本的拼读标注方法的发音标注;
    根据所述发音标注拼读出各个所述拼读单元。
  11. 根据权利要求10所述的方法,其中,所述发音标注包括所述拼读单元的显示标记和/或拼读提示标记。
  12. 根据权利要求11所述的方法,其中,所述显示标记包括颜色标记、下划线标记以及划掉标记;所述拼读提示标记包括拼读提示符号标记或拼读提示数字标记。
  13. 根据权利要求12所述的方法,其中,所述根据所述发音标注拼读出各个所述拼读单元,包括:
    对于未进行过发音标注的拼读单元,按照常规读法拼读出所述拼读单元;
    对于进行过发音标注的拼读单元,根据所述拼读单元的显示标记区分出所述拼读单元,并且/或者,根据所述拼读提示标记,对区分出的拼读单元进行拼读。
  14. 一种英语文本的拼读标注装置,包括:
    获取模块,用于获取待处理的英语文本数据以及所述英语文本数据的发音数据;
    切分模块,用于将所述英语文本数据切分为至少一个拼读单元,所述拼读单元包括形成基本发音单位或组合发音单位的一个或多个字母;
    发音标注模块,用于根据所述发音数据,分别对至少一个所述拼读单元进行发音标注,以使所述发音标注融合在所述英语文本数据中;
    提供模块,用于提供融合有对各个所述拼读单元进行发音标注的所述英语文本数据,以使读者能够通过所述发音标注直接读出英语文本。
  15. 根据权利要求14所述的装置,其中,所述发音标注模块用于:
    根据所述发音数据,对所述拼读单元进行显示标记,以及/或者,
    根据所述发音数据,对所述拼读单元添加拼读提示标记。
  16. 根据权利要求15所述的装置,其中,所述发音标注模块用于:
    根据所述拼读单元中的所述发音数据,将所述拼读单元进行颜色标记、添加下 划线标记以及添加划掉标记。
  17. 根据权利要求15所述的装置,其中,所述发音标注模块用于:
    根据所述拼读单元中的所述发音数据,对具有至少两种发音的拼读单元添加相应的拼读提示符号标记或拼读提示数字标记。
  18. 根据权利要求14~17任一项所述的装置,其中,所述拼读单元包括与单元音、单辅音、复合元音、复合辅音或特定发音组合对应的字母或字母组合。
  19. 根据权利要求18所述的装置,其中,所述发音标注模块用于:
    根据所述发音数据,对软化发音的辅音对应的字母添加相应的拼读提示符号标记,所述软化发音的辅音对应的字母包括字母c或字母g;以及/或者,
    根据所述发音数据,对哑辅音对应的字母组合添加下划线标记;以及/或者,
    根据所述发音数据,对复合辅音对应的字母组合添加下划线标记;以及/或者,
    根据所述发音数据,对特定发音组合对应的字母组合添加下划线标记;以及/或者,
    根据所述发音数据,对具有至少两种发音的复合辅音对应的字母组合添加相应的拼读提示数字标记或加注相应的提示发音音标。
  20. 根据权利要求18所述的装置,其中,所述发音标注模块用于:
    根据所述发音数据,对具有长元音发音和短元音发音的单元音对应的字母进行添加颜色标记,并且对长元音发音的所述单元音对应的字母和短元音发音的所述单元音对应的字母分别添加相应不同的拼读提示符号标记;以及/或者,
    根据所述发音数据,对不发音的单词尾部元音对应的字母添加划掉标记;以及/或者,
    根据所述发音数据,对复合元音对应的字母组合添加下划线标记;以及/或者,
    根据所述发音数据,对具有至少两种发音的复合元音对应的字母组合添加相应的拼读提示标记。
  21. 根据权利要求14~17任一项所述的装置,其中,所述提供模块用于:
    将所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据写入指定的文件;以及/或者,
    显示所述融合有对各个所述拼读单元进行发音标注的所述英语文本数据。
  22. 根据权利要求14~17任一项所述的装置,其中,所述英语文本数据为单词、短语、语句片段、整个语句或一个或多个文本段落。
  23. 一种英语文本的拼读装置,包括:
    获取模块,用于获取融合有发音标注的英语文本数据,所述英语文本数据中的至少一个拼读单元进行过如权利要求14~22任一项所述的英语文本的拼读标注装置的发音标注;
    拼读模块,用于根据所述发音标注拼读出各个所述拼读单元。
  24. 根据权利要求23所述的装置,其中,所述发音标注包括所述拼读单元的显示标记和/或拼读提示标记。
  25. 根据权利要求24所述的装置,其中,所述显示标记包括颜色标记、下划线标记以及划掉标记;所述拼读提示标记包括拼读提示符号标记或拼读提示数字标记。
  26. 根据权利要求25所述的装置,其中,所述拼读模块包括:
    第一拼读单元,用于对于未进行过发音标注的拼读单元,按照常规读法拼读出所述拼读单元;
    第二拼读单元,用于对于进行过发音标注的拼读单元,根据所述拼读单元的显示标记区分出所述拼读单元,并且/或者,根据所述拼读提示标记,对区分出的拼读单元进行拼读。
  27. 一种计算机可读存储介质,其上存储有计算机程序指令,其中,所述程序指令被处理器执行时实现权利要求1~9中任一项所述英语文本的拼读标注方法的步骤。
  28. 一种计算机可读存储介质,其上存储有计算机程序指令,其中,所述程序指令被处理器执行时实现权利要求10~14中任一项所述英语文本的拼读方法的步骤。
  29. 一种电子设备,包括:处理器、存储器、通信元件和通信总线,所述处理器、所述存储器和所述通信元件通过所述通信总线完成相互间的通信;
    所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行如权利要求1~9任一项所述英语文本的拼读标注方法对应的操作。
PCT/CN2020/097548 2019-07-04 2020-06-22 英语文本的拼读标注方法、拼读方法、其装置、存储介质和电子设备 WO2021000756A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910598740.2 2019-07-04
CN201910598740.2A CN110442839A (zh) 2019-07-04 2019-07-04 英语文本的拼读标注方法、拼读方法、存储介质和电子设备

Publications (1)

Publication Number Publication Date
WO2021000756A1 true WO2021000756A1 (zh) 2021-01-07

Family

ID=68428510

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/097548 WO2021000756A1 (zh) 2019-07-04 2020-06-22 英语文本的拼读标注方法、拼读方法、其装置、存储介质和电子设备

Country Status (3)

Country Link
CN (1) CN110442839A (zh)
TW (1) TW202103121A (zh)
WO (1) WO2021000756A1 (zh)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110442839A (zh) * 2019-07-04 2019-11-12 陈俪 英语文本的拼读标注方法、拼读方法、存储介质和电子设备
CN111681467B (zh) * 2020-06-01 2022-09-23 广东小天才科技有限公司 一种词汇学习方法及电子设备、存储介质
CN112906360A (zh) * 2021-02-05 2021-06-04 李四艳 一种英语文本的拼读标注方法及装置
CN113257234A (zh) * 2021-04-15 2021-08-13 北京百度网讯科技有限公司 生成词典与语音识别的方法、装置

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1376963A (zh) * 2001-03-27 2002-10-30 肖水清 注音英语及其输入方法
CN1959764A (zh) * 2006-11-06 2007-05-09 张伟 一种英文拼读方法及其拼读器具
CN101636774A (zh) * 2007-06-29 2010-01-27 李如云 英文单词的切分、重读标注方法及其应用
US20100209895A1 (en) * 2009-01-24 2010-08-19 Ricciardi Geoffrey S Playing Cards with the Added Function of Teaching and Learning English Phonics
CN108352126A (zh) * 2015-11-11 2018-07-31 株式会社Mglish 外语读音及标记装置及其方法,包括利用其装置和方法的基于外语节奏动作传感器的运动学习装置、运动学习方法以及对其进行记录的电子媒体和学习教材
CN110442839A (zh) * 2019-07-04 2019-11-12 陈俪 英语文本的拼读标注方法、拼读方法、存储介质和电子设备

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1376963A (zh) * 2001-03-27 2002-10-30 肖水清 注音英语及其输入方法
CN1959764A (zh) * 2006-11-06 2007-05-09 张伟 一种英文拼读方法及其拼读器具
CN101636774A (zh) * 2007-06-29 2010-01-27 李如云 英文单词的切分、重读标注方法及其应用
US20100209895A1 (en) * 2009-01-24 2010-08-19 Ricciardi Geoffrey S Playing Cards with the Added Function of Teaching and Learning English Phonics
CN108352126A (zh) * 2015-11-11 2018-07-31 株式会社Mglish 外语读音及标记装置及其方法,包括利用其装置和方法的基于外语节奏动作传感器的运动学习装置、运动学习方法以及对其进行记录的电子媒体和学习教材
CN110442839A (zh) * 2019-07-04 2019-11-12 陈俪 英语文本的拼读标注方法、拼读方法、存储介质和电子设备

Also Published As

Publication number Publication date
CN110442839A (zh) 2019-11-12
TW202103121A (zh) 2021-01-16

Similar Documents

Publication Publication Date Title
WO2021000756A1 (zh) 英语文本的拼读标注方法、拼读方法、其装置、存储介质和电子设备
Moran et al. The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles
Trudgill et al. International English: A guide to the varieties of standard English
Taouka et al. The cognitive processes involved in learning to read in Arabic
Lüpke Orthography development
AU2011335900B2 (en) Text conversion and representation system
Cutler Representation of second language phonology
Probert et al. Word recognition strategies amongst isiXhosa/English bilingual learners: The interaction of orthography and language of learning and teaching
US20240203396A1 (en) Unambiguous phonics system
Yin et al. Unspoken knowledge: kindergarteners are sensitive to patterns in Chinese pinyin before formally learning it
Kan Colloquial Chinese: the complete course for beginners
WO2009003308A2 (fr) Procédé de marquage d'accent et de séparation de mot en anglais et application associée
Protopapas From diacritics to the mental lexicon
Ahmed Different types of spelling errors made by Kurdish EFL learners and their potential causes.
Nag Learning to read Kannada and other languages of South Asia
Iyengar Variation in Perso-Arabic<? br?> and Devanāgarī Sindhī orthographies: An overview
Al-Jarf Absence of vowels in the English spelling of Arabic personal names on social media
Nakamura et al. Biliteracy spelling acquisition in akshara and English
Garcia et al. The Role of Diacritics in Reading Urdu. Can children read without “the dots”?
Oladiipo et al. Spelling Error Patterns in Typed Yorùbá Text Documents
Garabík et al. A cross linguistic database of children's printed words in three Slavic languages
Zohirovna Implementation of different language teaching methods in culture
Metruk Determining the priority in vocabulary when learning English through electronic dictionaries
B. Silva The Importance of General and Academic Vocabulary Learning
TWI621105B (zh) 英文發音提示系統及其方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20834810

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20834810

Country of ref document: EP

Kind code of ref document: A1