Embodiment
Also in conjunction with the accompanying drawings the present invention is described in further detail below by specific embodiment.
First embodiment
Fig. 1 is the auxiliary process flow diagram of reciting document display method that first embodiment of the invention provided, and this method comprises the steps:
Step 100, when receiving when beginning to ask, judge the type that begins to ask, and according to the type that begins to ask with the first progress mode display-object document;
The voice that step 200, basis collect are that unit carries out speech recognition operation with the recognition unit, show the recognition unit of having discerned according to voice identification result in the second progress mode;
Whether there is the unidentified recognition unit that arrives in the destination document that step 300, judgement show, if, then return execution in step 200, if not, then this document shows end.
Present embodiment auxiliary recited document display method and can be recited file display system and realized by auxiliary, file display system comprises the equipment that generally is used to carry out the document demonstration, for example storer, control device, display screen, button etc., that destination document is selected for the learner, be stored in the article in the file display system in advance.The learner can pass through forms such as keyboard, touch-screen button to file display system input request at the beginning, and then file display system can cooperate reading aloud of learner or the progress of reciting is carried out the document display operation under the control of control device.
When receiving when beginning to ask, file display system is at first with the first progress mode display-object document, voice that come Recognition and Acquisition to arrive by speech recognition technology then, and the second progress mode has shown content identified.Speech recognition operation is specifically as follows: the voice of gathering certain hour length or quantity length, the voice of several words, several phrases or several sentences for example, then collecting voice are carried out phrase or whole identification one by one, but, judge whether recognition result specifically is unit with the recognition unit correctly, for example setting a word is recognition unit, or to set a phrase be recognition unit.After the voice to interrupted or continuous acquisition carry out integral body or identification one by one, judge whether one of them recognition unit is correct.When destination document is English, can set one or several words or phrase is recognition unit, so-called phrase is to have the unit independent language meaning, that have minimum word number, also can be a word, generally is predefined by relation from the context.
The concrete form of the first progress mode and the second progress mode can have multiple, and for example, wherein a kind of mode is to distinguish progress with cursor position, and Fig. 2 is a synoptic diagram of distinguishing progress displaying in the first embodiment of the invention with cursor position.The first progress mode can be in after the current cursor position for the displayed document content, and the second progress mode is in before the current cursor position for the displayed document content.Perhaps the first progress mode and the second progress mode can also be distinguished progress displaying for different colours.
Perhaps, the first progress mode can also be for replacing literal to show recognition unit with placeholder, and the second progress mode is the literal that directly shows recognition unit.Fig. 3 is a synoptic diagram of distinguishing progress displaying in the first embodiment of the invention with placeholder and literal.Promptly the first progress mode is display text content not, only show this literal content position occupied, for example can replace literal to display with placeholders such as blank or color, patterns, and according to learner's the progress of reciting, progressively word content is shown, whether correct recite with checking.
In addition, the first progress mode and the second progress mode can also have multiple mode and multiple combination, for example display mode can for as highlighted, add outstanding forms such as boldface type, tilting font, color, amplification, in a word, the first progress mode is different display modes with the second progress mode, can distinguish to show current progress.
As shown in Figure 4, the step 200 in the present embodiment specifically comprises the steps:
Step 210a, file display system are unit with the recognition unit according to the voice that collect, the unit of setting current to be identified is carried out speech recognition operation, judge whether voice identification result is correct, if, execution in step 220a then, if not, execution in step 230a then, wherein when initial, current unit to be identified can be first word or phrase, also can be the word or the phrase of learner's select location;
Step 220a, show the recognition unit of having discerned, next recognition unit is set at current unit to be identified, and continue execution in step 300 in the second progress mode;
Step 230a, demonstration error flag, and return execution in step 210a.
Speech recognition technology generally is to allow machine voice signal be changed into the technology of corresponding text or order by identification and understanding process.During identification, acoustic model and language model acting in conjunction, the word string that obtains making a certain probability maximum can be by judging mistake with unit to be identified similarity according to the rules as recognition result.For example, obtain discerning literal and the literal to be identified mistake that then is judged as inequality by identification when the voice that collect.
In the present embodiment, can determine the demonstration of progress according to the correctness of voice identification result.In the time correctly recognizing current unit to be identified, i.e. correct the and pronunciation standard of the content of learner's pronunciation at this moment, shows in the second progress mode and to read content.When the learner can not right pronunciation, file display system can show error flag, and the prompting learner pronounces once more.In the specific implementation, whether one status indication to identify recognition unit be current unit to be identified if can being set, in the time of correctly discerning, current unit to be identified is updated to next recognition unit, if can not correctly discern, then still keep this recognition unit to be in current state to be identified, till can correctly discerning.
In the technique scheme, current recognition unit is the up-to-date unit current to be identified that recognizes, correctly discern current unit to be identified after, current unit to be identified is promptly as current recognition unit, and next recognition unit is as current unit to be identified.After a certain recognition unit of correct identification, also show that in the first difference mode this recognition unit continues the regular hour, for example show the current phrase that correctly recognizes, in the demonstration that monitors the first difference mode that to stop when producing first trigger event with highlighted, overstriking or mode such as tilting.Fig. 5 is for distinguishing the synoptic diagram that shows correct recognition unit in the overstriking mode in the first embodiment of the invention.
The form that shows error flag in above-mentioned steps 230a also has multiple, and preferred a kind of embodiment is:
Show current unit to be identified in the second difference mode, after monitoring generation second trigger event, stop the demonstration of the second difference mode, and return execution in step 210a.
Above-mentioned second trigger event can be to reach the setting clocking value, for example carries out timing earlier, produces second trigger event when monitoring to be after clocking value reaches setting value.Perhaps also can will correctly recognize this recognition unit as second trigger event.
The second difference mode and the first difference mode can be identical or different, preferably adopt different display modes, for example when the first difference mode is the overstriking demonstration, the second difference mode can be highlighted demonstration, and Fig. 6 is for distinguishing the synoptic diagram that shows the wrong identification unit in highlighted mode in the first embodiment of the invention.
In the practical application, can also be further when showing error flag will the received pronunciation that should recognition unit prestores be play, with prompting or correct learner's pronunciation.
Step 300 in the present embodiment judge shown destination document whether fully identification finish, can judge specifically whether document shows in the second progress mode fully or whether can't obtain next recognition unit to be used as end mark.When not having the unidentified recognition unit that arrives in the destination document of judging demonstration in the step 300, further execution in step 400, step 400 is for to show the recognition unit that had shown according to the second difference mode in the 3rd difference mode.
Technique scheme is about to that learner's mistake is read aloud or the non-type content of pronouncing is unified again shows once.The 3rd difference mode and the second difference mode can be the same or different, and are specifically as follows outstanding form, for example, and highlighted, overstriking, tilting, amplification etc.
As shown in Figure 7, in step 200, during the display-object document, can also comprise the steps:
Step 210b, with the current unit that continues of the 4th difference mode display setting, the unit that continues can be complete sentence, phrase or word, complete sentence can be by setting or the identification punctuation mark be determined the reference position of sentence;
Step 220b, judge whether also there is the unidentified recognition unit that arrives in the current unit that continues, if, then proceed speech recognition, if not, the unit that then next one continued is set at the current unit that continues, so that upgrade with the 4th difference mode content displayed.
Technique scheme has highlighted the content that the current needs of learner are read aloud or recited, to play prompting and booster action.Highlighting of the current unit that continues can change along with learner's progress, and be stronger to the adaptability of learner's study schedule.
Relate to multiple display mode in the foregoing description,, now introduced one by one for clearly distinguishing various display modes:
The first progress mode and the second progress mode are used to distinguish the progress that shows current identification, for example, in the above-described embodiments, the first progress mode is for showing the literal of recognition unit after current cursor position, the second progress mode is the literal that showed recognition unit before current cursor position, and recognition unit is specially phrase;
The first difference mode is used to distinguish the recognition unit that demonstration can correctly be discerned;
The second difference mode is used for difference and shows and can not correctly discern, i.e. the recognition unit of wrong identification;
The 3rd difference mode be used for destination document discerned finish after, unified wrong identification once and the recognition unit that showed in the second difference mode of showing can adopt identical display mode with the second difference mode;
The 4th difference mode is used to distinguish the content that shows the current unit that continues.
The above-mentioned first, second, third and the 4th difference mode all can be various outstanding forms, for example: highlighted, overstriking, tilting font, underscore, enlarge font, change color etc., four kinds of difference modes can be identical, shows but preferably adopt different outstanding forms to be beneficial to distinguish.
State on the implementation in the process of embodiment, further can also be included in and preserve the voice that collect when gathering voice, and play the voice of preserving, so that the learner looks back according to the playback request of document progress displaying or learner's input.
When recognizing learner's clicking operation, can also play the received pronunciation of the corresponding recognition unit of institute's click location, so that learner and received pronunciation compare study.
Adopt the technical scheme of the foregoing description, read aloud or recite progress progress displaying in document in conjunction with what voice identification result can tightly cooperate the learner, the adaptability of using with the learner is stronger, and the auxiliary prompting effect is better.In addition, can also provide different displaying schemes according to learner's pronunciation progress in the present embodiment, right pronunciation is carried out different promptings automatically with wrong pronunciation, subsidiary function is abundanter, and initiative is stronger.
The foregoing description to the content regions of destination document told recognition unit, the unit that continues, can adopt the displaying scheme of setting to highlight the information of setting according to the result of speech recognition, various displaying schemes can be implemented or independently in conjunction with adopting.For reading aloud, recite demand from language and characters, learner such as cooperation such as many-side such as memory logicality etc., cooperate learner's progress that more intelligentized subsidiary function is provided, the embodiment of the invention also preferably provides auxiliary and has recited document display method and be applicable to that the multiple embodiment under the different situations is as follows.
Second embodiment
Fig. 8 is the auxiliary process flow diagram of reciting document display method that second embodiment of the invention provided.Present embodiment can be in conjunction with the technological means that adopts among above-mentioned first embodiment, and specifically is applicable to the situation the when learner reads aloud exercise.Different study situations can be with dissimilar differentiation that begins to ask, for example, can distinguish the type that begins to ask by different buttons is set, by file display system the type that begins to ask be discerned, and further finished subsequent operation according to the type identification result.
In the present embodiment, recognition unit is a phrase, and the unit that continues is a sentence, and the first progress mode is for showing the literal of phrase after current cursor position, and the second progress mode is the literal that showed phrase before current cursor position.
The concrete steps of present embodiment are as follows:
Step 111, reception begin request, and recognize the type that begins to ask and be specially and read aloud when beginning to ask, according to reading aloud the literal that begins to ask with each phrase in the first progress mode display-object document, specifically can be the literal that after current cursor position, shows each phrase;
Step 211, with the current sentence that continues of the 4th difference mode display setting, it is concrete in the present embodiment that to set the 4th difference mode be highlighted form, Fig. 9 is the synoptic diagram of the current sentence that continues of highlighted demonstration in the second embodiment of the invention;
The voice that step 212, basis collect are recognition unit with the phrase, and the current identification phrase of setting is carried out speech recognition operation, judge whether voice identification result is correct, if then execution in step 213, if not, then execution in step 215;
Step 213, before current cursor position, show current phrase to be identified, judge whether there is the unidentified phrase that arrives in the current sentence that continues, if, then next phrase is set at current phrase to be identified, and execution in step 212, otherwise execution in step 214;
Step 214, stop to show the current sentence that continues, promptly cancel highlightedly, and continue execution in step 311 in the 4th difference mode;
Step 215, show current phrase to be identified in the second difference mode, after monitoring generation second trigger event, the second difference mode that stops to show, and return execution in step 212, second trigger event is specifically as follows the incidents such as setting clocking value that reaches, the second difference mode is different with the 4th difference mode in the present embodiment, the concrete setting adds underscore, Figure 10 is the synoptic diagram that underscore shows the wrong identification phrase in the second embodiment of the invention, when showing current phrase to be identified in the second difference mode in this step, can also further play the received pronunciation of current phrase correspondence to be identified, to remind or to correct learner's pronunciation;
Step 311, judge currently continue whether there is phrase after the sentence, if, then next sentence is set at the current sentence that continues, and returns execution in step 211, otherwise, flow for displaying finished.
In above-mentioned steps 213, when can correctly recognizing current phrase to be identified, when showing the phrase of having discerned in the second progress mode, can also begin to ask to show current phrase to be identified according to reading aloud in the first difference mode, after monitoring generation first trigger event, the first difference mode that stops to show that the first difference mode is preferably different with the 4th difference mode with the second difference mode.
On the basis of present embodiment, can also work as and judge current continuing when not having phrase after the sentence in the step 311, execution in step 400, step 400 is for to show the phrase that had shown according to the second difference mode in the 3rd difference mode.
The technical scheme of present embodiment provides and has been applicable to that assisted learning person reads aloud document, corrects one's pronunciation and pore over the document displaying scheme of document.The learner can click to enter bright reading mode, can record when gathering voice, in order to playback.Recognize at file display system and to read aloud when beginning to ask, difference demonstrates the sentence that the current needs of learner are read aloud automatically, gathers voice simultaneously and discerns progress displaying, and the learner reads aloud sentence by sentence under text prompt.Automatically judge under the real-time voice analysis whether read aloud voice correct, reading aloud when correct with the sentence is that unit shows the current literal progress that continues, the phrase of not reading aloud correct or mispronounce is highlighted, simultaneously can also point out the Received Pronunciation of reading aloud wrong phrase automatically, require the learner to read aloud wrong phrase again.The learner read aloud sentence by sentence finish after, can judge whether to read aloud end according to progress displaying, do not finish then to proceed reading aloud of next, if after judging that the entire chapter document is read aloud end, can highlight in the bright read procedure wrong or read aloud the true phrase of cacoepy, make learner's integral body feel to read aloud effect, look back bright read procedure, enhance memory simultaneously.
The 3rd embodiment
Figure 11 is the auxiliary process flow diagram of reciting document display method that third embodiment of the invention provided.Present embodiment can be in conjunction with the technological means among first embodiment, specifically be applicable to clue mnemonic(al) situation, the type that begins to ask begins request for the clue mnemonic(al), and the first progress mode is for to show each word with placeholder, and the second progress mode is the literal of each word of demonstration.Placeholder can be that blank figure and other can block the figure of inherited literal, only keeps former word position occupied.The method of present embodiment specifically comprises the steps:
Step 121, when receiving the clue mnemonic(al) when beginning to ask, begin to ask each word according to the clue mnemonic(al) with in the placeholder display-object document, and set the literal of crucial phrase in the display-object document, crucial phrase is predefined, for example can be the first phrase of sentence or significant predicate etc.;
The voice that step 221, basis collect are recognition unit with the phrase, and the phrase of setting current to be identified is carried out speech recognition operation, judge whether voice identification result is correct, if then execution in step 222, if not, then execution in step 223;
The literal of step 222, the current phrase to be identified of demonstration, and return execution in step 321;
Step 223, demonstration error flag, when recognizing clicking operation, show that the phrase of clicking operation institute click location also carries out timing, reach first when hiding setting value when monitoring clocking value, show the phrase literal of institute's click location with placeholder, and return execution in step 221;
Step 321, judge whether there is phrase after the current phrase to be identified, if, then next phrase is set at current phrase to be identified, and returns execution in step 221, otherwise, flow for displaying finished.
Show that error flag is specifically as follows error prompting sound or error prompting pattern, preferably can after monitoring generation second trigger event, show current phrase to be identified for showing the literal of current phrase to be identified with placeholder.For example the literal of the phrase of wrong identification can be continued show a period of time to point out, then hiding.
In step 321, recognize and do not exist after the phrase after the current phrase to be identified, can also comprise: the phrase of wrong identification being crossed in the 3rd difference mode and/or click the phrase that showed and show.
Present embodiment can assisted learning person be understood the document clue, catches the full piece of writing of centre point commander.File display system receives the clue mnemonic(al) and begins only to show the crucial phrase clue in the entire chapter destination document after the request that other phrases replace with placeholder.Clue is crucial phrase, the key point of document, can assisted learning person better understand document semantic, catches the key point of document, can associate the document context content easily according to these clues.When reciting the phrase literal that has needed in the process to remove beyond the crucial phrase part in the study, can click the position of corresponding phrase, then the phrase of this position correspondence presents automatically, hides automatically behind the setting-up time, hides automatically after for example setting 5 seconds.Can better help the learner to pursue speech reciting sentence by sentence like this.Present embodiment carries out real-time speech recognition analysis when gathering voice, then show the literal of this phrase in the time of can correctly discerning automatically.The entire chapter document recognition presents the literal of entire chapter document after finishing, and highlights the learner at memory process mid-point phrase that hit and the phrase of reciting mistake, and these phrases of indication learner need be remembered emphatically, and assisted learning person better remembers document.Can record in order to playback study in the process of collection voice, understand the memory process.Crucial phrase in the document can manually mark in advance one by one, for example, every beginning of the sentence phrase, relevance phrase or crucial semantic phrase, as time, point adverbial etc., all helpful to reciting of learner.File display system can use the text edit tool that these phrases that need show are marked the clue mark according to the procedure script prescribed form on stream.
The 4th embodiment
Figure 12 is the auxiliary process flow diagram of reciting document display method that fourth embodiment of the invention provided.Present embodiment can be in conjunction with the technological means among first embodiment, specifically be applicable to the situation of reciting sentence by sentence, the type that begins to ask begins request for reciting sentence by sentence, the first progress mode is for to show each word with placeholder after current cursor position, the second progress mode is for to show each word with placeholder before current cursor position, the method for present embodiment specifically comprises the steps:
Step 131, recite sentence by sentence when beginning to ask, begin request with the first progress mode display-object document, promptly after current cursor position, show each word with placeholder according to reciting sentence by sentence when receiving;
The voice that step 231, basis collect are recognition unit with the phrase, and the phrase of setting current to be identified is carried out speech recognition operation, judge whether voice identification result is correct, if then execution in step 232, if not, then execution in step 234;
Step 232, show current phrase to be identified in the second progress mode, promptly before current cursor position, show current phrase to be identified with placeholder, judge in the current sentence that continues and whether have unidentified phrase, if, then next phrase is set at current phrase to be identified and execution in step 231, if not, execution in step 233 then;
The literal of step 233, the current sentence that continues of demonstration, and execution in step 331;
Step 234, show and when recognizing clicking operation, to show error flag the literal of the current sentence that continues and to carry out timing, reach second when hiding setting value,, and return execution in step 231 with the literal of the current sentence that continues of placeholder demonstration when monitoring clocking value;
Step 331, judge currently continue whether there is sentence after the sentence, if, then next sentence is set at the current sentence that continues, next phrase is set at current phrase to be identified, and returns execution in step 231, otherwise, flow for displaying finished.
Present embodiment is specially the situation that assisted learning person recites sentence by sentence.When receiving, recites sentence by sentence when beginning to ask file display system, the entire chapter destination document adopts blank placeholder to show, keep text point, the learner begins to recite sentence by sentence document, carry out the analysis of voice Real time identification, correctly recite current continuing and show this literal during sentence when recognizing, jump to the speech recognition analysis of next simultaneously, when the learner need point out certain content, can click the optional position or click this position, then file display system shows this literal, and hidden text after the certain time, hide after for example showing 5 seconds, when showing the current sentence literal that continues, can also play this received pronunciation, make the learner can be from vision, the many-sided memory of sense of hearing sentence.Can record when gathering voice, the learner can provide the recording playback, so that the learner understands the memory process after reciting the study end sentence by sentence.
Further, in the present embodiment, also can the 3rd difference mode the sentence of clicked demonstration be finished that the back is unified to be shown reciting, so that the emphasis memory.
The 5th embodiment
Figure 13 is the auxiliary process flow diagram of reciting document display method that fifth embodiment of the invention provided.Present embodiment can be in conjunction with the technological means among first embodiment, specifically be applicable to the situation that prompting is recited, the first progress mode is for to show each word with placeholder after current cursor position, the second progress mode is for to show each word with placeholder before current cursor position, present embodiment specifically comprises the steps:
Step 141, recite when beginning to ask, recite according to prompting and begin request, promptly after current cursor position, show each word with placeholder with the first progress mode display-object document when receiving prompting;
The voice that step 241, basis collect are recognition unit with the phrase, and the phrase of setting current to be identified is carried out speech recognition operation, judge whether voice identification result is correct, if then execution in step 242, if not, then execution in step 244;
Step 242, show current phrase to be identified in the second progress mode, promptly before current cursor position, show current phrase to be identified with placeholder, judge in the current sentence that continues and whether have unidentified phrase, if, then next phrase is set at current phrase to be identified and execution in step 241, if not, execution in step 243 then;
Set the literal of crucial phrase in step 243, the current sentence that continues of demonstration, and execution in step 341;
Step 244, demonstration error flag, when recognizing clicking operation, show and set the literal of crucial phrase in the current sentence that continues and carry out timing, reach the 3rd when hiding setting value when monitoring clocking value, show the crucial phrase of setting in the current sentence that continues with placeholder, and return execution in step 241;
Step 341, judge currently continue whether there is sentence after the sentence, if, then next sentence is set at the current sentence that continues, next phrase is set at current phrase to be identified, and returns execution in step 241, otherwise, process ends.
Present embodiment further shows after the literal of setting crucial phrase in the current sentence that continues in step 244, can also carry out following operation:
When recognizing clicking operation, show that the phrase literal of clicking operation institute click location also carries out timing, reach the 4th when hiding setting value when monitoring clocking value, show the phrase of institute's click location with placeholder.
Present embodiment provides prompting to recite situation, the prompting situation of reciting can assisted learning person from basic grasp document to skillfully reciting.Recite when beginning to ask when file display system receives prompting, the entire chapter document all adopts blank placeholder to show, keeps text point.The learner recites document sentence by sentence, by carrying out the analysis of voice Real time identification, shows the literal of the crucial phrase of the current sentence that continues during sentence when can correctly discerning current continuing, and for example the beginning of the sentence phrase jumps to next sentence simultaneously and carries out speech recognition analysis.When the learner need point out, by the clicking operation first time, can make file display system point out crucial phrase, for example only show first phrase of this sentence, thereby auxiliary inspiration learner recites whole sentence.Further point out other phrases as needs, when file display system recognized for the second time clicking operation under this pattern, file display system showed the phrase literal of institute's click location, but disappeared automatically in the setting-up time, for example hid after 5 seconds.This mode has been simulated tradition and has been recited the process of having a look at.In tradition is recited, have a look at books and but have the problem of seeing the literal that should not see, also may there be the problem of searching the literal inconvenience that need check.Present embodiment can navigate to the phrase literal that needs fast, only shows simultaneously to wish that the phrase seen, other phrase do not show, more conveniently recites middle use.
Behind the destination document end of identification, the literal that can also further show the entire chapter document, and can the 3rd difference mode highlight sentence and/or the phrase that the learner clicked, and these phrases of indication learner need be remembered emphatically, and assisted learning person better remembers document.Present embodiment also can provide the recording playback function, so that the learner understands the memory process.
The 6th embodiment
Figure 14 is the auxiliary process flow diagram of reciting document display method that sixth embodiment of the invention provided.Present embodiment can be in conjunction with the technological means among first embodiment, specifically be applicable to the situation that challenge is recited, the first progress mode is for to show each word with placeholder after current cursor position, the second progress mode is for to show each word with placeholder before current cursor position, present embodiment specifically comprises the steps:
Step 151, recite when beginning to ask, recite according to challenge and begin request, promptly after current cursor position, show each word with placeholder with the first progress mode display-object document when receiving challenge;
The voice that step 251, basis collect are recognition unit with the phrase, and the phrase of setting current to be identified is carried out speech recognition operation, judge whether voice identification result is correct, if then execution in step 252, if not, then execution in step 254;
Step 252, judge in the current sentence that continues whether have unidentified phrase, if, then next phrase is set at current phrase to be identified and execution in step 251, if not, then execution in step 253;
Step 253, show the current sentence that continues, promptly before current cursor position, show the current sentence that continues, and return execution in step 351 with placeholder in the second progress mode;
Step 254, demonstration error flag, when recognizing clicking operation, show that the sentence literal of clicking operation institute click location also carries out timing, reach the 5th when hiding setting value when monitoring clocking value, sentence with placeholder demonstration institute click location returns execution in step 251;
Step 351, judge currently continue whether there is sentence after the sentence, if, then next sentence is set at the current sentence that continues, next phrase is set at current phrase to be identified, and returns execution in step 251, otherwise, process ends.
In the present embodiment, when file display system recognizes clicking operation, can show the sentence literal of clicking operation institute click location and carry out timing, reach the 5th when hiding setting value, show the sentence of institute's click location with placeholder when monitoring clocking value.
The technical scheme of present embodiment is applicable to the situation that challenge is recited, and effect is recited in assisted learning person's check, consolidates memory.When file display system receive challenge recite begin request after, the entire chapter document adopts blank placeholder to show, only keeps text point.The learner recites document sentence by sentence, analyzes by the voice Real time identification, when recognizing the voice of correctly reciting, cursor position is moved to next sentence, carries out the speech recognition analysis of next.When the learner need point out certain content, show the literal of relevant position by clicking operation, literal disappears behind the setting-up time, for example set show 3 seconds after literal disappear automatically, reduce the dependence of learner to literal.After the process of reciting finished, file display system can also the 3rd difference mode highlight the sentence that the learner clicked.In addition, present embodiment also can provide the recording playback function, and assisted learning person understands the memory process.
In above-mentioned second to the 6th embodiment, can also enrich the display alarm function further combined with the various technological means among first embodiment, for example:
Show the phrase of correct identification one by one in the first difference mode, promptly, also comprise when judging voice identification result when correct:
Show current phrase to be identified in the first difference mode, after monitoring generation first trigger event, the first difference mode that stops to show.
The phrase that can also show wrong identification in the second difference mode promptly when judging that voice identification result is incorrect, also comprises: show current phrase to be identified in the second difference mode, after monitoring generation second trigger event, the second difference mode that stops to show.When highlighting the phrase of wrong identification, current phrase to be identified is not updated to next phrase, till can correctly discerning.When showing current phrase to be identified in the second difference mode, can also play the received pronunciation of current phrase correspondence to be identified, point out the learner jointly from vision and sense of hearing angle.
When after judging current continue sentence or current phrase to be identified, not having phrase, can also execution in step 400, step 400 comprises and shows in the 3rd difference mode and to show according to the second difference mode and/or the phrase of clicked demonstration.
Can show the current content that continues in the 4th difference mode, promptly when the display-object document, also comprise:
The current unit that continues with the 4th difference mode display setting;
When in judging the current unit that continues, not having the unidentified phrase that arrives, the next one unit that continues is set at the current unit that continues, so that next circulation time shows the new current unit that continues in the 4th difference mode.
Various difference display modes can adopt various ways as mentioned above.In the flow process of carrying out each pattern, the learner is not limited to adopt the click mode to obtain prompting, can also obtain corresponding speech play and literal shows help by operations such as mouse, button and quick buttons.
The present invention the above-mentioned auxiliary document display method embodiment that recites provide a whole set of complete required flow process of document of reading aloud, recite, the document display packing is not limited to English learning, also can show foreign languages such as Chinese or Japanese, French, for the learner provides intelligentized auxiliary prompting function, use technical scheme of the present invention and can independently finish reciting of document chapter fast.
The present invention can show reading aloud of learner or recite progress according to recognition result, uses the adaptability of progress strong with the learner, and can further provide multiple display mode according to use progress and recognition result, has enriched prompting function.Specifically, technical scheme of the present invention can also specifically provide plurality of display modes, for example be used for tentatively understanding the bright reading mode of document content, the clue mnemonic(al) pattern of phrase level assisted memory is provided, the pattern of reciting sentence by sentence that is used for the assisted memory of sentence level, be used for the prompting that sentence level prompting recites and recite pattern, and reach and break away from original text fully and realize that the challenge of reciting recites pattern.The learner can recite process and study habit in conjunction with the individual and different begin request and trigger file display system various document displaying schemes are provided automatically by importing, can simulate true man assisted learning person and carry out document and recite, increase substantially the efficient and the quality of reciting with the function design of hommization.
The concrete available five kinds of patterns of the present invention, promptly bright reading mode, clue mnemonic(al) pattern, sentence by sentence recite pattern, pattern is recited in prompting and pattern is recited in challenge, with from the easier to the more advanced, the method for progressively going forward one by one realizes assisting the purpose of reciting document.
Bright reading mode is to allow the learner understand document content, corrects pronunciation mistakes.Bright reading mode is taked the real-time voice analysis, identification learning person pronunciation, thus judge that the learner reads aloud the accuracy of progress and pronunciation, provide received pronunciation to help.
Clue mnemonic(al) pattern is to help the learner better to understand document content, the document clue train of thought that marks in advance by system, key point, the key phrase framework of entire chapter document is provided, allow the learner go to understand according to clue, understanding could be thorough, summarizes full piece of writing document content to the learner, the prompting of phrase level is provided in the process of reciting, helps the learner better to recite.
The pattern of reciting provides the prompting and the help of sentence level sentence by sentence, and document is divided into some sentences, and one one, reciting of next level of layer provides sentence text prompt, sentence voice suggestion in the process of reciting, and helps the learner to recite sentence by sentence.
The prompting pattern of reciting provides the beginning of the sentence phrase of each sentence, and assisted learning person to the process of skillfully reciting document, allows the learner recite document under the situation of basic disengaging original text from basic grasp document.
The challenge pattern of reciting provides the learner to break away from original text fully and recites document, and effect is recited in check, consolidates the process of reciting effect simultaneously.
All patterns all can provide sound-recording function, so that playback study.
Can be after the learner enters file display system according to own degree of understanding to document, which pattern selection should be brought into use from, preferably file display system receive certain type begin the request after, with bright reading mode, clue mnemonic(al) pattern, recite pattern sentence by sentence, prompting is recited pattern and is challenged the order of the pattern of reciting, after the flow process of some patterns finishes in turn the pattern of triggering following begin request, a plurality of mode sequence are offered the learner, allow the learner incremental finish the process of reciting, assisted learning person recites faster, and the method for reciting is better brought into play.
The present invention understands the document globality for better assisted learning person, and to the assurance of semantic integrity, sentence has been carried out the phrase level to be divided, can not make the learner break away from semantic environment, recite and read aloud isolated word separately, but kept the most basic phrase structure, the learner is read aloud in true semanteme recite.Deepen the understanding of learner, be convenient to better recite document semantic.
The 7th embodiment
Figure 15 is the auxiliary structural representation of reciting file display system that seventh embodiment of the invention provided.This system comprises clue mnemonic(al) module 20, and clue mnemonic(al) module 20 specifically comprises: the second request receiving element, 21, the second initial display unit 22, second voice recognition unit 23, the second progress display unit, 24, the second wrong display unit 25, second are clicked display unit 26 and second and are finished judging unit 27.Wherein, second the request receiving element 21 be used to receive the clue mnemonic(al) begin the request; The second initial display unit 22 is used for beginning to ask each word with placeholder display-object document according to the clue mnemonic(al), sets the literal of crucial phrase in the display-object document, and triggers second and finish judging unit 27; The voice that basis collected when second voice recognition unit 23 was used to be triggered are recognition unit with the phrase, and the phrase of setting current to be identified is carried out speech recognition operation, and produce voice identification result; The second progress display unit 24 is used for showing the literal of current phrase to be identified when voice identification result is correct, and triggers second and finish judging unit 27; The second wrong display unit 25 is used for when voice identification result is incorrect, shows error flag, and triggers second voice recognition unit 23; Second clicks display unit 26 is used for when recognizing clicking operation, especially show when recognizing clicking operation behind the error flag, show the phrase of clicking operation institute click location and carry out timing, reach first when hiding setting value when monitoring clocking value, show the phrase literal of institute's click location with placeholder; Second finishes judging unit 27 is used for judging whether current phrase to be identified exists phrase afterwards when being triggered, if then next phrase is set at current phrase to be identified, and triggers second voice recognition unit 23.
The auxiliary file display system of reciting of present embodiment can be used to carry out the auxiliary document display method flow process of reciting that third embodiment of the invention provides, be specially clue mnemonic(al) pattern, can only provide the crucial phrase of understanding document for the learner, along with the progress of reciting display text progressively, can improve the auxiliary prompting function.The clue mnemonic(al) module of present embodiment can also comprise the functional module that other realize each step among the preceding method embodiment, further enriches the display alarm function.
The 8th embodiment
Figure 16 is the auxiliary structural representation of reciting file display system that eighth embodiment of the invention provided.On the basis of above-mentioned the 7th embodiment, this system can also comprise a bright read through model 10, and bright read through model 10 comprises: first request receiving element 11, the first initial display unit 12, first continue display unit 13, first voice recognition unit 14, the first progress display unit, 15, the first wrong display unit 16 and the first end judging unit 17.Wherein, the first request receiving element 11 is used for receiving reading aloud and begins request; The first initial display unit 12 is used for the literal that begins to ask with first each phrase of progress mode display-object document according to reading aloud, and triggers first and finish judging unit 17; First display unit 13 that continues is used for the current sentence that continues with the 4th difference mode display setting; The voice that basis collected when first voice recognition unit 14 was used to be triggered are recognition unit with the phrase, and the phrase of setting current to be identified is carried out speech recognition operation, and produce voice identification result; The first progress display unit 15 is used for when voice identification result is correct, show current phrase to be identified in the second progress mode, and judge whether there is the unidentified phrase that arrives in the current sentence that continues, if, then and with next phrase be set at current phrase to be identified, and trigger first voice recognition unit 14, if not, then stop to show the current sentence that continues, and trigger first and finish judging unit 17 in the 4th difference mode; The first wrong display unit 16 is used for when voice identification result is incorrect, shows current phrase to be identified in the second difference mode, and after monitoring generation second trigger event, the second difference mode that stops to show, and triggers first voice recognition unit 14; First finishes judging unit 17 is used for judging whether the current sentence that continues exists phrase afterwards when being triggered, if then next sentence is set at the current sentence that continues, and triggers first voice recognition unit 14 and first display unit 13 that continues.
The auxiliary bright read through model of reciting in the file display system of present embodiment can be used to carry out the auxiliary document display method flow process of reciting that second embodiment of the invention provides, be specially bright reading mode, can show for the learner provides document, and according to voice identification result the phrase of wrong identification and the progress of reading aloud are highlighted, can improve the auxiliary prompting function.The bright read through model of present embodiment can also comprise the functional module that other realize each step among the preceding method embodiment, further enriches the display alarm function.
The 9th embodiment
Figure 17 is the auxiliary structural representation of reciting file display system that ninth embodiment of the invention provided.Present embodiment can the 7th or the 8th embodiment be the basis, further comprise and recite module 30 sentence by sentence, recite module 30 sentence by sentence and comprise: the 3rd request receiving element the 31, the 3rd initial display unit 32, the 3rd voice recognition unit 33, the 3rd progress display unit the 34, the 3rd wrong display unit 35, thirdly hit display unit 36 and the 3rd end judging unit 37.Wherein, the 3rd request receiving element 31 is used for receiving reciting sentence by sentence and begins request; The 3rd initial display unit 32 is used for each word of beginning to ask current cursor position after with placeholder display-object document according to reciting sentence by sentence, and triggers the 3rd end judging unit 37; The voice that basis collected when the 3rd voice recognition unit 33 was used to be triggered are recognition unit with the phrase, and the phrase of setting current to be identified is carried out speech recognition operation, and produce voice identification result; The 3rd progress display unit 34 is used for when voice identification result is correct, before current cursor position, show current phrase to be identified with placeholder, judge in the current sentence that continues and whether have unidentified phrase, if, then next phrase is set at current phrase to be identified and triggers the 3rd voice recognition unit 33, if not, then show the literal of the current sentence that continues and trigger the 3rd and finish judging unit 37; The 3rd wrong display unit 35 is used for when voice identification result is incorrect, shows error flag, and triggers the 3rd voice recognition unit 33; Thirdly hitting display unit 36 is used for when recognizing clicking operation, especially show when recognizing clicking operation behind the error flag, show the literal of the current sentence that continues and carry out timing, reach second when hiding setting value, show the literal of the current sentence that continues with placeholder when monitoring clocking value; The 3rd finishes judging unit 37 is used for judging whether the current sentence that continues exists sentence afterwards when being triggered, if, then next sentence is set at the current sentence that continues, next phrase is set at current phrase to be identified, and trigger the 3rd voice recognition unit 33.
Present embodiment auxiliary recited reciting module sentence by sentence and can be used to carry out the auxiliary document display method flow process of reciting that fourth embodiment of the invention provides in the file display system, be specially the pattern of reciting sentence by sentence, can show for the learner provides document sentence by sentence, and according to voice identification result the phrase of wrong identification and the progress of reading aloud are highlighted, can improve the auxiliary prompting function.Present embodiment recite the functional module that module can also comprise that other realize each step among preceding method embodiment sentence by sentence, further enrich the display alarm function.
The tenth embodiment
Figure 18 is the auxiliary structural representation of reciting file display system that tenth embodiment of the invention provided.Present embodiment can the 7th, the 8th or the 9th embodiment be the basis, further comprise pointing out and recite module 40, prompting is recited module 40 and comprised: the 4th request receiving element the 41, the 4th initial display unit 42, the 4th voice recognition unit 43, the 4th progress display unit the 44, the 4th wrong display unit the 45, the 4th are clicked display unit 46 and the 4th and are finished judging unit 47.Wherein, the 4th request receiving element 41 is used for receiving prompting and recites and begin request; The 4th initial display unit 42 is used for reciting each word that begins to ask after current cursor position with placeholder display-object document according to prompting, and triggers the 4th and finish judging unit 47; The voice that basis collected when the 4th voice recognition unit 43 was used to be triggered are recognition unit with the phrase, and the phrase of setting current to be identified is carried out speech recognition operation, and produce voice identification result; The 4th progress display unit 44 is used for when voice identification result is correct, before current cursor position, show current phrase to be identified with placeholder, judge in the current sentence that continues and whether have unidentified phrase, if, then next phrase is set at current phrase to be identified and triggers the 4th voice recognition unit 43, if not, then show the literal of setting crucial phrase in the current sentence that continues, and trigger the 4th and finish judging unit 47; The 4th wrong display unit 45 is used for when voice identification result is incorrect, shows error flag, and triggers the 4th voice recognition unit 43; The 4th clicks display unit 46 is used for when recognizing clicking operation, especially show when recognizing clicking operation behind the error flag, show and set the literal of crucial phrase in the current sentence that continues and carry out timing, reach the 3rd when hiding setting value when monitoring clocking value, show with placeholder and set crucial phrase in the current sentence that continues; The 4th finishes judging unit 47 is used for judging whether the current sentence that continues exists sentence afterwards when being triggered, if, then next sentence is set at the current sentence that continues, next phrase is set at current phrase to be identified, and trigger the 4th voice recognition unit 47.
Module is recited in the auxiliary prompting of reciting in the file display system of present embodiment can be used to carry out the auxiliary document display method flow process of reciting that fifth embodiment of the invention provides, be specially prompting and recite pattern, can provide sentence by sentence crucial phrase prompt facility for the learner, and according to voice identification result the phrase of wrong identification and the progress of reading aloud are highlighted, can improve the auxiliary prompting function.The functional module that module can also comprise that other realize each step among the preceding method embodiment is recited in the prompting of present embodiment, further enriches the display alarm function.
The 11 embodiment
Figure 19 is the auxiliary structural representation of reciting file display system that eleventh embodiment of the invention provided.Present embodiment can the 7th, the 8th, the 9th or the tenth embodiment be the basis, further comprise challenging and recite module 50, challenge is recited module 50 and comprised: the 5th request receiving element the 51, the 5th initial display unit 52, the 5th voice recognition unit 53, the 5th progress display unit the 54, the 5th wrong display unit the 55, the 5th are clicked display unit 56 and the 5th and are finished judging unit 57.Wherein, the 5th request receiving element 51 is used for receiving challenge and recites and begin request; The 5th initial display unit 52 is used for reciting each word that begins to ask after current cursor position with placeholder display-object document according to challenge, and triggers the 5th and finish judging unit 57; The voice that basis collected when the 5th voice recognition unit 53 was used to be triggered are recognition unit with the phrase, and the phrase of setting current to be identified is carried out speech recognition operation, and produce voice identification result; The 5th progress display unit 54 is used for when voice identification result is correct, judge in the current sentence that continues and whether have unidentified phrase, if, then next phrase is set at current phrase to be identified and triggers the 5th voice recognition unit 53, if not, then before current cursor position, show the current sentence that continues, and trigger the 5th and finish judging unit 57 with placeholder; The 5th wrong display unit 55 is used for when voice identification result is incorrect, shows error flag, and triggers the 5th voice recognition unit 53; The 5th clicks display unit 56 is used for when recognizing clicking operation, especially show when recognizing clicking operation behind the error flag, show the sentence literal of clicking operation institute click location and carry out timing, reach the 5th when hiding setting value when monitoring clocking value, show the sentence of institute's click location with placeholder; The 5th finishes judging unit 57 is used for judging whether the current sentence that continues exists sentence afterwards when being triggered, if, then next sentence is set at the current sentence that continues, next phrase is set at current phrase to be identified, and trigger the 5th voice recognition unit 53.
Module is recited in the auxiliary challenge of reciting in the file display system of present embodiment can be used to carry out the auxiliary document display method flow process of reciting that sixth embodiment of the invention provides, be specially challenge and recite pattern, can provide the placeholder hidden text for the learner, after the click just display text can improve the auxiliary prompting function as the function of reminding.The functional module that module can also comprise that other realize each step among the preceding method embodiment is recited in the challenge of present embodiment, further enriches the display alarm function.
The auxiliary embodiment that recites file display system of the present invention can be used to implement the auxiliary technical scheme of reciting the arbitrary embodiment of document display method of the present invention, comprise that the functional module of carrying out above-mentioned each flow process can independent setting recites in the file display system auxiliary, also can be used in combination, can enrich auxiliary mode of reciting the file display system display document, for the learner provides better auxiliary prompting function.
Of the present invention auxiliary recite file display system preferably comprise simultaneously bright read through model 10, clue mnemonic(al) module 20, sentence by sentence recite module 30, module 40 is recited in prompting and module 50 is recited in challenge, as shown in figure 20, wherein each unit can be separate, also can be in conjunction with employing.The module of each pattern correspondence assisted learning person is by easy stages recited document, and hommization and intelligentized subsidiary function are provided.
Preferably, in bright read through model, also comprise clue mnemonic(al) trigger element, be used for after recognizing bright running through, produce the second request receiver module that the clue mnemonic(al) begins to ask to send to clue mnemonic(al) module, trigger the operation of clue mnemonic(al) pattern automatically.Similarly, clue mnemonic(al) module also comprises recites trigger element sentence by sentence, is used for recognizing after clue mnemonic(al) flow process finishes, and produces to recite sentence by sentence to begin to ask to send to the 3rd request receiver module of reciting module sentence by sentence; Recite module sentence by sentence and comprise that also prompting recites trigger element, be used for reciting sentence by sentence after flow process finishes recognizing, produce prompting and recite and begin to ask to send to the 4th request receiver module that module is recited in prompting; Prompting is recited module and is comprised that also challenge recites trigger element, is used for reciting after flow process finishes recognizing prompting, produces challenge and recites and begin to ask to send to the 5th request receiver module that module is recited in challenge.
The such scheme design can make the learner after the workflow that starts file display system, file display system can be automatically and the bright read through model of triggering of order, clue mnemonic(al) module, sentence by sentence recite module, module is recited in prompting and module is recited in challenge, the display mode that meets learner's cognition, memory pattern is provided, enrich Presentation Function, improve the function of auxiliary prompting.
One of ordinary skill in the art will appreciate that: all or part of step that realizes said method embodiment can be finished by the relevant hardware of programmed instruction, aforesaid program can be stored in the computer read/write memory medium, this program is carried out the step that comprises said method embodiment when carrying out; And aforesaid storage medium comprises: various media that can be program code stored such as ROM, RAM, magnetic disc or CD.
It should be noted that at last: above embodiment only in order to technical scheme of the present invention to be described, is not intended to limit; Although with reference to previous embodiment the present invention is had been described in detail, those of ordinary skill in the art is to be understood that: it still can be made amendment to the technical scheme that aforementioned each embodiment put down in writing, and perhaps part technical characterictic wherein is equal to replacement; And these modifications or replacement do not make the essence of appropriate technical solution break away from the spirit and scope of various embodiments of the present invention technical scheme.