CN102346731A - File processing method and file processing device - Google Patents

File processing method and file processing device Download PDF

Info

Publication number
CN102346731A
CN102346731A CN2010102435669A CN201010243566A CN102346731A CN 102346731 A CN102346731 A CN 102346731A CN 2010102435669 A CN2010102435669 A CN 2010102435669A CN 201010243566 A CN201010243566 A CN 201010243566A CN 102346731 A CN102346731 A CN 102346731A
Authority
CN
China
Prior art keywords
word
file
character library
notes content
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2010102435669A
Other languages
Chinese (zh)
Other versions
CN102346731B (en
Inventor
武亚强
张建忠
王哲鹏
徐超
王巍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CN201010243566.9A priority Critical patent/CN102346731B/en
Priority to US13/813,720 priority patent/US10210148B2/en
Priority to PCT/CN2011/077865 priority patent/WO2012016505A1/en
Publication of CN102346731A publication Critical patent/CN102346731A/en
Application granted granted Critical
Publication of CN102346731B publication Critical patent/CN102346731B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/169Annotation, e.g. comment data or footnotes

Abstract

The invention provides a file processing method and a file processing device, wherein the file processing method comprises the following steps of: acquiring a file, analyzing the file and obtaining a first character contained in the file; matching the first character with a preset matching character library; obtaining an annotated content corresponding to the first character when the first character meets the preset condition; and displaying the first character and the annotated content. In the file processing method and the file processing device, provided by the invention, the automatic annotation of specific characters in the file is realized, and the reading experience of a user is improved.

Description

A kind of document handling method and document handling apparatus
Technical field
The present invention relates to the word processing technical field, be specifically related to a kind of document handling method and document handling apparatus.
Background technology
The user is when electronic equipment (like computing machine, personal digital assistant PDA, the mechanical, electrical paper book of hand etc.) reading file; Perhaps some are not sure of the words of its implication or pronunciation to run into some unacquainted words through regular meeting; Like uncommon words and/or polyphone etc., these contents can influence user's making much of file content.
In order to obtain the making much of of file content, the user is for example running into uncommon words in the prior art when reading file, needs to interrupt reading process and goes specifying words to carry out the relevant inquiring operation, and queries dictionary for example confirms to specify the pronunciation and the implication of words.Obviously, this query manipulation needs the user in happy dripping reading process, to have to interrupt reading, and therefore will interrupt the continuity of reading, has a strong impact on user's reading experience.
Summary of the invention
Embodiment of the invention technical matters to be solved provides a kind of document handling method and document handling apparatus, in order to realize that automatic note realized in word specific in the file, improves user's reading experience.
For solving the problems of the technologies described above, the embodiment of the invention provides scheme following:
A kind of document handling method comprises:
Obtain file;
Resolve said file and obtain first word that said file comprises;
With said first word and the coupling character library coupling that is provided with in advance;
When said first word satisfies predetermined condition, obtain the corresponding notes content of said first word;
Show said first word and said notes content.
Preferably; In the above-mentioned document handling method; Said first word of said demonstration and said notes content comprise: show said first word according to the displaying scheme with first display effect; And show said notes content according to second displaying scheme with second display effect; Wherein, said first display effect is different with second display effect.
Preferably, in the above-mentioned document handling method,
Said first word of said demonstration and said notes content comprise:
Obtain the original composing of said file;
Confirm the display position of said notes content with respect to said first word;
Judge the said display position place in the said original composing, whether have living space and hold said notes content;
When not having the space to hold said notes content; Said file set type again obtain one and newly set type; Make the said display position place in the said new composing that the space that holds said notes content arranged; And show said first word, and show said notes content at said display position place according to said new composing.
Preferably, in the above-mentioned document handling method,
Said predetermined condition is that said first word does not belong to said coupling character library or said first word belongs to said coupling character library.
Preferably, in the above-mentioned document handling method,
Said notes content comprises at least a of translation content that other Languages that the mark with phonetic symbols symbol of the articulation type that is used for marking said first word and intonation, the lexical or textual analysis information that is used to explain the said first word implication, the Play Control menu and utilizing that is used for the audio file of said first word pronunciation of controls playing be different from language under said first word is translated said first word.
Preferably, in the above-mentioned document handling method,
When said notes content comprises said mark with phonetic symbols symbol; Said coupling character library comprises character library commonly used and the fallibility character library that is provided with in advance; Said predetermined condition is that said first word does not belong to said character library commonly used or said first word belongs to said fallibility character library; Wherein said character library commonly used includes predefined everyday character, and said fallibility character library comprises the predefined word that misreads easily.
Preferably, in the above-mentioned document handling method,
When said notes content comprises said mark with phonetic symbols symbol, after the said file of said parsing obtains first word that said file comprises, also comprise: according to the context of said first word, said first word is carried out word segmentation processing, obtain word segmentation result;
The corresponding notes content of said first word of said acquisition comprises: according to said word segmentation result, inquire about predefined dictionary, obtain the mark with phonetic symbols symbol of said first word.
Preferably, in the above-mentioned document handling method,
Have at least two predetermined character libraries, the word that each said predetermined character library comprises is incomplete same;
Before said acquisition file, also comprise:
Receive the coupling character library information is set;
According to said coupling character library information being set, is said coupling character library with the reserved word lab setting in the said plural predetermined character library.
The embodiment of the invention also provides a kind of document handling apparatus, comprising:
The first acquisition unit is used to obtain file;
Resolution unit is used to resolve said file and obtains first word that said file comprises;
Matching unit is used for said first word and the coupling character library coupling that is provided with in advance;
The note unit is used for when said first word satisfies predetermined condition, obtains the corresponding notes content of said first word;
Display unit is used to show said first word and said notes content.
Preferably, in the above-mentioned document handling apparatus, said display unit comprises:
Effect is confirmed the unit, is used for confirming first displaying scheme of said first word and second displaying scheme of said notes content, and wherein, first display effect of said first displaying scheme is different with second display effect of said second displaying scheme;
Display processing unit is used for when showing said first word, showing said first word according to said first displaying scheme; And when showing said notes content, show said notes content according to said second displaying scheme.
Preferably, in the above-mentioned document handling apparatus, also comprise:
The second acquisition unit is used to obtain the original composing of said file;
Position determination unit is used for confirming the display position of said notes content with respect to said first word;
Judging unit is used for judging the said display position place of said original composing whether to have living space and hold said notes content;
The composing unit is used for when not having the space to hold said notes content, and said file is set type again obtains a composing newly, makes that there is the space that holds said notes content at the said display position place in the said new composing;
Said display unit also is used for showing said first word according to the said new composing that said composing unit obtains, and shows said notes content at said display position place.
Preferably, in the above-mentioned document handling apparatus,
Said note unit is further used for when said first word does not belong to said coupling character library, obtains the corresponding notes content of said first word; Perhaps when said first word belongs to said coupling character library, obtain the corresponding notes content of said first word.
Preferably, in the above-mentioned document handling apparatus, also comprise:
Storage unit; Be used to store said notes content; Wherein, said notes content comprises at least a of translation content that other Languages that the mark with phonetic symbols symbol of the articulation type that is used for marking said first word and intonation, the lexical or textual analysis information that is used to explain the said first word implication, the Play Control menu and utilizing that is used for the audio file of said first word pronunciation of controls playing be different from language under said first word is translated said first word.
Preferably, in the above-mentioned document handling apparatus,
Said storage unit also is used to store predefined dictionary;
When said notes content comprised said mark with phonetic symbols symbol, said document handling apparatus also comprised:
The participle unit is used for after said resolution unit obtains said first word, according to the context of said first word, said first word is carried out word segmentation processing, obtains word segmentation result;
Query unit is used for according to said word segmentation result, inquires about the said dictionary of said cell stores, obtains the mark with phonetic symbols symbol of said first word.
Preferably, in the above-mentioned document handling apparatus, also comprise:
Storage unit is used at least two predetermined character libraries of storage, and the word that each said predetermined character library comprises is incomplete same;
Receiving element is used for receiving the coupling character library information is set;
The unit being set, being used for according to said coupling character library information being set, is said coupling character library with a reserved word lab setting of said cell stores.
Can find out from the above; Document handling method that the embodiment of the invention provides and document handling apparatus; Can carry out note to the word that conforms to a predetermined condition in the file automatically; Thereby avoided the user to interrupt reading process and gone operation that these words are inquired about; Guaranteed that the user reads continuity; Simultaneously the embodiment of the invention also provides the opportunity to study of the more knowledge of study to the user in reading process, and these have all improved user's reading experience.
Description of drawings
Fig. 1 is the schematic flow sheet of the embodiment of the invention one described document handling method;
Fig. 2 is the schematic flow sheet of the embodiment of the invention two described document handling methods;
Fig. 3 is the schematic flow sheet of the embodiment of the invention five described document handling methods;
Fig. 4 is the structural representation of the described document handling apparatus of the embodiment of the invention.
Embodiment
The embodiment of the invention is mated with the coupling character library that is provided with in advance through with the word that resolution file obtained, and need to confirm the word and the notes content thereof of automatic note, and then when showing, carries out automatic note, in order to improve user's reading experience.Below will combine accompanying drawing, the present invention done further explanation through specific embodiment.
< embodiment one >
As shown in Figure 1, the described document handling method of present embodiment can be applied in such as in the various electronic equipments such as computing machine, PDA, mobile phone, MP4 and electric paper book, specifically may further comprise the steps:
Step 11 obtains file.
Here, obtaining file can be that electronic equipment reads the local file of preserving, or downloads said file from network or miscellaneous equipment, can also be the file through the network online reading.The said file of present embodiment is not limited to concrete file layout, gets final product so long as can access the file of word after resolving, and specifically comprises following three types:
(1) only comprises the file of content of text, like Word document file and WPS document files etc.
(2) only comprise the file of non-content of text, like pdf document, picture file etc.
(3) both comprised that content of text also comprised the file of non-content of text, as included video file and files in stream media of caption information etc.
Word described in the present embodiment comprises various spoken and written languages, specifically can be Chinese character, English word, French word or the like.
Step 12 is resolved said file and is obtained first word that said file comprises.
Here,, said file is resolved, obtain the word that comprises in the said file according to said file layout.Concrete, describe respectively to above three class files:
(1) be directed against the only file of content of text: after reading file, obtain the content of text that wherein comprises, the word that can obtain comprising in this document, for example, the word that comprises in the Word file.
(2) only comprise the file of non-content of text: after reading file; File is carried out literal identification, convert wherein non-content of text into content of text, thus the word that obtains comprising in this document; For example, the image in the picture is carried out the word that literal identification obtains this image representative.
(3) both comprised that content of text also comprised the file of non-content of text: after reading file, ignore the non-content of text that wherein comprises, extract the content of text that wherein comprises, thus the word that obtains comprising in the content of text.For example,, ignore video image wherein, and extract the word in the caption content wherein to video file.To the e-book that comprises image, ignore image wherein, and extract the word in the content of text wherein.Certainly, if same existence needs content identified in the image, can use above-mentioned the 2nd type mode further to handle.
First word that obtains with parsing is that example describes below.
Step 13 is with said first word and the coupling character library coupling that is provided with in advance.Here, the coupling character library can have 1,2 or a plurality of.
Step 14 when said first word satisfies predetermined condition, obtains the corresponding notes content of said first word.
Here, with said first word and the coupling character library coupling that is provided with in advance, obtain the matching result of said first word in the step 13; In step 14,, then removes said matching result to obtain the corresponding notes content of first word when satisfying said predetermined condition.As a preferred embodiment, said predetermined condition specifically can be that said first word does not belong to said coupling character library, and will carry out note this moment to first word that does not belong in the said coupling character library; As another preferred embodiment, said predetermined condition can also be that said first word belongs to said coupling character library, and will carry out note this moment to first word that belongs in the said coupling character library.
In the present embodiment, notes content can be preserved in the database in advance, this database can be kept at the local storage unit of electronic equipment or with network that electronic equipment is connected on storage unit in.In the above-mentioned steps 14, when said first word satisfies predetermined condition, be index, search said database, confirm the notes content that said first word is corresponding with said first word.Here, described notes content comprises a kind of in following four kinds at least:
A), be used to mark the articulation type of said first word and the mark with phonetic symbols symbol of intonation, for example, can be the Chinese phonetic alphabet and tone for Chinese character, then be its phonetic symbol and the accent that indicates stressed syllable or the like for English word.
B), be used to explain the lexical or textual analysis information of the said first word implication, specifically can use the lexical or textual analysis in the normal dictionary in the language separately, for example,, can use the literal lexical or textual analysis in " Xinhua dictionary ", " the archaic Chinese dictionary " etc. for certain Chinese character.
C), be used for the Play Control menu of the audio file of said first word of controls playing pronunciation, can the controls playing audio file through this Play Control menu, in order to demonstrate the concrete pronunciation of said first word through voice mode.
D), utilize and to be different from the translation content that the other Languages of language is translated said first word under said first word, for example for first word of Chinese character, can be to utilize English, French or other Languages to its translation content of translating; For first word of English word, can be to utilize Chinese to its translation content of translating.
Step 15 shows said first word and said notes content.
Here; Through above-mentioned steps 13 and 14; Said first word and coupling character library are mated; Whether the matching result of judging said first word satisfies predetermined condition; If satisfy, then being expressed as said first word increases corresponding notes content, confirms the pairing notes content of said first word this moment; And when in step 15, showing said file, said first word and corresponding notes content thereof are shown simultaneously.
If the matching result of said first word that in step 13, obtains does not satisfy predetermined condition, then need not is that first word increases notes content, shows that directly first word gets final product.
Above-mentioned steps through present embodiment; Present embodiment is when display file; Automatic note realized in first word to satisfying predetermined condition in the file content; Make that the user need not the associated annotation information that just can obtain first word automatically inquired about in said first word in reading process; Realized in reading process, necessary knowledge being provided for the user; Increased user's knowledge quantity; Improved user's making much of to file content; And simplified user's reading operation, improved user's reading experience.
Preferred embodiment present embodiment is handled the word that comprises in will content displayed in the procedure for displaying of said file in real time as one, and this moment, above-mentioned steps 12 specifically comprised:
Step 121 is resolved said file, obtains the first content that will show in the said file.For example, for document files, will content displayed possibly be certain one page in the document file; For files in stream media, will content displayed possibly be a certain frame data.
Step 122 is extracted first word that comprises in the said first content.Here, said first word is the word that comprises in the said first content.
If said first word satisfies predetermined condition, then obtain the corresponding notes content of said first word, and in step 15, show the said first content that comprises said first word, show the notes content that said first word is corresponding simultaneously.
Below preferred embodiment the concrete steps of above-mentioned steps 15 are described, specifically comprise through one:
Step 151 obtains the original composing of said file.
Step 152 is confirmed the display position of said notes content with respect to said first word.
Here, in the step 152, the display position of notes content can be confirmed according to reading habit.For example, when said notes content is the phonetic of Chinese character, display position normally corresponding Chinese character directly over; When said notes content was the phonetic symbol of English word, display position normally followed this English word closely, with this English word same delegation (, then can postpone) to next line if do not show with delegation.
Whether step 153 is judged the said display position place in the said original composing, have living space and hold said notes content.
Here, not having the space to hold said notes content, possibly be that the line space of text is too little, block so that can form urtext when in the between-line spacing of text, showing said notes content; Also might be that word interval is too little, block so that can form urtext when in word interval, showing said notes content, or the like.
Step 154; When not having the space to hold said notes content; Said file set type again obtain one and newly set type; Make the said display position place in the said new composing that the space that holds said notes content arranged; And show said first word, and show said notes content at said display position place according to said new composing.
Here; In the step 154; When not having the space to hold said notes content; Can adjust line space (as increasing line space) when setting type again as required; Perhaps increase new delegation and be used to show notes content;, can also be the word interval that increases between the next word of said first word and said first word, so that notes content has enough spatial accommodations.
Embodiment as another replacement of above-mentioned steps 154; Can also not adjust composing; Handle but earlier said notes content is carried out transparence; Make it transparency and bring up to predetermined value; Then said notes content stack is presented at the position of said first word; So both can see notes content, not influence the demonstration of first word again.
Step 155 having living space when holding said notes content, shows said first word according to original composing, and shows said notes content at said display position place.
As one preferred embodiment; Present embodiment can also adopt different display effects in above-mentioned steps 15; Show said first kind word and said notes content: show said first word according to first displaying scheme with first display effect; And show said notes content according to second displaying scheme with second display effect; Wherein, said first display effect is different with second display effect.
Here, can to first kind word and notes content, different displaying schemes be set in advance by the user according to individual preference, also can be to be the acquiescence displaying scheme that first kind word and notes content are provided with in advance in the electronic equipment.Whether static the content of displaying scheme comprises: font type, size, color, transparency, show, whether dynamically show parameters such as (like flicker demonstration, gradual change demonstrations etc.).Before in step 15, showing, confirm each self-corresponding displaying scheme of said first word and said notes content, then, show, reach different display effects according to separately displaying scheme.
As another preferred implementation; Above-mentioned displaying scheme can also combine with above-mentioned steps 151~155; In step 154, show said first word according to said new composing; And when showing said notes content at said display position place; Can further show said first word, and show said notes content according to said second display effect according to said first displaying scheme; In step 155, show said first word according to original composing; And when showing said notes content at said display position place; Can further show said first word, and show said notes content according to said second display effect according to said first displaying scheme.
Based on above-mentioned document handling method, present embodiment also provides a kind of document handling apparatus, and as shown in Figure 4, this document treating apparatus 80 specifically comprises:
The first acquisition unit is used to obtain file;
Resolution unit is used to resolve said file and obtains first word that said file comprises;
Matching unit is used for said first word and the coupling character library coupling that is provided with in advance;
The note unit is used for when said first word satisfies predetermined condition, obtains the corresponding notes content of said first word;
Display unit is used to show said first word and said notes content.
As a preferred implementation, said display unit comprises:
Effect is confirmed the unit, is used for confirming first displaying scheme of said first word and second displaying scheme of said notes content, and wherein, first display effect of said first displaying scheme is different with second display effect of said second displaying scheme;
Display processing unit is used for when showing said first word, showing said first word according to said first displaying scheme; And when showing said notes content, show said notes content according to said second displaying scheme.
As a preferred implementation, said document handling apparatus also comprises:
The second acquisition unit is used to obtain the original composing of said file;
Position determination unit is used for confirming the display position of said notes content with respect to said first word;
Judging unit is used for judging the said display position place of said original composing whether to have living space and hold said notes content;
The composing unit is used for when not having the space to hold said notes content, and said file is set type again obtains a composing newly, makes that there is the space that holds said notes content at the said display position place in the said new composing;
Said display unit also is used for showing said first word according to the said new composing that said composing unit obtains, and shows said notes content at said display position place.
As a preferred implementation, said note unit is further used for when said first word does not belong to said coupling character library, obtains the corresponding notes content of said first word; Perhaps when said first word belongs to said coupling character library, obtain the corresponding notes content of said first word.
As a preferred implementation, said document handling apparatus also comprises:
Storage unit; Be used to store said notes content; Wherein, said notes content comprises at least a of translation content that other Languages that the mark with phonetic symbols symbol of the articulation type that is used for marking said first word and intonation, the lexical or textual analysis information that is used to explain the said first word implication, the Play Control menu and utilizing that is used for the audio file of said first word pronunciation of controls playing be different from language under said first word is translated said first word.
Figure 4 shows a further embodiment described in the present document processing apparatus, Chinese characters in the file after the display, which is displayed on the left display layout as the original file, the right is the result of the embodiment after display, which adds an uncommon word "Ye ugly" in pinyin: "y? ày
Figure BSA00000215244600101
".
< embodiment two >
As a preferred embodiment, present embodiment can be in advance carries out pre-service to the full content of said file, obtains all words that said file comprises, and then determines whether to need to show the first kind word and the corresponding notes content of notes content; Then, when showing said file, confirm again will content displayed in included said first kind word, thereby when showing its pairing notes content.
As shown in Figure 2, the described document handling method of present embodiment can be used in the various electronic equipments, specifically may further comprise the steps:
Step 21 obtains file.
Step 22 is resolved said file, obtains the full content of said file, extracts all words that comprise in the said full content.
Step 23 is mated said all words one by one with the coupling character library that is provided with in advance, therefrom select the first kind word that satisfies predetermined condition.
Here predetermined condition can be identical with embodiment one with the set-up mode of coupling character library.
Step 24 obtains the corresponding notes content of said first kind word.
Step 25 is confirmed the first content that will show in the said file, from said first kind word, selects the second type of word that belongs to said first content;
Step 26 shows the said first content that comprises said second type of word, shows the corresponding notes content of said second type of word simultaneously.
Present embodiment is that example describes with all words that comprise in the file: when resolution file; Utilize coupling character library matching mode; Select the first kind word that satisfies predetermined condition in all words from said file, and then obtain corresponding notes content to first kind word; When specifically showing certain content, show the pairing notes content of first kind word in this content simultaneously, thereby realized particular word in the file is carried out the purpose of automatic note too then.
Below on the basis of embodiment one, through more embodiment the present invention is done further explanation.
< embodiment three >
Present embodiment comprises that with said notes content the mark with phonetic symbols symbol is that example further specifies.
When said notes content comprised the mark with phonetic symbols symbol, the character library of coupling described in the present embodiment comprised the character library commonly used that is provided with in advance, and this moment, said predetermined condition was that said first word does not belong to said character library commonly used.Said character library commonly used includes predefined everyday character, for example, for Chinese character, can be with the Chinese character in the one-level character library of including among the CNS GB2312 as everyday character; For English word, can be with the English word of the public English CET-6 of university level as everyday character, or the like.
The described document handling method of present embodiment can be applied in such as in the various electronic equipments such as computing machine, PDA, mobile phone, MP4 and electric paper book, specifically may further comprise the steps:
Step 31 obtains file.
Step 32 is resolved said file and is obtained first word that said file comprises.
Step 33, with said first word and the coupling character library coupling that is provided with in advance, here, said coupling dictionary comprises predefined character library commonly used.
Step 34 when said first word does not belong to said character library commonly used, obtains the corresponding notes content of said first word, and said notes content comprises the mark with phonetic symbols symbol, can also comprise contents such as lexical or textual analysis information.
Step 35 shows said first word and said notes content.
Through above-mentioned steps, present embodiment has been realized the function to the automatic note of non-common word, makes the user in reading process, can learn non-common word, has improved the efficient of reading learning, has improved user's reading experience.
< embodiment four >
Present embodiment comprises that with said notes content the mark with phonetic symbols symbol is that example further specifies.
In the prior art; The user is for the non-common word that runs in the reading process; Can initiatively go to look up the dictionary and obtain information such as its pronunciation, lexical or textual analysis; But for some easy wrongly written characters; If this user treats as right pronunciation with wrong pronunciation; This user can initiatively not remove to confirm the pronunciation of this easy wrongly written character usually again in reading process, just can't correct its mistake yet, can not learn correct pronunciation.Present embodiment is through being provided with the fallibility character library, and the pronunciation of the word in reading process in the active commute wrongly written character storehouse marks automatically, thereby can offer the chance of a study of user right pronunciation, improves user's reading experience.
When said notes content comprised the mark with phonetic symbols symbol, the character library of coupling described in the present embodiment comprised the fallibility character library that is provided with in advance, and this moment, said predetermined condition was that said first word belongs to said fallibility character library.Said fallibility character library includes the predefined word that misreads easily, and for example, the polyphone in the Chinese character has different pronunciations like " OK " word in " bank " and " pedestrian "; Again for example, the place name in the English " San Jose " is to be derived from a Spanish English phrase, is often misread.Based on context this words that misreads easily when confirming its pronunciation, need carry out word segmentation processing, removes to search the dictionary of preserving pronunciation information according to word segmentation result, could confirm its accurate pronunciation.
The described document handling method of present embodiment can be applied in such as in the various electronic equipments such as computing machine, PDA, mobile phone, MP4 and electric paper book, specifically may further comprise the steps:
Step 41 obtains file.
Step 42 is resolved said file and is obtained first word that said file comprises.
Step 43, with said first word and the coupling character library coupling that is provided with in advance, here, said coupling dictionary comprises predefined fallibility character library.
Step 44 when said first word belongs to said fallibility character library, obtains the corresponding notes content of said first word, and said notes content comprises the mark with phonetic symbols symbol, can also comprise contents such as lexical or textual analysis information.
Step 45 shows said first word and said notes content.
Here; As a preferred implementation; When above-mentioned first word is Chinese character; In above-mentioned steps 42; After the said file of said parsing obtains first word that said file comprises; Also comprise: according to the context of said first word, said first word is carried out word segmentation processing, obtain word segmentation result; In above-mentioned steps 44, further according to said word segmentation result, inquire about predefined dictionary, obtain the mark with phonetic symbols symbol of said first word.
Through above-mentioned steps, present embodiment has been realized the function to the automatic note of word that misreads easily, makes the user in reading process, can learn to be prone to the right pronunciation of wrongly written character, has improved the efficient of reading learning, has improved user's reading experience.
Similarly, present embodiment also provides a kind of document handling apparatus, specifically comprises:
The first acquisition unit is used to obtain file;
Resolution unit is used to resolve said file and obtains first word that said file comprises;
Matching unit is used for said first word and the coupling character library coupling that is provided with in advance;
The note unit is used for when said first word satisfies predetermined condition, obtains the corresponding notes content of said first word;
Display unit is used to show said first word and said notes content;
Storage unit; Be used to store said notes content; Wherein, said notes content comprises at least a of translation content that other Languages that the mark with phonetic symbols symbol of the articulation type that is used for marking said first word and intonation, the lexical or textual analysis information that is used to explain the said first word implication, the Play Control menu and utilizing that is used for the audio file of said first word pronunciation of controls playing be different from language under said first word is translated said first word.
As a preferred implementation, said storage unit also is used to store predefined dictionary;
When said notes content comprised said mark with phonetic symbols symbol, said document handling apparatus also comprised:
The participle unit is used for after said resolution unit obtains said first word, according to the context of said first word, said first word is carried out word segmentation processing, obtains word segmentation result;
Query unit is used for according to said word segmentation result, inquires about the said dictionary of said cell stores, obtains the mark with phonetic symbols symbol of said first word.
< embodiment five >
In the said document handling method of present embodiment, the coupling character library comprises character library commonly used and fallibility character library, and this moment, said predetermined condition was that said first word does not belong to said character library commonly used or said first word belongs to said fallibility character library.At this moment, the described document handling method of present embodiment as shown in Figure 3, specifically may further comprise the steps:
Step 51 obtains file.
Step 52 is resolved said file and is obtained first word that said file comprises.
Step 53 is with said first word and the character library coupling commonly used that is provided with in advance: when said first word belongs to said character library commonly used, get into step 54, when said first word does not belong to said character library commonly used, get into step 55.
Step 54 is with said first word and the fallibility character library coupling that is provided with in advance: when said first word belongs to said fallibility character library, get into step 55, when said first word does not belong to said fallibility character library, get into step 57.
Step 55 obtains the corresponding notes content of said first word, gets into step 56 then.
Step 56 shows said first word and said notes content, and said notes content comprises the mark with phonetic symbols symbol.
Step 57 shows said first word.
Above step is earlier with first word and character library commonly used coupling; If belonging to character library commonly used, first word further judges first word and fallibility character library coupling; Whether final definite first word is to be prone to wrongly written character or non-common word: if; Then need confirm the notes content of first word, and when showing, first word and notes content thereof.
Certainly, present embodiment also can change the order of above-mentioned coupling, earlier with first word and fallibility character library coupling, does not further judge first word and character library commonly used coupling if first word does not belong to the fallibility character library, confirms finally whether first word is to be prone to wrongly written character or non-common word.
Through above step, present embodiment can be when being prone to wrongly written character or non-common word at first word, is that first word increases corresponding notes content when showing first word, improves user's reading experience.
< embodiment six >
When reading file; Different users possibly have the different knowledge ability; For example; The Chinese character of pupil's understanding will lack with respect to the university student usually; The English word of pupil's understanding will lack with respect to the university student usually, therefore a plurality of character libraries can be set in advance, for example; For English word other character library of various level such as the public English CET-4 of university level word character library, the public English CET-6 of university level word character library can be set, include the English word of different stage respectively; For Hanzi font library, then corresponding grade's character library can be set for the student of different grades, for example, comprise the Chinese character that first-year student should be grasped for first-year student is provided with a character library; For the sophomore is provided with the second grade character library, comprise the Chinese character that the sophomore should grasp ....
For this reason, present embodiment is provided with at least two predetermined character libraries in advance, and the word that each said predetermined character library comprises is incomplete same.The described document handling method of present embodiment can be applied in such as in the various electronic equipments such as computing machine, PDA, mobile phone, MP4 and electric paper book, specifically may further comprise the steps:
Step 61, the coupling character library that receives user's input is provided with information;
Step 62 is provided with information according to said coupling character library, and the reserved word lab setting in said at least two predetermined character libraries is the coupling character library.
Step 63 obtains file.
Step 64 is resolved said file and is obtained first word that said file comprises.
Step 65 is with said first word and said coupling character library coupling.
Step 66 when the matching result of said first word satisfies predetermined condition, obtains the corresponding notes content of said first word.
Step 67 shows said first word and said notes content.
Here, if the matching result of first word does not satisfy predetermined condition described in the step 65, then when showing, need not show the notes content of first word.
Through above-mentioned steps, present embodiment has been realized the function to the automatic note of non-common word, makes the user in reading process, can learn non-common word, has improved the efficient of reading learning, has improved user's reading experience.
Because the user in the file reading process, can learn said first word that shows notes content.After reading this document reached certain number of times, the user possibly grasp the notes content of said first word, showed that the necessity of the notes content of said first word just reduces greatly this moment again.Therefore; Present embodiment can also be after being provided with said coupling character library; Add up the number of times that said file is shown further; In step 67, show before said first word and the said notes content; Judge whether number of times that said file is shown reaches the corresponding number of times of said coupling dictionary that is provided with in advance:, then when showing said first word, do not show said notes content if reach the corresponding number of times of said coupling dictionary; If do not reach the corresponding number of times of said coupling dictionary, then show said first word and said notes content simultaneously.
Similarly, present embodiment also provides a kind of document handling apparatus, specifically comprises:
The first acquisition unit is used to obtain file;
Resolution unit is used to resolve said file and obtains first word that said file comprises;
Matching unit is used for said first word and the coupling character library coupling that is provided with in advance;
The note unit is used for when said first word satisfies predetermined condition, obtains the corresponding notes content of said first word;
Display unit is used to show said first word and said notes content;
Storage unit,, be used at least two predetermined character libraries of storage, the word that each said predetermined character library comprises is incomplete same;
Receiving element is used for receiving the coupling character library information is set;
The unit being set, being used for according to said coupling character library information being set, is said coupling character library with a reserved word lab setting of said cell stores.
< embodiment seven >
When reading file, same user's cognitive level also can be to change, and this user will learn more word along with the increase of reading file number of times, thereby cognitive level is improved.For this reason, the number of times of user's reading file that present embodiment obtains according to statistics is provided with current coupling character library, so that coupling character library and active user's cognitive level adapts, specifies as follows:
Present embodiment is provided with at least two predetermined character libraries in advance, and the word that each said predetermined character library comprises is incomplete same.And present embodiment also is provided with the corresponding number of times thresholding of each said predetermined character library in advance, wherein, and the number of times thresholding difference that each predetermined character library is corresponding.The described document handling method of present embodiment can be applied in such as in the various electronic equipments such as computing machine, PDA, mobile phone, MP4 and electric paper book, specifically may further comprise the steps:
Step 71 is added up the demonstration number of times that said file is shown.
Step 72; According to said demonstration number of times; From said two predetermined character libraries, select the first predetermined character library at least; Thereby the coupling character library that obtains including the said first reserved word library information is provided with information; Wherein, the said first predetermined character library is the predetermined character library that has the minimum number thresholding in the predetermined character library of number of times thresholding greater than said demonstration number of times.
Step 73 is provided with information according to said coupling character library, is current coupling character library with the said first reserved word lab setting.
Step 74 obtains file.
Step 75 is resolved said file and is obtained first word that said file comprises.
Step 76 is with said first word and said coupling character library coupling.
Step 77 when the matching result of said first word satisfies predetermined condition, obtains the corresponding notes content of said first word.
Step 78 shows said first word and said notes content.
The above-mentioned steps 73 of present embodiment is different with the step 62 of embodiment six.Be that electronic equipment is according to predetermined policy in above-mentioned steps 73; Automatically the coupling character library that generates is provided with information; According to this coupling character library information is set then the reserved word lab setting of correspondence is the coupling character library, but not the coupling character library of the input of the reception user in the step 61 of embodiment six, 62 is provided with information and mate the setting of character library according to this information.
In the above step of present embodiment, realized reading (demonstration) number of times, the function of current coupling character library has been set automatically, made coupling character library and user's present cognitive level adapt according to file.Illustrate as follows:
Suppose that predetermined character library is character library commonly used; And the character library commonly used that has 3 different stages; The quantity of the everyday character that is comprised in the quantity<three grade character library commonly used of the everyday character that quantity<secondary character library commonly used of the everyday character that one-level character library commonly used is comprised is comprised, and the corresponding number of times thresholding<three grade corresponding number of times thresholding of character library commonly used of the corresponding number of times thresholding<secondary character library commonly used of one-level character library commonly used is set.Listed a kind of possible example in the following table:
One-level is used character library always Secondary is used character library always Three grades of character libraries commonly used
Everyday character quantity 3600 6000 9200
The number of times thresholding 3 10 30
Here, the implication of number of times thresholding is: reach the number of times thresholding of current coupling character library if show number of times, then should select for use the predetermined character library with high reps thresholding more as the coupling character library.For example, when current coupling character library is the commonly used character library of one-level,, then should select for use the number of times thresholding to be higher than 3 secondary character library commonly used as the coupling character library if file shows that number of times has reached 3 times; If file shows number of times and reached 3 times, then be higher than secondary character library commonly used that 3 secondary, three grades of character libraries commonly used select to have less number of times thresholding 10 as the coupling character library from the number of times thresholding; If file shows that number of times has reached more than 30 times; By not existing the number of times thresholding to be higher than 30 character library commonly used, therefore the coupling character library no longer is set, shown many times owing to this document this moment; The user is fully learnt non-common word wherein, therefore there is no need to show notes content again.
< embodiment eight >
Some words are arranged, have different pronunciations at country variant, for example, some English words have American pronunciation in the U.S., in Britain the Anglicism pronunciation are arranged then; Some words then have different dialect pronunciations in different regions, promptly the pronunciation of these words is relevant with the geographic position.For this reason; Present embodiment is provided with a predetermined character library in advance, and the word that is comprised in this predetermined character library has at least two kinds of pronunciations, and wherein first kind of pronunciation is corresponding to first geographic position; Second kind of pronunciation is corresponding to second geographic position, and said first geographic position is different with said second geographic position.In addition, a mark with phonetic symbols symbol database is set also, has preserved the mark with phonetic symbols symbol of the difference pronunciation of word when diverse geographic location in the said predetermined character library in this database.
The said document handling method of present embodiment is applied to specifically may further comprise the steps in the electronic equipment:
Step 81 obtains file.
Step 82 is resolved said file and is obtained first word that said file comprises.
Step 83 is with said first word and said predetermined character library coupling.
Step 83 when said first word belongs to said predetermined character library, obtains the current geographic position of said electronic equipment.
Here, the current geographic position of electronic equipment can be according to the IP address of electronic equipment, and the database of geographic position and IP address corresponding relation, the current geographic position of electron gain equipment are preserved in inquiry; Can also utilize global position system GPS that said electronic equipment is positioned, obtain the current geographic position of said electronic equipment.
Step 84 according to the current geographic position of said electronic equipment, is searched said mark with phonetic symbols symbol database, confirms the first mark with phonetic symbols symbol of the pronunciation of said first word when said current geographic position.
Step 85 shows said first word and the said first mark with phonetic symbols symbol.
Like this, present embodiment can for the user shows the pronunciation of word at current geographic position, make the user to do as the Romans do while in Rome according to the current geographic location of user, helps user and local resident's communication exchange.
In sum; Document handling method and document handling apparatus that each embodiment of the present invention is provided; Can carry out note to the word that conforms to a predetermined condition in the file automatically; Thereby avoided the user to interrupt reading process and gone operation that these words are inquired about; Guaranteed that the user reads continuity; Simultaneously the embodiment of the invention also provides the opportunity to study of the more knowledge of study to the user in reading process, and these have all improved user's reading experience.
The above only is an embodiment of the present invention; Should be pointed out that for those skilled in the art, under the prerequisite that does not break away from the principle of the invention; Can also make some improvement and retouching, these improvement and retouching also should be considered as protection scope of the present invention.

Claims (15)

1. a document handling method is characterized in that, comprising:
Obtain file;
Resolve said file and obtain first word that said file comprises;
With said first word and the coupling character library coupling that is provided with in advance;
When said first word satisfies predetermined condition, obtain the corresponding notes content of said first word;
Show said first word and said notes content.
2. document handling method as claimed in claim 1; It is characterized in that; Said first word of said demonstration and said notes content comprise: show said first word according to the displaying scheme with first display effect; And show said notes content according to second displaying scheme with second display effect; Wherein, said first display effect is different with second display effect.
3. document handling method as claimed in claim 1 is characterized in that,
Said first word of said demonstration and said notes content comprise:
Obtain the original composing of said file;
Confirm the display position of said notes content with respect to said first word;
Judge the said display position place in the said original composing, whether have living space and hold said notes content;
When not having the space to hold said notes content; Said file set type again obtain one and newly set type; Make the said display position place in the said new composing that the space that holds said notes content arranged; And show said first word, and show said notes content at said display position place according to said new composing.
4. document handling method as claimed in claim 1 is characterized in that,
Said predetermined condition is that said first word does not belong to said coupling character library or said first word belongs to said coupling character library.
5. document handling method as claimed in claim 1 is characterized in that,
Said notes content comprises at least a of translation content that other Languages that the mark with phonetic symbols symbol of the articulation type that is used for marking said first word and intonation, the lexical or textual analysis information that is used to explain the said first word implication, the Play Control menu and utilizing that is used for the audio file of said first word pronunciation of controls playing be different from language under said first word is translated said first word.
6. document handling method as claimed in claim 5 is characterized in that,
When said notes content comprises said mark with phonetic symbols symbol; Said coupling character library comprises character library commonly used and the fallibility character library that is provided with in advance; Said predetermined condition is that said first word does not belong to said character library commonly used or said first word belongs to said fallibility character library; Wherein said character library commonly used includes predefined everyday character, and said fallibility character library comprises the predefined word that misreads easily.
7. document handling method as claimed in claim 5 is characterized in that,
When said notes content comprises said mark with phonetic symbols symbol, after the said file of said parsing obtains first word that said file comprises, also comprise: according to the context of said first word, said first word is carried out word segmentation processing, obtain word segmentation result;
The corresponding notes content of said first word of said acquisition comprises: according to said word segmentation result, inquire about predefined dictionary, obtain the mark with phonetic symbols symbol of said first word.
8. document handling method as claimed in claim 1 is characterized in that,
Have at least two predetermined character libraries, the word that each said predetermined character library comprises is incomplete same;
Before said acquisition file, also comprise:
Receive the coupling character library information is set;
According to said coupling character library information being set, is said coupling character library with the reserved word lab setting in the said plural predetermined character library.
9. a document handling apparatus is characterized in that, comprising:
The first acquisition unit is used to obtain file;
Resolution unit is used to resolve said file and obtains first word that said file comprises;
Matching unit is used for said first word and the coupling character library coupling that is provided with in advance;
The note unit is used for when said first word satisfies predetermined condition, obtains the corresponding notes content of said first word;
Display unit is used to show said first word and said notes content.
10. document handling apparatus as claimed in claim 9 is characterized in that, said display unit comprises:
Effect is confirmed the unit, is used for confirming first displaying scheme of said first word and second displaying scheme of said notes content, and wherein, first display effect of said first displaying scheme is different with second display effect of said second displaying scheme;
Display processing unit is used for when showing said first word, showing said first word according to said first displaying scheme; And when showing said notes content, show said notes content according to said second displaying scheme.
11. document handling apparatus as claimed in claim 9 is characterized in that, also comprises:
The second acquisition unit is used to obtain the original composing of said file;
Position determination unit is used for confirming the display position of said notes content with respect to said first word;
Judging unit is used for judging the said display position place of said original composing whether to have living space and hold said notes content;
The composing unit is used for when not having the space to hold said notes content, and said file is set type again obtains a composing newly, makes that there is the space that holds said notes content at the said display position place in the said new composing;
Said display unit also is used for showing said first word according to the said new composing that said composing unit obtains, and shows said notes content at said display position place.
12. document handling apparatus as claimed in claim 9 is characterized in that,
Said note unit is further used for when said first word does not belong to said coupling character library, obtains the corresponding notes content of said first word; Perhaps when said first word belongs to said coupling character library, obtain the corresponding notes content of said first word.
13. document handling apparatus as claimed in claim 8 is characterized in that, also comprises:
Storage unit; Be used to store said notes content; Wherein, said notes content comprises at least a of translation content that other Languages that the mark with phonetic symbols symbol of the articulation type that is used for marking said first word and intonation, the lexical or textual analysis information that is used to explain the said first word implication, the Play Control menu and utilizing that is used for the audio file of said first word pronunciation of controls playing be different from language under said first word is translated said first word.
14. document handling apparatus as claimed in claim 13 is characterized in that,
Said storage unit also is used to store predefined dictionary;
When said notes content comprised said mark with phonetic symbols symbol, said document handling apparatus also comprised:
The participle unit is used for after said resolution unit obtains said first word, according to the context of said first word, said first word is carried out word segmentation processing, obtains word segmentation result;
Query unit is used for according to said word segmentation result, inquires about the said dictionary of said cell stores, obtains the mark with phonetic symbols symbol of said first word.
15. document handling apparatus as claimed in claim 9 is characterized in that, also comprises:
Storage unit is used at least two predetermined character libraries of storage, and the word that each said predetermined character library comprises is incomplete same;
Receiving element is used for receiving the coupling character library information is set;
The unit being set, being used for according to said coupling character library information being set, is said coupling character library with a reserved word lab setting of said cell stores.
CN201010243566.9A 2010-08-02 2010-08-02 File processing method and file processing device Active CN102346731B (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN201010243566.9A CN102346731B (en) 2010-08-02 2010-08-02 File processing method and file processing device
US13/813,720 US10210148B2 (en) 2010-08-02 2011-08-01 Method and apparatus for file processing
PCT/CN2011/077865 WO2012016505A1 (en) 2010-08-02 2011-08-01 File processing method and file processing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010243566.9A CN102346731B (en) 2010-08-02 2010-08-02 File processing method and file processing device

Publications (2)

Publication Number Publication Date
CN102346731A true CN102346731A (en) 2012-02-08
CN102346731B CN102346731B (en) 2014-09-03

Family

ID=45545419

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010243566.9A Active CN102346731B (en) 2010-08-02 2010-08-02 File processing method and file processing device

Country Status (3)

Country Link
US (1) US10210148B2 (en)
CN (1) CN102346731B (en)
WO (1) WO2012016505A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104933033A (en) * 2015-07-08 2015-09-23 邱行中 System for automatic pinyin annotation of Chinese characters and annotation method of system
CN105989099A (en) * 2015-02-13 2016-10-05 晨星半导体股份有限公司 Relevant information display method and electronic device capable of automatically displaying relevant information
CN107239441A (en) * 2017-04-26 2017-10-10 广东小天才科技有限公司 A kind of dictionary definition method and device
CN108804002A (en) * 2018-04-25 2018-11-13 广州视源电子科技股份有限公司 The text annotation method and apparatus of interactive intelligence equipment
CN110321535A (en) * 2018-03-30 2019-10-11 富士施乐实业发展(中国)有限公司 Children's book processing method and processing device
CN110874527A (en) * 2018-08-28 2020-03-10 游险峰 Cloud-based intelligent paraphrasing and phonetic notation system
CN111274352A (en) * 2020-01-14 2020-06-12 北大方正集团有限公司 Method and equipment for marking characteristic characters in tool book
CN116484052A (en) * 2023-06-26 2023-07-25 广州宏途数字科技有限公司 Educational resource sharing system based on big data

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5753769B2 (en) * 2011-11-18 2015-07-22 株式会社日立製作所 Voice data retrieval system and program therefor
CN104346375B (en) * 2013-07-31 2017-10-13 北大方正集团有限公司 A kind of method and device for making middle character library
CN103941981B (en) * 2014-04-24 2017-09-19 江西迈思科技有限公司 A kind of method and device of information processing
WO2023121681A1 (en) * 2021-12-20 2023-06-29 Google Llc Automated text-to-speech pronunciation editing for long form text documents

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030214528A1 (en) * 2002-03-15 2003-11-20 Pitney Bowes Incorporated Method for managing the annotation of documents
CN101196874A (en) * 2007-12-28 2008-06-11 宇龙计算机通信科技(深圳)有限公司 Method and apparatus for machine aid reading
CN101408874A (en) * 2007-10-09 2009-04-15 深圳富泰宏精密工业有限公司 Apparatus and method for translating image and character
CN201259670Y (en) * 2008-07-22 2009-06-17 青岛海信移动通信技术股份有限公司 Text message processing apparatus and equipment
CN101765840A (en) * 2006-09-15 2010-06-30 埃克斯比布里奥公司 Capture and display of annotations in paper and electronic documents

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2020748A1 (en) * 1989-08-22 1991-02-23 Thomas F. Look Method and apparatus for machine reading of retroreflective vehicle identification articles
DE69232493T2 (en) * 1991-10-21 2003-01-09 Canon Kk Method and device for character recognition
US5369704A (en) * 1993-03-24 1994-11-29 Engate Incorporated Down-line transcription system for manipulating real-time testimony
US6128632A (en) * 1997-03-06 2000-10-03 Apple Computer, Inc. Methods for applying rubi annotation characters over base text characters
US6262728B1 (en) * 1998-11-03 2001-07-17 Agilent Technologies, Inc. System and method for annotating a graphical user interface display in a computer-based system
US6551357B1 (en) * 1999-02-12 2003-04-22 International Business Machines Corporation Method, system, and program for storing and retrieving markings for display to an electronic media file
JP2000330902A (en) * 1999-05-25 2000-11-30 Sony Corp Device and method for information processing, and medium
JP2004505563A (en) 2000-07-27 2004-02-19 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Transcript trigger information for video enhancement
US20020086269A1 (en) * 2000-12-18 2002-07-04 Zeev Shpiro Spoken language teaching system based on language unit segmentation
US20040267798A1 (en) * 2003-06-20 2004-12-30 International Business Machines Corporation Federated annotation browser
US7418656B1 (en) * 2003-10-03 2008-08-26 Adobe Systems Incorporated Dynamic annotations for electronics documents
WO2005116863A1 (en) * 2004-05-24 2005-12-08 Swinburne University Of Technology A character display system
WO2006029259A2 (en) 2004-09-08 2006-03-16 Sharedbook Ltd Creating an annotated web page
US7779347B2 (en) * 2005-09-02 2010-08-17 Fourteen40, Inc. Systems and methods for collaboratively annotating electronic documents
CN100483416C (en) 2007-05-22 2009-04-29 北京搜狗科技发展有限公司 Character input method, input method system and method for updating word stock
CN101420313B (en) 2007-10-22 2011-01-12 北京搜狗科技发展有限公司 Method and system for clustering customer terminal user group
CN101645088B (en) 2008-08-05 2016-06-01 北京搜狗科技发展有限公司 Determine the method for auxiliary lexicon, device and the input method system that need to load
CN101645190B (en) 2009-07-22 2011-03-30 合肥讯飞数码科技有限公司 Word inquiring system and inquiring method thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030214528A1 (en) * 2002-03-15 2003-11-20 Pitney Bowes Incorporated Method for managing the annotation of documents
CN101765840A (en) * 2006-09-15 2010-06-30 埃克斯比布里奥公司 Capture and display of annotations in paper and electronic documents
CN101408874A (en) * 2007-10-09 2009-04-15 深圳富泰宏精密工业有限公司 Apparatus and method for translating image and character
CN101196874A (en) * 2007-12-28 2008-06-11 宇龙计算机通信科技(深圳)有限公司 Method and apparatus for machine aid reading
CN201259670Y (en) * 2008-07-22 2009-06-17 青岛海信移动通信技术股份有限公司 Text message processing apparatus and equipment

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105989099A (en) * 2015-02-13 2016-10-05 晨星半导体股份有限公司 Relevant information display method and electronic device capable of automatically displaying relevant information
CN104933033A (en) * 2015-07-08 2015-09-23 邱行中 System for automatic pinyin annotation of Chinese characters and annotation method of system
CN107239441A (en) * 2017-04-26 2017-10-10 广东小天才科技有限公司 A kind of dictionary definition method and device
CN107239441B (en) * 2017-04-26 2020-09-01 广东小天才科技有限公司 Dictionary paraphrasing method and device
CN110321535B (en) * 2018-03-30 2023-08-18 富士胶片实业发展(上海)有限公司 Child reading material processing method and device
CN110321535A (en) * 2018-03-30 2019-10-11 富士施乐实业发展(中国)有限公司 Children's book processing method and processing device
CN108804002A (en) * 2018-04-25 2018-11-13 广州视源电子科技股份有限公司 The text annotation method and apparatus of interactive intelligence equipment
CN108804002B (en) * 2018-04-25 2022-03-08 广州视源电子科技股份有限公司 Text annotation method and device for interactive intelligent equipment
CN110874527A (en) * 2018-08-28 2020-03-10 游险峰 Cloud-based intelligent paraphrasing and phonetic notation system
CN111274352A (en) * 2020-01-14 2020-06-12 北大方正集团有限公司 Method and equipment for marking characteristic characters in tool book
CN111274352B (en) * 2020-01-14 2023-05-26 北大方正集团有限公司 Method and equipment for marking characteristic words in tool book
CN116484052A (en) * 2023-06-26 2023-07-25 广州宏途数字科技有限公司 Educational resource sharing system based on big data
CN116484052B (en) * 2023-06-26 2023-12-01 广州宏途数字科技有限公司 Educational resource sharing system based on big data

Also Published As

Publication number Publication date
CN102346731B (en) 2014-09-03
WO2012016505A1 (en) 2012-02-09
US10210148B2 (en) 2019-02-19
US20130132816A1 (en) 2013-05-23

Similar Documents

Publication Publication Date Title
CN102346731B (en) File processing method and file processing device
US10210154B2 (en) Input method editor having a secondary language mode
CN108959242B (en) Target entity identification method and device based on part-of-speech characteristics of Chinese characters
Younes et al. Constructing linguistic resources for the Tunisian dialect using textual user-generated contents on the social web
JP2006190006A5 (en)
US10402474B2 (en) Keyboard input corresponding to multiple languages
CN102880460A (en) Analyzing method and device of note content
CN102033866A (en) Method and system for checking chemical name
CN110738050A (en) Text recombination method, device and medium based on word segmentation and named entity recognition
KR101400129B1 (en) Apparatus and Method for Display Characteristics, System for Educating Korean Language in Online Using It
AU2011265574B2 (en) Image processing apparatus, image processing program, and image processing method
CN111241276A (en) Topic searching method, device, equipment and storage medium
Bień The IMPACT project Polish Ground-Truth texts as a DjVu corpus
CN104050156B (en) For extracting device, method and the electronic equipment of maximum noun phrase
CN115273057A (en) Text recognition method and device, dictation correction method and device and electronic equipment
CN103678424A (en) Document proofreading method and device
CN112699692A (en) Text translation control method and device, electronic equipment and storage medium
US20170148337A1 (en) Method and system for analyzing a piece of text
US20150095314A1 (en) Document search apparatus and method
KR20130122437A (en) Method and system for converting the english to hangul
Mclellan et al. Introduction: English in Brunei Darussalam
JP2020064428A (en) Content display method and device
KR20160032423A (en) Apparatus providing studying words
CN104063366A (en) Text format setting method and device
KR20170061412A (en) Apparatus for word learning system and method for operating the same

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant