CN106708793B - Annotate footnote recognition methods, device and electronic equipment - Google Patents

Annotate footnote recognition methods, device and electronic equipment Download PDF

Info

Publication number
CN106708793B
CN106708793B CN201611108286.0A CN201611108286A CN106708793B CN 106708793 B CN106708793 B CN 106708793B CN 201611108286 A CN201611108286 A CN 201611108286A CN 106708793 B CN106708793 B CN 106708793B
Authority
CN
China
Prior art keywords
footnote
annotation
recognizer
current file
notes content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201611108286.0A
Other languages
Chinese (zh)
Other versions
CN106708793A (en
Inventor
于刚
胡元琪
孙上斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhangyue Technology Co Ltd
Original Assignee
Zhangyue Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhangyue Technology Co Ltd filed Critical Zhangyue Technology Co Ltd
Priority to CN201611108286.0A priority Critical patent/CN106708793B/en
Publication of CN106708793A publication Critical patent/CN106708793A/en
Application granted granted Critical
Publication of CN106708793B publication Critical patent/CN106708793B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/169Annotation, e.g. comment data or footnotes

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention discloses a kind of annotation footnote recognition methods, device and electronic equipment, methods to include:It is chosen from preset multiple annotation footnote recognizers and is suitable for the highest annotation footnote recognizer of current file matching degree;The annotation footnote of current file is identified using annotation footnote recognizer;The notes content of current file is obtained, the corresponding notes content of footnote will be annotated and be associated.Utilize the program, select the highest annotation footnote recognizer of matching degree, the annotation footnote of current file is identified using the annotation footnote recognizer, and annotation footnote is associated with annotation, facilitate user that can be directly viewable corresponding annotation in reading file, query when user reads is solved in time, while avoids the troublesome operation of page turning before and after user.Further, situations such as being identified using the highest annotation footnote recognizer of matching degree, the annotation footnote in file can be found out to greatest extent, being reduced identification mistake or omit.

Description

Annotate footnote recognition methods, device and electronic equipment
Technical field
The present invention relates to computer software fields, and in particular to a kind of annotation footnote recognition methods, device and electronics are set It is standby.
Background technology
Hereof, it is related to specialized vocabulary, reference vocabulary when the vocabulary that authors' needs are explained further, often author Annotation footnote can be increased on these vocabulary.Correspondingly, at the end of each chapters and sections of file or at the end of the end of file, increase Add annotation corresponding with these annotation footnotes.By annotating footnote and annotation, it is more advantageous to that reader is helped to understand the interior of file Hold.
Since annotation footnote and annotation are separately positioned on different positions, when checking annotation, need file page turning backward It is checked to the position where annotation.User is caused to need file Page forward backward constantly in this way, annotation footnote with Annotation cannot directly be checked that the reading experience of user is ineffective.Simultaneously as the annotation footnote in file can sometimes Can there is a situation where that form is inconsistent, user is searching all annotation footnotes and during associated annotation, also easily occur to search mistake or Situations such as omission.
Invention content
In view of the above problems, it is proposed that the present invention overcomes the above problem in order to provide one kind or solves at least partly The recognition methods of annotation footnote, device and the electronic equipment of the above problem.
According to an aspect of the invention, there is provided a kind of annotation footnote recognition methods, including:
It is chosen from preset multiple annotation footnote recognizers and is suitable for the highest annotation footnote of current file matching degree Recognizer;
The annotation footnote of current file is identified using annotation footnote recognizer;
The notes content of current file is obtained, the corresponding notes content of footnote will be annotated and be associated.
According to another aspect of the present invention, a kind of annotation footnote identification device is provided, including:
Algorithm picks module is suitable for current file suitable for being chosen from preset multiple annotation footnote recognizers With the highest annotation footnote recognizer of degree;
Footnote identification module is annotated, suitable for identifying the annotation footnote of current file using annotation footnote recognizer;
Relating module suitable for obtaining the notes content of current file, will annotate the corresponding notes content of footnote and carry out Association.
According to another aspect of the invention, a kind of electronic equipment is provided, including:Processor, memory, communication interface And communication bus, the processor, the memory and the communication interface complete mutual lead to by the communication bus Letter;
For the memory for storing an at least executable instruction, the executable instruction performs the processor State the corresponding operation of annotation footnote recognition methods.
In accordance with a further aspect of the present invention, provide a kind of computer storage media, be stored in the storage medium to A few executable instruction, the executable instruction make the processor perform such as the corresponding behaviour of above-mentioned annotation footnote recognition methods Make.
According to annotation footnote recognition methods provided by the invention, device and electronic equipment, from preset multiple annotation footnotes It is chosen in recognizer and is suitable for the highest annotation footnote recognizer of current file matching degree;It is calculated using annotation footnote identification Method identifies the annotation footnote of current file;The notes content of current file is obtained, in the annotation corresponding by footnote is annotated Appearance is associated.The highest annotation footnote recognizer of matching degree is selected, is identified and worked as using the annotation footnote recognizer The annotation footnote of preceding document, and annotation footnote is associated with annotation, facilitate user that can be directly viewable in reading file Corresponding annotation solves query when user reads, while avoids the troublesome operation of page turning before and after user in time.Further, It is identified using the highest annotation footnote recognizer of matching degree, the annotation foot in file can be found out to greatest extent Mark, situations such as reducing identification mistake or omit.
Above description is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, below the special specific embodiment for lifting the present invention.
Description of the drawings
By reading the detailed description of hereafter preferred embodiment, it is various other the advantages of and benefit it is general for this field Logical technical staff will become clear.Attached drawing is only used for showing the purpose of preferred embodiment, and is not considered as to this hair Bright limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings:
Fig. 1 shows the flow chart of according to embodiments of the present invention one annotation footnote recognition methods;
Fig. 2 shows the flow charts of according to embodiments of the present invention one selection annotation footnote recognizer;
Fig. 3 shows the flow chart of according to embodiments of the present invention two annotation footnote recognition methods;
Fig. 4 shows the functional block diagram of according to embodiments of the present invention three annotation footnote identification device;
Fig. 5 shows the functional block diagram of according to embodiments of the present invention four annotation footnote identification device;
Fig. 6 shows the structure diagram of according to embodiments of the present invention six a kind of electronic equipment.
Specific embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although this public affairs is shown in attached drawing The exemplary embodiment opened, it being understood, however, that may be realized in various forms the disclosure without the implementation that should be illustrated here Example is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the disclosure Range is completely communicated to those skilled in the art.
Embodiment one
Fig. 1 shows the flow chart of according to embodiments of the present invention one annotation footnote recognition methods, as shown in Fig. 1, annotation Footnote recognition methods includes the following steps:
Step S101 chooses from preset multiple annotation footnote recognizers and is suitable for current file matching degree highest Annotation footnote recognizer.
Annotate the form of footnote due to different authors write that custom is different or correction personnel edit be accustomed to it is different etc. it is various because Element and there are different forms, therefore the recognizer of corresponding annotation footnote is also required to the annotation foot suitable for different-format Mark.A kind of recognizer for annotating footnote cannot meet the annotation footnote of multiple format.By a large amount of multiple experiments, to not Annotation footnote with form is trained, and obtains multiple annotation footnote recognizers.Wherein, annotation footnote recognizer can be with Using such as regular expression, by such as using regular expression in identification process, obtained from file and meet the regular expression Annotation footnote.
For current file, need to choose suitable for current file from preset multiple annotation footnote recognizers The highest annotation footnote recognizer of matching degree.It is specifically chosen from preset multiple annotation footnote recognizers and is suitable for working as The process of the highest annotation footnote recognizer of preceding document matching degree, as shown in Figure 2 includes the following steps:
S1011, statistics obtain the total number of all annotation footnotes in current file.
Annotation footnote is normally at the upper right corner or inferior horn of word, is set with the normal texts such as title, text in file Difference, font size are less than the font size of normal text, and the font of font and normal text may be inconsistent.According to annotation footnote and just The different situation of normal word, can be counted from current file, obtain total of all annotation footnotes in current file Number.In addition to carrying out statistics according to the annotation footnote situation different from normal text and obtaining number, it can also be obtained according to notes content Take the number of annotation footnote.It, can be to note because including annotation footnote and the corresponding notes content of annotation footnote in notes content It releases content to be parsed, obtains the number of annotation footnote.
S1012, successively using the annotation footnote in preset multiple annotation footnote recognizer identification current files, system Count the annotation footnote number identified.
S1013 selects annotation footnote identification corresponding with the annotation footnote number of the immediate identification of the total number of footnote Algorithm, annotation footnote recognizer are suitable for the highest annotation footnote recognizer of current file matching degree.
The annotation footnote in identification current file is removed using preset multiple annotation footnote recognizers successively, statistics is each The number of annotation footnote that a annotation footnote recognizer is identified.Obtained each annotation footnote recognizer institute will be counted The number of the annotation footnote of identification in step S1011 with obtaining in current file compared with the total number of all annotation footnotes, such as The total number of all annotation footnotes is 100 in obtained current file, the annotation that difference annotation footnote recognizer is identified The number of footnote is respectively 98,96,90 etc..From wherein selection and the total number of all annotation footnotes of current file Immediate number, that is, the number of annotation footnote identified is 98.The corresponding annotation footnote recognizer of the number is suitable For the highest annotation footnote recognizer of current file matching degree.If the annotation footnote that annotation footnote recognizer is identified Number be 100 when, i.e., the annotation footnote recognizer can recognize that all annotation footnotes in current file, no The annotation footnote that other preset multiple annotation footnote recognizers are gone in identification current file is reused, can be directly selected The annotation footnote recognizer is suitable for the highest annotation footnote recognizer of current file matching degree.
Step S102 identifies the annotation footnote of current file using annotation footnote recognizer.
Identification is gone to work as using the highest annotation footnote recognizer of current file matching degree that is suitable for that step S101 chooses The annotation footnote of preceding document.It can be according to the sortord difference of different file notes footnotes, as noted in current file during identification Footnote is released in a manner that full text sorts successively, the entirety of current file can be identified, is disposably identified current Annotation footnote in file;Or if the annotation footnote in current file is in a manner that often chapters and sections content sorts successively, i.e., currently Annotation footnote in every chapters and sections content of file is sorted since 1, using annotation footnote recognizer to current file point Chapters and sections are identified, and only identify the annotation footnote in one chapters and sections of current file every time, until having identified current file;Or such as Annotation footnote in current file is in a manner that every page of content sorts successively, i.e., the annotation in current file in every page of content Footnote is sorted since 1, and current file paging is identified using annotation footnote recognizer, is only identified every time Annotation footnote in current file one page, until having identified current file.
It is to be configured for example, being embodied according to actual conditions above, does not limit herein.
Step S103 obtains the notes content of current file, will annotate the corresponding notes content of footnote and closes Connection.
The notes content of file is generally located at the end of file, at end of the file per chapters and sections or every page of file At end, some can add horizontal line at each end, to distinguish text and notes content.Title in notes content and file, just The setting of the normal texts such as text may be different, and such as font size of the font size less than normal text, the font of font and normal text is inconsistent Situations such as.According to the feature of notes content, notes content can be got from current file.
Include each annotation footnote and the corresponding notes content of annotation footnote in notes content.It, can according to annotation footnote It is associated one by one with the annotation footnote in current file so that the corresponding notes content of footnote will be annotated.During association, can such as it incite somebody to action The corresponding notes content of annotation footnote establishes correspondence with the annotation footnote in current character, when user wants to check annotation During the corresponding notes content of footnote, annotation footnote can be operated, corresponding annotation can be viewed at annotation footnote Content.
According to annotation footnote recognition methods provided by the invention, chosen from preset multiple annotation footnote recognizers Suitable for the highest annotation footnote recognizer of current file matching degree;Being identified using annotation footnote recognizer ought be above The annotation footnote of part;The notes content of current file is obtained, the corresponding notes content of footnote will be annotated and be associated.Choosing The highest annotation footnote recognizer of matching degree is taken out, the annotation of current file is identified using the annotation footnote recognizer Footnote, and annotation footnote is associated with annotation, facilitate user that can be directly viewable corresponding annotation in reading file, Query when user reads is solved in time, while avoids the troublesome operation of page turning before and after user.Further, using matching degree Highest annotation footnote recognizer is identified, and can find out the annotation footnote in file to greatest extent, reduce and know Situations such as wrong or omission.
Embodiment two
Fig. 3 shows the flow chart of according to embodiments of the present invention two annotation footnote recognition methods, as shown in Fig. 3, annotation Footnote recognition methods includes the following steps:
Step S301 chooses from preset multiple annotation footnote recognizers and is suitable for current file matching degree highest Annotation footnote recognizer.
Step S302 identifies the annotation footnote of current file using annotation footnote recognizer.
Above step can refer to the step S101-S102 in Fig. 1 embodiments one, and details are not described herein.
It should be noted that after annotation footnote recognizer is modified, calculated using modified annotation footnote identification Method continues to identify the annotation footnote of current file.
Step S303, judges whether the number of the annotation footnote of identification is equal to the number of the annotation footnote of current file.
If using annotation footnote recognizer identify be current file all annotation footnotes, by the annotation of identification The number of footnote judges of the annotation footnote of identification compared with the total number of all annotation footnotes of current file carries out Whether number is equal to the number of the annotation footnote of current file;If what it is using annotation footnote recognizer identification is current file The annotation footnote of a certain chapters and sections first obtains the total number that footnote is annotated in the chapters and sections, by the number of the annotation footnote of identification with being somebody's turn to do The total number for annotating footnote in chapters and sections is compared, and is judged whether the number of the annotation footnote of identification is equal in the chapters and sections and is annotated The total number of footnote;If using annotation footnote recognizer identify be current file certain one page annotation footnote, first obtain The total number of footnote is annotated in this page, the number of the annotation footnote of identification is subjected to phase with annotating the total number of footnote in this page Than judging whether the number of the annotation footnote of identification is equal to the total number that footnote is annotated in this page.
If the number of the annotation footnote of identification is equal to the number of the annotation footnote of current file or the annotation footnote of identification Number, which is equal to equal to the number of the total number of annotation footnote in the chapters and sections or the annotation footnote of identification in this page, annotates the total of footnote During number, i.e., whole annotation footnotes is identified using annotation footnote recognizer, has performed step S305;Otherwise, even if Do not identify whole annotation footnotes with annotation footnote recognizer, need to continue to the annotation footnote in current file into Row identification, performs step S304.
Step S304, if the number of the annotation footnote of identification is not equal to the number of the annotation footnote of current file, modification note Release footnote recognizer.
When modifying to annotation footnote recognizer, the various limits in footnote recognizer can be annotated by reduction Fixed condition modification annotation footnote recognizer.Footnote recognizer is such as annotated to include such as regular expression, the regular expression It is pre-set to be suitable for the character of annotation footnote and the regular character string that forms of character combination.Pass through regular expression pair Current file is filtered, and identification obtains annotation footnote.Such as regular expression can identify that (1), (2), (3) include number The annotation footnote of word and round bracket form.If annotation footnote is caused due to hand mistake in current file when author writes etc. Also have such as (4】During the annotation footnote of form, which goes out (4】The annotation footnote of form.Therefore it is right The regular expression is modified, and such as removal annotation footnote must include the limitation of () round bracket form, reduces annotation footnote Qualifications in recognizer, and then can identify more annotation footnotes.
When modifying to annotation footnote recognizer, it can also be annotated by modification various in footnote recognizer Qualifications annotate footnote recognizer to change.When modification annotates the various qualifications in footnote recognizer, Ke Yiru The former qualifications in annotation footnote recognizer are replaced using corresponding qualifications.Footnote recognizer is such as annotated to include Such as regular expression, regular expression can identify that (1), (2), (3) etc. include Arabic numerals and round bracket form Annotate footnote.It, can be by the corresponding canonical of Arabic numerals if also there is the annotation footnote such as (four) form in current file Expression formula is replaced accordingly, and use can identify the corresponding regular expression of Chinese-character digital, and then can identify more More annotation footnotes.
Be above for example, in implementing to operate, may annotate regular expression that footnote recognizer uses or During other modes identification annotation footnote, when multiple qualifications that annotation footnote recognizer includes are modified, it can subtract Few one or more qualifications replace one or more corresponding qualifications using corresponding qualifications, can also Reduce and change simultaneously one or more qualifications.Specific modification mode is configured according to scene is implemented, and is not done herein It limits.
Modification annotate footnote recognizer after, perform step S303, using it is modified annotation footnote recognizer after Annotation footnote in continuous identification current file.
Step S305, the notes content type specified according to user obtain the notes content of current file, will annotate foot Corresponding notes content is marked to be associated.
Notes content type can be divided into general remarks, combination annotation etc..The notes content of general remarks is according to annotation foot That marks a rule lists the corresponding notes content of annotation footnote, and such as every notes content is shown in the form of a line.Combination The notes content of annotation can put the corresponding notes content of all annotation footnotes together, with whole such as the side of one whole section of content Formula shows all notes contents.
The notes content type specified according to user if notes content type is combination annotation, is obtaining notes content When, first whole section of notes content is segmented according to annotation footnote.As notes content is:(1) XXXX.(2)XXXX.(3) XXXX.First notes content according to each annotation footnote is segmented, respective annotation footnote is got and annotation footnote is corresponding Notes content.The corresponding notes content of footnote will be annotated to be associated;Or such as notes content type is general remarks, it can To directly acquire the corresponding notes content of each annotation footnote, the corresponding notes content of footnote will be annotated and be associated.
Step S306 obtains the behavior of user's operation annotation footnote, will be shown with the annotation associated notes content of footnote Show.
It will be annotated after the corresponding notes content of footnote is associated by step S305, obtain user's operation annotation The behavior of footnote such as obtains the behavior that user double-clicked or clicked annotation footnote, the corresponding notes content of display at annotation footnote So that user checks;Or obtain being grasped as setting annotates the related of the associated notes content show or hide of footnote for user's operation Make behavior, show that corresponding notes content checks or hide corresponding notes content with convenient for user at annotation footnote User is to reading of body matter etc..
According to annotation footnote recognition methods provided by the invention, chosen from preset multiple annotation footnote recognizers Suitable for the highest annotation footnote recognizer of current file matching degree;Being identified using annotation footnote recognizer ought be above The annotation footnote of part;Judge whether the number of the annotation footnote of identification is equal to the number of the annotation footnote of current file, if identification Annotation footnote number not equal to current file annotation footnote number, modification annotation footnote recognizer.Use modification Annotation footnote recognizer afterwards continues to identify the annotation footnote in current file, until number of annotation footnote of identification etc. In the number of the annotation footnote of current file.The notes content type specified according to user, obtains in the annotation of current file Hold, the corresponding notes content of footnote will be annotated and be associated.The highest annotation footnote recognizer of matching degree is selected, The annotation footnote of current file is identified using the annotation footnote recognizer, and annotation footnote is associated with annotation, side Just user can be directly viewable corresponding annotation in reading file, solve query when user reads in time, avoid simultaneously The troublesome operation of page turning before and after user.It is identified using the highest annotation footnote recognizer of matching degree, it can be to greatest extent The annotation footnote found out in file, reduce identification mistake or omit situations such as.Further, when footnote annotation algorithm can not It when identifying the annotation footnote in all files, modifies to footnote annotation algorithm, and is calculated using modified annotation footnote Method further searches for out the annotation footnote in file, can perform repeatedly, until identifying annotation footnote all in file. Meanwhile different types of notes content is handled, notes content is associated with annotation footnote.Obtaining user's operation note After the behavior for releasing footnote, it will be shown with the annotation associated notes content of footnote.Cause user while reading file, Notes content can be got, user experience is preferable.
Embodiment three
Fig. 4 shows the functional block diagram of according to embodiments of the present invention three annotation footnote identification device.As shown in figure 4, note It releases footnote identification device and includes following module:
Algorithm picks module 410 is suitable for current file suitable for being chosen from preset multiple annotation footnote recognizers The highest annotation footnote recognizer of matching degree.
Annotate the form of footnote due to different authors write that custom is different or correction personnel edit be accustomed to it is different etc. it is various because Element and there are different forms, therefore the recognizer of corresponding annotation footnote is also required to the annotation foot suitable for different-format Mark.A kind of recognizer for annotating footnote cannot meet the annotation footnote of multiple format.The annotation footnote of different-format is passing through After a large amount of multiple training, multiple annotation footnote recognizers can be obtained.Wherein, annotation footnote recognizer may be used Such as regular expression, by such as using regular expression in identification process, the note for meeting the regular expression is obtained from file Release footnote.
For current file, need through algorithm picks module 410 from preset multiple annotation footnote recognizers It chooses and is suitable for the highest annotation footnote recognizer of current file matching degree.Specific algorithm picks module 410 includes as follows Module:
Statistical module 411 obtains the total number of all annotation footnotes in current file suitable for statistics.
Annotation footnote is normally at the upper right corner or inferior horn of word, is set with the normal texts such as title, text in file Difference, font size are less than the font size of normal text, and the font of font and normal text may be inconsistent.Statistical module 411 is according to note The footnote situation different from normal text is released, can be counted from current file, obtains all annotation feet in current file Target total number.In addition to carrying out statistics according to the annotation footnote situation different from normal text and obtaining number, statistical module 411 The number of annotation footnote can also be obtained according to notes content.Because including annotation footnote and annotation footnote pair in notes content The notes content answered, statistical module 411 can parse notes content, obtain the number of annotation footnote.
Identification module 412 is tested, suitable for successively using in preset multiple annotation footnote recognizer identification current files Annotation footnote, count the annotation footnote number that is identified.
Test identification module 412 is gone using preset multiple annotation footnote recognizers in identification current file successively Footnote is annotated, counts the number of annotation footnote that each annotation footnote recognizer is identified.Algorithm picks module 410 will unite The number and statistical module 411 for counting the annotation footnote that obtained each annotation footnote recognizer is identified obtain current file In the total numbers of all annotation footnotes compare, all annotation footnotes is total in the current file obtained such as statistical module 411 Number is 100, and test identification module 412 obtains the number difference for the annotation footnote that different annotation footnote recognizers are identified It is 98,96,90 etc..Algorithm picks module 410 from wherein selection with current file it is all annotation footnotes total numbers Immediate number, that is, the number of annotation footnote identified is 98.The corresponding annotation footnote recognizer of the number is suitable For the highest annotation footnote recognizer of current file matching degree.If test identification module 412 obtains annotation, footnote identification is calculated When the number of annotation footnote that method is identified is 100, i.e., the annotation footnote recognizer can recognize that in current file All annotation footnotes, test identification module 412 do not use other preset multiple annotation footnote recognizers and identification are gone to work as Annotation footnote in preceding document, it is suitable for current that algorithm picks module 410, which can directly select the annotation footnote recognizer, The highest annotation footnote recognizer of file matching degree.
Footnote identification module 420 is annotated, suitable for identifying the annotation foot of current file using annotation footnote recognizer Mark.
It annotates footnote identification module 420 and is suitable for current file matching degree highest using what algorithm picks module 410 was chosen Annotation footnote recognizer go identification current file annotation footnote.Annotating can basis when footnote identification module 420 identifies The sortord of different file notes footnotes is different, as annotated footnote in current file in a manner that full text sorts successively, note The entirety of current file can be identified by releasing footnote identification module 420, disposably identify the annotation in current file Footnote;Or if the annotation footnote in current file is in a manner that often chapters and sections content sorts successively, i.e. every chapters and sections of current file Annotation footnote in content is sorted since 1, and annotation footnote identification module 420 is using annotation footnote recognizer to working as Preceding document divides chapters and sections to be identified, and only identifies the annotation footnote in one chapters and sections of current file every time, ought be above up to having identified Part;Or if the annotation footnote in current file is in a manner that every page of content sorts successively, i.e., in current file in every page of content Annotation footnote be all to sort since 1, annotation footnote identification module 420 is using annotation footnote recognizer to current file Paging is identified, and only identifies the annotation footnote in current file one page every time, until having identified current file.
It is to be configured for example, being embodied according to actual conditions above, does not limit herein.
Relating module 430, suitable for obtaining the notes content of current file, the notes content corresponding by footnote is annotated It is associated.
The notes content of file is generally located at the end of file, at end of the file per chapters and sections or every page of file At end, some can add horizontal line at each end, to distinguish text and notes content.Title in notes content and file, just The setting of the normal texts such as text may be different, and such as font size of the font size less than normal text, the font of font and normal text is inconsistent Situations such as.Relating module 430 can get notes content according to the feature of notes content from current file.
Include each annotation footnote and the corresponding notes content of annotation footnote in notes content.Notes content is according to class Type can be divided into general remarks, combination annotation etc..The notes content of general remarks lists annotation according to annotation one rule of footnote The corresponding notes content of footnote, such as every notes content are shown in the form of a line.Combining the notes content of annotation can incite somebody to action All corresponding notes contents of footnote that annotate are put together, are shown in all annotations in a manner of whole such as one whole section of content Hold.
The notes content type specified according to user, relating module 430 further comprise following module:
It is annotated if combining relating module 431 and being suitable for notes content type for combination.It combines relating module 431 and obtains annotation During content, first whole section of notes content is segmented according to annotation footnote.As notes content is:(1)XXXX.(2)XXXX. (3)XXXX.First notes content according to each annotation footnote is segmented, gets respective annotation footnote and annotation footnote pair The notes content answered.The corresponding notes content of footnote will be annotated to be associated.
It is general remarks that if common association module 432, which is suitable for notes content type,.Common association module 432 can be direct The corresponding notes content of each annotation footnote is obtained, the corresponding notes content of footnote will be annotated and be associated.
It is corresponding such as can will to annotate footnote in the corresponding notes content of association annotation footnote for relating module 430 Notes content establishes correspondence with the annotation footnote in current character, when user wants to check the corresponding annotation of annotation footnote During content, annotation footnote can be operated, corresponding notes content can be viewed at annotation footnote.
According to annotation footnote identification device provided by the invention, chosen from preset multiple annotation footnote recognizers Suitable for the highest annotation footnote recognizer of current file matching degree;Being identified using annotation footnote recognizer ought be above The annotation footnote of part;The notes content of current file is obtained, the corresponding notes content of footnote will be annotated and be associated.Choosing The highest annotation footnote recognizer of matching degree is taken out, the annotation of current file is identified using the annotation footnote recognizer Footnote, and annotation footnote is associated with annotation, facilitate user that can be directly viewable corresponding annotation in reading file, Query when user reads is solved in time, while avoids the troublesome operation of page turning before and after user.Further, using matching degree Highest annotation footnote recognizer is identified, and can find out the annotation footnote in file to greatest extent, reduce and know Situations such as wrong or omission.
Example IV
Fig. 5 shows the functional block diagram of according to embodiments of the present invention four annotation footnote identification device.As shown in figure 5, with The difference lies in annotation footnote identification device further includes following module to Fig. 4:
Whether judgment module 440, the number suitable for judging the annotation footnote of identification are equal to the annotation footnote of current file Number.
If annotation footnote identification module 420 using annotation footnote recognizer identify be current file all notes Release footnote, judgment module 440 by the number of the annotation footnote of identification and the total numbers of all annotation footnotes of current file into Row is compared, and judges whether the number of the annotation footnote of identification is equal to the number of the annotation footnote of current file;If it annotates footnote to know Other module 420 using annotation footnote recognizer identify be current file a certain chapters and sections annotation footnote, judgment module 440 first obtain the total number that footnote is annotated in the chapters and sections, by the number of the annotation footnote of identification and annotation footnote in the chapters and sections Total number is compared, and judges whether the number of the annotation footnote of identification is equal to the total number that footnote is annotated in the chapters and sections;If note Release footnote identification module 420 using annotation footnote recognizer identify be current file certain one page annotation footnote, sentence Disconnected module 440 first obtains the total number that footnote is annotated in this page, will annotate footnote in the number of the annotation footnote of identification and the page Total number compared, judge identification annotation footnote number whether be equal to this page in annotate footnote total number.
Judgment module 440 judges that the number of the annotation footnote of identification is equal to number or the knowledge of the annotation footnote of current file The number of other annotation footnote is equal to the total number of annotation footnote or the number of the annotation footnote of identification in the chapters and sections and is equal to the page During the total number of middle annotation footnote, that is, annotate footnote identification module 420 and identified whole using annotation footnote recognizer Annotation footnote, perform relating module 430.Otherwise, i.e., annotation footnote identification module 420 is not had using annotation footnote recognizer Have and identify whole annotation footnotes, need to continue that the annotation footnote in current file is identified, perform modified module 450。
Modified module 450, if the number suitable for the annotation footnote of identification is a not equal to the annotation footnote of current file Number, modification annotation footnote recognizer;
When modified module 450 modifies to annotation footnote recognizer, can footnote recognizer be annotated by reduction In various qualifications modification annotation footnote recognizer.Footnote recognizer is such as annotated to include such as regular expression, it should The regular character string that regular expression is the pre-set character for being suitable for annotation footnote and character combination forms.Annotate foot Mark identification module 420 is filtered current file using regular expression, and identification obtains annotation footnote.Such as annotate footnote knowledge Other module 420 can identify that (1), (2), (3) etc. include the annotation foot of number and round bracket form using regular expression Mark.If annotation footnote is caused also to have such as (4 due to hand mistake in current file when author writes etc.】The annotation of form During footnote, annotation footnote identification module 420 goes out (4 using the regular expression None- identified】The annotation footnote of form.Therefore it repaiies Change module 450 to modify to the regular expression, must include () round bracket shape as modified module 450 removes annotation footnote The limitation of formula reduces the qualifications in annotation footnote recognizer, and then can identify more annotation footnotes.
When modified module 450 modifies to annotation footnote recognizer, footnote identification can also be annotated by modification and is calculated Various qualifications in method annotate footnote recognizer to change.In the modification annotation footnote recognizer of modified module 450 It, can be as corresponding qualifications be used to replace the former qualifications in annotation footnote recognizer during various qualifications.Such as Annotation footnote recognizer includes such as regular expression, and annotation footnote identification module 420 can be identified using regular expression Go out (1), (2), (3) etc. and include Arabic numerals and the annotation footnote of round bracket form.If also there are such as (four) in current file During the annotation footnote of form, modified module 450 can be replaced the corresponding regular expression of Arabic numerals accordingly, Using can identify the corresponding regular expression of Chinese-character digital, and then it can identify more annotation footnotes.
Be above for example, in implementing to operate, may annotate regular expression that footnote recognizer uses or During other modes identification annotation footnote, when multiple qualifications that annotation footnote recognizer includes are modified, mould is changed Block 450 can reduce one or more qualifications or replace one or more corresponding restrictions using corresponding qualifications Condition can also reduce and change simultaneously one or more qualifications.Specific modification mode is set according to scene is implemented It puts, does not limit herein.
After modified module 450 changes annotation footnote recognizer, annotation footnote identification module 420 is continued to execute.Annotation Footnote identification module 420 continues to identify the annotation footnote in current file using modified annotation footnote recognizer.It repeats Judgment module 440 and modified module 450 are performed, until judgment module 440 judges that the annotation footnote of identification is equal to current file Until the number for annotating footnote.
Display module 460, will be with annotating in the associated annotation of footnote suitable for obtaining the behavior of user's operation annotation footnote Appearance is shown.
It will be annotated after the corresponding notes content of footnote is associated by relating module 430, display module 460 obtains The behavior of user's operation annotation footnote is taken, if display module 460 obtains user's double-click or clicks the behavior of annotation footnote, is being noted It releases and shows corresponding notes content so that user checks at footnote;Or display module 460 obtains being annotated as set for user's operation The relevant operation behavior of the associated notes content show or hide of footnote, at annotation footnote the corresponding notes content of display with It is checked for user or hides corresponding notes content to facilitate user to reading of body matter etc..
According to annotation footnote identification device provided by the invention, chosen from preset multiple annotation footnote recognizers Suitable for the highest annotation footnote recognizer of current file matching degree;Being identified using annotation footnote recognizer ought be above The annotation footnote of part;Judge whether the number of the annotation footnote of identification is equal to the number of the annotation footnote of current file, if identification Annotation footnote number not equal to current file annotation footnote number, modification annotation footnote recognizer.Use modification Annotation footnote recognizer afterwards continues to identify the annotation footnote in current file, until number of annotation footnote of identification etc. In the number of the annotation footnote of current file.The notes content of current file is obtained, in the annotation corresponding by footnote is annotated Appearance is associated.The highest annotation footnote recognizer of matching degree is selected, is identified using the annotation footnote recognizer The annotation footnote of current file, and annotation footnote is associated with annotation, facilitate user that can directly be looked into reading file It sees corresponding annotation, solves query when user reads in time, while avoid the troublesome operation of page turning before and after user.It uses The highest annotation footnote recognizer of matching degree is identified, and can find out the annotation footnote in file to greatest extent, subtract Situations such as few identification mistake or omission.Further, the annotation footnote in all files is gone out when footnote annotation algorithm None- identified When, it modifies to footnote annotation algorithm, and the annotation in file is further searched for out using modified annotation footnote algorithm Footnote can perform repeatedly, until identifying annotation footnote all in file.Meanwhile obtaining user's operation annotation foot After target behavior, it will be shown with the annotation associated notes content of footnote.So that user is while reading file, you can Notes content is got, user experience is preferable.
Embodiment five
The embodiment of the present application five provides a kind of nonvolatile computer storage media, and the computer storage media is deposited An at least executable instruction is contained, which can perform the annotation footnote in above-mentioned any means embodiment Recognition methods.
Embodiment six
Fig. 6 shows the structure diagram of according to embodiments of the present invention six a kind of electronic equipment, present invention specific implementation Example does not limit the specific implementation of electronic equipment.
As shown in fig. 6, the electronic equipment can include:Processor (processor) 602, communication interface (Communications Interface) 604, memory (memory) 606 and communication bus 608.
Wherein:
Processor 602, communication interface 604 and memory 606 complete mutual communication by communication bus 608.
Communication interface 604, for communicating with the network element of miscellaneous equipment such as client or other servers etc..
Processor 602 for performing program 610, can be specifically performed in above-mentioned annotation footnote recognition methods embodiment Correlation step.
Specifically, program 610 can include program code, which includes computer-managed instruction.
Processor 602 may be central processor CPU or specific integrated circuit ASIC (Application Specific Integrated Circuit) or be arranged to implement the embodiment of the present invention one or more integrate Circuit.The one or more processors that electronic equipment includes can be same type of processor, such as one or more CPU; Can also be different types of processor, such as one or more CPU and one or more ASIC.
Memory 606, for storing the first data acquisition system, the second data set and program 610.Memory 606 may Include high-speed RAM memory, it is also possible to further include nonvolatile memory (non-volatile memory), for example, at least one A magnetic disk storage.
Program 610 specifically can be used for so that processor 602 performs following operation:Know from preset multiple annotation footnotes It is chosen in other algorithm and is suitable for the highest annotation footnote recognizer of current file matching degree;Use annotation footnote recognizer Identify the annotation footnote of current file;Obtain the notes content of current file, the notes content corresponding by footnote is annotated It is associated.
In a kind of optional embodiment, program 610 is for so that the statistics of processor 602 obtains institute in current file There is the total number of annotation footnote;Successively using the annotation foot in preset multiple annotation footnote recognizer identification current files Mark counts the annotation footnote number identified;The annotation footnote number pair of selection and the immediate identification of total number of footnote The annotation footnote recognizer answered;It is to know suitable for the highest annotation footnote of current file matching degree to annotate footnote recognizer Other algorithm.
In a kind of optional embodiment, program 610 is for so that processor 602 judges the annotation footnote of identification Whether number is equal to the number of the annotation footnote of current file, if the number of the annotation footnote of identification is not equal to current file Annotate the number of footnote, modification annotation footnote recognizer;Use modified annotation footnote recognizer identification current file Annotation footnote;The step is repeated, until judging that the annotation footnote of identification is equal to of the annotation footnote of current file Number is terminated and is performed.
In a kind of optional embodiment, program 610 is for so that processor 602 reduces and/or change annotation footnote The qualifications of recognizer.
In a kind of optional embodiment, program 610 is for so that in the annotation that processor 602 is specified according to user Hold type, obtain the notes content of current file, the corresponding notes content of footnote will be annotated and be associated.
In a kind of optional embodiment, notes content type is general remarks or combination annotation.Program 610 is used for So that the notes content type that processor 602 is specified according to user, obtain the notes content of current file, will annotation footnote with Its corresponding notes content, which is associated, to be further comprised:If notes content type is annotated for combination, notes content is obtained;Root Notes content is segmented according to annotation footnote;The corresponding notes content of footnote will be annotated to be associated;If notes content Type is general remarks, obtains notes content;The corresponding notes content of footnote will be annotated to be associated.
In a kind of optional embodiment, program 610 is for so that processor 602 obtains user's operation annotation footnote Behavior, will with annotation the associated notes content of footnote show.
The specific implementation of each step may refer to the corresponding steps in above-mentioned annotation footnote identification embodiment in program 610 The corresponding description in unit, this will not be repeated here.It is apparent to those skilled in the art that the side for description Just and succinctly, the specific work process of the equipment of foregoing description and module can refer to corresponding in preceding method embodiment Journey describes, and details are not described herein.
The scheme provided through this embodiment is chosen from preset multiple annotation footnote recognizers and is suitable for currently The highest annotation footnote recognizer of file matching degree;The annotation foot of current file is identified using annotation footnote recognizer Mark;The notes content of current file is obtained, the corresponding notes content of footnote will be annotated and be associated.Select matching degree Highest annotation footnote recognizer identifies the annotation footnote of current file using the annotation footnote recognizer, and will note It releases footnote to be associated with annotation, facilitates user that can be directly viewable corresponding annotation in reading file, solve user in time Query during reading, while avoid the troublesome operation of page turning before and after user.Further, using the highest annotation foot of matching degree Mark recognizer is identified, and can find out the annotation footnote in file to greatest extent, reduces identification mistake or omits Situations such as.
Algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment provided herein. Various general-purpose systems can also be used together with teaching based on this.As described above, required by constructing this kind of system Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It should be understood that it can utilize various Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this The preferred forms of invention.
In the specification provided in this place, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention Example can be put into practice without these specific details.In some instances, well known method, knot is not been shown in detail Structure and technology, so as not to obscure the understanding of this description.
Similarly, it should be understood that in order to simplify the disclosure and help to understand one or more of each inventive aspect, Above in the description of exemplary embodiment of the present invention, each feature of the invention is grouped together into single reality sometimes It applies in example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:Want Ask protection the present invention claims the more features of feature than being expressly recited in each claim.More precisely, such as As following claims reflect, inventive aspect is all features less than single embodiment disclosed above. Therefore, it then follows thus claims of specific embodiment are expressly incorporated in the specific embodiment, wherein each right It is required that in itself all as separate embodiments of the invention.
Those skilled in the art, which are appreciated that, to carry out adaptivity to the module in the equipment in embodiment Ground changes and they is arranged in one or more equipment different from the embodiment.It can be the module in embodiment Or unit or component are combined into a module or unit or component and can be divided into multiple submodule or son in addition Unit or sub-component.It, can be with other than such feature and/or at least some of process or unit exclude each other Using any combinations to all features disclosed in this specification (including adjoint claim, abstract and attached drawing) and such as Any method of the displosure or all processes or unit of equipment are combined.Unless expressly stated otherwise, this specification Each feature disclosed in (including adjoint claim, abstract and attached drawing) can be by providing identical, equivalent or similar mesh Alternative features replace.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments means in the present invention Within the scope of and form different embodiments.For example, in the following claims, embodiment claimed One of arbitrary mode can use in any combination.
The all parts embodiment of the present invention can be with hardware realization or to be transported on one or more processor Capable software module is realized or is realized with combination thereof.It it will be understood by those of skill in the art that can be in practice Annotation footnote identification device according to embodiments of the present invention is realized using microprocessor or digital signal processor (DSP) In some or all components some or all functions.The present invention is also implemented as described here for performing Some or all equipment of method or program of device (for example, computer program and computer program product).This The program of the realization present invention of sample can may be stored on the computer-readable medium or can have one or more signal Form.Such signal can be downloaded from internet website to be obtained either providing or with any on carrier signal Other forms provide.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and this Field technology personnel can design alternative embodiment without departing from the scope of the appended claims.In claim In, any reference mark between bracket should not be configured to limitations on claims.Word "comprising" is not excluded for depositing In element or step not listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple Such element.The present invention can be by means of including the hardware of several different elements and by means of properly programmed calculating Machine is realized.If in the unit claim for listing equipment for drying, several in these devices can be by same Hardware branch embodies.The use of word first, second, and third does not indicate that any sequence.It can be by these word solutions It is interpreted as title.

Claims (14)

1. a kind of annotation footnote recognition methods, including:
Being suitable for from the preset selection suitable for multiple annotation footnote recognizers of the annotation footnote of different-format ought be above The highest annotation footnote recognizer of part matching degree;
The annotation footnote of current file is identified using the annotation footnote recognizer;
Judge whether the number of the annotation footnote of the identification is equal to the number of the annotation footnote of current file, if the annotation of identification The number of footnote changes the annotation footnote recognizer not equal to the number of the annotation footnote of current file;After modification Annotation footnote recognizer identification current file annotation footnote;The step is repeated, until judging the note of the identification The number for releasing footnote is equal to the number for annotating footnote of current file, terminates and performs;
The notes content of current file is obtained, the corresponding notes content of the annotation footnote is associated.
It is 2. described from the multiple of the preset annotation footnote suitable for different-format according to the method described in claim 1, wherein It chooses in annotation footnote recognizer and further comprises suitable for the highest annotation footnote recognizer of current file matching degree:
Statistics obtains the total number of all annotation footnotes in current file;
Successively using the annotation footnote in preset multiple annotation footnote recognizer identification current files, count what is identified Annotate footnote number;
Select annotation foot corresponding with the annotation footnote number of the immediate identification of total number of all annotation footnotes Mark recognizer;The annotation footnote recognizer is suitable for the highest annotation footnote recognizer of current file matching degree.
3. according to the method described in claim 1, wherein, the modification annotation footnote recognizer further comprises:
Reduce and/or change the qualifications of the annotation footnote recognizer.
4. according to the method described in claim 1, wherein, the notes content for obtaining current file, by the annotation footnote Corresponding notes content, which is associated, to be further comprised:
The notes content type specified according to user obtains the notes content of current file, and the annotation footnote is corresponding Notes content be associated.
5. according to the method described in claim 4, wherein, the notes content type is general remarks or combination annotation;
The notes content type specified according to user obtains the notes content of current file, by the annotation footnote and its Corresponding notes content, which is associated, to be further comprised:
If the notes content type is annotated for combination, the notes content is obtained;According to the footnote that annotates by the annotation Content is segmented;The corresponding notes content of the annotation footnote is associated;
If the notes content type is general remarks, the notes content is obtained;By the corresponding note of the annotation footnote Content is released to be associated.
6. according to claim 1-5 any one of them methods, wherein, the method further includes:
The behavior of user's operation annotation footnote is obtained, will be shown with the annotation associated notes content of footnote.
7. a kind of annotation footnote identification device, including:
Algorithm picks module, suitable for from the preset multiple annotation footnote recognizers for annotating footnote suitable for different-format It chooses and is suitable for the highest annotation footnote recognizer of current file matching degree;
Footnote identification module is annotated, suitable for identifying the annotation footnote of current file using the annotation footnote recognizer;
Suitable for obtaining the notes content of current file, the corresponding notes content of the annotation footnote is carried out for relating module Association;
The annotation footnote identification module further comprises:
Whether judgment module, the number suitable for judging the annotation footnote of the identification are equal to of the annotation footnote of current file Number;
Modified module if the number suitable for the annotation footnote of identification is not equal to the number for annotating footnote of current file, changes institute State annotation footnote recognizer;
The annotation footnote identification module is further adapted for:Use modified annotation footnote recognizer identification current file Annotate footnote;The judgment module and the modified module are repeated, until judging the number of the annotation footnote of the identification Until number equal to the annotation footnote of current file, terminate and perform.
8. device according to claim 7, wherein, the algorithm picks module further comprises:
Statistical module obtains the total number of all annotation footnotes in current file suitable for statistics;
Identification module is tested, suitable for successively using the annotation foot in preset multiple annotation footnote recognizer identification current files Mark counts the annotation footnote number identified;
The algorithm picks module is further adapted for:Selection and the immediate identification of total number of all annotation footnotes Annotation footnote number it is corresponding annotation footnote recognizer;The annotation footnote recognizer is to be matched suitable for current file Spend highest annotation footnote recognizer.
9. device according to claim 7, wherein, the modified module is further adapted for:
Reduce and/or change the qualifications of the annotation footnote recognizer.
10. device according to claim 7, wherein, the relating module is further adapted for:
The notes content type specified according to user obtains the notes content of current file, and the annotation footnote is corresponding Notes content be associated.
11. device according to claim 10, wherein, the notes content type is general remarks or combination annotation;
The relating module further comprises:
Relating module is combined, if being annotated suitable for the notes content type for combination, obtains the notes content;According to the note Footnote is released to be segmented the notes content;The corresponding notes content of the annotation footnote is associated;
Common association module if being general remarks suitable for the notes content type, obtains the notes content;By the annotation The corresponding notes content of footnote is associated.
12. according to claim 7-11 any one of them devices, wherein, described device further includes:
Display module, suitable for obtain user's operation annotation footnote behavior, will with it is described annotation the associated notes content of footnote into Row display.
13. a kind of electronic equipment, including:Processor, memory, communication interface and communication bus, the processor, the storage Device and the communication interface complete mutual communication by the communication bus;
For the memory for storing an at least executable instruction, the executable instruction makes the processor perform right such as will Ask the corresponding operation of annotation footnote recognition methods described in any one of 1-6.
14. a kind of computer storage media, an at least executable instruction, the executable instruction are stored in the storage medium Processor is made to perform the corresponding operation of annotation footnote recognition methods as described in any one of claim 1-6.
CN201611108286.0A 2016-12-06 2016-12-06 Annotate footnote recognition methods, device and electronic equipment Active CN106708793B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611108286.0A CN106708793B (en) 2016-12-06 2016-12-06 Annotate footnote recognition methods, device and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611108286.0A CN106708793B (en) 2016-12-06 2016-12-06 Annotate footnote recognition methods, device and electronic equipment

Publications (2)

Publication Number Publication Date
CN106708793A CN106708793A (en) 2017-05-24
CN106708793B true CN106708793B (en) 2018-06-08

Family

ID=58935932

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611108286.0A Active CN106708793B (en) 2016-12-06 2016-12-06 Annotate footnote recognition methods, device and electronic equipment

Country Status (1)

Country Link
CN (1) CN106708793B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110825376B (en) * 2018-08-07 2024-03-12 深圳Tcl数字技术有限公司 Method, storage medium and device for analyzing annotated JSON file
CN110399801A (en) * 2019-06-26 2019-11-01 南京智录信息科技有限公司 Number note identification technology is arranged at the table bottom in file and picture

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3452774B2 (en) * 1997-10-16 2003-09-29 富士通株式会社 Character recognition method
KR101443404B1 (en) * 2006-09-15 2014-10-02 구글 인코포레이티드 Capture and display of annotations in paper and electronic documents
CN102982027A (en) * 2011-09-02 2013-03-20 北大方正集团有限公司 Method and device for abstracting contents in document
CN104750661B (en) * 2013-12-30 2018-09-28 腾讯科技(深圳)有限公司 A kind of method and apparatus that selected words and phrases are carried out to text
CN105913093B (en) * 2016-05-03 2019-06-21 电子科技大学 A kind of template matching method for Text region processing

Also Published As

Publication number Publication date
CN106708793A (en) 2017-05-24

Similar Documents

Publication Publication Date Title
US11093698B2 (en) Method and apparatus and computer device for automatic semantic annotation for an image
EP1672537B1 (en) Data semanticizer
KR100467638B1 (en) Method for fast searching and analyzing inter-relations between patents from a patent database
CN109657694A (en) Picture automatic classification method, device and computer readable storage medium
US9501455B2 (en) Systems and methods for processing data
CN112749547A (en) Generation of text classifier training data
CN109063055A (en) Homologous binary file search method and device
CN110321437B (en) Corpus data processing method and device, electronic equipment and medium
CN111259627A (en) Document analysis method and device, computer storage medium and equipment
CN106557463A (en) Sentiment analysis method and device
CN103530386B (en) The edit methods and browser of browsing device net page
CN106708793B (en) Annotate footnote recognition methods, device and electronic equipment
CN108182174A (en) New words extraction method, electronic equipment and computer storage media
CN107678968A (en) Sample extraction method, apparatus, computing device and the storage medium of source code function
CN110209780A (en) A kind of question template generation method, device, server and storage medium
CN106815253A (en) A kind of method for digging based on mixed data type data
CN108062422A (en) A kind of sort method of paging query, intelligent terminal, system and storage medium
CN109657043B (en) Method, device and equipment for automatically generating article and storage medium
CN116798053B (en) Icon generation method and device
CN107329756A (en) Generation method, device, storage medium, processor and the terminal of program file
CN110287460A (en) The methods of exhibiting of e-book calculates equipment and computer storage medium
CN104778202B (en) The analysis method and system of event evolutionary process based on keyword
CN111164560B (en) Techniques for dynamically defining data record formats
CN110503378A (en) A kind of BOM standardized method, system and electronic equipment and storage medium
CN116226526A (en) Intellectual property intelligent retrieval platform and method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant