CN103678353B - For the inspection method and device of the post information in contribution - Google Patents

For the inspection method and device of the post information in contribution Download PDF

Info

Publication number
CN103678353B
CN103678353B CN201210335592.3A CN201210335592A CN103678353B CN 103678353 B CN103678353 B CN 103678353B CN 201210335592 A CN201210335592 A CN 201210335592A CN 103678353 B CN103678353 B CN 103678353B
Authority
CN
China
Prior art keywords
post
name
contribution
database
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201210335592.3A
Other languages
Chinese (zh)
Other versions
CN103678353A (en
Inventor
周志扬
朱建波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
New Founder Holdings Development Co ltd
Peking University
Beijing Founder Electronics Co Ltd
Original Assignee
Peking University
Peking University Founder Group Co Ltd
Beijing Founder Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University, Peking University Founder Group Co Ltd, Beijing Founder Electronics Co Ltd filed Critical Peking University
Priority to CN201210335592.3A priority Critical patent/CN103678353B/en
Publication of CN103678353A publication Critical patent/CN103678353A/en
Application granted granted Critical
Publication of CN103678353B publication Critical patent/CN103678353B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/226Validation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing

Abstract

The invention provides a kind of inspection method of the post information in contribution, including:Full-text search is carried out to contribution using name database, to determine the name in contribution;Post database is retrieved with the name for determining, to determine the post associated by name;Judge whether relevant information of the name in contribution be correct using the post for determining.Present invention also offers a kind of check device of the post information in contribution, including:Name module, for carrying out full-text search to contribution using name database, to determine the name in contribution;Post module, for retrieving post database with the name for determining, to determine the post associated by name;Judge module, for judging whether relevant information of the name in contribution be correct using the post for determining.The present invention improves Article quality.

Description

For the inspection method and device of the post information in contribution
Technical field
The present invention relates to field of information processing, in particular to a kind of reviewing party of the post information in contribution Method and device.
Background technology
Often occur name and its post information in contribution, the post letter of the name is required in the editing process of contribution Breath, and when multiple names occur side by side, should be ranked up these names according to the sequence of its post.
The work of current contribution post check and correction can take artificial proofreading method, generally comprise following steps:
(1) being printed from collecting and editing system needs the contribution of check and correction.
(2) manual read's paper contribution, runs into the leader's post having a question, and goes to search related post letter manually Breath, or seek advice from veteran press corrector.
(3) post to mistake carries out annotation modification by hand.
(4) content manually modification is entered into collecting and editing system.
The check and correction process of artificial check and correction relies on the knowledge experience of press corrector too much, easily makes a fault, and causes newspaper There is the post information of mistake, influence the quality of publication.
The content of the invention
The present invention is intended to provide the inspection method and device of a kind of post information in contribution, to replace artificial check and correction Name information in contribution.
According to an aspect of the invention, there is provided a kind of inspection method of post information in contribution, including:Profit Employment name database carries out full-text search to contribution, to determine the name in contribution;Post database is retrieved with the name for determining, To determine the post associated by name;Judge whether relevant information of the name in contribution be correct using the post for determining.
According to another aspect of the present invention, there is provided a kind of check device of post information in contribution, including:People Name module, for carrying out full-text search to contribution using name database, to determine the name in contribution;Post module, is used for Post database is retrieved with the name for determining, to determine the post associated by name;Judge module, for using the post for determining Judge whether relevant information of the name in contribution be correct.
The inspection method and device of the post information in contribution of the invention using database because check name Information, so overcoming the error problem of artificial check and correction, and then improves Article quality.
Brief description of the drawings
Accompanying drawing described herein is used for providing a further understanding of the present invention, constitutes the part of the application, this hair Bright schematic description and description does not constitute inappropriate limitation of the present invention for explaining the present invention.In the accompanying drawings:
Fig. 1 shows the flow chart of the inspection method for the post information in contribution according to embodiments of the present invention;
Fig. 2 shows the schematic diagram of the check device for the post information in contribution according to embodiments of the present invention.
Specific embodiment
Describe the present invention in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 shows the flow chart of the inspection method for the post information in contribution according to embodiments of the present invention, bag Include:
Step S10, full-text search is carried out using name database to contribution, to determine the name in contribution;
Step S20, retrieves post database, to determine the post associated by name with the name for determining;
Step S30, judges whether relevant information of the name in contribution be correct using the post for determining.
The name post information in contribution is checked by artificial check and correction in the prior art, and in the present embodiment, using people Name database and post database analyze the name post information in contribution, so that whole process realizes software automation, So overcoming the error problem of artificial check and correction, and then improve Article quality.
Preferably, this inspection method also includes:Name database is pre-created, including a plurality of record, each bar record Including the field for recording name;Wherein, carrying out full-text search to contribution using name database includes:During each bar is recorded The name of record is matched with the full text of contribution;If matched in contribution with the name identical word described in record, The word that then will match to is defined as the name in contribution.The name database project plan comparison of the present embodiment is simple, easily realizes.It is logical The maintenance to name database is crossed, the dynamic renewal of name database can also be realized.It should be noted that word herein is Refer to a linguistic unit, can be a character for constituting word, or the multiple characters for constituting word, and character can Being punctuate, or word.
Preferably, this inspection method also includes:Post database is pre-created, including a plurality of record, each bar record Including the first field and the second field for recording post for recording name;Wherein, post is retrieved with the name for determining Database includes:The name that will be determined matches each bar record;If matching name in the first field of record, extract and work as The post in the second field in preceding record;The post of extraction is defined as the post associated by name.The post of the present embodiment Database scheme is fairly simple, easily realizes.By the maintenance to post database, the dynamic of post database can also be realized Update.For example, press corrector is if it find that the post information in post database is wrong, can be with manual modification post database Relevant field.
Preferably, step S30 includes:
Extract the adjacent word in contribution of name;Judge whether adjacent word is post;
If adjacent word is post, it is determined that whether adjacent word matches the post determined with post database;
If it does not match, the adjacent word of mark.
The present embodiment is by simple matching operation, you can automatically judge whether post information is accurate, significantly saves The workload of press corrector.Such as word in contribution is " minister Zhang San ", and " minister " is matched with post database, It was found that second field of " Zhang San " record in post database is " vice-minister ", you can automatically mark " minister Zhang San ", example The Scarlet Letter is such as shown as, so as to remind press corrector to judge whether contribution is wrong.
Preferably, post database is pre-created, including a plurality of record, each bar record includes:For recording name The first field, the second field for recording post;And for recording the 3rd field of the index of post, the size of index With the rank linear correlation of post;Wherein, post database is retrieved with the name for determining, it is determined that post associated by name Meanwhile, also determine the index of associated post.The rank of post is given numeral by this preferred embodiment in post database Change, such that it is able to the automatic inspection class information of post.
Preferably, step S30 includes:Determine to be the name of coordination in contribution;Judge that the index of name arranged side by side exists The professional level whether the priority sequence in coordination meets associated by name is from high to low;If do not met, mark is arranged side by side Name.Because the rank of post has been digitized into post database, by the sequence to indexing, you can determine original text Whether the name in part is sorted according to service grade.Such as word in contribution is " chief Li Si, minister Zhang San, section chief King five ", if the index that the index that the index of minister is 1, chief is 2, section chief is 3 in post database, then above-mentioned word The indexed sequential for obtaining is " 2,1,3 ", does not meet the order of " 1,2,3 ", and this preferred embodiment automatically can mark " office by blue word Li Si long, minister Zhang San, section chief king five ", thus remind this section of word of press corrector name whether sort it is wrong.
Preferably, coordination is following pattern:Post 1, post 2...... post ml names 1, post 1, post 2...... post m2Name 2 ..., post 1, post 2...... posts mnName n, wherein, n is the nature more than or equal to 2 Number;m1、m2、......、mnNonnegative integer is, post is not essential.For example, can have following several:
1) leader 1
2) leader 1, leader 2
3) leader 1 of post 1
4) post 1, the leader 1 of post 2
5) post 1, the leader 1 of post 2, leader 2.
This includes common name order in contribution.
Preferably, determine that the name in contribution for coordination includes:
A the adjacent above word in contribution of current name) is judged;
B) if not being post without word or word, it is determined that without name arranged side by side before current name, terminate current name Coordination judgement;
C) if word is punctuation mark or word is post, step B and C are performed to character cycle adjacent above;
D) if word is name, will determine that the name that obtains adds coordination, and using the name that judges to obtain as The step of current name circulation performs above-mentioned judgement coordination.
Said process is simple cyclic process, it is easy to become to realize by computer.
Fig. 2 shows the schematic diagram of the check device for the post information in contribution according to embodiments of the present invention, bag Include:
Name module 10, for carrying out full-text search to contribution using name database, to determine the name in contribution;
Post module 20, for retrieving post database with the name for determining, to determine the post associated by name;
Judge module 30, for judging whether relevant information of the name in contribution be correct using the post for determining.
The present apparatus overcomes the error problem that name post is manually proofreaded, and then improves Article quality.
Preferably, judge module 30 includes:Extraction module, the word adjacent in contribution for extracting name;Post judges Module, for judging whether adjacent word is post;Matching module, if being post for adjacent word, it is determined that adjacent Whether word matches the post determined with post database;Labeling module, for if it does not match, the adjacent word of mark.
In the present embodiment, the name post information in contribution is analyzed using name database and post database, so that Whole process realizes software automation, so overcoming the error problem of artificial check and correction, and then improves Article quality.
Preferably, post database is pre-created, including a plurality of record, each bar record includes:For recording name The first field, the second field for recording post;And for recording the 3rd field of the index of post, the size of index With the rank linear correlation of post;Wherein, post is it is determined that while post associated by name, also determine associated post Index, judge module 30 includes:Dependent module, for determining to be the name of coordination in contribution;Order module, for sentencing The professional level whether priority sequence of the index of disconnected name arranged side by side in coordination meets associated by name is from high to low;Mark Injection molding block, if for not meeting, marking name arranged side by side.
Often it is related to leader in the various contributions such as website, publication, leader can be taken before usual leader's name The post of people.In contribution, the post mistake of leader or sequence are chaotic, can have a strong impact on the quality of contribution.Retouching more than In stating, it can be seen that the present invention realizes the automatic Proofreading to the name post information of contribution, so as to improve Article quality.
Obviously, those skilled in the art should be understood that above-mentioned of the invention each module or each step can be with general Computing device realize that they can be concentrated on single computing device, or be distributed in multiple computing devices and constituted Network on, alternatively, the program code that they can be can perform with computing device be realized, it is thus possible to they are stored Performed by computing device in the storage device, or they be fabricated to each integrated circuit modules respectively, or by they In multiple modules or step single integrated circuit module is fabricated to realize.So, the present invention is not restricted to any specific Hardware and software is combined.
The preferred embodiments of the present invention are the foregoing is only, is not intended to limit the invention, for the skill of this area For art personnel, the present invention can have various modifications and variations.It is all within the spirit and principles in the present invention, made any repair Change, equivalent, improvement etc., should be included within the scope of the present invention.

Claims (10)

1. the inspection method of a kind of post information in contribution, it is characterised in that including:
Full-text search is carried out to contribution using name database, to determine the name in the contribution;
Post database is retrieved with the name for determining, to determine post associated by the name and associated post Index, wherein, the size of index and the rank linear correlation of post;
Judge whether relevant information of the name in the contribution be correct using the post for determining, wherein, this step Including:Determine to be the name of coordination in the contribution, wherein, the coordination is following pattern:Post 1, duty Business 2 ... post m1Name 1, post 1, post 2 ... post m2Name 2 ..., post 1, post 2 ... post mnName n, Wherein, n is the natural number more than or equal to 2, m1、m2、……、mnNonnegative integer is, post is not essential;Judge described arranged side by side The priority sequence of the index in the coordination of name whether meet professional level associated by the name for from high to low.
2. method according to claim 1, it is characterised in that also include:The name database is pre-created, wherein wrapping A plurality of record is included, each bar record includes the field for recording name;Wherein, contribution is carried out entirely using name database Text retrieval includes:
Name described in each bar record is matched with the full text of the contribution;
If matched in the contribution with the name identical word described in the record, the word for matching is true It is set to the name in the contribution.
3. method according to claim 1, it is characterised in that also include:The post database is pre-created, wherein wrapping A plurality of record is included, each bar record includes the first field and the second field for recording post for recording name;Its In, included with the name retrieval post database for determining:
The name matching each bar record that will be determined;
If matching the name in first field of the record, the second word in presently described record is extracted Post in section;
The post of the extraction is defined as the post associated by the name.
4. method according to claim 1, it is characterised in that judge the name described using the post for determining Whether the relevant information in contribution correctly includes:
Extract the adjacent word in the contribution of the name;
Judge whether the adjacent word is post;
If the adjacent word is post, it is determined that whether the adjacent word matches the duty determined with the post database Business;
If it does not match, the mark adjacent word.
5. method according to claim 1, it is characterised in that the post database is pre-created, including a plurality of Record, each bar record includes:For recording the first field of name, the second field for recording post;And be used for Record the 3rd field of the index of the post, the rank linear correlation of the size of the index and the post;Wherein, with true Fixed name retrieval post database, it is determined that while post associated by the name, also determining described associated Post index.
6. method according to claim 5, it is characterised in that judge the name described using the post for determining Whether the relevant information in contribution correctly includes:
If priority sequence of the index of the name arranged side by side in the coordination is not met associated by the name Professional level is from high to low, then to mark the name arranged side by side.
7. method according to claim 6, it is characterised in that determine in the contribution to be the name bag of coordination Include:
A the adjacent above word in the contribution of the current name) is judged;
B) if not being post without word or word, it is determined that without name arranged side by side before the current name, terminate described current The judgement of the coordination of name;
C) if word is punctuation mark or word is post, step B and C are performed to character cycle adjacent above;
D) if word is name, the name for judging to obtain is added into the coordination, and judge what is obtained with described The step of name performs above-mentioned judgement coordination as current name circulation.
8. the check device of a kind of post information in contribution, it is characterised in that including:
Name module, for carrying out full-text search to contribution using name database, to determine the name in the contribution;
Post module, for determine the name retrieve post database, with determine the post associated by the name with And the index of associated post, wherein, the size of index and the rank linear correlation of post;
Judge module, for whether just to judge relevant information of the name in the contribution using the post for determining Really, wherein, the judge module includes:Dependent module, for determining to be the name of coordination in contribution, wherein, it is described simultaneously Row relation is following pattern:Post 1, post 2 ... post m1Name 1, post 1, post 2 ... post m2Name 2 ..., duty Business 1, post 2 ... post mnName n, wherein, n is the natural number more than or equal to 2, m1、m2、……、mnIt is nonnegative integer, Post is not essential;Order module, for judging whether priority sequence of the index of name arranged side by side in coordination meets Professional level associated by name is for from high to low.
9. device according to claim 8, it is characterised in that the judge module includes:Extraction module, for extracting State the adjacent word in the contribution of name;
Post judge module, for judging whether the adjacent word is post;
Matching module, if being post for the adjacent word, it is determined that whether the adjacent word is matched with the post The post that database determines;
Labeling module, for if it does not match, the mark adjacent word.
10. device according to claim 8, it is characterised in that the post database is pre-created, including a plurality of Record, each bar record includes:For recording the first field of name, the second field for recording post;And be used for Record the 3rd field of the index of the post, the rank linear correlation of the size of the index and the post;Wherein, it is described Post is it is determined that while post associated by the name, also determine the index of the associated post, the judgement mould Block includes:
Labeling module, if for the name arranged side by side the index in the coordination priority sequence do not meet Professional level associated by the name is from high to low, then to mark the name arranged side by side.
CN201210335592.3A 2012-09-11 2012-09-11 For the inspection method and device of the post information in contribution Expired - Fee Related CN103678353B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210335592.3A CN103678353B (en) 2012-09-11 2012-09-11 For the inspection method and device of the post information in contribution

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210335592.3A CN103678353B (en) 2012-09-11 2012-09-11 For the inspection method and device of the post information in contribution

Publications (2)

Publication Number Publication Date
CN103678353A CN103678353A (en) 2014-03-26
CN103678353B true CN103678353B (en) 2017-06-20

Family

ID=50315946

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210335592.3A Expired - Fee Related CN103678353B (en) 2012-09-11 2012-09-11 For the inspection method and device of the post information in contribution

Country Status (1)

Country Link
CN (1) CN103678353B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104023124B (en) * 2014-05-14 2017-09-26 上海卓悠网络科技有限公司 Automatic identification and the method and device for extracting name in short message
CN108197110B (en) * 2018-01-03 2021-07-27 北京方寸开元科技发展有限公司 Method, device and storage medium for acquiring and correcting names and jobs

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007328533A (en) * 2006-06-07 2007-12-20 Toshiba Corp Data processor and data processing program
CN101547326A (en) * 2008-03-27 2009-09-30 株式会社东芝 Device and method for notifying content scene appearance
US7792837B1 (en) * 2007-11-14 2010-09-07 Google Inc. Entity name recognition
CN102043763A (en) * 2009-10-23 2011-05-04 北大方正集团有限公司 Method and device for automatically checking names
CN102567374A (en) * 2010-12-16 2012-07-11 北大方正集团有限公司 Manuscript proofreading method and system
CN102656554A (en) * 2009-09-16 2012-09-05 起元技术有限责任公司 Mapping dataset elements

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007328533A (en) * 2006-06-07 2007-12-20 Toshiba Corp Data processor and data processing program
US7792837B1 (en) * 2007-11-14 2010-09-07 Google Inc. Entity name recognition
CN101547326A (en) * 2008-03-27 2009-09-30 株式会社东芝 Device and method for notifying content scene appearance
CN102656554A (en) * 2009-09-16 2012-09-05 起元技术有限责任公司 Mapping dataset elements
CN102043763A (en) * 2009-10-23 2011-05-04 北大方正集团有限公司 Method and device for automatically checking names
CN102567374A (en) * 2010-12-16 2012-07-11 北大方正集团有限公司 Manuscript proofreading method and system

Also Published As

Publication number Publication date
CN103678353A (en) 2014-03-26

Similar Documents

Publication Publication Date Title
CN105608942B (en) A kind of work correction system and method
CN110292775B (en) Method and device for acquiring difference data
CN108664538B (en) Automatic identification method and system for suspected familial defects of power transmission and transformation equipment
CN109062950B (en) Text labeling method and device
CN106886509A (en) A kind of academic dissertation form automatic testing method
CN102662930A (en) Corpus tagging method and corpus tagging device
CN102436547A (en) Wrong sentence statistical method and system for teaching
JP7147185B2 (en) Information processing device, information processing method and information processing program
CN112445897A (en) Method, system, device and storage medium for large-scale classification and labeling of text data
CN112163553A (en) Material price accounting method and device, storage medium and computer equipment
CN112036166A (en) Data labeling method and device, storage medium and computer equipment
CN103678353B (en) For the inspection method and device of the post information in contribution
CN113111159A (en) Question and answer record generation method and device, electronic equipment and storage medium
CN110162684B (en) Machine reading understanding data set construction and evaluation method based on deep learning
CN117194255A (en) Test data maintenance method, device, equipment and storage medium
CN112017079A (en) Component information extraction method, processing device and storage medium of patent document
CN108875060A (en) A kind of website identification method and identifying system
US20040210834A1 (en) Data management method and system for generating and verifying accurate coding information
CN102651097A (en) Electronic evaluating system and electronic evaluating method
CN109600428A (en) A kind of automation uploads attachment and matches associated method and apparatus
CN107807964A (en) Digital content sort method, device and computer-readable recording medium
CN111488327B (en) Data standard management method and system
CN113643163A (en) Internet education student comprehensive portrait label management system based on deep learning
CN113515588A (en) Form data detection method, computer device and storage medium
CN112053760A (en) Medication guide method, medication guide device, and computer-readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220621

Address after: 100871 No. 5, the Summer Palace Road, Beijing, Haidian District

Patentee after: Peking University

Patentee after: New founder holdings development Co.,Ltd.

Patentee after: BEIJING FOUNDER ELECTRONICS Co.,Ltd.

Address before: 100871 No. 5, the Summer Palace Road, Beijing, Haidian District

Patentee before: Peking University

Patentee before: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd.

Patentee before: BEIJING FOUNDER ELECTRONICS Co.,Ltd.

TR01 Transfer of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170620

CF01 Termination of patent right due to non-payment of annual fee