CN107870991A - A kind of similarity calculating method and computer-readable recording medium of paper metadata - Google Patents

A kind of similarity calculating method and computer-readable recording medium of paper metadata Download PDF

Info

Publication number
CN107870991A
CN107870991A CN201711022946.8A CN201711022946A CN107870991A CN 107870991 A CN107870991 A CN 107870991A CN 201711022946 A CN201711022946 A CN 201711022946A CN 107870991 A CN107870991 A CN 107870991A
Authority
CN
China
Prior art keywords
metadata
similarity
character string
feature
paper
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711022946.8A
Other languages
Chinese (zh)
Inventor
龙开亮
王志
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hunan Latitude Mdt Infotech Ltd
Original Assignee
Hunan Latitude Mdt Infotech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hunan Latitude Mdt Infotech Ltd filed Critical Hunan Latitude Mdt Infotech Ltd
Priority to CN201711022946.8A priority Critical patent/CN107870991A/en
Publication of CN107870991A publication Critical patent/CN107870991A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/38Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A kind of similarity calculating method and computer-readable recording medium of paper metadata, are related to microcomputer data processing field, this method comprises the following steps:To the two paper metadata that need to compare, at least two features in first paper metadata are extracted, extract corresponding feature in second paper metadata;Determine the similarity of each individual features being extracted of the two paper metadata;The weight being endowed according to each feature, the similarity of each feature is weighted, draw the similarity of the two paper metadata, the computer-readable recording medium, it is stored with computer program, the step of program realizes above method when being performed, the present invention can improve the accuracy of paper metadata redundancy detection.

Description

A kind of similarity calculating method and computer-readable recording medium of paper metadata
Technical field
The present invention relates to microcomputer data processing field, more particularly to a kind of Similarity Measure of paper metadata Method and computer-readable recording medium.
Background technology
Currently, scientific achievement turns into the evaluation mark that the units such as domestic each colleges and universities and scientific research institutions weigh its basic research strength Standard, the evaluations of professional titles of numerous colleges and universities, performance appraisal, department's bonus etc. links directly with science research output and influence power, while is also School and relevant departments carry out the very important decisions such as department's restructuring, development adjustment and planning and provide objective fact foundation.Paper is made For the important carrier of scientific achievement, critical role is occupied in the statistics of scientific achievement.With the development and scientific research of information technology The globalization of propagation, various Digest Databases provide the retrieval service of paper metadata in different levels different range, from difference It is the important means assessed mechanism Publications that Digest Database, which collects paper metadata and carries out statistical analysis, but due to The different degrees of overlapping paper metadata for be gathered together be present and certain redundancy be present in the data source of each database, This have impact on the accuracy of statistical analysis to a certain extent, therefore the metadata to being converged from disparate databases carries out redundancy inspection Survey and remove duplicate data be carry out follow-up data processing basis, wherein how to judge metadata it is whether identical be redundancy detection Key.
All the time, in the industry for the non-structural data of network sentence weight problem research it is more, also layer goes out various algorithm achievements It is not poor, and have utilization in current all kinds of search engines.Judge whether paper metadata is identical mainly by more first at present Whether the relevant field value in data is consistent, such as doi (Digital Object Unique Identifier (Digital Object Unique Whether Identifier-DOI)) identical, whether author's title mechanism is identical, and whether publisher, year, phase, the page number are consistent etc. Deng.But metadata is as the structural data with semanteme, and it sentences weight standard and the requirement of the degree of accuracy is all more accurate.It is therefore existing Sentence double recipe case for unstructured data, the requirement that metadata sentences weight can not be fully met.In addition, it is commonly used to data The accurate of storehouse sentences double recipe case and can not be adapted in the environment of this partial data mistake that may be present of metadata itself.
The content of the invention
A kind of similarity of paper metadata is provided it is an object of the invention to avoid weak point of the prior art Computational methods and computer-readable recording medium, the similarity calculating method and computer-readable recording medium of the paper metadata The accuracy of paper metadata redundancy detection can be improved.
The purpose of the present invention is achieved through the following technical solutions:
A kind of similarity calculating method of paper metadata is provided, this method comprises the following steps:
To the two paper metadata that need to compare, at least two features in first paper metadata are extracted, extraction the Corresponding feature in two paper metadata;
Determine the similarity of each individual features being extracted of the two paper metadata;
The weight being endowed according to each feature, the similarity of each feature is weighted, draws the two opinions The similarity of literary metadata.
Wherein, data cleansing and regular is carried out to the relevant field of metadata before feature is extracted.
Wherein, the characteristic type is included in author, summary, title, doi, publication issue, publication reel number and the page number extremely It is few two.
Wherein, among the feature of extraction:
Author:The author field in metadata is extracted, character string is split with decollator to obtain list of authors, author is arranged Table and field character string save as traits of author value;
Summary:The abstract fields in metadata are extracted, the word list made a summary are segmented to character string, by word list Summary characteristic value is saved as with field character string;
Title:The header field in metadata is extracted, character string is segmented to obtain the word list of title, by word list Title feature value is saved as with field character string;
doi:The doi fields in metadata are extracted, field character string saves as doi characteristic values;
Publish issue:The publication issue field in metadata is extracted, it is the publication phase that field character string, which is converted to digital halftoning, Number characteristic value;
Publish reel number:The publication reel number field in metadata is extracted, field character string is converted to digital halftoning and rolled up to publish Number characteristic value;
The page number:The page number field in metadata is extracted, numeral therein is gone out to text string extracting, according to the size point of numerical value Do not matched with start page, sign-off sheet, using field character string and start page, sign-off sheet as page number characteristic value.
Wherein, the similarity calculation module of the feature performs following method:
Author:1 is set to if field character string is identical, otherwise counts same position value identical member prime number in list of authors Amount is designated as N, and the maximum length of two list of authors of note is M, then Similarity value is set to N/M,
Summary:1 is set to if field character string is identical, otherwise Similarity value is set to the cosine distances between word list,
Title:1 is set to if field character string is identical, otherwise Similarity value is set to the cosine distances between word list,
doi:1 is set to if field character string is identical, is otherwise set to 0,
Publish issue:1 is set to if field character string or publication issue value are identical, is otherwise set to 0,
Publish reel number:1 is set to if field character string or publication reel number value are identical, is otherwise set to 0,
The page number:1 is set to if field character string or start page, sign-off sheet are identical, is otherwise set to 0.
Wherein, for all features, the similarity of this feature is put if some characteristic value between metadata pair is sky For 0.5, while weight corresponding to this feature is set to 1.
Wherein, the calculation of the cosine distances between the word list is:Word in two word lists is subjected to rope Invite index the mapping table map, map (term) of word=N to represent that word term index be N, note m for map size, for the The newly-built size of one word list is m array vec1 and all elements is initially into 0, travels through word list, its index is map to word t (t) vec1 [map (t)]=vec1 [map (t)]+1 is then put, traversal terminates to obtain the term vector array vec1 of the first word list, right Obtaining term vector array vec2, cosine distance in the second word list progress same operation is
Wherein, characteristic similarity is weighted to obtain the similarity of metadata, calculation is:Assuming that it have selected Characteristic type F1,F2,...,Fn, wherein n is the quantity of selection feature, and corresponding weight is arranged to W1,W2,...,Wn, for The similarity that metadata calculates all features is S1,S2,...,Sn, then the similarity between metadata be
A kind of computer-readable recording medium, is stored with computer program, and the program realizes side described above when being performed The step of method.
Wherein, imparting of the user to the weight of each feature is realized when described program is performed.
Beneficial effects of the present invention:
A kind of similarity calculating method of paper metadata of the present invention, two metadata are passed sequentially through with initialization, spy Sign extraction and Similarity Measure obtain the similarity of described two metadata, i.e., first carry out data cleansing to metadata relevant field With it is regular, can remove unnecessary impurity and noise, ensure the accuracy of data, by the feature class for being configured to Similarity Measure Type, by combining different characteristic type, it can be advantageous to lift redundancy inspection from different dimensions, fineness ratio compared with the difference of metadata The recall rate of survey, by quantifying the similarity of metadata different characteristic, the difference of subtly descriptive metadata can be compared, favorably In the error rate for reducing redundancy detection, by the way that to weight corresponding to each feature configuration, both having considered keynote message, (big weight is special Sign) leading position in redundancy detection, it is also considered that influence of the auxiliary information (small weight feature) to similarity, therefore carrying Can be by lower error rate while the recall rate of liter redundancy detection.
The present invention a kind of computer-readable recording medium, be stored with computer program, when the program is performed realize with The step of upper methods described, the accuracy of paper metadata redundancy detection can be improved.
Brief description of the drawings
Invention is described further using accompanying drawing, but the embodiment in accompanying drawing does not form any limitation of the invention, For one of ordinary skill in the art, on the premise of not paying creative work, it can also be obtained according to the following drawings Its accompanying drawing.
Fig. 1 is a kind of flow chart of the similarity calculating method of paper metadata of the present invention.
Embodiment
The invention will be further described with the following Examples.
A kind of similarity calculating method of paper metadata of the present embodiment, it is simple schematic diagram to see Fig. 1, Fig. 1.This reality Example is applied to illustrate with following two metadata:
Metadata 1:{"pageRange":"25-26","keywords":" electric EMU;Structure design;Drivers' cab;Outside Type;Head;Advanced international standard;Train running speed;Railway Design;China Star;Design level;Railway technology;Air hinders Power;Mechanics Phenomenon;Locomotive ", " description ":" in recent years, China railways design, produced " pioneer " number, " blue arrow ", The electric EMU more advanced than China conventional locomotive such as " star in Central Plains ", " China Star ".The design of these novel electric vehicle groups Level has promoted the development of China railways entirety cause, electric EMU also has become me significantly close to advanced international standard The direction of state's railway technology development.The speed of service of these electric EMUs is substantially accelerated, and train running speed is higher, and it is by air The influence " of resistance, " docTitle ":" a kind of novel electric vehicle group head external form and the cab structure design ", " number ":" 06","org":" Central South University's Electrical and Mechanical Engineering College ", " author ":" Chen Nanyi;Zhang Gongming ", " volume ":""," issn":"1672-0954","journalTitle":" invention and innovation (general news column) ", " doi ":""}.
Metadata 2:{"pageRange":"25-26","keywords":" electric EMU;Structure design;Head configuration; Drivers' cab;" pioneer " number;Railway Design;China Star;Railway technology;Advanced international standard;The speed of service ", " description":"","docTitle":" a kind of novel electric vehicle group head configuration and the cab structure design ", " number":"06","org":" Central South University's Electrical and Mechanical Engineering College ", " author ":" Chen Nanyi;Zhang Gongming ", " colleges":[177],"volume":"","issn":"1672-0954","journalTitle":" invent and innovate (in Generation in class hour) ", " doi ":""}.
Feature doi, author, title, summary, publication issue, publication reel number, the page number are selected, corresponding weight distinguishes assignment For 20,3,4,5,2,2,4, the Similarity Measure of each feature of metadata is as follows:
doi:Doi fields are sky, therefore similarity is directly set to 0.5, and weight is directly set to 1, and default weight 20 is cancelled;
Author:Author field character strings are identical, similarity 1, weight 3;
Title:DocTitle field character strings differ, then similarity is the cosine distances of participle list 0.8947368, weight 4;
Summary:The description fields missing of one metadata, phase knowledge and magnanimity are 0.5, weight 1;
Publish issue:Number field character strings are identical, similarity 1, weight 2;
Publish reel number:Volume fields are sky, and phase knowledge and magnanimity are 0.5, weight 1;
The page number:PageRange section character strings are identical, similarity 1, weight 4.
The similarity of metadata is
A kind of computer-readable recording medium of the present embodiment, is stored with computer program, and the program is realized when being performed The step of above method.
Imparting of the user to the weight of each feature is realized when described program is performed.
The similarity calculating method and computer-readable recording medium of a kind of paper metadata of the present embodiment, to two members Data pass sequentially through initialization, feature extraction and Similarity Measure and obtain the similarity of described two metadata, i.e., first to first number Data cleansing and regular is carried out according to relevant field, unnecessary impurity and noise is can remove, ensures the accuracy of data, pass through and configure , can be from different dimensions, fineness ratio compared with metadata by combining different characteristic type for the characteristic type of Similarity Measure Difference, be advantageous to be lifted the recall rate of redundancy detection, by quantifying the similarity of metadata different characteristic, can compare subtly The difference of descriptive metadata, the error rate of redundancy detection is advantageously reduced, by weight corresponding to each feature configuration, both examining Leading position of the keynote message (big weight feature) in redundancy detection is considered, it is also considered that auxiliary information (small weight feature) is right The influence of similarity, therefore can be by lower error rate while the recall rate of redundancy detection is lifted.
Finally it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than the present invention is protected The limitation of scope is protected, although being explained with reference to preferred embodiment to the present invention, one of ordinary skill in the art should Work as understanding, technical scheme can be modified or equivalent substitution, without departing from the reality of technical solution of the present invention Matter and scope.

Claims (10)

  1. A kind of 1. computational methods of paper metadata similarity, it is characterised in that:Comprise the following steps:
    To the two paper metadata that need to compare, at least two features in first paper metadata are extracted, extract second Corresponding feature in paper metadata;
    Determine the similarity of each individual features being extracted of the two paper metadata;
    The weight being endowed according to each feature, the similarity of each feature is weighted, draws the two papers member The similarity of data.
  2. A kind of 2. similarity calculating method of paper metadata as claimed in claim 1, it is characterised in that:Extraction feature it The preceding relevant field to metadata carries out data cleansing and regular.
  3. A kind of 3. similarity calculating method of paper metadata as claimed in claim 1, it is characterised in that:The characteristic type Including at least two in author, summary, title, doi, publication issue, publication reel number and the page number.
  4. A kind of 4. similarity calculating method of paper metadata as claimed in claim 3, it is characterised in that:The feature of extraction is worked as In:
    Author:Extract the author field in metadata, character string split with decollator to obtain list of authors, by list of authors and Field character string saves as traits of author value;
    Summary:The abstract fields in metadata are extracted, the word list made a summary are segmented to character string, by word list and word Section character string saves as summary characteristic value;
    Title:The header field in metadata is extracted, character string is segmented to obtain the word list of title, by word list and word Section character string saves as title feature value;
    doi:The doi fields in metadata are extracted, field character string saves as doi characteristic values;
    Publish issue:The publication issue field in metadata is extracted, it is special to publish issue that field character string is converted to digital halftoning Value indicative;
    Publish reel number:The publication reel number field in metadata is extracted, it is special to publish reel number that field character string is converted to digital halftoning Value indicative;
    The page number:Extract metadata in page number field, numeral therein is gone out to text string extracting, according to numerical value size respectively with Start page, sign-off sheet are matched, using field character string and start page, sign-off sheet as page number characteristic value.
  5. A kind of 5. similarity calculating method of paper metadata as claimed in claim 4, it is characterised in that:The phase of the feature It is like degree computational methods:
    Author:1 is set to if field character string is identical, same position value identical number of elements in list of authors is otherwise counted and remembers For N, the maximum length of two list of authors of note is M, then Similarity value is set to N/M,
    Summary:1 is set to if field character string is identical, otherwise Similarity value is set to the cosine distances between word list,
    Title:1 is set to if field character string is identical, otherwise Similarity value is set to the cosine distances between word list,
    doi:1 is set to if field character string is identical, is otherwise set to 0,
    Publish issue:1 is set to if field character string or publication issue value are identical, is otherwise set to 0,
    Publish reel number:1 is set to if field character string or publication reel number value are identical, is otherwise set to 0,
    The page number:1 is set to if field character string or start page, sign-off sheet are identical, is otherwise set to 0.
  6. A kind of 6. similarity calculating method of paper metadata as claimed in claim 5, it is characterised in that:For all spies Sign, the similarity of this feature is set to 0.5 if some characteristic value between metadata pair is sky, while by corresponding to this feature Weight is set to 1.
  7. A kind of 7. similarity calculating method of paper metadata as claimed in claim 6, it is characterised in that:The word list it Between the calculations of cosine distances be:Word in two word lists is indexed to obtain the index mapping table map of word, Map (term)=N represents that word term index is N, the size that note m is map, for the number that the newly-built size of the first word list is m All elements are simultaneously initially 0 by group vec1, travel through word list, to its index of word t for map (t) then put vec1 [map (t)]= Vec1 [map (t)]+1, traversal terminate to obtain the term vector array vec1 of the first word list, are carried out for the second word list identical Operation obtains term vector array vec2, cosine distance and is
  8. A kind of 8. similarity calculating method of paper metadata as claimed in claim 1, it is characterised in that:To characteristic similarity It is weighted to obtain the similarity of metadata, calculation is:Assuming that it have selected characteristic type F1,F2,...,Fn, wherein n To select the quantity of feature, corresponding weight is arranged to W1,W2,...,Wn, for the similarity of all features of metadata calculating For S1,S2,...,Sn, then the similarity between metadata be
  9. 9. a kind of computer-readable recording medium, is stored with computer program, it is characterised in that:Power is realized when the program is performed Profit requires the step of 1 to 8 methods described.
  10. A kind of 10. computer-readable recording medium according to claim 9, it is characterised in that:When described program is performed Realize imparting of the user to the weight of each feature.
CN201711022946.8A 2017-10-27 2017-10-27 A kind of similarity calculating method and computer-readable recording medium of paper metadata Pending CN107870991A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711022946.8A CN107870991A (en) 2017-10-27 2017-10-27 A kind of similarity calculating method and computer-readable recording medium of paper metadata

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711022946.8A CN107870991A (en) 2017-10-27 2017-10-27 A kind of similarity calculating method and computer-readable recording medium of paper metadata

Publications (1)

Publication Number Publication Date
CN107870991A true CN107870991A (en) 2018-04-03

Family

ID=61753575

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711022946.8A Pending CN107870991A (en) 2017-10-27 2017-10-27 A kind of similarity calculating method and computer-readable recording medium of paper metadata

Country Status (1)

Country Link
CN (1) CN107870991A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109241008A (en) * 2018-08-07 2019-01-18 北京诺道认知医学科技有限公司 Document De-weight method and device
CN109255122A (en) * 2018-08-06 2019-01-22 浙江工业大学 A kind of method of pair of paper adduction relationship classification marker
CN109615001A (en) * 2018-12-05 2019-04-12 上海恺英网络科技有限公司 A kind of method and apparatus identifying similar article
CN110059180A (en) * 2019-03-13 2019-07-26 百度在线网络技术(北京)有限公司 Author identification and assessment models training method, device and storage medium
CN110362741A (en) * 2019-06-11 2019-10-22 新浪网技术(中国)有限公司 A kind of intelligent delivery method and system of Feed stream information
CN114462384A (en) * 2022-04-12 2022-05-10 北京大学 Metadata automatic generation device for digital object modeling

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101286156A (en) * 2007-05-29 2008-10-15 北大方正集团有限公司 Method for removing repeated object based on metadata
CN101369279A (en) * 2008-09-19 2009-02-18 江苏大学 Detection method for academic dissertation similarity based on computer searching system
CN105808738A (en) * 2016-03-10 2016-07-27 哈尔滨工程大学 Duplication elimination method based on search results of metasearch engine
CN106294677A (en) * 2016-08-04 2017-01-04 浙江大学 A kind of towards the name disambiguation method of China author in english literature

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101286156A (en) * 2007-05-29 2008-10-15 北大方正集团有限公司 Method for removing repeated object based on metadata
CN101369279A (en) * 2008-09-19 2009-02-18 江苏大学 Detection method for academic dissertation similarity based on computer searching system
CN105808738A (en) * 2016-03-10 2016-07-27 哈尔滨工程大学 Duplication elimination method based on search results of metasearch engine
CN106294677A (en) * 2016-08-04 2017-01-04 浙江大学 A kind of towards the name disambiguation method of China author in english literature

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
徐川: "论文相似度分析系统设计", 《中国优秀硕士学位论文全文数据库》 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109255122A (en) * 2018-08-06 2019-01-22 浙江工业大学 A kind of method of pair of paper adduction relationship classification marker
CN109255122B (en) * 2018-08-06 2023-07-11 浙江工业大学 Method for classifying and marking thesis citation relation
CN109241008A (en) * 2018-08-07 2019-01-18 北京诺道认知医学科技有限公司 Document De-weight method and device
CN109241008B (en) * 2018-08-07 2020-10-27 北京大学第三医院 Document de-duplication method and device
CN109615001A (en) * 2018-12-05 2019-04-12 上海恺英网络科技有限公司 A kind of method and apparatus identifying similar article
CN110059180A (en) * 2019-03-13 2019-07-26 百度在线网络技术(北京)有限公司 Author identification and assessment models training method, device and storage medium
CN110362741A (en) * 2019-06-11 2019-10-22 新浪网技术(中国)有限公司 A kind of intelligent delivery method and system of Feed stream information
CN110362741B (en) * 2019-06-11 2022-02-25 新浪网技术(中国)有限公司 Intelligent issuing method and system of Feed stream information
CN114462384A (en) * 2022-04-12 2022-05-10 北京大学 Metadata automatic generation device for digital object modeling
CN114462384B (en) * 2022-04-12 2022-07-12 北京大学 Metadata automatic generation device for digital object modeling

Similar Documents

Publication Publication Date Title
CN107870991A (en) A kind of similarity calculating method and computer-readable recording medium of paper metadata
JP5116775B2 (en) Information retrieval method and apparatus, program, and computer-readable recording medium
CN103559191B (en) Based on latent space study and Bidirectional sort study across media sort method
CN104346438B (en) Based on big data data management service system
García et al. A lexicon based sentiment analysis retrieval system for tourism domain
CN106708966A (en) Similarity calculation-based junk comment detection method
CN110781670B (en) Chinese place name semantic disambiguation method based on encyclopedic knowledge base and word vectors
Nguyen et al. A lattice-based approach for mathematical search using formal concept analysis
WO2014000518A1 (en) Public opinion information display system and method
CN103150333A (en) Opinion leader identification method in microblog media
CN103995903B (en) Cross-media search method based on isomorphic subspace mapping and optimization
US20120047175A1 (en) Short Point-Of-Interest Title Generation
CN108763496A (en) A kind of sound state data fusion client segmentation algorithm based on grid and density
CN107305545A (en) A kind of recognition methods of the network opinion leader based on text tendency analysis
CN109582783B (en) Hot topic detection method and device
CN103186509A (en) Wildcard character class template generalization method and device and general template generalization method and system
CN107463711A (en) A kind of tag match method and device of data
CN111090994A (en) Chinese-internet-forum-text-oriented event place attribution province identification method
CN111680506A (en) External key mapping method and device of database table, electronic equipment and storage medium
CN105389338B (en) A kind of analytic method of buying acceptance of the bid data
CN104133861B (en) A kind of method of the international air ticket freight rate list of intelligently parsing excel forms
CN103257961B (en) Bibliography disappear weight method, Apparatus and system
CN106844539A (en) Real-time data analysis method and system
CN103530421A (en) Micro-blog based event similarity measuring method and system
JP5963310B2 (en) Information processing apparatus, information processing method, and information processing program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180403