CN107357933A - A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus - Google Patents

A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus Download PDF

Info

Publication number
CN107357933A
CN107357933A CN201710658922.5A CN201710658922A CN107357933A CN 107357933 A CN107357933 A CN 107357933A CN 201710658922 A CN201710658922 A CN 201710658922A CN 107357933 A CN107357933 A CN 107357933A
Authority
CN
China
Prior art keywords
entity
attribute
science
label
information resource
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710658922.5A
Other languages
Chinese (zh)
Other versions
CN107357933B (en
Inventor
刘应波
王�锋
文若瑾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yunnan Academy Of Scientific & Technical Information
Original Assignee
Yunnan Academy Of Scientific & Technical Information
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yunnan Academy Of Scientific & Technical Information filed Critical Yunnan Academy Of Scientific & Technical Information
Priority to CN201710658922.5A priority Critical patent/CN107357933B/en
Publication of CN107357933A publication Critical patent/CN107357933A/en
Application granted granted Critical
Publication of CN107357933B publication Critical patent/CN107357933B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention relates to a kind of label for multi-source heterogeneous science and technology information resource to describe method and apparatus.The present invention includes:Science and technology information resource is classified and then builds object entity storehouse;Extracting object entity attributes build attribute library;Analyze object entity and relation on attributes structure relation storehouse;Specification and symbolism entity object, attribute and incidence relation;With reference to XML markup language, the syntax rule of definition label description language, design specialized describes the Schema specifications of label and language;Description example is parsed according to the syntax rule of markup language using resolver and obtains description object;Conversion and interactive service with other data formats is provided.The present invention apparent can identify multi-source heterogeneous scientific and technological resources, support isomeric data standardization, realize the Unify legislation of polynary isomery science and technology information resource;Isomeric data fusion, data storage can be met and calculate the science and technology information resource data modeling needs handled under occasion.

Description

A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus
Technical field
Scientific and technological information file information management and computer application crossing domain.
Background technology
At present, various science and technology information resources are generated in scientific and technical innovation and technology evolution, these information Resource exists in a manner of platform and database.Science and technology information resource service platform include Experimental Base and large scientific instrument, Collaborative share platform, nature scientific and technological resources shared platform, Platform of Scientific data Sharing, achievements conversion public service platform etc.;Number Include science popularization database, innovative approach storehouse, small enterprise's service database, large scientific instrument database according to storehouse.These information There is scattered, independence and isomerism, and their source formation is various, data format is different, the mark that follows between resource It is accurate different, lack effectively unified resource description mode all the time, so as to cause scientific and technological information data cleaning, depositing Be difficult to be uniformly processed in terms of storage, analysis calculating, thus software overlapping development design be present, difficult in maintenance, availability is low etc. asks Topic.
The invention provides a kind of label based on XML to describe method and apparatus, helps solve multi-source heterogeneous science and technology letter Cease the Unify legislation problem of resource.This method can describe all kinds of isomeric datas in current science and technology information resource field, also can Can be multi-source heterogeneous data fusion enough as the data information exchange agreement under a kind of multi-source heterogeneous storage and computing environment, Large data sets provide technical foundation support into shared service application.
The content of the invention
The application problem to be solved is to provide a kind of label for multi-source heterogeneous science and technology information resource and describes method And device, Unify legislation and data modeling can be carried out to isomery science and technology information resource.
In order to solve the above problems, the present invention publishes a kind of label for multi-source heterogeneous science and technology information resource and described Method, including the following steps:Step 1, science and technology information resource is classified, structure object entity storehouse;Step 2, analyze Relation between extracting object entity attributes and entity forms attribute library and incidence relation library respectively;Step 3, standardize Entity object information, attribute and incidence relation;Step 4, with reference to XML markup language, (the follow-up narration of label description language is defined Convenient, be designated as HSTL) syntax rule, design its label and XML Schema specifications.
Preferably, step 1 includes:According to category classification method, using flattening taxonomic methods, science and technology information resource is entered Row classification.
Preferably, step 2 includes:Extract the entity attributes each classified and form attribute library;Closed with reference to " entity-attribute " System obtains entity attributes, according to the standard of entity, the attribute mark being standardized, has the attribute of standard, will be made with this For label, no longer separately define, no standard then needs customized label, and label value is used as using corresponding English word.
Preferably, step 3 includes:In entity storehouse, corresponding attribute is obtained by " entity-attribute ";According to entity Attribute, the two attributes similarity is checked with this, filter table reaches the entity of redundancy.Obtain the attribute with stronger expressive faculty and pass It is that the attribute and relation after specification will arrange title, determine the synonym and concept field of same attribute.
Preferably, step 4 includes:For every a kind of entity object design XML associated description labels;Define entity attributes Label;According to the relation between entity, the correlation tag of entity is defined;For the markup language syntax rule designed, design Schema specifications.
In order to parse and understand foregoing description language, this application discloses be used to multi-source heterogeneous science and technology described in one kind believe The markup language resolver of resource is ceased, is mainly included:Entity object automatic describing module;Isomery science and technology information resource label language Speech identification, legitimate verification module;The tag resolution module of language;Entity object semantic interpretation and object entity data format turn Change the mold block.
Preferably, entity object automatic describing module can be classified automatically according to resource to be described, for judging Classification belonging to the resource, and be described from matched label information.
Preferably, label description language normalization correction verification module includes the normative inspection mould of language identification, language legitimacy Block;Science and technology information resource for being described to needs carries out legitimate verification, judges that HSTL is described according to HSTL syntax rule Whether example meets the requirements;Whether current version, which can be resolved engine, parses, compatibility between version etc..
Preferably, linguistic labelses parsing module includes the parsing to HSTL label, attribute and relationship description rule, is used for The tree structure of HSTL descriptions is parsed, obtains each label, description value corresponding to attribute.
Preferably, entity object semantic interpretation and object entity data format conversion module include the Schema according to HSTL Specification understands specific meaning tag in description example;Respective labels processing method, the object after Dynamic Maintenance parsing are provided;There is provided The data interchange format of current main-stream is supported, and the object information being stored in after parsing in internal memory is wanted according to specific as needed Derivation goes out.
The beneficial effects of the invention are as follows:Unify legislation can be carried out to multi-source heterogeneous science and technology information resource, solve separate sources, The Unify legislation problem of different structure data, therefore storage model can be optimized, data modeling is supported, reduces setting for processing routine Count complexity;Alternatively, it is also possible to which in this, as the data exchange agreement between each science and technology information resource management application, having can The advantages that autgmentability, readability, normalization and usability.
Brief description of the drawings
In order to illustrate the embodiments of the present invention more clearly or prior art, embodiment or prior art will be retouched below Accompanying drawing required in stating is briefly described, it should be apparent that, drawings in the following description are only some implementations of the present invention Example, for those of ordinary skill in the art, under the premise of creative labor is not paid, can also be obtained according to these accompanying drawings Take other accompanying drawings.
Fig. 1 is the overall procedure that a kind of label for multi-source heterogeneous science and technology information resource of the invention describes method and apparatus Figure.
Fig. 2 is the Schema specifications of the present invention.
Fig. 3 is the classification entity in the present invention.
Fig. 4 is the entity attribute in the present invention.
Fig. 5 is the scope classification chart of entity and attribute in the present invention.
Fig. 6 is the tag resolution method flow in the present invention.
Fig. 7 is a specific descriptions example of HSTL in the present invention.
Embodiment
It is below in conjunction with the accompanying drawings and specific real to enable the above-mentioned purpose of the application, feature and advantage more obvious, understandable Mode is applied to be described in further details the application.
Embodiment 1:Fig. 1 give a kind of markup language design for multi-source heterogeneous science and technology information resource of the present invention with Analytic method overview flow chart, the process of markup language design include following several nucleus modules, such as F1-1, F1-2 in Fig. 1, F1-3 and F1-4;Wherein, F1-1, for building entity, attribute and relation storehouse;F1-2, proofreaded using existing metadata standard Entity, attribute and the relation of extraction, are mainly used in standardization processing;F1-3, specific HSTL label design process;F1-4, it is HSTL Schema, for constraining HSTL.Flow relation between this four modules has succession, includes following several cores Step:
Step 1:Entity structure is described.With reference to Scientific Documents Classification method, current main science and technology information resource is believed science and technology Breath resource is classified, and in addition to the sorting technique including standard, in order to the descriptive power of extended description language, is extended Classification entity, the HSTL entities after extension and F5-1 in the entity relationship in Scientific Documents Classification such as Fig. 5;Entity after extension, As shown in figure 3, including:Data set, periodical, article, figure, chart, report, meeting, books, blog, webpage, patent, calculating Machine program, dictionary, paper, electronics article, formula, film etc..
Explanation is needed exist for, identifies, the partially flat sorting technique of use, avoids as far as possible tree-like for the ease of XML tag Hierarchical classification, so it is easy to the expression descriptive semantics directly perceived of HSTL labels.For example, information resources " article " classification, is needed further Be subdivided into " journal of writings " and " meeting article ", corresponding subsequent descriptions label be designed as " JournalArticle " and " ConferenceArticle ", complexity is reduced on describing mode.How effectively sharp it take into account in addition on attributive classification With existing entity metadata standard, for example, " data set " in the metadata of Dublin is " Dataset ", HSTL is then made with this Describe, be designed as label<Dataset>, " title " of data set then corresponds to " Title ", former according to design in HSTL Then, corresponding label is referred to as with the name.And for entity not in the metadata, then need it is self-defined, such as:Chinese Science There is a lot " subtitles " in information resources, this is no in metadata specification, so for these attributes, it is necessary to from Definition, uses its English word:" Subhead " is used as attribute tags;
Step 2:Entity about subtracts.Type characteristic according to science and technology information resource itself is classified to science and technology information resource, is obtained Classification entity storehouseObj 1 , Obj 1 , Obj 1 ,…, Obj n , it is designated as:Γ, traversal check that each classification entity checks whether it deposits In corresponding standard, if in the presence of being marked, Ψ is designated as;
Step 3:The entity of standard is obtained by entity storehouse, entity attributes is obtained with reference to " entity-attribute " relation, is designated as: Θ, further according to corresponding standard, travel through corresponding attribute and check whether meta data match corresponding with substantive standard, if unanimously, entering Line flag, it is designated as:Do not have respective standard and specification in Ω, entity and attribute library is designated as Γ-Ψ and Θ-Ω respectively;
Step 4:Entity attribute build, on the basis of step 1, extract each classify entity i attributeAttr i1 , Attr i2 , Attr i3 ,…, Attr im ,, establish " entity-attribute ", form attribute library and relation storehouse;
Step 5:, it is necessary to object beyond the Ψ that standardizes, i.e. Γ-Ψ in entity storehouse, check other Object Semantemes whether with Ψ In it is synonymous or close, if in the presence of obtaining corresponding attribute by " entity-attribute ";
Step 6:On the basis of step 1, two entity attributes similitudes are calculated using vector space cosine similarity:If similarity is more than 70%, then the entity will be given up, because the content of its statement is deposited Repeating, the entity with compared with high rule complexity is obtained after the completion of this step, is designated as:Γ
Step 7:Entity symbolism, travel through ΓMiddle attribute, to conventional attribute according to respective meta-data standard to same class object Different attribute merges, and the attribute and relation that are of little use are given up;Attribute and relation after specification carry out title agreement, it is determined that The synonym of same attribute;HSTL entity description mark will be used as the unique title of each type definition, and using the title Label, Verbose Listing are illustrated in fig. 3 shown below, and HSTL entities storehouse can be built after the completion of the step;
Step 8:For its bookmark name of each attribute definition, title also using as the attribute description label under HSTL entities, HSTL entity attributes after equally being extended with entity are illustrated in fig. 5 shown below, and will form HSTL entity attributes storehouse;
Step 9:Relation is built between entity, on the basis of step 1, defines the relation between entity, has one between entity Fixed relation, define 3 kinds of description relations:Similar, dependence, adduction relationship, for example, existing between electronic journal and periodical similar Relation, dependence between equipment and instrument and unit be present, adduction relationship etc. between document be present;
Step 10:Entity attribute relation is built, on the basis of step 2, the relation between defined attribute, and main definitions at present 1 kind of description relation:It is similar.For example, there is similarity relation in author, chief editor, director;
Step 11:Using entity storehouse and attribute library as the description foundation of each entity, the two storehouses are set as entity integrity The restriction range of description;
Step 12:According to the specification of XML language, use<HSTL>Root node of the label as whole description language, under root node Increase by 4 label nodes, be respectively<STInfoRes>:For description information resource;<Name>:For currently describing example life Name;<Version>:For controlling version;<Description>:For being illustrated to current description example;< ModifiedData>:The modification time of description language;<CreatedDate>:The creation time of description language;< STInfoRes >4 child nodes of lower structure, it is respectively<Resources>:Resource type;<Organization>:Resource institute Belong to mechanism;<Services>:The service that Current resource can be provided;<Security>:The security information of Current resource.Wherein< Resources >Under<Resource>Type be exactly HSTL entity, be defined as<Resource>, it has 4 property values, It is id respectively:The identification number of Current resource;name:Corresponding to the title of entity;Order is used to sort;ref:Closed for indicating System.<Resource>Under child node<Fields>It is a set for including multiple attributes, each attribute has been given a definition name, Order, id and ref attribute tags, description implication is such as<Resource>;For example, Fig. 4 shows Dataset attribute word Section, dotted line frame is Optional Field;
It should be noted that entity tag here<Resource>And attribute tags<Field>Pass through science and technology information resource In classification structure and statistics of attributes analysis draw that there is statistical nature.Especially for attribute field, except basic several category Property meet outside metadata standard, some other attached attributes are obtained by statistical method, in this embodiment if some category Property occur more than 90% in given sample, will just be used as label value, due to the specified occurrence of the embodiment, therefore can be with Obtain good autgmentability;
Fig. 7 gives HSTL one example of description, and the example has 2 different types of resource informations, and one is " Article ", another is " Book ", and this resource can pass through " Yunnan Academy of Science & The graduate Document Services of Technical Information " (LiterDelivery) obtain, and user can pass through IP " 192.168.1.22 " is accessed, and need to provide username and password during access, can be by configuring generation if the network address can not access Reason " 10.0.1.22 " is conducted interviews, and thus can be seen, and the Unify legislation of isomery science and technology information resource is realized by the language.
Present invention also provides a kind of resolver of the markup language for multi-source heterogeneous science and technology information resource, flow chart As shown in fig. 6, the device includes the following steps:
Step 1:HSTL description example is obtained, description example can be obtained in a manner of document form, manifold formula, can be located at It can also locally be located at long-range;
Step 2:Such as F6-2 in Fig. 6, HSTL file validation is verified first, whether verification file type meets the requirements parsing HSTL examples, the syntax rule defined according to HSTL, HSTL root node are obtained first, is then found according to HSTL under the node 's<Name>,<Version>Etc. information, according to version information, verification version information whether the version one with current analytics engine Cause property, is exited if inconsistent, otherwise into process of analysis;
It should be noted that HSTL version and the version of analytics engine need to be mutually matched, analytics engine can be to HSTL forward It is compatible.
Step 3:Such as F6-3 in Fig. 6, parsing extracts the type of science and technology information resource, passes through the scientific and technical literature built in system Type entities tag set finds corresponding type, further obtains the property value belonging to science and technology information resource by entity< Fields>, then verify whether attribute is object properties built in current version, if not then ignoring.Meanwhile it can be somebody's turn to do The method of service of entity;In addition, being limited according to the HSTL agency informations defined and resource access security, current entity is obtained The access control policy of object, the parsing of primary label is completed by the step;
Step 4:Such as the F6-3 in Fig. 6, after label substance parsing is completed, the essential information of current entity object is just obtained, so F6-3 modules are utilized afterwards, entity tag is handled, and if security restriction, then obtain safe handling rule;
Step 5:Such as the F6-4 in Fig. 6, HSTL memory object version is built in internal memory, processing is with safety regulation limitation Object, filter non-public object, attribute and relation;
Step 6:Such as the F6-5 in Fig. 6, object after processing can with to being disclosed using visitor, can with CSV and JSON form export;Other storage programs and processing routine can be understood with used acquisition HSTL Schema specifications CSV and JSON semantic information;
It should be noted that safety filtering here is mainly used in the processing of basic object attribute, the control of real resource is each The mechanism of individual science and technology information resource is managed.
To sum up, the present invention has the advantages that:Due to lacking the consistent of unification in current science and technology information resource field Property standard, also due to the multi-source of science and technology information resource, isomery diversity, result in the need for carrying out different Object tables for them State, sub-category design is also required in data storage and data processing, therefore bring great complexity, for these problems, A kind of isomery science and technology information resource of the present invention describes method and apparatus, and the multi-source Document Information Resources of Science & Technology of isomery can be entered The unified effective description of row, the object after description are easy to unified storage and processing so that storage and processing method can be to differences The science and technology information resource of type uses the processing mode of uniformity, and user obtains the example after description by the method for the present invention, Description content can be understood that with reference to the Schema specifications of description language;Also there is good autgmentability, compatibility and can manage Xie Xing.
A kind of label for multi-source heterogeneous science and technology information resource provided above the embodiment of the present invention describes method It is described in detail with device, applies specific case herein and the principle and implementation of the present invention are set forth, The explanation of above example is only intended to help the method and core concept for understanding the present invention;Meanwhile the present invention is not limited to Embodiment is stated, in those of ordinary skill in the art's possessed knowledge, present inventive concept can also not departed from Under the premise of various changes can be made, in summary, this specification content should not be construed as the present invention limitation.

Claims (5)

1. a kind of label for multi-source heterogeneous science and technology information resource describes method, it is characterised in that including:First, science and technology is believed Cease the object in resource category according to scientific and technological resources classification of type, extract each object of classification entity attribute and entity it Between relation form entity, attribute and relation storehouse;Secondly, to the statement standardization and symbolism in these storehouses, then, language is defined Method rule, for the special description labels of the XML of entity, object and relational design towards science and technology information resource field, formation is made by oneself Label description language that is adopted and possessing self-described;Finally, designed by the XML Schema of the description language, it is possessed rule Plasticity, markup language can be verified and constrained.
2. according to the method for claim 1, it is characterised in that the object in science and technology information resource category is provided according to science and technology Source Type is classified, and forms entity storehouse, and then extract the relation shape between the general entity attribute and entity of each object of classification Into entity, attribute and relation storehouse, the following steps can be divided into:
Step 1:Type characteristic according to science and technology information resource itself is classified to science and technology information resource, obtains classification entity storehouse {Obj 1 , Obj 1 , Obj 1 ,…, Obj n , it is designated as:Γ, traversal check that classification entity storehouse checks it with the presence or absence of corresponding mark Standard, if in the presence of being marked, form set Ψ;
Step 2:On the basis of step 1, extract each classification entity i attributeAttr i1 , Attr i2 , Attr i3 ,…, Attr im ,, attribute library is built, according to the relationship of the two, establishes " entity-attribute " relation storehouse;
Step 3:On entity sets Ψ, obtain entity attributes set with reference to " entity-attribute " relation and be designated as:Θ, further according to Respective standard, travel through corresponding attribute and check whether meta data match corresponding with substantive standard, if unanimously, being marked, formed Set:Ω;
Step 4:The data for not having standard in entity and attribute library are designated as set Γ-Ψ and Θ-Ω respectively.
3. according to the method for claim 1, it is characterised in that on the basis in entity, attribute and the relation storehouse formed On be described as follows standardization and symbolism processing:
Step 1:In entity storehouse, for the object beyond Ψ, i.e. Γ-Ψ, check its Object Semanteme whether with it is synonymous in Ψ or It is close, if in the presence of obtaining corresponding attribute by " entity-attribute ";
Step 2:On the basis of step 1 two entity attributes similitudes are calculated using vector space cosine similarity:<math display = 'block'> <mrow> <mi>sim</mi> <mo stretchy='false'>(</mo> <mi>X</mi> <mo>,</mo> <mi>Y</mi> <mo stretchy='false'>)</mo> <mo>=</mo> <mi>cos</mi> <mi>&amp;theta;</mi> <mo>=</mo> <mfrac> <mrow> <mover> <mi>x</mi> <mo>&amp;rarr;</mo> </mover> <mo>&amp;sdot;</mo> <mover> <mi>y</mi> <mo>&amp;rarr;</mo> </mover> </mrow> <mrow> <mo>|</mo> <mo>|</mo> <mi>x</mi> <mo>|</mo> <mo>|</mo> <mo>&amp;sdot;</mo> <mo>|</mo> <mo>|</mo> <mi>y</mi> <mo>|</mo> <mo>|</mo> </mrow> </mfrac> </mrow> </math>If similarity is more than 70%, then the entity will be given up, because the content of its description is deposited Repeating, will obtain the entity sets with compared with high rule complexity after the completion of this step, be designated as:Γ’;
Step 3:Travel through ΓMiddle attribute, the different attribute of same class object is merged according to corresponding metadata standard, The attribute and relation being of little use are given up, and the attribute and relation after specification carry out title agreement, and determine the synonym of same attribute.
4. according to the method for claim 1, it is characterised in that for the design XML associated description marks per a kind of entity object Label, it is characterised in that:1)According to the type of science and technology information resource to one XML tag of every a kind of setting, label substance is according to power The classification entity of profit 1;2)According to each entity attributes, entity attributes label is defined;3)It is fixed according to the relation between entity The correlation tag of adopted entity;4)For the markup language designed, XML Schema specifications are further designed, formulate related constraint Scope, in order to the normalization of markup language, while also allow for being modeled instantiation using current XML Software tools.
A kind of 5. label description language resolver for multi-source heterogeneous science and technology information resource, it is characterised in that including:Entity Object automatic describing module;The identification of isomery science and technology information resource markup language, legitimate verification module;The tag resolution mould of language Block;Entity object semantic interpretation and object entity data format conversion module;
Device according to claim 5, it is characterised in that the entity object automatic describing module, be used for:According to object, Attribute with associate the affiliated type of automatic identification object, then carried out automatically using corresponding label according to resolution system semantic rules Sign;
Device according to claim 5, it is characterised in that the isomery science and technology information resource markup language identification, legitimacy Authentication module, it is used for:The content described to the language of science and technology information resource is identified, and whether the syntax rule of assertion language closes Method, and the front and rear version compatibility of verification description language;
Device according to claim 5, it is characterised in that the tag resolution module of the language, be used for:If pair of description Image structures mode is legal, then parses corresponding label by syntax rule, further obtains the value that is identified of label and clearly Relation between entity, so as to obtain the object of description;
Device according to claim 5, it is characterised in that entity object semantic interpretation and the object entity data format turns Block is changed the mold, is used for:By the value of tag identifier, the semantic information of entity object is parsed, for example, what object is, belongs to assorted Type etc.;By serializing mode object can be allowed to enter row information with other processing routines to exchange, realize CSV, JSON etc. its The conversion of his form;Other programs can understand the description content of instantiation by Schema specifications;According to pair after parsing As semantic action scope calls respective handling module.
CN201710658922.5A 2017-08-04 2017-08-04 Label description method and device for multi-source heterogeneous scientific and technological information resources Expired - Fee Related CN107357933B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710658922.5A CN107357933B (en) 2017-08-04 2017-08-04 Label description method and device for multi-source heterogeneous scientific and technological information resources

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710658922.5A CN107357933B (en) 2017-08-04 2017-08-04 Label description method and device for multi-source heterogeneous scientific and technological information resources

Publications (2)

Publication Number Publication Date
CN107357933A true CN107357933A (en) 2017-11-17
CN107357933B CN107357933B (en) 2020-08-21

Family

ID=60286161

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710658922.5A Expired - Fee Related CN107357933B (en) 2017-08-04 2017-08-04 Label description method and device for multi-source heterogeneous scientific and technological information resources

Country Status (1)

Country Link
CN (1) CN107357933B (en)

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108509595A (en) * 2018-04-02 2018-09-07 深圳市华傲数据技术有限公司 Method for sorting, device, storage medium and the equipment of isomeric data
CN108595421A (en) * 2018-04-13 2018-09-28 北京神州泰岳软件股份有限公司 A kind of abstracting method, the apparatus and system of Chinese entity associated relationship
CN109471957A (en) * 2018-09-19 2019-03-15 北京悦图遥感科技发展有限公司 A kind of metadata conversion method and device based on unified label
CN109919469A (en) * 2019-02-27 2019-06-21 浪潮软件集团有限公司 A kind of holography science data processing method
CN110647587A (en) * 2019-09-29 2020-01-03 肖凯泽 Heterogeneous resource mapping method based on two-stage model
CN110688421A (en) * 2018-06-20 2020-01-14 南京网感至察信息科技有限公司 Intelligent customizable data management and analysis method
CN110750647A (en) * 2019-10-17 2020-02-04 北京华宇信息技术有限公司 Construction method of ELP model of multi-source heterogeneous information data
CN111125383A (en) * 2019-12-25 2020-05-08 新华智云科技有限公司 Event model-based media asset tag storage and retrieval method
CN111177372A (en) * 2019-12-06 2020-05-19 绍兴市上虞区理工高等研究院 Scientific and technological achievement classification method, device, equipment and medium
CN111190602A (en) * 2019-12-30 2020-05-22 富通云腾科技有限公司 Heterogeneous cloud resource-oriented conversion method
CN111681775A (en) * 2020-06-03 2020-09-18 北京启云数联科技有限公司 Medicine application analysis method, system and device based on medicine big data
CN111681776A (en) * 2020-06-03 2020-09-18 北京启云数联科技有限公司 Medicine object relation analysis method and system based on medicine big data
CN112148741A (en) * 2020-10-16 2020-12-29 中石化重庆涪陵页岩气勘探开发有限公司 Petroleum geological data loading method and device, server and storage medium
CN112163248A (en) * 2020-10-12 2021-01-01 重庆大学 Rule-based process resource environmental load data normalization method
CN113220911A (en) * 2021-05-25 2021-08-06 中国农业科学院农业信息研究所 Agricultural multi-source heterogeneous data analysis and mining method and application thereof
CN113361979A (en) * 2021-08-10 2021-09-07 湖南高至科技有限公司 Profile-oriented ontology modeling method and device, computer equipment and storage medium
CN113992769A (en) * 2021-10-26 2022-01-28 重庆斯欧智能科技研究院有限公司 Industrial internet information exchange method
CN114844786A (en) * 2022-03-31 2022-08-02 广州大学 Internet of things resource credibility assessment method based on heterogeneous information map

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130086063A1 (en) * 2011-08-31 2013-04-04 Trista P. Chen Deriving User Influences on Topics from Visual and Social Content
CN103886046A (en) * 2014-03-11 2014-06-25 中国信息安全测评中心 Automatic semanteme extraction method for Web data exchange
CN104182454A (en) * 2014-07-04 2014-12-03 重庆科技学院 Multi-source heterogeneous data semantic integration model constructed based on domain ontology and method
CN105488056A (en) * 2014-09-17 2016-04-13 阿里巴巴集团控股有限公司 Object processing method and equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130086063A1 (en) * 2011-08-31 2013-04-04 Trista P. Chen Deriving User Influences on Topics from Visual and Social Content
CN103886046A (en) * 2014-03-11 2014-06-25 中国信息安全测评中心 Automatic semanteme extraction method for Web data exchange
CN104182454A (en) * 2014-07-04 2014-12-03 重庆科技学院 Multi-source heterogeneous data semantic integration model constructed based on domain ontology and method
CN105488056A (en) * 2014-09-17 2016-04-13 阿里巴巴集团控股有限公司 Object processing method and equipment

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108509595A (en) * 2018-04-02 2018-09-07 深圳市华傲数据技术有限公司 Method for sorting, device, storage medium and the equipment of isomeric data
CN108595421A (en) * 2018-04-13 2018-09-28 北京神州泰岳软件股份有限公司 A kind of abstracting method, the apparatus and system of Chinese entity associated relationship
CN110688421A (en) * 2018-06-20 2020-01-14 南京网感至察信息科技有限公司 Intelligent customizable data management and analysis method
CN109471957A (en) * 2018-09-19 2019-03-15 北京悦图遥感科技发展有限公司 A kind of metadata conversion method and device based on unified label
CN109471957B (en) * 2018-09-19 2020-08-04 北京悦图数据科技发展有限公司 Metadata conversion method and device based on uniform tags
CN109919469A (en) * 2019-02-27 2019-06-21 浪潮软件集团有限公司 A kind of holography science data processing method
CN110647587A (en) * 2019-09-29 2020-01-03 肖凯泽 Heterogeneous resource mapping method based on two-stage model
CN110750647A (en) * 2019-10-17 2020-02-04 北京华宇信息技术有限公司 Construction method of ELP model of multi-source heterogeneous information data
CN111177372A (en) * 2019-12-06 2020-05-19 绍兴市上虞区理工高等研究院 Scientific and technological achievement classification method, device, equipment and medium
CN111125383A (en) * 2019-12-25 2020-05-08 新华智云科技有限公司 Event model-based media asset tag storage and retrieval method
CN111125383B (en) * 2019-12-25 2023-08-11 新华智云科技有限公司 Event model-based media resource tag storage and retrieval method
CN111190602A (en) * 2019-12-30 2020-05-22 富通云腾科技有限公司 Heterogeneous cloud resource-oriented conversion method
CN111681776A (en) * 2020-06-03 2020-09-18 北京启云数联科技有限公司 Medicine object relation analysis method and system based on medicine big data
CN111681775A (en) * 2020-06-03 2020-09-18 北京启云数联科技有限公司 Medicine application analysis method, system and device based on medicine big data
CN111681776B (en) * 2020-06-03 2023-09-29 北京启云数联科技有限公司 Medical object relation analysis method and system based on medical big data
CN111681775B (en) * 2020-06-03 2023-09-29 北京启云数联科技有限公司 Medicine application analysis method, system and device based on medicine big data
CN112163248A (en) * 2020-10-12 2021-01-01 重庆大学 Rule-based process resource environmental load data normalization method
CN112148741A (en) * 2020-10-16 2020-12-29 中石化重庆涪陵页岩气勘探开发有限公司 Petroleum geological data loading method and device, server and storage medium
CN113220911A (en) * 2021-05-25 2021-08-06 中国农业科学院农业信息研究所 Agricultural multi-source heterogeneous data analysis and mining method and application thereof
CN113220911B (en) * 2021-05-25 2024-02-02 中国农业科学院农业信息研究所 Agricultural multi-source heterogeneous data analysis and mining method and application thereof
CN113361979A (en) * 2021-08-10 2021-09-07 湖南高至科技有限公司 Profile-oriented ontology modeling method and device, computer equipment and storage medium
CN113992769A (en) * 2021-10-26 2022-01-28 重庆斯欧智能科技研究院有限公司 Industrial internet information exchange method
CN113992769B (en) * 2021-10-26 2023-10-27 合肥斯欧互联科技股份有限公司 Industrial Internet information exchange method
CN114844786A (en) * 2022-03-31 2022-08-02 广州大学 Internet of things resource credibility assessment method based on heterogeneous information map
CN114844786B (en) * 2022-03-31 2023-11-14 广州大学 Internet of things resource credibility evaluation method based on heterogeneous information map

Also Published As

Publication number Publication date
CN107357933B (en) 2020-08-21

Similar Documents

Publication Publication Date Title
CN107357933A (en) A kind of label for multi-source heterogeneous science and technology information resource describes method and apparatus
Visser et al. Enabling technologies for interoperability
Song et al. An ontology-driven framework towards building enterprise semantic information layer
US20110246530A1 (en) Method and System for Semantically Unifying Data
CN106919689A (en) Professional domain knowledge mapping dynamic fixing method based on definitions blocks of knowledge
US8452772B1 (en) Methods, systems, and articles of manufacture for addressing popular topics in a socials sphere
CN106663101A (en) Ontology mapping method and apparatus
de Almeida Ferreira et al. RSL-PL: A linguistic pattern language for documenting software requirements
Stührenberg The TEI and current standards for structuring linguistic data. An overview
US10397326B2 (en) IRC-Infoid data standardization for use in a plurality of mobile applications
CN114443855A (en) Knowledge graph cross-language alignment method based on graph representation learning
CN103092973B (en) information extraction method and device
KR20220074576A (en) A method and an apparatus for extracting new words based on deep learning to generate marketing knowledge graphs
Chiarcos et al. On the linguistic linked open data infrastructure
CN114792145A (en) Standard digital management maintenance system and method based on knowledge graph
Jiang et al. Research on BIM-based Construction Domain Text Information Management.
Zhang et al. Constructing ontologies by mining deep semantics from XML Schemas and XML instance documents
Jou Schema extraction for deep web query interfaces using heuristics rules
CN117473054A (en) Knowledge graph-based general intelligent question-answering method and device
Yuxuan et al. Research on intelligent organization and application of multi-source heterogeneous knowledge resources for energy internet
US20090217156A1 (en) Method for Storing Localized XML Document Values
KR20220074572A (en) A method and an apparatus for extracting new words based on deep learning to generate marketing knowledge graphs
Wahid et al. XML semantic constraint validation for XML updates: A survey
Li et al. [Retracted] The Research of Multimedia Complex Intelligent System in Financial Reporting Mode
Sun Online algorithm design of english translation of film and television works under the background of media cultural information

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200821

Termination date: 20210804