CN101576898A - Metadata proposal suitable for permanently filing and using network academic resources - Google Patents

Metadata proposal suitable for permanently filing and using network academic resources Download PDF

Info

Publication number
CN101576898A
CN101576898A CNA2008103057557A CN200810305755A CN101576898A CN 101576898 A CN101576898 A CN 101576898A CN A2008103057557 A CNA2008103057557 A CN A2008103057557A CN 200810305755 A CN200810305755 A CN 200810305755A CN 101576898 A CN101576898 A CN 101576898A
Authority
CN
China
Prior art keywords
resource
academic resources
network
metadata
network academic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008103057557A
Other languages
Chinese (zh)
Inventor
刘玉良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING ZHONGJIAGUODAO TECHNOLOGY Co Ltd
Original Assignee
BEIJING ZHONGJIAGUODAO TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING ZHONGJIAGUODAO TECHNOLOGY Co Ltd filed Critical BEIJING ZHONGJIAGUODAO TECHNOLOGY Co Ltd
Priority to CNA2008103057557A priority Critical patent/CN101576898A/en
Publication of CN101576898A publication Critical patent/CN101576898A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a metadata proposal suitable for permanently filing and using network academic resources, and belongs to the technical field of information storage and retrieval. Aiming at the characteristics of the network academic resources, the proposal extends the Dublin core metadata set, introduces eight key extension elements, such as a resource type, an organization, a set name, and the like, and can effectively support the disclosure, the storage and the retrieval of the network academic resources.

Description

A kind of metadata proposal that is applicable to network academic resources permanently filing and use
Technical field
The present invention is a kind of metadata proposal that is applicable to network academic resources permanently filing and use, may be used on announcement, permanently filing and the use of academic resource on the internet.On subject, belong to Modern Library, information science category.
Background technology
Network academic resources is the digitalization resource relevant with science on the internet.The network academic resources enormous amount, of a great variety, subject broad covered area, and have certain learning value is traditional academic resources, for example the important supplement of academic journal.
The dynamic person's character in internet has determined network academic resources to have changeableness, tends to occur problems such as link fails, and permanently filing is difficulty quite; And the sharing of internet and opening also make everybody to issue academic resources on the internet, thus, cause the confusing of resource organizations on the one hand, make again that on the other hand its quality is very different.Under such background, establish metadata specification targetedly, the internet academic resources is carried out the systematization combing, its meaning is more apparent great.
Metadata is data (data that describes data) or " about the data of data " (dataabout data) of data of description, is used for the feature and the attribute of descriptive data base, also is the instrument of description and organizational information resource.But in library science and information science, metadata is extensively used the description of books, periodical, meeting, rules, information and various resources.In Modern Library, finished the norm-setting of MARC (MachineReadableCatalog: be recorded in catalogue on the computer memory carrier, that use computer Recognition and reading) substantially with code form and ad hoc structure.This cover standard has played crucial effects to the permanently filing of books; But, and be not suitable for network academic resources because it is to formulate at the characteristics of books.The invention of this programme produces in the application aspect the books information in order to solve network academic resources.
Formulating the feasible metadata proposal of a cover, is the infrastructure type engineering of a complexity: not only need be participated in by many-sided expert, and need long application verification to finalize the design.This cover metadata proposal has fully been used for reference Dublin Core Metadata collection standard and has been condensed the experience that the storehouse is built in our long-term network academic resources arrangement.
Summary of the invention
The present invention has disclosed a kind of metadata proposal that is applicable to network academic resources permanently filing and use, belongs to the informationm storage and retrieval technical field.This programme is expanded at the Dublin Core Metadata collection at the characteristics of network academic resources, has introduced 8 critical extension elements such as resource type, mechanism, set name, the announcement of network enabled academic resources, storage and retrieval effectively.
The associated metadata elements sets definition of this programme is as follows:
(1) core element (8)
Element term: resource type
Definition: the type of network academic resources
Note: resource is divided into 11 big classes such as paper class, report type, minutes, subject under discussion proposal, special column commentary, statutory standard, new product information, e-book, teaching material, mechanism's publication, patent according to content.
Element term: mechanism
Definition: asset creation person's organization names
Note: the organization names at founder (Creator) place.
Element term: name set
Definition: serialization, the serial content of delivering are referred to as a certain set, and name set refers to the title of this set.
Note: as procceedings title, meeting title, periodical title, newspaper title etc.
Element term: series number
Definition: the consecutive number of each resource in the set
Note: science and technology report number, Contract NO, procceedings numbering, book number, number of the edition, the patent No., standard No. etc. are arranged.
Element term: region
Definition: resource is delivered mechanism address or founder's region name of living in.
Note: country origin and (or) city.The host city that refers to meeting, the address of the paper author institution where he works, the location, website, the possession of company under the product, or the like.
Element term: local mirror image address
Definition: the mirror image address of network academic resources on local network.
Note: local network is made up of a plurality of mirror image website, and every part of resource all has mirror image on a plurality of websites.
Term name: centre mirror is as the address
Definition: the mirror image address of network academic resources on the mirror site of center.
Note: the center mirror site is meant one group of group of server that all available network academic resources mirror images are provided.It is large and complete mirror site.
Term name: score value
Definition: each resources importance score value.
Note: overlap each resources importance score value that the prominence score system provides by one.
(2) all the other elements (element of dublin core data set, 15)
Element term: other owners (contributor)
Definition: other entities that the content of resource is contributed.
Note: other owners' example can comprise individual, tissue or a certain service.Generally speaking, the title with other owners identifies this clauses and subclauses.
Element term: coverage (coverage)
Definition: extension that resource content is related or scope.
Note: coverage generally comprises the scope (such as an administrative entity of appointment) in locus (place name or geographic coordinate), time interval (time marking, date or a date range) or administrative area under one's jurisdiction.Recommend coverage preferably to be taken from a controlled vocabulary (for example geographic name thesaurus [TGN]), and should use as much as possible by describing place name and time period between the coordinate of numeral or date field.
Element term: founder (creator)
Definition: the prime responsibility person of establishing resource content.
Note: founder's example comprises individual, tissue or a certain service.Generally speaking, the title with the founder identifies this clauses and subclauses.
Element term: date (date)
Definition: with the time that incident is relevant in the resource life cycle.
Note: generally speaking, the date should be relevant with the establishment or the obtainable date of resource.The date format that suggestion is adopted should meet ISO 8601[W3CDTF] standard, and the form of use YYYY-MM-DD.
Element term: describe (description)
Definition: the explanation of resource content.
Note: description can include but not limited to following content: the explanatory note of digest, catalogue, image or the textual description about resource content.
Element term: form (format)
Definition: the physics of resource or digital representation.
Note: generally speaking, form can comprise the medium type of resource or the size of resource, and format item can be showed or required software and hardware or other relevant devices of operating resource with deciding.For example the size of resource comprises storage space or the duration that resource is shared.The value (for example computer media form of Internet medium type [MIME] definition) that comes from the controlled vocabulary is adopted in suggestion.
Element term: identifier (identifier)
Definition: a clear and definite sign that in specific scope, gives resource.
Note: suggestion is adopted character string and the combination of numbers that meets a certain formal sign system to the sign of resource.The example of formal sign system comprises unified resource identifier (URI) (comprising uniform resource position mark URL), digital object identifier (DOI) and International Standard Book Number (ISBN) etc.
Element term: languages (language)
Definition: the languages of describing the resources and knowledge content.
Note: the value of advising this element adopts RFC3066[RFC3066], this standard is with ISO639[ISO639] defined main label and the optional subtab formed by two or three English alphabets and identify languages.For example use " en " or " eng " to represent English, " akk " represents Akkadian, and " en-GB " represents British English.
Element term: publisher (publisher)
Definition: make resource become obtainable responsibility entity.
Note: the example of publisher comprises individuality, tissue or service.Generally speaking, should identify this clauses and subclauses with the title of publisher.
Element term: related (relation)
Definition: to the reference of related resource.
Note: suggestion preferably use the character string of compliant sign system or numeral identify will reference resource
Element term: authority (rights)
Definition: relevant resource authority information all or that be endowed itself.
Note: generally speaking, the authority element should comprise a rights statements to resource, or to the reference of service that this information is provided.Authority generally comprises intellecture property (IPR), copyright or other various property rights.If there is not the mark of authority element, cannot make any supposition to the situation of above-mentioned or other rights relevant with resource.
Element term: source (source)
Definition: to the reference of current source resource.
Note: the resource that current resource may partly or entirely be derived from source element and identified, suggestion adopt the word string or the combination of numbers of a compliant tag system to the sign of this resource.
Element term: theme (subject)
Definition: the subject description of resource content.
Note:, generally adopt keyword, keyword phrase or classification number, preferably value from the taxonomic hierarchies of controlled vocabulary or standard if describe a certain theme of specific resources.
Element term: autograph (title)
Definition: the title of giving resource.
Note: generally speaking, refer to the formal disclosed title of resource object.
Element term: type (type)
Definition: the feature of resource content or type.
Note: resource type comprises the term of general category, function, kind or the cluster level of describing resource content.The value (for example DCMI type vocabulary [DCMITYPE]) that comes from the controlled vocabulary is adopted in suggestion.Describe the physics or the digitizing form of expression of resource, please use " form (FORMAT) " element.
The advantage of this programme
At the characteristics of network academic resources, this programme has been introduced 8 critical extension elements, has effectively guaranteed the permanent of data storage and the convenience of using.
1, resource type element, described the type of network academic resources, these types comprise: 11 major types such as paper class, report type, minutes, subject under discussion proposal, special column commentary, statutory standard, new product information, e-book, teaching material, mechanism's publication, patent.Can allow the user retrieve according to specific resource type, and navigation.
2, mechanism's element is described network academic resources founder unit one belongs to, certain school for example, certain research institute, certain scientific research department of company etc.Mechanism's element except retrieval and navigation inlet are provided, also is one of important evidence of evaluating significance of network academic resources.
3, score value is represented the quantized value that network academic resources is worth with this field, can be used as one of foundation of information retrieval sort result.
4, except " identifier (Identifier) ", " source (Source) " two elements are arranged in the dublin core element, this programme also expands centre mirror as address and local mirror image address, except the primitive network link that resource is provided, also have at least two mirror image website links like this.These 4 elements have all pointed to the visit approach of resource, and this multistage storage scheme has improved the availability of network academic resources.
5, network academic resources has been contained multiple subject polytype, lack unified issue standard and unified administrative mechanism, the title of the resource collection with certain continuation property is described with " name set " this element in this programme, it is a kind of neoteric describing method, with the characteristic of mixed and disorderly resource according to set, organization of unity is got up, and makes resource more convenient and quicker in use.
6, in certain set, each resource all has a specific number, and for example, each this procceedings all has the numbering of procceedings, and the periodical of each volume all has volume issue etc., for the series number element that cooperates " name set " to set up is used for describing this number.
7, land used field element indexed resource is delivered geographic position, the residing position of mechanism or founder, makes things convenient for the user to provide according to retrieving or navigate in the geographic position.
Specific embodiments
Below in conjunction with the MicrosoftSQLServer2005 database, the metadata proposal that how provides according to this programme is described, realize the file and the use of network academic resources.
At first create a database (Database), name is called sample, can use following statement:
CREATE?DATABASE[sample]
A newly-built tables of data NetSource wherein, uses the major key of the id field of automatic increase as database.This table is a simple relation table just, can use following statement:
USE[sample]
GO
CREATE?TABLE[dbo].[NetSource](
[ID][int]IDENTITY(1,1)NOT?NULL,
[resource type] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[mechanism] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[name set] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[series number] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[region] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[local mirror image address] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[centre mirror is as the address] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[score value] [int] NULL,
[autograph] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[founder] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[date] [datetime] NULL,
[theme] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[publisher] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[type] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[description] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[other owners] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[form] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[source] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[authority] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[identifier] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[languages] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[association] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max),
[coverage] [nvarchar] be COLLATE Chinese_PRC_CI_AS NULL (max)) ON[PRIMARY]
After having created this tables of data, just can use the instrument of recording, with recording of network academic resources.After recording, just can realize the retrieval utilization of network academic resources by the correlation function of database.
Description of drawings
Accompanying drawing is the schematic diagram that is applicable to the metadata proposal of network academic resources permanently filing and use.See " Figure of description " for details.

Claims (3)

1. metadata proposal that is applicable to network academic resources permanently filing and use, this scheme has following characteristics:
(1) expands 8 elements based on the Dublin Core Metadata collection;
(2) be specially adapted to announcement, file and retrieval network academic resources.
2. as described in the metadata proposal characteristics that are applicable to network academic resources permanently filing and use in the claim 1, this scheme comprises that resource type, mechanism, set name, series number, region, centre mirror are as 8 element-specific such as address, local mirror image address, score values.
3. as described in the metadata proposal characteristics that are applicable to network academic resources permanently filing and use in the claim 1, this metadata proposal can be mapped to the table definition of database, thereby combine with various databases, with network enabled academic resources permanently filing and use.
CNA2008103057557A 2008-11-26 2008-11-26 Metadata proposal suitable for permanently filing and using network academic resources Pending CN101576898A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2008103057557A CN101576898A (en) 2008-11-26 2008-11-26 Metadata proposal suitable for permanently filing and using network academic resources

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008103057557A CN101576898A (en) 2008-11-26 2008-11-26 Metadata proposal suitable for permanently filing and using network academic resources

Publications (1)

Publication Number Publication Date
CN101576898A true CN101576898A (en) 2009-11-11

Family

ID=41271831

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008103057557A Pending CN101576898A (en) 2008-11-26 2008-11-26 Metadata proposal suitable for permanently filing and using network academic resources

Country Status (1)

Country Link
CN (1) CN101576898A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106874144A (en) * 2016-12-30 2017-06-20 上海中信信息发展股份有限公司 Storage backup policy evaluation method based on electronic record attribute
CN111984776A (en) * 2020-08-20 2020-11-24 中国农业科学院农业信息研究所 Mechanism name standardization method based on word vector model
CN114462384A (en) * 2022-04-12 2022-05-10 北京大学 Metadata automatic generation device for digital object modeling

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106874144A (en) * 2016-12-30 2017-06-20 上海中信信息发展股份有限公司 Storage backup policy evaluation method based on electronic record attribute
CN111984776A (en) * 2020-08-20 2020-11-24 中国农业科学院农业信息研究所 Mechanism name standardization method based on word vector model
CN111984776B (en) * 2020-08-20 2023-08-11 中国农业科学院农业信息研究所 Mechanism name standardization method based on word vector model
CN114462384A (en) * 2022-04-12 2022-05-10 北京大学 Metadata automatic generation device for digital object modeling
CN114462384B (en) * 2022-04-12 2022-07-12 北京大学 Metadata automatic generation device for digital object modeling

Similar Documents

Publication Publication Date Title
Stepchenkova Content analysis
Sotirova et al. Digitization of cultural heritage–standards, institutions, initiatives
Bird et al. Extending Dublin Core metadata to support the description and discovery of language resources
Ore et al. TEI and cultural heritage ontologies: Exchange of information?
Wang et al. The evolution of digital humanities in China
Elliott et al. Digital geography and classics
Lee et al. An integrated approach to metadata interoperability
CN101576898A (en) Metadata proposal suitable for permanently filing and using network academic resources
Kim Toward video semantic search based on a structured folksonomy
Santos et al. Placing GIS and NLP in literary geography: experiments with literature in Portuguese
Berendsohn et al. OpenUp! Creating a cross-domain pipeline for natural history data
Kunze A metadata kernel for electronic permanence
Apenīte Subject Indexing at the National Library of Latvia: New Approach, Challenges, and Benefits
Riley et al. The IN Harmony project: Developing a flexible metadata model for the description and discovery of sheet music
Babeu et al. Named entity identification and cyberinfrastructure
Jahns Guidelines for subject access in National bibliographies
Artese et al. A multimedia system for the management of intangible cultural heritage
Richter et al. The Development of Czech Libraries, 1990–2013
Urbina et al. Visual knowledge: textual iconography of the Quixote, a hypertextual archive
Stūrmane et al. Subject Metadata Development for Digital Resources in Latvia
Nevile et al. Dublin core and museum information: metadata as cultural heritage data
Krabina The Vienna history Wiki: a collaborative knowledge platform for the city of Vienna
Radio et al. Creating and Using a Glacier Authority Index to Document Climate Change
Gul et al. Metadata Diversity in the Cultural Heritage Repositories
Xu et al. The Evolution of Intangible CH Digital Resources: The Case of the Qingming Festival

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20091111