CN113177150A - Publication resource integration method and publication resource integration system - Google Patents

Publication resource integration method and publication resource integration system Download PDF

Info

Publication number
CN113177150A
CN113177150A CN202110448632.4A CN202110448632A CN113177150A CN 113177150 A CN113177150 A CN 113177150A CN 202110448632 A CN202110448632 A CN 202110448632A CN 113177150 A CN113177150 A CN 113177150A
Authority
CN
China
Prior art keywords
data
publication
publication resource
resource
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110448632.4A
Other languages
Chinese (zh)
Inventor
夏国兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xinhua Zhiyun Technology Co ltd
Original Assignee
Xinhua Zhiyun Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xinhua Zhiyun Technology Co ltd filed Critical Xinhua Zhiyun Technology Co ltd
Priority to CN202110448632.4A priority Critical patent/CN113177150A/en
Publication of CN113177150A publication Critical patent/CN113177150A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/30Creation or generation of source code
    • G06F8/31Programming languages or programming paradigms

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computing Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application relates to a publication resource integration method and a publication resource integration system, which can realize the uniform storage and management, the uniform modeling and the uniform standardization of all media (audio-visual image-text) data of publication resources, and can construct the association relationship among the publication resource data based on the method, thereby effectively avoiding the data island problem among the publication resource data, so that a manager can provide data-driven management intelligent decision analysis through the analysis of the association relationship of the publication resource data.

Description

Publication resource integration method and publication resource integration system
Technical Field
The application relates to the technical field of publication resource management, in particular to a publication resource integration method and a publication resource integration system.
Background
Conventional publishing resource management systems generally include book management systems, course management systems, and illustration management systems. The three systems are independently constructed and operated, and each system only processes certain type of data or provides certain part of functional service. The book management system contains published book resources for retrieving book-related information. The course management system comprises online and offline course resources for retrieving course-related information. The picture-inserting management system comprises some picture materials in books and courses and is used for specially managing the picture materials.
In a traditional publishing resource management system, three systems are independently constructed and operated, each system only processes data of a certain type, or provides a certain part of functions, and resources such as books, courses, pictures and the like are dispersedly stored in databases of the respective systems. This creates a problem in that data islands are created between publication source data for individual systems. If a certain type of publication resources need to be edited and processed, for example, to process information related to courses, corresponding information needs to be acquired from three systems respectively and then manually gathered and analyzed, which not only consumes a large amount of manual workload and does not realize effective assistance of intelligent capability and publishing production capability, but also causes the publication company to be unable to effectively control the real-time overall situation of the publication resources and to be unable to timely, effectively and clearly manage the publication resources.
Disclosure of Invention
Based on this, it is necessary to provide a publication resource integration method and a publication resource integration system for solving the problem that data islanding is easily generated between publication resource data by a conventional publication resource management method.
The application provides a publication resource integration method, comprising:
acquiring a plurality of publication source data from a service system and synchronizing the publication source data to a data storage unit;
capturing operation data from a publication resource website and a service system, and sending the operation data to a data storage unit;
performing data processing on each publication resource data to standardize each type of publication resource data so as to establish association between different publication resource data;
establishing an Elasticissearch index based on all the publication resource data, and importing all the publication resource data into an Elasticissearch engine; the Elasticsearch index includes a plurality of records, each record corresponding to a publication resource data.
The application also provides a publication resource integration system, which is used in cooperation with a service system, and comprises:
a management unit for executing the publication resource integration method as mentioned in the foregoing;
the data storage unit is connected with the management unit and comprises a publication resource data storage unit and an operation data storage unit;
the data processing unit is respectively connected with the management unit and the data storage unit and comprises a data standardization unit and a material processing unit;
the URC material library is respectively connected with the management unit and the data processing unit and is used for storing text materials, picture materials and voice materials after media processing and content verification;
and the Elasticissearch search engine is connected with the management unit.
The application relates to a publication resource integration method and a publication resource integration system, which can realize the uniform storage and management, the uniform modeling and the uniform standardization of all media (audio-visual image-text) data of publication resources, and can construct the association relationship among the publication resource data based on the method, thereby effectively avoiding the data island problem among the publication resource data, so that a manager can provide data-driven management intelligent decision analysis through the analysis of the association relationship of the publication resource data.
Drawings
FIG. 1 is a schematic flow chart of a publication resource integration method provided by an embodiment of the present application;
fig. 2 is a schematic structural diagram of a publication resource integration system according to an embodiment of the present application.
Reference numerals:
10-publication resource integration system; 110-a management unit; 120-a data storage unit;
121-publication resource data storage unit; 122-operation data storage unit;
130-a data processing unit; 131-a data normalization unit; 132-material processing unit;
140-URC materials library; 150-elastic search engine; 20-service system
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
The present application provides a method for publication resource integration. It should be noted that the method for integrating publication resources provided by the present application is applicable to any kind of publication resources, including but not limited to books, video and audio materials, courses, information, topics, test questions, test papers, meetings, authors and periodicals.
In addition, the publication resource integration method provided by the application is not limited to the execution subject. Alternatively, the execution subject of the publication resource integration method provided by the present application may be a publication resource integration system 10. The publication resource integration system 10 is connected to a publisher's business system 20. Specifically, the execution subject of the publication resource integration method provided by the present application may be the management unit 110 in the publication resource integration system 10.
As shown in fig. 1, in an embodiment of the present application, the method includes the following steps S100 to S400:
s100, acquiring data of a plurality of publication sources from the service system 20 and synchronizing the data to the data storage unit 120.
Specifically, the local service system 20 of the publisher stores its own publication resource data. In this step, the management unit 110 in the publication resource integration system 10 may synchronize the data of the plurality of publication resources in the service system 20 to the data storage unit 120 in the publication resource integration system 10.
S200, capturing operation data from the publication resource website and the service system 20, and sending the operation data to the data storage unit 120.
Specifically, step S100 is to acquire publication resource data, and this step is to capture operation data. Operational data may encompass many aspects of information. For example, the number of pre-purchased persons for a particular publication resource (such as a book).
S300, performing data processing on each publication resource data to standardize each type of publication resource data, and establishing association among different publication resource data.
In particular, the publication resource data includes audio, video, picture and other unstructured data in the service system 20, and the purpose of this step is to convert these unstructured data into structured data. On the other hand, the data standards of various publication resources are unified, namely the data are standardized, so that the associated data among different publication resource data can be generated, and the purpose of eliminating data islands among different publication resource data is achieved.
S400, establishing an Elasticissearch index based on all the publication resource data, and importing all the publication resource data into the Elasticissearch search engine 150. The Elasticsearch index includes a plurality of records. Each record corresponds to a publication resource data.
Specifically, the Elasticsearch search engine 150 is a Lucene-based search server that provides a distributed multi-user capable full-text search engine, based on RESTful web interfaces. The Elasticissearch engine 150 has the characteristic of strong lateral expansion capability, publication resource data can be synchronized into the Elasticissearch engine 150 every day or every month, and data statistics and analysis are carried out through the Aggregation of the Elasticissearch engine 150. The indicators of the statistical analysis may include one or more of total amount of publication resources, total amount of departments, total amount of disciplines, newly added publication resources, newly added departments, newly added disciplines, product distribution, search frequency TOP5 and hit TOP 5.
In the embodiment, the application relates to a publication resource integration method and a publication resource integration system, which can realize the uniform storage and management, the uniform modeling and the uniform standardization of all media (audio-visual image-text) data of publication resources, and can construct the association relationship among the publication resource data based on the method, thereby effectively avoiding the data island problem among the publication resource data, and providing data-driven management intelligent decision analysis through the analysis of the association relationship among the publication resource data by a manager.
In an embodiment of the present application, the S100 includes the following steps:
s110, synchronizing the plurality of published resource data in the service system 20 to the data storage unit 120 based on the Dataworks data collection script.
In particular, dataword supports a variety of computing and storage engine services, including offline computing MaxCompute, open source big data engine E-MapReduce, Flink-based real-time computing, machine learning PAI, Graph computing service Graph computer, and interactive analytics service, etc., and supports user-defined access to computing and storage services. The data of a plurality of published resources in the business system 20 can be synchronized to the data storage unit 120 based on the Dataworks data collection script.
In an embodiment of the present application, the S200 includes the following steps:
s210, capturing operation data based on one or more of a crawler service, a buried point service, and a log service, and sending the operation data to the data storage unit 120.
In particular, based on a crawler service, web data for a publication resource or resources can be crawled from a publication selling website. For example, for the a book, based on the crawler service, information such as how many users pay attention to the book, how many users buy the book in advance, and how many users actually purchase the book in order can be captured on the Amazon website, which belong to one of the operation data.
The data objects captured by the buried point service and the crawler service are consistent, and the data objects are also used for acquiring network data of a certain publication resource or a plurality of publication resources. The difference with the crawler service is that the point burying service is to bury points in a publication selling website in advance and periodically and automatically capture the network data of the publication selling website.
Based on the log service, log files related to the operation data can be automatically captured in the business system 20 to help the business system 20 grasp its own operation status.
In this embodiment, capturing the operation data through one or more of the crawler service, the point burying service, and the log service can help the service system 20 to grasp its own operation state and external operation state, so that the locally stored publication resource data can be better maintained and managed.
In an embodiment of the present application, the step S300 includes the following steps S310 to S320:
and S310, establishing interface standards of various publication resources.
And S320, standardizing the data of each publication material according to the corresponding interface standard, and attaching labels.
Specifically, in order to establish an association relationship between different publication resource data, the interface standards of various types of publication resources need to be unified first. Different publication resource data, as long as they belong to the same type, must use this well-established interface standard. Different publication resource data can be understood, and a publication resource model is used as long as the data belong to the same type. By the method, key information and effective information in different publication resource data can be collected, so that the association relationship between different publication resource data is established.
In the embodiment, by establishing interface standards of various publication resources, standardizing each publication resource data according to the corresponding interface standard, and attaching a label, uniform modeling and uniform standardization of the same type of publication resource data are realized.
In an embodiment of the present application, the step S310 includes the following steps S311 to S315:
s311, establishing a book interface standard. The book interface standard comprises one or more labels of book name, original book name, author, original book author, translator, WBS number, material number, ISBN number, E-ISBN number, pricing and ERP pricing.
S312, establishing an audio interface standard. The audio interface standard includes one or more tags of a version number, subject classification, subject matter, total collection number, speaker, category, review, and duration.
And S313, establishing a video interface standard. The video interface standard includes one or more tags of version number, subject classification, subject matter, total collection number, speaker, category, review and duration.
And S314, establishing a picture interface standard. The video interface criteria include one or more tags of category, column, cover page, keyword, name, source, vignette, and thumbnail.
S315, establishing an information interface standard, wherein the information interface standard comprises one or more labels of reader ID, reader hierarchy, classification, province and city.
Specifically, of course, this embodiment is only illustrative of several common interface standard establishment methods for publication resource types. By taking audio-type publication resources as an example, the audio-type publication resource data is subjected to standardized processing based on the audio interface standard, so that one or more labels of version numbers, subject classifications, subject matters, total collection numbers, speakers, categories, audits and duration can be quickly attached to the audio-type publication resource data, and the core information of the audio-type publication resource data can be collected.
The book interface standard may further include one or more tags of cooperative publishers, ERP publication times, affiliated departments, first reader hierarchy, second reader hierarchy, applicable objects, keywords, key projects, prize winning information, and plan edits.
In an embodiment of the present application, the S300 includes the following steps:
and S330, performing media processing and content auditing on various materials in the data of each publication resource.
Specifically, in addition to performing media processing and content auditing on the material, media asset meta-information processing may also be performed on the material, and the meta-information of the material is acquired and asynchronously synchronized to the Elasticsearch engine 150. Only material of the video/audio/image type needs to acquire meta information.
The video meta information can acquire the duration, size and code rate of the video, and the video meta information can be asynchronously synchronized to the elastic search engine 150, so that the functions of size search, duration search and the like of the video can be realized.
The audio meta information includes the duration and size of the audio, and the audio meta information is asynchronously synchronized to the Elasticsearch engine 150, so that the functions of size search, duration search and the like of the audio can be realized.
The image meta information contains the size of the picture, and the meta information is asynchronously synchronized to the elastic search engine 150, so that the picture can be subjected to size search.
The content security audit can filter out some objectionable content related to yellowness and related to violence.
In the embodiment, media processing and content auditing are performed on various materials in each publication resource data, so that some unstructured data in the publication resource data can be converted into structured data, association can be conveniently established between different publication resource data in the follow-up process, and some yellow-related and riot-related bad contents can be filtered out through content security auditing.
In an embodiment of the present application, the S330 includes the following S331 to S336:
s331, selecting publication resource data.
And S332, performing document processing on the text materials in the publication resource data.
Specifically, the text-type material is generally a document in doc format, and document processing is performed on the document in doc format, so that one or more thumbnails can be generated based on the document in doc format, which is convenient for a user to know the approximate content of the document.
And S333, transcoding the picture material in the publication resource data to generate a transcoded picture material, and watermarking the transcoded picture material.
Specifically, the picture material is a picture in an image format. The conversion process may convert pictures of different resolution sizes into pictures of uniform resolution size. For example, several 1080P and 720P pictures are transcoded to collectively generate 480P pictures, which belongs to the standardization process of picture-class materials. The watermarking process is to add a watermark to the transcoded picture according to needs, for example, adding a watermark of a publisher name.
The thumbnail finally output in step S332 and the picture after the watermark is finally output in step S313 are both not unstructured data.
And S334, performing content identification processing on the transcoded picture material to generate structured picture material data.
Specifically, in this step, the transcoded picture material needs to be subjected to content identification processing, so as to generate structured picture material data. Optionally, json-structured text data is finally generated. The content recognition process may include face recognition, which may recognize the presence of a face in a picture.
And S335, performing audio transcoding processing and/or video transcoding processing and voice recognition processing on the audio and video material in the publication resource data to generate structured audio and video material data.
Specifically, the audio and video material includes video material and audio material. audio material needs to be video transcoded. Since video material may also contain audio, both audio transcoding and video transcoding are performed. And carrying out voice recognition on the transcoded data to finally generate structured audio and video material data. Optionally, json-structured text data is finally generated.
And S336, performing content security audit on the data generated after the processing of the four steps S332 to S335, and integrating the data into material data corresponding to the publication resource data after the audit is passed.
In particular, the content security audit may filter out some objectionable content related to yellowness and related to violence. These processed and content-secure audited material can be stored in the URC material library 140.
And S316, repeatedly executing the six steps S311 to S315 until all the various materials in the publication resource data are subjected to media processing and content auditing.
In the embodiment, different media processing is performed on different types of materials, so that the materials of the same type can be uniformly converted into the materials of the same format, and some unstructured data are converted into structured data, thereby facilitating subsequent processing.
In an embodiment of the present application, the S400 includes the following S410 to S420:
s410, establishing product data Elasticissearch indexes based on all the standardized publication resource data, converting all the standardized publication resource data into Mapping structures and storing the Mapping structures in an Elasticissearch engine.
And S420, establishing an Elasticissearch index of the material data based on all the material data, converting all the material data into a Mapping structure and storing the Mapping structure in an Elasticissearch engine.
In particular, the Mapping structure is a representation of data in the Elasticsearch engine 150. The embodiment generates the Elasticsearch index, and imports all the standardized publication resource data and all the material data into the Elasticsearch engine, so as to import all the publication resource data and the data management information related to the publication resource data into the Elasticsearch engine 150, and the beneficial effects are mainly 3 points:
the Elasticissearch search engine 150 has a fast search speed. The product name and summary field mapping are defined as text participles and can support full-text retrieval, and the keyword is used for keyword search.
The elastic search engine 150 is laterally compatible with the structure of various databases.
The Elasticissearch search engine 150 can make statistics of edition resources, and the statistics speed is high. The classification field of the product is defined as pattern participle, and tree structure statistics can be supported, namely a parent classification contains data of a child classification during query and statistics.
In one embodiment of the present application, the publication resource integration method further comprises the steps of:
s500, generating a searching DSL of the same ISBN publication resource, a searching DSL of the same subject publication resource, a searching DSL of the same name publication resource and a searching DSL of the same classified publication resource, so as to obtain an additionally expanded association relationship among different structured publication resource data.
Specifically, after the search DSL of the same ISBN publication resource is generated, it is possible to search multiple publication resources of the same ISBN, for example, multiple books of the same ISBN, in the Elasticsearch engine 150 to form a set of slave books. The search DSL of publication resources of the same ISBN can be as follows:
Figure BDA0003037766600000101
Figure BDA0003037766600000111
the DSL search for generating publication resources of the same topic can realize the search for multiple publication resources of the same topic, for example, multiple books of the same topic, in the Elasticsearch engine 150 to form a set of slave books. The search DSL of publication resources of the same topic may be as follows:
Figure BDA0003037766600000112
Figure BDA0003037766600000121
the search DSL for generating publication resources of the same name can realize the search of multiple publication resources of the same name, for example, multiple books of the same name, in the Elasticsearch engine 150 to form a collection of different publishers or different versions. The search DSL of a publication resource of the same name can be as follows:
Figure BDA0003037766600000122
Figure BDA0003037766600000131
a search DSL generating publication resources of the same category can implement a search for multiple publication resources of the same category within the Elasticsearch search engine 150. The search DSL for the same category of publication resources can be as follows:
Figure BDA0003037766600000132
in this embodiment, by generating different types of search DSLs, an additionally expanded association relationship between different structured publication resource data can be achieved, so that publication resources with the same characteristics can be associated, and uniform management is facilitated.
The present application also provides a publication resource integration system 10 for use with a business system 20.
As shown in fig. 2, in an embodiment of the present application, the publication resource integration system 10 includes a management unit 110, a data storage unit 120, a data processing unit 130, a URC material library 140, and an Elasticsearch search engine 150.
The management unit 110 is configured to execute the publication resource integration method provided in any one of the embodiments mentioned above. The data storage unit 120 is connected to the management unit 110. The data storage unit 120 includes a publication resource data storage unit 121 and an operation data storage unit 122. The data processing unit 130 is connected to the management unit 110. The data processing unit 130 is also connected to the data storage unit 120. The data processing unit 130 includes a data normalizing unit 131 and a material processing unit 132. The URC material library 140 is connected to the management unit 110. The Elasticsearch engine 150 is connected to the management unit 110.
Specifically, the URC material library 140 is used to store various types of material data after media processing and content review.
The technical features of the embodiments described above may be arbitrarily combined, the order of execution of the method steps is not limited, and for simplicity of description, all possible combinations of the technical features in the embodiments are not described, however, as long as there is no contradiction between the combinations of the technical features, the combinations of the technical features should be considered as the scope of the present description.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the present application. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present application shall be subject to the appended claims.

Claims (10)

1. A method for publication resource integration, the method comprising:
acquiring a plurality of publication source data from a service system and synchronizing the publication source data to a data storage unit;
capturing operation data from a publication resource website and a service system, and sending the operation data to a data storage unit;
performing data processing on each publication resource data to standardize each type of publication resource data so as to establish association between different publication resource data;
establishing an Elasticissearch index based on all the publication resource data, and importing all the publication resource data into an Elasticissearch engine; the Elasticsearch index includes a plurality of records, each record corresponding to a publication resource data.
2. The method for integrating publication resources of claim 1, wherein the acquiring publication resource data from the business system and synchronizing the publication resource data to the data storage unit comprises:
and synchronizing a plurality of publishing resource data in the service system to the data storage unit based on the Dataworks data acquisition script.
3. The publication resource integration method of claim 2, wherein the capturing operation data from the publication resource website and the business system, and sending the operation data to the data storage unit, comprises:
and capturing operation data based on one or more of a crawler service, a buried point service and a log service, and sending the operation data to a data storage unit.
4. The publication resource integration method of claim 3, wherein the data processing of each publication resource data to standardize each type of publication resource data to establish an association between different publication resource data comprises:
establishing interface standards of various publication resources;
and carrying out standardization processing on the data of each publication resource according to the corresponding interface standard, and attaching a label.
5. The method for publication resource integration according to claim 4, wherein the establishing API interface standards for various types of publication resources comprises:
establishing a book interface standard, wherein the book interface standard comprises one or more labels of book name, original book name, author, original book author, translator, WBS number, material number, ISBN number, E-ISBN number, pricing and ERP pricing;
establishing audio interface standards, wherein the audio interface standards comprise one or more labels of version number, subject classification, subject matter, total collection number, speaker, category, audit and duration;
establishing a video interface standard, wherein the video interface standard comprises one or more labels of version number, subject classification, subject matter, total collection number, speaker, category, audit and duration;
establishing picture interface standards, wherein the video interface standards comprise one or more labels of classification, column, cover page, keyword, name, source, brief introduction and thumbnail;
establishing an information interface standard, wherein the information interface standard comprises one or more labels of reader ID, reader hierarchy, classification, province and city.
6. The method for integrating publication resources of claim 5, wherein the data processing of each publication resource data for standardizing each type of publication resource data to establish an association between different publication resource data further comprises:
and performing media processing and content auditing on various materials in the data of each publication resource.
7. The method for publication resource integration according to claim 6, wherein the media processing and content auditing of the various types of material in each publication resource data comprises:
selecting publication resource data;
performing document processing on the text materials in the publication resource data;
transcoding the picture materials in the publication resource data to generate transcoded picture materials, and watermarking the transcoded picture materials;
performing content identification processing on the transcoded picture material to generate structured picture material data;
performing audio transcoding processing and/or video transcoding processing and voice recognition processing on the audio and video material in the publication resource data to generate structured audio and video material data;
performing content security audit on the data generated after the four steps, and integrating the data into material data corresponding to the publication resource data after the audit is passed;
and repeatedly executing the six steps until all the materials in the publication material data are subjected to media processing and content verification.
8. The publication resource integration method of claim 7, wherein the creating an Elasticsearch index based on all the publication resources, and importing all the publication resources into an Elasticsearch engine comprises:
establishing an Elasticissearch index of product data based on all the publication resource data after the standardization processing, converting all the publication resource data after the standardization processing into a Mapping structure and storing the Mapping structure in an Elasticissearch engine;
establishing an Elasticissearch index of the material data based on all the material data, converting all the material data into a Mapping structure and storing the Mapping structure in an Elasticissearch engine.
9. The publication resource integration method of claim 8, further comprising:
and generating a searching DSL of the same ISBN publication resource, a searching DSL of the same subject publication resource, a searching DSL of the same name publication resource and a searching DSL of the same classified publication resource so as to obtain an additionally expanded association relationship among different structured publication resource data.
10. A publication resource integration system for use with a business system, comprising:
a management unit for performing the publication resource integration method of any one of claims 1-9;
the data storage unit is connected with the management unit and comprises a publication resource data storage unit and an operation data storage unit;
the data processing unit is respectively connected with the management unit and the data storage unit and comprises a data standardization unit and a material processing unit;
the URC material library is respectively connected with the management unit and the data processing unit;
and the Elasticissearch search engine is connected with the management unit.
CN202110448632.4A 2021-04-25 2021-04-25 Publication resource integration method and publication resource integration system Pending CN113177150A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110448632.4A CN113177150A (en) 2021-04-25 2021-04-25 Publication resource integration method and publication resource integration system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110448632.4A CN113177150A (en) 2021-04-25 2021-04-25 Publication resource integration method and publication resource integration system

Publications (1)

Publication Number Publication Date
CN113177150A true CN113177150A (en) 2021-07-27

Family

ID=76925465

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110448632.4A Pending CN113177150A (en) 2021-04-25 2021-04-25 Publication resource integration method and publication resource integration system

Country Status (1)

Country Link
CN (1) CN113177150A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116384947A (en) * 2023-06-01 2023-07-04 威海海洋职业学院 Publication issuing monitoring management system and method based on big data

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1924915A (en) * 2006-09-20 2007-03-07 中山大学 Database technique based library intelligent management system
US20090138430A1 (en) * 2007-11-28 2009-05-28 International Business Machines Corporation Method for assembly of personalized enterprise information integrators over conjunctive queries
CN102982701A (en) * 2012-12-03 2013-03-20 北京中加国道科技有限责任公司 Multimedia digital teaching resource system and establishing method
CN104699849A (en) * 2015-04-07 2015-06-10 同方知网数字出版技术股份有限公司 Digital library resource unified search system
CN111708773A (en) * 2020-08-13 2020-09-25 江苏宝和数据股份有限公司 Multi-source scientific and creative resource data fusion method
CN112131449A (en) * 2020-09-21 2020-12-25 西北大学 Implementation method of cultural resource cascade query interface based on elastic search

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1924915A (en) * 2006-09-20 2007-03-07 中山大学 Database technique based library intelligent management system
US20090138430A1 (en) * 2007-11-28 2009-05-28 International Business Machines Corporation Method for assembly of personalized enterprise information integrators over conjunctive queries
CN102982701A (en) * 2012-12-03 2013-03-20 北京中加国道科技有限责任公司 Multimedia digital teaching resource system and establishing method
CN104699849A (en) * 2015-04-07 2015-06-10 同方知网数字出版技术股份有限公司 Digital library resource unified search system
CN111708773A (en) * 2020-08-13 2020-09-25 江苏宝和数据股份有限公司 Multi-source scientific and creative resource data fusion method
CN112131449A (en) * 2020-09-21 2020-12-25 西北大学 Implementation method of cultural resource cascade query interface based on elastic search

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
何明: "物联网与数字营区 第2版", 国防工业出版社 *
张晓雁 等: "高校数字图书馆资源整合:破解现代远程教育"信息孤岛"效应", 《现代远程教育研究》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116384947A (en) * 2023-06-01 2023-07-04 威海海洋职业学院 Publication issuing monitoring management system and method based on big data
CN116384947B (en) * 2023-06-01 2023-08-15 威海海洋职业学院 Publication issuing monitoring management system and method based on big data

Similar Documents

Publication Publication Date Title
CN109992645B (en) Data management system and method based on text data
Chowdhury et al. Introduction to digital libraries
CN109446344B (en) Intelligent analysis report automatic generation system based on big data
US9483464B2 (en) Method and system for managing semantic and syntactic metadata
KR20210040891A (en) Method and Apparatus of Recommending Information, Electronic Device, Computer-Readable Recording Medium, and Computer Program
CN103310025A (en) Unstructured-data description method and device
Ransom et al. Facets of user‐assigned tags and their effectiveness in image retrieval
Zadel et al. Web Services for Music Information Retrieval.
CN114356967A (en) Professional information collection and analysis application platform
Liu et al. Document processing and retrieval: texpros
Dobreski et al. Remodeling archival metadata descriptions for linked archives
Darmont et al. Data lakes for digital humanities
Weller et al. Folksonomy: the collaborative knowledge organization system
CN113177150A (en) Publication resource integration method and publication resource integration system
CN105159904A (en) Digital resource associated management method and system
Singh et al. Event-based modeling and processing of digital media
Scholtes et al. Big data analytics for e-discovery
Simon et al. Aspects of the Long-Term Preservation of Digitized Catalogue Data: Analysis of the Databases of Integrated Collection Management Systems
Prasad et al. Text analytics to data warehousing
CN110704421A (en) Data processing method, device, equipment and computer readable storage medium
Keiper et al. COLLATE-A Web-Based Collaboratory for Content-Based Access to and Work with Digitized Cultural Material.
Kroeze Towards a multidimensional linguistic database of Biblical Hebrew
Barbuti et al. A Pilot of Smart Digital Library Used-Centered: The Project SMARTER.
Wang et al. Research on Digital Resource Construction of University Library under Big Data
Zenkert et al. Practice-Oriented Approaches for Information and Metadata Management in a Content Management System-Learnings from the Smart City Project LOKAL-digital

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20210727

RJ01 Rejection of invention patent application after publication