WO2017193471A1 - Digital global sharing platform for preserving dongba ancient texts - Google Patents

Digital global sharing platform for preserving dongba ancient texts Download PDF

Info

Publication number
WO2017193471A1
WO2017193471A1 PCT/CN2016/090274 CN2016090274W WO2017193471A1 WO 2017193471 A1 WO2017193471 A1 WO 2017193471A1 CN 2016090274 W CN2016090274 W CN 2016090274W WO 2017193471 A1 WO2017193471 A1 WO 2017193471A1
Authority
WO
WIPO (PCT)
Prior art keywords
dongba
database
interpretation
meaning
event
Prior art date
Application number
PCT/CN2016/090274
Other languages
French (fr)
Chinese (zh)
Inventor
徐小力
吴国新
王红军
李宁
蒋章雷
王少红
Original Assignee
北京信息科技大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京信息科技大学 filed Critical 北京信息科技大学
Publication of WO2017193471A1 publication Critical patent/WO2017193471A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7844Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using original textual content or text extracted from visual content or transcript of audio data

Definitions

  • the invention relates to a text digital sharing platform, in particular to a digitized international sharing platform of the Dongba classic ancient book inheritance system.
  • the Naxi people are ethnic minorities living in the parallel area of Sanjiang (Nujiang, Minjiang, Jinshajiang) in the southwestern part of the Asian Himalayas.
  • the nation has written tens of thousands of classics with the oldest Dongba hieroglyphics.
  • Hieroglyphics are the only pictographs in the world that are still in use today. Many researchers believe that the character form of Dongba hieroglyphics is more primitive than the cuneiform writings of Sumer and Arabic, the holy books of ancient Egypt, and the Mayan scripts of Central America and Chinese oracle bones.
  • the Naxi Dongba classics written in this hieroglyphic form were listed as “World Memory Heritage” by UNESCO, which established that Naxi Dongba culture occupies an important and unique position in the history of cultural development in the world and China.
  • Dongba academic research is a worldwide research hotspot, and the collection agencies of various countries are eager to understand the contents of their collections, the academic research of Dongba classics has always been in a decentralized form. At the same time, due to the ability to interpret Dongba ancient books. Most of the old Dongba clergys have been over the ages. In order to realize the information sharing and dissemination of the Dongba classic world, it is urgent to establish a digital international sharing platform for the Dongba classic ancient books inheritance system.
  • the object of the present invention is to provide a digital international sharing platform for the Dongba classic ancient books inheritance system, which is based on modern information technology to share resources of Dongba classic ancient books, so that many people who are concerned about the Naxi Dongba culture can Seeing and using the Dongba classics from all over the world, we can provide resources and conditions for the worldwide study of Dongba classics.
  • the platform's digital network means is conducive to the rescue, protection and inheritance of Dongba classic ancient books.
  • the present invention adopts the following technical solutions: a digital international sharing platform for the Dongba classic ancient books inheritance system, which is characterized in that it includes a collecting institution, a Dongba classic ancient Chinese pictographic interpretation library, a world memory engineering database, and a management platform.
  • the collection institution transmits the collected information of the various Dongba classic ancient books to the Dongba classic ancient books pictographic interpretation library, and the Dongba classic ancient book pictographic interpretation library exchanges information with the world memory engineering database;
  • the Dongba Classic Ancient Pictographic Reading and Reading Library is registered and identity management, usage authority management, storage management, security management, and query management by the management platform module; the Dongba classic ancient books pictographic interpretation library will process the Dongba pictogram
  • the text is transmitted to the outside via the information dissemination module.
  • the method for establishing the Dongba classic ancient Chinese pictographic interpretation library is as follows: 1) collecting existing Dongba classic ancient books data and establishing a Dongba classic ancient pictographic interpretation database, the interpretation data
  • the library includes a graphic template library, an audio template library and a video template library;
  • the graphic template library includes a unique graphic code, a standard font, and a special shaped word; wherein the graphic content in the graphic template library includes Dongba pictograph, Dongba statement and east
  • the audio template library includes a unique audio code, an audio storage path, and a Naxi phonetic symbol; wherein the audio content in the audio template library also includes the Dongba text, the Dongba statement, and the Dongba event;
  • the video template library includes The only video code and video storage path;
  • the video content in the video template library also includes Dongba characters, Dongba statement and Dongba events; 2)
  • the Dongba Classical Ancient Pictographic Literature Reading Database transmits its Dongba ancient books information to the Dongba Classical Ancient Books Pictographic Literature Library, which contains the digital cataloging formats and rules required for the digital international sharing platform, according to the ancient books cataloging form. To classify and organize the Dongba classics in the Dongba classics, and complete the digital cataloging of Dongba classics.
  • the rule is: if the input condition attribute C1 is the value range A certain value Vc in V, then the decision attribute D is the corresponding decision value d1, that is, the attribute corresponding to the corresponding field when the output satisfies Vc1; if two condition attributes C2 and C3 are input, where C2 is one of the value fields V The value Vc2, C3 is a certain value Vc3 in the value field V, then the decision attribute D is the decision value d2, that is, the attribute corresponding to the corresponding field when the output satisfies Vc2 and Vc3.
  • the inference engine process is as follows: 1 matching: whether the relevant fact of the current solution problem in the interpretation database matches the condition part of the rule in the interpretation knowledge base, if two If the match is matched, the rules in the interpretation knowledge base are enabled, and the process proceeds to step 3 to execute according to the execution part of the rule; if the conditional part of the multiple rules simultaneously matches the facts related to the solution problem, proceed to step 2; 2 conflict resolution: Prioritize the condition parts of all rules in advance.
  • the fusion method of the semantic database, the semantic database and the event database comprises the following steps: 1 uniquely determining a Dongba text according to the semantic database encoding, according to Dongba
  • the graphic code, audio code and video code of the text synchronously retrieve the graphics, audio and video corresponding to the Dongba text, and present the content and meaning of the Dongba text; 2 fuzzy search according to the corresponding Chinese characters and the classification in the meaning database
  • the sentence database search for the Dongba statement that satisfies the same classification, that is, the Dongba statement is matched according to the individual Dongba characters, so that the graphic corresponding to the Dongba statement is retrieved according to the graphic code, the audio code and the video code in the sentence meaning database.
  • the frequent pattern mining process is: performing frequent pattern mining on the word meaning database, the sentence meaning database, and the event database in the interpretation database, and obtaining the meaning database and the sentence meaning.
  • Frequently used combination of words and meanings in the database and event database frequent pattern mining of rules in the interpretation knowledge base, summarizing the combination of frequently used conditional attributes and decision values; outputting frequent items to the interpretation knowledge base,
  • the word combination and syntactic combination which are the most suitable for the current interpretation are provided as the interpretation option.
  • the FP-growth algorithm is used to search the frequent meaning database as an example: 1 scan The meaning database, find out the frequently used words and the number of uses, make a list L of frequent items, sort by the number of times of use; 2 scan the database again, and continuously build FP-Tree by each word: FP-Tree
  • the root node is set to null; add each word item to the branch of FP-Tree one by one; 3 make the head node table and put all the same necklaces
  • find the path ending with a certain word meaning that is, the suffix pattern of the meaning of the word
  • the prefix path of the 5 word meaning constitutes the conditional pattern base of the meaning of the word
  • 6 constructing the meaning of the word according to the conditional pattern base
  • the conditional FP-tree obtains the conditional frequent item set; 7 the conditional frequent item set and the suffix pattern of the word meaning are combined to obtain the frequent item set of the word meaning.
  • the cluster analysis method is as follows: 1 using the number of events in the event database as the number of categories for cluster analysis, encoding the event, event name, classification, event
  • the content and graphic code information are used as the source data, and the fuzzy C-means method is used to calculate the clustering center of the event; 2 the meaning of the word meaning, the graphic code, the corresponding Chinese character, the classification in the meaning database corresponding to the two consecutive words in the interpretation process
  • the Chinese interpretation information is used as a sample to calculate the membership degree of the sample belonging to a cluster center; 3 the event corresponding to the membership degree with the largest value is regarded as the implicit event of the meaning of the word, and the relationship between the meaning and the event is Output to the interpretation of the knowledge base to improve the fusion method in the interpretation of the knowledge base.
  • the world memory engineering database includes a lost memory database, an endangered memory database, and a current activity database.
  • the information dissemination module outputs to the outside world by using a website, a virtual reality, a streaming media, a voice, and a text transmission mode, thereby realizing off-site text, graphics, 2D/3D animation, video, and sound multimedia playback. , the meaning of the phonetic information and the propagation of its common track information.
  • the present invention has the following advantages due to the above technical solutions: 1.
  • the present invention can provide For a rescue method based on modern information technology, to achieve the Dongba classic digital technology rescue and network technology dissemination.
  • the invention is beneficial to realize the global sharing and information exchange of Dongba classics, and has unique cultural protection value and international academic exchange value. It has profound social significance for the rescue and return of world-class cultural relics, the inheritance and spread of human and Chinese national cultures. Far-reaching historical significance.
  • Figure 1 is a schematic view showing the entire structure of the present invention.
  • the present invention provides a digital international sharing platform for the Dongba classic ancient books inheritance system, which includes a collecting institution, a Dongba classic ancient Chinese pictographic interpretation library, a world memory engineering database, a management platform module and an information dissemination module.
  • the collection agency will transfer the collected information of the various Dongba classics to the Dongba classics, and the Dongba classics will be exchanged with the world memory engineering database.
  • the Dongba classics will be recorded by the Pictographs.
  • the management platform module performs login and identity management, usage rights management, storage management, security management, query management, and access statistics.
  • the Dongba Classic Ancient Pictographic Reading and Reading Library will transmit the processed Dongba pictographs to the outside world through the information dissemination module.
  • the interpretation database includes a graphic template library, an audio template library and a video template library.
  • the graphic template library is to digitally record and image the Dongba classic ancient books data collected by the non-contact ancient book scanner and professional digital camera, and save it as a JPG file.
  • the graphic template library includes unique graphic codes, standard glyphs (JPG), and special-shaped characters (JPG); the graphic content in the graphic template library includes Dongba pictographs, Dongba statements, and Dongba events.
  • the audio template library uses audio editing software to clip the high-sampling digital frequency audio resources acquired by the digital recording device and save them as mp3 format files; the high sampling frequency is 320 kb/s.
  • the audio template library includes a unique audio code, an audio storage path (Nashi pronunciation) and a Naxi phonetic symbol; the audio content in the audio template library also includes Dongba text, Dongba statement and Dongba event.
  • the video template library is to edit the video resources of the collected Dongba classic ancient books data, load the commentary audio, explain the subtitles or soundtrack, and save them as wmv format files.
  • the video template library includes a unique video code and video storage path (video content including song and dance, ritual ceremony, etc.); the video content in the video template library also includes Dongba text, Dongba statement and Dongba event.
  • the interpretation database includes the semantic database, the sentence database and the event database.
  • Dongba pictographs from the existing Dongba classics (such as Mr. Fang Guoyu's "Nasi Pictographs" as standard fonts, encode each character in Unicode, and build Dongba using the existing TrueType method.
  • the pictographic standard template library the texts in the standard template library of Dongba pictograms have been compiled and recorded.
  • the fields of the semantic database include Unicode (word-like coding as the primary key), graphic code (PId), corresponding Chinese (chinese), classification (category), corresponding English (English), translator (interpreter), Chinese Translation, audio code (AId), Naxi phonetic (NaxiP) and video code (VId).
  • the sentence database includes sentence code, Dongba statement, corresponding Chinese, sentence meaning, classification, graphic code, audio code and video code.
  • the event database includes event name code, event name, classification, event content, graphics code, audio code, and video code.
  • the content categories include: philosophy, history, religion, medicine, astronomy, geography, folklore, flora and fauna, military, literature and art.
  • the rule is: if the input condition attribute C1 is a value Vc1 in the value field V, then the decision attribute D is the corresponding decision value d1, that is, the attribute corresponding to the corresponding field when the output meets Vc1; if two conditions are input Attributes C2, C3, where C2 is a value Vc2 in the range V, and C3 is a value Vc3 in the range V, then the decision attribute D is the decision value d2, that is, the output satisfies Vc2. Vc3 The attribute corresponding to the corresponding field.
  • the current solution problem is whether the relevant facts in the interpretation database match the conditional parts of the rules in the interpretation knowledge base. If the two match, the rules in the interpretation knowledge base are enabled, and the process proceeds to step 3 according to the execution part of the rule. Execution; if the conditional part of the multiple rules simultaneously matches the facts related to the solution problem, proceed to step 2;
  • the fusion method of the semantic database, the semantic database and the event database includes the following steps:
  • the fuzzy search meaning database is searched, and the Dongba statement that satisfies the same classification is searched, that is, the Dongba statement is matched according to the individual Dongba characters, so that the graphic code in the database according to the sentence meaning is , audio code, video code, retrieve the graphics, audio and video corresponding to the Dongba statement;
  • the fuzzy search event database is searched for the name of the Dongba event that satisfies the same classification, that is, the Dongba event is matched according to the individual Dongba characters, so that according to the graphic code in the event database,
  • the audio code and video code retrieve the graphics, audio and video corresponding to the Dongba event, thereby realizing the fusion of the semantic database, the semantic database and the event database.
  • the frequent pattern mining process is: mining the word meaning database, sentence meaning database and event database in the database for frequent pattern mining, and obtaining the word meaning database, the sentence meaning database, the frequently used word combination and the sentence combination in the event database. Frequent pattern mining of rules in the interpretation knowledge base, summarizing the combination of frequently used conditional attributes and decision values.
  • the frequent items are output to the interpretation knowledge base, and during the interpretation of the Dongba pictographic text, the word combination and sentence combination that best match the current interpretation sentence are provided as an interpretation option.
  • the word pattern database is used as an example to illustrate frequent pattern mining:
  • the prefix path of the 5 word meaning constitutes the conditional pattern base of the meaning of the word.
  • conditional FP-tree is constructed, and the conditional frequent itemset is obtained.
  • conditional frequent item set and the suffix pattern of the word meaning are merged to obtain a frequent item set of the word meaning.
  • the number of events in the event database is used as the number of categories for cluster analysis.
  • the event coding, event name, classification, event content, and graphic code are used as source data, and the fuzzy C-means method is used to calculate the cluster center of the event.
  • the Dongba Classical Ancient Books Pictographic Reading Database transmits its Dongba ancient books information to the Dongba Classical Ancient Books Pictographic Literature Library, which contains the digital cataloging formats and rules required for the digital international sharing platform, according to ancient books.
  • Catalogue form of Dongba classic ancient books pictographic interpretation database The Dongba classic ancient books in the middle class are sorted and sorted, and the digital cataloging of Dongba classic ancient books is completed.
  • the collection institution transmits the collected information of the various Dongba classics to the Dongba classics, and realizes the interconnection with the world's collection agencies, and brings together the world's famous libraries with Dongba classic collections. Relevant information on collections of museums, research institutes and institutions.
  • the World Memory Engineering Database includes a lost memory database, an endangered memory database, and a current activity database.
  • UNESCO Dongba Classical Ancient Pictographs Interpretation Library and the three databases in the World Memory Engineering Database are connected to exchange information, integrate existing resources, establish a book sharing query specification, and realize resource interconnection.
  • the information dissemination module uses a plurality of propagation modes such as a website, virtual reality, streaming media, voice, and text to output to the outside world, and realizes multimedia playback of text, graphics, 2D/3D animation, video, and sound in different places.
  • the transmission of sound and meaning information and its common track information showing the high-resolution image and audio information of the Naxi Dongba clergy on a specific classic verbatim sentence by word.

Abstract

A digital global sharing platform for preserving Dongba ancient texts comprises a collection mechanism, a Dongba ancient pictograph interpretive library, a global memory engineering database, a platform management module, and information transmission module. The collection mechanism transmits, to the Dongba ancient pictograph interpretive library, various Dongba ancient text information. The Dongba ancient pictograph interpretive library and the global memory engineering database exchange information with each other. The platform management module performs, for the Dongba ancient pictograph interpretive library, registration and identity management, usage permission management, storage management, security management, and query management. The Dongba ancient pictograph interpretive library transmits via the information transmission module to outside a processed Dongba pictograph. The solution can provide resources and create favorable conditions for studying Dongba classics in a global system.

Description

一种东巴经典古籍传承体系数字化国际共享平台A digital international sharing platform for Dongba classic ancient books inheritance system 技术领域Technical field
本发明涉及一种文字数字化共享平台,特别是关于一种东巴经典古籍传承体系数字化国际共享平台。The invention relates to a text digital sharing platform, in particular to a digitized international sharing platform of the Dongba classic ancient book inheritance system.
背景技术Background technique
纳西族是居住在亚洲喜马拉雅山以东中国西南部三江(怒江、澜沧江、金沙江)并流区域的少数民族,该民族用最古老的东巴象形文字写下了数万卷经典,其东巴象形文字是当今公认的世界上唯一还在使用的象形文字。众多学者认为东巴象形文字的文字形态比苏美尔和巴比伦的楔形文字、古埃及的圣书文字,以及中美洲的玛雅文字和中国甲骨文都更原始。2003年以该象形文字书写的纳西族东巴经典古籍被联合国教科文组织列为“世界记忆遗产”,确立了纳西东巴文化在世界及中国的文化发展史上占有重要独特的地位。The Naxi people are ethnic minorities living in the parallel area of Sanjiang (Nujiang, Minjiang, Jinshajiang) in the southwestern part of the Asian Himalayas. The nation has written tens of thousands of classics with the oldest Dongba hieroglyphics. Hieroglyphics are the only pictographs in the world that are still in use today. Many scholars believe that the character form of Dongba hieroglyphics is more primitive than the cuneiform writings of Sumer and Babylon, the holy books of ancient Egypt, and the Mayan scripts of Central America and Chinese oracle bones. In 2003, the Naxi Dongba classics written in this hieroglyphic form were listed as “World Memory Heritage” by UNESCO, which established that Naxi Dongba culture occupies an important and unique position in the history of cultural development in the world and China.
国际学界认为:对东巴文化的深入研究会进一步揭示世界古代人类文化之谜。东巴学术研究虽然是世界性的研究热点,而且各国收藏机构也都迫切地想了解自己收藏的经书的内容,但东巴经典的学术研究始终处于分散的型态;同时由于能够释读东巴古籍的老东巴祭司大都已年逾古稀,为了实现东巴经典世界范围的信息共享及传播,迫切需要建立东巴经典古籍传承体系数字化国际共享平台。The international academic community believes that the in-depth study of Dongba culture will further reveal the mystery of ancient human culture in the world. Although Dongba academic research is a worldwide research hotspot, and the collection agencies of various countries are eager to understand the contents of their collections, the academic research of Dongba classics has always been in a decentralized form. At the same time, due to the ability to interpret Dongba ancient books. Most of the old Dongba priests have been over the ages. In order to realize the information sharing and dissemination of the Dongba classic world, it is urgent to establish a digital international sharing platform for the Dongba classic ancient books inheritance system.
发明内容Summary of the invention
针对上述问题,本发明的目的是提供一种东巴经典古籍传承体系数字化国际共享平台,该平台基于现代信息化手段进行东巴经典古籍的资源共享,使得众多对纳西东巴文化关注的人群能够看到和使用世界各地收藏的东巴经典,能够为世界范围系统研究东巴经典提供资源及条件。同时,该平台的数字化网络化手段有利于东巴经典古籍的抢救、保护与传承。In view of the above problems, the object of the present invention is to provide a digital international sharing platform for the Dongba classic ancient books inheritance system, which is based on modern information technology to share resources of Dongba classic ancient books, so that many people who are concerned about the Naxi Dongba culture can Seeing and using the Dongba classics from all over the world, we can provide resources and conditions for the worldwide study of Dongba classics. At the same time, the platform's digital network means is conducive to the rescue, protection and inheritance of Dongba classic ancient books.
为实现上述目的,本发明采取以下技术方案:一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:它包括收藏机构、东巴经典古籍象形文释读库、世界记忆工程数据库、管理平台模块和信息传播模块;所 述收藏机构将收藏到的各种东巴经典古籍信息传输至所述东巴经典古籍象形文释读库,所述东巴经典古籍象形文释读库与所述世界记忆工程数据库进行信息交互;所述东巴经典古籍象形文释读库由所述管理平台模块进行登录与身份管理、使用权限管理、存储管理、安全管理、查询管理;所述东巴经典古籍象形文释读库将处理后的东巴象形文字经所述信息传播模块传输至外界。In order to achieve the above object, the present invention adopts the following technical solutions: a digital international sharing platform for the Dongba classic ancient books inheritance system, which is characterized in that it includes a collecting institution, a Dongba classic ancient Chinese pictographic interpretation library, a world memory engineering database, and a management platform. Module and information dissemination module; The collection institution transmits the collected information of the various Dongba classic ancient books to the Dongba classic ancient books pictographic interpretation library, and the Dongba classic ancient book pictographic interpretation library exchanges information with the world memory engineering database; The Dongba Classic Ancient Pictographic Reading and Reading Library is registered and identity management, usage authority management, storage management, security management, and query management by the management platform module; the Dongba classic ancient books pictographic interpretation library will process the Dongba pictogram The text is transmitted to the outside via the information dissemination module.
在一个优选的实施例中,所述东巴经典古籍象形文释读库的建立方法如下:1)对现有东巴经典古籍资料进行采集并建立东巴经典古籍象形文释读资料库,该释读资料库包括图形模板库、音频模板库和视频模板库;所述图形模板库内包括唯一图形代码、标准字形、异形字;其中图形模板库中的图形内容有东巴象形文字、东巴语句和东巴事件;所述音频模板库内包括唯一音频代码、音频存储路径和纳西音标;其中音频模板库中的音频内容也包括东巴文字、东巴语句和东巴事件;所述视频模板库内包括唯一视频代码和视频存储路径;其中视频模板库中的视频内容也包括东巴文字、东巴语句和东巴事件;2)根据东巴经典古籍象形文释读资料库建立东巴经典古籍象形文释读数据库,该释读数据库包括词意数据库、句意数据库和事件数据库;所述词意数据库:提取现有东巴经典中的东巴象形文字作为标准字模,采用Unicode对每个字符进行编码,并利用现有TrueType方法建立东巴象形文标准模板库;将东巴象形文标准模板库中的文字已有释读资料进行整理录入;所述词意数据库的字段包括词意编码Unicode、图形代码、对应汉字、分类、对应英文、翻译员、中文释义、音频代码、纳西音标和视频代码;所述句意数据库包括句意编码、东巴语句、对应汉语、语句含义、分类、图形代码、音频代码和视频代码;所述事件数据库包括事件名称代码、事件名称、分类、事件内容、图形代码、音频代码和视频代码,其中内容分类包括:哲学、历史、宗教、医学、天文、地理、民俗、动植物、军事、文学和艺术;3)建立东巴经典古籍释读知识库对释读数据库进行管理:释读知识库根据释读规则对三种释读数据库进行释读内容的组合,并利用推理引擎促进释读数据库中词意数据库、句意数据库、事件数据库之间的融合;4)建立东巴经典古籍释读优化库,通过知识挖掘工具对释读数据库、释读知识库的内容进行频繁模式挖掘以及聚类分析,为释读数据库、释读知识库的释读规则优化及更新提供支持;5) 东巴经典古籍象形文释读资料库将其东巴古籍信息传输至东巴经典古籍象形文文献库,该文献库中预置有数字化国际共享平台所需的数字化编目格式和规则,根据古籍编目形式对东巴经典古籍象形文释读资料库中的东巴经典古籍进行分类、整理,完成东巴经典古籍的数字化编目。In a preferred embodiment, the method for establishing the Dongba classic ancient Chinese pictographic interpretation library is as follows: 1) collecting existing Dongba classic ancient books data and establishing a Dongba classic ancient pictographic interpretation database, the interpretation data The library includes a graphic template library, an audio template library and a video template library; the graphic template library includes a unique graphic code, a standard font, and a special shaped word; wherein the graphic content in the graphic template library includes Dongba pictograph, Dongba statement and east The audio template library includes a unique audio code, an audio storage path, and a Naxi phonetic symbol; wherein the audio content in the audio template library also includes the Dongba text, the Dongba statement, and the Dongba event; the video template library includes The only video code and video storage path; the video content in the video template library also includes Dongba characters, Dongba statement and Dongba events; 2) The interpretation of the Dongba classic ancient books based on the Dongba classics a database, the interpretation database includes a semantic database, a semantic database, and an event database; Take the Dongba pictograph in the existing Dongba classic as the standard font, encode each character in Unicode, and use the existing TrueType method to establish the Dongba pictographic standard template library; the text in the Dongba pictorial standard template library The reading data has been compiled and entered; the fields of the meaning database include Unicode, graphic code, corresponding Chinese characters, classification, corresponding English, translator, Chinese interpretation, audio code, Naxi phonetic symbol and video code; The database includes sentence code, Dongba statement, corresponding Chinese, sentence meaning, classification, graphic code, audio code and video code; the event database includes event name code, event name, classification, event content, graphic code, audio code And video code, including content classification: philosophy, history, religion, medicine, astronomy, geography, folklore, flora and fauna, military, literature and art; 3) establishing the Dongba classic ancient books interpretation knowledge base to manage the interpretation database: interpretation knowledge The library interprets the contents of the three interpretation databases according to the interpretation rules. Combine and use the inference engine to promote the integration of the semantic database, the semantic database and the event database in the database; 4) Establish the Dongba Classical Ancient Books Interpretation Optimization Library, and use the knowledge mining tools to interpret the database and interpret the knowledge base. Frequent pattern mining and cluster analysis provide support for interpretation and optimization of interpretation rules for reading and reading the knowledge base; 5) The Dongba Classical Ancient Pictographic Literature Reading Database transmits its Dongba ancient books information to the Dongba Classical Ancient Books Pictographic Literature Library, which contains the digital cataloging formats and rules required for the digital international sharing platform, according to the ancient books cataloging form. To classify and organize the Dongba classics in the Dongba classics, and complete the digital cataloging of Dongba classics.
在一个优选的实施例中,所述步骤3)中,所述释读规则如下:3.1)定义S为规则集,C={C1、C2...Cn}为条件属性集,V=(Vc1,Vc2...Vcn)是条件属性和决策属性的值域,D是决策属性集,(d1,d2,d3...dv)为决策值;3.2)规则为:如果输入条件属性C1为值域V中的某一值Vc1,那么决策属性D为对应的决策值d1,即输出满足Vc1时相应字段对应的属性;如果输入两个条件属性C2、C3,其中C2为值域V中的某一值Vc2,C3为值域V中的某一值Vc3,那么决策属性D为决策值d2,即输出满足Vc2、Vc3时相应字段对应的属性。In a preferred embodiment, in the step 3), the interpretation rule is as follows: 3.1) defining S as a rule set, C={C1, C2...Cn} as a conditional attribute set, V=(Vc1, Vc2...Vcn) is the range of conditional attributes and decision attributes, D is the set of decision attributes, (d1, d2, d3...dv) is the decision value; 3.2) The rule is: if the input condition attribute C1 is the value range A certain value Vc in V, then the decision attribute D is the corresponding decision value d1, that is, the attribute corresponding to the corresponding field when the output satisfies Vc1; if two condition attributes C2 and C3 are input, where C2 is one of the value fields V The value Vc2, C3 is a certain value Vc3 in the value field V, then the decision attribute D is the decision value d2, that is, the attribute corresponding to the corresponding field when the output satisfies Vc2 and Vc3.
在一个优选的实施例中,所述步骤3)中,所述推理引擎过程如下:①匹配:当前求解问题在释读数据库中的相关事实是否与释读知识库中规则的条件部分相匹配,如果两者匹配,则启用释读知识库中的规则,进入步骤③按规则的执行操作部分去执行;若同时存在多条规则的条件部分与求解问题相关事实相匹配,则进入步骤②;②冲突消解:预先给所有规则的条件部分设定优先级,当存在多条规则的条件部分与求解问题相关事实相匹配时,优先启用条件部分优先级较高的规则;③执行操作:执行启用规则的操作部分,经执行操作后,得到新的事实,将所得新事实送入当前释读数据库。In a preferred embodiment, in the step 3), the inference engine process is as follows: 1 matching: whether the relevant fact of the current solution problem in the interpretation database matches the condition part of the rule in the interpretation knowledge base, if two If the match is matched, the rules in the interpretation knowledge base are enabled, and the process proceeds to step 3 to execute according to the execution part of the rule; if the conditional part of the multiple rules simultaneously matches the facts related to the solution problem, proceed to step 2; 2 conflict resolution: Prioritize the condition parts of all rules in advance. When the condition part of the existence of multiple rules matches the facts related to solving the problem, the rule with higher priority condition is enabled preferentially; 3 Execution operation: operation part of executing the enable rule After the operation, new facts are obtained, and the new facts are sent to the current interpretation database.
在一个优选的实施例中,所述步骤3)中,所述词意数据库、句意数据库和事件数据库的融合方法包括以下步骤:①根据词意数据库编码唯一确定一个东巴文字,根据东巴文字的图形代码、音频代码、视频代码,同步检索出对应东巴文字的图形、音频及视频,呈现出东巴文字的内容与含义;②根据词意数据库中的对应汉字以及所属分类,模糊检索句意数据库,搜索出满足同一分类的东巴语句,即根据单独的东巴文字匹配出东巴语句,从而根据句意数据库中图形代码、音频代码、视频代码,检索出对应东巴语句的图形、音频及视频;③根据词意数据库中的对应汉字以及所属分类,模糊检索事件数据库,搜索出满足同一分类的东巴事件名称,即根据单独的东巴文字匹配出东巴事件,从而根据事件数据库中图形代码、音 频代码、视频代码,检索出对应东巴事件的图形、音频及视频,从而实现词意数据库、句意数据库、事件数据库的融合。In a preferred embodiment, in the step 3), the fusion method of the semantic database, the semantic database and the event database comprises the following steps: 1 uniquely determining a Dongba text according to the semantic database encoding, according to Dongba The graphic code, audio code and video code of the text synchronously retrieve the graphics, audio and video corresponding to the Dongba text, and present the content and meaning of the Dongba text; 2 fuzzy search according to the corresponding Chinese characters and the classification in the meaning database The sentence database, search for the Dongba statement that satisfies the same classification, that is, the Dongba statement is matched according to the individual Dongba characters, so that the graphic corresponding to the Dongba statement is retrieved according to the graphic code, the audio code and the video code in the sentence meaning database. , audio and video; 3 according to the corresponding Chinese characters in the word meaning database and the classification, fuzzy search event database, search for the name of the Dongba event that satisfies the same classification, that is, match the Dongba event according to the separate Dongba text, and thus according to the event Graphic code, sound in the database The frequency code and video code retrieve the graphics, audio and video corresponding to the Dongba event, thereby realizing the fusion of the semantic database, the sentence database and the event database.
在一个优选的实施例中,所述步骤4)中,所述频繁模式挖掘过程为:对释读数据库中的词意数据库、句意数据库、事件数据库进行频繁模式挖掘,得到词意数据库、句意数据库、事件数据库中频繁使用的词意组合、句意组合;对释读知识库中的规则进行频繁模式挖掘,归纳出频繁使用的条件属性与决策值的组合;将频繁项输出给释读知识库,在对东巴象形文进行释读过程中提供与当前释语句最匹配的词意组合、句意组合,作为释读选项供选择;采用FP-growth算法,频繁模式挖掘以词意数据库为例:①扫描词意数据库,找出频繁使用的词意以及使用次数,做出频繁项的列表L,按照使用次数递减排序;②再次扫描数据库,由每个词意不断构建FP-Tree:将FP-Tree的根节点设为null;把每个词意项逐个添加到FP-Tree的分枝上去;③做出头结点表,将所有相同的项链接起来;④根据头结点表找出以某个词意为结尾的路径,即词意的后缀模式;⑤词意的前缀路径构成词意的条件模式基;⑥根据条件模式基构建词意的条件FP-树,得到条件频繁项集;⑦条件频繁项集和词意的后缀模式合并,得到词意的频繁项集。In a preferred embodiment, in the step 4), the frequent pattern mining process is: performing frequent pattern mining on the word meaning database, the sentence meaning database, and the event database in the interpretation database, and obtaining the meaning database and the sentence meaning. Frequently used combination of words and meanings in the database and event database; frequent pattern mining of rules in the interpretation knowledge base, summarizing the combination of frequently used conditional attributes and decision values; outputting frequent items to the interpretation knowledge base, In the process of interpreting the Dongba pictograph, the word combination and syntactic combination which are the most suitable for the current interpretation are provided as the interpretation option. The FP-growth algorithm is used to search the frequent meaning database as an example: 1 scan The meaning database, find out the frequently used words and the number of uses, make a list L of frequent items, sort by the number of times of use; 2 scan the database again, and continuously build FP-Tree by each word: FP-Tree The root node is set to null; add each word item to the branch of FP-Tree one by one; 3 make the head node table and put all the same necklaces Then, according to the head node table, find the path ending with a certain word meaning, that is, the suffix pattern of the meaning of the word; the prefix path of the 5 word meaning constitutes the conditional pattern base of the meaning of the word; 6 constructing the meaning of the word according to the conditional pattern base The conditional FP-tree obtains the conditional frequent item set; 7 the conditional frequent item set and the suffix pattern of the word meaning are combined to obtain the frequent item set of the word meaning.
在一个优选的实施例中,所述步骤4)中,所述聚类分析方法如下:①将事件数据库中事件的个数作为聚类分析的类别数,将事件编码、事件名称、分类、事件内容、图形代码信息作为源数据,采用模糊C均值方法计算事件的聚类中心;②将释读过程中的连续两个词意对应的词意数据库中的词意编码、图形代码、对应汉字、分类、中文释义信息作为样本,计算样本隶属于某个聚类中心的隶属度;③将具有最大数值的隶属度所对应的事件作为词意的隐含事件,将词意与事件之间的关联关系输出给释读知识库,改进释读知识库中的融合方法。In a preferred embodiment, in the step 4), the cluster analysis method is as follows: 1 using the number of events in the event database as the number of categories for cluster analysis, encoding the event, event name, classification, event The content and graphic code information are used as the source data, and the fuzzy C-means method is used to calculate the clustering center of the event; 2 the meaning of the word meaning, the graphic code, the corresponding Chinese character, the classification in the meaning database corresponding to the two consecutive words in the interpretation process The Chinese interpretation information is used as a sample to calculate the membership degree of the sample belonging to a cluster center; 3 the event corresponding to the membership degree with the largest value is regarded as the implicit event of the meaning of the word, and the relationship between the meaning and the event is Output to the interpretation of the knowledge base to improve the fusion method in the interpretation of the knowledge base.
在一个优选的实施例中,所述世界记忆工程数据库包括失去的记忆数据库、濒危的记忆数据库和目前的活动数据库。In a preferred embodiment, the world memory engineering database includes a lost memory database, an endangered memory database, and a current activity database.
在一个优选的实施例中,所述信息传播模块采用网站、虚拟现实、流媒体、语音、文本传播方式向外界输出,实现异地的文本、图形、二维/三维动画、影像和声音多媒体的播放,音形义信息及其共轨信息的传播。In a preferred embodiment, the information dissemination module outputs to the outside world by using a website, a virtual reality, a streaming media, a voice, and a text transmission mode, thereby realizing off-site text, graphics, 2D/3D animation, video, and sound multimedia playback. , the meaning of the phonetic information and the propagation of its common track information.
本发明由于采取以上技术方案,其具有以下优点:1、本发明能够提 供一种基于现代信息化技术的抢救手段,实现东巴经典的数字化技术抢救及网络化技术传播。2、本发明有利于实现东巴经典的全球共享与信息交流,并具有独特文化保护价值和国际学术交流价值,对世界级文物抢救与回归、人类及中华民族文化传承与传播具有深刻社会意义及深远历史意义。The present invention has the following advantages due to the above technical solutions: 1. The present invention can provide For a rescue method based on modern information technology, to achieve the Dongba classic digital technology rescue and network technology dissemination. 2. The invention is beneficial to realize the global sharing and information exchange of Dongba classics, and has unique cultural protection value and international academic exchange value. It has profound social significance for the rescue and return of world-class cultural relics, the inheritance and spread of human and Chinese national cultures. Far-reaching historical significance.
附图说明DRAWINGS
图1是本发明的整体结构示意图。BRIEF DESCRIPTION OF THE DRAWINGS Figure 1 is a schematic view showing the entire structure of the present invention.
本发明最佳实施方式Best mode for carrying out the invention
下面结合附图和实施例对本发明进行详细的描述。The invention will now be described in detail in conjunction with the drawings and embodiments.
如图1所示,本发明提供一种东巴经典古籍传承体系数字化国际共享平台,其包括收藏机构、东巴经典古籍象形文释读库、世界记忆工程数据库、管理平台模块和信息传播模块。收藏机构将收藏到的各种东巴经典古籍信息传输至东巴经典古籍象形文释读库,东巴经典古籍象形文释读库与世界记忆工程数据库进行信息交互;东巴经典古籍象形文释读库由管理平台模块进行登录与身份管理、使用权限管理、存储管理、安全管理、查询管理、访问数量统计等。东巴经典古籍象形文释读库将处理后的东巴象形文字经信息传播模块传输至外界。As shown in FIG. 1 , the present invention provides a digital international sharing platform for the Dongba classic ancient books inheritance system, which includes a collecting institution, a Dongba classic ancient Chinese pictographic interpretation library, a world memory engineering database, a management platform module and an information dissemination module. The collection agency will transfer the collected information of the various Dongba classics to the Dongba classics, and the Dongba classics will be exchanged with the world memory engineering database. The Dongba classics will be recorded by the Pictographs. The management platform module performs login and identity management, usage rights management, storage management, security management, query management, and access statistics. The Dongba Classic Ancient Pictographic Reading and Reading Library will transmit the processed Dongba pictographs to the outside world through the information dissemination module.
上述实施例中,东巴经典古籍象形文释读库的建立方法如下:In the above embodiment, the method for establishing the Dongba Classical Ancient Pictograph Reading and Reading Library is as follows:
1)对现有东巴经典古籍资料进行采集并建立东巴经典古籍象形文释读资料库,该释读资料库包括图形模板库、音频模板库和视频模板库。1) Collecting the existing Dongba classic ancient books data and establishing a metaphysical interpretation database of Dongba classic ancient books. The interpretation database includes a graphic template library, an audio template library and a video template library.
图形模板库是将通过非接触式古籍扫描仪和专业数码照相机采集的东巴经典古籍资料图片进行数字化录入及图像处理,保存为JPG格式文件。图形模板库内包括唯一图形代码、标准字形(JPG)、异形字(JPG);其中图形模板库中的图形内容有东巴象形文字、东巴语句和东巴事件。The graphic template library is to digitally record and image the Dongba classic ancient books data collected by the non-contact ancient book scanner and professional digital camera, and save it as a JPG file. The graphic template library includes unique graphic codes, standard glyphs (JPG), and special-shaped characters (JPG); the graphic content in the graphic template library includes Dongba pictographs, Dongba statements, and Dongba events.
音频模板库是采用音频编辑软件对通过数字录音设备获取的高采样数字频率音频资源进行剪辑,保存为mp3格式文件;其中高采样频率为320kb/s。音频模版库内包括唯一音频代码、音频存储路径(纳西读音)和纳西音标;其中音频模板库中的音频内容也包括东巴文字、东巴语句和东巴事件。 The audio template library uses audio editing software to clip the high-sampling digital frequency audio resources acquired by the digital recording device and save them as mp3 format files; the high sampling frequency is 320 kb/s. The audio template library includes a unique audio code, an audio storage path (Nashi pronunciation) and a Naxi phonetic symbol; the audio content in the audio template library also includes Dongba text, Dongba statement and Dongba event.
视频模板库是将采集到的东巴经典古籍资料的视频资源进行剪辑,加载解说音频、解说字幕或配乐,保存为wmv格式文件。视频模版库内包括唯一视频代码和视频存储路径(视频内容包括歌舞、祭祀仪式等);其中视频模板库中的视频内容也包括东巴文字、东巴语句和东巴事件。The video template library is to edit the video resources of the collected Dongba classic ancient books data, load the commentary audio, explain the subtitles or soundtrack, and save them as wmv format files. The video template library includes a unique video code and video storage path (video content including song and dance, ritual ceremony, etc.); the video content in the video template library also includes Dongba text, Dongba statement and Dongba event.
2)根据东巴经典古籍象形文释读资料库建立东巴经典古籍象形文释读数据库,该释读数据库包括词意数据库、句意数据库和事件数据库。2) According to the Dongba Classical Ancient Pictographs Reading Database, the Dongba Classical Ancient Books Pictographic Reading Database is established. The interpretation database includes the semantic database, the sentence database and the event database.
词意数据库:提取现有东巴经典(例如方国瑜先生的《纳西象形文字谱》)中的东巴象形文字作为标准字模,采用Unicode对每个字符进行编码,并利用现有TrueType方法建立东巴象形文标准模板库;将东巴象形文标准模板库中的文字已有释读资料进行整理录入。Word Database: Extract the Dongba pictographs from the existing Dongba classics (such as Mr. Fang Guoyu's "Nasi Pictographs") as standard fonts, encode each character in Unicode, and build Dongba using the existing TrueType method. The pictographic standard template library; the texts in the standard template library of Dongba pictograms have been compiled and recorded.
词意数据库的字段包括词意编码(Unicode)(词意编码为主键)、图形代码(PId)、对应汉字(chinese)、分类(category)、对应英文(English)、翻译员(interpreter)、中文释义(Translation)、音频代码(AId)、纳西音标(NaxiP)和视频代码(VId)。The fields of the semantic database include Unicode (word-like coding as the primary key), graphic code (PId), corresponding Chinese (chinese), classification (category), corresponding English (English), translator (interpreter), Chinese Translation, audio code (AId), Naxi phonetic (NaxiP) and video code (VId).
句意数据库包括句意编码、东巴语句、对应汉语、语句含义、分类、图形代码、音频代码和视频代码。The sentence database includes sentence code, Dongba statement, corresponding Chinese, sentence meaning, classification, graphic code, audio code and video code.
事件数据库包括事件名称代码、事件名称、分类、事件内容、图形代码、音频代码和视频代码。其中内容分类包括:哲学、历史、宗教、医学、天文、地理、民俗、动植物、军事、文学和艺术。The event database includes event name code, event name, classification, event content, graphics code, audio code, and video code. The content categories include: philosophy, history, religion, medicine, astronomy, geography, folklore, flora and fauna, military, literature and art.
3)建立东巴经典古籍象形文释读知识库对释读数据库进行管理:释读知识库根据释读规则对三种释读数据库进行释读内容的组合,并利用推理引擎促进释读数据库中词意数据库、句意数据库、事件数据库之间的融合。3) Establishing the Dongba Classical Ancient Books Pictographic Reading Knowledge Base to manage the interpretation database: Interpreting the knowledge base to interpret the contents of the three interpretation databases according to the interpretation rules, and using the inference engine to facilitate the interpretation of the database of meanings and sentences in the database. , the fusion of event databases.
3.1)释读规则如下:3.1) The interpretation rules are as follows:
3.1.1)定义S为规则集,C={C1、C2...Cn}为条件属性集,V=(Vc1,Vc2...Vcn)是条件属性和决策属性的值域,D是决策属性集,(d1,d2,d3...dv)为决策值。3.1.1) Define S as the rule set, C={C1, C2...Cn} as the conditional attribute set, V=(Vc1, Vc2...Vcn) is the range of the conditional attribute and the decision attribute, and D is the decision The attribute set, (d1, d2, d3...dv) is the decision value.
3.1.2)规则为:如果输入条件属性C1为值域V中的某一值Vc1,那么决策属性D为对应的决策值d1,即输出满足Vc1时相应字段对应的属性;如果输入两个条件属性C2、C3,其中C2为值域V中的某一值Vc2,C3为值域V中的某一值Vc3,那么决策属性D为决策值d2,即输出满足Vc2、 Vc3时相应字段对应的属性。3.1.2) The rule is: if the input condition attribute C1 is a value Vc1 in the value field V, then the decision attribute D is the corresponding decision value d1, that is, the attribute corresponding to the corresponding field when the output meets Vc1; if two conditions are input Attributes C2, C3, where C2 is a value Vc2 in the range V, and C3 is a value Vc3 in the range V, then the decision attribute D is the decision value d2, that is, the output satisfies Vc2. Vc3 The attribute corresponding to the corresponding field.
例如:当输入条件属性C1为‘词意编码’属性时,通过规则进行判断,若Vc1=E900时,则D为d1,即输出相应字段对应的属性,如Category为天象,Chinese为天等信息,如表1所示。For example, when the input condition attribute C1 is the 'word meaning code' attribute, it is judged by the rule. If Vc1=E900, then D is d1, that is, the attribute corresponding to the corresponding field is output, such as Category for the sky, Chinese for the sky and the like. ,As shown in Table 1.
表1Table 1
Figure PCTCN2016090274-appb-000001
Figure PCTCN2016090274-appb-000001
3.2)推理引擎过程如下:3.2) The reasoning engine process is as follows:
①匹配:当前求解问题在释读数据库中的相关事实是否与释读知识库中规则的条件部分相匹配,如果两者匹配,则启用释读知识库中的规则,进入步骤③按规则的执行操作部分去执行;若同时存在多条规则的条件部分与求解问题相关事实相匹配,则进入步骤②;1 Match: The current solution problem is whether the relevant facts in the interpretation database match the conditional parts of the rules in the interpretation knowledge base. If the two match, the rules in the interpretation knowledge base are enabled, and the process proceeds to step 3 according to the execution part of the rule. Execution; if the conditional part of the multiple rules simultaneously matches the facts related to the solution problem, proceed to step 2;
②冲突消解:预先给所有规则的条件部分设定优先级,即值域V中的优先级为:Vc1>Vc2>…>Vcn,当存在多条规则的条件部分与求解问题相关事实相匹配时,优先启用条件部分优先级较高的规则;2 Conflict resolution: Prioritize the condition parts of all rules in advance, that is, the priority in the value field V is: Vc1>Vc2>...>Vcn, when the condition part of the existence of multiple rules matches the facts related to solving the problem , preferentially enable the rule with higher priority in the condition part;
③执行操作:执行启用规则的操作部分,经执行操作后,得到新的事实,将所得新事实送入当前释读数据库。3 Execute operation: Execute the operation part of the enable rule. After the operation, get the new fact and send the new fact to the current release database.
3.3)词意数据库、句意数据库和事件数据库的融合方法包括以下步骤:3.3) The fusion method of the semantic database, the semantic database and the event database includes the following steps:
①根据词意数据库编码唯一确定一个东巴文字,根据东巴文字的图形代码、音频代码、视频代码,同步检索出对应东巴文字的图形、音频及视频,呈现出东巴文字的内容与含义;1 According to the meaning database code, uniquely determine a Dongba text, according to the graphic code, audio code and video code of Dongba text, synchronously retrieve the graphics, audio and video corresponding to Dongba text, and present the content and meaning of Dongba text. ;
②根据词意数据库中的对应汉字以及所属分类,模糊检索句意数据库,搜索出满足同一分类的东巴语句,即根据单独的东巴文字匹配出东巴语句,从而根据句意数据库中图形代码、音频代码、视频代码,检索出对应东巴语句的图形、音频及视频;2 According to the corresponding Chinese characters in the meaning database and the classification, the fuzzy search meaning database is searched, and the Dongba statement that satisfies the same classification is searched, that is, the Dongba statement is matched according to the individual Dongba characters, so that the graphic code in the database according to the sentence meaning is , audio code, video code, retrieve the graphics, audio and video corresponding to the Dongba statement;
③根据词意数据库中的对应汉字以及所属分类,模糊检索事件数据库,搜索出满足同一分类的东巴事件名称,即根据单独的东巴文字匹配出东巴事件,从而根据事件数据库中图形代码、音频代码、视频代码,检索出对应东巴事件的图形、音频及视频,从而实现词意数据库、句意数据库、事件数据库的融合。3 According to the corresponding Chinese characters in the semantic database and the classification, the fuzzy search event database is searched for the name of the Dongba event that satisfies the same classification, that is, the Dongba event is matched according to the individual Dongba characters, so that according to the graphic code in the event database, The audio code and video code retrieve the graphics, audio and video corresponding to the Dongba event, thereby realizing the fusion of the semantic database, the semantic database and the event database.
4)建立东巴经典古籍释读优化库,通过知识挖掘工具对释读数据库、 释读知识库的内容进行频繁模式挖掘以及聚类分析,为释读数据库、释读知识库的释读规则优化及更新提供支持。4) Establish a Dongba Classical Ancient Books Interpretation and Optimization Library, and interpret the database through knowledge mining tools. Interpret the contents of the knowledge base for frequent pattern mining and cluster analysis, and provide support for the interpretation and optimization of interpretation rules for reading the database and interpreting the knowledge base.
4.1)频繁模式挖掘过程为:对释读数据库中的词意数据库、句意数据库、事件数据库进行频繁模式挖掘,得到词意数据库、句意数据库、事件数据库中频繁使用的词意组合、句意组合;对释读知识库中的规则进行频繁模式挖掘,归纳出频繁使用的条件属性与决策值的组合。将频繁项输出给释读知识库,在对东巴象形文进行释读过程中提供与当前释语句最匹配的词意组合、句意组合,作为释读选项供选择。4.1) The frequent pattern mining process is: mining the word meaning database, sentence meaning database and event database in the database for frequent pattern mining, and obtaining the word meaning database, the sentence meaning database, the frequently used word combination and the sentence combination in the event database. Frequent pattern mining of rules in the interpretation knowledge base, summarizing the combination of frequently used conditional attributes and decision values. The frequent items are output to the interpretation knowledge base, and during the interpretation of the Dongba pictographic text, the word combination and sentence combination that best match the current interpretation sentence are provided as an interpretation option.
采用FP-growth算法,以词意数据库为例阐述频繁模式挖掘:Using the FP-growth algorithm, the word pattern database is used as an example to illustrate frequent pattern mining:
①扫描词意数据库,找出频繁使用的词意以及使用次数,做出频繁项的列表L,按照使用次数递减排序。1 Scan the word database, find out the frequently used words and the number of uses, and make a list L of frequent items, sorted according to the number of uses.
②再次扫描数据库,由每个词意不断构建FP-Tree:将FP-Tree的根节点设为null;把每个词意项逐个添加到FP-Tree的分枝上。2 Scan the database again, and continue to build FP-Tree by each word: set the root node of FP-Tree to null; add each word meaning to the branch of FP-Tree one by one.
③做出头结点表,将所有相同的项链接起来。3 Make a head node table and link all the same items together.
④根据头结点表找出以某个词意为结尾的路径,即词意的后缀模式。4 Find the path ending with a certain meaning according to the head node table, that is, the suffix pattern of the meaning of the word.
⑤词意的前缀路径构成词意的条件模式基。The prefix path of the 5 word meaning constitutes the conditional pattern base of the meaning of the word.
⑥根据条件模式基构建词意的条件FP-树,得到条件频繁项集。6 According to the conditional pattern base, the conditional FP-tree is constructed, and the conditional frequent itemset is obtained.
⑦条件频繁项集和词意的后缀模式合并,得到词意的频繁项集。7 The conditional frequent item set and the suffix pattern of the word meaning are merged to obtain a frequent item set of the word meaning.
4.2)聚类分析方法如下:4.2) The cluster analysis method is as follows:
①将事件数据库中事件的个数作为聚类分析的类别数,将事件编码、事件名称、分类、事件内容、图形代码等信息作为源数据,采用模糊C均值方法计算事件的聚类中心。1 The number of events in the event database is used as the number of categories for cluster analysis. The event coding, event name, classification, event content, and graphic code are used as source data, and the fuzzy C-means method is used to calculate the cluster center of the event.
②将释读过程中的连续两个词意对应的词意数据库中的词意编码、图形代码、对应汉字、分类、中文释义等信息作为样本,计算样本隶属于某个聚类中心的隶属度。(2) The information of the word meaning coding, the graphic code, the corresponding Chinese characters, the classification, the Chinese interpretation and the like in the meaning database of the two consecutive words in the interpretation process are taken as samples, and the membership degree of the sample belonging to a certain cluster center is calculated.
③将具有最大数值的隶属度所对应的事件作为词意的隐含事件,将词意与事件之间的关联关系输出给释读知识库,改进释读知识库中的融合方法。3 The event corresponding to the membership degree with the largest value is taken as the implicit event of the meaning of the word, and the relationship between the meaning and the event is output to the interpretation knowledge base, and the fusion method in the interpretation knowledge base is improved.
5)东巴经典古籍象形文释读资料库将其东巴古籍信息传输至东巴经典古籍象形文文献库,该文献库中预置有数字化国际共享平台所需的数字化编目格式和规则,根据古籍编目形式对东巴经典古籍象形文释读资料库 中的东巴经典古籍进行分类、整理,完成东巴经典古籍的数字化编目。5) The Dongba Classical Ancient Books Pictographic Reading Database transmits its Dongba ancient books information to the Dongba Classical Ancient Books Pictographic Literature Library, which contains the digital cataloging formats and rules required for the digital international sharing platform, according to ancient books. Catalogue form of Dongba classic ancient books pictographic interpretation database The Dongba classic ancient books in the middle class are sorted and sorted, and the digital cataloging of Dongba classic ancient books is completed.
上述各实施例中,收藏机构将收藏到的各种东巴经典古籍信息传输至东巴经典古籍象形文释读库,实现与世界各收藏机构互联,汇集世界上拥有东巴经典藏品的著名图书馆、博物馆、研究所和院校收藏的相关资料信息。In each of the above embodiments, the collection institution transmits the collected information of the various Dongba classics to the Dongba classics, and realizes the interconnection with the world's collection agencies, and brings together the world's famous libraries with Dongba classic collections. Relevant information on collections of museums, research institutes and institutions.
收藏机构包括德国国家图书馆、哈佛大学燕京图书馆、华盛顿的美国国会图书馆、法国国家图书馆、法国巴黎语言文化大学图书馆、法国远东学院、法国吉美特博物馆、法国原始文化博物馆、英国国家图书馆、英国曼彻斯特大学图书馆,以及云南省博物馆、丽江东巴文化研究院、东巴文化博物院、北京东巴文化艺术发展促进会以及在大量田野调研中获得的资料。Collections include the German National Library, Harvard University Yanjing Library, the Library of Congress in Washington, the National Library of France, the Library of the University of Paris Language and Culture, the French Far East Institute, the French Gem Museum, the French Museum of Primitive Culture, and the United Kingdom. National Library, University of Manchester Library, and Yunnan Provincial Museum, Lijiang Dongba Culture Research Institute, Dongba Culture Museum, Beijing Dongba Culture and Art Development Promotion Association, and materials obtained in a large number of field research.
上述各实施例中,世界记忆工程数据库包括失去的记忆数据库、濒危的记忆数据库和目前的活动数据库。在联合国教科文组织支持下,东巴经典古籍象形文释读库与世界记忆工程数据库中的三个数据库连接进行信息交互,对现有资源进行整合,建立典籍共享查询规范,实现资源的互联互通。In each of the above embodiments, the World Memory Engineering Database includes a lost memory database, an endangered memory database, and a current activity database. With the support of UNESCO, the Dongba Classical Ancient Pictographs Interpretation Library and the three databases in the World Memory Engineering Database are connected to exchange information, integrate existing resources, establish a book sharing query specification, and realize resource interconnection.
上述各实施例中,信息传播模块采用网站、虚拟现实、流媒体、语音、文本等多种传播方式向外界输出,实现异地的文本、图形、二维/三维动画、影像和声音等多媒体的播放,音形义信息及其共轨信息的传播,展示纳西族东巴祭司对某册特定经典逐字逐句吟诵的高清晰度影像及音频信息。In the above embodiments, the information dissemination module uses a plurality of propagation modes such as a website, virtual reality, streaming media, voice, and text to output to the outside world, and realizes multimedia playback of text, graphics, 2D/3D animation, video, and sound in different places. The transmission of sound and meaning information and its common track information, showing the high-resolution image and audio information of the Naxi Dongba priest on a specific classic verbatim sentence by word.
上述各实施例仅用于说明本发明,各个步骤都是可以有所变化的,在本发明技术方案的基础上,凡根据本发明原理对个别步骤进行的改进和等同变换,均不应排除在本发明的保护范围之外。 The above embodiments are merely illustrative of the present invention, and various steps may be varied. On the basis of the technical solutions of the present invention, improvements and equivalent changes to individual steps in accordance with the principles of the present invention should not be excluded. Outside the scope of protection of the present invention.

Claims (9)

  1. 一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:它包括收藏机构、东巴经典古籍象形文释读库、世界记忆工程数据库、管理平台模块和信息传播模块;所述收藏机构将收藏到的各种东巴经典古籍信息传输至所述东巴经典古籍象形文释读库,所述东巴经典古籍象形文释读库与所述世界记忆工程数据库进行信息交互;所述东巴经典古籍象形文释读库由所述管理平台模块进行登录与身份管理、使用权限管理、存储管理、安全管理、查询管理;所述东巴经典古籍象形文释读库将处理后的东巴象形文字经所述信息传播模块传输至外界。A digital international sharing platform for Dongba classic ancient books inheritance system, which is characterized in that it comprises a collecting institution, a Dongba classic ancient Chinese pictographic interpretation library, a world memory engineering database, a management platform module and an information dissemination module; the collection institution will collect The various Dongba classic ancient books information is transmitted to the Dongba classic ancient books pictographic interpretation library, and the Dongba classic ancient books pictographic interpretation library interacts with the world memory engineering database; the Dongba classic ancient books pictogram The document reading library is logged in and identity management, usage authority management, storage management, security management, and query management by the management platform module; the Dongba classic ancient book pictographic interpretation library will process the Dongba pictograph through the information. The propagation module is transmitted to the outside world.
  2. 如权利要求1所述的一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:所述东巴经典古籍象形文释读库的建立方法如下:The digital international sharing platform of the Dongba classic ancient books inheritance system according to claim 1, wherein the method for establishing the Dongba classic ancient books pictographic interpretation library is as follows:
    1)对现有东巴经典古籍资料进行采集并建立东巴经典古籍象形文释读资料库,该释读资料库包括图形模板库、音频模板库和视频模板库;1) Collecting the existing Dongba classic ancient books data and establishing a metaphysical interpretation database of Dongba classic ancient books, the interpretation database includes a graphic template library, an audio template library and a video template library;
    所述图形模板库内包括唯一图形代码、标准字形、异形字;其中图形模板库中的图形内容有东巴象形文字、东巴语句和东巴事件;所述音频模板库内包括唯一音频代码、音频存储路径和纳西音标;其中音频模板库中的音频内容也包括东巴文字、东巴语句和东巴事件;所述视频模板库内包括唯一视频代码和视频存储路径;其中视频模板库中的视频内容也包括东巴文字、东巴语句和东巴事件;The graphic template library includes a unique graphic code, a standard font, and a special-shaped word; wherein the graphic content in the graphic template library includes Dongba pictograph, Dongba statement and Dongba event; the audio template library includes a unique audio code, The audio storage path and the Naxi phonetic symbol; wherein the audio content in the audio template library also includes the Dongba text, the Dongba statement, and the Dongba event; the video template library includes a unique video code and a video storage path; wherein the video template library The video content also includes Dongba script, Dongba statement and Dongba incident;
    2)根据东巴经典古籍象形文释读资料库建立东巴经典古籍象形文释读数据库,该释读数据库包括词意数据库、句意数据库和事件数据库;2) Establish a database of the Dongba classic ancient books pictographic interpretation according to the Dongba Classical Ancient Pictographs Reading Database, which includes the meaning database, the sentence database and the event database;
    所述词意数据库:提取现有东巴经典中的东巴象形文字作为标准字模,采用Unicode对每个字符进行编码,并利用现有TrueType方法建立东巴象形文标准模板库;将东巴象形文标准模板库中的文字已有释读资料进行整理录入;所述词意数据库的字段包括词意编码Unicode、图形代码、对应汉字、分类、对应英文、翻译员、中文释义、音频代码、纳西音标和视频代码;所述句意数据库包括句意编码、东巴语句、对应汉语、语句含义、分类、图形代码、音频代码和视频代码;所述事件数据库包括事件名称代码、事件名称、分类、事件内容、图形代码、音频代码和视频代码,其中内容分类包括:哲学、历史、宗教、医学、天文、地理、民俗、动植 物、军事、文学和艺术;The word meaning database: extract the Dongba pictograph in the existing Dongba classic as a standard font, encode each character in Unicode, and use the existing TrueType method to establish the Dongba pictographic standard template library; The text in the standard template library has been interpreted and entered; the fields of the meaning database include Unicode, graphic code, corresponding Chinese characters, classification, corresponding English, translator, Chinese interpretation, audio code, and Naxi phonetic symbols. And the video code; the sentence database includes sentence code, Dongba statement, corresponding Chinese, sentence meaning, classification, graphic code, audio code and video code; the event database includes event name code, event name, classification, event Content, graphic code, audio code and video code, including content classification: philosophy, history, religion, medicine, astronomy, geography, folklore, movement Things, military, literature and art;
    3)建立东巴经典古籍释读知识库对释读数据库进行管理:释读知识库根据释读规则对三种释读数据库进行释读内容的组合,并利用推理引擎促进释读数据库中词意数据库、句意数据库、事件数据库之间的融合;3) Establishing the Dongba Classical Ancient Books Interpretation Knowledge Base to manage the interpretation database: Interpreting the knowledge base to interpret the contents of the three interpretation databases according to the interpretation rules, and using the inference engine to facilitate the interpretation of the meaning database, sentence database and events in the database. Fusion between databases;
    4)建立东巴经典古籍释读优化库,通过知识挖掘工具对释读数据库、释读知识库的内容进行频繁模式挖掘以及聚类分析,为释读数据库、释读知识库的释读规则优化及更新提供支持;4) Establishing the Dongba Classical Ancient Books Interpretation and Optimization Library, and using the knowledge mining tools to perform frequent pattern mining and cluster analysis on the contents of the interpretation database and the interpretation of the knowledge base, and provide support for the interpretation and optimization of the interpretation rules of the interpretation database and the interpretation knowledge base;
    5)东巴经典古籍象形文释读资料库将其东巴古籍信息传输至东巴经典古籍象形文文献库,该文献库中预置有数字化国际共享平台所需的数字化编目格式和规则,根据古籍编目形式对东巴经典古籍象形文释读资料库中的东巴经典古籍进行分类、整理,完成东巴经典古籍的数字化编目。5) The Dongba Classical Ancient Books Pictographic Reading Database transmits its Dongba ancient books information to the Dongba Classical Ancient Books Pictographic Literature Library, which contains the digital cataloging formats and rules required for the digital international sharing platform, according to ancient books. The cataloguing form classifies and organizes the Dongba classic ancient books in the Dongba classic ancient books pictographic interpretation database, and completes the digital cataloging of Dongba classics.
  3. 如权利要求2所述的一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:所述步骤3)中,所述释读规则如下:The digital international sharing platform of the Dongba classic ancient books inheritance system according to claim 2, wherein in the step 3), the reading rules are as follows:
    3.1)定义S为规则集,C={C1、C2...Cn}为条件属性集,V=(Vc1,Vc2...Vcn)是条件属性和决策属性的值域,D是决策属性集,(d1,d2,d3...dv)为决策值;3.1) Define S as the rule set, C={C1, C2...Cn} as the conditional attribute set, V=(Vc1, Vc2...Vcn) is the value field of the conditional attribute and the decision attribute, and D is the decision attribute set. , (d1, d2, d3...dv) is the decision value;
    3.2)规则为:如果输入条件属性C1为值域V中的某一值Vc1,那么决策属性D为对应的决策值d1,即输出满足Vc1时相应字段对应的属性;如果输入两个条件属性C2、C3,其中C2为值域V中的某一值Vc2,C3为值域V中的某一值Vc3,那么决策属性D为决策值d2,即输出满足Vc2、Vc3时相应字段对应的属性。3.2) The rule is: if the input condition attribute C1 is a value Vc1 in the range V, then the decision attribute D is the corresponding decision value d1, that is, the attribute corresponding to the corresponding field when the output satisfies Vc1; if two condition attributes C2 are input C3, where C2 is a value Vc2 in the range V, and C3 is a value Vc3 in the range V, then the decision attribute D is the decision value d2, that is, the attribute corresponding to the corresponding field when the output satisfies Vc2 and Vc3.
  4. 如权利要求2所述的一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:所述步骤3)中,所述推理引擎过程如下:The digital international sharing platform of the Dongba classic ancient books inheritance system according to claim 2, wherein in the step 3), the inference engine process is as follows:
    ①匹配:当前求解问题在释读数据库中的相关事实是否与释读知识库中规则的条件部分相匹配,如果两者匹配,则启用释读知识库中的规则,进入步骤③按规则的执行操作部分去执行;若同时存在多条规则的条件部分与求解问题相关事实相匹配,则进入步骤②;1 Match: The current solution problem is whether the relevant facts in the interpretation database match the conditional parts of the rules in the interpretation knowledge base. If the two match, the rules in the interpretation knowledge base are enabled, and the process proceeds to step 3 according to the execution part of the rule. Execution; if the conditional part of the multiple rules simultaneously matches the facts related to the solution problem, proceed to step 2;
    ②冲突消解:预先给所有规则的条件部分设定优先级,当存在多条规则的条件部分与求解问题相关事实相匹配时,优先启用条件部分优先级较高的规则;2 conflict resolution: prioritize the condition parts of all rules in advance, and when the condition parts of the multiple rules match the facts related to solving the problem, the rules with higher priority conditions are preferentially enabled;
    ③执行操作:执行启用规则的操作部分,经执行操作后,得到新的事 实,将所得新事实送入当前释读数据库。3 Execute the operation: Execute the operation part of the enable rule, after the operation, get a new thing In fact, the new facts are sent to the current interpretation database.
  5. 如权利要求2所述的一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:所述步骤3)中,所述词意数据库、句意数据库和事件数据库的融合方法包括以下步骤:The digital international sharing platform of the Dongba classic ancient books inheritance system according to claim 2, wherein in the step 3), the fusion method of the semantic database, the semantic database and the event database comprises the following steps:
    ①根据词意数据库编码唯一确定一个东巴文字,根据东巴文字的图形代码、音频代码、视频代码,同步检索出对应东巴文字的图形、音频及视频,呈现出东巴文字的内容与含义;1 According to the meaning database code, uniquely determine a Dongba text, according to the graphic code, audio code and video code of Dongba text, synchronously retrieve the graphics, audio and video corresponding to Dongba text, and present the content and meaning of Dongba text. ;
    ②根据词意数据库中的对应汉字以及所属分类,模糊检索句意数据库,搜索出满足同一分类的东巴语句,即根据单独的东巴文字匹配出东巴语句,从而根据句意数据库中图形代码、音频代码、视频代码,检索出对应东巴语句的图形、音频及视频;2 According to the corresponding Chinese characters in the meaning database and the classification, the fuzzy search meaning database is searched, and the Dongba statement that satisfies the same classification is searched, that is, the Dongba statement is matched according to the individual Dongba characters, so that the graphic code in the database according to the sentence meaning is , audio code, video code, retrieve the graphics, audio and video corresponding to the Dongba statement;
    ③根据词意数据库中的对应汉字以及所属分类,模糊检索事件数据库,搜索出满足同一分类的东巴事件名称,即根据单独的东巴文字匹配出东巴事件,从而根据事件数据库中图形代码、音频代码、视频代码,检索出对应东巴事件的图形、音频及视频,从而实现词意数据库、句意数据库、事件数据库的融合。3 According to the corresponding Chinese characters in the semantic database and the classification, the fuzzy search event database is searched for the name of the Dongba event that satisfies the same classification, that is, the Dongba event is matched according to the individual Dongba characters, so that according to the graphic code in the event database, The audio code and video code retrieve the graphics, audio and video corresponding to the Dongba event, thereby realizing the fusion of the semantic database, the semantic database and the event database.
  6. 如权利要求2所述的一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:所述步骤4)中,所述频繁模式挖掘过程为:对释读数据库中的词意数据库、句意数据库、事件数据库进行频繁模式挖掘,得到词意数据库、句意数据库、事件数据库中频繁使用的词意组合、句意组合;对释读知识库中的规则进行频繁模式挖掘,归纳出频繁使用的条件属性与决策值的组合;将频繁项输出给释读知识库,在对东巴象形文进行释读过程中提供与当前释语句最匹配的词意组合、句意组合,作为释读选项供选择;采用FP-growth算法,频繁模式挖掘以词意数据库为例:The digitized international sharing platform of the Dongba classic ancient books inheritance system according to claim 2, wherein in the step 4), the frequent pattern mining process is: reading the meaning database and sentence meaning in the database. The database and event database are frequently modeled, and the word meaning database, the sentence meaning database, the frequently used word combination and the syntactic combination in the event database are obtained. The frequent patterns mining of the rules in the interpretation knowledge base are summarized, and the frequently used conditions are summarized. The combination of attribute and decision value; output frequent items to the interpretation knowledge base, provide the word combination and sentence combination that best match the current release sentence in the process of interpretation of Dongba pictogram, as an interpretation option for selection; adopt FP The -growth algorithm, frequent pattern mining uses the word meaning database as an example:
    ①扫描词意数据库,找出频繁使用的词意以及使用次数,做出频繁项的列表L,按照使用次数递减排序;1 Scan the word database, find out the frequently used words and the number of uses, and make a list L of frequent items, sorted according to the number of uses;
    ②再次扫描数据库,由每个词意不断构建FP-Tree:将FP-Tree的根节点设为null;把每个词意项逐个添加到FP-Tree的分枝上去;2 Scan the database again, and build FP-Tree by each word: set the root node of FP-Tree to null; add each word item to the branch of FP-Tree one by one;
    ③做出头结点表,将所有相同的项链接起来;3 Make a head node table and link all the same items together;
    ④根据头结点表找出以某个词意为结尾的路径,即词意的后缀模式;4 Find the path ending with a certain meaning according to the head node table, that is, the suffix pattern of the meaning of the word;
    ⑤词意的前缀路径构成词意的条件模式基; The prefix path of the 5 word meaning constitutes the conditional pattern base of the meaning of the word;
    ⑥根据条件模式基构建词意的条件FP-树,得到条件频繁项集;6 Build a conditional frequent item set based on the conditional pattern base to construct a conditional FP-tree;
    ⑦条件频繁项集和词意的后缀模式合并,得到词意的频繁项集。7 The conditional frequent item set and the suffix pattern of the word meaning are merged to obtain a frequent item set of the word meaning.
  7. 如权利要求2所述的一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:所述步骤4)中,所述聚类分析方法如下:The digital international sharing platform of the Dongba classic ancient books inheritance system according to claim 2, wherein in the step 4), the cluster analysis method is as follows:
    ①将事件数据库中事件的个数作为聚类分析的类别数,将事件编码、事件名称、分类、事件内容、图形代码信息作为源数据,采用模糊C均值方法计算事件的聚类中心;1 The number of events in the event database is used as the number of categories for cluster analysis, and the event coding, event name, classification, event content, and graphic code information are used as source data, and the fuzzy C-means method is used to calculate the cluster center of the event;
    ②将释读过程中的连续两个词意对应的词意数据库中的词意编码、图形代码、对应汉字、分类、中文释义信息作为样本,计算样本隶属于某个聚类中心的隶属度;(2) The meaning code, the graphic code, the corresponding Chinese character, the classification, and the Chinese interpretation information in the meaning database corresponding to the two consecutive words in the interpretation process are taken as samples, and the membership degree of the sample belonging to a cluster center is calculated;
    ③将具有最大数值的隶属度所对应的事件作为词意的隐含事件,将词意与事件之间的关联关系输出给释读知识库,改进释读知识库中的融合方法。3 The event corresponding to the membership degree with the largest value is taken as the implicit event of the meaning of the word, and the relationship between the meaning and the event is output to the interpretation knowledge base, and the fusion method in the interpretation knowledge base is improved.
  8. 如权利要求1-7任一项所述的一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:所述世界记忆工程数据库包括失去的记忆数据库、濒危的记忆数据库和目前的活动数据库。The digital international sharing platform of the Dongba classic ancient books inheritance system according to any one of claims 1 to 7, wherein the world memory engineering database comprises a lost memory database, an endangered memory database and a current activity database. .
  9. 如权利要求1-7任一项所述的一种东巴经典古籍传承体系数字化国际共享平台,其特征在于:所述信息传播模块采用网站、虚拟现实、流媒体、语音、文本传播方式向外界输出,实现异地的文本、图形、二维/三维动画、影像和声音多媒体的播放,音形义信息及其共轨信息的传播。 The digital international sharing platform of the Dongba classic ancient books inheritance system according to any one of claims 1 to 7, wherein the information dissemination module uses a website, a virtual reality, a streaming media, a voice, and a text transmission mode to the outside world. Output, to achieve off-site text, graphics, 2D / 3D animation, video and sound multimedia playback, sound and meaning information and its common track information.
PCT/CN2016/090274 2016-05-10 2016-07-18 Digital global sharing platform for preserving dongba ancient texts WO2017193471A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610304528.7 2016-05-10
CN201610304528.7A CN105975597B (en) 2016-05-10 2016-05-10 A kind of international shared platform of Dongba classics ancient books succession system digitlization

Publications (1)

Publication Number Publication Date
WO2017193471A1 true WO2017193471A1 (en) 2017-11-16

Family

ID=56991547

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/090274 WO2017193471A1 (en) 2016-05-10 2016-07-18 Digital global sharing platform for preserving dongba ancient texts

Country Status (2)

Country Link
CN (1) CN105975597B (en)
WO (1) WO2017193471A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111222190A (en) * 2020-01-16 2020-06-02 杭州四方博瑞科技股份有限公司 Ancient building management system
CN116149484A (en) * 2023-03-03 2023-05-23 湖北工业大学 Immersive experience method for assisting non-genetic culture propagation and related device
CN116303990A (en) * 2023-03-02 2023-06-23 越读(浙江)数字科技有限公司 Ancient book database management method, system, terminal and medium

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106503247A (en) * 2016-11-09 2017-03-15 天津赛因哲信息技术有限公司 Ancient book document management system and method based on knowledge discovery technology
CN109086257A (en) * 2017-06-14 2018-12-25 佛山辞荟源信息科技有限公司 Language coding processing method and system based on Chinese meaning
CN107609100A (en) * 2017-09-11 2018-01-19 叙永县图书馆 A kind of human body temperature type Library Resources Database Systems and method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070130112A1 (en) * 2005-06-30 2007-06-07 Intelligentek Corp. Multimedia conceptual search system and associated search method
CN102033876A (en) * 2009-09-25 2011-04-27 叶高 Information management system method
CN104408559A (en) * 2014-11-19 2015-03-11 湖北福泰建筑装饰工程有限公司 Engineering information management system
CN104794470A (en) * 2015-05-04 2015-07-22 北京信息科技大学 Method of digital acquisition and image processing for Dongba pictograph
CN104794455A (en) * 2015-05-04 2015-07-22 北京信息科技大学 Dongba hieroglyphic recognizing method
CN104866607A (en) * 2015-06-04 2015-08-26 北京信息科技大学 Dongba character interpretation database building method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102682338A (en) * 2011-03-17 2012-09-19 中国藏学研究中心北京藏医院 Information platform for arranging ancient Tibet medicine books
CN104111942B (en) * 2013-04-19 2017-11-28 新疆维吾尔自治区维吾尔医医院 Uighur medicine ancient books resource network searching platform

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070130112A1 (en) * 2005-06-30 2007-06-07 Intelligentek Corp. Multimedia conceptual search system and associated search method
CN102033876A (en) * 2009-09-25 2011-04-27 叶高 Information management system method
CN104408559A (en) * 2014-11-19 2015-03-11 湖北福泰建筑装饰工程有限公司 Engineering information management system
CN104794470A (en) * 2015-05-04 2015-07-22 北京信息科技大学 Method of digital acquisition and image processing for Dongba pictograph
CN104794455A (en) * 2015-05-04 2015-07-22 北京信息科技大学 Dongba hieroglyphic recognizing method
CN104866607A (en) * 2015-06-04 2015-08-26 北京信息科技大学 Dongba character interpretation database building method

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111222190A (en) * 2020-01-16 2020-06-02 杭州四方博瑞科技股份有限公司 Ancient building management system
CN111222190B (en) * 2020-01-16 2023-09-22 杭州四方博瑞科技股份有限公司 Ancient building management system
CN116303990A (en) * 2023-03-02 2023-06-23 越读(浙江)数字科技有限公司 Ancient book database management method, system, terminal and medium
CN116303990B (en) * 2023-03-02 2024-03-08 越读(浙江)数字科技有限公司 Ancient book database management method, system, terminal and medium
CN116149484A (en) * 2023-03-03 2023-05-23 湖北工业大学 Immersive experience method for assisting non-genetic culture propagation and related device
CN116149484B (en) * 2023-03-03 2023-11-07 湖北工业大学 Immersive experience method for assisting non-genetic culture propagation and related device

Also Published As

Publication number Publication date
CN105975597B (en) 2019-03-22
CN105975597A (en) 2016-09-28

Similar Documents

Publication Publication Date Title
WO2017193471A1 (en) Digital global sharing platform for preserving dongba ancient texts
CN111753099B (en) Method and system for enhancing relevance of archive entity based on knowledge graph
CN109271529B (en) Method for constructing bilingual knowledge graph of Xilier Mongolian and traditional Mongolian
CN111143479B (en) Knowledge graph relation extraction and REST service visualization fusion method based on DBSCAN clustering algorithm
CN112256888A (en) Geographic knowledge acquisition method
CN106502991B (en) Publication treating method and apparatus
WO2017193472A1 (en) Method of establishing digital dongba ancient text interpretive library
CN115080694A (en) Power industry information analysis method and equipment based on knowledge graph
Moncla et al. Mapping urban fingerprints of odonyms automatically extracted from French novels
CN116227594A (en) Construction method of high-credibility knowledge graph of medical industry facing multi-source data
El Abdouli et al. Mining tweets of Moroccan users using the framework Hadoop, NLP, K-means and basemap
Xiong et al. Oracle bone inscriptions big knowledge management and service platform
Fuller et al. Structuring, recording, and analyzing historical networks in the china biographical database
Coll Ardanuy et al. Person-centric mining of historical newspaper collections
Li et al. Artwork information embedding framework for multi-source Ukiyo-e record retrieval
Revanth et al. Nl2sql: Natural language to sql query translator
Chatzipanagiotou et al. Automated recognition of geographical named entities in titles of Ukiyo-e prints
Na et al. A method of collecting four character medicine effect phrases in TCM patents based on semi-supervised learning
Zeng et al. Construction of scenic spot knowledge graph based on ontology
Hong Application of Data Mining in Network Information Dynamic Push Software
Touya Lessons learned from research on multimedia summarization
CN113076468B (en) Nested event extraction method based on field pre-training
Xiong et al. OBSKP: Oracle Bone Studies Knowledge Pyramid Model With Applications
Li et al. Design of knowledge map construction based on convolutional neural network
Anand et al. Integrating and querying similar tables from PDF documents using deep learning

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16901426

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 16901426

Country of ref document: EP

Kind code of ref document: A1