CN102006513B - Analytical engine system suitable for HSML markup language - Google Patents

Analytical engine system suitable for HSML markup language Download PDF

Info

Publication number
CN102006513B
CN102006513B CN 201010569758 CN201010569758A CN102006513B CN 102006513 B CN102006513 B CN 102006513B CN 201010569758 CN201010569758 CN 201010569758 CN 201010569758 A CN201010569758 A CN 201010569758A CN 102006513 B CN102006513 B CN 102006513B
Authority
CN
China
Prior art keywords
hsml
markup language
analytics engine
document
mode
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN 201010569758
Other languages
Chinese (zh)
Other versions
CN102006513A (en
Inventor
罗笑南
戴洪学
孟思明
曹庭毅
朱建宝
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GUANGDONG XINGHAI DIGITAL HOME INDUSTRY TECHNOLOGY RESEARCH INSTITUTE Co Ltd
Sun Yat Sen University
Original Assignee
GUANGDONG XINGHAI DIGITAL HOME INDUSTRY TECHNOLOGY RESEARCH INSTITUTE Co Ltd
Sun Yat Sen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GUANGDONG XINGHAI DIGITAL HOME INDUSTRY TECHNOLOGY RESEARCH INSTITUTE Co Ltd, Sun Yat Sen University filed Critical GUANGDONG XINGHAI DIGITAL HOME INDUSTRY TECHNOLOGY RESEARCH INSTITUTE Co Ltd
Priority to CN 201010569758 priority Critical patent/CN102006513B/en
Publication of CN102006513A publication Critical patent/CN102006513A/en
Application granted granted Critical
Publication of CN102006513B publication Critical patent/CN102006513B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses an analytical engine system suitable for HSML markup language, which comprises an inquiry system, a storage system and an indexing system, wherein the inquiry system is used for realizing HSML route inquiry function and using HSML file as a text file to provide a text inquiry function as well as a channel of accessing HSML engine for external programs; the storage system is used for storing various files and providing storage support for the inquiry system; the indexing system is used for servicing the inquiry system to index the HSML file; and the indexing system adopts DOM (Document Object Model) analytical mode and SAX (Simple API for XML) analytical mode. The technical scheme of the invention can be used for analyzing different HSML pages by different analytical techniques, can save processing time and processing cost, and can obtain page data in real time.

Description

A kind of analytics engine system that is applicable to the HSML markup language
Technical field
The invention belongs to digital home technical field, be specifically related to a kind of analytics engine system of the HSML of being applicable to markup language.
Background technology
Along with development and perfect and digital product and information service continuous infiltration and the day by day fusion in the family of Digital Television correlation technique, digital television interaction is used also increasingly abundant and various.
The digital television interaction application and service has the huge market demand, yet the page marks of using for digital television interaction does not at present have unified standard, there is very large difference the aspects such as each manufacturer resolves at page rendering, page marks element, page info exchange and transmission, and concrete equipment and the system hardware close-coupled of the implementation of page marks and each manufacturer.The disunity of page marks technology and implementation, caused same application not move at the terminal equipment of different vendor, also be difficult to the exchange of the shared and data of the information of carrying out between the simultaneously different application, this has greatly hindered the development of digital television interaction application and has popularized.
Present on the not enough basis that the digital television interaction application page mark mode that digital TV middleware standard and the middleware product of the domestic and international main flow of research and analysis adopt exists in concrete the application, practical application and the conditions of demand used in conjunction with present domestic digital television interaction simultaneously, study and formulated digital television interaction and use page markup language (HSML, Home Service Markup Language) standard.
The markup language of digital television interactive service analytics engine is a kind of key technology, is the basis of Digital Television tradition audio frequency and video business and increment interactive service service development.Analytics engine is for " markup language of digital television interactive service (HSML) standard criterion " and " interactive application visual modeling platform and fast Development environment " task, realization is to parsing and the preliminary treatment of HSML file, support is unified collaborative encapsulation to the service of various criterion, different agreement, set up the serviced component storehouse, service content presents and cooperation interaction to customize flexibly, the exploitation of standard interactive application service and terminal equipment product, and then provide technical foundation for interactive application presentation layer middleware, interactive service function.
Therefore, people wish to provide a kind of analytics engine system of the HSML of being applicable to markup language.
Summary of the invention
The invention provides a kind of analytics engine system of the HSML of being applicable to markup language, by this analytics engine, can operate the HSML page very easily, realization is to parsing and the preliminary treatment of HSML file, service content presents and cooperation interaction to customize flexibly, the exploitation of the service of standard interactive application and terminal equipment product, this analytics engine have stronger autgmentability with compatible.
The invention provides a kind of analytics engine system of the HSML of being applicable to markup language:
This analytics engine system comprises: inquiry system, storage system and directory system;
Described inquiry system is used for realizing HSML path query function, and the HSML document is regarded as text, and the full-text query function is provided, as the passage of external program access HSML engine;
Described storage system is used for storing various files, for described inquiry system provides storage support;
Described directory system is used to described inquiry system service, finishes the index to the HSML document, and described directory system adopts DOM analysis mode and SAX analysis mode.
Wherein, described inquiry system is divided into content search and structure query two parts.
Wherein, described storage system is based on the file realization, tables of data realizes or tree is realized.
Wherein, the storage mode of described storage system comprises: file system storage mode, mode map mode and Document mapping mode.
Wherein, described directory system comprises content indexing and configuration index two parts.
Technique scheme can be found out:
The embodiment of the invention has adopted two kinds of methods that technology combines, the analytics engine system that designs by the method, can operate the HSML page very easily, realization is to parsing and the preliminary treatment of HSML file, save memory headroom, support random access and can be used for broadcast environment, have a great deal of practical meanings.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, the below will do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.
Fig. 1 is analytics engine system configuration schematic diagram of the present invention;
Fig. 2 is the functional block diagram of analytics engine of the present invention system;
Fig. 3 is analytics engine schematic diagram of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making all other embodiment that obtain under the creative work prerequisite.
The invention provides a kind of analytics engine system of the HSML of being applicable to markup language, can operate the HSML page very easily, realize parsing and preliminary treatment to the HSML file, this analytics engine has stronger autgmentability with compatible.
The HSML language is the expansion of XML language, therefore the present invention has used for reference the XML document analytics engine, with the HSML engine design become a lightweight, flexibly, Embedded HSML document management parts, so engine comprises storage system, directory system and three important component parts of inquiry system.
HSML engine language Analytic principle of the present invention be with a undressed sequence of characters string as input, and it is carried out some specific operations.Check at first whether the HSML data meet syntactic rule, guarantee that beginning label has the end mark of coupling with it, and there is not overlapping element, and according to DTD (Document Type Definition, DTD) or XML Schema confirm, examine its structure and content, resolve at last output.
Wherein, HSML language analytics engine can adopt following two kinds of analytic techniques:
The DOM analytic technique: DOM Document Object Model (Document Object Model, DOM) be one based on the analytic technique of tree type, it is converted into an analytic tree that comprises its content to XML document, and can travel through tree.
SAX analytic technique: for king-sized document, resolving and load whole document may very slow and very expensive source, particularly for Digital Television embedded software environment, therefore using SAX (Simple API for XML, SAX) resolver to process can be better.
Therefore, technical solution of the present invention can adopt different analytic techniques to carry out the page for the different HSML pages and resolve, save the processing time, has saved processing cost, acquisition page data that can be real-time.
Below in conjunction with accompanying drawing the present invention program is described in detail.
As depicted in figs. 1 and 2 be the functional block diagram of analytics engine of the present invention system, have relation of interdependence between each submodule.Analytics engine provided by the invention system has comprised inquiry system, storage system and three important component parts of directory system.
105: inquiry system
Inquiry system is divided into 108 content searches and 109 structure query two parts.The 105 basic XPath grammers (XPath 1.0 standards that W3C recommends) of observing when realizing HSML path query function, are also regarded the HSML document as general text and are processed, and the full-text query function is provided.Inquiry system is the main thoroughfare of external program access HSML engine.
104 of HSML engine is designed to provide simultaneously the interface of C language and the interface of JAVA language.
The system of HSML engine can be stretched, can can be contracted to the single HSML document of processing such as it according to the cutting of Digital Television embedded system demand, weakened directory system this moment, inquiry system just can obtain the arbitrary portion in the document based on the API Calls of DOM.When the scale of application system is larger, also can increase as required the functions such as the user authenticates, safety management, transaction management.
111: storage system
111 can be based on file realizes, also can be based on tables of data and realize, or realize based on tree.111 also provide storage to support for 112, the Parameter File that uses such as index file, index etc.; 111 can also be 105 storages that temporary file is provided; When 105 will take out source data, need to access 111; 111 itself also provide interface 117 to access to external program.
It is the set of a HSML document and part thereof that the storage administration of HSML analytics engine is defined as, it can be managed and control these collection of document information systems by one and keep, except can storage organization and semi-structured data, the ability that also should have various management XML data is such as independence, integrality, access rights and the redundancy etc. of data.
Summary is got up, and the storage management technique of HSML analytics engine has following functions:
1) comprises all functions that typical data storehouse (such as relational database) has;
2) all information in the storage HSML document;
3) comprising a lot of Web links in the HSML document, the information resources on the managing internet;
4) design the api function of standard, thereby improved the interoperability between application program.
According to the characteristics of HSML document itself, namely can adopt three kinds of modes take data as the master or take content as main HSML storage means: the one, file system storage mode, the 2nd, mode map mode, the 3rd, Document mapping mode.
112, directory system
112 mainly finish the index to the HSML document, are divided into 114 content indexings and 115 configuration index two parts.112 realization can be adopted various ways, such as inverted file, signature form, B+ tree etc.112 are mainly 105 services, are 105 to realize that full-text queries, boolean queries, path query provide support; 112 itself also provide some interfaces, for the external program access, browse the individual character index file such as order, can go to the access index system without inquiry system.
The following two kinds of technology of HSML index technology:
1) base index technology.This technology is processed the HSML document as plain text,
2) path query index technology has adopted traversal HSML document tree, obtains each paths, then the path is stored, for the usefulness of inquiry.
The analytics engine schematic diagram of system of the present invention such as Fig. 3:
In the present invention, the method that analytics engine adopts the DOM technology to combine with the SAX technology, DOM analytic technique are usually used in the service that document needs change frequently, and SAX then for the page greatly and the service that is of little use.
DOM Document Object Model (Document Object Model, DOM) technology 202 be one based on the analytic technique of tree type, it is converted into an analytic tree that comprises its content to XML document, and can travel through tree.The tree that has shown the DOM analytic modell analytical model among the figure, document 205 are roots of all dom trees, and this root has at least one child node, i.e. root element Element id.Another node is Element title, is used for the DTD explanation.Element unit have child node, and its child node also has the child node of oneself.Child node can be element, text, note, processing instruction and similar information.
DOM uses the W3C of the official standard that represents XML document with the mode of platform and language independent.DOM is with node or the pieces of information of hierarchical structure tissue, and this hierarchical structure allows the developer to seek customizing messages in tree.Advantage with the DOM analytic modell analytical model is that programming is easy, and the developer only need to call the instruction of achievement, then utilizes the required tree node of traversal APIs access to finish the work.Can add easily and revise the element in the tree.Yet need to process whole document in the time of owing to use DOM resolver, thus higher to the requirement of performance and internal memory, when especially running into very large file.Therefore illustrate 202 and be usually used in the service that the document needs change frequently, namely when application program often during at random access document.
For king-sized document, resolving and load whole document may very slow and very expensive source, and particularly for Digital Television embedded software environment, therefore using SAX (Simple API for XML, SAX) analytic technique 203 to process can be better.203 process be very similar to Streaming Media, resolve when receiving data and can begin immediately, rather than wait for that all data are processed.And, because application program just checks data when reading out data, therefore not needing to store data in the internal memory, this is a huge advantage for large-scale document.Application program even can when certain condition is met, stop to resolve.In general, SAX is also many soon than its replacer DOM.
203 have adopted based on event driven " pushing away " model.203 do not resemble the tree type of setting up a whole document in the of 202 represents, but activates a series of event when reading document.These events are pushed away to event handler, and event handler then provides the access to document content.When finding given label, it can activate a callback method, and the label of telling the method to formulate finds.Event handler has three basic forms of it:
1) for the DTDHandler that accesses XML DTD content;
2) be used for the ErrorHandler of rudimentary access parse error;
3) be used for the general types ContentHandler of access document content.
Shown among the figure that how the SAX resolver is by a callback mechanism reporting event.Resolver reads the input document and when 206 process document each event is pushed away to 211.
Compare with 202,203 provide better performance advantage.Be all nodes establishment objects except need not to resemble in 202,203 models can be used for broadcast environment, can register a plurality of processors here, but the parallel receive event.203 shortcoming is to exchange metamessage, and the built-in navigation support that provides such as XPath is not provided yet, and adds single pass and resolves, and this just means that it does not support random access, needing only to be suitable for the application program of single pass reading of content.The technology that both combine then can be avoided this result's generation, thereby produce better parsing effect.
Technique scheme can be found out: the embodiment of the invention has adopted two kinds of methods that technology combines, the analytics engine system that designs by the method, can operate the HSML page very easily, realization is to parsing and the preliminary treatment of HSML file, save memory headroom, support random access and can be used for broadcast environment, have a great deal of practical meanings.
More than the analytics engine system of a kind of HSML of being applicable to markup language that the embodiment of the invention is provided, be described in detail, used specific case herein principle of the present invention and execution mode are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, all will change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (5)

1. analytics engine system that is applicable to the HSML markup language is characterized in that:
This analytics engine system comprises: inquiry system, storage system and directory system;
Described inquiry system is used for realizing markup language of digital television interactive service HSML path query function, and the HSML document is regarded as text, and the full-text query function is provided, as the passage of external program access HSML engine;
Described storage system is used for storing various files, for described inquiry system provides storage support;
Described directory system is used to described inquiry system service, finishes the index to the HSML document, and described directory system adopts DOM analysis mode and SAX analysis mode.
2. the analytics engine system that is applicable to the HSML markup language according to claim 1 is characterized in that:
Described inquiry system is divided into content search and structure query two parts.
3. the analytics engine system that is applicable to the HSML markup language according to claim 1 and 2 is characterized in that:
Described storage system is based on the file realization, tables of data realizes or tree is realized.
4. the analytics engine system that is applicable to the HSML markup language according to claim 1 and 2 is characterized in that:
The storage mode of described storage system comprises: file system storage mode, mode map mode and Document mapping mode.
5. the analytics engine system that is applicable to the HSML markup language according to claim 1 and 2 is characterized in that:
Described directory system comprises content indexing and configuration index two parts.
CN 201010569758 2010-11-30 2010-11-30 Analytical engine system suitable for HSML markup language Active CN102006513B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010569758 CN102006513B (en) 2010-11-30 2010-11-30 Analytical engine system suitable for HSML markup language

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010569758 CN102006513B (en) 2010-11-30 2010-11-30 Analytical engine system suitable for HSML markup language

Publications (2)

Publication Number Publication Date
CN102006513A CN102006513A (en) 2011-04-06
CN102006513B true CN102006513B (en) 2013-02-13

Family

ID=43813519

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010569758 Active CN102006513B (en) 2010-11-30 2010-11-30 Analytical engine system suitable for HSML markup language

Country Status (1)

Country Link
CN (1) CN102006513B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1783090A (en) * 2004-11-30 2006-06-07 国际商业机器公司 Sharable two way method and system for switching between object model and XML
CN1871600A (en) * 2003-10-31 2006-11-29 株式会社爱可信 Method, program and device for rendering web page

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4735068B2 (en) * 2005-06-15 2011-07-27 沖電気工業株式会社 COMMUNICATION SYSTEM, COMMUNICATION METHOD, AND COMMUNICATION DEVICE

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1871600A (en) * 2003-10-31 2006-11-29 株式会社爱可信 Method, program and device for rendering web page
CN1783090A (en) * 2004-11-30 2006-06-07 国际商业机器公司 Sharable two way method and system for switching between object model and XML

Also Published As

Publication number Publication date
CN102006513A (en) 2011-04-06

Similar Documents

Publication Publication Date Title
US9535966B1 (en) Techniques for aggregating data from multiple sources
CN101290625A (en) XML document storage and search method
CN110716952A (en) Multi-source heterogeneous data processing method and device and storage medium
Maluf et al. NASA Technology Transfer System
EP1337937B1 (en) Method and apparatus for xml data storage, query rewrites, visualization, mapping and references
KR20110070724A (en) Apparatus and method for search open api and generation mashup block skeleton code
US7103872B2 (en) System and method for collecting and transferring sets of related data from a mainframe to a workstation
CN102006513B (en) Analytical engine system suitable for HSML markup language
Maluf et al. Netmark: A schema-less extension for relational databases for managing semi-structured data dynamically
Zhu et al. A Complex XML Schema to Map the XML Documents of Distance Education Technical Specifications into Relational Database
Havlik Building environmental semantic web applications with Drupal
Bhowmick et al. Representation of web data in a web warehouse
Huang et al. Development of a tourism GIS based on Web2. 0
Xu et al. Applying semantic web technologies for geodata integration and visualization
WO2008113642A1 (en) A method for providing interaction between a first content set and a second content set
Cobosi et al. An architecture for fast semantic retrieval in the film heritage domain
Rajugan et al. Extensible Web (xWeb): An XML-view based web engineering methodology
Martins et al. Complex Data Transformations in Digital Libraries with Spatio-Temporal Information
Dong Designing and Implementing of Online Message System Based on XML Technology
Morishima et al. Drag and Drop: Amalgamation of Authoring, Querying, and Restructuring for Multimedia View Construction
Shulin et al. Research on the Implement Scheme of Reconfigurable Function Navigation Based on JAXB
Phan et al. From Extensional Data to Intensional Data: AXML for XML
Hu et al. Interoperability middleware between geoscience and geospatial catalog protocols
Schlachter et al. Towards a search driven system architecture for environmental information portals
Brentjens Opengis web feature service for editing cadastral data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant