CN102135977A - Filter processing method and system using high spot makeup language (HSML) parsing engine - Google Patents

Filter processing method and system using high spot makeup language (HSML) parsing engine Download PDF

Info

Publication number
CN102135977A
CN102135977A CN2010105694275A CN201010569427A CN102135977A CN 102135977 A CN102135977 A CN 102135977A CN 2010105694275 A CN2010105694275 A CN 2010105694275A CN 201010569427 A CN201010569427 A CN 201010569427A CN 102135977 A CN102135977 A CN 102135977A
Authority
CN
China
Prior art keywords
hsml
document
content
resolved
configuration file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010105694275A
Other languages
Chinese (zh)
Inventor
罗笑南
魏筝
朱建宝
陈任
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GUANGDONG XINGHAI DIGITAL HOME INDUSTRY TECHNOLOGY RESEARCH INSTITUTE Co Ltd
Original Assignee
GUANGDONG XINGHAI DIGITAL HOME INDUSTRY TECHNOLOGY RESEARCH INSTITUTE Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GUANGDONG XINGHAI DIGITAL HOME INDUSTRY TECHNOLOGY RESEARCH INSTITUTE Co Ltd filed Critical GUANGDONG XINGHAI DIGITAL HOME INDUSTRY TECHNOLOGY RESEARCH INSTITUTE Co Ltd
Priority to CN2010105694275A priority Critical patent/CN102135977A/en
Publication of CN102135977A publication Critical patent/CN102135977A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a filter processing system using a high spot makeup language (HSML) parsing engine, and permits users to filter keywords of an HSML file. The system comprises a user interaction interface, a configuration file, an HSML parsing unit and an HSML keyword filter unit. The invention also discloses a filter processing method using the HSML parsing engine. Designation can be filtered through the system and the method so as to avoid illegal users transmitting dangerous or unhealthy content by utilizing the HSML files to bring harm to the society.

Description

A kind of filtration treatment method and system of using the HSML analytics engine
Technical field
The present invention relates to digital home technical field, be specifically related to a kind of filtration treatment method and system of the HSML of application analytics engine.The invention belongs to interactive TV page marks language (HSML) category.
Background technology
Fast development along with social informatization, interactive television and multimedia technology become the focus that people pay close attention to already, China plan whole nation in 2015 stops the broadcast of simulated television, thereby realizes that the digital television broadcasting TV is limited, satellite and the wireless whole nation cover.Cable television digitalization, can increase the program capacity greatly, colourful specialization, variation, objectification program are provided, distinct image quality and graceful tonequality more are provided, the user can also enjoy the service of various informations when enjoying radio and television services.The exploitation of miscellaneous service and development need be carried out standard to digital interactive TV service information on services, help the information butt joint between provider and the numerous content and service provider.
Digital television interactive service mark language (HSML) standard is expanded the XML language, formulation is at the SGML of digital television interactive service, business presents and information interaction is described to carrying out towards the digital television interactive service of the integration of three networks in realization, be convenient to adopt Intel Virtualization Technology that needed cross-domain sharing with integrated service content carried out the function extraction with abstract, in order to break through this bottleneck of the current interactive service content and the high degree of coupling of digital TV platform, realize the high speed development of the high-end value-added service of Digital Television industry.
In order to prevent that the lawless person from scattering dangerous or unhealthy content endangers society, on the network some sensitive informations are carried out keyword filtration now, google for example, search engines such as baidu all provide the filtering function of responsive key word.Author of the present invention finds not only can control from the link that network scatters for the filtration of key word in practice, also can control in that document is resolved content, prevents from that unhealthy content is resolved to come out.At present, the expansion of interaction content mainly paid close attention in digital television interactive service mark language (HSML), can't satisfy requirement in this respect.
The present invention is directed to digital television interactive service language (HSML) analytics engine deficiency in this respect, added the function that data key words is filtered, can protect the user can not be forced to accept some bad information, is an improvement aspect information security.
Summary of the invention
The object of the present invention is to provide a kind of filtration treatment method of the HSML of application analytics engine, allow the user that the HSML document is carried out keyword filtration.Another object of the present invention is to provide a kind of filtration treatment system of the HSML of application analytics engine simultaneously.Can filter appointment by the present invention, prevent that the lawless person from utilizing the HSML document to pass dangerous or unhealthy content, harm society.
Purpose one of the present invention is to be achieved by the following technical programs:
Described a kind of filtration treatment system of using the HSML analytics engine comprises User Interface, configuration file, HSML resolution unit, HSML keyword filtration unit.
Described User Interface provides the interface of user and HSML analytics engine, upwards receives user's order, can call the HSML engine downwards and realize user's request.
Described configuration file provides the information that loads the HSML document, blacklist and some user's operation information that needs filtration.The HSML engine is resolved and keyword filtration document according to configuration file.
Described HSML resolution unit has adopted the DOM analysis mode, and can judge the parsing success or not, if resolve successfully, then carries out next step operation, if unsuccessful, then resolves again, and repetitive operation surpasses three times and then resolves failure.
Described HSML keyword filtration unit, after HSML resolution unit and document tentatively resolved, to the label of gained and content respectively with configuration file in the blacklist preserved mate, if the match is successful, then the key word for filtering shields or substitutes it.
Another object of the present invention is achieved through the following technical solutions:
Described a kind of filtration treatment method of using the HSML analytics engine, comprise following flow process: at first the user opens the HSML document, beginning HSML process of analysis; Read configuration file then, need to determine the functional module of loading; Document is resolved, the HSML document is resolved to tree structure in the internal memory, parse label and content according to Xpath then, for other cell processing by DOM; Judge the parsing success or not after having resolved document, if resolve successfully, then carry out next step operation, if unsuccessful, then resolve again, repetitive operation is then resolved failure above three times, returns; The HSML analytics engine carries out content match respectively to label and content; If the match is successful, then this partial content is replaced to " you filter at institute's viewing content "; If there is not coupling, this content health is described, can directly resolve reorganization; After handling, more scattered element is reassembled into the HSML document.
Can filter appointment by the present invention, prevent that the lawless person from utilizing the HSML document to pass dangerous or unhealthy content, harm society.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, to do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below, apparently, accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the structural drawing of the filtration treatment system of a kind of HSML of application analytics engine of the present invention;
Fig. 2 is the process flow diagram of the filtration treatment method of a kind of HSML of application analytics engine of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making all other embodiment that obtained under the creative work prerequisite.
As shown in Figure 1, for the present invention is a kind of can be to the one-piece construction block diagram of the HSML engine of data keyword filtration.This system comprises: User Interface 101, configuration file 102, HSML resolution unit 103, HSML keyword filtration unit 104.
User Interface 101: the interface of user and HSML analytics engine is provided, has upwards received user's order, can call the HSML engine downwards and realize user's request.
Configuration file 102: the information, the key word blacklist that load the HSML document are provided, and some other user's operation information.The HSML engine is determined the HSML document module that needs load according to the information of configuration file, and carries out keyword matching according to the blacklist in the configuration file.Simultaneously, the replacement document after the keyword matching also is kept in the configuration file.
HSML resolution unit 103: by DOM the HSML document is resolved to tree structure in the internal memory, parse label and content according to Xpath then, for other cell processing.After handling, more scattered element is reassembled into the HSML document.Can judge the parsing success or not after having resolved document, if resolve successfully, then carry out next step operation, if unsuccessful, then resolve again, repetitive operation surpasses three times and then resolves failure.
HSML keyword filtration 104: HSML resolution unit 103 is resolved the blacklist of being preserved in gained label and content and the allocation unit mate.If the match is successful, this label of group profile or content are the content that will filter.It is shielded, and call replacement document in the configuration file.Raw content is " you seen content filter ".
In order further to understand the operational scheme of system of the present invention, but describe below in conjunction with the process flow diagram of the HSML analytics engine operation of the present invention's keyword filtration of Fig. 2:
Step 201, the user opens the HSML document, beginning HSML process of analysis.
Step 202 reads configuration file, need to determine the functional module of loading.
Step 203 is resolved document, by DOM the HSML document is resolved to tree structure in the internal memory, parses label and content according to Xpath then, for other cell processing.
Step 204 has been resolved and has been judged behind the document and resolve success or not, if resolves successfully, then carries out next step operation, if unsuccessful, then resolve again, repetitive operation surpass three times then parsing fail, return.
Step 205, the HSML analytics engine carries out content match respectively to label and content.
Step 206 if the match is successful, then replaces to this partial content " you filter at institute's viewing content ".If there is not coupling, this content health is described, can directly resolve reorganization.
Step 207 after handling, is reassembled into the HSML document with scattered element again.
Need to prove, contents such as the information interaction between said apparatus and intrasystem each unit, implementation since with the inventive method embodiment based on same design, particular content can repeat no more referring to the narration among the inventive method embodiment herein.
One of ordinary skill in the art will appreciate that all or part of step in the whole bag of tricks of the foregoing description is to instruct relevant hardware to finish by program, this program can be stored in the computer-readable recording medium, storage medium can comprise: ROM (read-only memory) (ROM, Read Only Memory), random access memory (RAM, Random Access Memory), disk or CD etc.
More than to the filtration treatment method and system of a kind of HSML of application analytics engine that the embodiment of the invention provided, be described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (3)

1. filtration treatment system of using the HSML analytics engine, it is characterized in that: this system comprises User Interface, configuration file, HSML resolution unit, HSML keyword filtration unit;
Described User Interface provides the interface of user and HSML analytics engine, upwards receives user's order, can call the HSML engine downwards and realize user's request;
Described configuration file provides the information that loads the HSML document, blacklist and some user's operation information that needs filtration;
Described HSML resolution unit has adopted the DOM analysis mode, and can judge the parsing success or not, if resolve successfully, then carries out next step operation, if unsuccessful, then resolves again, and repetitive operation surpasses three times and then resolves failure;
Described HSML keyword filtration unit, after HSML resolution unit and document tentatively resolved, to the label of gained and content respectively with configuration file in the blacklist preserved mate, if the match is successful, then the key word for filtering shields or substitutes it.
2. system according to claim 1 is characterized in that: the HSML engine is resolved and keyword filtration document according to configuration file.
3. filtration treatment method of using the HSML analytics engine is characterized in that: may further comprise the steps:
Step 1), the user opens the HSML document, beginning HSML process of analysis;
Step 2), reads configuration file, need to determine the functional module of loading;
Step 3) is resolved document, by DOM the HSML document is resolved to tree structure in the internal memory, parses label and content according to Xpath then, for other cell processing;
Step 4) has been resolved and has been judged behind the document and resolve success or not, if resolves successfully, then carries out next step operation, if unsuccessful, then resolve again, repetitive operation surpass three times then parsing fail, return;
Step 5), the HSML analytics engine carries out content match respectively to label and content;
Step 6) if the match is successful, then replaces to this partial content " you filter at institute's viewing content "; If there is not coupling, this content health is described, can directly resolve reorganization;
Step 7) after handling, is reassembled into the HSML document with scattered element again.
CN2010105694275A 2010-11-30 2010-11-30 Filter processing method and system using high spot makeup language (HSML) parsing engine Pending CN102135977A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010105694275A CN102135977A (en) 2010-11-30 2010-11-30 Filter processing method and system using high spot makeup language (HSML) parsing engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010105694275A CN102135977A (en) 2010-11-30 2010-11-30 Filter processing method and system using high spot makeup language (HSML) parsing engine

Publications (1)

Publication Number Publication Date
CN102135977A true CN102135977A (en) 2011-07-27

Family

ID=44295765

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010105694275A Pending CN102135977A (en) 2010-11-30 2010-11-30 Filter processing method and system using high spot makeup language (HSML) parsing engine

Country Status (1)

Country Link
CN (1) CN102135977A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102769610A (en) * 2012-04-26 2012-11-07 新奥特(北京)视频技术有限公司 Method for selecting information from XML document
CN112969991A (en) * 2018-11-02 2021-06-15 索尼集团公司 Display control device, display control method, and program

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1513142A (en) * 2001-06-04 2004-07-14 Nct���Ź�˾ System and method for modifying a data stream using element parsing
CN101888507A (en) * 2010-06-30 2010-11-17 中山大学 Method and equipment for converting interaction application page mode of digital television

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1513142A (en) * 2001-06-04 2004-07-14 Nct���Ź�˾ System and method for modifying a data stream using element parsing
CN101888507A (en) * 2010-06-30 2010-11-17 中山大学 Method and equipment for converting interaction application page mode of digital television

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
於志文等: "XML技术在电视节目个性化系统中的应用", 《计算机工程》, vol. 30, no. 11, 30 June 2004 (2004-06-30) *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102769610A (en) * 2012-04-26 2012-11-07 新奥特(北京)视频技术有限公司 Method for selecting information from XML document
CN112969991A (en) * 2018-11-02 2021-06-15 索尼集团公司 Display control device, display control method, and program

Similar Documents

Publication Publication Date Title
US11113333B2 (en) Automated content tag processing for mobile media
CN101305555B (en) Multimedia middleware apparatus using metadata, method for controlling multimedia middleware
US20130166580A1 (en) Media Processor
CN102197391B (en) Digital custom data content injection mechanism for a content delivery network
CN102902733A (en) Information push method, device and system based on content subscription
CN102750299B (en) A kind of method of network information convergence
US20100205276A1 (en) System and method for exploiting a media object by a fruition device
US20180255103A1 (en) Metadata supporting cyber content sharing and governance and application method thereof
CN105302908A (en) E-book related audio resource recommendation method and apparatus
US20090083141A1 (en) Methods, systems, and computer program products for detecting and predicting user content interest
Hammond IV Regulating Broadband Communication Networks
CN102135977A (en) Filter processing method and system using high spot makeup language (HSML) parsing engine
CN101316259B (en) Method, device and system for contents filtering
JP4642903B2 (en) Message conversion system and method with enhanced context recognition
CN107608660A (en) Shared technical ability application process and system
CN103581159A (en) System and method for controlling Internet access through white list based on various terminals
CN102571757A (en) Method and system for providing web services
CN106156081A (en) A kind of list verifying method and equipment
CN103258012A (en) Method and device for acquiring picture information
CN108182191B (en) Hotspot data processing method and device
CN115688068A (en) Authority management method, system, equipment and storage medium
EP1809034A1 (en) Method of managing service information by a device for receiving digital services and device implementing the method
CN101917426A (en) RSS (Really Simple Syndication) subscribing method and client thereof
Moscibroda et al. Legal and regulation issues, SPICE Service Platform for Innovative Communication Environment-FP6 Integrated Project, D1. 6
CN104598487A (en) Method for obtaining book information on basis of IP address searching strategy

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20110727

WD01 Invention patent application deemed withdrawn after publication