CN105354282A - XML file retrieval method and apparatus - Google Patents

XML file retrieval method and apparatus Download PDF

Info

Publication number
CN105354282A
CN105354282A CN201510728080.7A CN201510728080A CN105354282A CN 105354282 A CN105354282 A CN 105354282A CN 201510728080 A CN201510728080 A CN 201510728080A CN 105354282 A CN105354282 A CN 105354282A
Authority
CN
China
Prior art keywords
xml
selector switch
descriptor
rule
new
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510728080.7A
Other languages
Chinese (zh)
Inventor
程少虹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingdao Haier Intelligent Home Appliance Technology Co Ltd
Original Assignee
Qingdao Haier Intelligent Home Appliance Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingdao Haier Intelligent Home Appliance Technology Co Ltd filed Critical Qingdao Haier Intelligent Home Appliance Technology Co Ltd
Priority to CN201510728080.7A priority Critical patent/CN105354282A/en
Publication of CN105354282A publication Critical patent/CN105354282A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/83Querying

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an XML file retrieval method and apparatus. The method comprises: analyzing an input XML character string via an SAX parser; when a new element is analyzed, acquiring description information of the element; and when the acquired description information is matched with a preset retrieval rule, storing the description information of the element to a result set and simultaneously, informing a user to process a retrieval process. According to the XML file retrieval method provided by the invention, a file does not need to be read into a memory in advance, so that a memory space can be effectively saved; and by using a style selector as a description means for to-be-searched contents, most of HTML and XML information can be shown, and thus, the XML file retrieval method is very universal and development work is effectively saved.

Description

A kind of XML file search method and device
Technical field
The present invention relates to field of computer technology, particularly relate to a kind of XML file search method and device.
Background technology
NodeJS is a platform set up based on GoogleV8 engine, for building fast response time easily, being easy to the network application of expansion.NodeJS use case drives, Non-Blocking I/O model and be able to light weight and efficient, is highly suitable for the data-intensive real-time application that distributed apparatus runs.NodeJS is as service end application technology, slowly ripe, has a large amount of modules of increasing income based on NodeJS, it is easy to use a lot of in community.
XML (extensibleMarkupLanguage) is a kind of extend markup language of text formatting of simple and flexible, structural markup language is made it have for electroactive marker son file, can be used for flag data, definition data type, be the source language that the markup language of a kind of user of permission to oneself defines.As service end technology, XML is widely used as configuration file and Data Transport Protocol, and the content comprised is extremely abundant.But on NodeJS, use the retrieval of JavaScript language to XML information very inconvenient.At present, also analytical tool powerful as Dom4j does not carry out processing XML file.One as XML is expanded, HTML (HyperTextMarkupLanguage, HyperText Markup Language) is as the major technique of page presentation, and the quantity of file is huge especially, the quantity of information comprised is larger, necessarily very large to the Search Requirement of this class file.And NodeJS is not fine to the process support of HTML.
See Fig. 1, at present on NodeJS during analyzing XML file, the XML file that usually will operate is loaded in internal memory, then resolves to Document object, then carries out DOM (DocumentObjectModel DOM Document Object Model) operation.And by this mode of Document resolver structure DOM object, need whole XML file all to load into internal memory and just can carry out parse operation, if Documents Comparison is large, be very large to memory consumption; In addition, construct an exercisable Document object, need each element to be configured to object, can be lower for larger file detection.
Summary of the invention
Technical matters to be solved by this invention is to provide a kind of XML file search method and device, in order to solve the problem that when prior art retrieves XML large files on NodeJS, memory consumption seriously even cannot be resolved.
According to one aspect of the present invention, a kind of XML file search method is provided, comprises:
Resolved by the XML character string of SAX resolver to input;
After being resolved to new element, obtain the descriptor of element;
When the described descriptor obtained matches with the search rule pre-set, then described descriptor is saved to results set, notifies that user processes retrieval process simultaneously.
Further, when arranging search rule, comprising:
Utilize pattern resolver to resolve the style rule that user inputs, and generate the list of increment formula;
According to each the increment formula simple structure selector switch in the list of described increment formula;
According to the logical relation constitutive logic selector switch between increment formula;
Described simple selector and described logic selector switch are combined into one style selector switch, for describing described search rule.
Further, when the described descriptor obtained is mated with the search rule pre-set, comprising:
Obtain the descriptor of element, if described descriptor meets the rule that described simple selector describes, then proceed logical relation coupling;
When described simple selector meets the logical relation of described logic selector switch definition, then this Match of elemental composition success.
Further, described method comprises:
When the input of new XML character string is come in, after described XML character string is carried out buffer memory, more described SAX resolver is transferred to resolve.
Further, described method comprises:
When described SAX resolver resolves starts to new element, new element is pressed into storehouse;
When described SAX resolver resolves is to the element end mark of this element, this element is popped.
Further, described method comprises:
After having resolved one section of XML, current element stack is carried out buffer memory;
When there being new XML file input, then the described element stack of buffer memory before recovering to use.
According to one aspect of the present invention, a kind of XML file indexing unit is provided, comprises:
Parsing module, for resolving by the XML character string of SAX resolver to input;
Acquisition module, for after being resolved to new element, obtains the descriptor of element;
Matching module, for when the described descriptor obtained matches with the search rule pre-set, is then saved to results set by described descriptor, notifies that user processes retrieval process simultaneously.
Further, described device also comprises rule and arranges module:
For when arranging search rule, utilize pattern resolver to resolve the style rule that user inputs, and generate the list of increment formula;
According to each the increment formula simple structure selector switch in the list of described increment formula;
According to the logical relation constitutive logic selector switch between increment formula;
Described simple selector and described logic selector switch are combined into one style selector switch, for describing described search rule.
Further, described matching module specifically for:
Obtain the descriptor of element, if described descriptor meets described simple selector description rule, then proceed logical relation coupling;
When relation between described simple selector meets the logical relation of described logic selector switch definition, then this Match of elemental composition success.
Further, described device comprises the first cache module:
For when the input of new XML character string is come in, after described XML character string is carried out buffer memory, more described SAX resolver is transferred to resolve.
Further, described device comprises the second cache module:
When described SAX resolver resolves starts to new element, new element is pressed into storehouse;
When described SAX resolver resolves is to the element end mark of this element, this element is popped.
Further, described device comprises the 3rd cache module:
For after having resolved one section of XML, current element stack is carried out buffer memory;
When there being new XML file input, then the described element stack of buffer memory before recovering to use.
Beneficial effect of the present invention is as follows:
The present invention, by adopting SAX resolver, can process parsing limit, XML file limit, not need file to read into memory in advance, effectively can save memory headroom; Notify retrieval progress in the mode of event, the process retrieved can be controlled more easily; By using selector as the description means to the content that will search, the information of the XML overwhelming majority can be indicated, very general, can development be saved; Based on the characteristic of selector, can also resolve html file, the retrieval use for this unstructured data of AM/BAM HTML can be very large.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to technological means of the present invention can be better understood, and can be implemented according to the content of instructions, and can become apparent, below especially exemplified by the specific embodiment of the present invention to allow above and other objects of the present invention, feature and advantage.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is in prior art, the process flow diagram of analyzing XML file on NodeJS;
Fig. 2 is in the embodiment of the present invention, the schematic flow sheet of XML file search method;
Fig. 3 is in the embodiment of the present invention, the workflow schematic diagram of searcher;
Fig. 4 is in the embodiment of the present invention, arranges the schematic flow sheet of search rule;
Fig. 5 is in the embodiment of the present invention, to the retrieval flow schematic diagram of XML file;
Fig. 6 is in the embodiment of the present invention, the structural representation of XMl document search device.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
In order to solve the retrieval of class unstructured data on NodeJS, the especially search problem of the XML file that volume ratio is larger, the invention provides a kind of XML file search method based on JavaScript language.The method adopts the SAX (abbreviation of SimpleAPIforXML, a kind of engine of the analyzing XML based on event) process of parsing limit, resolver limit, do not need in advance XML file to be read into memory, effectively can save memory headroom, after retrieving the element matched in the present invention simultaneously, user can be notified in the mode of event, after not needing file to process completely, reinform user search result, user can determine whether according to current result for retrieval to continue retrieval process, the more convenient control of retrieval process.
As shown in Figure 2, XML file search method provided by the invention, specifically comprises the steps:
The XML character string of step S101, SAX resolver to input is resolved;
Step S102, after being resolved to new element, obtains the descriptor of element;
Step S103, when this descriptor matches with the search rule pre-set, is saved in results set by the descriptor of this element, notifies that user processes retrieval process simultaneously.
Below in conjunction with accompanying drawing and embodiment, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, do not limit the present invention.
SAX resolver primary responsibility analyzing XML character string in the present invention, a scanning, resolves for one time.When being resolved to new element, in the mode of event, analysis result is notified searcher.Searcher is the object of external disclosure, according to the information that search rule coupling SAX resolver resolves arrives, informs the user progress and the result of retrieval in the mode of event.
See Fig. 3, before resolving, searcher need carry out initialization.User, by after the input of application program, constructs searcher object.Searcher, when initialization, constructs SAX object.SAX resolver carries out parsing to SAX object and generates SAX analysis object.The SAX analysis object registration retrieval event that searcher returns according to SAX.This event is used for when after SAX resolver resolves to respective element, and notice searcher carries out matching operation.After searcher has registered retrieval event, generate searcher object.Application program section monitors event according to this searcher object registration.When after retriever retrieves to matching content, the retrieval progress current with the mode notification application of event and result.User can control the process retrieved more easily, after finding the content that will look for, can stop retrieval immediately, and can hang up halfway, and can recover to continue to perform.
Before retrieval, in searcher, pre-set search rule.As shown in Figure 4, when arranging rule, user arranges the search rule of needs by application program.Pattern resolver is resolved the style rule that user inputs, and generates the list of increment formula.Searcher according to each the increment formula simple structure selector switch in the list of increment formula, according to the logical relation constitutive logic selector switch between increment formula.These simple selectors and logic selector switch are finally combined into one style selector switch (CSSSelector), form a selector switch tree, that selector switch outermost is tree root, and all simple selectors are leaf nodes, and logic selector switch is then branch.Selector switch tree just can indicate all search rules of user's input.
Particularly, simple selector is the not subdivisible smallest match unit of atom, such as, comprises the content of text etc. that masurium, attribute of an element and this element comprise.Logic selector switch, when user judges whether all simple selectors meet user-defined logical relation.Logic selector switch logically relation, can be divided into selector switch and selector switch or selector switch and non-selection device.
When mating, first obtain the descriptor of element, judge descriptor whether with meet the rule that simple selector describes and conform to: if conform to, then judge the logical relation between simple selector; When all simple logic selector switchs meet the logical relation of described logic selector switch definition, then this Match of elemental composition success.When logic selector switch is with selector switch, after needing all simple selectors all to mate, just the match is successful in calculation.When logic selector switch be or selector switch time, only need element describe with wherein arbitrarily simple selector match, namely the match is successful.Particularly, when mating, first from the simple selector of the bottom of selector switch tree, successively to upper strata coupling, it fails to match that then all it fails to match for any one-level, exits option program, continue next element.Until outermost selector switch the match is successful, then retrieve successfully once.
Because selector (CSSSelector) is a kind of pattern, select the html element element needing to add pattern.And selector has a variety of, conventional has typed selector, ID selector switch, attribute selector and pattern class selector etc.In numerous selector switch, be equally also applicable to the expression of XML element.Therefore, using abundant selector, almost can describe all information that HTML comprises, is also almost whole to the covering of XML content.Therefore, the present invention arranges search rule by pattern Chooser, can realize the retrieval to XML file and html file, achieve the general of gopher.
After search rule is provided with, searcher enters ready (preparation) state.After searcher enters ready state, before calling stop (stopping) search method, can continue to carry out to the action of input XML character string, like this in NodeJS, just can use the XML character string of reading in the input needs retrieval that flow object continues.Once have invoked stop method, resource will be released, and can not continue have input again.In the present embodiment, searcher provides the events such as onReady, onMatched, onFree, onFindFirst, provides the methods such as input, pause, resume, start, stop, for the control to retrieval process.
As shown in Figure 5, when carrying out XML file and resolving, searcher needs to carry out buffer memory to the XML content of input.When SAX resolver is resolved, because SAX object has the length of specifying, if once, can not resolve and need to remember current parse position.When new XML character string inputs, the situation before needing to recover, to continue to resolve from the parsing position of memory.
When SAX resolver resolves character string, when being resolved to element, trigger opentag/text event.After searcher knows situation, start to mate.When mating, element being pressed into storehouse, obtaining the descriptor of this element.Then, call tectonic style selector switch and mate with the descriptor of acquisition: upon a match, the corresponding information of element is saved in result for retrieval set, notify that user processes retrieval process simultaneously.User determines current retrieval process according to result for retrieval: or stop retrieval immediately, or midway hangs, or recover to continue to perform.When SAX is resolved to element end mark, when triggering closetag/text event, corresponding element is moved back stack by searcher, represents that this element mates end.For XML, its nested structure is complete, and each starts a label always corresponding end-tag, and there will not be label nested between dislocation.Element Down-Up arranges out one by one, just can obtain the path that from XML root node to present node one is unique.According to the information in this path, the structure of document can be known, and the particular location of element.
Further, after having resolved one section of XML, searcher has needed current element stack to carry out buffer memory, to ensure the continuity retrieved.When analyzing XML file, repeatedly may input into, there will be the incomplete situation of file, if at this moment disposed by current element stack, may there is the situations such as omission in follow-up result for retrieval.Therefore, in the present invention after one section of XML file is resolved and terminated, preserve the information of element stack, simultaneously when the input of new XML file is come in, the element stack before needing to continue to use, to ensure the accuracy retrieved and integrality.
See Fig. 6, present invention also offers a kind of XML file indexing unit, comprising:
Parsing module, for resolving by the XML character string of SAX resolver to input;
Acquisition module, for after being resolved to new element, obtains the descriptor of element;
Matching module, for when the descriptor obtained matches with the search rule pre-set, is then saved to results set by the descriptor of element, notifies that user processes result for retrieval simultaneously.
Further, this device also comprises rule and arranges module:
For when arranging search rule, utilize pattern resolver to resolve the style rule that user inputs, and generate the list of increment formula;
According to each the increment formula simple structure selector switch in the list of increment formula;
According to the logical relation constitutive logic selector switch between increment formula;
Simple selector and logic selector switch are combined into one style selector switch, for describing search rule.
Further, matching module specifically for:
Obtain the descriptor of element, if descriptor meets simple selector description rule, then proceed logical relation coupling;
When relation between simple selector meets the logical relation of logic selector switch definition, then this Match of elemental composition success.
Further, this device comprises the first cache module:
For when the input of new XML character string is come in, after XML character string is carried out buffer memory, then SAX resolver is transferred to resolve.
Further, this device comprises the second cache module:
For when SAX resolver resolves starts to new element, new element is pressed into storehouse;
When SAX resolver resolves is to the element end mark of this element, this element is popped.
Further, this device comprises the 3rd cache module:
For after having resolved one section of XML, current element stack is carried out buffer memory;
When there being new XML file input, then the element stack of buffer memory before recovering to use.
The present invention has following beneficial effect:
(1) by adopting SAX resolver, can process parsing limit, XML file limit, do not need file to read into memory in advance, then construct huge document object, effectively can save memory headroom, especially when Documents Comparison is large time, effect is more obvious.
(2) notify retrieval progress in the mode of event, the process retrieved can be controlled more easily, after finding the content that will look for, retrieval can be stopped immediately, and can hang up halfway, and can recover to continue to perform.
(3) use selector as the description means to the content that will search, the information of the XML overwhelming majority can be indicated, very general.Need not resolve for each like this or retrieval scene design treatment logic all separately, can development be saved.
(4) based on the characteristic of selector, the present invention can also resolve html file, and the retrieval use for this unstructured data of AM/BAM HTML can be very large.
One of ordinary skill in the art will appreciate that all or part of flow process realized in above-described embodiment method, that the hardware that can carry out instruction relevant by computer program has come, program can be stored in computer read/write memory medium, this program, when performing, can comprise the flow process of the embodiment as above-mentioned each side method.
Although describe the application by embodiment, those skilled in the art will appreciate that the application has many distortion and change and do not depart from the spirit and scope of the present invention.Like this, if these amendments of the present invention and modification belong within the scope of the claims in the present invention and equivalent technologies thereof, then the present invention is also intended to comprise these change and modification.

Claims (12)

1. an XML file search method, is characterized in that, comprising:
Resolved by the XML character string of SAX resolver to input;
After being resolved to new element, obtain the descriptor of element;
When the described descriptor obtained matches with the search rule pre-set, then described descriptor is saved in results set, notifies that user processes retrieval process simultaneously.
2. the method for claim 1, is characterized in that, when arranging search rule, comprising:
Utilize pattern resolver to resolve the style rule that user inputs, and generate the list of increment formula;
According to each the increment formula simple structure selector switch in the list of described increment formula;
According to the logical relation constitutive logic selector switch between increment formula;
Described simple selector and described logic selector switch are combined into one style selector switch, for describing described search rule.
3. method as claimed in claim 2, is characterized in that, when the described descriptor obtained being mated with the search rule pre-set, comprising:
Obtain the descriptor of element, if described descriptor meets the rule that described simple selector describes, then proceed logical relation coupling;
When described simple selector meets the logical relation of described logic selector switch definition, then this Match of elemental composition success.
4. the method for claim 1, is characterized in that, described method comprises:
When the input of new XML character string is come in, after described XML character string is carried out buffer memory, more described SAX resolver is transferred to resolve.
5. the method for claim 1, is characterized in that, described method comprises:
When described SAX resolver resolves starts to new element, new element is pressed into storehouse;
When described SAX resolver resolves is to the element end mark of this element, this element is popped.
6. method as claimed in claim 5, it is characterized in that, described method comprises:
After having resolved one section of XML, current element stack is carried out buffer memory;
When there being new XML file input, then the described element stack of buffer memory before recovering to use.
7. an XML file indexing unit, is characterized in that, comprising:
Parsing module, for resolving by the XML character string of SAX resolver to input;
Acquisition module, for after being resolved to new element, obtains the descriptor of element;
Matching module, for when the described descriptor obtained matches with the search rule pre-set, is then saved to results set by described descriptor, notifies that user processes retrieval process simultaneously.
8. device as claimed in claim 7, it is characterized in that, described device also comprises rule and arranges module:
For when arranging search rule, utilize pattern resolver to resolve the style rule that user inputs, and generate the list of increment formula;
According to each the increment formula simple structure selector switch in the list of described increment formula;
According to the logical relation constitutive logic selector switch between increment formula;
Described simple selector and described logic selector switch are combined into one style selector switch, for describing described search rule.
9. device as claimed in claim 8, is characterized in that, described matching module specifically for:
Obtain the descriptor of element, if described descriptor meets described simple selector description rule, then proceed logical relation coupling;
When relation between described simple selector meets the logical relation of described logic selector switch definition, then this Match of elemental composition success.
10. device as claimed in claim 7, it is characterized in that, described device comprises the first cache module:
For when the input of new XML character string is come in, after described XML character string is carried out buffer memory, more described SAX resolver is transferred to resolve.
11. devices as claimed in claim 8, it is characterized in that, described device comprises the second cache module:
When described SAX resolver resolves starts to new element, new element is pressed into storehouse;
When described SAX resolver resolves is to the element end mark of this element, this element is popped.
12. devices as claimed in claim 11, it is characterized in that, described device comprises the 3rd cache module:
For after having resolved one section of XML, current element stack is carried out buffer memory;
When there being new XML file input, then the described element stack of buffer memory before recovering to use.
CN201510728080.7A 2015-10-30 2015-10-30 XML file retrieval method and apparatus Pending CN105354282A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510728080.7A CN105354282A (en) 2015-10-30 2015-10-30 XML file retrieval method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510728080.7A CN105354282A (en) 2015-10-30 2015-10-30 XML file retrieval method and apparatus

Publications (1)

Publication Number Publication Date
CN105354282A true CN105354282A (en) 2016-02-24

Family

ID=55330254

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510728080.7A Pending CN105354282A (en) 2015-10-30 2015-10-30 XML file retrieval method and apparatus

Country Status (1)

Country Link
CN (1) CN105354282A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105610701A (en) * 2016-03-18 2016-05-25 深圳市招华电子科技有限公司 Instant messaging method, device and system
CN105930470A (en) * 2016-04-25 2016-09-07 安徽富驰信息技术有限公司 File retrieval method based on feature weight analysis technology
WO2017185277A1 (en) * 2016-04-28 2017-11-02 华为技术有限公司 File storage method and electronic device
CN112528080A (en) * 2019-09-03 2021-03-19 北京国双科技有限公司 Method and device for extracting text content of docx file
CN112749180A (en) * 2021-01-19 2021-05-04 上海复佳信息科技有限公司 Data management method, electronic device, and computer-readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100505752C (en) * 2005-01-21 2009-06-24 华为技术有限公司 Universal parser for text code protocols
CN101221556B (en) * 2008-02-01 2010-06-02 中国建设银行股份有限公司 Method and device for XML document analysis
US20140244692A1 (en) * 2013-02-27 2014-08-28 Cellco Partnership D/B/A Verizon Wireless Converting xml to json with configurable output

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100505752C (en) * 2005-01-21 2009-06-24 华为技术有限公司 Universal parser for text code protocols
CN101221556B (en) * 2008-02-01 2010-06-02 中国建设银行股份有限公司 Method and device for XML document analysis
US20140244692A1 (en) * 2013-02-27 2014-08-28 Cellco Partnership D/B/A Verizon Wireless Converting xml to json with configurable output

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
ITEYE_4185: "XML之SAX解析XML实例", 《CSDN博客》 *
LIUJIZHONG: "SAX解析XML文件", 《CSDN博客》 *
MOSHALANYE: "SAX解析xml", 《IT610》 *
北京洪恩教育科技有限公司: "《SQL Server2000数据库技术与应用》", 31 October 2006 *
真悲剧: "SAX解析XML详解", 《博客园》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105610701A (en) * 2016-03-18 2016-05-25 深圳市招华电子科技有限公司 Instant messaging method, device and system
CN105610701B (en) * 2016-03-18 2019-01-15 深圳市招华智能股份有限公司 A kind of method, apparatus and system of instant messaging
CN105930470A (en) * 2016-04-25 2016-09-07 安徽富驰信息技术有限公司 File retrieval method based on feature weight analysis technology
CN105930470B (en) * 2016-04-25 2019-03-26 安徽富驰信息技术有限公司 A kind of document retrieval method based on feature weight analytical technology
WO2017185277A1 (en) * 2016-04-28 2017-11-02 华为技术有限公司 File storage method and electronic device
RU2712823C1 (en) * 2016-04-28 2020-01-31 Хуавэй Текнолоджиз Ко., Лтд. Method of storing a file and an electronic device
US11308029B2 (en) 2016-04-28 2022-04-19 Huawei Technologies Co., Ltd. File saving method and electronic device
CN112528080A (en) * 2019-09-03 2021-03-19 北京国双科技有限公司 Method and device for extracting text content of docx file
CN112749180A (en) * 2021-01-19 2021-05-04 上海复佳信息科技有限公司 Data management method, electronic device, and computer-readable storage medium

Similar Documents

Publication Publication Date Title
US10824691B2 (en) Page rendering method, device, and data storage medium
CN105354282A (en) XML file retrieval method and apparatus
US10108715B2 (en) Transformation and presentation of on-demand native application crawling results
US10515142B2 (en) Method and apparatus for extracting webpage information
US8977626B2 (en) Indexing and searching a data collection
CN110955428A (en) Page display method and device, electronic equipment and medium
US9760551B2 (en) Generating regular expression
US20170277672A1 (en) Information processing device, information processing method, and computer program product
US20130339840A1 (en) System and method for logical chunking and restructuring websites
CN107256234A (en) A kind of web page text method of adjustment and its equipment
JP3220886B2 (en) Document search method and apparatus
CN111241793B (en) Method, system and medium for resolving rich text editor content to render to native client
CN111367595B (en) Data processing method, program running method, device and processing equipment
CN103678704A (en) Picture recognition method, system, equipment and device based on picture information
US11314757B2 (en) Search results modulator
CN104142985A (en) Semi-automatic vertical crawler generation tool and method
US11403078B2 (en) Interface layout interference detection
CN104270257A (en) Network element level network management service configuration adaptive system and method based on PB and XPATH
CN111831384A (en) Language switching method and device, equipment and storage medium
CN104391796A (en) Method for parsing test cases
CN108664546B (en) XML data structure conversion method and device
CN104809173A (en) Search result processing method and device
JP6292190B2 (en) Document association apparatus, document association system, and program
CN113360385A (en) Test method, device, equipment and medium
CN102609518A (en) Method and system for acquiring content of multistate AJAX (asynchronous javascript and extensible markup language) webpage

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160224