CN105868280A - Internet information data processing system and method - Google Patents

Internet information data processing system and method Download PDF

Info

Publication number
CN105868280A
CN105868280A CN201610167633.0A CN201610167633A CN105868280A CN 105868280 A CN105868280 A CN 105868280A CN 201610167633 A CN201610167633 A CN 201610167633A CN 105868280 A CN105868280 A CN 105868280A
Authority
CN
China
Prior art keywords
data
internet information
character
data processing
information data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610167633.0A
Other languages
Chinese (zh)
Inventor
沈文策
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201610167633.0A priority Critical patent/CN105868280A/en
Publication of CN105868280A publication Critical patent/CN105868280A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9577Optimising the visualization of content, e.g. distillation of HTML documents

Abstract

The present invention provides an Internet information data processing method. The method comprises the following steps: S1. obtaining original data of a web; S2. removing redundant data of the original data to obtain processed data; and S3. displaying the processed data on the app. The present invention further provides an Internet information data processing system. The system comprises a data obtaining module, used for obtaining the original data of the web; a data processing module, used for removing the redundant data of the original data to obtain the processed data; and a display module, used for displaying the processed data on the app. According to the system and method disclosed by the present invention, by processing data content on the web, useful content is extracted to display on the app, so that data processing and display efficiency is improved.

Description

A kind of internet information data handling system and method
Technical field
The present invention relates to the network communications technology, particularly relate to a kind of internet information data handling system and side Method.
Background technology
Along with the quickening of internet development process, information resources networking becomes a megatrend, with traditional Information resources are compared, network information resource quantity, structure, be distributed and propagate scope, carrier form, The aspects such as intension transfer means all show the feature made new advances, and these new features impart network information resource New intension.Network information resource refers to the information aggregate existing the most in every way and propagating. Network information resource is based on Internet information resource at present, also includes that other are not connected into the Internet simultaneously Information resources.Traditional document information content is also referred to as virtual information resource, and it is to remember in digitized form Record, express in the multimedia form and be stored in network computer magnetizing mediums, light medium and all kinds of communication Jie In matter, and by computer network communication mode row transmission information content set, be mainly manifested in it with Electronic data, as main carrying form, both can be text message, image information, can be again to regard Frequently, the multimedia form such as audio frequency.In addition network information resource is no matter in scale, in distribution, the most more The aspects such as new speed are all that traditional literature information resources are incomparable, so according to the feature of the network information Need network information resource is carried out effective data process with the information of the network user, be network information money Major issue the most urgent in the process of construction of source.Due to complexity and the multiformity of data, need logarithm Showing according to carrying out processing, such as, there is html label, spcial character and hiding word in the data on web Symbol, will not be revealed when showing on web, but can be revealed on app, so We need to process data after taking data, and these need not to prevent data from showing on app The content of display, affects user and uses.
Summary of the invention
One of the technical problem to be solved in the present invention, is to provide a kind of internet information data process side Method, by display complete for data content on web on app, improves data and processes and the efficiency of display With convenience.
One of the technical problem to be solved in the present invention is achieved in that a kind of internet information data process Method, comprises the steps:
Step 1, the initial data of acquisition web;
Step 2, the redundant data removed in described initial data, the data after being processed;
Step 3, on app, show the data after described process.
Further, the redundant data in described step 2 includes html label, spcial character and hides Character.
Further, the removing method of described html label is particularly as follows: once remove by recursion method , until there is not html label in data in a pair hmlt label.
Further, described spcial character and the removing method the hidden character method by replaceAll Carry out character replacement process.
The two of the technical problem to be solved in the present invention, are to provide a kind of internet information data to process system System, by display complete for data content on web on app, improves data and processes and the efficiency of display With convenience.
The two of the technical problem to be solved in the present invention are achieved in that a kind of internet information data process System, including
One data acquisition module, for obtaining the initial data of web;
One data processing module, for removing the redundant data in described initial data, after being processed Data;And
One display module, the data after showing described process on app.
Further, the redundant data in described data processing module includes html label, spcial character With hide character.
Further, the removing method of described html label is particularly as follows: once remove by recursion method , until there is not html label in data in a pair hmlt label.
Further, described spcial character and the removing method the hidden character method by replaceAll Carry out character replacement process.
Present invention have the advantage that processing the initial data to website by data content is processed place Reason, the information being formed, it is allowed to meet the demand of statistical analysis, and by the form of icon by data exhibition Show on app, thus improve the convenience of information transmission.
Accompanying drawing explanation
The present invention is further illustrated the most in conjunction with the embodiments.
Fig. 1 is that one internet information data processing method of the present invention performs flow chart.
Fig. 2 is one internet information data handling system block diagram of the present invention.
Detailed description of the invention
As it is shown in figure 1, a kind of internet information data processing method, comprise the steps:
Step 1, the initial data of acquisition web;
Step 2, the redundant data removed in described initial data, the data after being processed are described many Remainder is according to including html label, spcial character and hiding character, and the removing method of described html label has Body is: once removed a pair hmlt label by recursion method, until data do not exist html label, Because there being a lot of html label inside data, routine processes once can process pair of tag, so Use recursion method, self call self until returning us after there is not html label in data and wanting Data, described spcial character and the removing method hidden character carry out word by the method for replaceAll Symbol replacement processes, and such as: original data content is: A &&, replaces & character, data after process Content is, thus spcial character & is removed.This step by initial data is processed, The information being formed, is allowed to meet the demand of statistical analysis, and data display is gone out by the form of available icons Come, from the point of view of user finds the mode of information, can be processed by data and obtain in webpage initial data Html label makes this content can show the pattern effect of this label, thus is shown by complete for data content Original effect is shown;
Step 3, on app, show the data after described process.
As in figure 2 it is shown, a kind of internet information data handling system, including a data acquisition module,
One data processing module and a display module;
Described data acquisition module, for obtaining the initial data of web;
Described data processing module, for removing the redundant data in described initial data, after being processed Data, described redundant data includes html label, spcial character and hides character, described html mark The removing method signed once removes a pair hmlt label particularly as follows: pass through recursion method, until in data There is not html label, described spcial character and the removing method the hidden character side by replaceAll Method carries out character replacement process, such as: original data content is: A &&, replaces & character, place After reason, data content is, thus spcial character & is removed.This step is by carrying out initial data Processed, the information being formed, it is allowed to meet the demand of statistical analysis, the form of available icons will Data display out, from the point of view of user finds the mode of information, can process acquisition webpage by data former In beginning data, html label makes this content can show the pattern effect of this label, thus by data content Complete demonstrates original effect;
Described display module, the data after showing described process on app.
Although the foregoing describing the detailed description of the invention of the present invention, but it is familiar with the technology people of the art Member should be appreciated that our described specific embodiment is merely exemplary rather than for this The restriction of bright scope, those of ordinary skill in the art are in the equivalence made according to the spirit of the present invention Modify and change, all should contain in the scope of the claimed protection of the present invention.

Claims (8)

1. an internet information data processing method, it is characterised in that: comprise the steps:
Step 1, the initial data of acquisition web;
Step 2, the redundant data removed in described initial data, the data after being processed;
Step 3, on app, show the data after described process.
A kind of internet information data processing method the most according to claim 1, it is characterised in that: Redundant data in described step 2 includes html label, spcial character and hides character.
A kind of internet information data processing method the most according to claim 1, it is characterised in that: The removing method of described html label once removes a pair hmlt label particularly as follows: pass through recursion method, Until data do not exist html label.
A kind of internet information data processing method the most according to claim 1, it is characterised in that: Described spcial character and the removing method hidden character are carried out at character replacement by the method for replaceAll Reason.
5. an internet information data handling system, it is characterised in that: include
One data acquisition module, for obtaining the initial data of web;
One data processing module, for removing the redundant data in described initial data, after being processed Data;And
One display module, the data after showing described process on app.
A kind of internet information data handling system the most according to claim 5, it is characterised in that: Redundant data in described data processing module includes html label, spcial character and hides character.
A kind of internet information data handling system the most according to claim 5, it is characterised in that: The removing method of described html label once removes a pair hmlt label particularly as follows: pass through recursion method, Until data do not exist html label.
A kind of internet information data handling system the most according to claim 5, it is characterised in that: Described spcial character and the removing method hidden character are carried out at character replacement by the method for replaceAll Reason.
CN201610167633.0A 2016-03-23 2016-03-23 Internet information data processing system and method Pending CN105868280A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610167633.0A CN105868280A (en) 2016-03-23 2016-03-23 Internet information data processing system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610167633.0A CN105868280A (en) 2016-03-23 2016-03-23 Internet information data processing system and method

Publications (1)

Publication Number Publication Date
CN105868280A true CN105868280A (en) 2016-08-17

Family

ID=56625437

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610167633.0A Pending CN105868280A (en) 2016-03-23 2016-03-23 Internet information data processing system and method

Country Status (1)

Country Link
CN (1) CN105868280A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112463778A (en) * 2020-11-24 2021-03-09 史传波 Information processing method based on big data and application program and big data server

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101216842A (en) * 2008-01-07 2008-07-09 华为技术有限公司 Method for obtaining page key words and page information processing apparatus
CN101887432A (en) * 2009-05-13 2010-11-17 北京博越世纪科技有限公司 Technology for analyzing and purifying internet webpage for mobile terminal to display
US20100333201A1 (en) * 2009-06-30 2010-12-30 International Business Machines Corporation System, method, and program for determining validity of string

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101216842A (en) * 2008-01-07 2008-07-09 华为技术有限公司 Method for obtaining page key words and page information processing apparatus
CN101887432A (en) * 2009-05-13 2010-11-17 北京博越世纪科技有限公司 Technology for analyzing and purifying internet webpage for mobile terminal to display
US20100333201A1 (en) * 2009-06-30 2010-12-30 International Business Machines Corporation System, method, and program for determining validity of string

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘强: "java正则表达式去除html中所有标签和特殊HTML字符(以&开头的)", 《HTTPS://WWW.ANDROIDDEV.NET》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112463778A (en) * 2020-11-24 2021-03-09 史传波 Information processing method based on big data and application program and big data server

Similar Documents

Publication Publication Date Title
US20130103385A1 (en) Performing sentiment analysis
CN105005568B (en) Web page front end processing device and method based on template
CN110049456A (en) A kind of method, device and mobile terminal showing Rich Media in mobile terminal
CN107948437B (en) Screen-off display method and device
CN104182430A (en) Method and device for displaying image in text message
US9330075B2 (en) Method and apparatus for identifying garbage template article
CN105094824B (en) A kind of notification message methods of exhibiting on smartwatch and a kind of smartwatch
CN104283765A (en) Information display method and device and terminal equipment
CN104915359A (en) Theme label recommending method and device
CN112580339B (en) Model training method and device, electronic equipment and storage medium
CN108229363A (en) Key frame dispatching method and device, electronic equipment, program and medium
CN106681598A (en) Information input method and device
CN105589631B (en) Information displaying method and device
CN102497580B (en) Video information synthesizing method based on audio feature information
CN112905753A (en) Method and device for distinguishing text information
JP2022091686A (en) Data annotation method, device, electronic apparatus and storage medium
CN104376304B (en) A kind of recognition methods of text advertisements image and device
CN113239256A (en) Method for generating website signature and method and device for identifying website
CN113378855A (en) Method for processing multitask, related device and computer program product
CN113904943A (en) Account detection method and device, electronic equipment and storage medium
WO2022156087A1 (en) Data blood relationship establishing method and apparatus, computer device, and storage medium
CN105868280A (en) Internet information data processing system and method
US20190332859A1 (en) Method for identifying main picture in web page
CN109919324B (en) Transfer learning classification method, system and equipment based on label proportion learning
CN111191242A (en) Vulnerability information determination method and device, computer readable storage medium and equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160817