CN105868280A - Internet information data processing system and method - Google Patents
Internet information data processing system and method Download PDFInfo
- Publication number
- CN105868280A CN105868280A CN201610167633.0A CN201610167633A CN105868280A CN 105868280 A CN105868280 A CN 105868280A CN 201610167633 A CN201610167633 A CN 201610167633A CN 105868280 A CN105868280 A CN 105868280A
- Authority
- CN
- China
- Prior art keywords
- data
- internet information
- character
- data processing
- information data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/957—Browsing optimisation, e.g. caching or content distillation
- G06F16/9577—Optimising the visualization of content, e.g. distillation of HTML documents
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
The present invention provides an Internet information data processing method. The method comprises the following steps: S1. obtaining original data of a web; S2. removing redundant data of the original data to obtain processed data; and S3. displaying the processed data on the app. The present invention further provides an Internet information data processing system. The system comprises a data obtaining module, used for obtaining the original data of the web; a data processing module, used for removing the redundant data of the original data to obtain the processed data; and a display module, used for displaying the processed data on the app. According to the system and method disclosed by the present invention, by processing data content on the web, useful content is extracted to display on the app, so that data processing and display efficiency is improved.
Description
Technical field
The present invention relates to the network communications technology, particularly relate to a kind of internet information data handling system and side
Method.
Background technology
Along with the quickening of internet development process, information resources networking becomes a megatrend, with traditional
Information resources are compared, network information resource quantity, structure, be distributed and propagate scope, carrier form,
The aspects such as intension transfer means all show the feature made new advances, and these new features impart network information resource
New intension.Network information resource refers to the information aggregate existing the most in every way and propagating.
Network information resource is based on Internet information resource at present, also includes that other are not connected into the Internet simultaneously
Information resources.Traditional document information content is also referred to as virtual information resource, and it is to remember in digitized form
Record, express in the multimedia form and be stored in network computer magnetizing mediums, light medium and all kinds of communication Jie
In matter, and by computer network communication mode row transmission information content set, be mainly manifested in it with
Electronic data, as main carrying form, both can be text message, image information, can be again to regard
Frequently, the multimedia form such as audio frequency.In addition network information resource is no matter in scale, in distribution, the most more
The aspects such as new speed are all that traditional literature information resources are incomparable, so according to the feature of the network information
Need network information resource is carried out effective data process with the information of the network user, be network information money
Major issue the most urgent in the process of construction of source.Due to complexity and the multiformity of data, need logarithm
Showing according to carrying out processing, such as, there is html label, spcial character and hiding word in the data on web
Symbol, will not be revealed when showing on web, but can be revealed on app, so
We need to process data after taking data, and these need not to prevent data from showing on app
The content of display, affects user and uses.
Summary of the invention
One of the technical problem to be solved in the present invention, is to provide a kind of internet information data process side
Method, by display complete for data content on web on app, improves data and processes and the efficiency of display
With convenience.
One of the technical problem to be solved in the present invention is achieved in that a kind of internet information data process
Method, comprises the steps:
Step 1, the initial data of acquisition web;
Step 2, the redundant data removed in described initial data, the data after being processed;
Step 3, on app, show the data after described process.
Further, the redundant data in described step 2 includes html label, spcial character and hides
Character.
Further, the removing method of described html label is particularly as follows: once remove by recursion method
, until there is not html label in data in a pair hmlt label.
Further, described spcial character and the removing method the hidden character method by replaceAll
Carry out character replacement process.
The two of the technical problem to be solved in the present invention, are to provide a kind of internet information data to process system
System, by display complete for data content on web on app, improves data and processes and the efficiency of display
With convenience.
The two of the technical problem to be solved in the present invention are achieved in that a kind of internet information data process
System, including
One data acquisition module, for obtaining the initial data of web;
One data processing module, for removing the redundant data in described initial data, after being processed
Data;And
One display module, the data after showing described process on app.
Further, the redundant data in described data processing module includes html label, spcial character
With hide character.
Further, the removing method of described html label is particularly as follows: once remove by recursion method
, until there is not html label in data in a pair hmlt label.
Further, described spcial character and the removing method the hidden character method by replaceAll
Carry out character replacement process.
Present invention have the advantage that processing the initial data to website by data content is processed place
Reason, the information being formed, it is allowed to meet the demand of statistical analysis, and by the form of icon by data exhibition
Show on app, thus improve the convenience of information transmission.
Accompanying drawing explanation
The present invention is further illustrated the most in conjunction with the embodiments.
Fig. 1 is that one internet information data processing method of the present invention performs flow chart.
Fig. 2 is one internet information data handling system block diagram of the present invention.
Detailed description of the invention
As it is shown in figure 1, a kind of internet information data processing method, comprise the steps:
Step 1, the initial data of acquisition web;
Step 2, the redundant data removed in described initial data, the data after being processed are described many
Remainder is according to including html label, spcial character and hiding character, and the removing method of described html label has
Body is: once removed a pair hmlt label by recursion method, until data do not exist html label,
Because there being a lot of html label inside data, routine processes once can process pair of tag, so
Use recursion method, self call self until returning us after there is not html label in data and wanting
Data, described spcial character and the removing method hidden character carry out word by the method for replaceAll
Symbol replacement processes, and such as: original data content is: A &&, replaces & character, data after process
Content is, thus spcial character & is removed.This step by initial data is processed,
The information being formed, is allowed to meet the demand of statistical analysis, and data display is gone out by the form of available icons
Come, from the point of view of user finds the mode of information, can be processed by data and obtain in webpage initial data
Html label makes this content can show the pattern effect of this label, thus is shown by complete for data content
Original effect is shown;
Step 3, on app, show the data after described process.
As in figure 2 it is shown, a kind of internet information data handling system, including a data acquisition module,
One data processing module and a display module;
Described data acquisition module, for obtaining the initial data of web;
Described data processing module, for removing the redundant data in described initial data, after being processed
Data, described redundant data includes html label, spcial character and hides character, described html mark
The removing method signed once removes a pair hmlt label particularly as follows: pass through recursion method, until in data
There is not html label, described spcial character and the removing method the hidden character side by replaceAll
Method carries out character replacement process, such as: original data content is: A &&, replaces & character, place
After reason, data content is, thus spcial character & is removed.This step is by carrying out initial data
Processed, the information being formed, it is allowed to meet the demand of statistical analysis, the form of available icons will
Data display out, from the point of view of user finds the mode of information, can process acquisition webpage by data former
In beginning data, html label makes this content can show the pattern effect of this label, thus by data content
Complete demonstrates original effect;
Described display module, the data after showing described process on app.
Although the foregoing describing the detailed description of the invention of the present invention, but it is familiar with the technology people of the art
Member should be appreciated that our described specific embodiment is merely exemplary rather than for this
The restriction of bright scope, those of ordinary skill in the art are in the equivalence made according to the spirit of the present invention
Modify and change, all should contain in the scope of the claimed protection of the present invention.
Claims (8)
1. an internet information data processing method, it is characterised in that: comprise the steps:
Step 1, the initial data of acquisition web;
Step 2, the redundant data removed in described initial data, the data after being processed;
Step 3, on app, show the data after described process.
A kind of internet information data processing method the most according to claim 1, it is characterised in that:
Redundant data in described step 2 includes html label, spcial character and hides character.
A kind of internet information data processing method the most according to claim 1, it is characterised in that:
The removing method of described html label once removes a pair hmlt label particularly as follows: pass through recursion method,
Until data do not exist html label.
A kind of internet information data processing method the most according to claim 1, it is characterised in that:
Described spcial character and the removing method hidden character are carried out at character replacement by the method for replaceAll
Reason.
5. an internet information data handling system, it is characterised in that: include
One data acquisition module, for obtaining the initial data of web;
One data processing module, for removing the redundant data in described initial data, after being processed
Data;And
One display module, the data after showing described process on app.
A kind of internet information data handling system the most according to claim 5, it is characterised in that:
Redundant data in described data processing module includes html label, spcial character and hides character.
A kind of internet information data handling system the most according to claim 5, it is characterised in that:
The removing method of described html label once removes a pair hmlt label particularly as follows: pass through recursion method,
Until data do not exist html label.
A kind of internet information data handling system the most according to claim 5, it is characterised in that:
Described spcial character and the removing method hidden character are carried out at character replacement by the method for replaceAll
Reason.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610167633.0A CN105868280A (en) | 2016-03-23 | 2016-03-23 | Internet information data processing system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610167633.0A CN105868280A (en) | 2016-03-23 | 2016-03-23 | Internet information data processing system and method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105868280A true CN105868280A (en) | 2016-08-17 |
Family
ID=56625437
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610167633.0A Pending CN105868280A (en) | 2016-03-23 | 2016-03-23 | Internet information data processing system and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105868280A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112463778A (en) * | 2020-11-24 | 2021-03-09 | 史传波 | Information processing method based on big data and application program and big data server |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101216842A (en) * | 2008-01-07 | 2008-07-09 | 华为技术有限公司 | Method for obtaining page key words and page information processing apparatus |
CN101887432A (en) * | 2009-05-13 | 2010-11-17 | 北京博越世纪科技有限公司 | Technology for analyzing and purifying internet webpage for mobile terminal to display |
US20100333201A1 (en) * | 2009-06-30 | 2010-12-30 | International Business Machines Corporation | System, method, and program for determining validity of string |
-
2016
- 2016-03-23 CN CN201610167633.0A patent/CN105868280A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101216842A (en) * | 2008-01-07 | 2008-07-09 | 华为技术有限公司 | Method for obtaining page key words and page information processing apparatus |
CN101887432A (en) * | 2009-05-13 | 2010-11-17 | 北京博越世纪科技有限公司 | Technology for analyzing and purifying internet webpage for mobile terminal to display |
US20100333201A1 (en) * | 2009-06-30 | 2010-12-30 | International Business Machines Corporation | System, method, and program for determining validity of string |
Non-Patent Citations (1)
Title |
---|
刘强: "java正则表达式去除html中所有标签和特殊HTML字符(以&开头的)", 《HTTPS://WWW.ANDROIDDEV.NET》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112463778A (en) * | 2020-11-24 | 2021-03-09 | 史传波 | Information processing method based on big data and application program and big data server |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20130103385A1 (en) | Performing sentiment analysis | |
CN105005568B (en) | Web page front end processing device and method based on template | |
CN110059273A (en) | A kind of method showing Rich Media on mobile terminals and mobile terminal | |
CN112580339B (en) | Model training method and device, electronic equipment and storage medium | |
CN104182430A (en) | Method and device for displaying image in text message | |
US9330075B2 (en) | Method and apparatus for identifying garbage template article | |
CN105094824B (en) | A kind of notification message methods of exhibiting on smartwatch and a kind of smartwatch | |
CN104915359A (en) | Theme label recommending method and device | |
CN106681598A (en) | Information input method and device | |
CN112084342A (en) | Test question generation method and device, computer equipment and storage medium | |
CN110705271A (en) | System and method for providing natural language processing service | |
CN102497580B (en) | Video information synthesizing method based on audio feature information | |
CN112905753A (en) | Method and device for distinguishing text information | |
CN109271247A (en) | Memory Optimize Method, device, computer installation and storage medium | |
JP2022091686A (en) | Data annotation method, device, electronic apparatus and storage medium | |
CN104376304B (en) | A kind of recognition methods of text advertisements image and device | |
US10963690B2 (en) | Method for identifying main picture in web page | |
CN117992569A (en) | Method, device, equipment and medium for generating document based on generation type large model | |
CN113904943A (en) | Account detection method and device, electronic equipment and storage medium | |
CN113378025A (en) | Data processing method and device, electronic equipment and storage medium | |
CN105868280A (en) | Internet information data processing system and method | |
CN102779076B (en) | The method and apparatus of interprocess communication in a kind of browser | |
CN109919324B (en) | Transfer learning classification method, system and equipment based on label proportion learning | |
CN109408647B (en) | Method and apparatus for processing information | |
CN111191242A (en) | Vulnerability information determination method and device, computer readable storage medium and equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160817 |