CN109376319A - A method of purification browser page - Google Patents

A method of purification browser page Download PDF

Info

Publication number
CN109376319A
CN109376319A CN201811122985.XA CN201811122985A CN109376319A CN 109376319 A CN109376319 A CN 109376319A CN 201811122985 A CN201811122985 A CN 201811122985A CN 109376319 A CN109376319 A CN 109376319A
Authority
CN
China
Prior art keywords
text
browser
purification
page
source code
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811122985.XA
Other languages
Chinese (zh)
Inventor
曾三平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhuoeer Purchase Information Technology (Wuhan) Co., Ltd.
Original Assignee
Wuhan Yun Media Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Yun Media Technology Co Ltd filed Critical Wuhan Yun Media Technology Co Ltd
Priority to CN201811122985.XA priority Critical patent/CN109376319A/en
Publication of CN109376319A publication Critical patent/CN109376319A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/186Templates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/189Automatic justification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention belongs to page processing technology field, in particular to a kind of method for purifying browser page.After the present invention is by purifying third party's browser page, it is shown in the application program of client again, browser reading service is provided for user, effectively filter out advertisement and the other content of third party website, browser body text content is only shown in specified purification frame by the browser for providing each browser website, user is improved to the attention rate of browser content, improves the visual experience for being used for reading and browsing device.

Description

A method of purification browser page
Technical field
The invention belongs to page processing technology field, in particular to a kind of method for purifying browser page.
Background technique
With the development of computer technology and network technology, network electronic information resources become normal in people's daily life With resource, such as browser, network picture etc..And the third party website for providing browser is even more innumerable, but each third party The style and features of website are even more to let a hundred schools contend, is different, and while bringing user's more visual enjoyments, also band dim eyesight is entangled Random application experience, causes a variety of different user experiences, and the visual experience for making user read Chinese character generates fluctuation, influences user Browsing mood.
Summary of the invention
The present invention to solve the above-mentioned problems, provides a kind of method for purifying browser page, improves user to browser The attention rate of content improves the visual experience for being used for reading and browsing device.
The present invention is realized using following technical scheme:
A method of purification browser page, comprising:
Read the source code of website locating for browser;
The purification frame of browser page, the compiling according to the source code are matched according to the Compiler Structure of the source code The step of purification frame of structure matching browser page, comprising: identify Chinese character start of text and end of text in the source code Symbol;
According to website locating for the feature of the start of text and the symbol of end of text difference browser;
Purification frame corresponding with website locating for the browser is selected, browser body text content is shown in institute It states in purification frame, step browser body text content being shown in the purification frame, comprising: described in extraction The text of browser body text;According to specified typesetting mode to text typesetting again.
Preferably, the step of text for extracting the browser body text, comprising: according to the start of text and The layout scope of the symbol location Chinese character body text of end of text;Extract the text paragraph in the layout scope.
Preferably, it is described according to specified typesetting mode to the text again typesetting the step of, comprising: by the text paragraph In each text set be embedded into it is described purification frame specified typesetting mode in.
The beneficial effects of the present invention are:
It after the present invention is by purifying third party's browser page, then is shown in the application program of client, is user Browser reading service is provided, advertisement and the other content of third party website are effectively filtered out, each browser website is provided Browser body text content is only shown in specified purification frame by browser, improves concern of the user to browser content Degree improves the visual experience for being used for reading and browsing device.
Specific embodiment
The technical solution in embodiment will be clearly and completely described below.Obviously, described embodiment is only It is a part of the embodiments of the present invention, instead of all the embodiments.Based on the embodiment of the present invention, ordinary skill people Member's every other embodiment obtained without creative efforts, shall fall within the protection scope of the present invention.
A method of purification browser page, comprising:
Read the source code of website locating for browser;
The purification frame of browser page, the compiling according to the source code are matched according to the Compiler Structure of the source code The step of purification frame of structure matching browser page, comprising: identify Chinese character start of text and end of text in the source code Symbol;
According to website locating for the feature of the start of text and the symbol of end of text difference browser;
Purification frame corresponding with website locating for the browser is selected, browser body text content is shown in institute It states in purification frame, step browser body text content being shown in the purification frame, comprising: described in extraction The text of browser body text;According to specified typesetting mode to text typesetting again.
Preferably, the step of text for extracting the browser body text, comprising: according to the start of text and The layout scope of the symbol location Chinese character body text of end of text;Extract the text paragraph in the layout scope.
Preferably, it is described according to specified typesetting mode to the text again typesetting the step of, comprising: by the text paragraph In each text set be embedded into it is described purification frame specified typesetting mode in.
The present invention is not limited to examples detailed above, in claims of the present invention limited range, art technology The various deformations or amendments that personnel can make without creative work are protected by this patent.

Claims (3)

1. a kind of method for purifying browser page characterized by comprising
Read the source code of website locating for browser;
The purification frame of browser page, the Compiler Structure according to the source code are matched according to the Compiler Structure of the source code The step of matching the purification frame of browser page, comprising: identify the symbol of Chinese character start of text and end of text in the source code Number;
According to website locating for the feature of the start of text and the symbol of end of text difference browser;
Purification frame corresponding with website locating for the browser is selected, browser body text content is shown in described net Change in frame, step browser body text content being shown in the purification frame, comprising: extract the browsing The text of device body text;According to specified typesetting mode to text typesetting again.
2. the method for purification browser page according to claim 1, which is characterized in that described to extract the browser just The step of text of text, comprising: according to the cloth of the start of text and the symbol location Chinese character body text of end of text Office's scope;Extract the text paragraph in the layout scope.
3. the method for purification browser page according to claim 2, which is characterized in that described according to specified typesetting mode To the text again typesetting the step of, comprising: by each text set in the text paragraph be embedded into it is described purification frame finger Determine in typesetting mode.
CN201811122985.XA 2018-09-26 2018-09-26 A method of purification browser page Pending CN109376319A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811122985.XA CN109376319A (en) 2018-09-26 2018-09-26 A method of purification browser page

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811122985.XA CN109376319A (en) 2018-09-26 2018-09-26 A method of purification browser page

Publications (1)

Publication Number Publication Date
CN109376319A true CN109376319A (en) 2019-02-22

Family

ID=65401859

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811122985.XA Pending CN109376319A (en) 2018-09-26 2018-09-26 A method of purification browser page

Country Status (1)

Country Link
CN (1) CN109376319A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170024423A1 (en) * 2015-07-20 2017-01-26 Guangzhou Ucweb Computer Technology Co., Ltd Webpage pre-reading method, apparatus and smart terminal
CN108090123A (en) * 2017-11-10 2018-05-29 深圳市华阅文化传媒有限公司 Purify the method and apparatus of the network novel page

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170024423A1 (en) * 2015-07-20 2017-01-26 Guangzhou Ucweb Computer Technology Co., Ltd Webpage pre-reading method, apparatus and smart terminal
CN108090123A (en) * 2017-11-10 2018-05-29 深圳市华阅文化传媒有限公司 Purify the method and apparatus of the network novel page

Similar Documents

Publication Publication Date Title
CN106528894B (en) The method and device of label information is set
CN102298638A (en) Method and system for extracting news webpage contents by clustering webpage labels
CN103336690A (en) HTML (Hypertext Markup Language) 5-based text-element drawing method and device
CN103514171B (en) Optically-based character recognition and the self-defined reptile method of vertical search
CN104252532A (en) Website information statistic method and device
CN108090123A (en) Purify the method and apparatus of the network novel page
CN109376319A (en) A method of purification browser page
CN109101665A (en) A method of purification browser news pages
CN108153872A (en) A kind of method and apparatus of the Internet web page information filtering
CN106815249B (en) Vertical text advertisement filtering method and device
CN106802841A (en) Data extract analytic method, device and server
CN102890630B (en) The minimizing technology of swf file peripheral link
WO2019090738A1 (en) Method and device for purifying web fiction page
CN106776493B (en) Information filtering method and information filtering device
CN105446644B (en) A kind of character deleting method and terminal
CN103020202A (en) Complicated dynamic data relation solution method based on character string
Subuhan Software-Based Attendance System Using Web Technology
Rabe The Savior of 6th Street
Chunfa Poetics and Memory: The Writings on Ruins in Pamuk’s Literary Works
CN104657881A (en) Method and device for establishing popularization and presentation effects
CN115017310A (en) Public opinion popularity determination method, device, storage medium and electronic equipment
Jin et al. A Study of Mobile Content Generation System using 2-Dimensional bar code in Smart Device Environment
Hyman Half-aquatic mist-dweller.
Lopez-Acevedo Wages and Productivity in Mexican Manufacturing. Policy Research Working Paper.
Randall History of Computers

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20190924

Address after: 432200 Hankou North E-Commerce Building, 88 Hankou North Avenue, Panlongcheng, Huangpi District, Wuhan City, Hubei Province, 13th Floor

Applicant after: Zhuoeer Purchase Information Technology (Wuhan) Co., Ltd.

Address before: 430073 Huazhong Dawning Software Park, No. 1 Guanshan Road, Donghu New Technology Development Zone, Wuhan City, Hubei Province

Applicant before: Wuhan Yun Media Technology Co., Ltd.

WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190222