TW448377B - Browser with automatic filtering web pages and its browsing method - Google Patents

Browser with automatic filtering web pages and its browsing method Download PDF

Info

Publication number
TW448377B
TW448377B TW088110307A TW88110307A TW448377B TW 448377 B TW448377 B TW 448377B TW 088110307 A TW088110307 A TW 088110307A TW 88110307 A TW88110307 A TW 88110307A TW 448377 B TW448377 B TW 448377B
Authority
TW
Taiwan
Prior art keywords
webpage
interviewed
browser
status line
character set
Prior art date
Application number
TW088110307A
Other languages
Chinese (zh)
Inventor
Sheng-Hong Yang
Jen-Shing Lai
Original Assignee
Inventec Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Corp filed Critical Inventec Corp
Priority to TW088110307A priority Critical patent/TW448377B/en
Application granted granted Critical
Publication of TW448377B publication Critical patent/TW448377B/en

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

It is a kind of browser with automatic filtering web pages and its browsing method that can save much time and money of downloading for users. According to this browsing method of automatic filtering web pages, it first needs to provide a browser when downloading web pages by HTTP protocol, and uses this browser to link to visited web pages to fetch responding data of visited web pages including status line, physical head and physical text. Then, it re-analyses the first three characters of status line to confirm receiving of responding data, and analyze physical head and physical text to confirm character sets used by visited web pages. And, users can determine if the browser used now supports character sets used by visited web pages or not to decide if it continues to download visited web pages.

Description

.448377 五、發明說明(1) 本發明是有關於一種劇 自動過滤網頁的潘j覽器及其 開啟網頁時’自動過濾掉現 的網頁,進而節省大量下載 在當今社會中,網際網 具之一。使用者可以透過瀏 想要知道的各種訊息。但是 素的限制。舉例來說,傳統 受訪網頁是使用何種字符集 載。因此,若這種瀏覽器無 集’則受訪網頁在下載後便 浪費使用者的下載時間和費 目前已有越來越多的劉覽器 頁,但傳統基於電話機平台 集而無法支援其他字符集。 字符集以構成的,故這些使 使用非英文字符集的網頁後 有鑑於此,本發明的主 濾網頁的瀏覽器及其劉覽方 頁時’自動過濾掉現行劉覽 進而節省大量下載時間及電 為解決上述及其他目的 濾網頁的瀏覽器及其瀏覽方 時間及金錢。根據這種自動 覽器’且特別是有關於一種可 劇覽方法’可在利用HTTp協定 行瀏覽器所無法支援的字符集 時間及金錢。 路已成為傳遞訊息的最重要工 覽器,在網際網路上獲得自己 網際網路的速度卻受到幾個因 的瀏覽器在開啟網頁時,無論 (中文’英文,…)都會一併下 法支援受訪網頁所使用的字符 會顯示亂碼。這種情形不僅會 用’還會造成很多不便。僅管 能夠支援使用各種字符集的網 的測覽器仍只能支援英文字符 並且’很多網頁是使用非英文 用者仍有相當機會,在下載到 ,看到一堆亂碼。 要目的就是提出一種可自動過 法’可在利用HTTP協定開啟網 器無法支援的字符集的網頁, 話費。 ’本發明乃提出一種可自動過 法’可節省使用者的大量下栽 過遽網頁的瀏覽方法,在利用.448377 V. Description of the invention (1) The present invention relates to a pan browser that automatically filters web pages and automatically filters out existing web pages when opening web pages, thereby saving a large number of downloads. One. Users can browse through various information that they want to know. But the limitations of prime. For example, what character set is used for traditional interviewed web pages. Therefore, if this browser has no collection, the downloaded webpage will waste the download time and cost of the user after downloading. At present, there are more and more browser pages, but traditionally based on the phone platform set and cannot support other characters. set. The character set is composed, so these pages make use of non-English character sets. In view of this, when the browser of the main filtering webpage of the present invention and its Liulangfang page 'automatically filter out the current Liulan and save a lot of download time and To filter the web browser and its viewers time and money for the above and other purposes. According to such an auto-browser ', and in particular, a method for displaying data, a character set that cannot be supported by a browser using the HTTp protocol, time and money. The road has become the most important browser for transmitting messages. The speed at which you get your own Internet on the Internet has been affected by several browsers. When you open a web page, no matter (Chinese, English, ...), it will be supported simultaneously. The characters used on the webpages displayed are garbled. This situation will not only use ’but also cause a lot of inconvenience. Although the browser that can support the use of various character sets can still only support English characters and ‘many webpages are non-English users, there is still a considerable opportunity. After downloading, I see a bunch of garbled characters. The main purpose is to propose a method that can be used to automatically open a web page that uses the HTTP protocol to open a character set that cannot be supported by the Internet. ‘The present invention proposes an automatic method’, which saves users a large number of downloading methods.

第4頁 448377 五、發明說明(2) HTTP協定下載網頁時,首先要提供一瀏覽器,並利用這個 瀏覽器連結受訪網頁,藉以取得受訪網頁的回應資料,包 括有狀態行,實體頭及實體正文。然後,再分析狀態行的 前三個字符以確認回應資料的接收,並分析實體頭及實體 正文以確認受訪網頁所使用的字符集。接著,使用者便可 判斷現行瀏覽器是否能夠支援受訪網頁所使用的字符集, 藉以決定是否繼續進行受訪網頁的下載動作。 另外,為達到上述及其他目的,本發明亦提供一種可 自動過濾網頁的瀏覽器,其特徵在於:主控模組,狀態行 分析模組及實體頭分析模組。在利用HTTP協定下載網頁 時,主控模組會連結受訪網頁,並取得受訪網頁的回應資 料,包括有狀態行,實體頭及實體正文。狀態行分析模組 會分析受訪網頁的狀態行,並確認回應資料的接收。實體 頭分析模組則會分析受訪網頁的實體頭,並判斷受訪網頁 所使用的字符集,並在現行瀏覽器無法支援的情形下,由 使用者自行決定是否進行隨後的下載動作。 為讓本發明之上述和其他目的、特徵、和優點能更明 顯易懂,下文特舉一較佳實施例,並配合所附圖式,作詳 細說明如下: 圖式說明 第1圖是本發明自動過濾網頁的瀏覽方法中,主控模 組的實施流程圖; 第2圖是本發明自動過濾網頁的瀏覽方法中,狀態行 分析模組的實施流程圖;以及 4 48 3 / ? 五、發明說明(3) 第3圖本發明自動過濾網頁的瀏覽方法中,實體頭分 析模組的實施流程圖。 實施例 根據HTTP協定’當使用者欲開啟某個網站伺服器上的 受訪網頁時’通常會向這個網站伺服器發送GET,1或 "POST"命令以取得受訪網頁的内容。而網站伺服器則會將 根據"GET"或"POST"命令的來源將受訪網頁的回應資料送 回使用者端。回應資料中通常包含有三個部分:狀態行, 實體頭及實體正文。並且’實體頭及實體正文中可能會存 有訊息以表示受訪網頁所使用的字符集。 因此’本發明便透過實體頭及實體正文的分析以判讀 受訪網頁所使用的字符集。當受訪網頁所使用的字符集可 由現行瀏覽器所支援時,則繼續下載受訪網頁^反之,則 將是否繼續下載受訪網頁的問題交給使用者自行決定。 以下便是對本發明實施例的詳細說明。 在本實施例中’為能達到自動過濾網頁的功能,在瀏 覽器中會設置三個模組·.主控模組,狀態行分析模組及 體頭分析模組。主控模組會在利用Ηττρ協定開啟網頁時 連結受訪網頁以取得受訪網頁的回應資料,包 實艘頭及實體正文。狀態行分析模組會 態行以確認回應資料的接收是否正確。奋 網頁的 會分析受訪網頁的實體頭及實體正文,^ ^析模組 所使用的字符集,並在現行瀏覽器盔二二1凟跫訪網 否繼續下載的決定權交給使用者。另外 的情况下將 另外,在瀏覽器中亦 4483· ί 五、發明說明(4) 設置三個布林(Boo lean)變數以記錄狀態行及實體頭/實體 正文的分析狀態:bStatusLineGot,bEntityHeaderGot, bCon t inueVisit 。 bStatusLineGot是記錄狀態行是否得到的變數, EntityHeaderGot是記錄實體頭是否得到的變數, Cont inueVi si t是記錄使用者是否繼續訪問網頁的變 數。 第1圖即是本發明自動過滤網頁的潔彳覽方法中,主控 模組的實施流程圖。主控模組是在利用HTTp協定開啟網頁 時’連結受訪網頁以取得受訪網頁的狀態行,實體頭及實 體正文。 在步騍S 1-1中’當使用者利用HTTP協定開啟網頁時, 主控模組會在起始化時,預先將狀態行得到狀態 (bStatusLineGot )、實體頭得到狀態 (bEnt i tyiieaderGot)、繼續訪問網頁狀態 (bContinueVisit)分別預設為假(〇)、假(0)、真(1),隨 後執行步驟S1-2。 在步驟S1-2中,主控模組會接收使用者所輸入的受訪 網頁的URL位址,隨後執行步驟S卜3。 在步驟S1-3中,主控模組會根據輸入的URL位址取得 受訪網頁所在的網站伺服器的IP位址,隨後執行步驟 S卜4。 在步驟S 1 -4中,主控模組會根據得到的I P位址向受訪 網頁所在的網站伺服器發出連結請求,並執行步驟S1-5。Page 4 448377 V. Description of the invention (2) When downloading a webpage using the HTTP protocol, first provide a browser, and use this browser to link to the visited webpage, so as to obtain the response data of the visited webpage, including the status line and the entity header. And entity text. Then, analyze the first three characters of the status line to confirm the receipt of the response data, and analyze the entity header and entity body to confirm the character set used by the interviewed web page. Then, the user can determine whether the current browser can support the character set used by the visited webpage, so as to decide whether to continue the downloading action of the visited webpage. In addition, in order to achieve the above and other objectives, the present invention also provides a browser capable of automatically filtering web pages, which is characterized by a main control module, a status line analysis module, and a physical head analysis module. When downloading a webpage using the HTTP protocol, the main control module will link to the visited webpage and obtain the response data of the visited webpage, including the status line, entity header, and entity body. The status line analysis module analyzes the status line of the visited web pages and confirms the receipt of response data. The physical header analysis module analyzes the physical headers of the visited web pages, determines the character set used by the visited web pages, and if the current browser cannot support it, the user decides whether to perform subsequent download actions. In order to make the above and other objects, features, and advantages of the present invention more comprehensible, a preferred embodiment is given below in conjunction with the accompanying drawings to make a detailed description as follows: Description of the drawings FIG. 1 is the invention Implementation flowchart of the main control module in the method for automatically filtering webpages; FIG. 2 is an implementation flowchart of the status line analysis module in the method for automatically filtering webpages according to the present invention; and 4 48 3 /? Explanation (3) FIG. 3 is a flowchart of the implementation of the entity head analysis module in the method for automatically filtering a webpage of the present invention. Embodiment According to the HTTP protocol, when a user wants to open a visited webpage on a web server, a GET, 1 or " POST " command is usually sent to this web server to obtain the content of the visited webpage. The web server will send the response data of the visited webpage back to the client based on the source of the "GET" or "POST" command. The response data usually contains three parts: the status line, the entity header, and the entity body. And there may be a message in the ‘Entity Header and Entity Body’ to indicate the character set used by the visited webpage. Therefore, the present invention uses the analysis of the entity header and the entity text to determine the character set used by the visited web page. When the character set used by the interviewed webpage can be supported by the current browser, continue to download the interviewed webpage ^ Conversely, it is up to the user to decide whether to continue downloading the interviewed webpage. The following is a detailed description of the embodiments of the present invention. In this embodiment, to achieve the function of automatically filtering the webpage, three modules are provided in the browser, a main control module, a status line analysis module, and a head analysis module. The main control module will link the interviewed webpage to obtain the response data of the interviewed webpage when the webpage is opened using the Ηττρ protocol, and the ship's bow and entity text will be included. The status line analysis module will status the line to confirm whether the response data is received correctly. Fen's website will analyze the physical head and entity body of the interviewed web page, analyze the character set used by the module, and give the user the right to decide whether to continue downloading in the current browser helmet. In other cases, it will also be 4483 in the browser. 5. Explanation of the invention (4) Set three Boo lean variables to record the status line and the analysis status of the entity header / entity body: bStatusLineGot, bEntityHeaderGot, bCon t inueVisit. bStatusLineGot is a variable that records whether the status line is obtained, EntityHeaderGot is a variable that records whether the entity header is obtained, and Cont inueVisit is a variable that records whether the user continues to access the web page. Fig. 1 is an implementation flowchart of the main control module in the method for automatically filtering webpages of the present invention. The main control module is to link the interviewed webpage to obtain the status line, entity header and entity text of the interviewed webpage when the webpage is opened using the HTTp protocol. In step S 1-1, when the user opens the webpage using the HTTP protocol, the main control module will get the status line (bStatusLineGot), the entity header (bEnt i tyiieaderGot), The state of continuing to access the webpage (bContinueVisit) is preset to false (0), false (0), and true (1), respectively, and then step S1-2 is performed. In step S1-2, the main control module receives the URL address of the visited webpage input by the user, and then executes step S3. In step S1-3, the main control module obtains the IP address of the web server where the visited webpage is located according to the entered URL address, and then performs step S4. In step S 1-4, the main control module sends a link request to the web server where the visited webpage is located according to the obtained IP address, and executes step S1-5.

第7頁 杏 483 7 χPage 7 Apricot 483 7 χ

五、發明說明(5) θ在步驟S 1 ―5中,主控模組會判斷與網站伺服器的連結 1否成功。若連結成功則執行步驟s丨_6,否則便執行 S1 - 1 4以結束主控模組。 在步驟S卜6中,主控模組會根據Ηττρ協定構造,,get :POST”命令以取得受訪網頁的内容,隨後執行步驟 。,步驟S1-7中,主控模組會將"GET"或,,p〇ST"命令送 到文訪網頁所在的網站伺服器,隨後執行步驟s丨_8。 •在步驟S 1 -8中,主控模組會判斷網站伺服器是否有回 應資料回傳。回應資料通常包括有狀態行、實體頭、實體 正文6若有回應資料回傳則執行步驟S1 _9,否則便執行步 驟S1 3以中斷與網站伺服器的連結。 在步驟S 1 -9 ΐ ’主控模組會接收網站伺服器所回傳的 回應資料,隨後執行步驟Sl-10。 在步驟S 1 - 1 0申,主控模組會將回應資料送至狀態行 分析模組(說明如下)及實體頭分析模組,藉以分析受訪網 頁所使用的字符集,並在現行瀏覽器無法支援該字符集 時’設置對話框以將是否繼續下載網頁的決定權交還給使 用者。隨後執行步驟S1-1 1。 在步驟S1-11中,主控模組會判斷使用者是否要繼續 下載網頁。若使用者要繼續下載網頁則執行步驟SI - 1 2, 否則便執行步驟S1 - 1 3以中斷與網站伺服器的連結。 在步驟S 1 -1 2中,主控模組會根據使用者的同意,下 載及顯示這部分網頁,並在這部分網頁下載完成後回到步V. Description of the invention (5) θ In steps S 1-5, the main control module judges whether the connection with the web server 1 is successful. If the connection is successful, execute step s 丨 _6, otherwise execute S1-1 4 to end the main control module. In step S6, the main control module constructs the "get: POST" command according to the Ηττρ agreement to obtain the content of the visited webpage, and then executes the step. In step S1-7, the main control module will " The GET " or ,, p〇ST " command is sent to the web server where the webpage is located, and then steps s 丨 _8 are performed. • In steps S 1-8, the main control module determines whether the web server has responded. Data return. The response data usually includes a status line, entity header, and entity body. 6 If there is a response data return, perform steps S1 _9, otherwise, perform step S1 3 to interrupt the connection with the web server. In step S 1- 9 ΐ 'The main control module will receive the response data returned by the web server, and then execute step Sl-10. In steps S 1-10, the main control module will send the response data to the status line analysis module (Explained below) and a physical head analysis module to analyze the character set used by the visited webpage, and set the dialog box to return the decision to continue downloading the webpage to the user when the current browser cannot support the character set. . Then execute step S1-1 1 In step S1-11, the main control module determines whether the user wants to continue downloading the webpage. If the user wants to continue downloading the webpage, execute step SI-1 2; otherwise, execute step S1-1 3 to interrupt the web server In step S 1-12, the main control module will download and display this part of the webpage according to the user's consent, and return to step after the downloading of this part of the webpage is completed.

在 4837 五、發明說明(6) 驟S1 - 8,藉以判斷網站伺服器是否還有資料回傳。 在步驟S1-13中,主控模組會中斷與網站伺服器的連 結’隨後執行步驟S1 -1 4以結束主控模組。 在步驟S 1 ~ 1 4中,結束主控模組。 第2圖即是本發明自動過濾網頁的瀏覽方法中’狀態 行分析模組的實施流程圖。狀態行分析模組是用來分析受 訪網頁的狀態行,並破認回應資料的接收。 在步騍S2-1中,狀態行分析模組首先會取得欲分析處 理的資料,即網站伺服器所回傳的回應資料,隨後執行步 驟S2-2 。 在步驟S2-2中,狀態行分析模組會判斷是否已經取得 全部狀態行。若已取得全部狀態行則執行步驟S2-3,否則 便執行步驟S 2 - 9以結束狀態行分析模組。 在步驟S2-3中,狀態行分析模組會將狀態行得到狀態 設置為真(bStatusLineGot=l),隨後執行步驟S2-4。 在步驟S2-4中,狀態行分析模組會取得狀態行的前三 個字符’隨後執行步驟S2-5。 在步驟S2-5中,狀態行分析模組會將這三個字符轉換 成十進位整數,即狀態碼’隨後執行步驟S2_6。 在步驟S2-6中,狀態行分析模組會判斷狀態碼是否正 確(是否為2G0)。若狀態碼正確則表示狀態行的接收沒有 問題,可繼續執行步驟S 2 - 7以進行實體頭分析;否則便表 示狀態行的接收發生問題’可執行步驟S2_8以發出錯誤訊 息。In 4837, the description of the invention (6) steps S1-8, so as to determine whether the web server still has data to return. In step S1-13, the main control module will interrupt the connection with the web server 'and then execute steps S1-14 to end the main control module. In steps S 1 to 14, the main control module is ended. Fig. 2 is a flowchart of the implementation of the 'state line analysis module' in the method for automatically filtering web pages of the present invention. The status line analysis module is used to analyze the status line of the visited web page and identify the receipt of response data. In step S2-1, the status line analysis module first obtains the data to be analyzed and processed, that is, the response data returned by the web server, and then executes step S2-2. In step S2-2, the status line analysis module determines whether all status lines have been obtained. If all status lines have been obtained, step S2-3 is performed, otherwise steps S 2-9 are performed to end the status line analysis module. In step S2-3, the status line analysis module sets the status of the status line to true (bStatusLineGot = 1), and then executes step S2-4. In step S2-4, the status line analysis module obtains the first three characters of the status line, and then executes step S2-5. In step S2-5, the status line analysis module converts these three characters into decimal integers, that is, the status code ', and then executes step S2_6. In step S2-6, the status line analysis module determines whether the status code is correct (whether it is 2G0). If the status code is correct, it indicates that there is no problem with the reception of the status line. You can continue to perform steps S 2-7 to analyze the physical header; otherwise, it indicates that there is a problem with the reception of the status line.

4483 ί 五 '發明說明(7) --- 在步驟S2-7中,狀態行分析模組進入實體頭分析模組 (說明如下)以分析實體頭及實體正文,隨後再執行步驟 S2-9以結束狀態行分析模組。 在步驟S 2 - 8中’狀態行分析模組會顯示錯誤訊φ、,# 將繼續訪問網頁狀態設為假(bCo n t丨nu e n s丨t =㈧^後執 行步驟S2-9以結束狀態行分析模組。 在步驟S2-9中,結束狀態行分析模組並返回主控模 組。 第3圖即是本發明自動過濾網頁的瀏覽方法中,實體 頭分析模組的實施流程圖。實體頭分析模組是用來分析受 訪網頁的實體頭及實體正文,藉以判讀受訪網頁所使用的 子符集,並在現行瀏覽器無法支援的情況下將是否繼續下 載的決定權交給使用者。 在步驟S3-1中’實體頭分析模組首先會取得欲分析處 理的資料,即網站词服器所回傳的回應資料,隨後執行步 稀 S 3 - 2 〇 在步驟S3-2中’實體頭分析模組會判斷是否已經取得 全部實體頭。若沒有取得全部實體頭則執行步驟33_3,否 則便執行步驟S3-1 3以分析實體正文(如傳統瀏覽器)。 在步驟S3-3 實體頭分析模組會判斷回應資料中是 否具有實體頭。若回應資料中具有實體頭則執行步驟S3_4 以取得這個實體頭,否則便執行步驟s3_14以結束實體頭 分析模組。 在步驟S3-4中,實體頭分析模組會取得回應資料中的4483 Five 'invention description (7) --- In step S2-7, the status line analysis module enters the entity head analysis module (explained below) to analyze the entity header and the entity text, and then executes step S2-9 to End the status line analysis module. In step S 2-8, the status line analysis module will display the error message φ ,, # Set the status of the webpage to continue to access to false (bCo nt 丨 nu ens 丨 t = ㈧ ^, then execute step S2-9 to end the status line Analysis module. In step S2-9, the status line analysis module is ended and returned to the main control module. Figure 3 is the implementation flowchart of the entity head analysis module in the method for automatically filtering a webpage of the present invention. Entity The header analysis module is used to analyze the physical header and physical body of the interviewed webpage, so as to determine the sub-character set used by the interviewed webpage, and give the decision whether to continue downloading to the use if the current browser cannot support it In step S3-1, the 'Entity Head Analysis Module' will first obtain the data to be analyzed, that is, the response data returned by the website servlet, and then execute steps S 3-2 〇 In step S3-2 'The entity header analysis module will determine whether all entity headers have been obtained. If not all entity headers have been obtained, step 33_3 is performed, otherwise step S3-1 3 is performed to analyze the entity text (such as a traditional browser). In step S3-3 Entity Head Analysis The team will determine whether there is a physical header in the response data. If there is a physical header in the response data, execute step S3_4 to obtain this entity header, otherwise execute step s3_14 to end the entity header analysis module. In step S3-4, the entity header The analytics module will get the

五、發明說明(8) 實體頭’隨後執行步驟S3-5。 在步驟S3-5中,實體頭分析模組會判斷實體頭中是否 具·有訊息以表示受訪網頁所使用的字符集。若實體頭中具 有訊息以表示受訪網頁所使用的字符集則執行步驟S3- 6, 否則便執行步驟S3 -11以判斷實體頭資料是否已經全部得 到。 在步驟S3-6中,實體頭分析模組會判斷受訪網頁所使 用的字符集是否為瀏覽器所支援的字符集。若受訪網頁所 使用的字符集是綱覽器所無法支援的字符集,則執行步驟 S3-7以提示使用者自行決定是否繼續下載動作,否則便執 行步驟S3 -11以判斷實體頭資料是否已經全部得到。 在步驟S3-7中,實體頭分析模組會提示受訪網頁所使 用的字符集並非割覽器所能支援的字符集,並詢問使用者 是否繼續下載網頁,隨後執行步驟S3-8。 在步驟S3-8中,實體頭分析模組會取得使用者是否終 土下載網頁的動作。若使用者決定終止下載網頁的動作’ 則執行步驟S3-9以設置繼續訪問網頁狀態 (bContinueVisit)為假及執行步称S3-14以結束實體頭分 析模組’否則便執行步驟S3 - 1 0以設置繼續訪問網頁狀態 為真° 在步驟S3-9中,實體頭分析模組會設置繼續訪問網頁 狀態為假。 在步驟S3-1 〇中,實體頭分析模組會設置繼續訪問網 頁狀態為真。V. Description of the invention (8) The entity head 'then executes step S3-5. In step S3-5, the entity header analysis module determines whether there is a message in the entity header to indicate the character set used by the visited webpage. If there is a message in the entity header to indicate the character set used by the visited webpage, step S3-6 is performed, otherwise step S3-11 is performed to determine whether all the entity header data has been obtained. In step S3-6, the entity header analysis module determines whether the character set used by the visited webpage is a character set supported by the browser. If the character set used by the interviewed webpage is a character set that cannot be supported by the profiler, step S3-7 is performed to prompt the user to decide whether to continue the download action, otherwise, step S3-11 is performed to determine whether the entity header data is Already got it. In step S3-7, the entity head analysis module prompts that the character set used by the visited webpage is not a character set supported by the browser, and asks the user whether to continue downloading the webpage, and then performs step S3-8. In step S3-8, the entity head analysis module obtains whether the user has finally downloaded the web page. If the user decides to stop the action of downloading the webpage, then step S3-9 is performed to set the state of continuing to access the webpage (bContinueVisit) as false and the step S3-14 is executed to end the entity head analysis module, otherwise step S3-1 0 is performed. Set the status of continuing to access the webpage to true ° In step S3-9, the entity head analysis module will set the status of continuing to access the webpage to false. In step S3-1 〇, the entity head analysis module will set the state of continuing to access the webpage to true.

S ,.1 B3' 五、發明說明(9) b 在步驊S3-11中,實體頭分析模組會判斷實體頭資料 是否已經全部取得。若實體頭資料已經全部取得則執行步 3 12以设置實體頭知到狀態(bEntityHeaderGot)為 真’否則便執行步驟S3- 1 4以結束實體頭分析模組。 ^ 在步驟s 3 - 1 3中’實體頭分析模組分析實體正文以判 $受訪網頁所使用的字符集,隨後執行步驟53_14以結束 體頭分析模組。步驟S3-13的動作與傳統瀏覽器的動作 相同,故不再做詳述。 在步驟S3-14中’結束實體頭分析模組並返回主控模 組。 、综上所述’本發明可自動過濾網頁的瀏覽器及其瀏覽 方可在利用HTTP協定開啟網頁時’自動過濾掉現行瀏 覽器無法支援的字符集的網頁,進而節省大量下載時間及 電話費。 根據這種可自動過濾網頁的瀏覽器,在利用Ηττρ協定 下載網頁時’主控模組會連結受訪網頁並取得受訪網頁 ,回應資料’包括有狀態行,實體頭及實體正文。狀態行 分析模組會分析受訪網頁的狀態行,並確認回應資料的接 $ β實體頭分析模組則會分析受訪網頁的實體頭,並判斷 受訪網頁所使用的字符集,並在現行瀏覽器無法支援的情 形下’由使用者自行決定是否進行隨後的下載動作。 —雖然本發明已以較佳實施例揭露如上,然其並非用以 限足本發明,任何熟習此技藝者,在不脫離本發明之精神 和範圍内’當可做更動與潤飾,因此本發明之保護範圍當S, .1 B3 'V. Description of the invention (9) b In step S3-11, the entity header analysis module will determine whether all the entity header data has been obtained. If all the entity header data have been obtained, go to step 3 12 to set the entity head to know the state (bEntityHeaderGot) is true '; otherwise, execute steps S3- 1 4 to end the entity head analysis module. ^ In step s 3-13, the 'body head analysis module analyzes the body text to determine the character set used by the interviewed webpage, and then executes step 53_14 to end the body head analysis module. The action of step S3-13 is the same as that of the traditional browser, so it will not be described in detail. In step S3-14 ', the entity head analysis module is ended and returned to the main control module. In summary, the browser of the present invention that can automatically filter webpages and their browsers can automatically filter webpages with character sets that cannot be supported by the current browser when the webpage is opened using the HTTP protocol, thereby saving a large amount of download time and phone charges. . According to this browser that can automatically filter webpages, when downloading webpages using the τττρ protocol, the 'master control module will link to the visited webpage and obtain the visited webpage, and the response data' includes status lines, entity headers, and entity text. The status line analysis module analyzes the status line of the interviewed web page, and confirms that the response data is received. The β entity header analysis module analyzes the physical header of the interviewed web page, and determines the character set used by the interviewed web page. In the case that the current browser cannot support it, it is up to the user to decide whether to perform the subsequent download operation. -Although the present invention has been disclosed in the preferred embodiment as above, it is not intended to limit the present invention. Any person skilled in the art can do modifications and retouching without departing from the spirit and scope of the present invention. Therefore, the present invention Scope of protection

第12頁 ^4837 五、發明說明(ίο) 視後附之申請專利範圍所界定者為準Page 12 ^ 4837 V. Description of Invention (ίο) Subject to the scope of the attached patent application

IBB 第13頁IBB Page 13

Claims (1)

44837 六、申請專利範圍 1. 一種可自動過滤網頁的ί劉覽方法,包括: 提供一瀏覽器; 利用該瀏覽器連結一受訪網頁,藉以取得該受訪網頁 的回應資料; 分析該受訪網頁的回應資料,藉以判讀該受訪網頁所 使用的字符集;以及 根據該受訪網頁所使用的字符集以決定是否繼續下載 動作。 2. 如申請專利範圍第1項所述可自動過濾網頁的瀏覽 方法,其中,該受訪網頁的回應資料包括一狀態行,一實 體頭及一實體正文。 3. 如申請專利範圍第2項所述可自動過濾網頁的瀏覽 方法,其中,分析該受訪網頁的回應資料以判讀該受訪網 頁所使用的字符集的步驟包括: 分析該受訪網頁的狀態行以確認該回應資料的接收; 以及 分析該受訪網頁的實體頭及實體正文以判讀該受訪網 頁所使用的字符集。 4. 如申請專利範圍第3項所述可自動過濾網頁的瀏覽 方法,其中,分析該受訪網頁的狀態行以確認該回應資料 的接收的步驟包括: 判斷是否已經取得該受訪網頁的回應資料的狀態行; 取得該狀態行的前三個字符; 將該狀態行的前三個字符轉換為一狀態碼;以及44837 VI. Scope of patent application 1. A method for automatically filtering web pages, including: providing a browser; using the browser to link to an interviewed webpage to obtain response data of the interviewed webpage; analyzing the interviewed data The response data of the webpage, so as to determine the character set used by the interviewed webpage; and decide whether to continue the download action based on the character set used by the interviewed webpage. 2. As described in item 1 of the scope of patent application, the webpage browsing method can be automatically filtered, wherein the response data of the interviewed webpage includes a status line, an entity header, and an entity body. 3. The browsing method for automatically filtering webpages as described in item 2 of the scope of patent application, wherein the steps of analyzing the response data of the interviewed webpage to determine the character set used by the interviewed webpage include: The status line confirms the receipt of the response data; and analyzes the entity header and entity body of the interviewed web page to determine the character set used by the interviewed web page. 4. The method for automatically filtering webpage browsing as described in item 3 of the scope of patent application, wherein the step of analyzing the status line of the visited webpage to confirm receipt of the response data includes: determining whether a response has been obtained for the visited webpage The status line of the data; obtaining the first three characters of the status line; converting the first three characters of the status line into a status code; and 第14頁 448377Page 14 448377 比對該狀態碼與一既定數值,藉以確認該回應資料的 接收。 。5.如申請專利範圍第4項所述可自動過濾網頁的瀏覽 器’其中,該狀態碼係將該狀態行的前三個字集轉換為十 進位數字以得到。 6. 如申請專利範圍第4項所述可自動過濾網頁的瀏覽 器’其中’該既定數值係2 0 0。 7. 如申請專利範圍第3項所述可自動過濾網頁的瀏覽 器’其中’分析該受訪網頁的實體頭及實體正文以判讀該 受訪網頁所使用的字符集的步驟包括: 判斷是否已經取得該受訪網頁的回應資料的實體頭; 取得該實體頭; 判斷該實體頭是否具有訊息以表承該受訪網頁所使用 的字符集; 分析該受訪網頁的回應資料的實體正文中是否具有訊 息以表示該受訪網頁所使用的字符集; 判斷該受訪網頁所使用的字符集是否為現行瀏覽器所 支援;以及 在該受訪網頁所使用的字符集為現行瀏覽器所無法支 援的字符集時,產生—詢問對話框以將是否繼續下載網頁 的選擇權交還使用者。 8. 一種可自動過滤網頁的劉覽器,其特徵在於: —主控模組,連結一受訪網頁以取得該受訪網頁的回 應資料,包括一狀態行’一實體頭及實體正文;The status code is compared with a predetermined value to confirm the reception of the response data. . 5. The browser capable of automatically filtering the webpage as described in item 4 of the scope of the patent application, wherein the status code is obtained by converting the first three character sets of the status line into decimal digits. 6. The browser that can automatically filter the webpage as described in item 4 of the scope of patent application, wherein the predetermined value is 2000. 7. As described in item 3 of the scope of the patent application, a browser that can automatically filter the webpage 'where' analyzes the physical header and the body text of the interviewed webpage to determine the character set used by the interviewed webpage. The steps include: Obtain the entity header of the response data of the interviewed webpage; Obtain the entity header; determine whether the entity header has a message to represent the character set used by the interviewed webpage; analyze whether the entity body of the response data of the interviewed webpage is in the body text Have a message to indicate the character set used by the interviewed webpage; determine whether the character set used by the interviewed webpage is supported by the current browser; and the character set used on the interviewed webpage is not supported by the current browser When the character set is generated, a query dialog box is returned to the user to choose whether to continue downloading the web page. 8. A Liu browser capable of automatically filtering webpages, which is characterized by:-a main control module, linking an interviewed webpage to obtain response data of the interviewed webpage, including a status line ', a physical header and an entity body; 第15頁 糊37¾ 六、申請專利範圍 一狀態行分析模組,分析該受訪網頁的狀態行以確認 該回應資料的接收;以及 一實體頭分析模組,分析該受訪網頁的實體頭以判讀 該受訪網頁所使用的字符集。 9.如申請專利範圍第8項所述可自動過濾網頁的瀏覽 器,其中,該主控模組的執行步驟包括: 取得該受訪網頁的I P位址、並對該受訪網頁的網站伺 服器發出一連結請求; 判斷與該受訪網頁的網站伺服器的連結是否成功; 向該受訪網頁的網站伺服器提出下載要求;以及 取得該受訪網頁的回應資料,包括該狀態行、該實體 頭及該實體正文。 I 0.如申請專利範圍第8項所述可自動過濾網頁的瀏覽 器,其中,該狀態行分析模組的執行步驟包括: 判斷是否已經取得該受訪網頁的回應資料的狀態行; 取得該狀態行的前三個字符; 將該狀態行的前三個字符轉換為一狀態碼;以及 比對該狀態碼與一既定數值,藉以確認該回應資料的 接收。 II .如申請專利範圍第1 0項所述可自動過濾網頁的瀏 覽器,其中,該狀態碼係將該狀態行的前三個字集轉換為 十進位數字以得到。 1 2 .如申請專利範圍第1 0項所述可自動過濾網頁的瀏 覽器,其中,該既定數值係20 0。Page 15 Paste 37¾ 6. Scope of patent application-a status line analysis module that analyzes the status line of the interviewed webpage to confirm receipt of the response data; and a physical head analysis module that analyzes the physical header of the interviewed webpage to Interpret the character set used for the interviewed web page. 9. The browser capable of automatically filtering web pages as described in item 8 of the scope of patent application, wherein the execution steps of the main control module include: obtaining the IP address of the visited web page, and serving the web server of the visited web page The server sends a link request; determines whether the connection with the web server of the visited webpage is successful; makes a download request to the web server of the visited webpage; and obtains the response data of the visited webpage, including the status line, the The entity header and the entity body. I 0. The browser capable of automatically filtering the webpage as described in item 8 of the scope of patent application, wherein the execution steps of the status line analysis module include: determining whether the status line of the response data of the visited webpage has been obtained; The first three characters of the status line; converting the first three characters of the status line into a status code; and comparing the status code with a predetermined value to confirm receipt of the response data. II. The browser capable of automatically filtering the webpage as described in Item 10 of the scope of patent application, wherein the status code is obtained by converting the first three character sets of the status line into decimal digits. 12. The browser capable of automatically filtering web pages as described in item 10 of the scope of patent application, wherein the predetermined value is 200. 第16頁 4483 1 7 六、申請專利範圍 1 3.如申請專利範圍第8項所述可自動過濾網頁的瀏覽 器,其中,該實體頭分析模組的執行步驟包括: 判斷是否已經取得該受訪網頁的回應資料的實體頭; 取得該實體頭; 判斷該實體頭是否具有訊息以表示該受訪網頁所使用 的字符集; 判斷該受訪網頁所使用的字符集是否為現行瀏覽器所 支援;以及 在該受訪網頁所使用的字符集為現行瀏覽器所無法支 援的字符集時,產生一詢問對話框以將是否繼續下載網頁 的選擇權交還使用者。Page 16 4483 1 7 6. Scope of patent application 1 3. The browser that can automatically filter web pages as described in item 8 of the scope of patent application, wherein the execution steps of the entity head analysis module include: judging whether the license has been obtained The physical header of the response data of the visited web page; obtaining the physical header; determining whether the physical header has a message indicating the character set used by the visited web page; determining whether the character set used by the visited web page is supported by the current browser ; And when the character set used by the interviewed webpage is a character set that cannot be supported by the current browser, a query dialog box is generated to return the option of whether to continue downloading the webpage to the user. 第17頁Page 17
TW088110307A 1999-06-21 1999-06-21 Browser with automatic filtering web pages and its browsing method TW448377B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW088110307A TW448377B (en) 1999-06-21 1999-06-21 Browser with automatic filtering web pages and its browsing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW088110307A TW448377B (en) 1999-06-21 1999-06-21 Browser with automatic filtering web pages and its browsing method

Publications (1)

Publication Number Publication Date
TW448377B true TW448377B (en) 2001-08-01

Family

ID=21641202

Family Applications (1)

Application Number Title Priority Date Filing Date
TW088110307A TW448377B (en) 1999-06-21 1999-06-21 Browser with automatic filtering web pages and its browsing method

Country Status (1)

Country Link
TW (1) TW448377B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI381280B (en) * 2005-04-07 2013-01-01 Microsoft Corp System and method for selecting a tab within a tabbed browser

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI381280B (en) * 2005-04-07 2013-01-01 Microsoft Corp System and method for selecting a tab within a tabbed browser
US8631341B2 (en) 2005-04-07 2014-01-14 Microsoft Corporation System and method for selecting a tab within a tabbed browser

Similar Documents

Publication Publication Date Title
CN103593354B (en) A kind of method, apparatus, server and the system of screen page ad
JP6093449B2 (en) Homepage forming method, peripheral device, and homepage forming system
CN105122760B (en) Page operation processing method, device and terminal
US8825756B2 (en) Server apparatus, information processing method, information processing program, and recording medium
CN103810268B (en) Search result recommendation information loading method, device and system and URL detection method, device and system
CN108156121A (en) The alarm method and device that the monitoring method and device of flow abduction, flow are kidnapped
CN109862074B (en) Data acquisition method and device, readable medium and electronic equipment
CN104954363A (en) Method and device for generating interface document
TW448377B (en) Browser with automatic filtering web pages and its browsing method
CN108234391B (en) Login information prompting method and device
CN112468550A (en) File downloading method and device and electronic equipment
CN112256545A (en) Method and device for acquiring user operation information
CN111209325A (en) Service system interface identification method, device and storage medium
JP2000028617A (en) Analysis system
JP2002149478A (en) Method for automatically displaying update information and device for the same, and medium and program
EP1887476A1 (en) Menu bar providing method and information read screen configuration file creation program
CN112416500B (en) Information processing method and electronic equipment
CN101772196A (en) Method and system for processing message sent by mobile terminal and acting server
JP2006107524A (en) Www server and system having user terminal connected to www server via communication line
JP2000222329A (en) Information communication system and information providing device and user attribute information collecting method and record medium
CN107239534B (en) Bar code scanning method, bar code scanning device, mobile terminal and computer readable storage medium
CN108959325B (en) Uniform resource locator display method, information display method and related products thereof
CN107204958A (en) The detection method and device of web page resources element, terminal device
JP3482111B2 (en) Recording medium recording Internet address translation program, Internet address translation device, and method of specifying information file on Internet
JP2001195329A (en) Data input supporting device and recording medium

Legal Events

Date Code Title Description
GD4A Issue of patent certificate for granted invention patent
MM4A Annulment or lapse of patent due to non-payment of fees