TW200951737A - Method for selecting an object from a web page - Google Patents

Method for selecting an object from a web page Download PDF

Info

Publication number
TW200951737A
TW200951737A TW097121874A TW97121874A TW200951737A TW 200951737 A TW200951737 A TW 200951737A TW 097121874 A TW097121874 A TW 097121874A TW 97121874 A TW97121874 A TW 97121874A TW 200951737 A TW200951737 A TW 200951737A
Authority
TW
Taiwan
Prior art keywords
webpage
information
sub
objects
page
Prior art date
Application number
TW097121874A
Other languages
Chinese (zh)
Inventor
hong-yong Wang
Ming-Hua Chen
Original Assignee
Mobile Action Technology Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mobile Action Technology Inc filed Critical Mobile Action Technology Inc
Priority to TW097121874A priority Critical patent/TW200951737A/en
Publication of TW200951737A publication Critical patent/TW200951737A/en

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

This invention relates to a method for selecting an object from a web page, capable of facilitating a user to select an interested object from an existing web page. By using this method, a specific object in a complex web page can be selected and extracted through a simple operation procedure; the user does not need to learn and understand the programming language used for the development of the web page, nor does the user necessarily write the program code just for the purpose of selecting and extracting the objects.

Description

•200951737 九、發明說明: 【發明所屬之技術領域】 本發明係關於一種選取網頁中物件的方法,尤指— 可令使用者方便地直接讀取所需資訊的選取網頁中=件= 方法。 、 【先前技術】 隨著網際網路的普及與科技的發展,個人電腦、筆吃 ©型電腦或行動電話等電子裝置已可輕易地連接網際網路°己 令使用者可透過網際網路使用許多不同的網際網路服務, 例如瀏覽網頁、連線即時通服務或讀取電子郵件。 而為了包裝上述網頁中的文字、圖片、影像、音樂等 等=容,並提供與用戶之間互動性,業界制定了所謂的超 文字標記語言(HyPerText Markup Language,簡稱 html) 通訊協定,其係為一種可供網頁編製的語言(makeup 讀丨anguage f0rweb pages)。透過HTML的編碼㈣⑽丨叫, 可將文字、圖片、影像及/或音樂,甚至如javascrjpt之 類的程式碼等内容先行包裝為一個個的網頁檔案,當一網 站伺服器存有該網頁檔案時,使用者可以透過—可連線網 路之電子裝置自該網站伺服器中取回該網頁檔案,並由該 可連線網路之電子裝置中内建的瀏覽器解譯該網頁檔案 後’將原本的内容呈現給使用者。 "月參閱第十二圖所示,由於HTML支援多樣性的設計, 且要將各種不同格式的文字⑷)、圖片(42)、影像、音樂、 .200951737 互動JavaScript甚至是FIash播放器(43)等外掛物件皆嵌 入同:網頁Θ ’是相當簡易的事情,因此為讓使用者有更 力豐田的使用感又,以吸引使用者繼續劉覽與點閱網站中 所提供的其他内容,目前大多數網站的網頁中均夾雜有大 量的不同形式的資訊。 然而,若使用者僅針對網頁中的特定資訊感興趣,則 由於網頁的瀏覽往往必猪县敕 、 頁瀏覽,而無法讓使用者選 ❹ ❹ 擇僅顯不所需的資訊,會 實極為不方便。若非要篩選過濾、 tit Μ轉發所需之特定資訊,就只能藉由了 解HTML的語法、總宜古斗' 寫式、物件所在位置等技術細節, 才此有效師選出所需的特定資訊。 更甚者’若所需的特定資訊係屬於 訊,例如:股市走勢圖、即時交通狀離…:更新的資 到必須即所需的即時更新資訊,更是牽涉 得該特定資訊\ 站、以該特定資訊之代碼要求取 在既^ ㈣特定f訊並解碼等繁複過程。 設計師針對單_特 64由程式 方能取回^之卜、屬目標網站之特定程式, 特叱資訊的即時更新肉六 進—步檢討Hi 更新内因此’仍有待 亚課求可行的解決方案。 【發明内容】 訊的缺點,本發明之主 的方法,其可令使用者 有鑒於前述篩選網頁中特定資 要目的在提供—錄 、 種選取網頁中物件 5 .200951737 毋須學習、理解網頁所使用的語言,亦毋須為了筛選所需 資訊而自行撰寫、開發應用程式。 為達成前述目的所採取之主要技術手段係令前述方法 應用於一可上網之電子裝置上,該方法係包括下列步驟: 接收一包含有目標網頁之網址的網頁連線請求; 依據該網頁連線請求透過網際網路取得目標網頁之完 整編碼内容; 70 〇 分析目標網頁,解析出目標網頁中之複數物件並將之 轉化為複數個資訊物件且加以編號; 產生一包含有該複數個資訊物件的圖形化物件選取畫 面,其中該物件選取畫面係模擬目標網頁之晝面排版; 接收一包含被選取物件之編號的物件選取請求; 將該目標網頁的網址及在該物件選取晝面中被選取之 資訊物件的編號儲存為一網頁萃取規則。 利用上述技術手段,使用者可容易地挑選出目標網頁 〇 中之特疋物件,並儲存為一網頁萃取規則,爾後欲取得所 需資訊時,可方便地利用設定好的網頁萃取規則,直接讀 取所需且即時的資訊。 【實施方式】 «月參閱第一圖所示,本發明係應用於一可上網之本地 端或遠端電子裝置上,其中可上網之本地端電子裝置可為 使用者之個人電腦(10)或手持裝置,而可上網之遠端電子 裝置則可為一網頁萃取伺服器(2〇),該網頁萃取伺服器(20) 6 200951737 可供使用者以其個人電腦(10)或手捭奘要 v 于符裝置透過網際網路連 接;又該可上網之電子裝置可連線至一 巳存在於目前網際 網路上的非特定網站伺服器(web server)(3〇)利用本發明之 方法篩選所需的資訊,其中該網站伺服器(3〇)係提供有複 數個供使用者連線瀏覽的網頁,每個網頁的内容可包含伸 不限定有文字、影像、ffl片、音樂等内容,而建立該網頁 的技術則可包含但不限定為下列數種·· WAp、η丁ml、CSS、BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for selecting objects in a web page, and more particularly to a method for selecting a web page in a selected web page that allows a user to directly read desired information. [Prior Art] With the popularity of the Internet and the development of technology, electronic devices such as personal computers, pen-type computers, or mobile phones can be easily connected to the Internet. Users can use the Internet. Many different Internet services, such as browsing the web, connecting to instant messaging services, or reading emails. In order to package the text, pictures, images, music, etc. in the above webpages, and provide interactivity with the users, the industry has developed a so-called Hyper-Text Markup Language (HyperText Markup Language, html) communication protocol. For a language that can be used for web pages (makeup reads 丨anguage f0rweb pages). Through HTML coding (4) (10) squeaking, text, pictures, images and/or music, and even code such as javascrjpt can be packaged first into individual web files, when a web server stores the web file. The user can retrieve the webpage file from the web server through the electronic device that can be connected to the network, and the webpage file is interpreted by a built-in browser in the electronic device of the connectable network. Present the original content to the user. "Monthly, as shown in the twelfth figure, because HTML supports a variety of design, and to have a variety of different formats of text (4)), pictures (42), images, music, .200951737 interactive JavaScript or even FIash player (43 ) and other external objects are embedded in the same page: 'Web page' is quite simple, so in order to let users have more sense of Toyota's use, to attract users to continue to browse and browse other content provided on the website, currently Most websites have a large variety of different forms of information. However, if the user is only interested in the specific information in the webpage, since the browsing of the webpage often requires the pigs to browse the pages and browse the pages, the user cannot be selected to select only the information that is not needed, which is extremely undesirable. Convenience. If you want to filter the specific information required for filtering and tit forwarding, you can only find out the specific information required by understanding the syntax of the HTML, the general description of the writing, the location of the object, and so on. What's more, 'if the specific information required is part of the news, for example: stock market charts, instant traffic, etc.: The updated information must be the immediate update information required, and it is related to the specific information\ The code of the specific information requires a complicated process such as the specific (c) specific f-signal and decoding. The designer is able to retrieve the ^b, which is a specific program of the target website, and the special update of the information is in the form of a step-by-step review of the Hi update. Therefore, the solution still needs to be feasible in the Asian class. . SUMMARY OF THE INVENTION The shortcomings of the present invention are the main methods of the present invention, which enable the user to provide an object in the webpage in view of the specific purpose of filtering the webpage. 5.200951737 No need to learn or understand the webpage The language does not require writing and developing applications for screening the information you need. The main technical means for achieving the foregoing objective is to apply the foregoing method to an electronic device capable of accessing the Internet, the method comprising the steps of: receiving a webpage connection request including a webpage of the target webpage; and connecting according to the webpage Requesting to obtain the complete encoded content of the target webpage through the Internet; 70 analysing the target webpage, parsing the plurality of objects in the target webpage and converting them into a plurality of information objects and numbering them; generating a plurality of information objects including the plurality of information objects a graphic material selection screen, wherein the object selection screen simulates a face layout of the target webpage; receiving an object selection request including the number of the selected object; selecting the webpage of the target webpage and selecting the object in the object selection surface The number of the information object is stored as a web page extraction rule. By using the above technical means, the user can easily select the special objects in the target webpage and store them as a webpage extraction rule. When the desired information is obtained, the set webpage extraction rule can be conveniently used to directly read. Take the required and immediate information. [Embodiment] «Monday Referring to the first figure, the present invention is applied to a local or remote electronic device that can access the Internet, wherein the local electronic device that can access the Internet can be a user's personal computer (10) or The handheld device, and the remote electronic device that can access the Internet, can be a webpage extraction server (2〇), and the webpage extraction server (20) 6 200951737 can be used by the user to use his personal computer (10) or handcuffs. v The device is connected via the Internet; the electronic device that can be connected to the Internet can be connected to a non-specific web server (3) existing on the current Internet to filter the required method using the method of the present invention. Information, wherein the website server (3〇) provides a plurality of web pages for users to browse, and the content of each webpage may include content such as text, video, ffl, music, etc. The technology of this web page may include, but is not limited to, the following types: WAp, η丁ml, CSS,

JavaScript、PHP、JSP、XHTML、XML、DHTML、JASP、 ASP、PERL、Flash 等。 請進一步參閱第二圖所示,本發明之第一實施例係包 括下列步驟: 接收一包含有目標網頁之網址的網頁連線請求(2〇彳), 可由使用者操作其個人電腦(10)而對該個人電腦(1〇)發出 網頁連線請求’或是由使用者操作其個人電腦(彳〇)連線至 該網頁萃取伺服器(20),以對網頁萃取伺服器(20)發出網 頁連線請求; 依據該網頁連線請求透過網際網路查詢目標網頁 (202); 擷取目標網頁之完整編碼内容(203); 分析目標網頁(2〇4),係解析出目標網頁中之複數物件 並將之轉化為複數個資訊物件且加以編號;例如若目標網 頁係以HTML語法編輯而成,則此步驟之具體實施方式可 於該可上網之電子裝置擷取目標網頁後,先透過HTML的 語法分析器解析目標網頁的HTML標籤,以將目標網頁中 .200951737 的資訊’包含文字及其超連結(hype「link)設定或圖片,物 件化為複數個資訊物件,並賦予每個資訊物件一個獨立的 編號; 產生一包含有該複數個資訊物件的圖形化物件選取晝 面(205),請參閱第三圖所示’具體實施方式可仿造該目標 網頁的編排方式,產生包含有該複數個資訊物件的物件選 取晝面(11),並由使用者之個人電腦(1〇)的瀏覽器顯示之, ❹該物件選取畫面(11)並配合嵌入程式邏輯,如JavaScript 語法等,以於滑鼠游標移至資訊物件之範圍内時,即可於 滑鼠滑過之資訊物件周圍出現一選取外框(111);另請參閱 第四圖左上角所示,當使用者以滑鼠點選某一資訊物件 後即會進步產生一預覽區域(12),將使用者所選定之 資訊物件顯示於該預覽區域(12)中,供使用者確認所選之 資訊物件無誤,若資訊物件中包含有超連結,則預覽區域(12) 所顯示之内容亦會包含超連結; 〇 接收一包含被選取物件之編號的物件選取請求(21 2), 即代表使用者已選定所需之資訊物件; 將該目標網頁的網址及在該物件選取晝面中被選取之 資訊物件的編號儲存為一網頁萃取規則(213)。 以上步驟即是本發明之選取網頁中物件的方法。藉由 本方法’當使用者需要再次取得此物件時,可透過執行該 網頁萃取規則來取得物件的县如_由a, 干的最新内容(如第五圖所示),而 具體實施步驟請參閱第六圖所 ' Μ ητ不,包括下列步驟: 當使用者欲檢視先前韩遗ψ 签 師選出之資訊時,該可上網之電 8 200951737 子裝置即會接收一包含指定網頁萃取規則之指令的連線請 求(214); 自所指定之網頁萃取規則中的目標網頁篩選出對應之 編號的資訊物件(21 5)(如第五圖所示);於本實施例中,此 步驟係進一步包括下列子步驟: 擷取目標網頁並將目標網頁的資訊加以物件化為複數 個檢視用資訊物件(21 5a),其中各檢視用資訊物件係被如 該「分析目標網頁」(204)步驟般設定對應的獨立編號; 依所指定之網頁萃取規則中的編號篩選出對應的檢視 用資訊物件(21 5b),係依所指定之網頁萃取規則中的編號, 於目標網頁中取出具有對應之檢視用編號的檢視用資訊物 件; 產生一包含篩選出之檢視用資訊物件的資訊顯示晝面 (21 5c),係如第五圖所示,將篩選出之檢視用資訊物件獨 立地顯不於一資訊顯示晝面(彳3)上,若資訊物件中包含有 ❹ 超連結,則資訊顯示晝面(13)所顯示之内容亦會包含超連 結。 此外,若上述的目標網頁是一個總覽性質的網頁,則 其所顯示的資訊往往僅是複數個詳細資訊的大綱’而每個 詳細資訊的大綱則往往都會設定超連結,以供使用者進一 步連結至具有詳細資訊的網頁。此時可以將本發明之第一 實施例擴展為階層式的遞迴筛選流程而成為本發明之第二 實施例。請進一步參閱第七圖所示,若於該物件選取畫面Ο” 之預覽區域(12)中的資訊物件内容中包含有超連結,則使 9 200951737 用者可進—步於「產生一包含有該複數個資訊物件的圖形 化物件選取晝面」(205)步驟後,先在預覽區域(12)中點選 超連結,此時本發明之第二實施例相較於第一實施例係= 一步包括下列步驟: 接收—包含有超連結網頁之網址的超連結連線往 (206); 依據該超連結連線請求透過網際網路取得超連結網頁 之完整編碼内容(2〇7); 分析超連結網頁(208),係解析出超連結網頁中之複數 物件並將之轉化為複數個子資訊物件且加以編號; 產生一包含有該複數個子資訊物件的圖形化子物件選 取晝面(209) ’請參閱第人圖所*,係如「產生-包含有該 複數個資訊物件的圖形化物件選取晝面」(205)步驟般產生 該子物件選取晝面(14),且滑鼠滑過該子物件選取畫面(14) 上的子資訊物件時,該子資訊物件周圍亦會出現一選取外 ❹框(141 ),另凊參閱第九圖左上角所示,當使用者以滑鼠點 選某子資訊物件後,即會進一步產生一預覽區域(15), 將使用者所選定之子資訊物件顯示於該預覽區域(15)中; 接收一包含被選取子資訊物件之編號的子物件選取請 求(210); 將該超連結網頁的網址及在子物件選取晝面被選取之 子資訊物件的編號儲存為一網頁萃取子規則(211); 爾後於「將該目標網頁的網址及在該物件選取畫面中 被選取之資訊物件的編號儲存為一網頁萃取規則」(21 3)步 200951737 驟中將此網頁萃取子規則一同存入對應的網頁萃取規 中。 次=爾後在進行「產生一包含篩選出之檢視用資訊物件的 =貝訊顯不晝面」(215c)步驟時,由於當資訊物件中包含有 超連結’則資訊顯示晝面(13)所顯示之内容亦會包含超連 '°因此右使用者點選了該資訊顯示晝面(1 3)中的超連結, 則對應網頁萃取規則的子規則會被套用,請參閱第十圖所 D示,本發明之方法即於「產生一包含篩選出之檢視用資訊 物件的資訊顯示畫面」(215c)步驟後進一步包括下列步驟: 接收一包含有超連結網頁之網址的超連結連線請 (215d); 操取超連結網頁並將超連結網頁的資訊加以物件化為 稷數個檢視用子資訊物件(215e),其中各檢視用子資訊物 系被如及刀析超連結網頁」(2〇8)步驟般設定對應的獨 立編號; ❹ I對應之子網頁萃取規則中的編號篩選出對應的檢視 =子資訊物件(215f),係依對應之子網頁萃取規射的編 號’於超連結網頁巾取出具有對應編號的檢視用子資訊物 產生-包含篩選出之檢視用子資訊物件的子資訊顯示 :5g)’係如第十—圖所示,將篩選出之檢視用子資 a . . ^ 子資汛顯不畫面(16)上;藉此完成 階層式的網頁萃取規則套用。 、本發月尤其適合用於新聞類、股票類等提供即 11 .200951737 時資訊的網站上,主因在於這類網站為方便即時更新内 容,令網路管理員毋須為了更新網站内容而必須不斷地製 作新的網頁,常見的做法是先設計出一標準樣板,供網路 管理員藉由輸入即時内容的文字,並選取欲一同顯示的圖 片及廣告,即可於該標準樣板上自動地將文字、圖片及廣 告帶入而呈現出不同内容的網頁。因此,本發明藉由設定 一網頁萃取規則,即可共同適用於上述使用標準樣板更新 網頁的網站,令使用者可容易地與目標網頁般獲得即時更 新的訊息。 由上述可知,本發明之優點具有: 1_筛選過程中’使用者毋須理解HTML語法等技術 層面的、’’田卽,使用者僅需透過視覺化的選取過程,即可指 定所需之資訊物件,並可將其儲存為一網頁萃取規則,或 疋針對不Π的網頁、不同的資訊設定多個不同的網頁萃取 規則,爾後使用者即可直接操作該可上網之電子裝置,或 〇 疋以個人電腦/手持裝置連接該可上網之電子裝置,藉由點 選所欲使用的網頁萃取規則,令該可上網之電子裝置依據 該網頁萃取規則取得所需資訊物件的即時内容,供使用者 於該可上網之電子褒置上劉覽觀看或於個人電腦或手持裝 置上瀏覽觀看,使用起來相當簡易。 2·由於目標網頁十所設的超連結多是連結到以標準樣 板做法產生的網頁,因此藉由階層式網頁萃取規則,可令 使用者於δ又疋一目標網頁中資訊物件的網頁萃取規則後, 進-步記錄該資訊物件中超連結的子資訊物件萃取流程, 12 .200951737 如此-來’本發明只要是針對類似版面的内容,一 一針對每個網頁重複設定其網頁萃取規則。 ❹ 3·藉由過溘篩選所需的資訊物件,在特定的劉覽環境 下,例”限的頻寬、受限的記憶體容量、受限的運算速 度、電池哥命等q吏用者可操作其個人電腦或手持裝置, 直接或透過一網頁萃取伺服器間接地篩選目標網頁中不必 要的資訊物件,而容易地取得真正感興趣的資訊物件。惟 本發明雖已於前述實施财所揭露,但並不僅限於前述實 施例中所提及之内纟,在不脫離本發明之精神和範圍内所 作之任何變化與修改,均屬於本發明之保護範圍。 綜上所述,本發明已具備顯著功效增進,並符合發明 專利要件’爰依法提起申請。 【圖式簡單說明】 第一圖:係實現本發明之一系統架構圖。 第二圖·係本發明之第一實施例的流程圖。 第三圖:係本發明所產生之一物件選取晝面。 第四圖:係本發明中具有一預覽區域的物件選取畫 面0 第五圖:係本發明所產生之一資訊顯示畫面。 第六圖:係執行本發明第一實施例所得之網頁萃取規 則的流程圖。 第七圖:係本發明第二實施例之部分流程圖。 第八圖:係本發明所產生之一子物件選取晝面。 13 200951737 面。第九圖.係本發明中具有一預覽區域的子物件選取畫 第十圖.係執行本發明第 規則的流程圖。 例所得 第十圖.係本發明所產生之一 兼具文字、圖片、 之子網頁萃取 第十二圖:係“ 子資訊顯示畫面 媒體型態的網頁JavaScript, PHP, JSP, XHTML, XML, DHTML, JASP, ASP, PERL, Flash, etc. Please refer to the second figure. The first embodiment of the present invention includes the following steps: receiving a webpage connection request (2〇彳) containing a webpage of a target webpage, and the user can operate the personal computer (10) And the personal computer (1〇) sends a web connection request' or the user operates his personal computer (彳〇) to connect to the webpage extraction server (20) to send the webpage extraction server (20) Web page connection request; querying the target webpage through the Internet according to the webpage connection request (202); capturing the complete encoded content of the target webpage (203); analyzing the target webpage (2〇4), parsing out the target webpage The plurality of objects are converted into a plurality of information objects and numbered; for example, if the target webpage is edited in HTML syntax, the specific implementation of this step can be obtained after the webpage of the Internet-enabled electronic device captures the target webpage The HTML parser parses the HTML tag of the target webpage to include the information of the .200951737 in the target webpage, including the text and its hyperlink (hype "link" setting or image, into a complex object. Information objects, and each information object is given a separate number; a graphic material selection surface (205) containing the plurality of information objects is generated, as shown in the third figure, the specific embodiment can imitate the target The web page is arranged to generate an object selection surface (11) including the plurality of information objects, and is displayed by a browser of the user's personal computer (1〇), and the object selection screen (11) is embedded with the object Program logic, such as JavaScript syntax, so that when the mouse cursor is moved within the scope of the information object, a selection frame (111) appears around the information object that the mouse slid over; see also the upper left corner of the fourth image. As shown, when the user selects a certain information object by clicking the mouse, a preview area (12) is generated, and the information object selected by the user is displayed in the preview area (12) for the user to confirm. The selected information object is correct. If the information object contains a hyperlink, the content displayed in the preview area (12) will also contain a hyperlink; 〇 Receive an object containing the number of the selected object The request (21 2) represents that the user has selected the desired information object; the URL of the target web page and the number of the selected information object selected in the object selection surface are stored as a web page extraction rule (213). The above steps are the method for selecting an object in a webpage according to the present invention. By the method, when the user needs to obtain the object again, the latest content of the county, such as a, can be obtained by executing the webpage extraction rule ( As shown in the fifth figure, please refer to the figure 第六 ητ in the sixth figure. The following steps are included: When the user wants to view the information selected by the previous Korean testator, the Internet can be connected to the Internet 8 200951737 The child device receives a connection request (214) containing an instruction for specifying a webpage extraction rule; and extracts a corresponding number of information objects from the target webpage in the specified webpage extraction rule (21 5) (as shown in the fifth figure) In this embodiment, the step further includes the following sub-steps: capturing the target webpage and objectifying the information of the target webpage into a plurality of information objects for viewing (2) 1 5a), wherein each of the viewing information objects is set to a corresponding independent number as in the "analysis target webpage" (204) step; the corresponding viewing information object is filtered according to the number in the specified webpage extraction rule (21) 5b), according to the number in the webpage extraction rule specified, the visual information object having the corresponding viewing number is taken out from the target webpage; and a information display surface containing the selected visual information object is generated (21 5c) ), as shown in the fifth figure, the selected information items for viewing are independently displayed on a display page (彳3). If the information object contains a hyperlink, the information is displayed ( 13) The content displayed will also contain hyperlinks. In addition, if the above-mentioned target webpage is a webpage of an overview nature, the information displayed is often only an outline of a plurality of detailed information' and the outline of each detailed information is often provided with a hyperlink for further linking by the user. To a web page with detailed information. At this time, the first embodiment of the present invention can be extended to a hierarchical recursive screening process as a second embodiment of the present invention. Please refer to the seventh figure. If the content of the information object in the preview area (12) of the object selection screen (") contains a hyperlink, then 9 200951737 users can proceed to "generate an included After the step of step (205) of the plurality of information objects, the second embodiment of the present invention is compared with the first embodiment. The step includes the following steps: Receiving - a hyperlink containing the hyperlinked webpage URL (206); obtaining the complete encoded content of the hyperlinked webpage through the Internet according to the hyperlinked connection request (2〇7); The hyperlinked webpage (208) parses and converts the plurality of objects in the hyperlinked webpage into a plurality of sub-information objects and numbers them; generates a graphical sub-object selection surface (209) including the plurality of sub-information objects 'Please refer to the figure of the person's figure*, which is generated by the step of "generating - containing the graphic material of the plurality of information objects" (205), and the mouse is selected to pass through the face (14), and the mouse is slid over The child When the sub-information object on the object selection screen (14), a selection outer frame (141) appears around the sub-information object, and as shown in the upper left corner of the ninth figure, when the user selects a certain mouse After the sub-information object, a preview area (15) is further generated, and the sub-information object selected by the user is displayed in the preview area (15); and a sub-object selection request including the number of the selected sub-information object is received ( 210); storing the URL of the hyperlinked webpage and the number of the sub-information object selected in the sub-object selection as a webpage extraction sub-rule (211); then "putting the webpage of the target webpage and the object selection screen" The number of the selected information object is stored as a web page extraction rule" (21 3) step 200951737. The web page extraction sub-rule is stored in the corresponding webpage extraction rule. After the time = "Steps to generate a filtered information object containing the selected information" (215c), because the information object contains a hyperlink, then the information display page (13) The displayed content will also contain a hyperlink. Therefore, if the right user clicks on the hyperlink in the information display page (1 3), the sub-rules corresponding to the page extraction rule will be applied. Please refer to the tenth figure. The method of the present invention further comprises the following steps after the step of: generating a information display screen including the filtered information object for viewing (215c): receiving a hyperlink connection including a hyperlinked webpage ( 215d); Take the Hyperlink page and object the information of the hyperlinked webpage into a number of viewing sub-information objects (215e), wherein each of the viewing sub-information items is like a Knockout Hyperlink webpage" (2 〇8) Step to set the corresponding independent number; ❹ I corresponds to the number in the sub-page extraction rule to filter out the corresponding view = sub-information object (215f), according to the corresponding sub-page extracting the number of the report 'in Hyperlink The towel is taken out of the viewing sub-information object with the corresponding number - the sub-information display including the selected sub-information object for viewing is displayed: 5g) 'as shown in the tenth-picture, the screening sub-investment a will be selected. ^ The sub-investment is not displayed on the screen (16); this completes the hierarchical web page extraction rule application. This month is especially suitable for news, stocks and other websites that provide information on 11.200951737. The main reason is that such websites are convenient for updating content so that network administrators do not have to constantly update the content of the website. To create a new web page, it is common practice to design a standard template for the webmaster to automatically place the text on the standard template by entering the text of the instant content and selecting the image and advertisement to be displayed together. , images, and advertisements that bring in different content. Therefore, the present invention can be applied to the above-mentioned website using the standard template update webpage by setting a webpage extraction rule, so that the user can easily obtain an instant update message like the target webpage. It can be seen from the above that the advantages of the present invention are as follows: 1_ During the screening process, the user does not need to understand the technical level of the HTML grammar, ''Tian Hao, the user only needs to specify the required process through the visual selection process. Information objects can be stored as a web page extraction rule, or multiple different web page extraction rules can be set for different web pages and different information, and then the user can directly operate the electronic device capable of accessing the Internet, or The personal computer/handheld device is connected to the electronic device capable of accessing the Internet, and by clicking on the webpage extraction rule to be used, the electronic device capable of accessing the Internet obtains the instant content of the desired information object according to the webpage extraction rule for use. It is easy to use on the Internet-enabled electronic device for viewing or viewing on a personal computer or handheld device. 2. Since the hyperlinks on the target page 10 are mostly linked to the web pages generated by the standard template method, the hierarchical web page extraction rules can enable users to extract the rules of the web pages of the information objects in the target webpage. After that, the sub-information object extraction process of the hyperlink in the information object is further recorded, 12.200951737. Thus, the present invention is directed to the content of the similar layout, and the webpage extraction rule is repeatedly set for each webpage. ❹ 3· By screening the required information objects, in a specific environment, for example, “limited bandwidth, limited memory capacity, limited computing speed, battery life, etc.” It can operate its personal computer or handheld device to indirectly filter unnecessary information items in the target web page directly or through a webpage extraction server, and easily obtain information items of genuine interest. However, the present invention has been implemented in the aforementioned financial assets. It is to be understood that the invention is not limited thereto, and any changes and modifications made without departing from the spirit and scope of the invention are intended to be included in the scope of the invention. Significant power improvement, and in line with the invention patent requirements '爰Apply according to law. [Simplified description of the drawings] The first figure: is a system architecture diagram of the present invention. The second figure is the flow of the first embodiment of the present invention Fig. 3 is a selection of objects selected by the present invention. The fourth figure is an object selection screen with a preview area in the present invention. A sixth embodiment is a flowchart of a webpage extraction rule obtained by the first embodiment of the present invention. The seventh diagram is a partial flowchart of the second embodiment of the present invention. The resulting sub-objects are selected as the facets. 13 200951737 face. The ninth figure is a tenth figure of the sub-objects with a preview area in the present invention. The flow chart of the first rule of the present invention is executed. According to the invention, one of the texts, pictures, and sub-page extractions is the twelfth picture: a web page of the sub-information display screen media type.

Flash播放器等多 【主要元件符號說明】 (1 〇)個人電腦 (11) 物件選取晝面 (1 1 1)選取外框 (12) 預覽區域 (13) 資訊顯示畫面 (14) 子物件選取晝面 (141)選取外框 (1 5)預覽區域 (16)子資訊顯示畫面 (20)網頁萃取伺服器 (30)網站伺服器 (41) 文字 (42) 圖片 (43) Flash播放器Flash player, etc. [Main component symbol description] (1 〇) Personal computer (11) Object selection face (1 1 1) Select frame (12) Preview area (13) Information display screen (14) Sub object selection 昼Face (141) Select the outer frame (1 5) Preview area (16) Sub-information display screen (20) Web page extraction server (30) Website server (41) Text (42) Picture (43) Flash player

Claims (1)

200951737 十、申請專利範圍: 1 . 一種選取網頁中物件的方法,係應用於一可上網 之電子裝置上,該方法係包括下列步驟: 接收一包含有目標網頁之網址的網頁連線請求; 依據該網頁連線請求透過網際網路取得目標網頁之完 整編碼内容; 70 分析目標網頁’解析出目標網頁中之複數物件並將之 轉化為複數個資訊物件且加以編號; ® I生—包含有該複數個資訊物件的圖形化物件選取畫 面’其中該物件選取晝面係模擬目標網頁之畫面排版; 接收包含被選取物件之編號的物件選取請求; 將該目標網頁的網址及在該物件選取晝面中被選取之 資訊物件的編號儲存為一網頁萃取規則。 2 .如申請專利範圍第i項所述選取網頁中物件的方 法,當目標網頁之物件中具有超連結網頁之網址時,該物 &件選取晝面上的資訊物件中亦會具有相同之超連結網頁的 網址,若被選取物件的内容中包含有超連結網頁之㈣ 時,則於「接收-包含被選取物件之編號的物件選取請求 步驟後進一步包括下列步驟: J 接收一包含有超連結網頁之網址的超連結連線請求; 依據該超連結連線請求透過網際網路取得超連結網 之完整編碼内容; ^ κ 分析超連結網頁’係解析出超連結網頁中之複數物件 並將之轉化為複數個子資訊物件且加以編號; 15 200951737 產生一包含有該複數個子資訊物件的圖形化子物件選 取畫面,其t該子物件選取畫面係模擬超連結網頁之晝面 排版; 接收一包含被選取子資訊物件之編號的子物件選取請 求; 將該超連結網頁的網址及在子物件選取晝面被選取之 子資訊物件的編號儲存為一網頁萃取子規則,其中該網頁 萃取子規則係包含在對應的網頁萃取規則之中。 Ο Η*一、圖式: 如次頁 ❹ 16200951737 X. Patent application scope: 1. A method for selecting an object in a webpage is applied to an electronic device capable of accessing the Internet, the method comprising the steps of: receiving a webpage connection request including a webpage of the target webpage; The webpage connection request obtains the complete encoded content of the target webpage through the Internet; 70 Analyze the target webpage' to parse out the plurality of objects in the target webpage and convert them into a plurality of information objects and number them; ® I--includes a graphic material selection screen of a plurality of information objects, wherein the object selection screen is a screen layout for simulating a target webpage; receiving an object selection request including a number of the selected object; and selecting a URL of the target webpage and selecting the object webpage The number of the selected information object is stored as a web page extraction rule. 2. If the object of the webpage is selected as described in item i of the patent application scope, when the webpage of the hyperlink webpage is included in the object of the target webpage, the information object in the object& The URL of the hyperlinked webpage, if the content of the selected object contains the hyperlinked webpage (4), then further includes the following steps after receiving the object selection request step including the number of the selected object: J receives one containing super Hyperlink connection request for the URL of the link page; obtain the complete encoded content of the hyperlink network through the Internet according to the hyperlink connection request; ^ κ Analysis Hyperlink page 'is parsing the plurality of objects in the hyperlinked webpage and Converting into a plurality of sub-information objects and numbering them; 15 200951737 generating a graphical sub-object selection screen containing the plurality of sub-information objects, wherein the sub-object selection screen is a mock-up of the simulated hyperlinked webpage; receiving an inclusion The sub-object selection request of the number of the selected sub-information object; the URL of the hyperlinked webpage and Select the sub-object information of the selected object plane day stored as a page number is extracted sub-rules, wherein the extracted sub-page contained in the rule-based web page corresponding to an extraction rule Ο Η *, FIG formula: such as hypophosphorous Page ❹ 16
TW097121874A 2008-06-12 2008-06-12 Method for selecting an object from a web page TW200951737A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW097121874A TW200951737A (en) 2008-06-12 2008-06-12 Method for selecting an object from a web page

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW097121874A TW200951737A (en) 2008-06-12 2008-06-12 Method for selecting an object from a web page

Publications (1)

Publication Number Publication Date
TW200951737A true TW200951737A (en) 2009-12-16

Family

ID=44871825

Family Applications (1)

Application Number Title Priority Date Filing Date
TW097121874A TW200951737A (en) 2008-06-12 2008-06-12 Method for selecting an object from a web page

Country Status (1)

Country Link
TW (1) TW200951737A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9774551B2 (en) 2014-12-01 2017-09-26 Institute For Information Industry User device, cloud server and share link identification method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9774551B2 (en) 2014-12-01 2017-09-26 Institute For Information Industry User device, cloud server and share link identification method
TWI611347B (en) * 2014-12-01 2018-01-11 財團法人資訊工業策進會 User device, cloud server and share link identification method

Similar Documents

Publication Publication Date Title
US11256848B2 (en) Automated augmentation of text, web and physical environments using multimedia content
JP6748071B2 (en) Web content generation method and system
US20190272313A1 (en) Dynamic generation of mobile web experience
US9529780B2 (en) Displaying content on a mobile device
CN102915308B (en) A kind of method of page rendering and device
US20130326333A1 (en) Mobile Content Management System
US20150227276A1 (en) Method and system for providing an interactive user guide on a webpage
US9887941B1 (en) In-message applications in a messaging platform
CN101765979A (en) Document processing for mobile devices
CN103336794B (en) For providing the corresponding method and apparatus that information is presented in target pages
US10439965B1 (en) In-message applications in a messaging platform
CN104142923A (en) Method and device for obtaining and sharing partial contents of webpage
US20130132823A1 (en) Metadata augmentation of web pages
CN106951405B (en) Data processing method and device based on typesetting engine
TW201214161A (en) Integration method for really simple syndication (RSS) document
CN113127776A (en) Breadcrumb path generation method and device and terminal equipment
TW200951737A (en) Method for selecting an object from a web page
CN116433997A (en) Image labeling method, device and medium
Joshi HTML5 programming for ASP. NET developers
JP4921570B2 (en) Blog service providing system, method and program
JP5330169B2 (en) Content data providing device
CN105224571A (en) Terminal uploaded data processing method and device and data uploading processing method and device
US20090327233A1 (en) Method of selecting objects in web pages
KR20090124873A (en) System and method for processing keyword(or context) advertisement and program recording medium
JP5674704B2 (en) Information processing apparatus, method, computer program, and system