TW201020989A - Method of capturing and analyzing web page contents - Google Patents

Method of capturing and analyzing web page contents Download PDF

Info

Publication number
TW201020989A
TW201020989A TW97146147A TW97146147A TW201020989A TW 201020989 A TW201020989 A TW 201020989A TW 97146147 A TW97146147 A TW 97146147A TW 97146147 A TW97146147 A TW 97146147A TW 201020989 A TW201020989 A TW 201020989A
Authority
TW
Taiwan
Prior art keywords
content
webpage
information
page
electronic map
Prior art date
Application number
TW97146147A
Other languages
Chinese (zh)
Inventor
wei-ren Zhang
Original Assignee
wei-ren Zhang
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by wei-ren Zhang filed Critical wei-ren Zhang
Priority to TW97146147A priority Critical patent/TW201020989A/en
Publication of TW201020989A publication Critical patent/TW201020989A/en

Links

Landscapes

  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This invention relates to a method of capturing and analyzing webpage contents, including steps of: defining the method of searching web pages of a web page searching program; starting executing web page searching and capturing related web page information; capturing and analyzing address information of the web page content; transforming the address information into a geographical coordinate; and presenting the web page information on an electronic map according to the geographical coordinate.

Description

201020989 九、發明說明: 【發明所屬之技術領域】 本發明是錢-翻頁魄触及分析方法,尤指— 種可以快速_路上相_胃之超賴與舰座標連結 :起,以方便使用者峨之網肋錢取及分析方 【先前技術】 普及以消費者通常峨機或σ耳相傳的方式 •J一餐廳、商店或飯店消費,若是遇到口味合適、 ❹ Ο 算是運氣好,若是口味J合適服 到且&位,Ρ貝,則只好自認倒楣,列為拒絕往來戶。 路Ϊ術之發達’消f者可於餐廳、飯店或旅遊景 i心得Ϊ表於如個人部落格(b_之網頁上、 關、ΐπ。二家也可透過美食卿或店家首頁報導相 遠二/者已經知道該部落格或店家首頁之網路超 ^位^可透過難器戦該網頁中所報導之餐靡評價 資訊或旅遊心得等;若使用者不知道網路 it Ϊΐ 侧麟料鱗侧餐顧飯店之部 ί ΐ^ί;路超連結,,然後連結網路超連結位址 店或景點。上述之行為已大幅改變 行為已大幅改變傳統消費者之消費習慣,消費 3需再像往昔般,以隨機或口耳相傳以以費 =那算是運氣好,若是Dj:適服 P貝,則只好自認倒楣,列為拒絕往來戶。 然而,目前網路上之網頁或部落格文章缺乏整合性, 201020989 逐一輸入部落格或網頁之網路位址, 如消費者需先得才例 進 並 ❹ 者S覽以地理位置顯示超連結’以方便使用 【發明内容】 本發明之目的係提供一種網頁内容擷取及分析方法, 其可以快速將網路上相關網頁之超連結及地理座標連結 一起’以方便使用者瀏覽之目的。 根據本發明之一個不受限制之實施例,該網頁内容擷 ❿取及分析方法,其包括下列步驟:定義網頁搜尋程式之網 頁搜尋方法;開始執行網頁搜尋並擷取相關網頁資訊;擷 取及分析該網頁内容中之地址資訊;將該地址資訊轉換成 地理座標;以及根據該地理座標,將該網頁資氣呈現於一 電子地圖上。 為使貴審查委員能進一步瞭解本發明之結構、特徵 及其目的,茲附以圖式及較佳具體實施例之詳細說明如后。 較佳具體實施例說明。 【實施方式】 請一併參照圖1〜圖4’其中圖1緣示根據本發明一較 201020989 佳^施例之網頁内容揭取及分析方法之流程示意圖;圖2 緣示根據本發明一較佳實施例之網頁内容擷取及分析方法 將網頁資訊呈現於一電子地圖上之示意圖;圖3繪示使用 者於如圖2所示之電子地圖上點選第一筆部落格網頁位 ,,以劉覽其文章之示意圖;圖4繪示使用者於如圖2所 示之電子地圖上點選第二筆部落格網頁位址,以瀏覽其文 章之示意圖。 ❹ Ο 如圖所示’本發明之網頁内容擷取及分析方法,其包 括下列步驟:定義網頁搜尋程式之網頁搜尋方法(步驟1); ,始^行網頁搜尋並擷取相關網頁資訊(步驟2 );擷取及 分析該網頁内容中之地址資訊(步驟3);將該地址資訊轉 ,,地理座標(步驟4);以及根據該地理座標,將該網頁 資訊呈現於一電子地圖上(步驟5)。 於步驟1 + ’使时於定義網碰尋程紅網頁搜尋 ^法’·其中,該網頁搜尋程式即俗稱網路蜘蛛或網路爬行 被=㈣使用者透過職騎定義之網際 ,路網頁誠’例如但不限於為部落格網頁,或主動依程 序内定義之網際網路網頁範圍執行搜尋。 士 ·於步驟2中,開始執行網頁搜尋並擷取相關網頁資 ^ ’其中,该網頁資訊為來源網址及網頁内容,透過 ΐί 程式將自動擷取至少包括來源網址、網 頁内容及地址等資訊。 j 於步驟3中’擷取及分析該網頁内容中之地址 、$ ’該搜尋程式依照所擷取之網頁内容執行至少包^ , 訊丁項以上之分析,若網肋容中有符合分析條^ 2,=該ίί擷取下來,該資料例如但不限於為Λ、 抬碩(title)或連結資料等。 m朋 7 201020989 总步驟4中,將該地址資訊轉換成地理座標;盆中, 但秘於°。。9祕所提供之Ge。加丨叩應用 ”執行该地址資訊轉換成地理座標之轉換。該Geo =====地字理^ ❹ Ο -雷中,根據該地理座標,將鞠頁資訊呈現於 示)f·,y、,其中,該電子地_位於—資料庫(圖未 1.'、可以線上即時轉換或背景(background)轉換 4 地圖例如但不限於為U_P或、 ,昭上内ϊ擷取及分析方法’其電子地圖除依 在該電子地圖上外,若不同 館之網頁目前共有兩筆有關忠南飯 -筆位用電,地圖上點選第 常料理之部落格網頁夺該網二!北餐廳」f ㈣,201020989 IX. INSTRUCTIONS: [Technical field of invention] The present invention is a money-turning page touch analysis method, in particular, a kind of can be quickly _ road phase _ stomach super affair and ship coordinates link: from, to facilitate users峨 网 肋 钱 取 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 【 J is suitable to serve and & position, mussel, then had to admit that he was down, listed as a refusal of the household. The development of the roads can be seen in restaurants, restaurants or tourist attractions, such as personal blogs (b_ on the web, off, ΐ π. The two can also report through the food court The second person has already known that the blog or the homepage of the store's homepage can pass through the information reported on the website or the travel experience; if the user does not know the network it Ϊΐ side lining scales Side restaurant Gu 饭店 ί ^ί; Road hyperlink, and then link to the network hyperlink site or attractions. The above behavior has greatly changed the behavior has greatly changed the consumption habits of traditional consumers, consumption 3 needs to be like In the past, random or word of mouth to pass the fee = that is good luck, if it is Dj: suitable for P, then had to admit that it is rejected as a refusal of the current household. However, the current web pages or blog articles lack integration Sex, 201020989 Enter the web address of the blog or webpage one by one, if the consumer needs to get in the first place and then the user's view to display the hyperlink in a geographical location' for convenient use. [Invention] The object of the present invention is to provide a Web page A method for capturing and analyzing, which can quickly link the hyperlinks and geographic coordinates of related web pages on the network together for the convenience of the user. According to an unrestricted embodiment of the present invention, the content of the webpage is captured. And an analysis method comprising the steps of: defining a webpage search method of a webpage search program; starting a webpage search and extracting relevant webpage information; capturing and analyzing address information in the webpage content; and converting the address information into a geographic coordinate; And presenting the webpage assets on an electronic map according to the geographic coordinates. To enable the reviewing committee to further understand the structure, features and objects of the present invention, detailed descriptions of the drawings and preferred embodiments are provided. The following is a description of a preferred embodiment. [Embodiment] Referring to FIG. 1 to FIG. 4 ′′, FIG. 1 is a flow chart showing a method for extracting and analyzing a webpage content according to a preferred embodiment of 201020989 according to the present invention. FIG. 2 illustrates a method for capturing and analyzing web content according to a preferred embodiment of the present invention. Figure 3 is a schematic diagram showing the user's first page of the blog on the electronic map shown in Figure 2, with a view of Liu's article; Figure 4 shows the user in Figure 2. Click on the second blog page address on the electronic map shown to view the schematic diagram of the article. ❹ Ο As shown in the following figure, the method for capturing and analyzing the content of the webpage of the present invention includes the following steps: defining a webpage search The webpage search method of the program (step 1); the webpage search and retrieve relevant webpage information (step 2); extract and analyze the address information in the webpage content (step 3); a geographic coordinate (step 4); and according to the geographic coordinates, the webpage information is presented on an electronic map (step 5). In step 1 + 'the time is defined in the definition web search for the red web search ^ method' The webpage search program is commonly known as web spider or web crawling. (4) The webpage defined by the user through the mount, the webpage is honest, such as but not limited to being a blog webpage, or actively defining the webpage range defined by the program. Perform a searchIn step 2, you will start a web search and retrieve relevant webpages. The webpage information is the source URL and webpage content. The ΐί program will automatically capture at least the source URL, webpage content and address. j In step 3, 'capture and analyze the address in the content of the webpage, $' the search program performs at least the analysis of the above-mentioned information according to the content of the captured webpage, and if there is an analysis bar in the network ^ 2,=The ίί撷, the information such as, but not limited to, Λ, title, or link information. m朋 7 201020989 In the general step 4, the address information is converted into a geographical coordinate; in the basin, but secretly in °. . 9 secrets provided by Ge. The coronation application "performs the conversion of the address information into a geographic coordinate. The Geo ===== geography ^ ❹ Ο - Leizhong, according to the geographic coordinates, the page information is presented in the display) f,, y , where, the electronic location - located in the database (Figure not 1.., can be converted on-line or background (background) conversion 4 maps such as but not limited to U_P or,, and the method of extraction and analysis] In addition to the electronic map on the electronic map, if there are two copies of the website of the different museums, there is a total of two pieces of information about the Zhongnan rice-pen position. On the map, click on the blog page of the first cooking to win the second! f (four),

咖網頁,該網頁中即記载另-部落格ki對L 201020989 之'^文早’且其上亦具有店家地址資訊及地圖等 法進之網頁内容娜及分析方 標記之示意圖。如圖所示於二 可於該電子地圖上ί械立广删 鲁 ΐ )-,讓使用者可於該電子地圖上自行 •j 占、屋又及緯度資訊,並於其下相關攔位中輸入註 之畫當潔“」,以刪除該標記’以使電子地圖 6 ’其繪示本發明之網肋容娜及分析方 ίίΐί有—使用者可於該電子地圖上自行新增文章連 =、隹!如圖所不,本發明之網肋容擷取及分析方 使用者可於該電子地圖上自行新增文章連 i站d步驟ρ ’讓使用者可於該電子地圖上之「來源 ί站ϊ頁’址」中輸人新增文章連結之網址,並於其下「來 網1說明」欄位中輸人說明註記,以便於該電子地 增文章連結;或者使用者可於該電子地圖上點選「刪 除“己」’以刪除該標記,以使電子地@之晝雜為簡潔。 八p此’由上述之結果可得知,本發明之網頁内容搁取 析方法,其可以快速將網路上相關網頁之位址及内容 連結在一起’以方便使用者瀏覽之等優點,因此,確較習 知方法具進步性。 ^ 〜雖然本發明已以較佳實施例揭露如上,然其並非用以 限^本發明,任何熟習此技藝者,在不脫離本發明之精神 和範圍内,當可作少許之更動與潤飾,因此本發明之保護 201020989 範圍當視後附之申請專利範圍所界定者為準。 【圖式簡單說明】 ~ 圖1為-示意圖,其繪示根據本發明一較 網頁内容擷取及分析方法之流程示意圖。缝貫施例之 圖2為-示意圖’其繪示根據本發明— J頁,擷取及分析方法將網頁資訊呈現於一電:地圖1上 之不意圖·。 圖3為一示意圖,其繪示使用者於如圖2 =圖上點選第一筆部落格網頁位址,簡覽其文章』 ㈣ΐ4為一示意圖’其緣示使用者於如圖2所示之電子 圖圖上點選第二筆部落格網頁位址,以戰其文章之示意 析方\ ίί #發明之網頁内容及分 刪除者可於該電子地圖上自行建立/ 鲁 析方之網頁内容擷取及分 章連結之示_有w者了於該電子地®上自行新增文 【主要元件符號說明】 ⑵丨=頁搜尋程式之網頁搜尋方法; 逆訊; ί!5: 步卿··使用者可於該電子地圖上自行建立/刪除標記;以 201020989 及 步驟7 :使用者可於該電子地圖上自行新增文章連結The coffee page, which is recorded in the other page, is a blog of the company's address information and maps. As shown in the figure, on the electronic map, the user can use the information on the electronic map to occupy the house, the house and the latitude information, and in the relevant barriers. Enter the note "Don't delete the mark" to make the electronic map 6' which shows the mesh of the present invention and the analysis party ίίΐί - the user can add a new article on the electronic map = Oh! As shown in the figure, the user of the ribbed device of the present invention can add a new article on the electronic map and connect the i station to the step ρ 'to enable the user to click on the electronic map. In the page 'Address', the user enters the URL of the article link, and enters the description note in the "Description 1" field to facilitate the electronic link to the article; or the user can view the electronic map. Click "Delete" to remove the tag so that the electronic address is noisy. As can be seen from the above results, the webpage content depreciation method of the present invention can quickly link the addresses and contents of related webpages on the network together to facilitate user browsing, and therefore, It is indeed more advanced than the conventional method. Although the present invention has been disclosed in the above preferred embodiments, it is not intended to limit the invention, and those skilled in the art can make a few changes and refinements without departing from the spirit and scope of the invention. Therefore, the scope of the protection of the present invention 201020989 is subject to the definition of the scope of the patent application. BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a schematic diagram showing a flow chart of a method for capturing and analyzing web content according to the present invention. Figure 2 is a schematic view showing the method of drawing and analyzing the web page information on a map 1 according to the present invention. FIG. 3 is a schematic diagram showing the user selecting the first blog page address on the map as shown in FIG. 2 and browsing the article. (4) ΐ 4 is a schematic diagram, and the user is shown in FIG. 2 Click on the second blog page address on the electronic map to warn the article of the article. \ ίί # The web content of the invention and the deleter can create the website content on the electronic map.撷 及 分 分 分 _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ · The user can create/delete the mark on the electronic map; for 201020989 and step 7: the user can add a new article link on the electronic map.

Claims (1)

201020989 十、申請專利範圍·· 址ί; 2:如;請現於-電子地圖上 方法,其中於該定義網頁搜尋内容娜及分^ 方法,其中於i定義頁内容擷取及分析 步驟中如^^搜尋朗之定義方法之定義 方法t 1項_之㈣内容娜及分析 驟中,該網頁;=====相關網頁資訊步 m 方法範圍第1項騎之师内容娜及分析 中n、擷取及分析該網頁内容中之地址資訊步驟 網頁内容中之日期、抬頭及連結等資訊。 *、土 ·ί1δ月專利乾圍1項所述之網頁内容擷取及分析 =’其巾__地址資鱗換雜理座標步驟中,係 ^由Google®所提供之Geo codjng應用程式介面執行轉 換0 、如申請專利範圍第1項所述之網頁内容擷取及分析 方法,其中於該根據該地理座標,將該網頁資訊呈現於一 電子地圖上步驟中,該電子地圖係位於一資料庫上,其可 以線上即時轉換或背景轉換執行。 ’、 12 201020989 8.=申請專利範圍第7項所述之網頁内容擷取及分析 方法,其中於該根據該地理座標,將該網頁資訊呈現於一 中,該電子地圖為UR Map或貝G〇: _ 方法97項所述之網頁内容擷取及分析 子地圖上進—步可顯示該網頁資 部落格文章:表;間;後座?j去細 10如申社皇排序,攻新者排在最前面。 方法,其進二月步具有'-使用者容擷取及f斤, 刪除標記之步驟。 、該電子地圖上自行建立/ ,.1.1 ^ ^ 文章連結之步驟。 可於該電子地圖上自行新增201020989 X. Patent application scope·· Address ί; 2: 如; 请在在-电子地图方法, in which the search page searches for content Na and points ^ method, where i defines page content capture and analysis steps ^^ Search for the definition method of Lang's definition method t 1 item _ (4) Content Na and analysis step, the page; ===== Related web page information step m Method range 1st rider content Na and analysis n And extract and analyze the date, header and link information in the content of the page information in the content of the webpage. *, · ί δ δ 专利 专利 专利 专利 专利 专利 专利 专利 专利 专利 专利 专利 = 专利 专利 = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = = The method for extracting and analyzing the content of the webpage as described in claim 1, wherein the webpage information is presented in an electronic map according to the geographic coordinates, and the electronic map is located in a database. On, it can be performed on-line instant conversion or background conversion. ', 12 201020989 8. = The method for extracting and analyzing the content of the webpage described in claim 7 of the patent application, wherein the webpage information is presented in one according to the geographic coordinates, and the electronic map is UR Map or Bei G 〇: _ Method 97 of the web content capture and analysis sub-map on the step can display the page blog article: table; room; back seat? j to fine 10, such as Shen Shehuang sort, attack new At the top. The method, which has a step of deleting the mark, is entered in the second step. On the electronic map, create the /, .1.1 ^ ^ article link step. Can be added on the map itself
TW97146147A 2008-11-28 2008-11-28 Method of capturing and analyzing web page contents TW201020989A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW97146147A TW201020989A (en) 2008-11-28 2008-11-28 Method of capturing and analyzing web page contents

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW97146147A TW201020989A (en) 2008-11-28 2008-11-28 Method of capturing and analyzing web page contents

Publications (1)

Publication Number Publication Date
TW201020989A true TW201020989A (en) 2010-06-01

Family

ID=44832490

Family Applications (1)

Application Number Title Priority Date Filing Date
TW97146147A TW201020989A (en) 2008-11-28 2008-11-28 Method of capturing and analyzing web page contents

Country Status (1)

Country Link
TW (1) TW201020989A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103020247A (en) * 2012-12-18 2013-04-03 北京奇虎科技有限公司 Page presentation method and device
TWI474202B (en) * 2012-01-20 2015-02-21 Htc Corp Methods for parsing content of document, handheld electronic apparatus and computer program product thereof
TWI661351B (en) * 2017-11-15 2019-06-01 湛天創新科技股份有限公司 System of digital content as in combination with map service and method for producing the digital content

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI474202B (en) * 2012-01-20 2015-02-21 Htc Corp Methods for parsing content of document, handheld electronic apparatus and computer program product thereof
US9218083B2 (en) 2012-01-20 2015-12-22 Htc Corporation Methods for parsing content of document, handheld electronic apparatus and computer-readable medium thereof
CN103020247A (en) * 2012-12-18 2013-04-03 北京奇虎科技有限公司 Page presentation method and device
CN103020247B (en) * 2012-12-18 2016-08-31 北京奇虎科技有限公司 page display method and device
TWI661351B (en) * 2017-11-15 2019-06-01 湛天創新科技股份有限公司 System of digital content as in combination with map service and method for producing the digital content

Similar Documents

Publication Publication Date Title
US11627178B2 (en) Providing geocoded targeted web content
JP5956263B2 (en) Evaluation apparatus, evaluation system, evaluation method for evaluating information provided by user, and computer program
US20080104055A1 (en) Restaurant review search system and method for automatically providing links to relevant reviews of selected restaurants by use of the internet
TW201020989A (en) Method of capturing and analyzing web page contents
Singh et al. Evaluation of official tourism websites of world’s leading tourist destinations using the balanced score-card approach
TW200945068A (en) Method and system for information corresponding to geographical position
Almodaresi et al. Investigation of fluoride concentration in rural drinking water resources of bardaskan county using geographic information system (GIS) in 2014
CN102467714A (en) Construction method of network business district and system thereof
JP5613701B2 (en) Related document collection apparatus, method and program
Ahlers Towards Geospatial Search for Honduras
CN201780582U (en) Electronic map system capable of offering street scenes
JP4498892B2 (en) Information browsing apparatus and information browsing method
Kanehira et al. CURAP: CURating geo-related information on a mAP
HASANI et al. Location and prioritize the capable sites to construct health villages (Case study: Qeshm Island)
TW201118618A (en) Method and system for providing geo-position-based information
SHAYKH Identifying deprived regions of iran by composite ranking
Quintá Website Production in Galicia and its Visibility on the Net: Moving Towards the Knowledge Society
Okuno Aggregation and application of community tourism information contents by using Linked Open Data
Lemmelä et al. Finding communication hot spots of location-based postings
SARVAR et al. The analysis of spatial distribution and positioning medical care by multi criterion multi phase decision making model: a case study of city Miandoaab
HOSSEINI et al. Comparative Analysis of Relative Advantages and Inequality of Employment between Urban Regions of Khorasan Razavi and Urban Regions of Iran
Chen et al. Fotowiki: distributed map enhancement service
Kim et al. Designing Region-specific information provided utilizing crowdsourcing service
Nazarian et al. An Investigation into the Functions of Small Cities in Urban System and Regional Development (The Case of the City of Nain)
Xiao Mina Art Village: A Year in Caochangdi