WO2014207941A1 - 判定装置、判定方法、及びプログラム - Google Patents
判定装置、判定方法、及びプログラム Download PDFInfo
- Publication number
- WO2014207941A1 WO2014207941A1 PCT/JP2013/067942 JP2013067942W WO2014207941A1 WO 2014207941 A1 WO2014207941 A1 WO 2014207941A1 JP 2013067942 W JP2013067942 W JP 2013067942W WO 2014207941 A1 WO2014207941 A1 WO 2014207941A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- link
- content
- area
- acquired
- uri
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
- G06F16/9566—URL specific, e.g. using aliases, detecting broken or misspelled links
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/134—Hyperlinking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0603—Catalogue ordering
Definitions
- the present invention relates to a determination device, a determination method, and a program.
- a system that displays content such as a URL (Uniform Resource Locator) web page linked to the text or the photo has become widespread.
- a web catalog used in web shopping lists a plurality of products, and when a user selects a desired product code or photo, a product page (contents) containing information on the product is displayed.
- a product code and a photo are set in a clickable area, and a URL of a product page is linked to each clickable area.
- Patent Document 1 describes a technique for adding link information to texts, photos, etc. when converting data for creating printed matter such as magazines and catalogs into PDF data.
- the present invention has been made in view of the above problems, and a purpose thereof is a determination device that can easily determine whether correspondence between information included in a web page and a URL linked to the information is correct or incorrect. It is in providing a determination method and a program.
- a determination apparatus includes an information acquisition unit that acquires link source information obtained in or around a link area associated with a URI, and a storage unit that stores content Content acquisition means for acquiring content specified by the URI associated with the link area, the link source information acquired by the information acquisition means, and the content acquired by the content acquisition means And determining means for determining whether the association between the link area and the URI is correct or not.
- the determination device may further include a character recognition unit that recognizes the link source information acquired by the information acquisition unit as a character.
- the determination unit determines whether the link area and the URI are correctly associated based on the character recognized by the character recognition unit and the content acquired by the content acquisition unit. Also good.
- the determination unit may determine whether the correspondence is correct or not depending on whether or not the character recognized by the character recognition unit is included in the content acquired by the content acquisition unit.
- the determination unit determines whether the correspondence is determined based on whether or not a plurality of the characters recognized by the character recognition unit are included in the content acquired by the content acquisition unit. The correctness of the attachment may be determined.
- the determination device may further include a coordinate acquisition unit that acquires the coordinates of the link area in the page. Further, the information acquisition unit may acquire the link source information based on the coordinates of the link area acquired by the coordinate acquisition unit.
- the determination device may further include notification means for notifying an error message when the association is incorrect.
- the determination device sets the link area in a page, acquires the URI corresponding to the set link area from a table in which the URI is registered in advance, and the link area, the URI, It may further include generating means for associating.
- the URIs may be registered in correspondence with the arrangement order of the link areas.
- the determination method includes an information acquisition step for acquiring link source information obtained from a link area associated with a URI or the periphery of the link area, and a storage unit for storing content, in the link area. Based on the content acquisition step of acquiring the content specified by the associated URI, the link source information acquired by the information acquisition step, and the content acquired by the content acquisition step, the link A determination step of determining whether or not the correspondence between the area and the URI is correct.
- the program according to the present invention associates the link source information with the link area from the link area associated with the URI or the link area information obtained from the link area and the storage means for storing the content.
- the link area Is a program for causing a computer to function as a determination means for determining whether or not the correspondence between the URI and the URI is correct.
- This program may be stored in a computer-readable information storage medium such as a CD-ROM or DVD-ROM.
- the correctness / incorrectness of the association between the link area and the URL is determined based on the characters in the link area and the content corresponding to the URL associated with the link area. Therefore, it is possible to easily determine whether the correspondence between the information included in the web page and the URL to which the information is linked is correct.
- FIG. 6 is an operation flowchart of the determination apparatus according to the first embodiment. It is a figure which shows an example of a goods page. It is a figure which shows an example of an HTML file. It is a figure which shows an example of a web catalog and a goods page. It is a figure which shows an example of an HTML file. It is an operation
- FIG. 1 is a diagram showing an example of a web catalog.
- a web catalog web page
- the web catalog lists a plurality of items of information such as soy sauce, apples and bread.
- the product code, photo, text, and the like of each product are associated with the URL of a product page (content) on which detailed information about the corresponding product is posted.
- the product name, product code, and photograph of the product “soy sauce” are associated with the URL of the product page “soy sauce” shown in FIG.
- a product page of “soy sauce” see FIG.
- the determination apparatus for example, in the form of Internet shopping as described above, the product posted in the web catalog (see FIG. 1) and the URL of the product page (see FIG. 2) on which information on the product is posted. It is possible to easily determine the correctness of the association with.
- the determination process is executed based on the operation of the site operator.
- FIG. 3 is a hardware configuration diagram of the determination apparatus according to the present embodiment.
- the determination device 10 includes a communication unit 1, a CPU 2, a memory 3, and a storage unit 4.
- the hardware elements constituting the determination device 10 are connected to each other by a bus so as to be able to exchange data.
- the communication unit 1 transmits / receives information to / from the user terminal from the Internet, for example.
- the CPU 2 controls each part of the apparatus and executes various types of information processing.
- the memory 3 holds various programs and data.
- the memory 3 also has a work area for the CPU 2.
- the storage unit 4 includes a page DB 4a.
- the page DB 4a stores a plurality of HTML files corresponding to a plurality of web catalogs.
- the HTML file is created by the site operator. Specifically, the site operator obtains PDF format image data (catalog data) that is the basis of the web catalog, for example, from a printing company, and based on the obtained catalog data, a clickable area and a link corresponding thereto. An HTML file in which the destination URL is set is created. By displaying the HTML file and the image file specified in the HTML file with the web browser of the user terminal, the web catalog shown in FIG. 1 is displayed.
- the designation of the clickable area and the designation of the link destination may be designated by, for example, a JavaScript (registered trademark) file different from HTML.
- the linked product page associated with the product listed in the web catalog is created by the site operator and uploaded to the web server (storage means).
- the storage unit 4 may be connected to the communication unit 1, the CPU 2, and the memory 3 via the Internet.
- FIG. 4 is an example of an HTML file corresponding to the web catalog shown in FIG.
- FIG. 5 is an example of a clickable area (link area) in the web catalog shown in FIG.
- the HTML file includes an element (map data) for setting a clickable area and a URL associated with the clickable area.
- the URL “http://aaa.co.jp/soy sauce.html” of the product page of “soy sauce” is set in the rectangular link area at the coordinates (x1, x2, x3, x4), and the coordinates (
- the URL “http://aaa.co.jp/apple.html” of the product page of “apple” is set in the rectangular link area at the position of y1, y2, y3, y4), and the coordinates (z1, z2, z3) , Z4)
- the URL “http://aaa.co.jp/pan.html” of the product page “pan” is set in the rectangular link area at the position of z4).
- Each link area corresponds to a clickable area. As shown in FIG.
- each clickable area corresponds to a product code field
- the link area of “soy sauce” is clickable area 1
- the link area of “apple” is clickable area 2
- the link area of “pan” is clickable. This is indicated by area 3.
- the clickable area 4 will be described later.
- FIG. 6 is a functional block diagram of the determination apparatus 10.
- the determination apparatus 10 includes a page acquisition unit 11, a coordinate acquisition unit 12 (coordinate acquisition unit), an image acquisition unit 13 (information acquisition unit), a character recognition unit 14 (character recognition unit), and a URL acquisition unit. 15, a linked page acquisition unit 16 (content acquisition unit), a character determination unit 17 (determination unit), a notification unit 18 (notification unit), and a page generation unit 19 (generation unit).
- This program may be installed in the determination apparatus 10 from a computer-readable information storage medium such as a CD-ROM, DVD-ROM, or memory card, or may be downloaded from a communication network such as the Internet.
- the page acquisition unit 11 acquires an HTML file corresponding to the web catalog from the page DB 4a.
- the page acquisition unit 11 acquires, for example, an HTML file shown in FIG.
- the coordinate acquisition unit 12 acquires the coordinates of the clickable area from the HTML file acquired by the page acquisition unit 11.
- the coordinate acquisition unit 12 may, for example, from the map data of the HTML file shown in FIG. 4, the coordinates of the clickable area 1 (x1, x2, x3, x4), the coordinates of the clickable area 2 (y1, y2, y3, y4), or The coordinates (z1, z2, z3, z4) of the clickable area 3 are acquired.
- the image acquisition unit 13 acquires an image (area image) (link source information) in the clickable area corresponding to the coordinates acquired by the coordinate acquisition unit 12 in the image data (catalog data) of the web catalog shown in FIG. .
- the image acquisition unit 13 for example, the area image “ ⁇ 1234” of the clickable area 1 corresponding to the coordinates (x1, x2, x3, x4) and the area image of the clickable area 2 corresponding to the coordinates (y1, y2, y3, y4).
- the area image “ ⁇ 3456” of the clickable area 3 corresponding to “ ⁇ 2345” or the coordinates (z1, z2, z3, z4) is acquired.
- the image acquisition unit 13 When a predetermined mark (here, “ ⁇ ” (star mark)) is added to the catalog data, the image acquisition unit 13 recognizes the catalog data and displays a predetermined area including the mark as an area image. You may get as The image acquisition unit 13 acquires the text display as an area image when the clickable area is a text display field (clickable area 4 in FIG. 5), and acquires the photograph as an area image when the clickable area is a photo field. Note that the image acquisition unit 13 may acquire the peripheral image instead of the image in the clickable area. For example, for the clickable area 1 in FIG. 5, an area located in the posting area of the corresponding product may be acquired as an area image. If the size of the posting area of the corresponding product and the position of the clickable area in the posting area are known, the area in the posting area can be specified.
- the character recognition unit 14 performs character recognition on the area image acquired by the image acquisition unit 13.
- the character recognition unit 14 performs character recognition using, for example, an optical character recognition (OCR) method.
- OCR optical character recognition
- the character recognition unit 14 recognizes a product code when the clickable area is a product code column, and recognizes text when the clickable area is a text display column. If the character code can be acquired, the character recognition process may not be performed.
- the URL acquisition unit 15 acquires the URL of the product page corresponding to the coordinates acquired by the coordinate acquisition unit 12 from the HTML file acquired by the page acquisition unit 11. For example, the URL acquisition unit 15 obtains the URL “http://aaa.co.jp/soy sauce” corresponding to the coordinates (x1, x2, x3, x4) of the clickable area 1 from the map data of the HTML file shown in FIG. “html”, URL “http://aaa.co.jp/apple.html” corresponding to the coordinates (y1, y2, y3, y4) of the clickable area 2, or the coordinates (z1, z2, z3) of the clickable area 3 , Z4), the URL “http://aaa.co.jp/pan.html” is acquired.
- the link destination page acquisition unit 16 acquires the product page of the URL acquired by the URL acquisition unit 15 from the web server. For example, the linked page acquisition unit 16 receives a product page of URL “http://aaa.co.jp/soy sauce.html”, URL “http://aaa.co.jp” shown in FIG. A product page of “/apple.html” or a product page of URL “http://aaa.co.jp/bread.html” is acquired.
- the character determination unit 17 determines whether or not the product page acquired by the linked page acquisition unit 16 includes a character recognized by the character recognition unit 14. Note that the character determination unit 17 may determine whether or not the product page includes a character that matches the recognized character. If there are a plurality of recognized characters, the number of characters greater than or equal to a predetermined ratio is determined. You may determine whether it is contained in the page. When the clickable area includes a photo column, the character determination unit 17 further calculates the similarity between the area image acquired by the image acquisition unit 13 and the image of the photo posted on the product page. The determination may be performed based on the similarity. In this case, the character determination unit 17 functions as an image processing determination unit, and calculates similarity by extracting and comparing feature points of images, for example.
- the notification unit 18 notifies a message based on the determination result of the character determination unit 17. Specifically, when the recognized character is not included in the product page, the notification unit 18 notifies the error message because the link association is incorrect. When the recognized character is included in the product page, the notification unit 18 may notify a message that the link association is correct.
- the page generation unit 19 performs processing of generating an HTML file (see FIG. 4) by associating the coordinates of the clickable area with the URL of the product page.
- the HTML file generated by the page generation unit 19 is stored in the page DB 4a.
- FIG. 7 is an operation flowchart of the determination apparatus 10 according to the first embodiment.
- the web catalog shown in FIG. 1 and the product page shown in FIG. 2 will be described as examples.
- the site operator creates an HTML file (see FIG. 4) corresponding to the web catalog shown in FIG. 1, and the created HTML file is stored in the page DB 4a. Further, it is assumed that a plurality of product pages shown in FIG. 2 created by the site operator are uploaded to the web server. In the web catalog shown in FIG. 1, the product code column of each product is set in the clickable areas 1 to 3 as shown in FIG. The site operator creates an HTML file based on a list in which the product code of each product and the URL of the product page are associated in advance.
- the site operator selects an inspection mode for inspecting the correctness of the link association in the determination apparatus 10. As a result, the following inspection process is executed.
- the page acquisition unit 11 acquires the HTML file shown in FIG. 4 from the page DB 4a (S101).
- the coordinate acquisition unit 12 acquires the coordinates (x1, x2, x3, x4) of the clickable area 1 (see FIG. 5) from the first data of the map data in the HTML file acquired in S101 ( S102).
- the image acquisition unit 13 is an image (area image) of the clickable area 1 corresponding to the coordinates (x1, x2, x3, x4) acquired in S102. 1) is acquired (S103).
- “ ⁇ 1234” in the product code column is acquired as the area image 1.
- the character recognition unit 14 performs character recognition on the area image 1 acquired in S103 (S104).
- “ ⁇ 1234” in the area image 1 is recognized as a character.
- the URL acquisition unit 15 includes the URL “http://aaa.co.jp/soy sauce.com” corresponding to the coordinates (x1, x2, x3, x4) acquired in S102 in the HTML file acquired in S101. html "is acquired (S105).
- the linked page acquisition unit 16 acquires the product page (see FIG. 2) of the URL “http://aaa.co.jp/soy sauce.html” acquired in S105 from the web server (S106). .
- the character determination unit 17 determines whether or not the character “ ⁇ 1234” recognized in S104 is included in the product page of “soy sauce” acquired in S106 (S107). In the example shown in FIG. 2, since the product code “ ⁇ 1234” is included in the product page of “soy sauce”, it is determined that the link association is correct, and the process proceeds to S109.
- FIG. 9 shows an HTML file corresponding to the product page shown in FIG. As shown in the figure, an incorrect URL “http://aaa.co.jp/dressing.html” is set in the first data of the map data.
- S109 it is determined whether or not there is a clickable area (uninspected area) that has not been subjected to the inspection process. Specifically, the determination process is performed with reference to the map data in the HTML file acquired in S101. Thereby, the inspection process can be executed for all the clickable areas.
- the process returns to S102, and the coordinate acquisition unit 12 acquires the coordinates of the next clickable area from the map data of the HTML file acquired in S101.
- the coordinate acquisition unit 12 acquires the coordinates (b1, b2, b3, b4) of the clickable area 2 (S102). Thereafter, the same processing as described above is performed.
- the determination apparatus 10 ends the inspection process.
- the determination device 10 it is possible to easily determine whether the correspondence between the product information of the web catalog and the URL of the product page linked to the product information is correct or incorrect. Even if the clickable area and the URL are correctly associated with each other, there is a possibility that the product page (content) specified by the URL is incorrect and does not correspond to the information on the clickable area. An error can be detected.
- Example 2 Although the clickable area is set in the product code column in the first embodiment, the present invention is not limited to this. In the second embodiment, the clickable area is set in the text display field (clickable area 4 in FIG. 5). Below, it demonstrates centering on difference with Example 1.
- FIG. FIG. 10 shows information on the product “A soy sauce” of the product code “ ⁇ 5678” posted in the web catalog shown in FIG. 1 and the product page of the product “A soy sauce”.
- the coordinate acquisition unit 12 acquires the coordinates (s1, s2, s3, s4) (see FIG. 5) of the clickable area 4 from the map data in the HTML file (see FIG. 11) acquired by the page acquisition unit 11. For the clickable areas 1 to 3, the inspection process shown in the first embodiment may be executed.
- the image acquisition unit 13 is an image (area image) of the clickable area 4 corresponding to the coordinates (s1, s2, s3, s4) acquired by the coordinate acquisition unit 12 in the catalog data (image data) of the web catalog shown in FIG. 4) is acquired.
- a text display field is acquired as the area image 4.
- the character recognition unit 14 performs character recognition on the area image 4 acquired by the image acquisition unit 13.
- the text of the area image 4 is recognized as a character (word).
- the character recognizing unit 14 performs, for example, “soy sauce”, “100 selections”, “A soy sauce”, “500 ml”, “bonito”, “mirin”, “mellow”, “taste” by morphological analysis. recognize.
- the URL acquisition unit 15 includes, in the HTML file acquired by the page acquisition unit 11 (see FIG. 11), the URL “http: // // corresponding to the coordinates (s1, s2, s3, s4) acquired by the coordinate acquisition unit 12. “aaa.co.jp/A soy sauce.html”.
- the linked page acquisition unit 16 acquires the product page (see FIG. 10) of the URL “http://aaa.co.jp/A soy sauce.html” acquired by the URL acquisition unit 15 from the web server.
- the character determination unit 17 includes a product page of “A soy sauce” acquired by the linked page acquisition unit 16 that includes more than a predetermined percentage of characters among the plurality of characters recognized by the character recognition unit 14. Determine whether or not. For example, it is determined whether or not 80% (seven words) or more of the above eight words are included in the product page “A soy sauce”. In the example of FIG. 10, the product page “A soy sauce” does not include “100 selections”, but includes other 7 words. Therefore, it is determined that the link association is correct.
- the photo column of the product may be added as a clickable area.
- the determination process using image recognition may be executed when it is determined that there is an error in the determination process using character recognition (NO in S107).
- the determination device 10 can determine the correctness / incorrectness of link association for various clickable areas. For one product, all of the product code field, text display field, and photo field may be set in the clickable area. In this case, the determination process corresponding to the clickable area among the above-described determination processes may be performed for each clickable area.
- this invention demonstrated the web catalog produced using PDF, this invention is applicable to the web page in general containing a link.
- FIG. 12 is an operation flowchart of the page generation unit 19.
- the page generation unit 19 sets a clickable area and a link destination URL corresponding to the clickable area based on the image data in the PDF format that is the basis of the web catalog, and performs processing for creating an HTML file. That is, the page generation unit 19 automatically performs the setting of the clickable area and the association of the link destination URL.
- image data (catalog data) of the web catalog shown in FIG. 1 is taken as an example.
- the page generation unit 19 extracts a mark (keyword) for specifying the clickable area from the catalog data (S202). For example, “ ⁇ ” is added as a mark.
- the page generation unit 19 arranges the product codes attached to the marks in the arrangement order based on the coordinates of the marks (S203).
- FIG. 13 shows a product code list table arranged in the order of arrangement.
- the page generation unit 19 determines the coordinates (x1, x2, x3, x4) of the first clickable area 1 based on the coordinates of the mark of the first product code (S204).
- the page generation unit 19 acquires the first URL “http://aaa.co.jp/soy sauce.html” from the URL list table (see FIG. 14) created by the site operator (see FIG. 14). S205).
- the page generation unit 19 uses the coordinates (x1, x2, x3, x4) of the first clickable area 1 determined in S204 and the first URL “http://aaa.co” acquired in S205. .Jp / soy sauce.html "is associated and registered in the HTML file (see FIG. 15) (S206).
- the page generation unit 19 determines the coordinates (y1, y2, y3, y4) of the second clickable area 2 based on the coordinates of the mark of the second product code. Thereafter, the same processing as described above is performed, and the coordinates (y1, y2, y3, y4) of the second clickable area 2 and the second URL “http://aaa.co.jp/apple” are stored in the HTML file. .Html "is registered in association with each other.
- the page generation unit 19 repeats the above process for all clickable areas, and the coordinates of each clickable area and each URL are registered in the HTML file in association with each other. Thereby, the HTML file shown in FIG. 4 is generated.
- the correspondence between the coordinates of the clickable area and each URL can be automatically performed, so that errors in the association can be reduced.
- the link destination page acquisition unit 16 converts the URL acquired by the URL acquisition unit 15 into a local address, and acquires a product page based on the converted address.
- the link destination page acquisition unit 16 converts the URL into “local storage path + file name”. For example, “URL: http://aaa.co.jp/soy sauce.html” is converted to “C: ⁇ temp ⁇ soy sauce.html”.
- the link destination page DB 4 b in the storage unit 4 of the determination apparatus 10 can be used. Thereby, the link destination page acquisition part 16 can acquire a goods page reliably.
- a web catalog having only one page has been described.
- link association and determination of link association errors can be performed by applying the present invention. Can be performed efficiently.
- the present invention is not limited to the web page. For example, it may be displayed on a smartphone or tablet application and applied to a screen (page) including a link.
- the link destination is specified by a URI (Uniform Resource Identifier).
- 10 determination device 1 communication unit, 2 CPU, 3 memory, 4 storage unit, 4a page DB, 4b linked page DB, 11 page acquisition unit, 12 coordinate acquisition unit, 13 image acquisition unit, 14 character recognition unit, 15 URL Acquisition unit, 16 linked page acquisition unit, 17 character determination unit, 18 notification unit, 19 page generation unit.
Abstract
Description
図7は、実施例1に係る判定装置10の動作フロー図である。ここでは、図1に示すウェブカタログと、図2に示す商品ページを例に挙げて説明する。
実施例1ではクリッカブルエリアが商品コード欄に設定されているが、本発明はこれに限定されない。実施例2では、クリッカブルエリアがテキスト表示欄(図5のクリッカブルエリア4)に設定されている。以下では、実施例1との相違点を中心に説明する。図10は、図1に示すウェブカタログに掲載されている商品コード「★5678」の商品「Aしょうゆ」の情報と、該商品「Aしょうゆ」の商品ページを示している。
ページ生成部19の詳細について説明する。図12は、ページ生成部19の動作フロー図である。
図4に示すHTMLファイルでは、クリッカブルエリアに対応付けられる商品ページはURLで特定されている。しかし、商品ページがウェブサーバにアップロードされる前の段階では、商品ページをURLで特定することができない。そこで、リンク先ページ取得部16は、URL取得部15により取得されたURLをローカルアドレスに変換し、変換されたアドレスに基づいて商品ページを取得する。
Claims (10)
- URIが対応付けられたリンクエリア内又は該リンクエリア周辺から得られるリンク元情報を取得する情報取得手段と、
コンテンツを記憶する記憶手段から、前記リンクエリアに対応付けられた前記URIによって特定されたコンテンツを取得するコンテンツ取得手段と、
前記情報取得手段により取得された前記リンク元情報と、前記コンテンツ取得手段により取得された前記コンテンツとに基づいて、前記リンクエリアと前記URIとの対応付けの正誤を判定する判定手段と、
を含むことを特徴とする判定装置。 - 前記情報取得手段により取得された前記リンク元情報を文字として認識する文字認識手段をさらに含み、
前記判定手段は、前記文字認識手段により認識された前記文字と、前記コンテンツ取得手段により取得された前記コンテンツとに基づいて、前記リンクエリアと前記URIとの対応付けの正誤を判定することを特徴とする請求項1に記載の判定装置。 - 前記判定手段は、前記文字認識手段により認識された前記文字が、前記コンテンツ取得手段により取得された前記コンテンツに含まれるか否かにより、前記対応付けの正誤を判定することを特徴とする請求項2に記載の判定装置。
- 前記判定手段は、前記文字認識手段により認識された複数の前記文字のうち所定割合以上の数の文字が、前記コンテンツ取得手段により取得された前記コンテンツに含まれるか否かにより、前記対応付けの正誤を判定することを特徴とする請求項2に記載の判定装置。
- ページ内における前記リンクエリアの座標を取得する座標取得手段をさらに含み、
前記情報取得手段は、前記座標取得手段により取得された前記リンクエリアの座標に基づいて、前記リンク元情報を取得することを特徴とする請求項1から4の何れか1項に記載の判定装置。 - 前記対応付けが誤っている場合にエラーメッセージを報知する報知手段をさらに含んでいることを特徴とする請求項1から5の何れか1項に記載の判定装置。
- ページ内の前記リンクエリアを設定するとともに、予め前記URIが登録されたテーブルから、設定された前記リンクエリアに対応する前記URIを取得し、前記リンクエリアと前記URIとを対応付ける生成手段をさらに含んでいることを特徴とする請求項1から6の何れか1項に記載の判定装置。
- 前記テーブルにおいて、前記URIは、前記リンクエリアの配置順に対応して登録されていることを特徴とする請求項7に記載の判定装置。
- URIが対応付けられたリンクエリア内又は該リンクエリア周辺から得られるリンク元情報を取得する情報取得ステップと、
コンテンツを記憶する記憶手段から、前記リンクエリアに対応付けられた前記URIによって特定されたコンテンツを取得するコンテンツ取得ステップと、
前記情報取得ステップにより取得された前記リンク元情報と、前記コンテンツ取得ステップにより取得された前記コンテンツとに基づいて、前記リンクエリアと前記URIとの対応付けの正誤を判定する判定ステップと、
を含むことを特徴とする判定方法。 - URIが対応付けられたリンクエリア内又は該リンクエリア周辺から得られるリンク元情報を取得する情報取得手段、
コンテンツを記憶する記憶手段から、前記リンクエリアに対応付けられた前記URIによって特定されたコンテンツを取得するコンテンツ取得手段、及び、
前記情報取得手段により取得された前記リンク元情報と、前記コンテンツ取得手段により取得された前記コンテンツとに基づいて、前記リンクエリアと前記URIとの対応付けの正誤を判定する判定手段、
としてコンピュータを機能させるためのプログラム。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2013/067942 WO2014207941A1 (ja) | 2013-06-28 | 2013-06-28 | 判定装置、判定方法、及びプログラム |
JP2015523817A JP5886477B2 (ja) | 2013-06-28 | 2013-06-28 | 判定装置、判定方法、及びプログラム |
US14/901,081 US10585965B2 (en) | 2013-06-28 | 2013-06-28 | Determination device, determination method, and program |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/JP2013/067942 WO2014207941A1 (ja) | 2013-06-28 | 2013-06-28 | 判定装置、判定方法、及びプログラム |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2014207941A1 true WO2014207941A1 (ja) | 2014-12-31 |
Family
ID=52141331
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2013/067942 WO2014207941A1 (ja) | 2013-06-28 | 2013-06-28 | 判定装置、判定方法、及びプログラム |
Country Status (3)
Country | Link |
---|---|
US (1) | US10585965B2 (ja) |
JP (1) | JP5886477B2 (ja) |
WO (1) | WO2014207941A1 (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP6144861B1 (ja) * | 2016-03-10 | 2017-06-07 | 楽天株式会社 | チェック装置、チェック方法、プログラム、ならびに、非一時的なコンピュータ読取可能な情報記録媒体 |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9558510B2 (en) * | 2009-02-24 | 2017-01-31 | Ebay Inc. | System and method to create listings using image and voice recognition |
CN107589893A (zh) * | 2017-09-21 | 2018-01-16 | 上海联影医疗科技有限公司 | 一种数据加载方法、装置及终端 |
US11200294B2 (en) * | 2019-03-20 | 2021-12-14 | Hisense Visual Technology Co., Ltd. | Page updating method and display device |
US11663193B2 (en) * | 2020-12-17 | 2023-05-30 | International Business Machines Corporation | Identifying incorrect links |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0944499A (ja) * | 1995-08-03 | 1997-02-14 | Matsushita Electric Ind Co Ltd | マルチメディア文書構造編集装置 |
JP2004139304A (ja) * | 2002-10-17 | 2004-05-13 | Nec Corp | ハイパーテキスト検査装置および方法並びにプログラム |
JP2007188356A (ja) * | 2006-01-13 | 2007-07-26 | Internatl Business Mach Corp <Ibm> | 不正ハイパーリンク検出装置及びその方法 |
Family Cites Families (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5544352A (en) * | 1993-06-14 | 1996-08-06 | Libertech, Inc. | Method and apparatus for indexing, searching and displaying data |
US6408331B1 (en) * | 1995-07-27 | 2002-06-18 | Digimarc Corporation | Computer linking methods using encoded graphics |
JP3768743B2 (ja) * | 1999-09-20 | 2006-04-19 | 株式会社東芝 | ドキュメント画像処理装置及びドキュメント画像処理方法 |
US6583792B1 (en) * | 1999-11-09 | 2003-06-24 | Newag Digital, Llc | System and method for accurately displaying superimposed images |
US6601066B1 (en) * | 1999-12-17 | 2003-07-29 | General Electric Company | Method and system for verifying hyperlinks |
US7600183B2 (en) * | 2000-06-16 | 2009-10-06 | Olive Software Inc. | System and method for data publication through web pages |
WO2003001413A1 (en) * | 2001-06-22 | 2003-01-03 | Nosa Omoigui | System and method for knowledge retrieval, management, delivery and presentation |
US7182462B2 (en) * | 2001-12-26 | 2007-02-27 | Infocus Corporation | System and method for updating an image display device from a remote location |
US7257598B2 (en) * | 2002-12-19 | 2007-08-14 | Nokia Corporation | System and method for generating descriptive link names |
JP2004220193A (ja) * | 2003-01-10 | 2004-08-05 | Ricoh Co Ltd | Htmlリンク検査システム |
JP4158567B2 (ja) | 2003-03-20 | 2008-10-01 | 凸版印刷株式会社 | 付加情報付加方法及び付加情報付加装置、並びに付加情報付加プログラム |
JP2006085234A (ja) * | 2004-09-14 | 2006-03-30 | Fuji Xerox Co Ltd | 電子文書作成装置、電子文書作成方法及び電子文書作成プログラム |
US7720436B2 (en) * | 2006-01-09 | 2010-05-18 | Nokia Corporation | Displaying network objects in mobile devices based on geolocation |
US9892196B2 (en) * | 2006-04-21 | 2018-02-13 | Excalibur Ip, Llc | Method and system for entering search queries |
US8489987B2 (en) * | 2006-07-31 | 2013-07-16 | Ricoh Co., Ltd. | Monitoring and analyzing creation and usage of visual content using image and hotspot interaction |
US20080172738A1 (en) * | 2007-01-11 | 2008-07-17 | Cary Lee Bates | Method for Detecting and Remediating Misleading Hyperlinks |
US9665543B2 (en) * | 2007-03-21 | 2017-05-30 | International Business Machines Corporation | System and method for reference validation in word processor documents |
JP4459250B2 (ja) * | 2007-04-20 | 2010-04-28 | 富士通株式会社 | 送信方法、画像送信システム、送信装置及びプログラム |
US8209599B2 (en) * | 2009-04-23 | 2012-06-26 | Xerox Corporation | Method and system for handling references in markup language documents |
JP5575511B2 (ja) * | 2009-07-16 | 2014-08-20 | 富士フイルム株式会社 | ウェブサイト閲覧システム、サーバ及びクライアント端末 |
US8438059B2 (en) * | 2010-01-28 | 2013-05-07 | Mypoints.Com Inc. | Dynamic e-mail |
JP5790345B2 (ja) * | 2011-09-07 | 2015-10-07 | 株式会社リコー | 画像処理装置、画像処理方法、プログラムおよび画像処理システム |
US20140136508A1 (en) * | 2012-11-09 | 2014-05-15 | Palo Alto Research Center Incorporated | Computer-Implemented System And Method For Providing Website Navigation Recommendations |
US9305227B1 (en) * | 2013-12-23 | 2016-04-05 | Amazon Technologies, Inc. | Hybrid optical character recognition |
US20160110319A1 (en) * | 2014-10-21 | 2016-04-21 | Nirnay Bansal | URI Font in print material processing method and apparatus thereof |
-
2013
- 2013-06-28 US US14/901,081 patent/US10585965B2/en active Active
- 2013-06-28 JP JP2015523817A patent/JP5886477B2/ja active Active
- 2013-06-28 WO PCT/JP2013/067942 patent/WO2014207941A1/ja active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0944499A (ja) * | 1995-08-03 | 1997-02-14 | Matsushita Electric Ind Co Ltd | マルチメディア文書構造編集装置 |
JP2004139304A (ja) * | 2002-10-17 | 2004-05-13 | Nec Corp | ハイパーテキスト検査装置および方法並びにプログラム |
JP2007188356A (ja) * | 2006-01-13 | 2007-07-26 | Internatl Business Mach Corp <Ibm> | 不正ハイパーリンク検出装置及びその方法 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP6144861B1 (ja) * | 2016-03-10 | 2017-06-07 | 楽天株式会社 | チェック装置、チェック方法、プログラム、ならびに、非一時的なコンピュータ読取可能な情報記録媒体 |
WO2017154169A1 (ja) * | 2016-03-10 | 2017-09-14 | 楽天株式会社 | チェック装置、チェック方法、プログラム、ならびに、非一時的なコンピュータ読取可能な情報記録媒体 |
Also Published As
Publication number | Publication date |
---|---|
JPWO2014207941A1 (ja) | 2017-02-23 |
JP5886477B2 (ja) | 2016-03-16 |
US20160154893A1 (en) | 2016-06-02 |
US10585965B2 (en) | 2020-03-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5886477B2 (ja) | 判定装置、判定方法、及びプログラム | |
JP2018026168A (ja) | 図形コードを通じてネットワーク情報にアクセスする方法、クライアントデバイス、およびサーバ | |
KR20210100222A (ko) | 복수의 장치 상의 정보의 제공 | |
JP7018714B2 (ja) | モータ、シールおよび情報提供装置 | |
US10606832B2 (en) | Search system, search method, and program | |
JP5767413B1 (ja) | 情報処理システム、情報処理方法、および情報処理プログラム | |
JP5753642B1 (ja) | 入力装置、フォーム入力方法、記録媒体およびプログラム | |
US9213502B2 (en) | Information processing apparatus, information processing method, and non-transitory computer readable medium for recording printing information | |
JP2008186135A (ja) | 原材料原産地適正検査システム | |
JP5471411B2 (ja) | 電子チラシ検索装置および電子チラシ検索システム | |
WO2016095725A1 (zh) | 一种条形码扫描方法及装置 | |
JP2018128883A (ja) | 情報処理装置、方法およびプログラム | |
CN103164411A (zh) | 浏览器的网页加载方法 | |
JP2017102779A (ja) | 管理情報の印刷方法 | |
JP2010266924A (ja) | パッケージ作成支援装置、パッケージ作成支援方法、及びプログラム | |
JP2004078436A (ja) | 入力支援装置 | |
JP2008112377A (ja) | 画像形成システム、画像形成装置、及び画像形成プログラム | |
JP6144861B1 (ja) | チェック装置、チェック方法、プログラム、ならびに、非一時的なコンピュータ読取可能な情報記録媒体 | |
WO2024054606A1 (en) | System and method for triggering countdown on digital user interface | |
JP6056580B2 (ja) | レイアウト管理装置およびコンピュータプログラム | |
TW202145815A (zh) | 資料提供方法 | |
JP2023157558A (ja) | アクセス情報取得方法、コード分割方法及びアクセス情報取得システム | |
JP2007193643A (ja) | 案内・注文書製作装置及び方法 | |
WO2016113887A1 (ja) | 情報処理装置、情報処理方法および情報処理プログラム | |
US20150309975A1 (en) | Non-transitory computer readable medium, information processing apparatus, and information processing method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 13887655 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2015523817 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14901081 Country of ref document: US |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 13887655 Country of ref document: EP Kind code of ref document: A1 |