WO2020170323A1 - Structured data generation system and program - Google Patents

Structured data generation system and program Download PDF

Info

Publication number
WO2020170323A1
WO2020170323A1 PCT/JP2019/006004 JP2019006004W WO2020170323A1 WO 2020170323 A1 WO2020170323 A1 WO 2020170323A1 JP 2019006004 W JP2019006004 W JP 2019006004W WO 2020170323 A1 WO2020170323 A1 WO 2020170323A1
Authority
WO
WIPO (PCT)
Prior art keywords
structured data
data
web page
generation system
structured
Prior art date
Application number
PCT/JP2019/006004
Other languages
French (fr)
Japanese (ja)
Inventor
豊志 永田
浩幸 川口
茂治 ▲高▼野
Original Assignee
株式会社ショーケース
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社ショーケース filed Critical 株式会社ショーケース
Priority to PCT/JP2019/006004 priority Critical patent/WO2020170323A1/en
Publication of WO2020170323A1 publication Critical patent/WO2020170323A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/81Indexing, e.g. XML tags; Data structures therefor; Storage structures

Definitions

  • the present invention relates to a system and a program for generating structured data using various kinds of data for forming at least a part of a web page and generating a web page using such structured data. ..
  • HTML HyperText Markup
  • HTML HyperText Markup
  • HTML HyperText Markup
  • this web page is uploaded to a web server and published on the Internet, it becomes a target of search by a search engine. Then, a third party using the Internet searches for and browses a desired Web page with a search engine.
  • a search engine particularly a robot-type search engine, includes a crawler for searching a web page, and the crawler performs a work called crawling for collecting information of texts and images forming the web page. ..
  • a search engine builds a database based on the results of crawler crawling and uses it to display search results. Therefore, in order for many people to browse the Web page published on the Internet, it is important to construct the Web page in a manner included in the crawler crawling result.
  • the search engine and the crawler only recognize the HTML-formatted text data and the like as symbols.
  • a method called structured data is known in order to enhance the effect of collecting crawler crawling web pages. This is to add one or a plurality of metadata indicating the semantic content to the text data or the image data for constructing the Web page for each semantic content indicated by the text data or the image data. It is also a method of constructing data so that the crawler can easily interpret the contents of text and images.
  • Patent Document 1 Conventionally, a method of constructing a Web page using structured data and publishing it on the Internet is known (for example, see Patent Document 1).
  • Patent Document 1 a person who publishes a Web page on the Internet does not allow many people to browse his or her Web page efficiently by having a search engine search or collect information. There is a problem that it is difficult.
  • the present invention has been made in view of the above problems, and it is possible to easily create a Web page using structured data and easily create and publish a Web page that can be easily viewed by many people.
  • the object is to provide a structured data generation system.
  • a structure for generating structured data by adding additional data to various data for forming at least a part of a Web page is configured to automatically generate various structured data including various data input processing means for inputting the various data and the inputted various data using a predetermined generation standard. It is characterized by comprising data generating means and structured data recording means for recording the generated structured data.
  • the structured data recorded in the structured data recording means when a request to call the structured data is issued from the Web page.
  • the structured data transmitting means for transmitting to the Web page that has called.
  • the structured data generation means performs crawling of the structured data to be generated in the destination web page for information collection by a crawler that acquires information of the web page.
  • the crawler is configured as information that can be acquired.
  • the structured data generation means includes a unique tag as the additional data for each structured data. And issuing the structured data generation means, when modifying at least a part of the structured data after the structured data is generated, issues the same tag as the unique tag, and issues the issued tag. It is characterized in that the tag is added to the modified various data to generate the modified structured data.
  • the tag when the structured data is used for the web page, stores the various data forming the structured data in the web page. It is characterized by being a script tag as information for dynamically expanding it.
  • the invention according to claim 6 is characterized in that, in addition to the configuration according to any one of claims 1 to 5, the structured data includes job offer information as the various data.
  • the invention according to claim 7 is a computer-readable program that causes a computer to function as the structured data generation system according to any one of claims 1 to 6.
  • various data forming at least a part of a Web page is input, and structured data including the various input data is automatically generated and recorded using a predetermined generation criterion.
  • structured data including the various input data is automatically generated and recorded using a predetermined generation criterion.
  • FIG. 1 is a network configuration diagram and a functional block diagram conceptually showing the configuration of a structured data providing system 1A according to this embodiment.
  • FIG. 1 is a network configuration diagram and a functional block diagram conceptually showing the configuration of a structured data providing system 1A according to this embodiment.
  • FIG. 1 is a network configuration diagram and a functional block diagram conceptually showing the configuration of a structured data providing system 1A according to an embodiment of the present invention.
  • the structured data providing system 1A includes a structured data generation system 1 according to the present invention, an input terminal 2, a Web server 3, a search engine 4, and a display terminal 5, which are all connected to the Internet 6. It is connected via communication.
  • the structured data generation system 1, the input terminal 2, the web server 3, the search engine 4, and the display terminal 5 are all computer devices or computer systems, and have arithmetic functions such as a CPU, storage such as RAM, ROM, and EEPROM. It has means and the like, and has a function of performing various processes by a program and the like.
  • the structured data generation system 1 is a server system having functions such as a Web server and a database server, and also has a function of generating structured data.
  • the structured data generation system 1 includes a data input processing unit 11 as “various data input processing means” and a structure as “structured data generation means” as functional means formed by various hardware and execution of programs.
  • a structured data generation unit 12 and a structured data transmission unit 13 as a “structured data transmission unit” are provided.
  • the structured data generation system 1 includes a server 14 as "structured data recording means”.
  • the data input processing unit 11 displays the input screen 23 on the display unit 22 of the input terminal 2 via the Internet 6, and the “various data” input to the input screen 23 by the operation of the creator 100 (described later). Data is acquired via the Internet 6.
  • the “various data” acquired by the data input processing unit 11 may be, for example, text data or image data displayed on the web page 33 (described later), but is not limited to this, and the Various types of data that are not displayed, such as text data and image data that are written in white or transparent characters on the web page and are not visible at first glance, or text that forms the web page 33 but is not displayed on the web page. It may be any data such as data or image data. Note that, for simplicity of explanation, it is assumed that the “various data” are text data and image data displayed on the web page 33.
  • the structured data generation unit 12 generates structured data 16 (described later) using the text data and image data acquired by the data input processing unit 11. This structured data 16 is used in the generation of the web page 33 (described later) (details will be described later).
  • the structured data generation unit 12 adds a predetermined data to each data input in the input fields 231, 232, 233 (described later) of the input screen 23 displayed on the input terminal 2
  • the converted data 16 is generated.
  • the input field 231 is an input field for the offered job type, based on the description of the job type input in the input field 231, for example, the text information of the transportation industry. "title”: "Transportation” Structured data 16 is generated.
  • the data to be added does not necessarily have to be determined in advance, and the data to be added may be changed according to various conditions, or the arbitrary data may be added.
  • the structured data generation unit 12 When generating the structured data 16, the structured data generation unit 12 has a function that is useful for recognizing the meaning content indicated by the text data from the text data (for example, a function of extracting a word from a sentence or an extracted word).
  • the synonyms and antonyms of are extracted from the database in which many words are recorded and associated, the function of recognizing the person or object included in the image, and the word indicating the person or object recognized in such a way. It is desirable to have a function of extracting and associating from a database in which many words are recorded).
  • the structured data transmission unit 13 transmits the structured data generated by the structured data generation unit 12 to the web server 3 or the like.
  • the server 14 has a function as a database server for recording various data, a web server for transmitting data requested by the web server 3 or the like to the web server 3 or the like that has made a request via the Internet 6, and the like. It has various server functions.
  • the server 14 records additional information 15 as “additional data” and structured data 16.
  • the additional information 15 is text data or the like to be added to the original text data or the image data when the structured data 16 is generated based on the text data or the image data acquired by the data input processing unit 11. Specifically, it is, for example, other text data including a predetermined tag (such as a tag 161 described later).
  • the input terminal 2 is a terminal such as a personal computer, a smartphone, or a tablet used by the creator 100 who creates the web page 33 (described later).
  • the input terminal 2 includes an operation unit 21 such as a keyboard and a mouse, and a display unit 22 such as an LCD (liquid crystal display) that displays characters and images. It should be noted that the operation unit 21 and the display unit 22 may be integrated in the form of a touch panel.
  • an input screen 23 is displayed on the display unit 22 of the input terminal 2.
  • the input screen 23 is a screen displayed on the display unit 22 by transmission of data or the like by the data input processing unit 11 of the structured data generation system 1, and the creator 100 generates a Web page 33 (described later).
  • the input screen 23 is provided with a plurality of input fields 231, 232, 233 for inputting text data and image data by operating the operation unit 21 (specifically, for example, these input fields 231, 232, 233 correspond to items such as “title input field”, “text input field”, “image data input field”, etc., such as “title”, “text”, “attached image” on the Web page 33. It is possible to configure the input field.).
  • an execution button 234 is provided on the input screen 23.
  • the execute button 234 is clicked by the operation unit 21, the text data and image data input in the input fields 231, 232, 233 are transmitted to the structured data generation system 1.
  • the Web server 3 is a server computer, a server system, or the like.
  • the web server 3 includes a web page generation unit 31 and a database 32.
  • the web page generation unit 31 is a functional unit formed by the hardware and the program of the web server 3.
  • the web page generation unit 31 generates a web page 33 that can be viewed by a terminal such as the display terminal 5 via a network such as the Internet 6.
  • the web page generation unit 31 operates the input terminal 2 of the creator 100 and the operation terminal (not shown) of the administrator (not shown) of the structured data generation system 1 connected to the Internet 6.
  • the web page 33 is generated based on the above.
  • the Web page generation unit 31 will be described as generating the Web page 33 based on the operation of the creator 100.
  • the database 32 records the web page 33 generated by the web page generation unit 31. Further, the Web server 3 has a function of transmitting the Web page 33 recorded in a database (not shown) via the Internet 6 in response to a request from the display terminal 5 or the like.
  • the Web page 33 recorded in the database 32 of the Web server 3 is described in various markup languages suitable for information disclosure on the Internet 6, such as HTML and XML (Extensible Markup Language), and is displayed on the display terminal 5.
  • HTML and XML Extensible Markup Language
  • On the displayed Web browser 53 based on "various data" such as text data (character information written in a language such as Japanese or English) or image data (including still image data and moving image data), It has a function of displaying characters and images as contents.
  • the Web page 33 is described in any markup language other than HTML or XML and a description format other than the markup language as long as it can supply contents such as characters and images via the Internet 6.
  • the display terminal 5 may output contents other than characters and images, such as voice.
  • the web page generation unit 31 is provided in the web server 3.
  • the present invention is not limited to this, and the web page generation unit 31 is provided in a part other than the web server 3, for example, structured data.
  • the configuration may be included in the generation system 1 or the input terminal 2.
  • the web page 33 generated by the web page generation unit 31 provided in a configuration other than the web server 3 is transmitted to the web server 3 via the Internet 6 or the like, and the web page 33 is stored in the database 32 of the web server 3. It is recorded and transmitted from the Web server 3 to the display terminal 5 or the like.
  • the search engine 4 is a system having a function of searching various information (web pages, websites, image files, net news, etc.) existing on the Internet 6 or a closed network such as a LAN.
  • the search engine 4 constructs a hypertext system (WWW: World Wide Web) on the Internet 6 by performing a search such as a keyword search using the display terminal 5 of the viewer 200 of the display terminal 5, and a plurality of web pages (
  • the information of one or more candidate Web pages (including the Web page 33) from among the Web pages 33 is also provided as a search result to the display terminal 5 (hereinafter, the description will be made.
  • “one or more Web pages (including Web page 33)” will be simply referred to as "Web page 33" unless there is a need for distinction.)
  • the search engine 4 of this embodiment is a robot type search engine.
  • the search engine 4 includes a crawler 41 and a database 42.
  • the crawler 41 is a functional unit formed by the hardware and programs of the search engine 4.
  • the crawler 41 has a function of constructing a hypertext system on the Internet 6, automatically collecting information such as the web page 33, and recording the collected data in the database 42.
  • the search engine 4 ranks the pages registered in the database based on the search keyword input by the user, and based on the search request received from the display terminal 5 while using this ranking.
  • 5 has a function of transmitting a search result.
  • the search engine 4 has a function of displaying information specialized for a specific item together with the search result based on the result of crawling by the crawler 41. For example, when a viewer 200 seeking job offer information accesses the search engine 4 using the web browser 53 (described later) of the display terminal 5 and enters a keyword related to web work as a search word, the search engine 4 causes crawling. As a result, the information of a plurality of recruitment sites recorded in the database 42 is collectively displayed on the Web browser 53 (described later), so that the recruitment information highly likely to meet the wishes of the respective viewers 200 is proposed. be able to.
  • the search engine 4 is not limited to such job information, and for various search words input from the display terminal 5, site information of various items (for example, accommodation information of hotels, books and daily necessities, etc.). Similarly, information on a plurality of sites can be collectively displayed on the Web browser 53 (for product information, concert information, movie information, etc.).
  • the display terminal 5 is a terminal such as a personal computer, a smartphone, or a tablet used by a viewer 200 who browses a web page.
  • the display terminal 5 includes an operation unit 51 such as a keyboard and a mouse, and a display unit 52 such as an LCD (liquid crystal display) that displays characters and images. It should be noted that the operation unit 51 and the display unit 52 may be integrated in the form of a touch panel.
  • a web browser 53 that displays the web page 33 published on the Internet 6 is displayed on the display unit 52.
  • the display terminal 5 requests the search engine 4 to search the Web page 33 published on the Internet 6 by operating the operation unit 51 of the viewer 200 (for example, requesting a keyword search to the search engine 4), and the search engine
  • the search result of No. 4 is displayed on the Web browser 53 displayed on the display unit 52.
  • the Internet 6 is an information communication network that interconnects multiple computer networks.
  • the structured data generation system 1, the input terminal 2, the Web server 3, the search engine 4, and the display terminal 5 of this embodiment are communicably connected to the Internet 6 via a wired or wireless network.
  • a LAN, an intranet or the like may be used instead of the Internet 6.
  • the structured data generation unit 12 of the structured data generation system 1 generates the structured data 16. Further, as described above, the structured data 16 is generated by adding the additional information 15 to the text data or the image data as “various data”.
  • the structured data 16 including the additional information 15 is generated so as to be highly likely to be included in information collected by crawling (described later) by the crawler 41 of the search engine 4.
  • the tag 161 added to the original text data or image data input to the data input processing unit 11 is given a predetermined regularity so that the original text data or image data is hierarchized (for example, by adding upper, middle, and lower tags 161 for each semantic content of the original text data or image data, the original text data or image data is divided into a superordinate concept, a middle concept, and a subordinate concept. It is conceivable that the data structure is easily collected by crawling (described later) by the crawler 41.
  • a part of a tag when the text data input to the data input processing unit 11 is described in a markup language includes a term (for example, “work” or For text data including words such as "part-time job” and "employment”, there are words that are related to words related to text data, such as "job”, and that are easy to use as search words using the search engine 4, or terms that include those words. It is conceivable to include).
  • a term for example, “work” or For text data including words such as "part-time job” and "employment”, there are words that are related to words related to text data, such as "job”, and that are easy to use as search words using the search engine 4, or terms that include those words. It is conceivable to include).
  • any configuration other than the tag may be used as long as the configuration is likely to be included in the information collected by the crawling by the crawler 41 (described later).
  • the tag 161 used for the structured data 16 of this embodiment may be composed of a script tag of a script language represented by javascript (registered trademark), for example.
  • javascript registered trademark
  • the structured data 16 of this embodiment is formed so that each tag information is unique and does not overlap with other tag information.
  • the structured data is generated by generating the tag information as a script tag of javascript
  • the “body (this portion is displayed on the web page 33)” of the text data as “various data” is generated.
  • the structured data 16 is generated as including unique tag information.
  • the above-described unique tag information that configures the structured data 16 is an example, and the unique tag information may be configured by any character, symbol, or the like.
  • the structured data generation unit 12 generates the structured data 16 by adding the unique tag 161, so that the unique tag 161 functions as a file name of the structured data 16. .. That is, the structured data 16 is configured such that a unique file name is given by the unique tag 161.
  • the same unique tag 161 as before the correction is added to the “text” after the correction even if the above “text” in the structured data 16 is corrected.
  • the display in the web page 33 can be corrected. That is, the web page 33 can be easily modified without modifying the markup language itself that describes the web page 33.
  • the creator 100 who creates a Web page operates the operation unit 21 of the input terminal 2 to access the structured data creation system 1 via the Internet 6.
  • the data input processing unit 11 of the structured data generation system 1 causes the input terminal 2 to display the input screen 23.
  • the creator 100 text-inputs character information to be displayed on the web page 33 in the input fields 231, 232, 233 of the input screen 23 and attaches image information (photograph, video, etc.).
  • image information photograph, video, etc.
  • the creator 100 clicks the execute button 234 in this state the text data and image data input in the input fields 231, 232, 233 are sent to the structured data generation system 1 via the Internet 6.
  • the data input processing unit 11 of the structured data generation system 1 sends the text data and image data sent from the input terminal 2 to the structured data generation unit 12.
  • the structured data generation unit 12 uses the method of the above [Method 1] or the like for the acquired text data or image data based on a predetermined condition set in advance, and includes a tag 161 (for example, including a related word). , Javascript (registration information) script tags, etc. are added to generate the structured data 16.
  • the structured data generation unit 12 records the generated structured data 16 in the server 14.
  • the creator 100 operates the operation unit 21 of the input terminal 2 to access the Web server 3 and causes the Web page generation unit 31 to generate the Web page 33.
  • the creator 100 causes the structured data transmission unit 13 of the structured data generation system 1 to send the structured data 16 recorded in the server 14 to the Web server 3 via the Internet 6.
  • the structured data generation system 1 receives the structured data calling request from the Web page 33, and the structured data transmission unit 13 sends the structured data 16 designated in this calling request to the Web server 3 To the Web page generation unit 31.
  • the Web page generation unit 31 uses the data including the structured data 16 to generate the Web page 33 described in the markup language.
  • the tag 161 of the structured data 16 is described in the description of the described markup language. That is, the tag 161 is recorded in the web page 33.
  • the web page generation unit 31 records the generated web page 33 in the database 32.
  • the crawler 41 of the search engine 4 crawls hypertext on the Internet 6.
  • the database 32 of the Web server 3 also becomes the target of crawling.
  • the web page 33 is a target of information collection by this crawling, and the information of the web page 33 is recorded in the database 42 of the search engine 4 and used for the subsequent search word search using the search engine 4.
  • the structured data 16 forming the web page 33 is also collected by crawling the crawler 41, the information of the structured data 16 will be used for the search word search using the search engine 4 thereafter. ..
  • the search engine 4 has a function of displaying information specialized for a specific item together with a search result (for example, a function of collectively displaying information of a plurality of recruitment sites as a search result)
  • the information of the structured data 16 and the information of the structured data 16 will be used as information that constitutes information specialized for a specific item.
  • the web page 33 including the structured data 16 is dynamically generated on the web browser 53. It is possible to deploy.
  • the execute button 234 When the execute button 234 is clicked, the information input in the input fields 231, 232, 233 is sent from the data input processing unit 11 to the structured data generation unit 12, and the structured data 16 is generated.
  • the structured data generation unit 12 adds the same tag 161 to a predetermined condition (for example, the character information input in the input field 231), so that the newly input character information is first added.
  • the same tag 161 as the tag 161 of the structured data 16 generated by the information input in the predetermined input field (for example, the input field 231) is added to generate the new structured data 16.
  • the newly generated structured data 16 is sent to the Web server 3 by the structured data sending unit 13.
  • the web page generation unit 31 of the web server 3 replaces the newly transmitted structured data 16 with the structured data 16 to which the same tag 161 as the tag 161 added to the structured data 16 is added. This completes the correction of the web page 33. That is, the creator 100 can add or modify the web page 33 without directly modifying the markup language description on the web page 33.
  • various data forming at least a part of the Web page 33 is input, and the structured data 16 including the various input data is automatically generated using a predetermined generation criterion.
  • the structured data 16 can be automatically generated based on various data input to the structured data generation system 1. Therefore, even a user who does not have knowledge of the structured data 16 can efficiently create the Web page 33 that allows the search engine 4 to search and collect information. This makes it possible to easily create the web page 33 using the structured data 16 and easily create/publish the web page 33 that many people can easily browse.
  • the recorded structured data 16 is transmitted to the calling web page 33, so that the structured data 16 previously generated and recorded is transmitted.
  • the web page 33 can be configured to be sent to the web page 33 when needed on the web page 33 side. Therefore, the user of the structured data generation system 1 uses the structured data 16 generated and recorded in advance as the necessary Web page 33, and uses the Web page 33 that many people can easily browse. It will be possible to easily create and publish.
  • the structured data 16 that is generated is configured by configuring the generated structured data 16 as information that can be acquired by the crawler 41 when crawling for collecting information by the crawler 41 is performed.
  • the web page 33 using can be configured in a manner that is easily reflected in the search result of the search engine 4. This makes it possible to easily create and publish the Web page 33 that many people can easily browse.
  • the unique tag 161 when the unique tag 161 is issued for each structured data 16 and at least a part of the structured data 16 is modified after the structured data 16 is generated, the same tag as the unique tag 161 is used.
  • the structured data 16 forming the Web page 33 is ex post facto. Even after the correction, the creator 100 can easily add or correct the data written in the markup language or the like such as the HTML data of the Web page 33 without adding or modifying the data. It is possible to provide the web page 33 that is highly convenient for the person 100.
  • the tag 161 is generated by a script tag such as javascript (registered trademark), so that the Web page 33 that can be easily added or modified and reflected in the search result of the search engine 4 It can be formed as a configuration that dynamically expands on the Web browser 53, and it is possible to generate the Web page 33 that is highly convenient for the creator and has a high visual effect.
  • a script tag such as javascript (registered trademark)
  • the present invention can be realized in various computer hardware. It will be possible.
  • the above embodiment is an example of the present invention, and the present invention is not limited to the above embodiment. That is, the specific configuration of the present invention is not limited to the above embodiment, and various modifications can be made without departing from the spirit of the present invention.
  • Structured Data Generation System 4 Search Engine 11 Data Input Processing Unit (Various Data Input Processing Means) 12 Structured data generation unit (structured data generation means) 13 Structured data transmission unit (structured data transmission means) 14 server (structured data recording means) 15 Additional information (additional data) 16 structured data 33 web page 41 crawler 161 tag (additional data, script tag)

Abstract

[Problem] To provide a structured data generation system which makes it possible to easily create a web page that uses structured data, and also readily create and publish a web page that can be easily browsed by many people. [Solution] A structured data generation system 1 for generating structured data by adding additional data to various types of data for forming at least a portion of a web page 33, said structured data generation system 1 comprising: a data input processing unit 11 for allowing various types of data to be input; a structured data generation unit 12 which uses prescribed generation criteria to automatically generate structured data 16 including the input various types of data; and a server 14 for recording the generated structured data 16.

Description

構造化データ生成システム、プログラムStructured data generation system, program
 本発明は、Webページの少なくとも一部を構成するための各種データを用いた構造化データを生成し、そのような構造化データを用いたWebページの生成を行わせるためのシステム、及びプログラムに関する。 The present invention relates to a system and a program for generating structured data using various kinds of data for forming at least a part of a web page and generating a web page using such structured data. ..
 文章や画像などのデータを用いたWebページを生成するためには、Webページを構成するための文章を構成するテキストデータや画像を構成する画像データ等に所定のマークアップを行うHTML(HyperText Markup Language、以下「HTML」と表記する。)化等が行われる。このWebページは、Webサーバにアップロードされてインターネット上に公開されると、検索エンジンによる検索の対象となる。そして、インターネットを利用する第三者は検索エンジンによって所望のWebページを検索し、閲覧する。 In order to generate a web page using data such as sentences and images, an HTML (HyperText Markup) that performs a predetermined markup on text data forming a sentence for forming a web page and image data forming an image. Language, hereinafter referred to as “HTML”). When this web page is uploaded to a web server and published on the Internet, it becomes a target of search by a search engine. Then, a third party using the Internet searches for and browses a desired Web page with a search engine.
 そして、検索エンジン、特にロボット型の検索エンジンは、Webページの検索を行うクローラー(Crawler)を備え、このクローラーがWebページを構成するテキストや画像の情報を収集するクローリング(Crawling)という作業を行う。検索エンジンは、クローラーのクローリングの結果によってデータベースを構築し、検索結果の表示に活用する。そのため、インターネット上に公開したWebページを多くの人に閲覧させるには、クローラーのクローリングの結果に含まれる態様にWebページを構築することが重要となる。 A search engine, particularly a robot-type search engine, includes a crawler for searching a web page, and the crawler performs a work called crawling for collecting information of texts and images forming the web page. .. A search engine builds a database based on the results of crawler crawling and uses it to display search results. Therefore, in order for many people to browse the Web page published on the Internet, it is important to construct the Web page in a manner included in the crawler crawling result.
 ここで、検索エンジンおよびクローラーは、単にHTML化されたテキストデータ等を、記号として認識するだけである。そして、クローラーのクローリングに自らのWebページが収集される効果を高めるために、構造化データという手法が知られている。これは、Webページを構築するテキストデータや画像データ等に、そのテキストデータや画像データ等の示す意味内容等ごとに、意味内容等を示す一又は複数のメタデータを付加することで、検索エンジン及びクローラーがテキストや画像の内容を容易に解釈できるようにデータを構築する手法である。 Here, the search engine and the crawler only recognize the HTML-formatted text data and the like as symbols. A method called structured data is known in order to enhance the effect of collecting crawler crawling web pages. This is to add one or a plurality of metadata indicating the semantic content to the text data or the image data for constructing the Web page for each semantic content indicated by the text data or the image data. It is also a method of constructing data so that the crawler can easily interpret the contents of text and images.
 従来、構造化データによってWebページを構築し、インターネット上に公表する手法が知られている(たとえば、特許文献1参照)。 Conventionally, a method of constructing a Web page using structured data and publishing it on the Internet is known (for example, see Patent Document 1).
特開2018-77630号公報JP, 2018-77630, A
 しかし、構造化データを構築すること、また、検索エンジンおよびクローラーのクローリングにより情報が収集され易い構造化データを構築するには専門的な知識が必要となる。そのため、特許文献1に記載の発明では、Webページをインターネット上に公表する者が、自らのWebページを効率よく検索エンジンに検索させたり情報収集させたりして、多くの者に閲覧させることは難しいという問題がある。 However, specialized knowledge is required to construct structured data, and to construct structured data in which information is easily collected by search engine and crawler crawling. Therefore, in the invention described in Patent Document 1, a person who publishes a Web page on the Internet does not allow many people to browse his or her Web page efficiently by having a search engine search or collect information. There is a problem that it is difficult.
 本発明は上記の問題に鑑みてなされたものであり、構造化データを用いたWebページを容易に作成し、多くの者が容易に閲覧可能なWebページを簡易に作成・公表させることのできる構造化データ生成システムを提供することを課題としている。 The present invention has been made in view of the above problems, and it is possible to easily create a Web page using structured data and easily create and publish a Web page that can be easily viewed by many people. The object is to provide a structured data generation system.
 かかる課題を解決するために、請求項1の発明に係る構造化データ生成システムでは、Webページの少なくとも一部を構成するための各種データに付加データを付加することで構造化データを生成する構造化データ生成システムであって、前記各種データを入力させるための各種データ入力処理手段と、入力された前記各種データを含む前記構造化データを所定の生成基準を用いて自動的に生成する構造化データ生成手段と、生成された前記構造化データを記録するための構造化データ記録手段とを備えたことを特徴とする。 In order to solve such a problem, in the structured data generation system according to the invention of claim 1, a structure for generating structured data by adding additional data to various data for forming at least a part of a Web page. A structured data generation system, which is configured to automatically generate various structured data including various data input processing means for inputting the various data and the inputted various data using a predetermined generation standard. It is characterized by comprising data generating means and structured data recording means for recording the generated structured data.
 請求項2に記載の発明は、請求項1に記載の構成に加え、前記Webページから前記構造化データの呼び出し要求があった際に、前記構造化データ記録手段に記録された前記構造化データを呼び出した前記Webページに送信する構造化データ送信手段を備えたことを特徴とする。 According to a second aspect of the present invention, in addition to the configuration of the first aspect, the structured data recorded in the structured data recording means when a request to call the structured data is issued from the Web page. The structured data transmitting means for transmitting to the Web page that has called.
 請求項3に記載の発明は、前記構造化データ生成手段は、生成する構造化データを、送信先の前記Webページにおいて、前記Webページの情報を取得するクローラーによる情報収集のためのクローリングが行われた際の、前記クローラーが取得可能な情報として構成することを特徴とする。 In the invention according to claim 3, the structured data generation means performs crawling of the structured data to be generated in the destination web page for information collection by a crawler that acquires information of the web page. When crawled, the crawler is configured as information that can be acquired.
 請求項4に記載の発明は、請求項1乃至3の何れか一つに記載の構成に加え、前記構造化データ生成手段は、前記構造化データごとに、前記付加データとしてのユニークなタグを発行し、前記構造化データ生成手段は、前記構造化データが生成された後に該構造化データの少なくとも一部を修正する場合、前記ユニークなタグと同一の前記タグを発行し、該発行された前記タグを修正された前記各種データに付加することで修正された前記構造化データとして生成することを特徴とする。 In the invention described in claim 4, in addition to the structure according to any one of claims 1 to 3, the structured data generation means includes a unique tag as the additional data for each structured data. And issuing the structured data generation means, when modifying at least a part of the structured data after the structured data is generated, issues the same tag as the unique tag, and issues the issued tag. It is characterized in that the tag is added to the modified various data to generate the modified structured data.
 請求項5に記載の発明は、請求項4に記載の構成に加え、前記タグは、前記構造化データを前記Webページに用いた場合、前記構造化データを構成する前記各種データを前記Webページ上で動的に展開させるための情報としてのスクリプトタグであることを特徴とする。 In the invention described in claim 5, in addition to the configuration described in claim 4, when the structured data is used for the web page, the tag stores the various data forming the structured data in the web page. It is characterized by being a script tag as information for dynamically expanding it.
 請求項6に記載の発明は、請求項1乃至5の何れか一つに記載の構成に加え、前記構造化データは、前記各種データとして求人情報を含むことを特徴とする。 The invention according to claim 6 is characterized in that, in addition to the configuration according to any one of claims 1 to 5, the structured data includes job offer information as the various data.
 請求項7に記載の発明は、コンピュータが読み取り可能なプログラムであって、コンピュータを請求項1乃至6の何れか一つに記載の構造化データ生成システムとして機能させることを特徴とする。 The invention according to claim 7 is a computer-readable program that causes a computer to function as the structured data generation system according to any one of claims 1 to 6.
 本発明の構成によれば、Webページの少なくとも一部を構成する各種データを入力させて、入力された各種データを含む構造化データを所定の生成基準を用いて自動的に生成し、記録することにより、構造化データ生成システムに入力された、Webページを構成する各種データに基づいて、自動的に構造化データを生成させることができる。そのため、構造化データに関する知識のない利用者であっても、効率よく検索エンジンに検索させたり情報収集させたりできるWebページを作成することができる。これにより、構造化データを用いたWebページを容易に作成し、多くの者が容易に閲覧可能なWebページを簡易に作成・公表させることが可能になる。 According to the configuration of the present invention, various data forming at least a part of a Web page is input, and structured data including the various input data is automatically generated and recorded using a predetermined generation criterion. As a result, it is possible to automatically generate the structured data based on the various data forming the Web page, which are input to the structured data generation system. Therefore, even a user who does not have knowledge about structured data can efficiently create a Web page that allows a search engine to search and collect information. This makes it possible to easily create a web page using structured data and easily create and publish a web page that many people can easily browse.
この実施の形態に係る、構造化データ提供システム1Aの構成を概念的に示すネットワーク構成図及び機能ブロック図である。1 is a network configuration diagram and a functional block diagram conceptually showing the configuration of a structured data providing system 1A according to this embodiment. FIG.
 以下、本発明の実施の形態について、図面を参照して説明する。 Embodiments of the present invention will be described below with reference to the drawings.
 [基本構成]
図1は、本発明の実施の形態に係る構造化データ提供システム1Aの構成を概念的に示すネットワーク構成図及び機能ブロック図である。
[Basic configuration]
FIG. 1 is a network configuration diagram and a functional block diagram conceptually showing the configuration of a structured data providing system 1A according to an embodiment of the present invention.
 同図に示すとおり、構造化データ提供システム1Aは、本発明に係る構造化データ生成システム1、入力用端末2、Webサーバ3、検索エンジン4、表示用端末5を備え、これらがインターネット6を介して通信可能に接続されている。構造化データ生成システム1、入力用端末2、Webサーバ3、検索エンジン4、表示用端末5は、いずれもコンピュータ装置やコンピュータシステムであり、CPU等の演算機能、RAM、ROM、EEPROM等の記憶手段等を備え、プログラム等によって各種の処理を行う機能を有する。 As shown in FIG. 1, the structured data providing system 1A includes a structured data generation system 1 according to the present invention, an input terminal 2, a Web server 3, a search engine 4, and a display terminal 5, which are all connected to the Internet 6. It is connected via communication. The structured data generation system 1, the input terminal 2, the web server 3, the search engine 4, and the display terminal 5 are all computer devices or computer systems, and have arithmetic functions such as a CPU, storage such as RAM, ROM, and EEPROM. It has means and the like, and has a function of performing various processes by a program and the like.
 構造化データ生成システム1は、サーバシステムであって、Webサーバ、データベースサーバ等の機能を備え、また、構造化データの生成を行う機能を備える。 The structured data generation system 1 is a server system having functions such as a Web server and a database server, and also has a function of generating structured data.
 構造化データ生成システム1は、各種ハードウェアとプログラムの実行等とにより形成される機能手段として、「各種データ入力処理手段」としてのデータ入力処理部11、「構造化データ生成手段」としての構造化データ生成部12、「構造化データ送信手段」としての構造化データ送信部13とを備える。また、構造化データ生成システム1は、「構造化データ記録手段」としてのサーバ14を備えている。 The structured data generation system 1 includes a data input processing unit 11 as "various data input processing means" and a structure as "structured data generation means" as functional means formed by various hardware and execution of programs. A structured data generation unit 12 and a structured data transmission unit 13 as a “structured data transmission unit” are provided. Further, the structured data generation system 1 includes a server 14 as "structured data recording means".
 データ入力処理部11は、インターネット6を介して入力用端末2の表示部22に入力用画面23を表示させ、作成者100(後述)の操作により入力用画面23に入力された「各種データ」としてのデータを、インターネット6を介して取得する。 The data input processing unit 11 displays the input screen 23 on the display unit 22 of the input terminal 2 via the Internet 6, and the “various data” input to the input screen 23 by the operation of the creator 100 (described later). Data is acquired via the Internet 6.
 この実施において、データ入力処理部11が取得する「各種データ」は、たとえば、Webページ33(後述)に表示されるテキストデータや画像データが考えられるが、これに限定されず、Webページ33に表示されない各種データ、たとえば、Webページ上に白文字や透明文字で表記されて一見すると視認できないテキストデータや画像データ、あるいは、Webページ33を構成するデータではあってもWebページ上に表示されないテキストデータや画像データ等、どのようなデータであってもよい。なお、以下では説明の簡単のため、「各種データ」がWebページ33上に表示されるテキストデータや画像データであるものとして説明する。 In this implementation, the “various data” acquired by the data input processing unit 11 may be, for example, text data or image data displayed on the web page 33 (described later), but is not limited to this, and the Various types of data that are not displayed, such as text data and image data that are written in white or transparent characters on the web page and are not visible at first glance, or text that forms the web page 33 but is not displayed on the web page. It may be any data such as data or image data. Note that, for simplicity of explanation, it is assumed that the “various data” are text data and image data displayed on the web page 33.
 構造化データ生成部12は、データ入力処理部11によって取得されたテキストデータや画像データを用いて構造化データ16(後述)を生成する。この構造化データ16は、Webページ33(後述)の生成において用いられるものである(詳しくは後述する)。 The structured data generation unit 12 generates structured data 16 (described later) using the text data and image data acquired by the data input processing unit 11. This structured data 16 is used in the generation of the web page 33 (described later) (details will be described later).
 なお、構造化データ生成部12による構造化データ16の生成方法としては、たとえば、下記[方法1][方法2]に記載する方法を用いることが考えられる。 As a method of generating the structured data 16 by the structured data generation unit 12, for example, the methods described in [Method 1] and [Method 2] below may be used.
 [方法1]
構造化データ生成部12は、入力用端末2に表示される入力用画面23の入力欄231,232,233(後述)に入力されたデータごとに、予め決められたデータを付加することで構造化データ16を生成する。たとえば、入力欄231が募集職種の入力欄である場合、入力欄231に入力された職種名の記載、たとえば
運輸業
のテキスト情報に基づいて
"title": "運輸業"
という構造化データ16が生成される。
ただし、付加されるデータは予め決められている必要は必ずしもなく、諸条件によって付加されるデータが変わる構成や、任意のデータが付加される構成であってもよい。
[Method 1]
The structured data generation unit 12 adds a predetermined data to each data input in the input fields 231, 232, 233 (described later) of the input screen 23 displayed on the input terminal 2 The converted data 16 is generated. For example, when the input field 231 is an input field for the offered job type, based on the description of the job type input in the input field 231, for example, the text information of the transportation industry.
"title": "Transportation"
Structured data 16 is generated.
However, the data to be added does not necessarily have to be determined in advance, and the data to be added may be changed according to various conditions, or the arbitrary data may be added.
 [方法2]
 構造化データ生成部12は、構造化データ16の生成にあたり、テキストデータの中から、テキストデータの示す意味内容の認識に役立つ機能(たとえば、文章の中から単語を抽出する機能や、抽出した単語の類義語や反対語を、多くの単語が記録されたデータベース中から抽出して関連付ける機能や、画像に含まれる人物や物体を認識する機能や、そのように認識した人物や物体を示す単語を、多くの単語が記録されたデータベース中から抽出して関連付ける機能など)を備えていることが望ましい。
[Method 2]
When generating the structured data 16, the structured data generation unit 12 has a function that is useful for recognizing the meaning content indicated by the text data from the text data (for example, a function of extracting a word from a sentence or an extracted word). The synonyms and antonyms of are extracted from the database in which many words are recorded and associated, the function of recognizing the person or object included in the image, and the word indicating the person or object recognized in such a way. It is desirable to have a function of extracting and associating from a database in which many words are recorded).
 なお、以下の記載では、上記[方法1]に基づいて説明する。 Note that the following description will be based on [Method 1] above.
 構造化データ送信部13は、構造化データ生成部12が生成した構造化データをWebサーバ3等に送信する。 The structured data transmission unit 13 transmits the structured data generated by the structured data generation unit 12 to the web server 3 or the like.
 サーバ14は、各種のデータを記録するデータベースサーバとしての機能や、Webサーバ3等から要求されたデータを、インターネット6を介して要求のあったWebサーバ3等に送信するWebサーバ等としての、各種サーバの機能を備えている。 The server 14 has a function as a database server for recording various data, a web server for transmitting data requested by the web server 3 or the like to the web server 3 or the like that has made a request via the Internet 6, and the like. It has various server functions.
 サーバ14には、「付加データ」としての付加情報15や、構造化データ16が記録される。 The server 14 records additional information 15 as “additional data” and structured data 16.
 付加情報15は、データ入力処理部11が取得したテキストデータや画像データに基づいて構造化データ16を生成する際に、元のテキストデータや画像データに付加されるためのテキストデータ等である。具体的には、たとえば、所定のタグ(後述のタグ161等)も含む、他のテキストデータ等である。 The additional information 15 is text data or the like to be added to the original text data or the image data when the structured data 16 is generated based on the text data or the image data acquired by the data input processing unit 11. Specifically, it is, for example, other text data including a predetermined tag (such as a tag 161 described later).
 構造化データ16の詳細については、下記[構造化データ]の項で詳述する。 
 入力用端末2は、Webページ33(後述)を作成する作成者100が利用するパーソナルコンピュータ、スマートフォン、タブレット等の端末である。入力用端末2はキーボード、マウス等の操作部21とLCD(liquid crystal display)等の文字や画像を表示する表示部22とを備える。なお、タッチパネルのような形で操作部21と表示部22とが一体となったものでもよい。
The details of the structured data 16 will be described in the section [Structured Data] below.
The input terminal 2 is a terminal such as a personal computer, a smartphone, or a tablet used by the creator 100 who creates the web page 33 (described later). The input terminal 2 includes an operation unit 21 such as a keyboard and a mouse, and a display unit 22 such as an LCD (liquid crystal display) that displays characters and images. It should be noted that the operation unit 21 and the display unit 22 may be integrated in the form of a touch panel.
 図1に示すように、入力用端末2の表示部22には、入力用画面23が表示される。この入力用画面23は、構造化データ生成システム1のデータ入力処理部11によるデータ等の送信によって表示部22に表示される画面であって、作成者100がWebページ33(後述)を生成するためのプラットフォームとして機能するものである。 As shown in FIG. 1, an input screen 23 is displayed on the display unit 22 of the input terminal 2. The input screen 23 is a screen displayed on the display unit 22 by transmission of data or the like by the data input processing unit 11 of the structured data generation system 1, and the creator 100 generates a Web page 33 (described later). Function as a platform for
 図1に示すように、入力用画面23には、操作部21の操作によってテキストデータや画像データを入力するための複数の入力欄231,232,233が設けられている(具体的には、たとえば、これらの入力欄231,232,233が「タイトル入力欄」「本文入力欄」「画像データ入力欄」等、Webページ33の「タイトル」「本文」「添付画像」等の項目に対応する入力欄を構成することが考えられる。)。 As shown in FIG. 1, the input screen 23 is provided with a plurality of input fields 231, 232, 233 for inputting text data and image data by operating the operation unit 21 (specifically, For example, these input fields 231, 232, 233 correspond to items such as “title input field”, “text input field”, “image data input field”, etc., such as “title”, “text”, “attached image” on the Web page 33. It is possible to configure the input field.).
 また、入力用画面23には、実行ボタン234が設けられている。この実行ボタン234が操作部21によりクリックされると、入力欄231,232,233に入力されたテキストデータや画像データが構造化データ生成システム1に送信されるように構成されている。 Moreover, an execution button 234 is provided on the input screen 23. When the execute button 234 is clicked by the operation unit 21, the text data and image data input in the input fields 231, 232, 233 are transmitted to the structured data generation system 1.
 Webサーバ3は、サーバコンピュータ、サーバシステム等である。Webサーバ3は、Webページ生成部31とデータベース32とを備える。 The Web server 3 is a server computer, a server system, or the like. The web server 3 includes a web page generation unit 31 and a database 32.
 Webページ生成部31は、Webサーバ3のハードウェアやプログラムによって形成される機能手段である。Webページ生成部31は、インターネット6等のネットワークを介して表示用端末5等の端末が閲覧可能なWebページ33を生成する。Webページ生成部31は、作成者100の入力用端末2の操作や、構造化データ生成システム1の管理者(図示せず)の、インターネット6に接続した操作用端末(図示せず)の操作等に基づいて、Webページ33を生成する。以下は、Webページ生成部31は、作成者100の操作に基づいてWebページ33を生成するものとして記載する。 The web page generation unit 31 is a functional unit formed by the hardware and the program of the web server 3. The web page generation unit 31 generates a web page 33 that can be viewed by a terminal such as the display terminal 5 via a network such as the Internet 6. The web page generation unit 31 operates the input terminal 2 of the creator 100 and the operation terminal (not shown) of the administrator (not shown) of the structured data generation system 1 connected to the Internet 6. The web page 33 is generated based on the above. Below, the Web page generation unit 31 will be described as generating the Web page 33 based on the operation of the creator 100.
 データベース32は、Webページ生成部31によって生成されたWebページ33を記録する。また、Webサーバ3は、表示用端末5等の要求により、データベース(図示せず)に記録されたWebページ33を、インターネット6を介して送信する機能を有する。 The database 32 records the web page 33 generated by the web page generation unit 31. Further, the Web server 3 has a function of transmitting the Web page 33 recorded in a database (not shown) via the Internet 6 in response to a request from the display terminal 5 or the like.
 Webサーバ3のデータベース32に記録されるWebページ33は、たとえばHTMLやXML(Extensible Markup Language)等、インターネット6上での情報公開に適した各種マークアップ言語等で記述され、表示用端末5に表示されるWebブラウザ53上でテキストデータ(たとえば日本語や英語等の言語で記載された文字情報)や画像データ(静止画のデータも動画データも含む)等の「各種データ」に基づいて、コンテンツとしての文字や画像を表示させる機能を有する。ただし、Webページ33は、インターネット6を介して、文字や画像等のコンテンツを供給できるものであれば、HTMLやXML以外のどのようなマークアップ言語や、マークアップ言語以外の記述様式で記述されていてもよいし、文字や画像以外、たとえば音声等をコンテンツとして表示用端末5で出力させるものであってもよい。 The Web page 33 recorded in the database 32 of the Web server 3 is described in various markup languages suitable for information disclosure on the Internet 6, such as HTML and XML (Extensible Markup Language), and is displayed on the display terminal 5. On the displayed Web browser 53, based on "various data" such as text data (character information written in a language such as Japanese or English) or image data (including still image data and moving image data), It has a function of displaying characters and images as contents. However, the Web page 33 is described in any markup language other than HTML or XML and a description format other than the markup language as long as it can supply contents such as characters and images via the Internet 6. Alternatively, the display terminal 5 may output contents other than characters and images, such as voice.
 なお、この実施の形態では、Webページ生成部31はWebサーバ3が備えた構成としているが、これに限定されず、Webページ生成部31をWebサーバ3以外が備えた構成、たとえば構造化データ生成システム1や入力用端末2が備えた構成となっていてもよい。この場合、Webサーバ3以外の構成に設けられたWebページ生成部31が生成したWebページ33が、インターネット6等を介してWebサーバ3に送信され、Webページ33はWebサーバ3のデータベース32に記録されてWebサーバ3から表示用端末5等に送信される。 In this embodiment, the web page generation unit 31 is provided in the web server 3. However, the present invention is not limited to this, and the web page generation unit 31 is provided in a part other than the web server 3, for example, structured data. The configuration may be included in the generation system 1 or the input terminal 2. In this case, the web page 33 generated by the web page generation unit 31 provided in a configuration other than the web server 3 is transmitted to the web server 3 via the Internet 6 or the like, and the web page 33 is stored in the database 32 of the web server 3. It is recorded and transmitted from the Web server 3 to the display terminal 5 or the like.
 検索エンジン4は、インターネット6上またはLAN等のクローズドネットワーク上に存在する各種情報(Webページ、Webサイト、画像ファイル、ネットニュースなど)を検索する機能を備えたシステムである。検索エンジン4は、表示用端末5の閲覧者200の表示用端末5を用いたキーワード検索等の検索により、インターネット6上でハイパーテキストシステム(WWW:World Wide Web)を構築する複数のWebページ(Webページ33も含む)の中から、候補となる一又は複数のWebページ(Webページ33も含む)の情報を、検索結果として表示用端末5に供給する機能を有する(なお、以下は説明の簡単のため、特に区別の必要がある場合を除き、「一又は複数のWebページ(Webページ33も含む)も」単に「Webページ33」と記載する。)。 The search engine 4 is a system having a function of searching various information (web pages, websites, image files, net news, etc.) existing on the Internet 6 or a closed network such as a LAN. The search engine 4 constructs a hypertext system (WWW: World Wide Web) on the Internet 6 by performing a search such as a keyword search using the display terminal 5 of the viewer 200 of the display terminal 5, and a plurality of web pages ( The information of one or more candidate Web pages (including the Web page 33) from among the Web pages 33 is also provided as a search result to the display terminal 5 (hereinafter, the description will be made. For the sake of simplicity, "one or more Web pages (including Web page 33)" will be simply referred to as "Web page 33" unless there is a need for distinction.)
 この実施の形態の検索エンジン4は、ロボット型の検索エンジンである。検索エンジン4は、クローラー41とデータベース42とを備える。 The search engine 4 of this embodiment is a robot type search engine. The search engine 4 includes a crawler 41 and a database 42.
 クローラー41は、検索エンジン4のハードウェアやプログラムによって形成される機能手段である。クローラー41は、インターネット6上でハイパーテキストシステムを構築する、Webページ33等の情報を自動で収集し、収集したデータをデータベース42に記録する機能を有する。そして、検索エンジン4は、ユーザーが入力した検索キーワードをもとにデータベースに登録されたページのランキング付けを行い、このランキングを用いつつ、表示用端末5から受けた検索要求に基づいて表示用端末5に検索結果を送信する機能を有する。 The crawler 41 is a functional unit formed by the hardware and programs of the search engine 4. The crawler 41 has a function of constructing a hypertext system on the Internet 6, automatically collecting information such as the web page 33, and recording the collected data in the database 42. Then, the search engine 4 ranks the pages registered in the database based on the search keyword input by the user, and based on the search request received from the display terminal 5 while using this ranking. 5 has a function of transmitting a search result.
 なお、検索エンジン4は、クローラー41によるクローリングの結果に基づいて、特定の項目に特化した情報を検索結果と共に表示する機能を有する。たとえば、求人情報を求める閲覧者200が表示用端末5のWebブラウザ53(後述)を用いて検索エンジン4にアクセスし、Web仕事に関連したキーワードを検索ワードとして入れると、検索エンジン4は、クローリングの結果としてデータベース42に記録された、複数の求人サイトの情報をまとめてWebブラウザ53(後述)に表示することで、それぞれの閲覧者200の希望に適合する可能性の高い求人情報を提案することができる。また、検索エンジン4は、このような求人情報に限らず、表示用端末5から入力された様々な検索ワードに対し、様々な項目のサイト情報(たとえばホテル等の宿泊先情報、本や日用品等の商品情報、コンサート情報や映画情報等)について、同様に、複数のサイトの情報をまとめてWebブラウザ53に表示することができる。 Note that the search engine 4 has a function of displaying information specialized for a specific item together with the search result based on the result of crawling by the crawler 41. For example, when a viewer 200 seeking job offer information accesses the search engine 4 using the web browser 53 (described later) of the display terminal 5 and enters a keyword related to web work as a search word, the search engine 4 causes crawling. As a result, the information of a plurality of recruitment sites recorded in the database 42 is collectively displayed on the Web browser 53 (described later), so that the recruitment information highly likely to meet the wishes of the respective viewers 200 is proposed. be able to. Further, the search engine 4 is not limited to such job information, and for various search words input from the display terminal 5, site information of various items (for example, accommodation information of hotels, books and daily necessities, etc.). Similarly, information on a plurality of sites can be collectively displayed on the Web browser 53 (for product information, concert information, movie information, etc.).
 表示用端末5は、Webページを閲覧する閲覧者200が利用するパーソナルコンピュータ、スマートフォン、タブレット等の端末である。表示用端末5はキーボード、マウス等の操作部51とLCD(liquid crystal display)等の文字や画像を表示する表示部52とを備える。なお、タッチパネルのような形で操作部51と表示部52とが一体となったものでもよい。表示部52には、インターネット6上に公開されたWebページ33を表示するWebブラウザ53が表示される。 The display terminal 5 is a terminal such as a personal computer, a smartphone, or a tablet used by a viewer 200 who browses a web page. The display terminal 5 includes an operation unit 51 such as a keyboard and a mouse, and a display unit 52 such as an LCD (liquid crystal display) that displays characters and images. It should be noted that the operation unit 51 and the display unit 52 may be integrated in the form of a touch panel. A web browser 53 that displays the web page 33 published on the Internet 6 is displayed on the display unit 52.
 表示用端末5は、閲覧者200の操作部51の操作(たとえば検索エンジン4に対するキーワード検索の要求)により、検索エンジン4にインターネット6上に公開されたWebページ33の検索要求を行い、検索エンジン4の検索結果を表示部52に表示されたWebブラウザ53に表示させる。 The display terminal 5 requests the search engine 4 to search the Web page 33 published on the Internet 6 by operating the operation unit 51 of the viewer 200 (for example, requesting a keyword search to the search engine 4), and the search engine The search result of No. 4 is displayed on the Web browser 53 displayed on the display unit 52.
 インターネット6は複数のコンピュータネットワークを相互接続した情報通信網である。この実施の形態の構造化データ生成システム1、入力用端末2、Webサーバ3、検索エンジン4、表示用端末5は、インターネット6に有線又は無線のネットワークで通信可能に接続される。なお、この実施の形態は、インターネット6に代えてLANやイントラネット等を用いてもよい。 The Internet 6 is an information communication network that interconnects multiple computer networks. The structured data generation system 1, the input terminal 2, the Web server 3, the search engine 4, and the display terminal 5 of this embodiment are communicably connected to the Internet 6 via a wired or wireless network. In this embodiment, a LAN, an intranet or the like may be used instead of the Internet 6.
 [構造化データ]
上述のとおり、この実施の形態において、構造化データ生成システム1の構造化データ生成部12は、構造化データ16を生成する。また、上述のとおり、構造化データ16は、「各種データ」としてのテキストデータや画像データに対して、付加情報15を付加することで生成される。
[Structured data]
As described above, in this embodiment, the structured data generation unit 12 of the structured data generation system 1 generates the structured data 16. Further, as described above, the structured data 16 is generated by adding the additional information 15 to the text data or the image data as “various data”.
 この構造化データ16は、付加情報15も含めて、検索エンジン4のクローラー41によるクローリング(後述)で収集される情報に含まれる可能性が高くなるように生成されることが望ましい。 It is desirable that the structured data 16 including the additional information 15 is generated so as to be highly likely to be included in information collected by crawling (described later) by the crawler 41 of the search engine 4.
 具体的には、たとえば、データ入力処理部11に入力された元のテキストデータや画像データに付加するタグ161に所定の規則性を持たせて、元のテキストデータや画像データを階層化する(たとえば、元のテキストデータや画像データの意味内容ごとに上位、中位、下位のタグ161を付加して、元のテキストデータや画像データを上位概念、中位概念、下位概念に区分する階層化が考えられる。)等により、クローラー41によるクローリング(後述)で収集されやすいデータ構成とすることが考えられる。 Specifically, for example, the tag 161 added to the original text data or image data input to the data input processing unit 11 is given a predetermined regularity so that the original text data or image data is hierarchized ( For example, by adding upper, middle, and lower tags 161 for each semantic content of the original text data or image data, the original text data or image data is divided into a superordinate concept, a middle concept, and a subordinate concept. It is conceivable that the data structure is easily collected by crawling (described later) by the crawler 41.
 またたとえば、データ入力処理部11に入力されたテキストデータをマークアップ言語で記述する際のタグ(たとえばタグ161)の一部に、それぞれのテキストデータの意味内容に関する用語(たとえば、「仕事」や「アルバイト」や「雇用」に関する言葉を含むテキストデータに、「求人」等、テキストデータに関する言葉に関連し、検索エンジン4を用いた検索ワードとして用いられやすい単語や、その単語を含む用語などが考えられる。)を含めることが考えられる。ただし、クローラー41によるクローリング(後述)で収集される情報に含まれる可能性が高くなる構成であれば、タグ(たとえばタグ161)以外のどのような構成を用いてもよい。 Further, for example, a part of a tag (for example, tag 161) when the text data input to the data input processing unit 11 is described in a markup language includes a term (for example, “work” or For text data including words such as "part-time job" and "employment", there are words that are related to words related to text data, such as "job", and that are easy to use as search words using the search engine 4, or terms that include those words. It is conceivable to include). However, any configuration other than the tag (for example, the tag 161) may be used as long as the configuration is likely to be included in the information collected by the crawling by the crawler 41 (described later).
 この実施の形態の構造化データ16に用いられるタグ161は、たとえば、javascript(登録商標)に代表されるスクリプト言語のスクリプトタグによって構成することが考えられる。このように構成することで、構造化データ16を、Webブラウザ53上で動的に展開する動的なデータとして、簡易に構成することが可能となる。 The tag 161 used for the structured data 16 of this embodiment may be composed of a script tag of a script language represented by javascript (registered trademark), for example. With this configuration, the structured data 16 can be easily configured as dynamic data that is dynamically expanded on the web browser 53.
 この実施の形態の構造化データ16は、それぞれのタグ情報が、他のタグ情報と重複しない、ユニークなものとして形成される。具体的には、たとえばタグ情報をjavascriptのスクリプトタグとして生成することで構造化データを生成する場合、「各種データ」としてのテキストデータの「本文(この部分がWebページ33上に表示される)」に対し、図1に示すように<script src=“xxx.js”></script>という、「付加データ」としてのタグ161を付加し、
<script src=“xxx.js”>本文</script>
という構造化データ16を生成する。
この、タグ161中の「src=“xxx.js”」という部分がユニークなデータであれば、構造化データ16はユニークなタグ情報を含むものとして生成される。なお、上述の、構造化データ16を構成するユニークなタグ情報は例示であり、どのような文字、記号等によってユニークなタグ情報が構成されていてもよい。
The structured data 16 of this embodiment is formed so that each tag information is unique and does not overlap with other tag information. Specifically, for example, when the structured data is generated by generating the tag information as a script tag of javascript, the “body (this portion is displayed on the web page 33)” of the text data as “various data” is generated. ”Is added to the tag 161 as “additional data”, which is <script src=“xxx.js”></script> as shown in FIG.
<script src=“xxx.js”>Body</script>
Structured data 16 is generated.
If the portion “src=“xxx.js”” in the tag 161 is unique data, the structured data 16 is generated as including unique tag information. Note that the above-described unique tag information that configures the structured data 16 is an example, and the unique tag information may be configured by any character, symbol, or the like.
 この実施の形態においては、構造化データ生成部12において、ユニークなタグ161を付加することで構造化データ16を生成することにより、ユニークなタグ161は、構造化データ16のファイル名として機能する。すなわち、構造化データ16は、ユニークなタグ161によりユニークなファイル名が付与されたものとして構成される。 In this embodiment, the structured data generation unit 12 generates the structured data 16 by adding the unique tag 161, so that the unique tag 161 functions as a file name of the structured data 16. .. That is, the structured data 16 is configured such that a unique file name is given by the unique tag 161.
 そして、一旦Webページ33にタグ161を付加すれば、その後に構造化データ16中の上記「本文」の部分を修正した場合でも、修正前と同一のユニークなタグ161を修正後の「本文」に付加して構造化データ16を生成し、Webページ33中の同一のタグ161を有する構造化データ16と置き換えることで、Webページ33中の表示を修正できる。すなわち、Webページ33を記述するマークアップ言語自体を修正しなくても、Webページ33の修正を、容易に行うことができる。 Then, once the tag 161 is added to the Web page 33, the same unique tag 161 as before the correction is added to the “text” after the correction even if the above “text” in the structured data 16 is corrected. To generate the structured data 16 and replace it with the structured data 16 having the same tag 161 in the web page 33, the display in the web page 33 can be corrected. That is, the web page 33 can be easily modified without modifying the markup language itself that describes the web page 33.
 [処理手順]
次に、この実施の形態の処理手順を説明する。
[Processing procedure]
Next, the processing procedure of this embodiment will be described.
 [構造化データの生成]
まず、Webページの生成を行う作成者100は、入力用端末2の操作部21を操作して、インターネット6を介して構造化データ生成システム1にアクセスする。構造化データ生成システム1のデータ入力処理部11は、入力用端末2に入力用画面23を表示させる。
[Generation of structured data]
First, the creator 100 who creates a Web page operates the operation unit 21 of the input terminal 2 to access the structured data creation system 1 via the Internet 6. The data input processing unit 11 of the structured data generation system 1 causes the input terminal 2 to display the input screen 23.
 作成者100は、入力用画面23の入力欄231,232,233に、Webページ33に表示させるための文字情報をテキスト入力し、画像情報(写真や動画など)を添付する。この状態で、作成者100が実行ボタン234をクリックすると、入力欄231,232,233に入力されたテキストデータや画像データが、インターネット6を介して構造化データ生成システム1に送られる。 The creator 100 text-inputs character information to be displayed on the web page 33 in the input fields 231, 232, 233 of the input screen 23 and attaches image information (photograph, video, etc.). When the creator 100 clicks the execute button 234 in this state, the text data and image data input in the input fields 231, 232, 233 are sent to the structured data generation system 1 via the Internet 6.
 構造化データ生成システム1のデータ入力処理部11は、入力用端末2から送られてきたテキストデータや画像データを構造化データ生成部12に送る。 The data input processing unit 11 of the structured data generation system 1 sends the text data and image data sent from the input terminal 2 to the structured data generation unit 12.
 構造化データ生成部12は、予め設定された所定の条件に基づいて、取得したテキストデータや画像データに、前述の[方法1]の方法等を用い、関連する単語等を含むタグ161(たとえば、javascript(登録情報)のスクリプトタグなど)を付加して、構造化データ16を生成する。 The structured data generation unit 12 uses the method of the above [Method 1] or the like for the acquired text data or image data based on a predetermined condition set in advance, and includes a tag 161 (for example, including a related word). , Javascript (registration information) script tags, etc. are added to generate the structured data 16.
 構造化データ生成部12は、生成された構造化データ16をサーバ14に記録する。 The structured data generation unit 12 records the generated structured data 16 in the server 14.
 [構造化データを用いたWebページの生成]
作成者100は、入力用端末2の操作部21を操作してWebサーバ3にアクセスし、Webページ生成部31にWebページ33を生成させる。
[Web page generation using structured data]
The creator 100 operates the operation unit 21 of the input terminal 2 to access the Web server 3 and causes the Web page generation unit 31 to generate the Web page 33.
 具体的には、作成者100は、構造化データ生成システム1の構造化データ送信部13に、サーバ14に記録された構造化データ16を、インターネット6を介してWebサーバ3に送らせる。これにより、構造化データ生成システム1は、Webページ33から構造化データの呼び出し要求を受けることとなり、構造化データ送信部13は、この呼び出し要求において指定された構造化データ16を、Webサーバ3のWebページ生成部31に送信する。 Specifically, the creator 100 causes the structured data transmission unit 13 of the structured data generation system 1 to send the structured data 16 recorded in the server 14 to the Web server 3 via the Internet 6. As a result, the structured data generation system 1 receives the structured data calling request from the Web page 33, and the structured data transmission unit 13 sends the structured data 16 designated in this calling request to the Web server 3 To the Web page generation unit 31.
 そして、送信先であるWebサーバ3において、Webページ生成部31は、構造化データ16を含むデータを用い、マークアップ言語によって記載されたWebページ33を生成する。このWebページ33中には、記述されたマークアップ言語の記載中に、構造化データ16のタグ161が記載される。即ち、Webページ33中には、タグ161が記録された状態となる。 Then, in the destination Web server 3, the Web page generation unit 31 uses the data including the structured data 16 to generate the Web page 33 described in the markup language. In the web page 33, the tag 161 of the structured data 16 is described in the description of the described markup language. That is, the tag 161 is recorded in the web page 33.
 Webページ生成部31は、生成されたWebページ33をデータベース32に記録する。 The web page generation unit 31 records the generated web page 33 in the database 32.
 [検索エンジンによるクローリング]
検索エンジン4のクローラー41は、インターネット6上のハイパーテキストのクローリングを行う。これにより、Webサーバ3のデータベース32もクローリングの対象となる。そして、Webページ33は、このクローリングによって情報収集の対象となり、Webページ33の情報は、検索エンジン4のデータベース42に記録され、以後の検索エンジン4を用いた検索ワード検索に用いられる。
[Search engine crawling]
The crawler 41 of the search engine 4 crawls hypertext on the Internet 6. As a result, the database 32 of the Web server 3 also becomes the target of crawling. Then, the web page 33 is a target of information collection by this crawling, and the information of the web page 33 is recorded in the database 42 of the search engine 4 and used for the subsequent search word search using the search engine 4.
 ここで、Webページ33を構成する構造化データ16も、クローラー41のクローリングによって収集されるので、構造化データ16の情報は、以後の検索エンジン4を用いた検索ワード検索に用いられることになる。 Here, since the structured data 16 forming the web page 33 is also collected by crawling the crawler 41, the information of the structured data 16 will be used for the search word search using the search engine 4 thereafter. ..
 ここで、検索エンジン4が、特定の項目に特化した情報を検索結果と共に表示する機能(たとえば、検索結果として、複数の求人サイトの情報をまとめて表示する機能)を有する場合には、構造化データ16の情報は、この、構造化データ16の情報も、特定の項目に特化した情報を構成する情報として用いることになる。 Here, when the search engine 4 has a function of displaying information specialized for a specific item together with a search result (for example, a function of collectively displaying information of a plurality of recruitment sites as a search result), The information of the structured data 16 and the information of the structured data 16 will be used as information that constitutes information specialized for a specific item.
 [表示用端末による検索]
閲覧者200が、表示用端末5の操作部51を用いて表示部52にWebブラウザ53を起動させて、Webブラウザ上で検索ワードを用いた検索を行った場合、Webブラウザ53から検索エンジン4にアクセスされ、検索エンジン4にて、検索ワードによる検索を行わせる。検索エンジンは、クローラー41のクローリングの結果としてデータベース42に記録された情報等を用いて、Webブラウザ53に検索結果を表示させる。
[Search by display terminal]
When the viewer 200 activates the Web browser 53 on the display unit 52 by using the operation unit 51 of the display terminal 5 and performs a search using the search word on the Web browser, the search engine 4 from the Web browser 53. Is accessed and the search engine 4 is caused to perform a search using the search word. The search engine causes the Web browser 53 to display the search result by using the information recorded in the database 42 as a result of the crawling of the crawler 41.
 ここで、Webページ33を構成する構造化データ16をjavascript(登録商標)等のスクリプトタグを用いて生成することにより、構造化データ16を含むWebページ33を、Webブラウザ53上で動的に展開させることが可能となる。 Here, by generating the structured data 16 configuring the web page 33 using a script tag such as javascript (registered trademark), the web page 33 including the structured data 16 is dynamically generated on the web browser 53. It is possible to deploy.
 [構造化データの修正]
作成者100が、Webページ33の記載に修正や追記を行いたい場合は、作成者100は、入力用端末2により構造化データ生成システム1にアクセスし、入力用端末2の表示部22に入力用画面23を表示させる。そして、作成者100は、入力欄231,232,233の少なくとも何れか一つに、修正や追記のための文字情報や画像情報を入力し、実行ボタン234をクリックする。
[Modification of structured data]
When the creator 100 wants to make corrections or additions to the description on the Web page 33, the creator 100 accesses the structured data generation system 1 via the input terminal 2 and inputs it to the display unit 22 of the input terminal 2. The display screen 23 is displayed. Then, the creator 100 inputs character information or image information for correction or additional writing into at least one of the input fields 231, 232, 233, and clicks the execute button 234.
 実行ボタン234がクリックされると、入力欄231,232,233に入力された情報はデータ入力処理部11から構造化データ生成部12に送られ、構造化データ16が生成される。 When the execute button 234 is clicked, the information input in the input fields 231, 232, 233 is sent from the data input processing unit 11 to the structured data generation unit 12, and the structured data 16 is generated.
 ここで、構造化データ生成部12は、所定の条件(たとえば、入力欄231に入力された文字情報)には同一のタグ161を付加するので、新たに入力された文字情報には、先に所定の入力欄(たとえば入力欄231)に入力された情報によって生成された構造化データ16のタグ161と同じタグ161が付加されて、新たな構造化データ16として生成される。 Here, the structured data generation unit 12 adds the same tag 161 to a predetermined condition (for example, the character information input in the input field 231), so that the newly input character information is first added. The same tag 161 as the tag 161 of the structured data 16 generated by the information input in the predetermined input field (for example, the input field 231) is added to generate the new structured data 16.
 そして、新たに生成された構造化データ16は、構造化データ送信部13によってWebサーバ3に送られる。Webサーバ3のWebページ生成部31は、新たに送られてきた構造化データ16を、その構造化データ16に付加されたタグ161と同一のタグ161が付加された構造化データ16に置き換える。これにより、Webページ33の修正が完了する。すなわち、作成者100は、Webページ33の、マークアップ言語の記載を直接修正することなく、Webページ33の追記や修正を行える。 Then, the newly generated structured data 16 is sent to the Web server 3 by the structured data sending unit 13. The web page generation unit 31 of the web server 3 replaces the newly transmitted structured data 16 with the structured data 16 to which the same tag 161 as the tag 161 added to the structured data 16 is added. This completes the correction of the web page 33. That is, the creator 100 can add or modify the web page 33 without directly modifying the markup language description on the web page 33.
 [効果]
以上、この実施の形態においては、Webページ33の少なくとも一部を構成する各種データを入力させて、入力された各種データを含む構造化データ16を所定の生成基準を用いて自動的に生成し、記録することにより、構造化データ生成システム1に入力された各種データに基づいて、自動的に構造化データ16を生成させることができる。そのため、構造化データ16に関する知識のない利用者であっても、効率よく検索エンジン4に検索させたり情報収集させたりできるWebページ33を作成することができる。これにより、構造化データ16を用いたWebページ33を容易に作成し、多くの者が容易に閲覧可能なWebページ33を簡易に作成・公表させることが可能になる。
[effect]
As described above, in the present embodiment, various data forming at least a part of the Web page 33 is input, and the structured data 16 including the various input data is automatically generated using a predetermined generation criterion. By recording, the structured data 16 can be automatically generated based on various data input to the structured data generation system 1. Therefore, even a user who does not have knowledge of the structured data 16 can efficiently create the Web page 33 that allows the search engine 4 to search and collect information. This makes it possible to easily create the web page 33 using the structured data 16 and easily create/publish the web page 33 that many people can easily browse.
 この実施の形態においては、Webページ33からの呼び出し要求があった際に、記録された構造化データ16を呼び出したWebページ33に送信することにより、予め生成され記録された構造化データ16を、Webページ33側で必要とされるときにWebページ33に送り、Webページ33を構成させることができる。そのため、構造化データ生成システム1の利用者は、予め生成され記録された構造化データ16を必要なときに必要なWebページ33に用いて、多くの者が容易に閲覧可能なWebページ33を簡易に作成・公表させることが可能になる。 In this embodiment, when there is a call request from the web page 33, the recorded structured data 16 is transmitted to the calling web page 33, so that the structured data 16 previously generated and recorded is transmitted. , The web page 33 can be configured to be sent to the web page 33 when needed on the web page 33 side. Therefore, the user of the structured data generation system 1 uses the structured data 16 generated and recorded in advance as the necessary Web page 33, and uses the Web page 33 that many people can easily browse. It will be possible to easily create and publish.
 この実施の形態においては、生成する構造化データ16を、クローラー41による情報収集のためのクローリングが行われた際の、クローラー41が取得可能な情報として構成することにより、生成した構造化データ16を用いるWebページ33を検索エンジン4の検索結果に反映されやすい態様に構成できる。これにより、多くの者が容易に閲覧可能なWebページ33を簡易に作成・公表させることが可能になる。 In this embodiment, the structured data 16 that is generated is configured by configuring the generated structured data 16 as information that can be acquired by the crawler 41 when crawling for collecting information by the crawler 41 is performed. The web page 33 using can be configured in a manner that is easily reflected in the search result of the search engine 4. This makes it possible to easily create and publish the Web page 33 that many people can easily browse.
 この実施の形態においては、構造化データ16ごとにユニークなタグ161を発行し、構造化データ16が生成された後に構造化データ16の少なくとも一部を修正する場合、ユニークなタグ161と同一のタグ161を発行し、発行されたタグ161を修正後のデータに付加することで修正された後の構造化データ16として生成することにより、Webページ33を構成する構造化データ16を事後的に修正した後も、Webページ33のHTMLデータ等、マークアップ言語等で記載されたデータを作成者100が直接記載・修正することなく、容易に追記や修正を図ることが可能になるので、作成者100にとって利便性の高いWebページ33を提供することが可能になる。 In this embodiment, when the unique tag 161 is issued for each structured data 16 and at least a part of the structured data 16 is modified after the structured data 16 is generated, the same tag as the unique tag 161 is used. By issuing the tag 161 and adding the issued tag 161 to the modified data to generate the modified structured data 16, the structured data 16 forming the Web page 33 is ex post facto. Even after the correction, the creator 100 can easily add or correct the data written in the markup language or the like such as the HTML data of the Web page 33 without adding or modifying the data. It is possible to provide the web page 33 that is highly convenient for the person 100.
 この実施の形態においては、タグ161をjavascript(登録商標)等のスクリプトタグによって生成することにより、追記や修正が容易で、検索エンジン4の検索結果に反映させることが容易なWebページ33を、Webブラウザ53上で動的に展開する構成として形成することができ、作成者にとって利便性が高く、視覚効果等の高いWebページ33を生成することが可能になる。 In this embodiment, the tag 161 is generated by a script tag such as javascript (registered trademark), so that the Web page 33 that can be easily added or modified and reflected in the search result of the search engine 4 It can be formed as a configuration that dynamically expands on the Web browser 53, and it is possible to generate the Web page 33 that is highly convenient for the creator and has a high visual effect.
 この実施の形態においては、求人情報において、検索エンジン4の検索結果に反映され易く、作成者100にとって利便性の高いWebページ33を提供することが可能になる。 In this embodiment, it is possible to provide the web page 33 that is highly convenient for the creator 100 because the job offer information is easily reflected in the search result of the search engine 4.
 この実施の形態においては、本発明の構造化データ生成システム1の機能手段の一部又は全部をコンピュータが読み取り可能なプログラムとして構成することで、多様なコンピュータハードウェアにおいて本発明を実現することが可能となる。 In this embodiment, by configuring part or all of the functional means of the structured data generation system 1 of the present invention as a computer-readable program, the present invention can be realized in various computer hardware. It will be possible.
 上記実施の形態は本発明の例示であり、本発明が上記実施の形態のみに限定されるものではないことはいうまでもない。すなわち、本発明の具体的な構成は上記実施の形態に限られるものではなく、本発明の趣旨を逸脱しない範囲において様々な変更が可能である。 Needless to say, the above embodiment is an example of the present invention, and the present invention is not limited to the above embodiment. That is, the specific configuration of the present invention is not limited to the above embodiment, and various modifications can be made without departing from the spirit of the present invention.
1  構造化データ生成システム
4 検索エンジン
11 データ入力処理部(各種データ入力処理手段)
12 構造化データ生成部(構造化データ生成手段)
13 構造化データ送信部(構造化データ送信手段)
14 サーバ(構造化データ記録手段)
15 付加情報 (付加データ)
16 構造化データ
33 Webページ
41 クローラー
161 タグ(付加データ、スクリプトタグ)

 
1 Structured Data Generation System 4 Search Engine 11 Data Input Processing Unit (Various Data Input Processing Means)
12 Structured data generation unit (structured data generation means)
13 Structured data transmission unit (structured data transmission means)
14 server (structured data recording means)
15 Additional information (additional data)
16 structured data 33 web page 41 crawler 161 tag (additional data, script tag)

Claims (7)

  1.  Webページの少なくとも一部を構成するための各種データに付加データを付加することで構造化データを生成する構造化データ生成システムであって、
     前記各種データを入力させるための各種データ入力処理手段と、
     入力された前記各種データを含む前記構造化データを所定の生成基準を用いて自動的に生成する構造化データ生成手段と、
     生成された前記構造化データを記録するための構造化データ記録手段とを備えたことを特徴とする構造化データ生成システム。
    A structured data generation system for generating structured data by adding additional data to various data for forming at least a part of a Web page,
    Various data input processing means for inputting the various data,
    Structured data generation means for automatically generating the structured data including the various input data by using a predetermined generation criterion,
    A structured data generation system comprising: structured data recording means for recording the generated structured data.
  2.  前記Webページから前記構造化データの呼び出し要求があった際に、前記構造化データ記録手段に記録された前記構造化データを呼び出した前記Webページに送信する構造化データ送信手段を備えたことを特徴とする請求項1に記載の構造化データ生成システム。 And a structured data transmission unit for transmitting the structured data recorded in the structured data recording unit to the called Web page when the structured data recording request is issued from the Web page. The structured data generation system according to claim 1, which is characterized in that.
  3.  前記構造化データ生成手段は、生成する構造化データを、送信先の前記Webページにおいて、前記Webページの情報を取得するクローラーによる情報収集のためのクローリングが行われた際の、前記クローラーが取得可能な情報として構成することを特徴とする請求項1又は2に記載の構造化データ生成システム。 The structured data generation means acquires the structured data to be generated by the crawler when the crawler for acquiring information of the web page is crawled for information collection in the destination web page. The structured data generation system according to claim 1, wherein the structured data generation system is configured as possible information.
  4.  前記構造化データ生成手段は、前記構造化データごとに、前記付加データとしてのユニークなタグを発行し、
     前記構造化データ生成手段は、前記構造化データが生成された後に該構造化データの少なくとも一部を修正する場合、前記ユニークなタグと同一の前記タグを発行し、該発行された前記タグを修正された前記各種データに付加することで修正された前記構造化データとして生成することを特徴とすることを特徴とする請求項1乃至3のいずれか一つに記載の構造化データ生成システム。
    The structured data generation means issues a unique tag as the additional data for each structured data,
    When the structured data generation means corrects at least a part of the structured data after the structured data is generated, the structured data generation means issues the same tag as the unique tag, and outputs the issued tag. The structured data generation system according to any one of claims 1 to 3, wherein the structured data generation system generates the modified structured data by adding the modified various data.
  5.  前記タグは、前記構造化データを前記Webページに用いた場合、前記構造化データを構成する前記各種データを前記Webページ上で動的に展開させるための情報としてのスクリプトタグであることを特徴とする請求項4に記載の構造化データ生成システム。 When the structured data is used for the web page, the tag is a script tag as information for dynamically expanding the various data forming the structured data on the web page. The structured data generation system according to claim 4.
  6.  前記構造化データは、前記各種データとして求人情報を含むことを特徴とする請求項1乃至5の何れか一つに記載の構造化データ生成システム。 The structured data generation system according to any one of claims 1 to 5, wherein the structured data includes job offer information as the various data.
  7.  コンピュータが読み取り可能なプログラムであって、コンピュータを請求項1乃至6の何れか一つに記載の構造化データ生成システムとして機能させることを特徴とするプログラム。

     
    A computer-readable program that causes a computer to function as the structured data generation system according to any one of claims 1 to 6.

PCT/JP2019/006004 2019-02-19 2019-02-19 Structured data generation system and program WO2020170323A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/JP2019/006004 WO2020170323A1 (en) 2019-02-19 2019-02-19 Structured data generation system and program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2019/006004 WO2020170323A1 (en) 2019-02-19 2019-02-19 Structured data generation system and program

Publications (1)

Publication Number Publication Date
WO2020170323A1 true WO2020170323A1 (en) 2020-08-27

Family

ID=72143760

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2019/006004 WO2020170323A1 (en) 2019-02-19 2019-02-19 Structured data generation system and program

Country Status (1)

Country Link
WO (1) WO2020170323A1 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003085165A (en) * 2001-09-10 2003-03-20 Toshiba Corp Forming method for information forming means, program and device, and information forming method, program and device
JP2008158980A (en) * 2006-12-26 2008-07-10 Nippon Shoken Technology Kk Information provision device and information provision method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003085165A (en) * 2001-09-10 2003-03-20 Toshiba Corp Forming method for information forming means, program and device, and information forming method, program and device
JP2008158980A (en) * 2006-12-26 2008-07-10 Nippon Shoken Technology Kk Information provision device and information provision method

Similar Documents

Publication Publication Date Title
US10796076B2 (en) Method and system for providing suggested tags associated with a target web page for manipulation by a useroptimal rendering engine
KR101150099B1 (en) Query graphs
US20080229218A1 (en) Systems and methods for providing additional information for objects in electronic documents
JP5309121B2 (en) Information processing method, program, information processing system
US20170052659A1 (en) Computer-implemented method for providing a browser contextual assistant in a graphical user interface on a display screen of an electronic device
RU2633180C2 (en) System and method for managing browser application, persistent computer-readable medium and electronic device
US20160299951A1 (en) Processing a search query and retrieving targeted records from a networked database system
KR20080057907A (en) Method for providing hyperlink information in mobile communication terminal which can connect with wireless-internet
KR20000036647A (en) Method for searching using image
US20160085730A1 (en) Debugging and Formatting Feeds for Presentation Based on Elements and Content Items
JP2008071116A (en) Information delivery system, information delivery device, information delivery method and information delivery program
WO2020170323A1 (en) Structured data generation system and program
JP5955186B2 (en) Information processing device
JP2007207202A (en) Information providing system using web log
Leung et al. Search engines
WO2009116179A1 (en) Web log content matching information providing system
Han et al. Deep mashup: A description-based framework for lightweight integration of Web contents
JP2009070109A (en) Content relation management method, content relation management device, content relation management program, content relation browsing method and content relation registration method
JP2006318138A (en) Web system, server computer for web system, and computer program
KR100905334B1 (en) Personalized and integrated information searching method
JP5114524B2 (en) Search result update system, server and method
JP2006065451A (en) Information disclosure system, information disclosure method and program
Wardani et al. The Mashup Relevant Content Module for Content Management System (CMS)
WO2013157592A1 (en) Internet advertisement search assistance program
Stone et al. Unofficial API and Browser Extension Development for Augmenting Student Resources: CSCI-ISED Short Research Papers

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

NENP Non-entry into the national phase

Ref country code: JP

122 Ep: pct application non-entry in european phase

Ref document number: 19915798

Country of ref document: EP

Kind code of ref document: A1