JP6800478B2

JP6800478B2 - Evaluation program for component keywords that make up a Web page

Info

Publication number: JP6800478B2
Application number: JP2016246906A
Authority: JP
Inventors: 武郡司; 実菜心加茂; 努北口
Original assignee: 株式会社エフティー
Priority date: 2016-12-20
Filing date: 2016-12-20
Publication date: 2020-12-16
Anticipated expiration: 2036-12-20
Also published as: JP2018101283A

Description

本発明は、Ｗｅｂページを構成する成分キーワードの評価プログラムに関し、具体的には、人工知能によって進化する検索エンジンが判断するユーザー嗜好性と単語間の関連性を知るシステムを構築するためのコンピュータプログラムに関するものである。 The present invention relates to an evaluation program for component keywords constituting a Web page, and specifically, a computer program for constructing a system for knowing user preference and the relationship between words determined by a search engine evolving by artificial intelligence. It is about.

Ｗｅｂページ（インターネットホームページ等）を検索するときに、検索クエリ（検索時に入力する単語やフレーズ）を所定の入力エリアに入力する。検索すると、検索されたＷｅｂページが上位から順に表示される。時には何十ページ以上にもわたって検索結果が表示される。ここでＷｅｂページを制作する側にとって重要なことは、出来るだけ上位に自分の制作したＷｅｂページが表示されるように、Ｗｅｂページを制作することである。１ページ目の上位に来れば来るほど、アクセスされる機会が多いと考えられるからである。 When searching a Web page (Internet homepage, etc.), a search query (words and phrases to be input at the time of search) is input in a predetermined input area. When you search, the searched Web pages are displayed in order from the top. Sometimes search results are displayed over dozens of pages. Here, what is important for the side that creates the Web page is to create the Web page so that the Web page created by oneself is displayed as high as possible. This is because it is considered that the higher the page is, the more chances it will be accessed.

従前は、検索クエリに含まれる単語が多く含まれるＷｅｂページが上位の検索結果に表示されていた。従って、Ｗｅｂページ制作者も、目的とするキーワードを設定して、その目的キーワードが多く含むような文章を作成するなどの方法でＷｅｂページを制作していた。 Previously, Web pages containing many words included in search queries were displayed in the top search results. Therefore, the Web page creator also creates a Web page by setting a target keyword and creating a sentence containing a large number of the target keywords.

しかしながら、近年の検索エンジン（例えば、Ｇｏｏｇｌｅ）は、下記非特許文献１に示すように、人工知能を取り入れており、上記のような単純な方法では対応しきれなくなっている。この点は、概ね次の２点に集約することができる。 However, recent search engines (for example, Google) have adopted artificial intelligence as shown in Non-Patent Document 1 below, and the simple method as described above cannot cope with it. This point can be roughly summarized in the following two points.

（１）１つめは、ユーザーの操作情報（閲覧履歴）から、良いＷｅｂページの特徴を学習する精度が向上した結果、検索クエリ（キーワード）ごとにアルゴリズムが異なるようになったことである。 (1) First, as a result of improving the accuracy of learning the features of a good Web page from the user's operation information (browsing history), the algorithm has become different for each search query (keyword).

例えば、「弁理士東京」というキーワードでＧｏｏｇｌｅ検索すると、図１のような画面が表示される。その検索結果において、符号Ａのように地理的な情報（Ｇｏｏｇｌｅサービスの１つ）が含まれている。これは、検索エンジンの人工知能が働き、検索クエリ「弁理士東京」の場合、地理的情報が重要であると学習した結果である。 For example, if you do a Google search with the keyword "patent attorney Tokyo", the screen shown in FIG. 1 will be displayed. In the search result, geographical information (one of the Google services) is included like reference numeral A. This is the result of learning that geographical information is important in the case of the search query "Patent Attorney Tokyo" due to the artificial intelligence of the search engine.

また、図１の符号Ｂで示される検索オプションメニューには、ニュース、地図、画像、ショッピングのリンクが表示されている。この表示も検索エンジンの人工知能が働き、これらのニーズがあると判断した結果なされたものである。 In addition, news, maps, images, and shopping links are displayed in the search option menu indicated by reference numeral B in FIG. This display is also made as a result of judging that there are these needs due to the artificial intelligence of the search engine.

従って、検索クエリ「弁理士東京」でＷｅｂページを上位に表示したい場合は、地図（地理的な情報）を必ず含むように記載し、さらに、ニュース（＝フレッシュな情報＝更新性）、画像（＝顔写真等）、ショッピング（＝書籍等）に関する情報を記載したコンテンツを制作することが、検索エンジンの人工知能に対応するためのＷｅｂページ制作ということができる。 Therefore, if you want to display the Web page at the top of the search query "Attorney Tokyo", be sure to include the map (geographical information), and also the news (= fresh information = updateability) and images ( Creating content that describes information related to (= face photo, etc.) and shopping (= books, etc.) can be said to be the creation of a Web page that corresponds to the artificial intelligence of a search engine.

つぎに、「弁理士大阪」というキーワードでＧｏｏｇｌｅ検索すると、図２のような画面が表示される。ここで符号Ｃにより示されるのがナレッジグラフカードである。ナレッジグラフとは、情報を単なる文字列ではなく、現実世界の関係性で捉えようとする検索エンジンの取り組みであり、キーワードに対して意味づけをすることである。上記の場合、検索エンジンの人工知能は「弁理士大阪」＝「日本弁理士会近畿支部」のことである、と意味づけをして認識したことが分かる。 Next, when a Google search is performed using the keyword "patent attorney Osaka", a screen as shown in FIG. 2 is displayed. Here, the knowledge graph card is indicated by the reference numeral C. The Knowledge Graph is a search engine's effort to capture information not just as a character string but as a relationship in the real world, and to give meaning to keywords. In the above case, it can be seen that the artificial intelligence of the search engine was recognized by meaning that "Patent Attorneys Osaka" = "Japan Patent Attorneys Association Kinki Branch".

従って、検索クエリ「弁理士大阪」でＷｅｂページを上位に表示したい場合は、日本弁理士会近畿支部について大きく言及し、ニュース、ショッピング、画像に関する情報を記載したコンテンツを制作することが、検索エンジンの人工知能に対応するためのＷｅｂページ制作ということになる。 Therefore, if you want to display a Web page at the top of the search query "Patent Attorneys Osaka", you can refer to the Japan Patent Attorneys Association Kinki Branch and create content that describes news, shopping, and images. This means creating a Web page to support the artificial intelligence of.

つぎに、「通天閣」というキーワードでＧｏｏｇｌｅ検索すると、図３のような画面が表示される。ここで、符号Ｄにより示されるのが、画像情報（Ｇｏｏｇｌｅサービスの１つ）が含まれている。符号Ｂで示される検索オプションメニューには、地図、画像、ニュース、動画のリンクが表示されている。符号Ｃで示されるナレッジグラフカードには、観光名所であること、所在地、営業時間等を意味づけして認識している。 Next, if you do a Google search with the keyword "Tsutenkaku", the screen shown in FIG. 3 will be displayed. Here, what is indicated by the reference numeral D includes image information (one of the Google services). Links to maps, images, news, and videos are displayed in the search option menu indicated by reference numeral B. The Knowledge Graph card indicated by the symbol C means that it is a tourist attraction, its location, business hours, and the like.

従って、検索クエリ「通天閣」でＷｅｂページを上位に表示したい場合は、画像情報を豊富に含み、地図、ニュースについて言及し、動画を載せ、意味として認識されている観光情報にも言及したコンテンツを制作することが、検索エンジンの人工知能に対応するためのＷｅｂページ制作ということになる。 Therefore, if you want to display a web page at the top of the search query "Tsutenkaku", include a wealth of image information, mention maps and news, post videos, and include content that also mentions tourist information that is recognized as meaning. Producing is the creation of a Web page to support the artificial intelligence of search engines.

以上の結果から、検索クエリ（キーワード）ごとにアルゴリズムが異なるようになったため、その点に対応してＷｅｂページを制作する必要がある。 From the above results, the algorithm is different for each search query (keyword), and it is necessary to create a Web page corresponding to that point.

（２）２つめは、Ｗｅｂページに記載された文章の文脈や意味を学習するようになった結果、未知の言葉に関する学習をするようになった点である。 (2) The second point is that as a result of learning the context and meaning of sentences described on Web pages, we have come to learn about unknown words.

人工知能のアルゴリズムに対応するためには、検索エンジンによって言葉の意味がどのように認識されているかを把握する必要がある。従前の人工知能を取り入れる前の検索エンジンは、ユーザーによって入力される検索クエリに含まれる単語を多く含むＷｅｂページを検索結果として表示していた。 In order to support artificial intelligence algorithms, it is necessary to understand how search engines recognize the meaning of words. Prior to incorporating conventional artificial intelligence, search engines displayed Web pages containing many words contained in search queries entered by users as search results.

しかしながら、人工知能を取り入れた検索エンジンは、ベクトル空間や特異値分解という数理的なアプローチで、Ｗｅｂページの文章を黙示的に類推している。そして、検索エンジンがＷｅｂページの文章の意味を解析するということは、文章に含まれる単語の成分から表現したいであろうキーワード（単語）の意味を類推することである。 However, search engines that incorporate artificial intelligence implicitly infer the text of Web pages using a mathematical approach such as vector space and singular value decomposition. Then, when a search engine analyzes the meaning of a sentence on a Web page, it infers the meaning of a keyword (word) that one would like to express from the components of the word contained in the sentence.

一方、Ｗｅｂページを制作する人間側が人工知能検索エンジンに対応するということは、Ｗｅｂページの文章で表現しようとしているテーマ（＝キーワード、単語）を因数分解し、その成分となる単語（以下、「成分キーワード」と定義する。）を含む文章をライティングすることである。従って、単語（成分キーワード）の選択が最も重要な要素になる。その事例を以下説明する。 On the other hand, the fact that the human side that creates a Web page is compatible with an artificial intelligence search engine means that the theme (= keyword, word) that is being expressed in the text of the Web page is factorized and the word that is the component (hereinafter, "" It is to write a sentence containing "component keyword". Therefore, the selection of words (component keywords) is the most important factor. The case will be described below.

図４は、検索クエリ「小麦粉卵豚肉ソース鉄板」での検索結果を表示する画面である。すなわち、「お好み焼き」をタイトルに含むＷｅｂページが検索されている。また図５は、検索クエリ「小麦粉卵豚肉ニラ」での検索結果を表示する画面である。すなわち、「チヂミ」をタイトルに含むＷｅｂページが検索されている。 FIG. 4 is a screen for displaying the search results of the search query “flour egg pork sauce iron plate”. That is, a Web page containing "Okonomiyaki" in the title is being searched. Further, FIG. 5 is a screen for displaying the search results of the search query “flour, egg, pork, leek”. That is, a Web page containing "Chijimi" in the title is being searched.

以上のことから、人工知能検索エンジンは、検索クエリに含まれる単語と、検索結果が指し示す目的キーワード（上記の場合、「お好み焼き」「チヂミ」）との相関が高いと判断している。すなわち、目的キーワードがテーマとなるＷｅｂページの文章に含まれる単語の成分から表現したいであろうキーワード（単語）の意味を類推した結果、「小麦粉卵豚肉ソース鉄板」は「お好み焼き」のことを示しており、「小麦粉卵豚肉ニラ」は「チヂミ」を示していると意味づけて認識しようとしていることが分かる。 From the above, the artificial intelligence search engine determines that the word included in the search query has a high correlation with the target keyword (in the above case, "okonomiyaki" and "chijimi") pointed to by the search result. In other words, as a result of inferring the meaning of the keyword (word) that you would like to express from the components of the word contained in the text of the Web page with the target keyword as the theme, "flour egg pork sauce iron plate" indicates "okonomiyaki". It can be seen that "flour, egg, pork, and leek" are meant to indicate "chijimi" and are being recognized.

従って、目的キーワード「お好み焼き」でＷｅｂページを上位に表示したい場合は、相関が高い成分キーワード群「小麦粉卵豚肉ソース鉄板」を、目的キーワード「チヂミ」で上位に表示したい場合は、同様に成分キーワード群「小麦粉卵豚肉ニラ」を使用して文章のライティングをすることが重要である。ここで「小麦粉卵豚肉」はお好み焼きとチヂミで共通するが、「ソース」「鉄板」は目的キーワード「お好み焼き」を決定づける特徴的な言葉であり、「ニラ」は目的キーワード「チヂミ」を決定づける特徴的な言葉である。成分キーワードのうち、特に、これらの言葉を「想起語」と定義する。 Therefore, if you want to display the Web page at the top with the target keyword "Okonomiyaki", the component keyword group "flour egg pork sauce iron plate" with high correlation is displayed at the top with the target keyword "Chijimi". It is important to write the text using the group "flour egg pork leek". Here, "flour egg pork" is common to okonomiyaki and chijimi, but "source" and "iron plate" are characteristic words that determine the target keyword "okonomiyaki", and "leek" is a characteristic word that determines the target keyword "chijimi". Words. Of the component keywords, these words are defined as "recollection words".

例えば、「ソース」「鉄板」の２つだけのキーワードを用いてＧｏｏｇｌｅ検索した場合にも、図４Ａに示すように「お好み焼き」に関するＷｅｂサイトが検索結果に表示され、「ニラ」だけで検索した場合も図５Ａに示すように、「チヂミ」に関するＷｅｂサイトが検索結果に表示される。 For example, even when a Google search is performed using only two keywords, "source" and "iron plate", a website related to "okonomiyaki" is displayed in the search results as shown in FIG. 4A, and the search is performed using only "leek". In this case as well, as shown in FIG. 5A, the website related to "Chijimi" is displayed in the search results.

日本経済新聞、「グーグル、数カ月前から検索エンジンに人工知能を導入」［２０１５年１０月２８日付電子版］［平成２８年１１月２９日検索］、インターネット（URL: http://www.nikkei.com/article/DGXMZO93305300X21C15A0000000/）Nihon Keizai Shimbun, "Google introduces artificial intelligence into search engines for several months" [electronic version dated October 28, 2015] [searched November 29, 2016], Internet (URL: http://www.nikkei) .com / article / DGXMZO93305300X21C15A0000000 /)

このような成分キーワードの選択は、Ｗｅｂページ制作の熟練者であれば、直感的に選択することも可能であるが、そうでない者が制作する場合は、キーワードの選択が難しく、試行錯誤で行うにも相当の時間を要する。また、仮に熟練者であったとしても、必ずしも選択した成分キーワードが最適かどうかを判断することは困難である。 If you are an expert in Web page production, you can intuitively select such component keywords, but if you are not, it is difficult to select keywords, and you should do it by trial and error. It also takes a considerable amount of time. Moreover, even if the person is an expert, it is difficult to determine whether or not the selected component keyword is optimal.

本発明は上記実情に鑑みてなされたものであり、その課題は、人工知能によって進化する検索エンジンに対応したＷｅｂページを制作するに際しての成分キーワードを効率よく選択して評価を行い、Ｗｅｂページ制作の品質向上に資することが可能な成分キーワードの評価プログラムを提供することである。 The present invention has been made in view of the above circumstances, and the problem is to efficiently select and evaluate component keywords when creating a Web page corresponding to a search engine that evolves by artificial intelligence, and to create a Web page. It is to provide an evaluation program of ingredient keywords that can contribute to the quality improvement of.

上記課題を解決するため本発明に係る成分キーワードの評価プログラムは、
Ｗｅｂページを検索するための目的キーワードの入力を受け付けるステップと、
入力された目的キーワードに基づいて、Ｗｅｂページを構成すべき言葉を成分キーワードとして自動抽出するステップと、
抽出された個々の成分キーワードと目的キーワードとの相関度を第１評価値として演算するステップと、
抽出された成分キーワードと前記第１評価値とを一覧表示させるステップと、
前記一覧表示に基づいて特定の成分キーワードを選択するステップと、
選択された全ての成分キーワードと目的キーワードとの相関度を第２評価値として演算するステップと、
演算された第２評価値を画面表示させるステップと、
をコンピュータに実行させることを特徴とするものである。 In order to solve the above problems, the component keyword evaluation program according to the present invention is
Steps to accept input of target keywords for searching web pages,
A step to automatically extract words that should compose a Web page as component keywords based on the entered target keyword,
The step of calculating the degree of correlation between the extracted individual component keywords and the target keyword as the first evaluation value, and
A step of displaying a list of the extracted component keywords and the first evaluation value, and
Steps to select a specific ingredient keyword based on the list display, and
The step of calculating the degree of correlation between all the selected component keywords and the target keyword as the second evaluation value, and
The step of displaying the calculated second evaluation value on the screen,
Is characterized by having a computer execute the above.

かかる構成による評価プログラムの作用・効果を以下説明する。まず、入力された目的キーワードに基づいて、Ｗｅｂページを構成すべき言葉を成分キーワードとして自動抽出するステップを有する。目的キーワードとは、検索エンジンに認識させたいキーワードであり、ユーザーがホームページを検索するときに検索クエリに入力されるキーワードである。次いで、入力された目的キーワードに基づいて、Ｗｅｂページを構成すべき言葉を成分キーワードとして自動抽出するステップを有する。成分キーワードとは、Ｗｅｂページを制作するときにタイトルやコンテンツ（説明文などの文章等）で使用される言葉であり、複数（多数）の成分キーワードが自動抽出される。なお、「言葉」の概念には「単語」のみならず単語と単語の組み合わせ等による文章等が含まれるものとする。 The operation and effect of the evaluation program with such a configuration will be described below. First, it has a step of automatically extracting words that should constitute a Web page as component keywords based on the input target keyword. The target keyword is a keyword that a search engine wants to recognize, and is a keyword that is entered in a search query when a user searches a home page. Next, it has a step of automatically extracting words that should constitute a Web page as component keywords based on the input target keyword. The component keyword is a word used in a title or content (sentence such as a description) when creating a Web page, and a plurality (many) component keywords are automatically extracted. It should be noted that the concept of "word" includes not only "word" but also sentences and the like based on words and word combinations.

次に、抽出された個々の成分キーワードと目的キーワードとの相関度を第１評価値として演算し、それぞれの成分キーワードに対する前記第１評価値とを一覧表示させる。相関度が高ければ高いほど、その成分キーワードが含まれるようにＷｅｂページを制作すればユーザーが検索したときに上位に表示される可能性が高くなる。相関度は第１評価値として演算され、例えば、数値が高いほど相関度が高くなるような演算式を設定することができる。ユーザーは、一覧表示をみて、どの成分キーワードの評価値が高いかを確認することができる。ユーザーは、一覧表示に基づいて特定の成分キーワードの選択が可能である。例えば、第１評価値の高い成分キーワードを適宜選択することができる。あるいは、ユーザーが選択するのではなく、ソフトウェアに分析機能を持たせ、人工知能の採用等により、自動的にソフトウェアにより選択するように構成してもよい。次に、選択された全ての成分キーワードと目的キーワードとの相関度を第２評価値として演算し、画面表示させる。この第２評価値ができるだけ高くなるように成分キーワードを選択することで、その選択された成分キーワードを使用してＷｅｂページの制作を行う。その結果、人工知能によって進化する検索エンジンに対応したＷｅｂページを制作するに際しての成分キーワードを効率よく選択して評価を行い、Ｗｅｂページ制作の品質向上に資することが可能になる。 Next, the degree of correlation between the extracted individual component keywords and the target keyword is calculated as the first evaluation value, and the first evaluation value for each component keyword is displayed in a list. The higher the degree of correlation, the higher the possibility that the Web page will be displayed at the top when the user searches if the Web page is created so that the component keyword is included. The degree of correlation is calculated as the first evaluation value, and for example, an arithmetic expression can be set such that the higher the numerical value, the higher the degree of correlation. The user can check which component keyword has a high evaluation value by looking at the list display. The user can select a specific component keyword based on the list display. For example, a component keyword having a high first evaluation value can be appropriately selected. Alternatively, the software may be provided with an analysis function instead of being selected by the user, and may be automatically selected by the software by adopting artificial intelligence or the like. Next, the degree of correlation between all the selected component keywords and the target keyword is calculated as the second evaluation value and displayed on the screen. By selecting a component keyword so that the second evaluation value is as high as possible, a Web page is created using the selected component keyword. As a result, it becomes possible to efficiently select and evaluate component keywords when creating a Web page corresponding to a search engine that evolves by artificial intelligence, and to contribute to improving the quality of Web page production.

本発明に係る、前記第１評価値を演算するステップにおいて、
前記成分キーワードを用いてＷｅｂページを検索するステップと、
検索されたＷｅｂページのタイトルと説明文に目的キーワードが存在するか否かをチェックするステップと、を実行することが好ましい。 In the step of calculating the first evaluation value according to the present invention.
A step of searching a Web page using the component keywords and
It is preferable to execute the step of checking whether or not the target keyword exists in the title and description of the searched Web page.

第１評価値を演算するに際して、成分キーワードを用いてＷｅｂページを検索する。Ｗｅｂページを検索する場合に、人工知能に対応した検索エンジンが用いられる。検索されたＷｅｂページのタイトルと、その説明文がリスト的に表示されるが、そのタイトルと説明文の中に目的キーワードがあるか否かにより判断する。例えば、タイトルに存在するか否か、説明文に存在するか否か、両方に存在するか否か、により評価値を定めることができる。 When calculating the first evaluation value, the Web page is searched using the component keywords. When searching a Web page, a search engine compatible with artificial intelligence is used. The title of the searched Web page and its description are displayed in a list, and it is determined by whether or not there is a target keyword in the title and description. For example, the evaluation value can be determined by whether or not it exists in the title, whether or not it exists in the description, and whether or not it exists in both.

本発明に係る前記第１評価値を演算するステップにおいて、
検索されたＷｅｂページが上位にあるほど評価点数が高くなるような評価テーブルが予め記憶されており、この評価テーブルを用いて演算するステップを有することが好ましい。 In the step of calculating the first evaluation value according to the present invention
An evaluation table is stored in advance so that the higher the searched Web page is, the higher the evaluation score is, and it is preferable to have a step of calculating using this evaluation table.

かかる評価テーブルを用いることで、演算処理を簡単に行うことができる。 By using such an evaluation table, arithmetic processing can be easily performed.

本発明に係る前記第２評価値を演算するステップにおいて、
選択された前記成分キーワード群を用いてＷｅｂページを検索するステップと、
検索されたＷｅｂページのタイトルと説明文に目的キーワードが存在するか否かをチェックするステップと、を実行することが好ましい。 In the step of calculating the second evaluation value according to the present invention
A step of searching a Web page using the selected component keyword group, and
It is preferable to execute the step of checking whether or not the target keyword exists in the title and description of the searched Web page.

選択された成分キーワード群を用いてＷｅｂページを検索する。Ｗｅｂページを検索する場合に、人工知能に対応した検索エンジンが用いられる。検索されたＷｅｂページのタイトルと、その説明文がリスト的に表示されるが、そのタイトルと説明文の中に目的キーワードがあるか否かにより判断する。例えば、タイトルに存在するか否か、説明文に存在するか否か、両方に存在するか否か、により第２評価値を定めることができる。この第２評価値に基づいて、選択された成分キーワードが適格であるか否かを判断することができる。 A Web page is searched using the selected component keyword group. When searching a Web page, a search engine compatible with artificial intelligence is used. The title of the searched Web page and its description are displayed in a list, and it is determined by whether or not there is a target keyword in the title and description. For example, the second evaluation value can be determined depending on whether or not it exists in the title, whether or not it exists in the description, and whether or not it exists in both. Based on this second evaluation value, it can be determined whether or not the selected component keyword is eligible.

本発明に係る前記成分キーワードを自動抽出するステップは、
辞書データベースから前記目的キーワードを検索し、目的キーワードを説明する文章から言葉を抽出するステップを有することが好ましい。 The step of automatically extracting the component keyword according to the present invention is
It is preferable to have a step of searching the target keyword from the dictionary database and extracting words from the sentences explaining the target keyword.

辞書データベースとして、例えば、Ｗｉｋｉｐｅｄｉａを使用することができる。この辞書データベースを利用して、成分キーワードを検索し、成分キーワードの説明文から言葉を抽出する。これにより、成分キーワードを自動抽出することができる。またユーザーの方で成分キーワードを何にするかについて考える必要がない。辞書データベースは上記に限定されるものではなく、他のデータベースを使用してもよい。なお、検索する場合には、人工知能に対応した検索エンジンが用いられる。 As the dictionary database, for example, Wikipedia can be used. This dictionary database is used to search for component keywords and extract words from the description of the component keywords. This makes it possible to automatically extract component keywords. Also, the user does not have to think about what the ingredient keyword should be. The dictionary database is not limited to the above, and other databases may be used. When searching, a search engine compatible with artificial intelligence is used.

本発明に係る前記成分キーワードを自動抽出するステップは、
検索クエリの入力エリアに目的キーワードを入力したときに、サジェストされる言葉を抽出するステップを有することが好ましい。 The step of automatically extracting the component keyword according to the present invention is
It is preferable to have a step of extracting the suggested words when the target keyword is entered in the input area of the search query.

例えば、「通天閣」という用語を入力エリアに入力すると「アクセス」「たこ焼き」「串カツ」・・・などの言葉がサジェストされる。これらの言葉は目的キーワードとの関係性があると考えられ、成分キーワードとして自動抽出することができる。これにより、ユーザーの方で成分キーワードを何にするかについて考える必要がない。 For example, if you enter the word "Tsutenkaku" in the input area, words such as "access," "takoyaki," "kushikatsu," etc. will be suggested. These words are considered to be related to the target keyword and can be automatically extracted as component keywords. This eliminates the need for the user to think about what the ingredient keywords should be.

本発明に係る前記成分キーワードを自動抽出するステップは、
目的キーワード用いてＷｅｂページを検索するステップと、
検索された上位のＷｅｂページをクロールして、Ｗｅｂページを構成する文章から言葉を抽出するステップを有することが好ましい。 The step of automatically extracting the component keyword according to the present invention is
Steps to search a web page using the target keyword,
It is preferable to have a step of crawling the searched upper Web page and extracting words from the sentences constituting the Web page.

目的キーワードを用いてＷｅｂページを検索すると、検索されたＷｅｂページが順に表示される。検索するに際して、人工知能に対応した検索エンジンが用いられる。次いで検索された上位、例えば、１位〜１０位のＷｅｂページをクロールして、Ｗｅｂページを構成する文章から言葉を抽出する。かかる言葉は、目的キーワードとの関係性があると考えられ、成分キーワードとして自動抽出することができる。これにより、ユーザーの方で成分キーワードを何にするかについて考える必要がない。 When a Web page is searched using the target keyword, the searched Web pages are displayed in order. When searching, a search engine that supports artificial intelligence is used. Next, the searched high-ranking Web pages, for example, the 1st to 10th Web pages are crawled, and words are extracted from the sentences constituting the Web page. Such words are considered to be related to the target keyword and can be automatically extracted as component keywords. This eliminates the need for the user to think about what the ingredient keywords should be.

なお、成分キーワードを自動抽出するステップとして、３通りを挙げたが、これらをすべて使用してもよく、１つまたは適宜の２つを用いてもよく、さらに別のステップを付け加えて自動抽出するようにしてもよい。 Although three steps are given as the steps for automatically extracting the component keywords, all of them may be used, one or two as appropriate may be used, and another step is added for automatic extraction. You may do so.

検索クエリ「弁理士東京」での検索結果を表示する画面を示す図A diagram showing a screen showing search results for the search query "Patent Attorney Tokyo" 検索クエリ「弁理士大阪」での検索結果を表示する画面を示す図A diagram showing a screen showing search results for the search query "Patent Attorney Osaka" 検索クエリ「通天閣」での検索結果を表示する画面を示す図A diagram showing a screen showing the search results of the search query "Tsutenkaku" 検索クエリ「小麦粉卵豚肉ソース鉄板」での検索結果を表示する画面を示す図The figure which shows the screen which displays the search result in the search query "flour egg pork sauce iron plate" 検索クエリ「ソース鉄板」での検索結果を表示する画面を示す図The figure which shows the screen which displays the search result in the search query "source iron plate" 検索クエリ「小麦粉卵豚肉ニラ」での検索結果を表示する画面を示す図Diagram showing a screen showing search results for the search query "flour, egg, pork, leek" 検索クエリ「ニラ」での検索結果を表示する画面を示す図Diagram showing a screen showing the search results for the search query "Chive" 評価プログラムを用いたシステム全体の構成を示すブロック図Block diagram showing the overall system configuration using the evaluation program 評価プログラムを使用するときの手順を示すフローチャートFlowchart showing the procedure when using the evaluation program 評価プログラムを使用するときの手順を示すフローチャートFlowchart showing the procedure when using the evaluation program 評価プログラムを使用するときの手順を示すフローチャートFlowchart showing the procedure when using the evaluation program 目的キーワードの入力画面を示す図Diagram showing the input screen of the target keyword サジェストの例を示す図Diagram showing an example of a suggestion 点数マスタテーブル（評価テーブル）の一例を示す図The figure which shows an example of the score master table (evaluation table) 点数マスタテーブルによる演算例を示す図Diagram showing an example of calculation using the score master table 点数マスタテーブルによる演算例を示す図Diagram showing an example of calculation using the score master table 成分キーワード候補の判定結果の画面表示例を示す図The figure which shows the screen display example of the judgment result of a component keyword candidate 選択された成分キーワードの入力画面を示す図Diagram showing the input screen of the selected component keyword 成分キーワードの中から想起語候補を自動判定して識別表示した例を示す図The figure which shows the example which automatically judged the recall word candidate from the component keywords and identified and displayed it. 選択された成分キーワード候補と目的キーワードの各要素の画面表示例を示す図A diagram showing a screen display example of each element of the selected component keyword candidate and the target keyword.

本発明に係る成分キーワードの評価プログラムの好適な実施形態を図面を用いて説明する。図６は、本発明に係る評価プログラムを用いたシステム全体の構成を示すブロック図である。 A preferred embodiment of the component keyword evaluation program according to the present invention will be described with reference to the drawings. FIG. 6 is a block diagram showing a configuration of the entire system using the evaluation program according to the present invention.

＜システム全体の構成＞
図６において、Ｗｅｂサーバー１００には、種々のプログラム（コンピュータソフトウェア）が格納されている。キーワードエクスプローラー１０１（本発明に係る評価プログラムに相当）は、Ｇｏｏｇｌｅ情報取得システム１０１ａ、Ｗｉｋｉｐｅｄｉａ取得システム１０１ｂ、サイト解析システム１０１ｃ、成分キーワード判定システム１０１ｄを少なくとも備えている。各システムの機能については後述する。 <Overall system configuration>
In FIG. 6, various programs (computer software) are stored in the Web server 100. The keyword explorer 101 (corresponding to the evaluation program according to the present invention) includes at least a Google information acquisition system 101a, a Wikipedia acquisition system 101b, a site analysis system 101c, and a component keyword determination system 101d. The functions of each system will be described later.

また、Ｗｅｂサーバー１００には、会員管理システム１０２が格納されており、認証システム１０２ａと外部接続管理システム１０２ｂを少なくとも備えている。本発明に係る評価プログラムを利用できるのは、会員登録をした者に限定されるので、会員であるか否かを確認するための認証システム１０２ａが設けられている。外部接続管理システム１０２ｂは、会員データ等を外部サーバーに保存・管理させるためのシステムである。 Further, the Web server 100 stores a member management system 102, and at least includes an authentication system 102a and an external connection management system 102b. Since the evaluation program according to the present invention can be used only by a person who has registered as a member, an authentication system 102a for confirming whether or not the member is a member is provided. The external connection management system 102b is a system for storing and managing member data and the like in an external server.

ＤＢサーバー２００は、種々のデータを格納するためのデータベースを備えたサーバーであり、Ｗｅｂサーバー１００と連携して作動する。クロールサーバー２１０には、クロールシステムが格納されている。インターネット上には、種々のＷｅｂサイトが存在する。会員登録されたユーザーは、例えば、ＰＣ（パーソナルコンピュータ）に格納されたクライアント・ブラウザ（例えば、インターネットエクスプローラ）を用いて、インターネットを介して、Ｗｅｂサーバー１００にアクセスして、本発明に係る評価プログラムを利用することができる。 The DB server 200 is a server provided with a database for storing various data, and operates in cooperation with the Web server 100. The crawl server 210 stores the crawl system. There are various websites on the Internet. The user registered as a member accesses the Web server 100 via the Internet using, for example, a client browser (for example, Internet Explorer) stored in a PC (personal computer), and the evaluation program according to the present invention. Can be used.

ＧｏｏｇｌｅＡＰＩ検索サービスは、サーバー２４０に格納されており、Ｇｏｏｇｌｅが提供する検索エンジンを用いた検索サービスを提供する。なお、本実施形態では、検索エンジンとしてＧｏｏｇｌｅを例に挙げて説明するが、本発明としては、検索エンジンはＧｏｏｇｌｅに限定されるものではなく、人工知能に対応した他の検索エンジンを用いる場合にも適用される。 The Google API search service is stored in the server 240, and provides a search service using a search engine provided by Google. In the present embodiment, Google will be used as an example as a search engine, but the present invention is not limited to Google, and when another search engine compatible with artificial intelligence is used, the search engine is not limited to Google. Also applies.

Ｗｉｋｉｐｅｄｉａは、辞書データベースであり、サーバー２５０に格納されている。辞書データベースについても、Ｗｉｋｉｐｅｄｉａに限定されるものではなく、他のデータベースを用いてもよい。また、複数の辞書データベースを用いてもよい。 Wikipedia is a dictionary database and is stored in the server 250. The dictionary database is not limited to Wikipedia, and other databases may be used. Moreover, you may use a plurality of dictionary databases.

＜評価プログラムの使用手順＞
つぎに、本発明に係る評価プログラムを使用するときの手順を図７Ａ，７Ｂ，７Ｃのフローチャートにより説明する。 <Procedure for using the evaluation program>
Next, the procedure when using the evaluation program according to the present invention will be described with reference to the flowcharts of FIGS. 7A, 7B, and 7C.

この評価プログラムを用いてＷｅｂページの制作を使用しようとするユーザーは、ＰＣ等を用いてインターネットを介してＷｅｂサーバー１００にアクセスし、キーワードエクスプローラー１０１を開く。利用するに際して、会員管理システム１０２による会員認証が行われるが、公知のシステムであるので説明は省略する。 A user who intends to use the production of a Web page using this evaluation program accesses the Web server 100 via the Internet using a PC or the like and opens the keyword explorer 101. When using it, member authentication is performed by the member management system 102, but since it is a known system, the description thereof will be omitted.

図８に示すような画面がユーザーのコンピュータ画面に表示される。なお、説明の便宜のため、画面構成の一部のみを概略化して示す。以下、他の画面表示も同様である。 A screen as shown in FIG. 8 is displayed on the user's computer screen. For convenience of explanation, only a part of the screen configuration is outlined. Hereinafter, the same applies to other screen displays.

入力エリア１０に、制作するＷｅｂサイトの目的キーワードを入力する（Ｓ１）。目的キーワードとは、Ｗｅｂサイトのテーマとなるものであり、検索エンジンに認識させたいキーワードになる。入力された目的キーワードは、記憶装置にデータベースとして保存・登録される（図７Ｂ）。 Enter the target keyword of the website to be produced in the input area 10 (S1). The target keyword is the theme of the website and is the keyword that the search engine wants to recognize. The input target keyword is stored and registered in the storage device as a database (FIG. 7B).

別途キーワードリストがある場合は読み込むようにする（Ｓ２）。キーワードリストは、クリックボタン１１を押すことでＤＢサーバー２００にアクセスして、別途キーワードリストを読み込む（Ｓ２ａ）。なお、このステップＳ２は、なくてもよい。 If there is a separate keyword list, read it (S2). The keyword list accesses the DB server 200 by pressing the click button 11 and reads the keyword list separately (S2a). Note that this step S2 is not necessary.

つぎに、Ｇｏｏｇｌｅによる検索結果に含まれるＧｏｏｇｌｅサービス（地図やニュース等）を自動取得する（Ｓ３）。取得された場合は、「重要コンテンツ要素」として定義され、データベースに保存される。これは図１のＡでは地図が例示される。このＧｏｏｇｌｅサービスは、Ｇｏｏｇｌｅの人工知能が、入力された検索クエリからユーザーのニーズを反映させたものであり、本来は、順位１位のＷｅｂサイトの表示エリアを削ってまで表示している。従って、検索エンジンが重要視しているコンテンツ要素と判断することができる。このステップＳ３は、Ｇｏｏｇｌｅ情報取得システム１０１ａによって実行される。 Next, the Google service (map, news, etc.) included in the search result by Google is automatically acquired (S3). When retrieved, it is defined as an "important content element" and stored in the database. This is illustrated by A in FIG. In this Google service, Google's artificial intelligence reflects the needs of users from the input search query, and originally, the display area of the website ranked first is cut off and displayed. Therefore, it can be determined that the content element is regarded as important by the search engine. This step S3 is executed by the Google information acquisition system 101a.

つぎに、Ｇｏｏｇｌｅによる検索結果の検索オプションメニュー項目を自動取得する（Ｓ４）。取得された項目は「コンテンツ要素」として定義され、これに即したコンテンツを作成すべき指標として、データベースに保存される。検索オプションメニュー項目は、図１の符号Ｂで示すエリアに表示される項目である。これも検索エンジンの人工知能が働き、これらのニーズがあると判断した結果が表示されている。このステップＳ４は、Ｇｏｏｇｌｅ情報取得システム１０１ａによって実行される。 Next, the search option menu item of the search result by Google is automatically acquired (S4). The acquired item is defined as a "content element" and is stored in the database as an index for creating content according to this. The search option menu item is an item displayed in the area indicated by reference numeral B in FIG. The artificial intelligence of the search engine works here as well, and the results of determining that there are these needs are displayed. This step S4 is executed by the Google information acquisition system 101a.

つぎに、Ｇｏｏｇｌｅによる検索結果のナレッジグラフカードを自動取得する（Ｓ５）。これは図２の符号Ｃに例示される。Ｇｏｏｇｌｅの検索結果にナレッジグラフカードが表示される場合、入力された検索クエリを指し示す意味情報として機能していることを示す。この情報は、「意味要素」として区分し、これに即したコンテンツを作成すべき指標として、データベースに保存する。このステップＳ５は、最終的にＧｏｏｇｌｅ情報取得システム１０１ａによって実行される。 Next, the knowledge graph card of the search result by Google is automatically acquired (S5). This is illustrated by reference numeral C in FIG. When the Knowledge Graph card is displayed in the Google search results, it indicates that it is functioning as semantic information pointing to the entered search query. This information is classified as "semantic elements" and stored in the database as an index for creating content corresponding to this. This step S5 is finally executed by the Google information acquisition system 101a.

つぎに目的キーワードのＷｉｋｉｐｅｄｉａから単語を自動抽出する（Ｓ６）。Ｇｏｏｇｌｅは、Ｗｉｋｉｐｅｄｉａのデータベースを使用していることを公表しており、Ｗｉｋｉｐｅｄｉａから抽出された単語は、高い確率で成分キーワードの候補になりうる。これらの単語は、成分キーワード候補としてデータベースに保存される（Ｓ６ａ）。このステップＳ６は、Ｗｉｋｉｐｅｄｉａ取得システム１０１ｂによって実行される。Ｗｉｋｉｐｅｄｉａ取得システム１０１ｂはデータベース２５０にアクセスして単語を取得する。 Next, words are automatically extracted from Wikipedia, which is the target keyword (S6). Google has announced that it uses Wikipedia's database, and words extracted from Wikipedia can be candidates for component keywords with high probability. These words are stored in the database as component keyword candidates (S6a). This step S6 is executed by the Wikipedia acquisition system 101b. The Wikipedia acquisition system 101b accesses database 250 to acquire words.

単語の抽出を行う場合、英語の文章であれば、単語ごとに分かれているので容易であるが、日本語文章の場合は、単語ごとに分かれていないため、形態素解析の技術を用いて文章から単語を分解する。なお、形態素解析は公知の解析方法である。 When extracting words, it is easy if it is an English sentence because it is divided by word, but if it is a Japanese sentence, it is not divided by word, so it is easy to extract from the sentence using morphological analysis technology. Break down words. The morphological analysis is a known analysis method.

つぎに、目的キーワードを含む検索クエリ（Ｇｏｏｇｌｅサジェスト）から単語を自動抽出する（Ｓ７）。図９は、検索クエリに「新大阪」と入力した例を示すが、その下に一覧的表示で「駅」「ランチ」「ホテル」・・・という単語がサジェストされている。Ｇｏｏｇｌｅサジェストとは、Ｇｏｏｇｌｅ検索画面にて単語を入力した際に候補が表示される機能のことをいう。これらの単語は、ユーザーのニーズそのものであり、高い確率で成分キーワードの候補になりうるので、これらの単語を成分キーワード候補としてデータベースに保存する（Ｓ６ａ）。このステップＳ７は、Ｇｏｏｇｌｅ情報取得システム１０１ａによって実行される。 Next, a word is automatically extracted from a search query (Google suggestion) including a target keyword (S7). FIG. 9 shows an example in which "Shin-Osaka" is entered in the search query, and the words "station", "lunch", "hotel", etc. are suggested below it in a list display. Google suggestion is a function that displays candidates when a word is entered on the Google search screen. Since these words are the user's needs themselves and can be candidates for component keywords with high probability, these words are stored in the database as component keyword candidates (S6a). This step S7 is executed by the Google information acquisition system 101a.

つぎに、目的キーワードのＧｏｏｇｌｅ検索結果の上位のＷｅｂサイトをクロールし、単語と特徴語を自動抽出する（Ｓ８）。上位のＷｅｂサイトで使用されている単語は、Ｇｏｏｇｌｅが成分キーワードとして一定の評価をしている単語が使用されていると考えられ、高い確率で成分キーワードの候補になりうる。従って、この情報をインターネット上の当該Ｗｅｂサイトから成分キーワード候補として取得し、データベースに保存する（Ｓ６ａ）。文章から単語を分解する手法は前述の形態素解析の技術を使用する。このステップＳ８は、クロールサーバー２１０上のクロールシステムによって実行される。 Next, the websites at the top of the Google search results of the target keyword are crawled, and words and feature words are automatically extracted (S8). It is considered that the words used on the higher-ranking websites are words that Google has evaluated to a certain degree as component keywords, and can be candidates for component keywords with high probability. Therefore, this information is acquired as a component keyword candidate from the website on the Internet and stored in the database (S6a). The method of decomposing words from sentences uses the above-mentioned morphological analysis technique. This step S8 is performed by the crawl system on the crawl server 210.

一方、Ｇｏｏｇｌｅは、Ｗｅｂサイトの制作に独創性を求めており、一概に上位に表示されたＷｅｂサイトの単語ばかりを使用することは好ましくない。そこで、上位表示された各Ｗｅｂサイト内のページごとに特徴語を取得し、そのＷｅｂサイト内で単語がどのような特徴づけをされているか自動計算する。同じような特徴語の構成にせず、作成するＷｅｂサイトが独自の特徴語として単語を構成するための指標として活用する。特徴語の取得はＴＦ−ＩＤＦ法を使用して自動取得する。取得した特徴語はデータベースに保存する（Ｓ８ａ）。ＴＦ−ＩＤＦ法は、公知の手法であり、情報検索や文書推薦などで幅広く利用される特徴量の指標である。なお、この特徴語の取得は、サイト解析システム１０１ｃの機能により実行される。 On the other hand, Google demands originality in the production of websites, and it is not preferable to use only the words of websites that are generally displayed at the top. Therefore, a feature word is acquired for each page in each of the top-displayed websites, and what kind of characterization of the word in the website is automatically calculated. Instead of constructing similar feature words, the website to be created uses it as an index for composing words as unique feature words. Feature words are automatically acquired using the TF-IDF method. The acquired feature words are stored in the database (S8a). The TF-IDF method is a known method and is an index of features widely used in information retrieval and document recommendation. The acquisition of this feature word is executed by the function of the site analysis system 101c.

つぎに、ステップＳ６ａにてデータベースに保存された成分キーワード候補を単体で自動判定する（Ｓ９）。このステップ９は、キーワードエクスプローラー１０１の成分キーワード判定システム１０１ｄにより実行される。この判定時に用いられるロジックを具体的に説明すると、成分キーワードを検索クエリとしてＧｏｏｇｌｅ検索を行う。検索結果あげられたＷｅｂサイトのタイトル及び説明文に目的キーワードが含まれるか否かで判定する。 Next, the component keyword candidates saved in the database in step S6a are automatically determined by themselves (S9). This step 9 is executed by the component keyword determination system 101d of the keyword explorer 101. To specifically explain the logic used at the time of this determination, a Google search is performed using the component keyword as a search query. It is determined whether or not the target keyword is included in the title and description of the website given as the search result.

検索により得られたＷｅｂサイトのタイトルと説明文は、ユーザーが視認できるダイレクトな情報である。ここに目的キーワードが含まれているか否かは、ユーザーが得たい情報かどうかを判別する上で非常に重要な要素である。Ｇｏｏｇｌｅの人工知能もタイトルと説明文に表示される情報を重要視していると考えられる。 The title and description of the website obtained by the search are direct information that can be visually recognized by the user. Whether or not the target keyword is included here is a very important factor in determining whether or not the information is the information that the user wants to obtain. It is thought that Google's artificial intelligence also attaches great importance to the information displayed in the title and description.

図１０は、判定ロジックに用いられる点数マスタテーブル（評価テーブルの一例）を示す図である。検索結果の順位は１位から１０位までを評価対象とする。ただし、何位までを評価対象にするかは適宜設定することができる。目的キーワードがタイトルに存在する場合、説明文に存在する場合、両方に存在する場合の点数が示されている。両方に存在する場合は、タイトルに存在する場合の点数と説明文に存在する場合の点数を加算するのではなく、別途設定した点数を適用する。 FIG. 10 is a diagram showing a score master table (an example of an evaluation table) used in the determination logic. The ranking of the search results is evaluated from 1st to 10th. However, the number of evaluation targets can be set as appropriate. The score is shown when the target keyword is present in the title, in the description, and in both. If it exists in both, the score set separately is applied instead of adding the score if it exists in the title and the score if it exists in the description.

当然に、上位のＷｅｂサイトに存在するほど、点数は高くなるように設定されている。なお、点数は単純に加算するのではなく、最高点を１００点とするため、１位〜３位までの合計点数の満点（上限値）は４０点、４位〜７位までの合計点数の満点は３５点、８位〜１０位の合計点数の満点は２５点に設定する。なお、具体的な点数設定は、これに限定されるものではなく、適宜設定することができる。 As a matter of course, the higher the website, the higher the score. In addition, the points are not simply added, but the highest point is 100 points, so the maximum score (upper limit) of the total points from 1st to 3rd place is 40 points, and the total points from 4th to 7th place. The maximum score is 35 points, and the total score of 8th to 10th is set to 25 points. The specific score setting is not limited to this, and can be set as appropriate.

図１１は、図１０の点数マスタテーブルを用いた演算例を示す図である。左側の表は、検索結果の例を示し、目的キーワードが５位のＷｅｂサイトのタイトルに存在し、説明文が３位、５位、１０位に存在する場合である。具体的な点数例は右側の表に示される。Ｗｅｂサイトの５位にはタイトルと説明文の両方に存在するので２５点となる。１位〜３位までの合計は１８点、４位〜７位までの合計は２５点、８位〜１０位までの合計は９点であり、合計は５２点である。これらの合計点はいずれも満点（上限値）には届いていないので、それぞれの点数が適用され、最終評価点（第１評価値に相当）も５２点となる。 FIG. 11 is a diagram showing an example of calculation using the score master table of FIG. The table on the left shows an example of search results, in which the target keyword exists in the title of the website in the 5th place and the description exists in the 3rd, 5th, and 10th places. Specific examples of points are shown in the table on the right. The 5th place on the website is 25 points because it exists in both the title and the description. The total of 1st to 3rd place is 18 points, the total of 4th to 7th place is 25 points, the total of 8th to 10th place is 9 points, and the total is 52 points. Since none of these total points have reached the perfect score (upper limit value), each score is applied, and the final evaluation score (corresponding to the first evaluation value) is also 52 points.

図１２は、別の演算例を示す図である。成分キーワード候補を同じようにＧｏｏｇｌｅ検索を行い、１位、２位、３位、５位、７位、９位のＷｅｂサイトのタイトルに目的キーワードが含まれ、１位、３位、５位、６位、９位、１０位の説明文に目的キーワードが存在する場合である。１位〜３位の合計点は８３点であるが、満点を超えているので４０点となる。４位〜７位の合計点は５８点であるが、満点を超えているので５８点となる。８位〜１０位の合計点は２９点であるが満点を超えているので２５点となる。最終的な第１評価値は１００点となる。 FIG. 12 is a diagram showing another calculation example. Google search for component keyword candidates in the same way, and the target keywords are included in the titles of the 1st, 2nd, 3rd, 5th, 7th, and 9th websites, and the 1st, 3rd, 5th, and so on. This is the case when the target keyword exists in the explanations of the 6th, 9th, and 10th places. The total score of the 1st to 3rd places is 83 points, but since it exceeds the perfect score, it is 40 points. The total score of the 4th to 7th places is 58 points, but since it exceeds the perfect score, it is 58 points. The total score of 8th to 10th place is 29 points, but since it exceeds the perfect score, it is 25 points. The final first evaluation value is 100 points.

第１評価値が１００点に近いほど、成分キーワードと目的キーワードの相関度が高いということがいえる。成分キーワード単体の判定結果については、データベースに保存される（Ｓ９ａ）。 It can be said that the closer the first evaluation value is to 100 points, the higher the degree of correlation between the component keyword and the target keyword. The determination result of the component keyword alone is stored in the database (S9a).

つぎに、それぞれの成分キーワード候補の判定結果を画面表示させる（Ｓ１０）。その画面表示例を図１３に示す。この例は、目的キーワードが「新宿」である。関連キーワードという項目の下に、成分キーワードの候補として「映画」「ランチ」「ラーメン」・・・が示されている。ブレインスコアという項目の下に第１評価値が点数として示されている。これが図１０の点数マスタテーブルを用いた判定結果である。一覧表示の一番右側に、個々の成分キーワードに対する選択ボタンが設けられている。 Next, the determination result of each component keyword candidate is displayed on the screen (S10). An example of the screen display is shown in FIG. In this example, the target keyword is "Shinjuku". Under the item of related keywords, "movie", "lunch", "ramen", etc. are shown as candidates for component keywords. The first evaluation value is shown as a score under the item called Brain Score. This is the determination result using the score master table of FIG. On the far right of the list display, there is a selection button for each component keyword.

ユーザーは、この判定結果を見ながら、第１評価値が高いものを選択ボタンをクリックすることで選択することができる。例えば、評価値の高い「ビアガーデン」「映画」「カフェ」を選択できる。あるいは、ユーザーが選択するのではなく、ソフトウェアに分析機能を持たせ、人工知能の採用等により、自動的にソフトウェアにより選択するように構成してもよい。 The user can select the one with the highest first evaluation value by clicking the selection button while looking at the determination result. For example, "beer garden", "movie", and "cafe" with high evaluation values can be selected. Alternatively, the software may be provided with an analysis function instead of being selected by the user, and may be automatically selected by the software by adopting artificial intelligence or the like.

なお、図７Ｂのフローチャートでは、特徴語についてステップＳ９の自動判定を行わないようになっているが、自動抽出された特徴語についてもステップＳ９の自動判定を行うようにしてもよい。 Although the flowchart of FIG. 7B does not automatically determine the feature word in step S9, the automatically extracted feature word may also be automatically determined in step S9.

次に、選択された成分キーワード候補をＧｏｏｇｌｅ検索して自動判定する（Ｓ１１）。図１４は選択された成分キーワードの入力画面である。ここでは成分キーワード（群）として「女子会」「ルミネ」の２つか選択された状態である。目的キーワードは「新宿」である。図１３の選択ボタンをクリックすることで自動的に入力される。あるいは、ユーザーが自ら入力できるようにしてもよい。自動判定のロジックはステップＳ１０の場合と同じであり、同じ点数マスタテーブルが使用される。ステップＳ１０では、単体（１つ）の成分キーワードについての評価であったが、ステップＳ１１では、複数の成分キーワード（複合ワード）についての評価である。図１４において「調べる」ボタンをクリックすると、判定ロジックが動作し、関連性評価が演算・表示される（Ｓ１２）。図１４では、評価値（第２評価値に相当）として１００点（満点）が示されている。また判定結果はデータベースに保存される（Ｓ１１ａ）。 Next, the selected component keyword candidate is Google-searched and automatically determined (S11). FIG. 14 is an input screen for the selected component keyword. Here, two of "women's association" and "LUMINE" are selected as the component keywords (group). The target keyword is "Shinjuku". It is automatically input by clicking the selection button in FIG. Alternatively, the user may be able to input by himself / herself. The logic of automatic determination is the same as in step S10, and the same score master table is used. In step S10, the evaluation was for a single component keyword (one), but in step S11, the evaluation is for a plurality of component keywords (composite words). When the "examine" button is clicked in FIG. 14, the determination logic operates, and the relevance evaluation is calculated and displayed (S12). In FIG. 14, 100 points (perfect score) are shown as the evaluation value (corresponding to the second evaluation value). The determination result is saved in the database (S11a).

図１４Ａは、成分キーワードの中から、目的キーワードを決定づける特徴的な言葉（想起語）の候補を自動判定して識別表示した例を示す。識別表示させる方法としては、有色のマーカーを付する方法、字体やサイズを変える方法等、種々考えられる。この事例では、目的キーワード「チヂミ」に対して、「ニラ」が想起語であると自動判定されている。想起語の判定については、例えば、ステップＳ８で抽出された特徴語を自動的に想起語とすることができる。あるいは、ステップＳ９で自動判定された結果、第１評価値が高い成分キーワードを想起語とすることができる。 FIG. 14A shows an example in which candidates for characteristic words (remembering words) that determine the target keyword are automatically determined and identified from the component keywords. As a method for identifying and displaying, various methods such as attaching a colored marker, changing the font and size, and the like can be considered. In this case, it is automatically determined that "Chive" is a recall word for the target keyword "Chijimi". Regarding the determination of the recall word, for example, the feature word extracted in step S8 can be automatically used as the recall word. Alternatively, as a result of the automatic determination in step S9, the component keyword having a high first evaluation value can be used as the recall word.

成分キーワードの選択を変えながら何度でも第２評価値の演算をすることもでき、これにより、適切な成分キーワードの選択を行うことができる。 The second evaluation value can be calculated as many times as necessary while changing the selection of the component keyword, whereby an appropriate component keyword can be selected.

第２評価値が低い成分キーワード候補は自動又は選択して除外される。例えば、図１４において評価値が低くなるような成分キーワードについては、図１３の一覧表示からユーザーが選択的に除外するか、自動的に除外される（表示されなくなる処理）ようにすることができる。また、図１３の一覧表示をする時にブレインスコア（第１評価値）が０点のものは、最初から表示されないような処理を行ってもよい。 Component keyword candidates with a low second evaluation value are automatically or selected and excluded. For example, a component keyword having a low evaluation value in FIG. 14 can be selectively excluded from the list display in FIG. 13 by the user, or automatically excluded (process to be hidden). .. Further, when displaying the list in FIG. 13, if the brain score (first evaluation value) is 0 points, processing may be performed so that the brain score is not displayed from the beginning.

Ｇｏｏｇｌｅに搭載された人工知能による評価を上げるためには、成分キーワードとして認識され、かつ特徴語として、ＴＦ−ＩＤＦ法が示す数値が高い単語を採用することが重要である。従って、それを満たす情報を表示し、選択または除外する。 In order to raise the evaluation by the artificial intelligence installed in Google, it is important to adopt a word that is recognized as a component keyword and has a high numerical value indicated by the TF-IDF method as a characteristic word. Therefore, the information that satisfies it is displayed and selected or excluded.

つぎに、選択された成分キーワード候補と目的キーワードの各要素を表示する（Ｓ１３）。その画面表示例を図１５に示す。目的キーワードの各要素とは、図１５の上に表示される「重要コンテンツ要素」「コンテンツ要素」「ナレッジグラフカード」のことである。また、その結果はデータベースに保存される（Ｓ１３ａ）。 Next, each element of the selected component keyword candidate and the target keyword is displayed (S13). An example of the screen display is shown in FIG. Each element of the target keyword is an "important content element", a "content element", and a "knowledge graph card" displayed on FIG. The result is saved in the database (S13a).

最終判定結果として、目的キーワードのユーザー嗜好性を反映するコンテンツの作成方法の要素、「重要コンテンツ」「コンテンツ」「意味要素（ナレッジグラフカード有）」が表示される。また、各成分キーワードの自動判定結果（第１評価値）と「重要コンテンツ」「コンテンツ」「意味要素（ナレッジグラフカード有）」が表示され、その結果リストが保存できる。 As the final judgment result, the elements of the content creation method reflecting the user preference of the target keyword, "important content", "content", and "semantic element (with Knowledge Graph card)" are displayed. In addition, the automatic determination result (first evaluation value) of each component keyword, "important content", "content", and "semantic element (with knowledge graph card)" are displayed, and the result list can be saved.

過去に行った評価結果は表示させて比較することができる。また結果リストはダウンロードすることができ、Ｗｅｂサイトの制作に活用することができる。 The evaluation results performed in the past can be displayed and compared. In addition, the result list can be downloaded and used for creating a website.

Ｗｅｂサイト制作者は、このシステムから得られた情報に基づいてＷｅｂサイト制作を進めていくことで、Ｇｏｏｇｌｅの人工知能に対応するためのＷｅｂサイト制作ができるようになる。 The website creator will be able to create a website to support Google's artificial intelligence by proceeding with the website creation based on the information obtained from this system.

＜別実施形態＞
本発明における言葉（単語）は名詞に限定されるものではなく、形容詞、動詞なども含まれるものである。また、本発明における「言葉」は、複数の単語や助詞等が組み合わせられたフレーズ、文章等も含むものとする。 <Another Embodiment>
Words in the present invention are not limited to nouns, but also include adjectives and verbs. Further, the "word" in the present invention includes a phrase, a sentence, etc. in which a plurality of words, particles, etc. are combined.

本発明における成分キーワードは、最終的にユーザーに選択される成分キーワードと、選択されるまでの候補（成分キーワード候補）も含むもの概念とする。 The component keyword in the present invention is a concept including a component keyword finally selected by the user and a candidate (component keyword candidate) until the selection.

データベースが実際に構築されるハードウェアは、ハードディスク、半導体メモリ、外部記憶媒体（ＤＶＤ等）等、適宜の記憶装置を用いることができる。 As the hardware on which the database is actually constructed, an appropriate storage device such as a hard disk, a semiconductor memory, or an external storage medium (DVD or the like) can be used.

本実施形態において、第１評価値、第２評価値は、数値の大小による評価を行っているが、これに限定されるものではなく、Ａ，Ｂ，Ｃのような評価を用いてもよい。また、数値を用いる場合、数値が小さいほど評価が高くなるように設定してもよい。 In the present embodiment, the first evaluation value and the second evaluation value are evaluated according to the magnitude of the numerical values, but the evaluation is not limited to this, and evaluations such as A, B, and C may be used. .. Further, when a numerical value is used, the smaller the numerical value, the higher the evaluation may be set.

本実施形態において、評価プログラムはＷｅｂサーバーにインストールされる構成であるが、ユーザーの各ＰＣにそれぞれインストールされる構成を採用してもよい。 In the present embodiment, the evaluation program is installed on the Web server, but it may be installed on each user's PC.

１０入力エリア
１１クリックボタン
１００Ｗｅｂサーバー
１０１キーワードエクスプローラー
１０１ａＧｏｏｇｌｅ情報取得システム
１０１ｂＷｉｋｉｐｅｄｉａ取得システム
１０１ｃサイト解析システム
１０１ｄ成分キーワード判定システム
１０２会員管理システム
２００ＤＢサーバー
２１０クロールサーバー 10 Input area 11 Click button 100 Web server 101 Keyword explorer 101a Google information acquisition system 101b Wikipedia acquisition system 101c Site analysis system 101d Component keyword judgment system 102 Member management system 200 DB server 210 Crawl server

Claims

Steps to accept input of target keywords for searching web pages,
A step to automatically extract component keywords as words that should compose a Web page based on the entered target keywords,
The step of calculating the degree of correlation between the extracted individual component keywords and the target keyword as the first evaluation value, and
A step of displaying a list of the extracted component keywords and the first evaluation value, and
Steps to select a specific ingredient keyword based on the list display, and
The step of calculating the degree of correlation between all the selected component keywords and the target keyword as the second evaluation value, and
The step of displaying the calculated second evaluation value on the screen,
An evaluation program for component keywords that make up a Web page, which is characterized by having a computer execute the above.

In the step of calculating the first evaluation value,
A step of searching a Web page using the component keywords and
The evaluation program according to claim 1, wherein the step of checking whether or not the target keyword exists in the title and description of the searched Web page and the execution of the step.

In the step of calculating the first evaluation value,
The evaluation program according to claim 2, wherein an evaluation table is stored in advance so that the higher the searched Web page is, the higher the evaluation score is, and the evaluation table has a step of calculating using the evaluation table. ..

In the step of calculating the second evaluation value,
A step of searching a Web page using the selected component keyword group, and
The evaluation program according to claim 2, wherein the step of checking whether or not the target keyword exists in the title and description of the searched Web page and the execution of the step.

The step of automatically extracting the component keywords is
The evaluation program according to any one of claims 1 to 4, wherein the evaluation program includes a step of searching the target keyword from a dictionary database and extracting words from a sentence explaining the target keyword.

The step of automatically extracting the component keywords is
The evaluation program according to any one of claims 1 to 5, wherein the evaluation program includes a step of extracting words to be suggested when a target keyword is input in an input area of a search query.

The step of automatically extracting the component keywords is
Steps to search a web page using the target keyword,
The evaluation program according to any one of claims 1 to 6, wherein the evaluation program includes a step of crawling a searched upper Web page and extracting words from sentences constituting the Web page.