JP2020086996A

JP2020086996A - Publication information retrieval system

Info

Publication number: JP2020086996A
Application number: JP2018221220A
Authority: JP
Inventors: 香深金子; Komi Kaneko; 俊彦愛澤; Toshihiko Aizawa; 伸一山竹; Shinichi Yamatake; 隆弘藤村; Takahiro Fujimura
Original assignee: Create Co Ltd
Current assignee: Create Co Ltd
Priority date: 2018-11-27
Filing date: 2018-11-27
Publication date: 2020-06-04
Anticipated expiration: 2038-11-27
Also published as: JP7018202B2

Abstract

To provide a publication information retrieval system which can efficiently execute from retrieval to display for browsing of publication information.SOLUTION: A publication information retrieval system 1 comprises: a company server 2 which manages information resources collected by a user H; a company database 3 which stores the collected information resources; and a user terminal 7 which retrieves the information resources to be required by the user and displays retrieval results. The company server 2 has retrieval range limitation means for limiting a retrieval range by specifying a category relevant to the required information resources from websites A to C of other companies when the information resources to be required by the user H is retrieved, and stores the information resources matched to retrieval conditions in the company database 3.SELECTED DRAWING: Figure 1

Description

本発明は、ウェブサイトに掲載された情報を検索、収集して、営業活動等に利用するためにリストアップする掲載情報検索システムに関する。 The present invention relates to a posted information search system for searching and collecting information posted on a website and listing it for use in business activities and the like.

従来、インターネット上に公開されている同業種のウェブサイトから検索条件に合致した宿泊施設情報、賃貸物件情報、求人情報等をピックアップするサイトが知られている。このようなサイトを利用すると、ユーザは、各社のウェブサイトに都度アクセスすることなく、必要な情報を一度に取得することができる。 2. Description of the Related Art Conventionally, there is known a site that picks up accommodation facility information, rental property information, job offer information, etc. that match search conditions from websites of the same industry published on the Internet. By using such a site, the user can obtain necessary information at a time without accessing the website of each company each time.

例えば、下記の特許文献１の求人情報の自動収集方法は、求人情報の収集手段を含むサーバと、各求人企業が所有する求人企業サーバと、各求職者が利用する求職者端末とで構成されている。 For example, the automatic recruitment information collection method of Patent Document 1 described below includes a server including a recruitment information collection unit, a recruitment company server owned by each recruiting company, and a job seeker terminal used by each job seeker. ing.

上記収集手段は、求人関連情報ページにアクセスし、ページの記述情報を受信し、一旦サーバ上のメモリに記憶する。また、解析手段は、収集手段により取得したページの記述情報に対して、ＨＴＭＬ等のマークアップ言語の仕様に基づいた解析を行い、文章及びそのレイアウト情報を抽出する。 The collecting means accesses the job-related information page, receives the description information of the page, and temporarily stores it in the memory on the server. Further, the analysis unit analyzes the description information of the page acquired by the collection unit based on the specifications of a markup language such as HTML to extract a sentence and its layout information.

この結果、サーバの求人情報データベースには、各企業がホームページ上で公開している最新の求人情報が求人案件構成要素毎に整理されて記憶されるので、求職者は、即座にウェブサイトから求人情報を閲覧することができる（特許文献１／段落００２０〜００２７）。 As a result, the latest job information that each company has published on its homepage is stored in the job information database of the server, organized by each job item component, and job seekers can immediately search for jobs from the website. Information can be browsed (Patent Document 1/Paragraphs 0020 to 0027).

特開２００４−３２６７１２号公報JP, 2004-326712, A

しかしながら、特許文献１の求人情報の自動収集方法では、クローラーと呼ばれる手法で情報を収集しているため、本来必要でない情報も取得してしまい、時間がかかる上、出力される情報を閲覧用に整理する手間も生じていた。 However, in the automatic recruitment information collection method of Patent Document 1, since information is collected by a method called a crawler, information that is not originally necessary is also acquired, and it takes time, and the output information is used for browsing. It also took time to sort them out.

本発明は、このような事情に鑑みてなされたものであり、掲載情報の検索から閲覧用の表示までを効率良く実行することができる掲載情報検索システムを提供することを目的とする。 The present invention has been made in view of such circumstances, and an object of the present invention is to provide a publication information search system capable of efficiently executing from the search of publication information to the display for browsing.

本発明は、ユーザが必要とする情報資源を、誰でも閲覧可能な情報資源が掲載された複数のウェブサイトを検索し、収集する掲載情報検索システムであって、前記収集された情報資源を管理するサーバと、前記収集された情報資源を記憶するデータベースと、前記サーバにネットワーク接続され、前記ユーザが必要とする情報資源の検索及び検索結果の表示を行うユーザ端末と、を備え、前記サーバは、前記ユーザが必要とする情報資源を検索する場合に、それぞれの前記ウェブサイトのウェブサーバから当該情報資源に関連するカテゴリーを特定して検索範囲を限定する検索範囲限定手段を有していることを特徴とする。 The present invention is a posted information search system that searches and collects information resources required by a user from a plurality of websites on which information resources that can be viewed by anyone are posted, and manages the collected information resources. Server, a database for storing the collected information resources, and a user terminal network-connected to the server for searching information resources required by the user and displaying search results, the server comprising: When the information resource required by the user is searched, a search range limiting means for limiting the search range by specifying a category related to the information resource from the web server of each website is provided. Is characterized by.

本発明の掲載情報検索システムでは、ユーザが、サーバにネットワーク接続されたユーザ端末を利用してウェブサイトから必要な情報資源を検索し、収集する。このとき、サーバの検索範囲限定手段は、検索範囲を当該情報資源に関連するカテゴリーに限定して検索することができるため、検索時に不要な情報資源を拾わず、時間の短縮が可能となる。そして、検索条件に合致した情報資源は、サーバに接続されたユーザ側のデータベースに記憶される。これにより、ユーザは、必要な情報資源を効率良く収集し、ユーザ端末により検索結果を閲覧することができる。 In the posted information search system of the present invention, a user searches for and collects necessary information resources from a website using a user terminal network-connected to a server. At this time, the search range limiting means of the server can limit the search to the category related to the information resource, and therefore, unnecessary information resources are not picked up at the time of search, and the time can be shortened. Then, the information resource that matches the search condition is stored in the database on the user side connected to the server. As a result, the user can efficiently collect the necessary information resources and browse the search results from the user terminal.

本発明の掲載情報検索システムにおいて、前記検索範囲限定手段は、それぞれの前記ウェブサイトのうち、前記ユーザがアクセスにより生成される動的生成ページを指定可能であることが好ましい。 In the publication information search system of the present invention, it is preferable that the search range limiting means is capable of designating a dynamically generated page generated by an access by the user among the websites.

この構成によれば、ユーザ端末の検索範囲限定手段は、動的生成ページを指定することができる。このため、ユーザは動的生成ページに掲載された情報資源についても検索によりヒットさせ、当該情報資源を利用することができる。 According to this configuration, the search range limiting means of the user terminal can specify the dynamically generated page. Therefore, the user can use the information resource posted on the dynamically generated page by searching for the information resource.

また、本発明の掲載情報検索システムにおいて、前記サーバは、前記収集された情報資源を前記データベースに記憶する際、当該情報資源を検査する情報資源検査手段を有し、前記情報資源検査手段は、前記収集された情報資源に対して表記揺れ補正を行った後、重複情報があるか否かを検査することが好ましい。 Further, in the posted information search system of the present invention, the server has an information resource inspection means for inspecting the information resource when storing the collected information resource in the database, and the information resource inspection means comprises: It is preferable to check whether or not there is duplicate information after performing the writing fluctuation correction on the collected information resources.

この構成によれば、サーバの情報資源検査手段は、検索して収集された情報資源をデータベースに記憶する際、当該情報資源を検査する。複数のウェブサイトで同じ情報資源を異なる態様で掲載している場合があるため、情報資源検査手段は、表記揺れの補正を行った後、重複情報であるか否かを判定する。これにより、重複した情報資源を効率良く発見することができる。 According to this configuration, the information resource inspecting means of the server inspects the information resource when storing the retrieved and collected information resource in the database. Since the same information resource may be posted in different forms on a plurality of websites, the information resource inspection means corrects the notation fluctuation and then determines whether or not the information is duplicate information. As a result, duplicate information resources can be efficiently found.

また、本発明の掲載情報検索システムにおいて、前記情報資源検査手段は、重複情報と判断された複数の前記収集された情報資源を１つの情報資源に統合することが好ましい。 Further, in the posted information search system of the present invention, it is preferable that the information resource inspection unit integrates a plurality of the collected information resources determined to be duplicated information into one information resource.

この構成によれば、ユーザ端末の情報資源検査手段は、重複情報と判断された複数の情報資源を１つの情報資源に統合することができる。これにより、最終的なリストアップの際、重複した情報資源を排除することができる。 According to this configuration, the information resource inspection unit of the user terminal can integrate a plurality of information resources determined to be duplicated information into one information resource. This makes it possible to eliminate duplicate information resources in the final listing.

本発明の実施形態に係る掲載情報検索システムの概略図。The schematic diagram of the publication information search system concerning an embodiment of the present invention. 情報検索時の範囲限定機能を説明する図。The figure explaining the range limitation function at the time of information search. ヒット情報のデータクレンジングを説明する図。The figure explaining the data cleansing of hit information. 掲載情報検索システムの情報閲覧を説明する図。The figure explaining information browsing of a publication information search system. 図３の情報閲覧画面（ウェブ画面）の表示例。The display example of the information browsing screen (web screen) of FIG. 図３の情報閲覧画面（スプレッドシート）の表示例。The display example of the information browsing screen (spreadsheet) of FIG.

以下では、図面を参照しながら、本発明の掲載情報検索システムの実施形態について説明する。 Hereinafter, an embodiment of a posted information search system of the present invention will be described with reference to the drawings.

図１は、本発明の実施形態に係る掲載情報検索システム１の概略図である。掲載情報検索システム１は、当該システムを利用する社員Ｈが所属する会社Ｘのサーバである自社サーバ２と、自社サーバ２に接続された自社データベース３と、自社サーバ２に社内ＬＡＮ等でネットワーク接続され、掲載情報検索ツール（アプリケーション）がインストールされたユーザ端末７とから構成される。 FIG. 1 is a schematic diagram of a posted information search system 1 according to an embodiment of the present invention. The posted information search system 1 has a company server 2 which is a server of a company X to which an employee H who uses the system belongs, a company database 3 connected to the company server 2, and a network connection to the company server 2 through a company LAN or the like. And the user terminal 7 in which the publication information search tool (application) is installed.

検索対象としては、賃貸情報や宿泊施設等が考えられるが、ここでは、誰でも閲覧可能な正社員、アルバイト等の求人情報とする。すなわち、会社Ｘは求人情報を取り扱う企業であり、社員Ｈ（本発明の「ユーザ」に相当）はユーザ端末７を利用して、同業他社のウェブサイトから必要とする求人情報を検索、収集することができる。 Rental information, accommodation facilities, and the like can be considered as search targets, but here, job information such as regular employees and part-time jobs that anyone can view is used. That is, the company X is a company that handles job information, and the employee H (corresponding to the “user” of the present invention) uses the user terminal 7 to search and collect necessary job information from the websites of other companies in the same industry. be able to.

図示するように、ウェブサイトに様々な職種の求人情報、求人広告を掲載している会社として、会社Ａ、会社Ｂ及び会社Ｃの３つがある。また、会社Ａ、会社Ｂ、会社Ｃのウェブサーバを、それぞれ管理サーバ５Ａ、管理サーバ５Ｂ、管理サーバ５Ｃとし、そのウェブサイトを、それぞれウェブサイトＡ、ウェブサイトＢ、ウェブサイトＣとする。 As shown in the figure, there are three companies, company A, company B, and company C, which have posted job postings and job advertisements for various job types on their websites. The web servers of company A, company B, and company C are referred to as management server 5A, management server 5B, and management server 5C, and their websites are referred to as website A, website B, and website C, respectively.

このとき、会社Ｘの社員Ｈがユーザ端末７の掲載情報検索ツールにより、検索条件をテキスト（キーワード）、チェックボックス等から入力すると、自社サーバ２は、管理サーバ５Ａ、管理サーバ５Ｂ及び管理サーバ５Ｃに順次アクセスして検索を行う。そして、検索条件に合致し、ヒットした情報（以下、ヒット情報という）があれば、ヒット情報を取得して自社データベース３に記憶する。 At this time, when the employee H of the company X inputs the search condition from the text (keyword), the check box, etc. by the posted information search tool of the user terminal 7, the company server 2 becomes the management server 5A, the management server 5B and the management server 5C. To access and search. If there is hit information (hereinafter referred to as hit information) that matches the search condition, the hit information is acquired and stored in the in-house database 3.

自社データベース３はハードディスク等の記憶装置であり、登録データの領域３Ａと、正規化データの領域３Ｂとで構成されている。領域３Ａには、検索対象のウェブサイトを巡回する巡回ルールに基づいて収集された企業名、職種等の情報が格納される。また、領域Ｂには、収集された情報から電話番号や住所を所定のフォーマットに正規化する正規化ルールに基づいて編集された正規化データが格納される。 The in-house database 3 is a storage device such as a hard disk, and is composed of a registration data area 3A and a normalized data area 3B. In the area 3A, information such as a company name and a job type collected based on a patrol rule that patrols a website to be searched is stored. Further, in the area B, the normalized data edited from the collected information based on the normalization rule for normalizing the telephone number and the address into a predetermined format is stored.

その後、自社サーバ２でソート等の処理が行われ、ヒット情報がリストアップされる。また、検索結果リスト（後述するリスト画面、スプレッドシート）がユーザ端末７の表示画面７ａに表示されるので、社員Ｈは、検索結果リストを元に営業活動を行うことができる。 After that, processing such as sorting is performed in the in-house server 2, and hit information is listed. In addition, since the search result list (a list screen and a spreadsheet described later) is displayed on the display screen 7a of the user terminal 7, the employee H can perform business activities based on the search result list.

次に、図２Ａ、図２Ｂを参照して、掲載情報検索システム１、特に自社サーバ２による情報検索の特徴について説明する。 Next, with reference to FIG. 2A and FIG. 2B, the features of the information retrieval by the posted information retrieval system 1, particularly the in-house server 2 will be described.

まず、自社サーバ２による情報検索時の範囲限定機能について説明する。図２Ａは、検索対象のウェブサイトのページ構造（リンク）を示している。 First, the range limiting function at the time of information retrieval by the in-house server 2 will be described. FIG. 2A shows the page structure (link) of the website to be searched.

社員Ｈは、ユーザ端末７の掲載情報検索ツールでキーワードや住所を入力して求人情報の検索を行う。この作業により、自社サーバ２がＡ社、Ｂ社、Ｃ社のそれぞれの管理サーバ５Ａ、管理サーバ５Ｂ及び管理サーバ５Ｃを巡回するようにアクセスする。そして、ウェブサイトＡ〜Ｃから検索条件に合致したヒット情報を取得する（図１参照）。 The employee H inputs a keyword or an address with the posted information search tool of the user terminal 7 and searches for the job offer information. By this work, the in-house server 2 accesses the management servers 5A, 5B, and 5C of the companies A, B, and C so as to circulate. Then, the hit information matching the search condition is acquired from the websites A to C (see FIG. 1).

ここで、本発明における情報検索は、ウェブサイトを構成する全ページを検索対象とするクローラー型の情報取得とは異なり、社員Ｈが必要とする情報に関連するカテゴリーを特定して、所定の範囲を限定して検索を行うことができる。 Here, the information search according to the present invention is different from the crawler type information acquisition in which all pages constituting a website are searched, and the category related to the information required by the employee H is specified and a predetermined range is specified. Can be limited to search.

具体的には、従来のクローラー型の情報取得では、トップページPtop、動的生成ページPdyn、その他のページP1〜P8を含むリンクされた全てのページを検索対象とする。 Specifically, in the conventional crawler type information acquisition, all linked pages including the top page Ptop, the dynamically generated page Pdyn, and other pages P1 to P8 are searched.

一方、掲載情報検索システム１の自社サーバ２は、特定のカテゴリーのページを検索対象に指定することができる。例えば、ウェブサイトＡ〜Ｃの求人情報のページは、企業名、電話番号、ＵＲＬ等のある程度決まった情報で構成され、ページが切替わったとき枠内の文字のみが入れ替わる動的生成ページである。自社サーバ２は、動的生成ページPdynのみ、又はこれに加えて所定の情報が掲載されたページP1を検索対象とすることができる。 On the other hand, the in-house server 2 of the posted information search system 1 can specify a page in a specific category as a search target. For example, the job information page of the websites A to C is a dynamically generated page that is composed of information such as a company name, a telephone number, and a URL that is determined to some extent, and when the page is switched, only the characters in the frame are replaced. .. The in-house server 2 can search only the dynamically generated page Pdyn or the page P1 in which predetermined information is posted in addition to this.

その方法として、リクエストメソッドの種類、固有の属性（地域名等）、推測可能な要素（ページ数等）、その他の変化する値の条件やタイミングを指定する方法がある。自社サーバ２は、例えば、属性「tokyo（東京）」を指定して対象となる１〜５００ページを検索し、その後、属性「kanagawa（神奈川）」を指定して同じ１〜５００ページを検索するといった巡回検索処理を行うことができる。 As a method therefor, there is a method of designating the type of request method, a unique attribute (region name, etc.), a speculable element (page number, etc.), and other conditions and timing of changing values. For example, the company server 2 searches the target pages 1 to 500 by specifying the attribute "tokyo", and then searches the same pages 1 to 500 by specifying the attribute "kanagawa". It is possible to perform a cyclic search process such as.

また、自社サーバ２は、特定の条件で巡回検索処理をスキップする設定を行うこともできる。例えば、固有の属性や推測可能な要素をＵＲＬにもつ情報一覧ページ内を巡回し、その中で新規に取得できるリンクがなく、それが所定のページ数連続した場合に、巡回検索処理をスキップする。 Further, the company server 2 can also be set to skip the cyclic search process under a specific condition. For example, when the information list page having a unique attribute or a speculable element in the URL is patrolled and there is no link that can be newly acquired, and the link continues for a predetermined number of pages, the patrol search process is skipped. ..

これにより、情報検索の際、不要なページの検索に時間をかけたり、不要な情報を取得して自社データベース３の容量を圧迫したりすることがなくなる。その結果、掲載情報検索システム１のランニングコストを抑えつつ、各社員のシステム利用時の負担を軽減することができる。 As a result, when searching for information, it is possible to avoid spending time on searching for unnecessary pages, and avoiding unnecessary capacity to acquire the capacity of the in-house database 3. As a result, it is possible to reduce the running cost of the posted information search system 1 and reduce the burden on each employee when using the system.

次に、図２Ｂを参照して、ヒット情報に対して行われるデータクレンジングについて説明する。 Next, with reference to FIG. 2B, data cleansing performed on hit information will be described.

ウェブサイトＡ〜Ｃには同じ求人情報が掲載されている場合があるので、情報検索を行うと重複してヒットしたと思われる求人情報が取得されることがある。自社サーバ２（図示省略）は、重複ヒットした求人情報（例えば、企業名、住所、電話番号等）の表記が全く同じであれば、一情報と判断して自社データベース３に記憶する。 Since the same job information may be posted on the websites A to C, job search information that seems to have been duplicated may be obtained when an information search is performed. The company server 2 (not shown) determines that the job information (e.g., company name, address, telephone number, etc.) that has been duplicated is exactly the same and stores it in the company database 3 as one piece of information.

しかし、ウェブサイトにより、表記揺れが生じている場合がある。企業名の例では、ウェブサイトＡにおいて「（株）エース」の表記であるが、ウェブサイトＢにおいて「株式会社エース」という表記となっていることがあり得る。 However, there are some cases where some websites have fluctuations. In the example of the company name, the notation “Ace Co., Ltd.” is shown on the website A, but the notation “Ace Co., Ltd.” may be shown on the website B.

また、住所の例では、ウェブサイトＡにおいて「東京都千代田区〇丁目・・・」という表記であるが、ウェブサイトＢにおいて「千代田区〇丁目・・・」という都道府県名が省略された表記になっていることがあり得る。 In addition, in the example of the address, the notation "Ochime, Chiyoda-ku, Tokyo..." is omitted on the website A, but the prefecture name "Ochime, Chiyoda-ku..." is omitted on the website B. It is possible that

このような場合、自社サーバ２は、求人情報を自社データベース３に記憶する前段階で当該情報の欠落部分や表記揺れの補正、すなわち、データクレンジングを行う。その後、改めて重複ヒットしたと思われる求人情報のマッチングを行い、マッチした場合には一情報と判断して自社データベース３に記憶する。 In such a case, the in-house server 2 corrects the missing part of the information and the fluctuation of the notation, that is, data cleansing, before the job information is stored in the in-house database 3. After that, the job information that seems to have been duplicately hit is again matched, and if matched, it is judged as one information and stored in the in-house database 3.

電話番号については、「−（ハイフン）」の有無の違いや、カッコを使用するか否かの違いがある。このようなルールに則り電話番号の表記を統一する作業は正規化と呼ばれ、データクレンジングの一態様として実行される。なお、この正規化を行うと、自社サーバ２は、容易に複数の求人情報の同一性を判断することができる。 Regarding telephone numbers, there are differences in the presence or absence of "- (hyphen)" and whether parentheses are used. The work of unifying the notation of telephone numbers according to such rules is called normalization and is performed as one mode of data cleansing. If this normalization is performed, the in-house server 2 can easily determine the identity of the plurality of job offer information.

また、自社サーバ２は、取得したページ内のどの箇所をどのように抽出してデータクレンジングを行うかをデータ分類毎に指定することができる。具体的には、自社サーバ２は、電話番号、住所等のラベル（指標）に対して、個別に「抽出方法」、「正規化方法」、「検証方法」を指定することができる。 Further, the in-house server 2 can specify, for each data classification, which part of the acquired page and how to extract and cleanse the data. Specifically, the in-house server 2 can individually specify the “extraction method”, “normalization method”, and “verification method” for labels (indexes) such as telephone numbers and addresses.

電話番号の例では、所定の文字列（指標）の直後に出現する「−（ハイフン）」で区切られた全角数字又は半角数字を抽出し、全角数字部分を半角数字に変換する。その後、数字部分のみを抜き出し、その結果が空文字列ではなく、かつ比較先（既にリストに存在する電話番号）に同じ電話番号がなければ、新規の電話番号と判断する。 In the example of the telephone number, full-width numbers or half-width numbers delimited by “- (hyphen)” that appear immediately after the predetermined character string (index) are extracted, and the full-width number part is converted into half-width numbers. After that, if only the numeric part is extracted and the result is not an empty character string and there is no same telephone number in the comparison destination (telephone number already existing in the list), it is determined as a new telephone number.

電話番号（例えば、“０９０１２３４１２３４”の１１桁）の場合、３桁の数字を取り出し（“０９０”、“９０１”、“０１２”等）、同様の処理を行った比較先の３桁の数字と比較し、スコアを付ける。そして、一致すれば、ハイスコアを付与し、全てのスコアが規定値以下であれば、新規の番号とみなす。 In the case of a telephone number (for example, 11 digits of "09012341234"), a 3-digit number is taken out ("090", "901", "012", etc.), and the same 3 digit number as the comparison target is processed. Compare and score. Then, if they match, a high score is given, and if all the scores are equal to or less than a specified value, it is regarded as a new number.

これにより、ウェブサイトＡ〜Ｃを検索した際に発生する、重複ヒットした求人情報をそのまま自社データベース３に記憶してしまうことを防止することができる。求人情報をリストアップしたとき求人情報に重複があると、社員Ｈは同じ企業に対して何度も営業活動を行うという事態が生じうるが、掲載情報検索システム１を使用することで、営業効率の向上を図ることができる。 As a result, it is possible to prevent the job information that has been duplicately hit and stored in the in-house database 3 as it is, which occurs when the websites A to C are searched. When job listing information is listed, if job listing information is duplicated, employee H may perform sales activities for the same company many times. Can be improved.

次に、図３〜図５を参照して、掲載情報検索システム１によるヒット情報の閲覧について説明する。 Next, referring to FIGS. 3 to 5, browsing of hit information by the posted information search system 1 will be described.

まず、図３に示すように、社員Ｈがユーザ端末７（掲載情報検索ツール）を用いて、検索条件をテキスト等で入力する。また、図４はユーザ端末７の表示画面７ａを示しており、表示画面７ａの上方が検索指定画面の例である。 First, as shown in FIG. 3, the employee H uses the user terminal 7 (publication information search tool) to input search conditions in text or the like. FIG. 4 shows the display screen 7a of the user terminal 7, and the upper part of the display screen 7a is an example of the search designation screen.

検索指定画面では「キーワード」による検索が基本となるため、社員Ｈは、「製造業」、「経理」といったキーワードを入力し、キーワードのＡＮＤ検索かＯＲ検索かを選択する。また、「取得日範囲」において求人情報が掲載された日付を指定することができ、「住所検索」において求人情報の中の勤務地を指定することができる。 Since "keyword" is basically used for the search in the search designation screen, the employee H inputs a keyword such as "manufacturing industry" or "accounting", and selects whether the keyword is an AND search or an OR search. Further, the date on which the job offer information is posted can be designated in the “acquisition date range”, and the work place in the job offer information can be designated in the “address search”.

社員Ｈが「キーワード」、「取得日範囲」及び「住所検索」から検索条件を入力して、検索ボタン７ｂをクリックすると、自社サーバ２（図示省略）が会社Ａの管理サーバ５Ａにアクセスして、ウェブサイトＡから検索情報に合致したヒット情報を取得する。また、ここでは図示を省略したが、自社サーバ２は、会社Ｂの管理サーバＢ、会社Ｃの管理サーバＣにも順次アクセスする。 When the employee H inputs search conditions from “keyword”, “acquisition date range” and “address search” and clicks the search button 7b, the own server 2 (not shown) accesses the management server 5A of the company A. , The hit information matching the search information is acquired from the website A. Although not shown here, the in-house server 2 also sequentially accesses the management server B of the company B and the management server C of the company C.

次に、自社サーバ２は、上述したデータクレンジングを実行した後、ヒット情報を自社データベース３に記憶する。さらに、自社データベース３に記憶されたヒット情報は、ユーザ端末７の表示画面７ａ下方のリスト画面に表示される。 Next, the in-house server 2 stores the hit information in the in-house database 3 after executing the above-mentioned data cleansing. Further, the hit information stored in the in-house database 3 is displayed on the list screen below the display screen 7a of the user terminal 7.

リスト画面では、ヒット情報を「取得日」、「媒体名」、「企業名」、「住所」等の複数の項目でソート（昇順又は降順）し、表示することができる。上述したように、重複ヒットした求人情報は一情報にまとまるため、リスト画面には同じ求人情報がリストアップされることはない。また、以前求人情報を出した企業が改めて求人情報を出す場合があるが、最新情報が１つだけ表示される。 On the list screen, hit information can be sorted (ascending order or descending order) by a plurality of items such as “acquisition date”, “medium name”, “company name”, “address” and displayed. As described above, since the job information that has been duplicately hit is collected as one information, the same job information is not listed on the list screen. In addition, there is a case where a company that previously issued job offer information issues another job offer information, but only the latest information is displayed.

リスト画面において、社員Ｈは、「ＬＩＮＫ」項目のＵＲＬをクリックして、求人情報の掲載元のウェブサイト（例えば、Ａ社のウェブサイトＡ）にジャンプすることができる。また、社員Ｈは、「詳細表示」項目の詳細ボタン７ｃをクリックして、求人情報の詳細が記載された詳細ページにジャンプすることができる。なお、この詳細ページは、社員Ｈの会社Ｘで作成し、自社データベース３に記憶されている求人情報である。 On the list screen, the employee H can jump to the website (for example, the website A of company A) where the job listing is posted by clicking the URL of the “LINK” item. Further, the employee H can jump to the detail page in which the details of the job offer information are described by clicking the detail button 7c of the "details display" item. It should be noted that this detailed page is job information created in the company X of the employee H and stored in the in-house database 3.

詳細ページに記載された求人情報は、リスト画面に表示されている情報の他に、「E-mail」、「事業内容」、「仕事内容」、「勤務時間」、「雇用形態」、「待遇」、「給与」等がある。社員Ｈは、リスト画面に表示された求人情報のうち、特に興味がある求人情報について、その詳細を確認することができる。 In addition to the information displayed on the list screen, the job listings on the detail page include "E-mail", "Business description", "Job description", "Working hours", "Employment form", and "Treatment". ,"Salary" etc. The employee H can confirm the details of the job information of particular interest among the job information displayed on the list screen.

また、社員Ｈは、図４の検索指定画面のエクスポートボタン７ｄをクリックして、自社データベース３に記憶されたヒット情報をスプレッドシートに出力することができる。図５は、ユーザ端末７の表示画面７ａに表示された、又はプリンタしたスプレッドシートの例である。 Further, the employee H can output the hit information stored in the in-house database 3 to a spreadsheet by clicking the export button 7d on the search designation screen of FIG. FIG. 5 is an example of a spreadsheet displayed on the display screen 7a of the user terminal 7 or printed.

スプレッドシートでは、１段目に「企業名」、「電話番号」、「ＬＩＮＫ」等の項目を表示し、２段目以降は、今回のヒット情報が列挙される。スプレッドシートは、基本的に図４のリスト画面と同じ項目が含まれるが、出力する際に社員Ｈが必要な項目を選択するようにしてもよい。 In the spreadsheet, items such as “company name”, “phone number”, and “LINK” are displayed in the first row, and the hit information of this time is listed in the second and subsequent rows. The spreadsheet basically includes the same items as the list screen of FIG. 4, but the employee H may select the necessary items when outputting.

スプレッドシートを出力する際、ヒット情報をソートできる点、重複情報が排除される点は、図４のリスト画面と同じである。また、スプレッドシートをユーザ端末７の表示画面７ａに表示している場合には、「ＬＩＮＫ」項目のＵＲＬから求人情報が掲載されたウェブサイトにジャンプすることもできる。 When the spreadsheet is output, the hit information can be sorted and the duplicate information is excluded, as in the list screen of FIG. Further, when the spreadsheet is displayed on the display screen 7a of the user terminal 7, it is possible to jump from the URL of the "LINK" item to the website where the job posting information is posted.

社員Ｈは、スプレッドシートを個人用にカスタマイズすることもできる。例えば、営業用の「連絡済」項目や、「メモ記入欄」項目を追加してもよい。これにより、社員Ｈは、職種や住所等で絞り込んだ連絡先に対して、電話等により効率良く営業活動を行うことができる。 Employee H can also personalize the spreadsheet. For example, a “contacted” item for sales and a “memo entry field” item may be added. As a result, the employee H can efficiently carry out sales activities by telephone or the like to the contact information narrowed down by job type or address.

上記説明は、本発明の実施形態の一部であり、これ以外にも種々な実施形態が考えられる。実施形態において示した検索対象の会社数（図１参照）やウェブサイトの構成（図２Ａ参照）は、例示に過ぎない。ユーザ端末７は、社内のデスクトップＰＣ、ノートＰＣに限られず、タブレット端末やスマートフォン等、インターネットに接続可能なモバイル機器であってもよい。 The above description is a part of the embodiment of the present invention, and various embodiments other than this are conceivable. The number of search target companies (see FIG. 1) and the website configuration (see FIG. 2A) shown in the embodiment are merely examples. The user terminal 7 is not limited to the in-house desktop PC or notebook PC, but may be a mobile device such as a tablet terminal or a smartphone that can be connected to the Internet.

また、検索対象の情報も求人情報に限られない。例えば、検索したい情報が飲食店であった場合、ユーザは、ユーザ端末７の掲載情報検索ツールから「韓国料理」、「世田谷区」等の検索条件を指定する。これにより、自社サーバ２が飲食店を紹介する複数のウェブサイトの管理サーバにアクセスして（検索範囲は限定）、検索情報に合致したヒット情報を取得する。 Further, the information to be searched is not limited to the job offer information. For example, when the information to be searched is a restaurant, the user specifies search conditions such as “Korean cuisine” and “Setagaya Ward” from the posted information search tool of the user terminal 7. As a result, the in-house server 2 accesses the management servers of a plurality of websites that introduce restaurants (the search range is limited) and acquires hit information that matches the search information.

さらに、自社サーバ２は、ヒット情報のデータクレンジングを行い、自社データベース３に記憶する。その後、ユーザ端末７にヒット情報を所定のソートを実行したうえ、リスト画面に表示する。これにより、ユーザは、複数のウェブサイトに都度アクセスすることなく、また、重複ヒットした情報は排除して、飲食店に関する検索結果を閲覧することができる。 Further, the company server 2 cleanses the hit information and stores it in the company database 3. Then, the hit information is displayed on the list screen after performing a predetermined sort on the user terminal 7. Thereby, the user can browse the search result regarding the restaurant without accessing the plurality of websites each time and excluding the duplicate hit information.

また、上記実施形態では、ユーザが自社サーバ２にアクセスできるという設定であるが、ユーザは、外部の企業や人であってもよい。その場合には、外部の企業や人に対して、自社サーバ２に対する特定のアクセス権を付与し、外部の企業や人が、自社サーバ２の上記システムを利用して情報検索ができるようにすることができる。そして、自社サーバ２を有する会社は、外部の企業や人に対して上記サービスを提供することにより、利用料等を請求するビジネスを展開することができる。 Further, in the above embodiment, the user is allowed to access the in-house server 2, but the user may be an external company or person. In that case, the external company or person is given a specific access right to the own server 2 so that the outside company or person can search information by using the system of the own server 2. be able to. Then, the company having the in-house server 2 can develop the business of charging the usage fee and the like by providing the above service to an external company or person.

１…掲載情報検索システム、２…自社サーバ、３…自社データベース、５Ａ〜５Ｃ…管理サーバ、７…ユーザ端末、７ａ…表示画面、７ｂ…検索ボタン、７ｃ…詳細ボタン、７ｄ…エクスポートボタン。 1... Posted information search system, 2... Own server, 3... Own database, 5A-5C... Management server, 7... User terminal, 7a... Display screen, 7b... Search button, 7c... Details button, 7d... Export button.

Claims

A posted information search system that searches and collects information resources required by users from a plurality of websites on which information resources that can be viewed by anyone are posted,
A server for managing the collected information resources,
A database storing the collected information resources,
A user terminal that is network-connected to the server and that searches for information resources required by the user and displays the search results;
The server has search range limiting means for limiting a search range by specifying a category related to the information resource from the web server of each website when searching the information resource required by the user. A posted information search system that is characterized by:

The publication information search system according to claim 1, wherein the search range limiting means is capable of designating a dynamically generated page generated by an access by the user among the websites.

The server has an information resource inspection means for inspecting the information resource when storing the collected information resource in the database,
3. The posted information search according to claim 1 or 2, wherein the information resource inspection means inspects whether or not there is duplicated information after correcting notation fluctuations of the collected information resources. system.

The posted information search system according to claim 3, wherein the information resource inspection unit integrates a plurality of the collected information resources determined to be duplicated information into one information resource.