JP6937707B2

JP6937707B2 - Information processing equipment, information processing methods, and programs

Info

Publication number: JP6937707B2
Application number: JP2018026741A
Authority: JP
Inventors: 隼人小林; 一真村尾; 毅司増山; 義宗田渕; 雅志沼田; 正幸仲辻; 拓実後藤
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2018-02-19
Filing date: 2018-02-19
Publication date: 2021-09-22
Anticipated expiration: 2038-02-19
Also published as: JP7231682B2; JP2021193582A; JP2019144722A

Description

本発明は、情報処理装置、情報処理方法、およびプログラムに関する。 The present invention relates to an information processing device, an information processing method, and a program.

従来、各種コンテンツをユーザに提供する際に、第三者により投稿されたコンテンツに対する意見、感想等を示すコメントを、コンテンツに合わせて提供するサービスが行われている（例えば、特許文献１参照）。 Conventionally, when providing various contents to users, a service has been provided in which comments indicating opinions, impressions, etc. regarding the contents posted by a third party are provided according to the contents (see, for example, Patent Document 1). ..

特開２０１４−４９０９４号公報Japanese Unexamined Patent Publication No. 2014-49094

上記のような第三者により投稿されたコメントには、特定の人物に対する誹謗、中傷といった、一般に公開されることが適切でない内容が含まれる場合がある。このような特定の人物を誹謗、中傷するコメントは、公開されるべきものではなく、速やかに削除することが好ましい。このため、コンテンツの提供者は、このような不適切なコメントが含まれていないかを人手により監視し、削除する場合があった。 Comments posted by third parties, such as those mentioned above, may include content that is not appropriate for public disclosure, such as slander or slander against a particular person. Such comments that slander or slander a specific person should not be made public, and it is preferable to delete them promptly. For this reason, the content provider may manually monitor and delete such inappropriate comments.

また、不適切な単語や表現が予め定義された固定のモデルを用いて、不適切なコメントを自動的に検知する方法が行われている。しかしながら、不適切なコメントに利用される表現は多様化、複雑化しており、単純なモデルでは検知できないものも多く存在している。例えば、ポジティブなコメント表現の一部に不適切な表現が組み込まれている場合や、隠語等により不適切な表現がなされている場合は、上記のようなモデルでは検知できない場合があった。また、このような不適切なコメントに利用される表現は次々と生み出されるものであるため、人手によりこれらすべてを監視することは容易ではなく、また、固定のモデルを用いた方法では対応が遅れてしまう場合があった。 In addition, a method of automatically detecting inappropriate comments is performed using a fixed model in which inappropriate words and expressions are defined in advance. However, the expressions used for inappropriate comments are diversified and complicated, and there are many that cannot be detected by a simple model. For example, if an inappropriate expression is incorporated as part of a positive comment expression, or if an inappropriate expression is made due to jargon or the like, it may not be detected by the above model. In addition, since the expressions used for such inappropriate comments are generated one after another, it is not easy to manually monitor all of them, and the method using a fixed model is delayed in responding. There was a case that it ended up.

本発明は、このような事情を考慮してなされたものであり、不適切コメントの検知を迅速且つ高精度で行うことが可能な情報処理装置、情報処理方法、およびプログラムを提供することを目的の一つとする。 The present invention has been made in consideration of such circumstances, and an object of the present invention is to provide an information processing device, an information processing method, and a program capable of detecting inappropriate comments quickly and with high accuracy. It is one of.

本発明の一態様は、第１コンテンツに対して投稿されたコメントである第２コンテンツを閲覧したユーザによる、前記第２コンテンツが不適切であることを示す報告を受け付ける受付部と、前記受付部により受け付けられた前記報告に基づいて、前記第２コンテンツが不適切であるか否かを判定する第１判定部とを備える情報処理装置である。 One aspect of the present invention is a reception unit that receives a report indicating that the second content is inappropriate by a user who has viewed the second content, which is a comment posted on the first content, and the reception unit. It is an information processing apparatus including a first determination unit that determines whether or not the second content is inappropriate based on the report received by.

本発明の一態様は、このような事情を考慮してなされたものであり、不適切コメントの検知を迅速且つ高精度で行うことが可能である。 One aspect of the present invention has been made in consideration of such circumstances, and it is possible to detect inappropriate comments quickly and with high accuracy.

実施形態におけるサービス提供装置１の利用環境を示す図である。It is a figure which shows the use environment of the service providing apparatus 1 in embodiment. 実施形態における第２判定部２５によって行われる判定処理の一例を説明する図である。It is a figure explaining an example of the determination process performed by the 2nd determination unit 25 in embodiment. 実施形態におけるサービス提供装置１により生成されるニュースページＰ１の一例を示す図である。It is a figure which shows an example of the news page P1 generated by the service providing apparatus 1 in embodiment. 実施形態におけるサービス提供装置１により生成される報告ページＰ２の一例を示す図である。It is a figure which shows an example of the report page P2 generated by the service providing apparatus 1 in embodiment. 実施形態における報告情報記憶部２８に記憶された報告情報２８Ａの一例を示す図である。It is a figure which shows an example of the report information 28A stored in the report information storage unit 28 in embodiment. 実施形態におけるサービス提供装置１による不適切コメントの第１判定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the 1st determination process of the inappropriate comment by the service providing apparatus 1 in embodiment. 実施形態におけるコンテンツ情報記憶部２６に記憶されたコメント情報２６Ｂの一例を示す図である。It is a figure which shows an example of the comment information 26B stored in the content information storage unit 26 in embodiment. 実施形態におけるユーザ情報記憶部３０に記憶されたユーザ情報３０Ａの一例を示す図である。It is a figure which shows an example of the user information 30A stored in the user information storage unit 30 in embodiment. 実施形態におけるコンテンツ情報記憶部２６に記憶されたコメント情報２６Ｂの一例を示す図である。It is a figure which shows an example of the comment information 26B stored in the content information storage unit 26 in embodiment. 実施形態における学習データの一例を示す図である。It is a figure which shows an example of the learning data in an embodiment. 実施形態におけるサービス提供装置１による不適切コメントの第２判定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the 2nd determination process of the inappropriate comment by the service providing apparatus 1 in embodiment. 実施形態におけるサービス提供装置１によるニュースページの生成処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the news page generation processing by the service providing apparatus 1 in embodiment. 実施形態におけるサービス提供装置１により生成されるニュースページＰ１の一例を示す図である。It is a figure which shows an example of the news page P1 generated by the service providing apparatus 1 in embodiment.

以下、図面を参照し、本発明の情報処理装置、情報処理方法、およびプログラムの実施形態について説明する。情報処理装置は、一以上のプロセッサ（コンピュータ）により実現される。情報処理装置は、コンテンツに対するコメントを閲覧したユーザにより報告された内容に基づいて、このコメントが不適切なコメントであるか否かを判定する。情報処理装置は、このコメントが不適切なコメントであると判定した場合、このコメントの掲載順位を下げるか、あるいは掲載対象から除外する。 Hereinafter, the information processing apparatus, the information processing method, and the embodiment of the program of the present invention will be described with reference to the drawings. The information processing device is realized by one or more processors (computers). The information processing device determines whether or not the comment is an inappropriate comment based on the content reported by the user who viewed the comment on the content. If the information processing device determines that this comment is an inappropriate comment, it lowers the ranking of this comment or excludes it from the posting target.

図１は、サービス提供装置１（情報処理装置）の利用環境を示す図である。サービス提供装置１は、ネットワークNWを介して、一以上の端末装置Ｔと接続され、このネットワークＮＷを介して互いに通信する。ネットワークＮＷは、例えば、ＷＡＮ（Wide Area Network）やＬＡＮ（Local Area Network）、インターネット、専用回線、無線基地局、プロバイダなどを含む。 FIG. 1 is a diagram showing a usage environment of the service providing device 1 (information processing device). The service providing device 1 is connected to one or more terminal devices T via the network NW, and communicates with each other via the network NW. The network NW includes, for example, WAN (Wide Area Network), LAN (Local Area Network), the Internet, a dedicated line, a wireless base station, a provider, and the like.

サービス提供装置１は、例えば、ウェブサーバまたはアプリサーバなどと称される。サービス提供装置１は、端末装置Ｔからの要求に応じて、ブラウザやアプリケーションプログラムによって再生されるコンテンツを提供する。 The service providing device 1 is referred to as, for example, a web server or an application server. The service providing device 1 provides content to be reproduced by a browser or an application program in response to a request from the terminal device T.

サービス提供装置１により提供されるコンテンツには、例えば、ブラウザによって参照されるウェブページの他、アプリケーションプログラムによって参照されるアプリページが含まれる。このようなウェブページまたはアプリページには、コメント投稿機能が実装された任意のページが含まれる。例えば、ウェブページには、ニュースサイト、オークションサイト、質問サイト等のページが含まれる。その他、コンテンツには、ミニブログ等のソーシャルネットワークサービス（ＳＮＳ）において投稿されたメッセージ等も含まれる。以下においては、サービス提供装置１により提供されるコンテンツが、ブラウザによって参照されるウェブページであり、このウェブページがニュースサイトのページである場合を例に挙げて説明する。 The content provided by the service providing device 1 includes, for example, a web page referenced by a browser and an application page referenced by an application program. Such web pages or app pages include any page that implements the comment posting feature. For example, a web page includes pages such as a news site, an auction site, and a question site. In addition, the content includes messages posted on social network services (SNS) such as miniblogs. In the following, a case where the content provided by the service providing device 1 is a web page referenced by a browser and the web page is a page of a news site will be described as an example.

サービス提供装置１により提供されるコンテンツは、メインのコンテンツ（第１コンテンツ）と共に、第三者により投稿されたこの第１コンテンツに対する意見、感想等を示すコメントが（第２コンテンツ）とを含むものである。サービス提供装置１は、コメントを閲覧したユーザにより報告された内容に基づいて、そのコメントが、特定の人物に対する誹謗、中傷といった一般に公開されることが適切でない内容が含まれるコメント（以下、「不適切コメント」とも言う）であるか否かを判定する。サービス提供装置１は、あるコンテンツに対して行われたコメントのうち、不適切コメントの掲載順位を下げるか、あるいは削除した上で、ユーザに提供する。 The content provided by the service providing device 1 includes the main content (first content) and comments (second content) indicating opinions, impressions, etc. regarding the first content posted by a third party. .. Based on the content reported by the user who viewed the comment, the service providing device 1 includes a comment whose comment is not appropriate to be disclosed to the public, such as slander or slander against a specific person (hereinafter, "non-existent"). It is also called "appropriate comment"). The service providing device 1 lowers or deletes the posting order of inappropriate comments among the comments made on a certain content, and then provides the comments to the user.

端末装置Ｔは、サービス提供装置１により提供されるニュースサイトのページを閲覧するユーザによって操作される。端末装置Ｔは、例えば、パーソナルコンピュータ、スマートフォンなどの携帯電話やタブレット端末、ＰＤＡ（Personal Digital Assistant）などのコンピュータ装置である。端末装置Ｔは、ブラウザやアプリケーションプログラムなどのＵＡ（User Agent）が動作して以下の機能を実現する。端末装置Ｔは、ユーザの操作に基づいて、サービス提供装置１により提供されるニュースサイトのページを要求するリクエストを送信し、リクエストに応じたニュースサイトのページに関する情報をサービス提供装置１から受信して表示する。 The terminal device T is operated by a user who browses a page of a news site provided by the service providing device 1. The terminal device T is, for example, a personal computer, a mobile phone such as a smartphone, a tablet terminal, or a computer device such as a PDA (Personal Digital Assistant). The terminal device T realizes the following functions by operating a UA (User Agent) such as a browser or an application program. The terminal device T transmits a request for a news site page provided by the service providing device 1 based on the user's operation, and receives information about the news site page in response to the request from the service providing device 1. To display.

［サービス提供装置の構成］
サービス提供装置１は、例えば、通信部１０（受付部）と、コンテンツ取得部１２と、順位決定部１４と、ページ生成部１６と、報告情報処理部１８（受付部）、第１判定部２０と、ラベル付与部２２と、学習部２４と、第２判定部２５と、コンテンツ情報記憶部２６と、報告情報記憶部２８と、ユーザ情報記憶部３０と、モデル記憶部３２とを備える。サービス提供装置１に含まれる各機能部は、複数の装置に分散されてもよい。例えば、第１判定部２０、ラベル付与部２２、および学習部２４と、他の機能部とは別体の装置によって実現されてもよい。また、例えば、第１判定部２０と、学習部２４とは別体の装置によって実現されてもよい。 [Configuration of service providing device]
The service providing device 1 includes, for example, a communication unit 10 (reception unit), a content acquisition unit 12, a ranking determination unit 14, a page generation unit 16, a report information processing unit 18 (reception unit), and a first determination unit 20. A label giving unit 22, a learning unit 24, a second determination unit 25, a content information storage unit 26, a report information storage unit 28, a user information storage unit 30, and a model storage unit 32 are provided. Each functional unit included in the service providing device 1 may be distributed to a plurality of devices. For example, it may be realized by a device separate from the first determination unit 20, the labeling unit 22, the learning unit 24, and the other functional units. Further, for example, the first determination unit 20 and the learning unit 24 may be realized by a separate device.

通信部１０は、ネットワークＮＷを介して、端末装置Ｔ等と通信する。通信部１０は、例えば、ＮＩＣ等の通信インターフェースを含む。 The communication unit 10 communicates with the terminal device T or the like via the network NW. The communication unit 10 includes, for example, a communication interface such as a NIC.

コンテンツ取得部１２は、ユーザにより要求されたニュースページの記事情報２６Ａ（第１コンテンツ）と、この記事情報２６Ａに対して投稿された１以上のコメントを含むコメント情報２６Ｂ（第２コンテンツ）とを、コンテンツ情報記憶部２６から取得する。 The content acquisition unit 12 obtains the article information 26A (first content) of the news page requested by the user and the comment information 26B (second content) including one or more comments posted to the article information 26A. , Obtained from the content information storage unit 26.

順位決定部１４は、コンテンツ取得部１２により取得された１以上のコメントの掲載順位を決定する。順位決定部１４は、例えば、コメントのクリック率や、コメントに対する評価、コメントに含まれる表現等に基づいて、コメントの掲載順位を決定する。例えば、順位決定部１４は、コメントのクリック率が高い順に、コメントの掲載順位を決定する。順位決定部１４は、不適切コメントであると判定されたコメントについては、掲載順位を下げるか、あるいは掲載対象から除外するように、コメントの掲載順位を決定する。 The ranking determination unit 14 determines the posting order of one or more comments acquired by the content acquisition unit 12. The ranking determination unit 14 determines the ranking of comments based on, for example, the click rate of comments, the evaluation of comments, the expressions included in comments, and the like. For example, the ranking determination unit 14 determines the ranking of comments in descending order of click rate of comments. The ranking determination unit 14 determines the ranking of comments so as to lower the ranking or exclude the comments that are determined to be inappropriate comments.

ページ生成部１６は、コンテンツ取得部１２により取得された記事情報２６Ａと、コメント情報２６Ｂに含まれる１以上のコメントとを含むニュースページを生成し、端末装置Ｔに送信する。このニュースページ内において、コメント情報２６Ｂに含まれるコメントは、順位決定部１４により決定された掲載順位で掲載される。 The page generation unit 16 generates a news page including the article information 26A acquired by the content acquisition unit 12 and one or more comments included in the comment information 26B, and transmits the news page to the terminal device T. In this news page, the comments included in the comment information 26B are posted in the ranking determined by the ranking determination unit 14.

また、ページ生成部１６は、ニュースページに含まれるコメントを閲覧したユーザによる不適切コメントを報告するリクエストを、通信部１０を介して受信した場合、報告内容の入力を受け付ける報告ページを生成し、端末装置Ｔに送信する。 Further, when the page generation unit 16 receives a request for reporting an inappropriate comment by a user who has viewed the comment included in the news page via the communication unit 10, the page generation unit 16 generates a report page that accepts the input of the report content. It is transmitted to the terminal device T.

報告情報処理部１８は、上記の報告ページに対してユーザにより入力された報告内容を取得し、報告情報記憶部２８に記憶させる。 The report information processing unit 18 acquires the report content input by the user for the above report page and stores it in the report information storage unit 28.

第１判定部２０は、報告情報記憶部２８に記憶された報告情報２８Ａに基づいて、コメント情報２６Ｂに含まれるコメントの各々が不適切コメントであるか否かを判定する。 The first determination unit 20 determines whether or not each of the comments included in the comment information 26B is an inappropriate comment based on the report information 28A stored in the report information storage unit 28.

ラベル付与部２２は、第１判定部２０による判定結果に基づいて、コメント情報２６Ｂに含まれるコメントの各々に対して、不適切なコメントを示すラベル（以下、「不適切ラベル」と呼ぶ）、不適切なコメントである可能性があることを示すラベル（以下、「準不適切ラベル」と呼ぶ）、および不適切なコメントではないことを示すラベル（以下、「適切ラベル」と呼ぶ）のいずれかを付与する。このラベルが付与されたコメントが、学習部２４の学習に利用される学習データとなる。 Based on the determination result by the first determination unit 20, the label giving unit 22 indicates a label indicating an inappropriate comment for each of the comments included in the comment information 26B (hereinafter, referred to as an “inappropriate label”). Either a label indicating that the comment may be inappropriate (hereinafter referred to as "quasi-inappropriate label") or a label indicating that the comment is not inappropriate (hereinafter referred to as "appropriate label"). Label. The comment to which this label is attached becomes the learning data used for the learning of the learning unit 24.

学習部２４は、ラベル付与部２２によってラベルが付与されたデータ（コメントと、ラベルとの組）を学習データとして機械学習を行い、コメントが不適切コメントであるか否かを判定する判定モデル３２Ａを生成する。学習部２４は、生成した判定モデル３２Ａを、モデル記憶部３２に記憶させる。 The learning unit 24 performs machine learning using the data labeled by the labeling unit 22 (a set of a comment and a label) as learning data, and determines whether or not the comment is an inappropriate comment. To generate. The learning unit 24 stores the generated determination model 32A in the model storage unit 32.

第２判定部２５は、学習部２４により生成された判定モデル３２Ａを用いて、コメントが不適切コメントであるか否かを判定する。第２判定部２５は、例えば、ニュースサイトのページのコメント入力欄に対してユーザにより新たなコメントが入力されると、このコメントが不適切コメントであるか否かを判定することができる。図２は、第２判定部２５によって行われる判定処理の一例を説明する図である。第２判定部２５は、入力情報としてユーザにより入力されたコメント情報を受け取る。次に、第２判定部２５は、モデル記憶部３２に記憶された判定モデル３２Ａを用いて、入力されたコメント情報に含まれるコメントが不適切コメントであるか否かを判定する。次に、第２判定部２５は、出力情報として不適切コメントか否かに関する情報を出力する。出力情報は、例えば確率値（不適切な確率０．８等）でもよいし、正例（適切）、負例（不適切）などを示す情報であってもよい。 The second determination unit 25 determines whether or not the comment is an inappropriate comment by using the determination model 32A generated by the learning unit 24. For example, when a new comment is input by the user to the comment input field of the page of the news site, the second determination unit 25 can determine whether or not this comment is an inappropriate comment. FIG. 2 is a diagram illustrating an example of determination processing performed by the second determination unit 25. The second determination unit 25 receives the comment information input by the user as the input information. Next, the second determination unit 25 determines whether or not the comment included in the input comment information is an inappropriate comment by using the determination model 32A stored in the model storage unit 32. Next, the second determination unit 25 outputs information regarding whether or not the comment is inappropriate as output information. The output information may be, for example, a probability value (inappropriate probability 0.8, etc.), or information indicating a positive example (appropriate), a negative example (inappropriate), or the like.

学習部２４および第２判定部２５は、例えば、サポートベクターマシンを用いて処理を行ってよい。また、学習部２４および第２判定部２５は、再帰型ニューラルネットワーク（Recurrent Neural Network：ＲＮＮ）多層構造のニューラルネットワーク（Deep Neural Network：ＤＮＮ）、畳み込みニューラルネットワーク（Convolutional Neural Network：ＣＮＮ）などを用いたディープラーニング技術を採用してもよい。 The learning unit 24 and the second determination unit 25 may perform processing using, for example, a support vector machine. Further, the learning unit 24 and the second determination unit 25 use a recurrent neural network (RNN), a multi-layered neural network (DNN), a convolutional neural network (CNN), and the like. The deep learning technology that was used may be adopted.

コンテンツ情報記憶部２６は、例えば、ニュース記事を示す記事情報２６Ａと、ニュース記事に対して投稿されたコメントを示すコメント情報２６Ｂとを記憶する。記事情報２６Ａは、ニュース記事の配信担当者等の操作に基づいて適宜更新されるか、あるいは、日次等の所定のタイミングで実施されるバッチ処理により更新されてよい。コメント情報２６Ｂは、ニュースページに対するユーザによるコメントの投稿処理に基づいて適宜更新されてよい。 The content information storage unit 26 stores, for example, article information 26A indicating a news article and comment information 26B indicating a comment posted on a news article. The article information 26A may be updated as appropriate based on the operation of the person in charge of distributing the news article, or may be updated by batch processing performed at a predetermined timing such as daily. The comment information 26B may be appropriately updated based on the processing of posting a comment by the user on the news page.

報告情報記憶部２８は、例えば、報告ページにおいてユーザにより入力された報告内容を示す報告情報２８Ａを記憶する。 The report information storage unit 28 stores, for example, report information 28A indicating the report content input by the user on the report page.

ユーザ情報記憶部３０は、例えば、ユーザごとの属性を示す情報を含むユーザ情報３０Ａを記憶する。モデル記憶部３２は、学習部２４により生成された判定モデル３２Ａを記憶する。 The user information storage unit 30 stores, for example, user information 30A including information indicating attributes for each user. The model storage unit 32 stores the determination model 32A generated by the learning unit 24.

サービス提供装置１の各機能部は、例えば、ＣＰＵ（Central Processing Unit）等のハードウェアプロセッサがプログラム（ソフトウェア）を実行することにより実現される。サービス提供装置１は、各機能部を実現するための複数のプロセッサを備えてもよい。また、これらの各機能部のうち一部または全部は、ＬＳＩ（Large Scale Integration）やＡＳＩＣ（Application Specific Integrated Circuit）、ＦＰＧＡ（Field-Programmable Gate Array）、ＧＰＵ（Graphics Processing Unit）等のハードウェア（回路部；circuitryを含む）によって実現されてもよいし、ソフトウェアとハードウェアの協働によって実現されてもよい。 Each functional unit of the service providing device 1 is realized, for example, by executing a program (software) by a hardware processor such as a CPU (Central Processing Unit). The service providing device 1 may include a plurality of processors for realizing each functional unit. In addition, some or all of these functional units are hardware such as LSI (Large Scale Integration), ASIC (Application Specific Integrated Circuit), FPGA (Field-Programmable Gate Array), GPU (Graphics Processing Unit), etc. It may be realized by the circuit unit (including circuitry), or it may be realized by the cooperation of software and hardware.

サービス提供装置１の各記憶部は、例えば、ＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）、ＨＤＤ（Hard Disk Drive）、フラッシュメモリ、またはこれらのうち複数が組み合わされたハイブリッド型記憶装置等により実現される。サービス提供装置１の各記憶部は、単一の装置により実現されてもよい。また、サービス提供装置１の各記憶部の一部または全部は、ＮＡＳや外部のストレージサーバ等、サービス提供装置１がアクセス可能な外部装置であってもよい。 Each storage unit of the service providing device 1 is, for example, a RAM (Random Access Memory), a ROM (Read Only Memory), an HDD (Hard Disk Drive), a flash memory, or a hybrid storage device in which a plurality of these is combined. Is realized by. Each storage unit of the service providing device 1 may be realized by a single device. Further, a part or all of each storage unit of the service providing device 1 may be an external device such as NAS or an external storage server that can be accessed by the service providing device 1.

［報告情報の取得処理］
以下、サービス提供装置１の処理について説明する。まず、サービス提供装置１により実施される不適切コメントの報告情報の取得処理について説明する。図３は、サービス提供装置１により生成され、端末装置Ｔに表示されるニュースページＰ１の一例を示す図である。ニュースページＰ１は、例えば、ニュース記事が表示される記事領域Ｒ１と、この記事に対して投稿された第１のコメントが表示される第１コメント領域Ｒ２と、第２のコメントが表示される第２コメント領域Ｒ３とを含む。尚、ニュースページＰ１は、３つ以上のコメントを表示するための領域が設定されてもよい。 [Acquisition process of report information]
Hereinafter, the processing of the service providing device 1 will be described. First, the process of acquiring the report information of inappropriate comments carried out by the service providing device 1 will be described. FIG. 3 is a diagram showing an example of a news page P1 generated by the service providing device 1 and displayed on the terminal device T. On the news page P1, for example, an article area R1 in which a news article is displayed, a first comment area R2 in which a first comment posted for this article is displayed, and a second comment are displayed. 2 Includes comment area R3. The news page P1 may be set with an area for displaying three or more comments.

この第１コメント領域Ｒ２および第２コメント領域Ｒ３の各々には、コメントを閲覧したユーザによるコメントが不適切であることを示す報告を受け付けるボタンＢ１と、コメントに対するポジティブな評価の入力を受け付けるボタンＢ２（「良い」ボタン）と、コメントに対するネガティブな評価の入力を受け付けるボタンＢ３（「悪い」ボタン）とが含まれる。ボタンＢ２およびＢ３の各々には、コメントを閲覧した全ユーザがボタンを押下した合計回数があわせて表示されている。ユーザによるボタンＢ２またはＢ３の押下に基づいて、この合計回数が１つ増大される。 In each of the first comment area R2 and the second comment area R3, a button B1 for receiving a report indicating that the comment by the user who viewed the comment is inappropriate and a button B2 for receiving a positive evaluation for the comment are received. ("Good" button) and button B3 ("bad" button) that accepts input of negative evaluation for comments are included. Each of the buttons B2 and B3 also displays the total number of times all users who viewed the comment pressed the button. This total number is incremented by 1 based on the user pressing button B2 or B3.

ユーザが端末装置Ｔを操作することにより、ボタンＢ１が押下されると、ユーザによる報告内容の入力を受け付ける報告ページが端末装置Ｔに表示される。図４は、図３における第２コメント領域Ｒ３のボタンＢ１が押下された場合に端末装置Ｔに表示される報告ページＰ２の一例を示す図である。報告ページＰ２は、不適切コメントの種類の指定を受け付ける。不適切コメントの種類には、例えば、「性差別」、「国籍差別」、「人種差別」等が含まれる。図４に示す例では、ユーザが不適切コメントの種類ごとに設定されたラジオボタンを選択することにより、種類の指定を受け付けるようになっている。図４に示す例では、「誹謗・中傷」がユーザにより指定された場合を示している。 When the button B1 is pressed by the user operating the terminal device T, a report page for accepting the input of the report content by the user is displayed on the terminal device T. FIG. 4 is a diagram showing an example of a report page P2 displayed on the terminal device T when the button B1 of the second comment area R3 in FIG. 3 is pressed. The report page P2 accepts the specification of the type of inappropriate comment. Types of inappropriate comments include, for example, "sexism," "nationality discrimination," "racism," and the like. In the example shown in FIG. 4, the user selects a radio button set for each type of inappropriate comment to accept the designation of the type. In the example shown in FIG. 4, the case where "slandering / slander" is specified by the user is shown.

また、報告ページＰ２は、コメント内における不適切な表現のユーザによる指定を受け付ける。図４に示す例では、コメント「Ｂの声が嫌い。二度と出るな。」における「声が嫌い」との表現がユーザにより指定された場合を示している。また、報告ページＰ２は、不適切な内容の詳細として、コメントに対するユーザの意見等のフリー入力（テキスト入力）を受け付ける。 In addition, the report page P2 accepts the user's designation of inappropriate expressions in the comment. The example shown in FIG. 4 shows a case where the expression "I hate the voice" in the comment "I hate the voice of B. Never come out again." Is specified by the user. In addition, the report page P2 accepts free input (text input) such as a user's opinion on a comment as details of inappropriate contents.

ユーザが端末装置Ｔを操作することにより、上記のような報告内容を入力した後、「送信」ボタンを押下することで、サービス提供装置１に不適切コメントに関する情報を報告することが出来る。サービス提供装置１は、端末装置Ｔから受信した報告内容を報告情報記憶部２８に記憶させる。 By operating the terminal device T, the user inputs the above-mentioned report contents, and then presses the "send" button, so that the information regarding the inappropriate comment can be reported to the service providing device 1. The service providing device 1 stores the report content received from the terminal device T in the report information storage unit 28.

図５は、報告情報記憶部２８に記憶された報告情報２８Ａの一例を示す図である。この報告情報２８Ａには、ニュース記事を識別する「記事ＩＤ」と、コメントを識別する「コメントＩＤ」と、不適切コメントの報告を行ったユーザ（報告者）を識別する「ユーザＩＤ」と、上記の報告ページＰ２においてユーザにより指定された不適切コメントの種類を示す「種類」と、ユーザにより指定された不適切な表現を示す「不適切な表現」と、ユーザにより入力された不適切な内容の詳細を示す「不適切な内容の詳細」とが関連付けされた１以上のデータが含まれる。以上により、サービス提供装置１は、不適切コメントに関する報告情報を収集することができる。 FIG. 5 is a diagram showing an example of the report information 28A stored in the report information storage unit 28. The report information 28A includes an "article ID" that identifies a news article, a "comment ID" that identifies a comment, and a "user ID" that identifies a user (reporter) who has reported an inappropriate comment. "Type" indicating the type of inappropriate comment specified by the user on page P2 of the above report, "inappropriate expression" indicating the inappropriate expression specified by the user, and inappropriate expression entered by the user. Contains one or more data associated with "inappropriate content details" indicating content details. As described above, the service providing device 1 can collect the report information regarding the inappropriate comment.

［不適切コメントの判定処理（第１例）］
以下、サービス提供装置１により実施される不適切コメントの判定処理の一例について説明する。図６は、サービス提供装置１による不適切コメントの判定処理の第１例である第１判定処理の流れを示すフローチャートである。この第１判定処理は、例えば、日ごと、週ごと等の所定のタイミングで実施されるバッチ処理により実施されてよい。 [Judgment processing of inappropriate comments (1st example)]
Hereinafter, an example of the inappropriate comment determination process performed by the service providing device 1 will be described. FIG. 6 is a flowchart showing the flow of the first determination process, which is a first example of the determination process of inappropriate comments by the service providing device 1. This first determination process may be performed by a batch process performed at a predetermined timing such as daily or weekly.

まず、サービス提供装置１は、コンテンツ情報記憶部２６に記憶されたコメント情報２６Ｂから１件のコメントを抽出し、抽出したコメントに対する報告情報を報告情報記憶部２８から抽出する（Ｓ１０１）。 First, the service providing device 1 extracts one comment from the comment information 26B stored in the content information storage unit 26, and extracts the report information for the extracted comment from the report information storage unit 28 (S101).

図７は、コンテンツ情報記憶部２６に記憶されたコメント情報２６Ｂの一例を示す図である。サービス提供装置１は、例えば、コメント情報２６Ｂから、コメントＩＤ「Ｃ２」のコメント「Ｂの声が嫌い。二度と出るな。」を抽出し、このコメントＩＤ「Ｃ２」を検索キーとして、図５に示される報告情報２８Ａの中から報告情報を抽出する。 FIG. 7 is a diagram showing an example of comment information 26B stored in the content information storage unit 26. For example, the service providing device 1 extracts the comment "I hate the voice of B. Never come out again" of the comment ID "C2" from the comment information 26B, and uses this comment ID "C2" as a search key in FIG. Report information is extracted from the indicated report information 28A.

次に、サービス提供装置１は、抽出したコメントに対する報告件数が、所定の閾値以上であるか否かを判定する（Ｓ１０３）。ある程度の件数の報告がなされたコメントは、不適切コメントである可能性が高いことが想定される。このため、サービス提供装置１は、報告件数に基づいて、コメントが不適切コメントであるか否かの一次判断を行う。例えば、抽出したコメントに対する報告件数が、閾値（例えば、１０件）以上であるか否かを判定する。 Next, the service providing device 1 determines whether or not the number of reports for the extracted comments is equal to or greater than a predetermined threshold value (S103). It is assumed that comments for which a certain number of reports have been made are likely to be inappropriate comments. Therefore, the service providing device 1 makes a primary determination as to whether or not the comment is an inappropriate comment based on the number of reports. For example, it is determined whether or not the number of reports for the extracted comments is equal to or greater than the threshold value (for example, 10).

サービス提供装置１は、抽出したコメントに対する報告件数が所定の閾値以上ではないと判定した場合、このコメントが不適切コメントと断定することはできないため、このコメントに対して「適切ラベル」を付与する（Ｓ１０５）。 If the service providing device 1 determines that the number of reports for the extracted comment is not equal to or more than a predetermined threshold value, it cannot conclude that this comment is an inappropriate comment, and therefore assigns an "appropriate label" to this comment. (S105).

一方、サービス提供装置１は、抽出したコメントに対する報告件数が所定の閾値以上であると判定した場合、報告内容の信頼性を判定するための以下の処理を行う。あるコメントに対する報告が特定の偏った思想、見解等を持つ一部のユーザにより行われているような場合には、この報告の信頼性が高くない場合がある。例えば、政治的なニュース記事に対するコメントに対して、このコメントとは異なる思想、見解等を持つユーザにより報告がなされている場合には、報告件数は十分であっても、世間一般の判断では必ずしも不適切コメントとは言えない場合がある。このため、サービス提供装置１は、報告を行ったユーザの属性の分布を参照し、この属性に偏りがあるか否かを判定し（Ｓ１０７）、この判定結果（偏り度合い）に基づいて、抽出したコメントが不適切であるか否かを判定する。 On the other hand, when it is determined that the number of reports for the extracted comments is equal to or greater than a predetermined threshold value, the service providing device 1 performs the following processing for determining the reliability of the report contents. If the report for a comment is made by some users who have a specific biased idea, opinion, etc., the reliability of this report may not be high. For example, if a comment on a political news article is reported by a user who has a different idea, opinion, etc. from this comment, even if the number of reports is sufficient, it is not always judged by the general public. It may not be an inappropriate comment. Therefore, the service providing device 1 refers to the distribution of the attributes of the user who made the report, determines whether or not the attributes are biased (S107), and extracts based on the determination result (degree of bias). Determine if the comment you made is inappropriate.

サービス提供装置１は、ユーザの属性の分布に偏りが無いと判定した場合、コメントに対する報告内容の信頼性が高いと判定して、このコメントに対して「不適切ラベル」を付与する（Ｓ１０９）。一方、サービス提供装置１は、ユーザの属性の分布に偏りがあると判定した場合、コメントに対する報告内容の信頼性が低いと判定して、このコメントに対して「準不適切ラベル」を付与する（Ｓ１１１）。 When the service providing device 1 determines that the distribution of the user's attributes is not biased, it determines that the report content for the comment is highly reliable, and assigns an "inappropriate label" to the comment (S109). .. On the other hand, when the service providing device 1 determines that the distribution of user attributes is biased, it determines that the reliability of the report content for the comment is low, and assigns a "quasi-inappropriate label" to this comment. (S111).

図８は、ユーザ情報記憶部３０に記憶されたユーザ情報３０Ａの一例を示す図である。サービス提供装置１は、例えば、抽出したコメントに対して報告を行ったユーザのユーザＩＤを図５に示される報告情報２８Ａから取得し、この取得したユーザＩＤを検索キーとして、このユーザの属性に関する情報をユーザ情報３０Ａから取得する。そして、サービス提供装置１は、取得したユーザの属性の分布に偏りがあるか否かを判定する。サービス提供装置１は、例えば、ユーザの属性が、「男性」、「３０代」にのみ偏っているような場合にはユーザの属性の分布に偏りがあると判定し、コメントに対する報告内容の信頼性が低いと判定して、このコメントに対して「準不適切ラベル」を付与する。 FIG. 8 is a diagram showing an example of the user information 30A stored in the user information storage unit 30. For example, the service providing device 1 acquires the user ID of the user who reported the extracted comment from the report information 28A shown in FIG. 5, and uses the acquired user ID as a search key to relate to the attribute of this user. Information is acquired from the user information 30A. Then, the service providing device 1 determines whether or not the distribution of the acquired user attributes is biased. For example, when the user's attributes are biased only to "male" and "30's", the service providing device 1 determines that the distribution of the user's attributes is biased, and trusts the content of the report for the comment. Judging that the character is low, a "quasi-inappropriate label" is given to this comment.

尚、サービス提供装置１は、上記のユーザの属性の分布の偏りの判定を行う場合には、例えば、ユーザの属性に基づいてユーザベクトル（特徴情報）を生成するようにしてよい。ユーザベクトルは、ユーザの属性を要素に置き替えた疎ベクトルで表されるベクトルデータであってよい。あるいは、ユーザベクトルは、ウェブ上でのユーザの行動履歴や、ユーザの属性情報などをもとに生成されてもよい。サービス提供装置１は、このユーザベクトルのベクトル間距離に基づいて、取得したユーザの属性の分布に偏りがあるか否かを判定してよい。 When determining the bias of the distribution of the user attributes, the service providing device 1 may generate a user vector (feature information) based on the user attributes, for example. The user vector may be vector data represented by a sparse vector in which the user's attributes are replaced with elements. Alternatively, the user vector may be generated based on the user's behavior history on the Web, the user's attribute information, and the like. The service providing device 1 may determine whether or not the distribution of the acquired user attributes is biased based on the inter-vector distance of the user vector.

尚、サービス提供装置１は、上記のようなユーザの属性の偏り度合いに基づいて、「準不適切ラベル」をさらに細かく設定してもよい。例えば、サービス提供装置１は、ユーザの属性の偏り度合いに基づいて、「準不適切ラベル」を０から１の数値で表すように設定してもよい。この数値で表された「準不適切ラベル」は、例えば、１に近いほど、不適切である可能性が高いことを示すように設定されてよい。 The service providing device 1 may set the "quasi-inappropriate label" in more detail based on the degree of bias of the user's attributes as described above. For example, the service providing device 1 may be set so that the "quasi-inappropriate label" is represented by a numerical value from 0 to 1 based on the degree of bias of the user's attributes. The "quasi-inappropriate label" represented by this numerical value may be set to indicate, for example, that the closer it is to 1, the higher the possibility of being inappropriate.

尚、サービス提供装置１は、コメント情報２６Ｂに含まれる「良い」ボタンＢ２が押下された回数、「悪い」ボタンＢ３が押下された回数の情報を用いて、上記のラベルの判定を行ってもよい。例えば、サービス提供装置１は、「悪い」ボタンＢ３が押下された回数が所定の閾値以上であるコメントについては、「不適切ラベル」を付与してもよい。 Even if the service providing device 1 determines the above label by using the information on the number of times the "good" button B2 is pressed and the number of times the "bad" button B3 is pressed included in the comment information 26B. good. For example, the service providing device 1 may give an "inappropriate label" to a comment in which the number of times the "bad" button B3 is pressed is equal to or greater than a predetermined threshold value.

サービス提供装置１は、上記の判定結果に関する情報をコメント情報２６Ｂに記憶させてもよい。図９は、判定結果に関する情報が追加されたコメント情報２６Ｂの一例を示す図である。図９に示す例では、ラベル「適切」、「不適切」、「準不適切」が各データに追加されている。 The service providing device 1 may store the information regarding the above determination result in the comment information 26B. FIG. 9 is a diagram showing an example of comment information 26B to which information regarding the determination result is added. In the example shown in FIG. 9, the labels "appropriate", "inappropriate", and "quasi-inappropriate" are added to each data.

サービス提供装置１は、以上の処理（Ｓ１０１からＳ１１１）を、コメント情報２６Ｂに含まれるコメントの各々に対して繰り返し実施する。 The service providing device 1 repeatedly performs the above processes (S101 to S111) for each of the comments included in the comment information 26B.

次に、サービス提供装置１は、ラベルが付与されたコメント（コメントと、ラベルとの組）を学習データとして機械学習を行い、判定モデル３２Ａを生成する（Ｓ１１３）。図１０は、学習データの一例を示す図である。図１０では、図９に示すコメント情報２６Ｂから抽出された「コメント」と「ラベル」との組を学習データとする例を示している。サービス提供装置１は、このような学習データを用いて機械学習を行うことにより生成した判定モデル３２Ａを、モデル記憶部３２に記憶させる。サービス提供装置１の第２判定部２５は、ユーザにより新たなコメントが入力されると、上記において生成されてモデル記憶部３２に記憶された判定モデル３２Ａを用いて、入力されたコメントが不適切コメントであるか否かを判定することができる。以上により、サービス提供装置１は本フローチャートの処理を終了する。 Next, the service providing device 1 performs machine learning using the labeled comment (a set of the comment and the label) as training data, and generates the determination model 32A (S113). FIG. 10 is a diagram showing an example of learning data. FIG. 10 shows an example in which a set of a “comment” and a “label” extracted from the comment information 26B shown in FIG. 9 is used as learning data. The service providing device 1 stores the determination model 32A generated by performing machine learning using such learning data in the model storage unit 32. When a new comment is input by the user, the second determination unit 25 of the service providing device 1 uses the determination model 32A generated above and stored in the model storage unit 32, and the input comment is inappropriate. It can be determined whether or not it is a comment. As described above, the service providing device 1 ends the processing of this flowchart.

尚、サービス提供装置１は、不適切コメントと、不適切コメントの種類との関係を学習するようにしてもよい。また、サービス提供装置１は、コメント全文を学習データとするのではなく、ユーザにより指定された「不適切な表現」の箇所のみを学習データとするようにしてもよい。また、サービス提供装置１は、全てのコメントを学習データとするのではなく、報告内容の信頼性が高いコメントのみを学習データとするようにしてもよい。また、サービス提供装置１は、ニュース記事の本文も学習データに組み入れるようにしてもよい。 The service providing device 1 may learn the relationship between the inappropriate comment and the type of the inappropriate comment. Further, the service providing device 1 may use only the part of the "inappropriate expression" specified by the user as the learning data, instead of using the entire comment as the learning data. Further, the service providing device 1 may use only the comments with high reliability of the report content as the learning data instead of using all the comments as the learning data. Further, the service providing device 1 may incorporate the text of the news article into the learning data.

［不適切コメントの判定処理（第２例）］
以下、サービス提供装置１により実施される不適切コメントの判定処理の他の例について説明する。図１１は、サービス提供装置１による不適切コメントの判定処理の第２例である第２判定処理の流れを示すフローチャートである。この第２判定処理は、例えば、日ごと、週ごと等の所定のタイミングで実施されるバッチ処理により実施されてよい。 [Judgment processing of inappropriate comments (second example)]
Hereinafter, another example of the inappropriate comment determination process performed by the service providing device 1 will be described. FIG. 11 is a flowchart showing the flow of the second determination process, which is the second example of the determination process of the inappropriate comment by the service providing device 1. This second determination process may be performed by a batch process performed at a predetermined timing such as daily or weekly.

まず、サービス提供装置１は、コンテンツ情報記憶部２６に記憶されたコメント情報２６Ｂから１件のコメントを抽出し、抽出したコメントに対する報告情報を報告情報記憶部２８から抽出する（Ｓ２０１）。 First, the service providing device 1 extracts one comment from the comment information 26B stored in the content information storage unit 26, and extracts the report information for the extracted comment from the report information storage unit 28 (S201).

次に、サービス提供装置１は、抽出したコメントに対する報告件数が、所定の閾値以上であるか否かを判定する（Ｓ２０３）。 Next, the service providing device 1 determines whether or not the number of reports for the extracted comments is equal to or greater than a predetermined threshold value (S203).

サービス提供装置１は、抽出したコメントに対する報告件数が所定の閾値以上ではないと判定した場合、不適切コメントと断定することはできないため、このコメントに対して「適切ラベル」を付与する（Ｓ２０５）。 If the service providing device 1 determines that the number of reports for the extracted comment is not equal to or more than a predetermined threshold value, it cannot conclude that the comment is inappropriate, and therefore assigns an "appropriate label" to this comment (S205). ..

一方、サービス提供装置１は、抽出したコメントに対する報告件数が所定の閾値以上であると判定した場合、報告内容の信頼性を判定するための以下の処理を行う。不適切コメントはその種類に応じて、報告を行うユーザの属性の分布に傾向が表れることが想定される。例えば、「性差別（女性差別）」に関する不適切コメントに対する報告は、このコメントに対して嫌悪感を抱きやすい「女性」のユーザによりなされる傾向がある。このため、「性差別（女性差別）」に関する不適切コメントに対する報告を行ったユーザの属性の多くが「女性」である場合には、その報告の信頼性が高いことが想定される。一方、報告を行ったユーザの属性の多くが「男性」である場合には、その報告の信頼性が低いことが想定される。このため、サービス提供装置１は、不適切コメントの種類ごとに基準となる参照分布を予め定義し、実際に報告を行ったユーザの属性の分布がこの参照分布に類似しているか否かを判定し（Ｓ２０７）、この判定結果（類似性）に基づいて、抽出したコメントが不適切であるか否かを判定する。 On the other hand, when it is determined that the number of reports for the extracted comments is equal to or greater than a predetermined threshold value, the service providing device 1 performs the following processing for determining the reliability of the report contents. It is assumed that inappropriate comments tend to appear in the distribution of attributes of users who report, depending on the type. For example, reports on inappropriate comments regarding "sexism (discrimination against women)" tend to be made by "female" users who are prone to dislike this comment. Therefore, if many of the attributes of the user who made a report on inappropriate comments regarding "sex discrimination (discrimination against women)" are "female", it is assumed that the report is highly reliable. On the other hand, if many of the attributes of the user who made the report are "male", it is assumed that the reliability of the report is low. Therefore, the service providing device 1 defines a reference distribution as a reference for each type of inappropriate comment in advance, and determines whether or not the distribution of the attributes of the user who actually made the report is similar to this reference distribution. (S207), and based on this determination result (similarity), it is determined whether or not the extracted comment is inappropriate.

尚、サービス提供装置１は、上記のユーザの属性の分布の類似性の判定を行う場合には、例えば、ユーザの属性に基づいてユーザベクトル（特徴情報）を生成し、このユーザベクトルのベクトル間距離に基づいて、取得したユーザの属性の分布が参照分布に類似しているか否かを判定してよい。 When the service providing device 1 determines the similarity of the distribution of the user attributes, for example, the service providing device 1 generates a user vector (feature information) based on the user attributes, and the vector between the user vectors. Based on the distance, it may be determined whether or not the acquired distribution of the user's attributes is similar to the reference distribution.

サービス提供装置１は、報告を行ったユーザの属性の分布が参照分布に類似していると判定した場合、コメントに対する報告内容の信頼性が高いと判定して、このコメントに対して「不適切ラベル」を付与する（Ｓ２０９）。一方、サービス提供装置１は、報告を行ったユーザの属性の分布がこの参照分布に類似していないと判定した場合、コメントに対する報告内容の信頼性が低いと判定して、このコメントに対して「準不適切ラベル」を付与する（Ｓ２１１）。 When the service providing device 1 determines that the distribution of the attributes of the user who made the report is similar to the reference distribution, it determines that the content of the report is highly reliable for the comment, and "inappropriate" for this comment. "Label" is given (S209). On the other hand, when the service providing device 1 determines that the distribution of the attributes of the user who made the report does not resemble this reference distribution, the service providing device 1 determines that the reliability of the report content for the comment is low, and the service providing device 1 determines that the reliability of the report content is low. A "quasi-inappropriate label" is given (S211).

尚、サービス提供装置１は、上記のようなユーザの属性の分布の類似性に基づいて、「準不適切ラベル」をさらに細かく設定してもよい。例えば、サービス提供装置１は、ユーザの属性の分布の類似性に基づいて、「準不適切ラベル」を０から１の数値で表すように設定してもよい。この数値で表された「準不適切ラベル」は、例えば、１に近いほど、不適切である可能性が高いことを示すように設定されてよい。 The service providing device 1 may set the "quasi-inappropriate label" in more detail based on the similarity of the distribution of the user's attributes as described above. For example, the service providing device 1 may be set so that the "quasi-inappropriate label" is represented by a numerical value from 0 to 1 based on the similarity of the distribution of the user's attributes. The "quasi-inappropriate label" represented by this numerical value may be set to indicate, for example, that the closer it is to 1, the higher the possibility of being inappropriate.

サービス提供装置１は、以上の処理（Ｓ２０１からＳ２１１）を、コメント情報２６Ｂに含まれるコメントの各々に対して繰り返し実施する。 The service providing device 1 repeatedly executes the above processes (S201 to S211) for each of the comments included in the comment information 26B.

次に、サービス提供装置１は、ラベルが付与されたコメント（コメントと、ラベルとの組）を学習データとして機械学習を行い、判定モデル３２Ａを生成する（Ｓ２１３）。サービス提供装置１は、生成した判定モデル３２Ａを、モデル記憶部３２に記憶させる。以上により、サービス提供装置１は本フローチャートの処理を終了する。 Next, the service providing device 1 performs machine learning using a comment with a label (a set of the comment and the label) as training data, and generates a determination model 32A (S213). The service providing device 1 stores the generated determination model 32A in the model storage unit 32. As described above, the service providing device 1 ends the processing of this flowchart.

［ニュースページの生成処理］
以下、サービス提供装置１により実施されるニュースページの生成処理の一例について説明する。図１２は、サービス提供装置１によるニュースページの生成処理の流れを示すフローチャートである。このニュースページの生成処理は、端末装置Ｔからのニュースページのリクエストを受信するたびに実施される。 [News page generation process]
Hereinafter, an example of the news page generation process performed by the service providing device 1 will be described. FIG. 12 is a flowchart showing a flow of a news page generation process by the service providing device 1. This news page generation process is executed every time a news page request from the terminal device T is received.

まず、サービス提供装置１は、ユーザの操作に基づいて端末装置Ｔにより送信されたニュースページを要求するリクエストを受信する（Ｓ３０１）。 First, the service providing device 1 receives a request for a news page transmitted by the terminal device T based on the operation of the user (S301).

次に、サービス提供装置１は、要求されたニュースページの記事情報と、この記事情報に対して投稿されたコメント情報とを、コンテンツ情報記憶部２６から取得する（Ｓ３０３）。 Next, the service providing device 1 acquires the article information of the requested news page and the comment information posted for this article information from the content information storage unit 26 (S303).

次に、サービス提供装置１は、取得したコメントの掲載順位を仮決定する（Ｓ３０５）。サービス提供装置１は、例えば、コメントのクリック率や、コメントに対する評価、コメントに含まれる表現等に基づいて、コメントの掲載順位を仮決定する。例えば、サービス提供装置１は、コメントのクリック率が高い順に、コメントの掲載順位を仮決定する。 Next, the service providing device 1 tentatively determines the posting order of the acquired comments (S305). The service providing device 1 tentatively determines the posting order of comments based on, for example, the click rate of comments, the evaluation of comments, the expressions included in comments, and the like. For example, the service providing device 1 tentatively determines the posting order of comments in descending order of the click rate of comments.

次に、サービス提供装置１は、掲載順位が仮決定されたコメントの中から１件のコメントを抽出し、抽出したコメントに不適切ラベルあるいは準不適切ラベルが付与されているか否かを判定する（Ｓ３０７）。サービス提供装置１は、抽出したコメントに不適切ラベルあるいは準不適切ラベルが付与されていると判定した場合、このコメントの掲載順位を下げるか、あるいは掲載対象から除外するように、コメントの掲載順位を決定する（Ｓ３０９）。一方、サービス提供装置１は、抽出したコメントに不適切ラベルあるいは準不適切ラベルが付与されていないと判定した場合、すなわち、抽出したコメントが適切コメントであると判定した場合、上記の掲載順位を下げる等の処理は行わない。 Next, the service providing device 1 extracts one comment from the comments whose publication order is tentatively determined, and determines whether or not the extracted comment is given an inappropriate label or a quasi-inappropriate label. (S307). When the service providing device 1 determines that the extracted comment is given an inappropriate label or a quasi-inappropriate label, the posting order of the comment is lowered or excluded from the posting target. Is determined (S309). On the other hand, when the service providing device 1 determines that the extracted comment is not given an inappropriate label or a quasi-inappropriate label, that is, when it determines that the extracted comment is an appropriate comment, the service providing device 1 determines the above-mentioned ranking. No processing such as lowering is performed.

次に、サービス提供装置１は、全コメントに対する処理が完了した否かを判定する（Ｓ３１１）。サービス提供装置１は、全コメントに対する処理が完了していないと判定した場合、未処理のコメントを抽出し、上記の処理を繰り返す。一方、サービス提供装置１は、全コメントに対する処理が完了したと判定した場合、取得した記事情報と、コメント情報に含まれるコメントとを含むニュースページを生成し、端末装置Ｔに送信する（Ｓ３１３）。このニュースページにおいて、コメントは上記の掲載順位決定処理により決定された掲載順位で掲載される。 Next, the service providing device 1 determines whether or not the processing for all comments is completed (S311). When the service providing device 1 determines that the processing for all the comments has not been completed, the service providing device 1 extracts the unprocessed comments and repeats the above processing. On the other hand, when it is determined that the processing for all the comments is completed, the service providing device 1 generates a news page including the acquired article information and the comment included in the comment information, and transmits the news page to the terminal device T (S313). .. On this news page, comments will be posted in the ranking determined by the above ranking determination process.

図１３は、サービス提供装置１により生成されたニュースページＰ１の一例を示す図である。ニュースページＰ１に含まれるコメントに関して、不適切ラベルあるいは準不適切ラベルが付与されたコメントは、掲載順位が下位に設定されているか、あるいは掲載対象から除外されている。以上により、本フローチャートの処理を終了する。 FIG. 13 is a diagram showing an example of the news page P1 generated by the service providing device 1. Regarding the comments included in the news page P1, comments with inappropriate labels or quasi-inappropriate labels are either set in the lower rank or excluded from the publication target. This completes the processing of this flowchart.

［新たに投稿されたコメントの不適切判定処理］
以下、サービス提供装置１により実施される新たに投稿されたコメントの不適切判定処理の一例について説明する。サービス提供装置１は、例えば、ユーザによる端末装置Ｔの操作に基づいてニュースサイトのページのコメント入力欄に対して新たなコメントが投稿され、この新たなコメントを投稿するリクエストを受信した場合、モデル記憶部３２に記憶された判定モデル３２Ａを用いて、この新たなコメントが不適切であるか否かを判定する。このような判定により、新たに投稿されたコメントが不適切であるか否かをリアルタイムで判定し、不適切であると判定されたコメントの掲載順位を下位に設定するか、あるいは掲載対象から除外するように制御することができる。 [Inappropriate judgment processing for newly posted comments]
Hereinafter, an example of the inappropriateness determination process of the newly posted comment performed by the service providing device 1 will be described. The service providing device 1 is a model when, for example, a new comment is posted to the comment input field of the news site page based on the operation of the terminal device T by the user, and a request to post the new comment is received. Using the determination model 32A stored in the storage unit 32, it is determined whether or not this new comment is inappropriate. Based on such a judgment, it is judged in real time whether or not the newly posted comment is inappropriate, and the ranking of the comment judged to be inappropriate is set to the lower rank or excluded from the posting target. Can be controlled to do so.

例えば、サービス提供装置１は、新たなコメントが不適切ではないと判定した場合、このコメントに対して「適切ラベル」を付与する。一方、サービス提供装置１は、投稿された新たなコメントが不適切であると判定した場合、このコメントに対して「不適切ラベル」を付与する。サービス提供装置１は、ラベルを付与したコメントを、コンテンツ情報記憶部２６のコメント情報２６Ｂに記憶させる。 For example, when the service providing device 1 determines that the new comment is not inappropriate, it assigns an "appropriate label" to this comment. On the other hand, when the service providing device 1 determines that the new comment posted is inappropriate, it assigns an "inappropriate label" to this comment. The service providing device 1 stores the labeled comment in the comment information 26B of the content information storage unit 26.

尚、サービス提供装置１は、不適切であるか否かについて断定できないコメントに対しては、「準不適切ラベル」を付与してもよい。また、サービス提供装置１は、判定モデル３２Ａを用いた判定結果に基づいて、「準不適切ラベル」をさらに細かく設定してもよい。例えば、サービス提供装置１は、「準不適切ラベル」を０から１の数値で表すように設定してもよい。この数値で表された「準不適切ラベル」は、例えば、１に近いほど、不適切である可能性が高いことを示すように設定されてよい。 The service providing device 1 may give a "quasi-inappropriate label" to a comment for which it cannot be determined whether or not it is inappropriate. Further, the service providing device 1 may set the "quasi-inappropriate label" in more detail based on the determination result using the determination model 32A. For example, the service providing device 1 may be set so that the "quasi-inappropriate label" is represented by a numerical value from 0 to 1. The "quasi-inappropriate label" represented by this numerical value may be set to indicate, for example, that the closer it is to 1, the higher the possibility of being inappropriate.

以上説明した実施形態のサービス提供装置１によれば、不適切コメントの検知を迅速且つ高精度で行うことが可能である。また、実施形態のサービス提供装置１によれば、不適切なコメントの掲載順位を下げる、あるいは掲載対象から除外することで、不適切なコメントが閲覧される機会を自動的に低減させることが可能である。 According to the service providing device 1 of the embodiment described above, it is possible to detect inappropriate comments quickly and with high accuracy. Further, according to the service providing device 1 of the embodiment, it is possible to automatically reduce the chance of viewing inappropriate comments by lowering the posting order of inappropriate comments or excluding them from the posting target. Is.

尚、サービス提供装置１は、コメントに基づいてコメントベクトル（特徴情報）を生成するようにしてよい。コメントベクトルは、分散表現化されたでベクトルデータであってよい。サービス提供装置１は、既知の不適切コメントのコメントベクトルと、新たなコメントのコメントベクトルとのベクトル間距離に基づいて、新たなコメントが、不適切コメントであるか否かを判定してよい。 The service providing device 1 may generate a comment vector (feature information) based on the comment. The comment vector may be vector data in a distributed representation. The service providing device 1 may determine whether or not the new comment is an inappropriate comment based on the distance between the vector of the comment vector of the known inappropriate comment and the comment vector of the new comment.

以上、本発明を実施するための形態について実施形態を用いて説明したが、本発明はこうした実施形態に何等限定されるものではなく、本発明の要旨を逸脱しない範囲内において種々の変形及び置換を加えることができる。 Although the embodiments for carrying out the present invention have been described above using the embodiments, the present invention is not limited to these embodiments, and various modifications and substitutions are made without departing from the gist of the present invention. Can be added.

１‥サービス提供装置、１０‥通信部、１２‥コンテンツ取得部、１４‥順位決定部、１６‥ページ生成部、１８‥報告情報処理部、２０‥第１判定部、２２‥ラベル付与部、２４‥学習部、２５‥第２判定部、２６‥コンテンツ情報記憶部、２８‥報告情報記憶部、３０‥ユーザ情報記憶部、３２‥モデル記憶部、Ｔ‥端末装置、ＮＷ‥ネットワーク 1 ... Service providing device, 10 ... Communication unit, 12 ... Content acquisition unit, 14 ... Ranking determination unit, 16 ... Page generation unit, 18 ... Report information processing unit, 20 ... First judgment unit, 22 ... Labeling unit, 24 Learning unit, 25 Second judgment unit, 26 Content information storage unit, 28 Report information storage unit, 30 User information storage unit, 32 Model storage unit, T Terminal device, NW network

Claims

A reception unit that accepts reports indicating that the second content is inappropriate by a user who has viewed the second content, which is a comment posted on the first content.
Based on the report received by the reception unit, the first determination unit that determines whether or not the second content is inappropriate, and the first determination unit.
With
The first determination unit determines whether or not the second content is inappropriate based on the degree of bias in the distribution of the attributes of the user who made the report.
Information processing device.

When the first determination unit determines that the second content is inappropriate, the order of determining the ranking of the second content so as to lower the ranking of the second content or exclude it from the posting target. The decision department and
A page generation unit that generates a web page including the first content and the second content based on the placement order determined by the ranking determination unit.
The information processing apparatus according to claim 1.

When the number of reports received by the reception unit is equal to or greater than the threshold value, the first determination unit determines that the second content is inappropriate.
The information processing device according to claim 1 or 2.

The first determination unit determines that the lower the degree of bias, the higher the reliability of the report by the user.
The information processing device according to any one of claims 1 to 3.

A reception unit that accepts reports indicating that the second content is inappropriate by a user who has viewed the second content, which is a comment posted on the first content.
Based on the report received by the reception unit, the first determination unit that determines whether or not the second content is inappropriate, and the first determination unit.
With
The reception unit receives a report from the user indicating the type of inappropriate content included in the second content, and receives a report.
The first determination unit does not have the second content based on the similarity between the distribution bias of the attribute of the user who made the report and the reference distribution defined in advance for each type of inappropriate content. Determine if it is appropriate,
Information processing device.

A learning unit that learns the relationship between the second content and a label indicating a determination result indicating whether or not the second content by the first determination unit is inappropriate and generates a determination model is further provided. ,
The information processing device according to any one of claims 1 to 5.

A second determination unit for determining whether or not the second content is inappropriate is further provided by using the determination model generated by the learning unit.
The information processing device according to claim 6.

The computer
We accept reports from users who have viewed the second content, which is a comment posted for the first content, indicating that the second content is inappropriate.
Based on the received report, it is determined whether or not the second content is inappropriate.
Information processing method
It is determined whether or not the second content is inappropriate based on the degree of bias in the distribution of the attributes of the user who made the report.
Information processing method.

On the computer
A user who browses the second content, which is a comment posted on the first content, is allowed to accept a report indicating that the second content is inappropriate.
Based on the received report, it is determined whether or not the second content is inappropriate.
It ’s a program
Based on the degree of bias in the distribution of the attributes of the user who made the report, it is determined whether or not the second content is inappropriate.
program.