JP6802332B1

JP6802332B1 - Information processing method and information processing equipment

Info

Publication number: JP6802332B1
Application number: JP2019131133A
Authority: JP
Inventors: 真也綿貫; 伸瀧澤
Original assignee: Macromill Inc
Current assignee: Macromill Inc
Priority date: 2019-07-16
Filing date: 2019-07-16
Publication date: 2020-12-16
Anticipated expiration: 2039-07-16
Also published as: JP2021015549A

Abstract

【課題】ネットリサーチにおいて、自由回答式のアンケートに対するモニタの回答を、簡易にかつ精度良く分類するための技術を提供する。【解決手段】本発明に係る情報処理方法は、アンケートに含まれる自由回答式の質問に対する回答を複数のカテゴリのいずれかに分類する情報処理方法であって、回答者により作成された、質問に対するテキスト情報からなる回答を取得するステップと、回答に含まれる文字列を１文字ずつに分割してそれぞれ符号化するステップと、符号化された回答に、文字レベル畳み込みニューラルネットワークにより生成された学習済みモデルを適用して、回答者による回答が複数のカテゴリのいずれに分類されるかを判定するステップを有する。【選択図】図１PROBLEM TO BE SOLVED: To provide a technique for easily and accurately classifying a monitor's response to a free-answer questionnaire in online research. An information processing method according to the present invention is an information processing method for classifying answers to free-answer questions included in a questionnaire into any of a plurality of categories, and is for questions created by respondents. A step to obtain an answer consisting of text information, a step to divide the character string included in the answer into characters and encode each, and a trained answer generated by a character-level convolutional neural network in the encoded answer. It has the steps of applying a model to determine which of the multiple categories the respondent's response falls into. [Selection diagram] Fig. 1

Description

本発明は、情報処理方法および情報処理装置に関する。 The present invention relates to an information processing method and an information processing device.

従来から、多数の回答者（モニタ）に質問を送ってアンケートを行う調査が行われている。アンケートにより得られた回答を集計、解析することで、マーケティングや商品開発などを行う際の意思決定支援、統計資料の作成、学術研究等に有益な情報を得ることができる。特に近年、ネットワークを介してアンケートを実施するネットリサーチと呼ばれる調査方法が盛んに利用されている。ネットリサーチには、回答者の負担が比較的軽い、回答結果の集計が容易であるなどのメリットがある。 Traditionally, surveys have been conducted in which questions are sent to a large number of respondents (monitors) to conduct a questionnaire. By aggregating and analyzing the answers obtained from the questionnaire, it is possible to obtain useful information for decision support, preparation of statistical data, academic research, etc. when conducting marketing and product development. In particular, in recent years, a survey method called net research, in which a questionnaire is conducted via a network, has been actively used. Internet research has the advantages that the burden on respondents is relatively light and that it is easy to aggregate the response results.

ネットリサーチを含むアンケート調査における質問の種類として、選択式の質問と、自由回答式の質問とがある。選択式の質問とは、事前に用意された選択肢の中から回答者が回答を選択するものであり、択一選択方式や複数回答方式などがある。一方、自由回答式とは、自由記述式やフリーアンサー（ＦＡ）式とも呼ばれる方式であり、回答者が質問への回答を自由に記述する方式である。自由回答式の質問には、数値や単語を記入するだけの比較的単純な場合あれば、文章による記入を求める場合もある。また、選択式の質問で「その他」を選択した場合のみ文章入力を要求するなど、選択式と自由回答式を組み合わせてもよい。 There are two types of questions in questionnaire surveys, including online research: multiple-choice questions and open-answer questions. The multiple-choice question is one in which the respondent selects an answer from the options prepared in advance, and there are an alternative-choice method and a multiple-answer method. On the other hand, the free answer type is also called a free description type or a free answer (FA) type, and is a method in which the respondent freely describes the answer to the question. Free-answer questions may be relatively simple, with numbers or words, or may require written questions. In addition, a combination of a selection type and a free answer type may be used, such as requesting text input only when "Other" is selected in the selection type question.

自由回答式の質問には、選択式の質問と比べて以下のような特徴がある。メリットとしては、選択式では既存の選択肢に含まれる回答しか得られないのに比べて、自由回答式では、調査者も想定していなかった自由な発想に基づく回答を得られることがある。また、文章による詳細な回答が得られる利点もある。一方デメリットとして、選択式は自動的な集計に適しているのに比べて、自由回答式では回答内容の定量的な分析や分類が困難な点がある。また、回答者による誤記のおそれもある。また、同じ物事を指し示す場合でも、回答者によって使う用語が異なる場合もある。 The open-ended questions have the following characteristics compared to the multiple-choice questions. The merit is that the free-answer type can obtain answers based on free ideas that the surveyor did not anticipate, while the multiple-choice type can only obtain the answers included in the existing options. It also has the advantage of providing detailed written answers. On the other hand, as a demerit, the selection formula is suitable for automatic aggregation, whereas the free response formula is difficult to quantitatively analyze and classify the response contents. There is also a risk of clerical errors by the respondents. Also, even when referring to the same thing, the terms used may differ depending on the respondent.

そこで、自由回答の結果を分析するために、コーディングという作業が行われる。これは、回答者から送信された自由回答をカテゴリごとに分類する作業であり、カテゴリに応じた符号（コード）を分類情報として付与する。これにより、自由回答を選択肢化し、定性情報を定量化して分析しやすくできる。コーディングは、自由回答後に行われることから「アフターコーディング（ＡＣ）」とも呼ばれる。 Therefore, in order to analyze the result of the free answer, a work called coding is performed. This is the work of classifying the free responses sent from the respondents by category, and assigns a code corresponding to the category as classification information. This makes it possible to make free answers optional and to quantify and analyze qualitative information. Coding is also called "after-coding (AC)" because it is done after free answer.

コーディングを自然人が行う場合、調査者が自由回答を目視して一つずつ分類を付与する。しかし、このような人手によるコーディングを行うと、時間的および費用的でのコストが増大する。また、判定手法が属人化しがちになる。そのため調査者が複数人いる場合、熟練度や考え方の違いにより判定結果にばらつきが生じる。また、調査者が交代した場合には作業に習熟するまで時間が必要となり、判定結果の一貫性が保たれないおそれがある。そこで、コーディングを自動的に行う試みがなされてきた。 When coding is done by a natural person, the investigator visually checks the free answers and assigns classifications one by one. However, such manual coding increases time and cost. In addition, the judgment method tends to be personalized. Therefore, when there are a plurality of investigators, the judgment result varies depending on the skill level and the difference in the way of thinking. In addition, if the investigator is changed, it takes time to become proficient in the work, and there is a risk that the judgment results will not be consistent. Therefore, attempts have been made to perform coding automatically.

特許文献１には、自由回答式のアンケートの回答を取得して、機械学習モデルを用いて内容を判定する分類装置が記載されている。
また、特許文献２にも、自由回答式のアンケートの回答に機械学習モデルを適用する、潜在的ニーズ導出装置が記載されている。 Patent Document 1 describes a classification device that obtains answers to a free-answer questionnaire and determines the contents using a machine learning model.
In addition, Patent Document 2 also describes a potential needs deriving device that applies a machine learning model to the answers to open-ended questionnaires.

特開２００５−０９２４９２号公報Japanese Unexamined Patent Publication No. 2005-092492 特開２０１６−１１０１８８号公報Japanese Unexamined Patent Publication No. 2016-10188

特許文献１では、日本語の自由回答テキストに対して形態素解析を行い、特定の表現を検出した上でサポートベクタマシンを用いた機械学習モデルを作成し、分類に利用する。
また、特許文献２でも、自由回答欄の文章を形態素解析して単語を抽出し、回答結果を複数の分類に分ける。 In Patent Document 1, a morphological analysis is performed on a free answer text in Japanese, a machine learning model using a support vector machine is created after detecting a specific expression, and the machine learning model is used for classification.
Further, also in Patent Document 2, the sentences in the free answer column are morphologically analyzed to extract words, and the answer results are divided into a plurality of categories.

しかしながら、これら従来技術では、自由回答テキストに対して前処理として形態素解析等の情報処理を行う必要があるため、工数が増加し、相応の演算資源や処理時間が必要となる。また、自由回答テキストに対して形態素解析を行う場合、分かち書きの無い日本語においては単語や句の単位での分離に困難が伴う。また、形態素解析には専門的な技術や知識が必要であり、人的コストの増大や、前処理に時間を取られて分析に集中できないなどの問題がある。また、辞書に含まれない単語がある場合（未登録の単語が出現した場合や、誤記を含む場合など）には解析精度が低下するため、辞書を手動により更新し続ける必要があり、継続的なコスト増加要因となる。また、形態素解析の手法は言語に応じて異なる。そのため、言語ごとに学習モデルを作成する必要が生じる。 However, in these conventional techniques, since it is necessary to perform information processing such as morphological analysis on the free answer text as preprocessing, man-hours increase and appropriate computational resources and processing time are required. In addition, when performing morphological analysis on open-ended text, it is difficult to separate words and phrases in Japanese without word-separation. In addition, morphological analysis requires specialized skills and knowledge, and has problems such as an increase in human cost and the time required for preprocessing, which makes it impossible to concentrate on analysis. In addition, if there are words that are not included in the dictionary (such as when unregistered words appear or when errors are included), the analysis accuracy will decrease, so it is necessary to keep updating the dictionary manually, which is continuous. It becomes a factor of cost increase. In addition, the method of morphological analysis differs depending on the language. Therefore, it becomes necessary to create a learning model for each language.

本発明は上記の課題に鑑みてなされたものであり、その目的は、ネットリサーチにおいて、自由回答式のアンケートに対するモニタの回答を、簡易にかつ精度良く分類するための技術を提供することにある。 The present invention has been made in view of the above problems, and an object of the present invention is to provide a technique for easily and accurately classifying a monitor's response to a free-answer questionnaire in online research. ..

上記の目的を達成するため、本発明では、以下のような構成を採用する。すなわち、
アンケートに含まれる自由回答式の質問に対する回答を複数のカテゴリのいずれかに分類する情報処理方法であって、
回答者により作成された、前記質問に対するテキスト情報からなる前記回答を取得するステップと、
前記回答に含まれる文字列を１文字ずつに分割してそれぞれ符号化するステップと、
符号化された前記回答に、文字レベル畳み込みニューラルネットワークにより生成された学習済みモデルを適用して、前記回答者による前記回答が前記複数のカテゴリのいずれに分類されるかを判定するステップと、
を有することを特徴とする情報処理方法である。 In order to achieve the above object, the following configuration is adopted in the present invention. That is,
It is an information processing method that classifies the answers to the open-ended questions included in the questionnaire into one of multiple categories.
The steps to obtain the answer, which consists of textual information for the question, created by the respondent,
The step of dividing the character string included in the answer into characters and encoding each character,
A step of applying a trained model generated by a character-level convolutional neural network to the encoded answer to determine which of the plurality of categories the answer by the respondent falls into.
It is an information processing method characterized by having.

本発明は、また、以下の構成を採用する。すなわち、
アンケートに対する回答を複数のカテゴリにいずれかに分類する情報処理方法であって、
前記アンケートには、回答者に対して、写真と、前記写真についてのコメントの送信を要求する質問が含まれており、
回答者より送信された前記回答から、前記写真の画像情報と、前記コメントのテキスト情報を取得するステップと、
前記テキスト情報に含まれる文字列を１文字ずつに分割してそれぞれ符号化するステップと、
前記画像情報を画素ごとに符号化するステップと、
符号化された前記テキスト情報および前記画像情報に、畳み込みニューラルネットワークにより生成された学習済みモデルを適用して、前記回答者による前記回答が前記複数のカテゴリのいずれに分類されるかを判定するステップと、
を有することを特徴とする情報処理方法である。 The present invention also employs the following configurations. That is,
It is an information processing method that classifies the answers to the questionnaire into multiple categories.
The survey includes a photo and a question requesting respondents to send comments about the photo.
A step of acquiring the image information of the photograph and the text information of the comment from the answer sent from the respondent, and
A step of dividing the character string included in the text information into characters and encoding each character, and
The step of encoding the image information for each pixel and
A step of applying a trained model generated by a convolutional neural network to the encoded text information and the image information to determine which of the plurality of categories the answer by the respondent falls into. When,
It is an information processing method characterized by having.

本発明は、また、以下の構成を採用する。すなわち、
アンケートに含まれる自由回答式の質問に対する回答を複数のカテゴリのいずれかに分類する情報処理装置であって、
回答者により作成された、前記質問に対するテキスト情報からなる前記回答を取得する回答受信部と、
前記回答に含まれる文字列を１文字ずつに分割してそれぞれ符号化する回答内容取得部と、
符号化された前記回答に、文字レベル畳み込みニューラルネットワークにより生成された学習済みモデルを適用して、前記回答者による前記回答が前記複数のカテゴリのいずれに分類されるかを判定する分類判定部と、
を有することを特徴とする情報処理装置である。 The present invention also employs the following configurations. That is,
An information processing device that classifies the answers to free-answer questions included in a questionnaire into one of multiple categories.
An answer receiver that acquires the answer, which consists of text information for the question, created by the respondent.
An answer content acquisition unit that divides the character string included in the answer into characters and encodes each character.
A classification determination unit that applies a trained model generated by a character-level convolutional neural network to the encoded answer to determine which of the plurality of categories the answer by the respondent falls into. ,
It is an information processing apparatus characterized by having.

本発明は、また、以下の構成を採用する。すなわち、
アンケートに対する回答を複数のカテゴリにいずれかに分類する情報処理装置であって、
前記アンケートには、回答者に対して、写真と、前記写真についてのコメントの送信を要求する質問が含まれており、
回答者より送信された前記回答から、前記写真の画像情報と、前記コメントのテキスト情報を取得する回答受信部と、
前記テキスト情報に含まれる文字列を１文字ずつに分割してそれぞれ符号化し、前記画像情報を画素ごとに符号化する、回答内容取得部と、
符号化された前記テキスト情報および前記画像情報に、畳み込みニューラルネットワークにより生成された学習済みモデルを適用することにより、前記テキスト情報と前記画像情報から同じアルゴリズムで、前記回答者による前記回答が前記複数のカテゴリのいずれに分類されるかを判定する分類判定部と、
を有することを特徴とする情報処理装置である。 The present invention also employs the following configurations. That is,
An information processing device that classifies the answers to questionnaires into multiple categories.
The survey includes a photo and a question requesting respondents to send comments about the photo.
An answer receiving unit that acquires the image information of the photograph and the text information of the comment from the answer sent from the respondent.
An answer content acquisition unit that divides the character string included in the text information into characters and encodes each character, and encodes the image information pixel by pixel.
By applying a trained model generated by a convolutional neural network to the coded text information and the image information, the text information and the image information can be subjected to the plurality of answers by the respondent using the same algorithm. A classification judgment unit that determines which of the categories is classified into
It is an information processing apparatus characterized by having.

本発明によれば、ネットリサーチにおいて、自由回答式のアンケートに対するモニタの回答を、簡易にかつ精度良く分類するための技術を提供することができる。 According to the present invention, it is possible to provide a technique for easily and accurately classifying a monitor's response to an open-ended questionnaire in net research.

情報処理システムの全体的な機能構成を説明する図である。It is a figure explaining the overall functional structure of an information processing system. 情報処理装置および回答者端末の構成を説明する図である。It is a figure explaining the structure of an information processing apparatus and a respondent terminal. 自由回答を分類する処理を説明するフローチャートである。It is a flowchart explaining the process of classifying free answer. 回答者端末を用いて回答する様子を説明する図である。It is a figure explaining the state of answering using a respondent terminal. 分類情報の設定について説明する図である。It is a figure explaining setting of classification information. 実施形態２の処理を説明するフローチャートである。It is a flowchart explaining the process of Embodiment 2. 実施形態２における回答の様子を説明する図である。It is a figure explaining the state of the answer in Embodiment 2.

以下に図面を参照しつつ、本発明の好適な実施の形態を説明する。ただし、以下に記載されている構成ブロックやそれらの相対配置などは、発明が適用されるシステムの各種条件により適宜変更されるべきものであり、この発明の範囲を以下の記載に限定する趣旨のものではない。 A preferred embodiment of the present invention will be described below with reference to the drawings. However, the constituent blocks described below and their relative arrangements should be appropriately changed depending on various conditions of the system to which the invention is applied, and the scope of the invention is limited to the following description. It's not a thing.

本発明は、自由回答式のアンケートに対する回答を分類する技術に関する。本発明は、このような技術を用いた情報処理装置または情報処理方法として捉えることができる。本
発明はまた、上記の情報処理装置の演算資源を利用して動作し、情報処理方法の各工程を実行させるプログラムや、かかるプログラムが格納された記憶媒体としても捉えることができる。記憶媒体は、コンピュータにより読み取り可能な非一時的な記憶媒体であってもよい。本発明はまた、上記のような情報処理装置と、ユーザがアンケートに回答する端末を含めた情報処理システムとして捉えることもできる。 The present invention relates to a technique for classifying answers to open-ended questionnaires. The present invention can be regarded as an information processing device or an information processing method using such a technique. The present invention can also be regarded as a program that operates by utilizing the arithmetic resources of the above-mentioned information processing apparatus and executes each process of the information processing method, or a storage medium in which such a program is stored. The storage medium may be a non-temporary storage medium that can be read by a computer. The present invention can also be regarded as an information processing system including the above-mentioned information processing device and a terminal in which a user answers a questionnaire.

以下の実施形態において、調査者とは、回答者（モニタ）に対してネットを介してアンケートを行い、回答を取得して解析する者である。調査者は、企業、研究機関または個人など、多数のモニタにアンケートを行う者であれば、だれでも構わない。調査者は、企業等自身であってもよいし、企業等からの依頼を受けたリサーチ会社であってもよい。
また回答者とは、ネットを介してアンケートに回答する者である。リサーチ会社は、多数のモニタを会員として組織化しており、調査行動を依頼されるとモニタと連絡を取ってアンケートを行う。かかるリサーチ会社が調査者を兼ねることにより、調査からアンケート実施までの処理をスムーズに行うことができる。ネットリサーチ会社には、比較的低コストかつ短期間で多人数のモニタを対象とする調査を実施可能という利点がある。 In the following embodiments, the surveyor is a person who conducts a questionnaire to respondents (monitors) via the Internet, acquires answers, and analyzes them. The investigator can be any person who surveys a large number of monitors, such as companies, research institutes, or individuals. The researcher may be a company or the like itself, or may be a research company that has received a request from the company or the like.
Respondents are those who respond to questionnaires via the Internet. The research company organizes a large number of monitors as members, and when asked to conduct a survey, it contacts the monitors and conducts a questionnaire. By having such a research company also serve as a researcher, it is possible to smoothly carry out the process from the survey to the implementation of the questionnaire. Internet research companies have the advantage of being able to conduct surveys of a large number of monitors at a relatively low cost and in a short period of time.

［第１の実施形態］
＜全体構成＞
本実施形態のシステムの全体的な構成について、図１、図２を参照しながら説明する。
情報処理システム１は概略、調査者１５が使用する情報処理装置１０と、複数のモニタ（回答者２５ａ，ｂ，…）それぞれが使用する回答者端末（回答者端末２０ａ，２０ｂ，…）を有している。情報処理装置１０と各々の回答者端末２０の間は、Ｗｅｂや専用回線等のネットワークを介して互いに通信可能に接続されている。 [First Embodiment]
<Overall configuration>
The overall configuration of the system of the present embodiment will be described with reference to FIGS. 1 and 2.
The information processing system 1 generally includes an information processing device 10 used by the surveyor 15 and respondent terminals (respondent terminals 20a, 20b, ...) Used by each of a plurality of monitors (respondents 25a, b, ...). doing. The information processing device 10 and each of the respondent terminals 20 are connected to each other so as to be able to communicate with each other via a network such as the Web or a dedicated line.

情報処理装置１０は、調査者が利用する装置であり、アンケートを作成し、そのアンケートに対する回答者２５からの回答を取得し、自由回答を分類する機能を有する。
図１の紙面下部には、情報処理装置１０の機能ブロックを示している。詳しくは後述するが、情報処理装置１０は、調査者１５がアンケートを作成するアンケート作成部１０１、アンケートを回答者端末２０に送信するアンケート送信部１０２、回答者端末２０から回答を受信する回答受信部１１１、回答中から判別に必要な情報（テキストや画像）を取得する回答内容取得部１１２、テキストから特徴量を取得する特徴抽出部１１３、回答に分類情報を付与する分類判定部１１４、判定結果を調査者１５に提示する結果出力部１１５、調査者１５が学習データを作成するデータ作成部１２１、学習データから学習済みモデルを作成するモデル作成部１２２を備える。各機能ブロックの詳細な動作については後述する。各機能ブロックは、必ずしも物理的な実体である必要はなく、それぞれがプログラムモジュールとして仮想的に実現されてもよい。また、機能ブロックの区分は必ずしも図示したとおりでなくても良く、情報処理装置１０が全体として本実施形態の処理を実現できれば良い。 The information processing device 10 is a device used by a researcher, and has a function of creating a questionnaire, acquiring answers from respondents 25 to the questionnaire, and classifying free responses.
The functional block of the information processing device 10 is shown at the bottom of the page of FIG. As will be described in detail later, the information processing apparatus 10 includes a questionnaire creation unit 101 in which the surveyor 15 creates a questionnaire, a questionnaire transmission unit 102 that transmits a questionnaire to the respondent terminal 20, and an answer reception unit that receives an answer from the respondent terminal 20. Unit 111, answer content acquisition unit 112 that acquires information (text or image) necessary for discrimination from the answers, feature extraction unit 113 that acquires the feature amount from the text, classification judgment unit 114 that gives classification information to the answer, judgment It includes a result output unit 115 that presents the result to the investigator 15, a data creation unit 121 that the investigator 15 creates training data, and a model creation unit 122 that creates a trained model from the training data. The detailed operation of each functional block will be described later. Each functional block does not necessarily have to be a physical entity, and each may be virtually realized as a program module. Further, the division of the functional blocks does not necessarily have to be as shown in the drawing, and it is sufficient that the information processing apparatus 10 can realize the processing of the present embodiment as a whole.

図２（ａ）に、情報処理装置１０の構成例を示す。
情報処理装置１０としては、演算資源を備え、プログラムの指示やインタフェースを介した指示によって動作する、ＰＣ、サーバ、ワークステーションなどを利用できる。なお、情報処理装置１０として、クラウド上の演算資源を利用するクラウドサーバを用いてもよいし、ネット回線または直接的に接続された複数のＰＣ等を組み合わせてもよい。 FIG. 2A shows a configuration example of the information processing device 10.
As the information processing device 10, a PC, a server, a workstation, or the like, which has computational resources and operates according to a program instruction or an instruction via an interface, can be used. As the information processing device 10, a cloud server that uses computing resources on the cloud may be used, or a net line or a plurality of directly connected PCs may be combined.

図示例では、情報処理装置１０は、情報処理を行うＣＰＵ１００１、画像処理を行うＧＰＵ１００２、データを一時的または永続的に保持するメモリ１００３、Ｗｅｂ等を介して外部との通信を行う通信モジュール１００４、マウス１００５、キーボード１００６、ディスプレイ１００７を含む。メモリ１００３に加えて、またはメモリ１００３の代わりに、情報処理装置１０にＨＤＤやＳＳＤ等の外部記憶装置を接続しても良い。 In the illustrated example, the information processing device 10 includes a CPU 1001 that performs information processing, a GPU 1002 that performs image processing, a memory 1003 that temporarily or permanently holds data, and a communication module 1004 that communicates with the outside via the Web or the like. Includes mouse 1005, keyboard 1006, display 1007. An external storage device such as an HDD or SSD may be connected to the information processing device 10 in addition to the memory 1003 or instead of the memory 1003.

図１に示したアンケート作成部１０１、アンケート送信部１０２、回答受信部１１１、回答内容取得部１１２、特徴抽出部１１３、分類判定部１１４、結果出力部１１５、データ作成部１２１、モデル作成部１２２などの機能ブロックは、図２（ａ）に示した情報処理装置１０の各構成要素が協同して動作することにより実現される。 Questionnaire creation unit 101, questionnaire transmission unit 102, response reception unit 111, response content acquisition unit 112, feature extraction unit 113, classification determination unit 114, result output unit 115, data creation unit 121, model creation unit 122 shown in FIG. Such a functional block is realized by operating each component of the information processing apparatus 10 shown in FIG. 2A in cooperation with each other.

回答者端末２０は、回答者２５がアンケートの質問に回答するための端末装置である。
図１の紙面上部には、回答者端末２０の機能ブロックを示している。詳しくは後述するが、回答者端末２０は、情報処理装置１０からアンケートを受信するアンケート受信部２０１、回答者２５が回答を入力する回答入力部２０２、回答を情報処理装置１０に送信する回答送信部２１１を備える。各機能ブロックの詳細な動作については後述する。各機能ブロックは、必ずしも物理的な実体である必要はなく、それぞれがプログラムモジュールとして仮想的に実現されてもよい。また、機能ブロックの区分は必ずしも図示したとおりでなくても良く、回答者端末２０が全体として本実施形態の処理を実現できれば良い。 The respondent terminal 20 is a terminal device for the respondent 25 to answer a question in a questionnaire.
The functional block of the respondent terminal 20 is shown in the upper part of the page of FIG. As will be described in detail later, the respondent terminal 20 includes a questionnaire receiving unit 201 that receives a questionnaire from the information processing device 10, an answer input unit 202 that the respondent 25 inputs an answer, and an answer transmission that transmits the answer to the information processing device 10. A unit 211 is provided. The detailed operation of each functional block will be described later. Each functional block does not necessarily have to be a physical entity, and each may be virtually realized as a program module. Further, the division of the functional blocks does not necessarily have to be as shown in the drawing, and it is sufficient that the respondent terminal 20 can realize the processing of the present embodiment as a whole.

図２（ｂ）に、回答者端末２０の構成例を示す。
回答者端末２０としては、演算資源を備え、プログラムの指示やインタフェースを介した指示によって動作する、ＰＣ、サーバ、ワークステーションなどを利用できる。また近年のモバイル装置の発達に伴い、スマートフォンやタブレット端末なども好適に利用できる。
図では、回答者端末２０がスマートフォンである場合を示している。回答者端末２０は、情報処理や画像処理を行うＳｏＣ（ＳｙｓｔｅｍｏｎＣｈｉｐ）２００１、メモリ２００２、通信モジュール２００３、タッチパネル２００４、マイク２００５を含む。 FIG. 2B shows a configuration example of the respondent terminal 20.
As the respondent terminal 20, a PC, a server, a workstation, or the like, which has computational resources and operates according to a program instruction or an instruction via an interface, can be used. In addition, with the development of mobile devices in recent years, smartphones and tablet terminals can also be suitably used.
The figure shows a case where the respondent terminal 20 is a smartphone. The respondent terminal 20 includes a System (System on Chip) 2001 that performs information processing and image processing, a memory 2002, a communication module 2003, a touch panel 2004, and a microphone 2005.

図１に示したアンケート受信部２０１、回答入力部２０２、回答送信部２１１などの機能ブロックは、図２（ｂ）に示した回答者端末２０の各構成要素が協同して動作することにより実現される。 The functional blocks such as the questionnaire receiving unit 201, the answer input unit 202, and the answer transmitting unit 211 shown in FIG. 1 are realized by the cooperative operation of each component of the respondent terminal 20 shown in FIG. 2 (b). Will be done.

＜処理フロー＞
（前提）
図３を参照して、本実施形態の処理フローについて説明する。本フローは、調査者１５がアンケート実施を決定した時点から開始される。なお、ここでは日本語のアンケートを行っているが、他の言語であっても同様に処理できる。本フローについては、主として図１の機能ブロックを用いて説明を行う。 <Processing flow>
(Premise)
The processing flow of the present embodiment will be described with reference to FIG. This flow starts when the investigator 15 decides to conduct the questionnaire. In addition, although the questionnaire is conducted in Japanese here, it can be processed in the same way even in other languages. This flow will be described mainly using the functional blocks of FIG.

本実施形態では、回答者２５に対してすぐに調査目的に沿ったアンケート（本アンケートとも呼ぶ）を送信し、回答を収集している。しかし、本アンケートの前にスクリーニングのためのアンケート（予備アンケートとも呼ぶ）を行い、属性条件や履歴条件を満たした回答者２５を抽出してもよい。本発明の技術は、予備アンケートに対する回答と本アンケートに対する回答のどちらにも適用できる。 In the present embodiment, a questionnaire (also referred to as this questionnaire) according to the purpose of the survey is immediately sent to the respondent 25, and the responses are collected. However, a questionnaire for screening (also referred to as a preliminary questionnaire) may be conducted before the main questionnaire to extract the respondents 25 who satisfy the attribute conditions and the history conditions. The technique of the present invention can be applied to both the answer to the preliminary questionnaire and the answer to the present questionnaire.

本フローでは、アンケート中に含まれる質問の数は一つであり、その内容は、回答者２５が購入したい車名を答えさせるというものである。ただし、質問の数に特に制限はない。また、自由回答式の質問だけなく、選択式の質問を含んでいても良い。その場合、自由回答式の質問への回答が処理される。
本フローでは、回答者２５が自由回答した車名を、学習済みモデルにより、予め設定された分類のいずれであるかを自動的に判別する。
本フローでは、単語による回答に対して、その単語に対応するコーディングを行っているが、回答の種類はこれに限られず、文章や数字を含む場合など、コーディングする可能性のある回答であれば、何であれ本発明の対象となる。 In this flow, the number of questions included in the questionnaire is one, and the content is to have the respondent 25 answer the name of the car he / she wants to purchase. However, there is no particular limit to the number of questions. In addition, not only open-ended questions but also multiple-choice questions may be included. In that case, the answers to the open-ended questions are processed.
In this flow, the vehicle name freely answered by the respondent 25 is automatically determined by the trained model as to which of the preset classifications.
In this flow, a word-based answer is coded corresponding to that word, but the type of answer is not limited to this, and any answer that may be coded, such as when a sentence or number is included, is used. Anything is the subject of the present invention.

（フロー開始）
本フローは概略、学習済みモデルを作成する準備工程と、作成された学習済みモデルを用いて自由回答を分類判別する実施工程に大別される。
ステップＳ１０１〜Ｓ１０３は、コーディング済みのサンプルデータを学習データとして学習済みモデルを作成する、準備工程である。 (Start flow)
This flow is roughly divided into a preparatory process for creating a trained model and an implementation process for classifying and discriminating free answers using the created trained model.
Steps S101 to S103 are preparatory steps for creating a trained model using the coded sample data as training data.

ステップＳ１０１にて、調査者１５は、情報処理装置１０のデータ作成部１２１を用いてサンプルデータをコーディングして、学習データを作成する。図５（ａ）は、分類の区分を示すテーブルであり、コーディングを行う際の車名とコード符号の対応関係が示されている。この対応関係はデータベースのテーブルとして保存されている。なお、ここでは便宜上、車名を３つだけ用いているが、カテゴリはこれに限定されない。また、同じ車種名でも型式や年式などに応じて細分化しても良い。また、自動車の場合であれば「車名」「色」「年式」といったように、一つの調査対象について、観点が異なる複数のカテゴリを設定しても良い。これにより、調査結果を様々な切り口でタグ検索することが可能となり、多様な観点からの分析が可能になる。 In step S101, the investigator 15 codes the sample data using the data creation unit 121 of the information processing apparatus 10 to create the learning data. FIG. 5A is a table showing the classifications, and shows the correspondence between the vehicle name and the code code when coding. This correspondence is stored as a database table. For convenience, only three vehicle names are used here, but the category is not limited to this. In addition, the same model name may be subdivided according to the model, year, and the like. Further, in the case of an automobile, a plurality of categories having different viewpoints may be set for one survey target, such as "vehicle name", "color", and "model year". This makes it possible to search for tags from various perspectives on the survey results, and to analyze from various perspectives.

図５（ｂ）は、調査者１５がサンプルデータ（サンプル回答）を人手によりコード化して作成した、学習データである。サンプルデータの取得方法は任意であり、過去の同種の質問に対する回答を取得してもよいし、調査者１５が新規に作成しても良い。サンプル数は求める精度などにもよるが、例えば数百以上とする。続いて、調査者１５は、個々のサンプルデータがどの分類に当たるかを判定して適切な分類情報を付与する。調査者１５は、マニュアル、辞書、ノウハウなどを基準として分類を行うので、図５（ｂ）に示すように、必ずしも正確でない車名に対しても適切な分類情報が付与される。
なお、調査者１５は、分類情報付与の際に、図５（ｄ）に示すように、正しい車名をそのまま入力しても構わない。 FIG. 5B is learning data created by the researcher 15 by manually coding the sample data (sample response). The method of acquiring the sample data is arbitrary, and answers to the same type of questions in the past may be acquired, or the investigator 15 may newly create the data. The number of samples depends on the required accuracy and the like, but is set to several hundreds or more, for example. Subsequently, the investigator 15 determines which classification the individual sample data corresponds to, and assigns appropriate classification information. Since the investigator 15 classifies vehicles based on manuals, dictionaries, know-how, etc., as shown in FIG. 5B, appropriate classification information is given even to vehicle names that are not always accurate.
The investigator 15 may enter the correct vehicle name as it is, as shown in FIG. 5D, when assigning the classification information.

ステップＳ１０２にて、調査者１５は、情報処理装置１０のモデル作成部１２２を用いて、テキスト情報を畳み込みニューラルネットワーク（ＣＮＮ）用に数値化する。そしてステップＳ１０３にて、モデル作成部１２２は、数値化されたテキスト情報から学習済みモデルを作成する。 In step S102, the investigator 15 digitizes the text information for the convolutional neural network (CNN) by using the model creation unit 122 of the information processing apparatus 10. Then, in step S103, the model creation unit 122 creates a trained model from the digitized text information.

ここで、テキスト情報を対象としたＣＮＮの一例に関して説明する。ＣＮＮは、階層型ニューラルネットワークを用いた機械学習の一種であり、畳み込み層とプーリング層とが多重に繰り返される積層構造を用いて高精度な情報処理を可能としている。本発明の特徴として、ＣＮＮでの学習に用いるテキスト情報を単語や句で区切るのではなく、文字が連続するデータとして捉えて、１文字ごとに符号化を行うことがある。符号化の方法としては、例えば、Ｕｎｉｃｏｄｅ規格の文字集合に含まれる文字に割り当てられたコードポイントを利用可能である。ただし、対象とするテキスト情報を符号化できるのであれば、文字集合や符号化方式は限定されない。例えば、英語のように文字集合に含まれるキャラクタの種類が比較的少ない場合は、符号化時のバイト数がより少なくて済む方法でもよい。なお、回答ごとにテキスト量が異なる場合、テキストを固定長化してＣＮＮでの処理を容易にするために、文章の先頭から所定の文字数を取得したり、ゼロパディングを行ったりしてもよい。このようにＵｎｉｃｏｄｅ文字集合に対する符号化を行うことで、日本語を含めた様々な言語を同じやり方で処理できる。 Here, an example of CNN for text information will be described. CNN is a kind of machine learning using a hierarchical neural network, and enables highly accurate information processing by using a laminated structure in which a convolutional layer and a pooling layer are repeated multiple times. A feature of the present invention is that the text information used for learning in CNN is not separated by words or phrases, but is regarded as continuous data of characters and coded for each character. As a coding method, for example, a code point assigned to a character included in a character set of the Unicode standard can be used. However, the character set and the encoding method are not limited as long as the target text information can be encoded. For example, when the number of types of characters included in the character set is relatively small as in English, a method that requires a smaller number of bytes at the time of encoding may be used. If the amount of text differs for each answer, a predetermined number of characters may be obtained from the beginning of the sentence or zero padding may be performed in order to make the text a fixed length and facilitate processing by CNN. By encoding the Unicode character set in this way, various languages including Japanese can be processed in the same way.

また上述のように、１文字ごとに符号化が行われる、文字レベル畳み込みニューラルネットワーク（Ｃｈａｒａｃｔｅｒ−ｌｅｖｅｌＣｏｎｖｏｌｕｔｉｏｎａｌｎｅｕｒａｌｎｅｔｗｏｒｋ、以下「文字レベルＣＮＮ」とも呼ぶ）を採用している。文字レベルＣＮＮを採用することで、文章を単なる文字列として取り扱えばよくなる。言い換える
と、文章の意味や文法構造を考慮する必要がなくなり、文字の並びだけを考慮すればよいため、特許文献１で行われていた形態素解析などの前処理が不要となり、１文字ずつ分割して符号化するだけの前処理で良くなる。さらに、文章解析用の辞書を用いる必要がないため、辞書の作成やメンテナンスのコストが不要になる。さらに、図５（ｃ）に示したように誤字や誤記、表記ゆれなどを含む場合でも判定精度が向上する。 Further, as described above, a character-level convolutional neural network (Character-level Convolutional neural network, hereinafter also referred to as "character-level CNN") in which coding is performed for each character is adopted. By adopting the character level CNN, the sentence can be treated as a simple character string. In other words, it is not necessary to consider the meaning and grammatical structure of the sentence, and only the arrangement of characters needs to be considered. Therefore, preprocessing such as morphological analysis performed in Patent Document 1 becomes unnecessary, and the characters are divided one by one. Pre-processing that only encodes is sufficient. Further, since it is not necessary to use a dictionary for sentence analysis, the cost of creating and maintaining the dictionary is not required. Further, as shown in FIG. 5C, the determination accuracy is improved even when typographical errors, typographical errors, and notational fluctuations are included.

ＣＮＮにおける入力データのサイズやチャネル数、フィルタサイズやフィルタ数やフィルタ移動間隔、畳み込みやプーリングの回数、その他学習に必要な種々のパラメータは、入力データの特性や必要な判別精度に応じて適宜定めることができる。なお、学習済みモデルを作成するときには、コーディング済みのサンプルデータを学習データと検証データに分けてもよいし、交差検定法によって精度を向上させてもよい。 The size and number of channels of input data in CNN, the size of filters, the number of filters, the filter movement interval, the number of convolutions and poolings, and various other parameters required for learning are appropriately determined according to the characteristics of the input data and the required discrimination accuracy. be able to. When creating the trained model, the coded sample data may be divided into training data and validation data, or the accuracy may be improved by a cross-validation method.

ステップＳ１０４以後は、アンケートを実施し、回答に対して学習済みモデルを適用する、アンケート実施工程である。
ステップＳ１０４にて、調査者１５は、情報処理装置１０のアンケート作成部１０１を用いてアンケートを作成する。ここでは、上述した質問文を作成するほか、アンケートに含まれる質問の流れや表示形式など、様々な条件を設定する。そして、アンケート送信部１０２を用いてアンケートを回答者端末２０に送信する。 After step S104, it is a questionnaire implementation process in which a questionnaire is conducted and the trained model is applied to the answers.
In step S104, the investigator 15 creates a questionnaire using the questionnaire creation unit 101 of the information processing device 10. Here, in addition to creating the above-mentioned question text, various conditions such as the flow of questions included in the questionnaire and the display format are set. Then, the questionnaire is transmitted to the respondent terminal 20 using the questionnaire transmission unit 102.

ステップＳ１０５にて、回答者２５は、回答者端末２０のアンケート受信部２０１が受信したアンケートに対して、回答入力部２０２を用いて回答する。図４は、回答者端末２０のタッチパネル２００４に表示されるアンケートと回答の様子を示す。図４（ａ）は本フローの回答者端末２０であり、ここではスマートフォンを用いる。回答者２５は、回答入力部２０２としてタッチパネル２００４を使用する。 In step S105, the respondent 25 responds to the questionnaire received by the questionnaire receiving unit 201 of the respondent terminal 20 by using the answer input unit 202. FIG. 4 shows a questionnaire and an answer displayed on the touch panel 2004 of the respondent terminal 20. FIG. 4A shows the respondent terminal 20 of this flow, and a smartphone is used here. The respondent 25 uses the touch panel 2004 as the answer input unit 202.

図４（ｂ）は、回答者２５が回答を入力するときの画面を示す。タッチパネル２００４には、質問表示欄２０４１、回答表示欄２０４２、回答送信ボタン２０４３が表示されている。この例での質問は、回答者２５が購入したい車名を答えさせるというものである。回答者２５は、スマートフォンにインストールされたＩＭＥを用いて回答を回答表示欄２０４２に入力すると、図５（ｃ）に示すように、回答送信ボタン２０４３を押下する。これにより、回答者端末２０の回答送信部２１１が、ネットワークを介して情報処理装置１０の回答受信部１１１に回答を送信する。 FIG. 4B shows a screen when the respondent 25 inputs an answer. On the touch panel 2004, a question display field 2041, an answer display field 2042, and an answer transmission button 2043 are displayed. The question in this example is to have respondent 25 answer the name of the car he wants to buy. When the respondent 25 inputs the answer in the answer display field 2042 using the IME installed on the smartphone, the respondent 25 presses the answer transmission button 2043 as shown in FIG. 5 (c). As a result, the response transmission unit 211 of the respondent terminal 20 transmits the response to the response reception unit 111 of the information processing device 10 via the network.

ステップＳ１０６にて、調査者１５は、情報処理装置１０の回答内容取得部１１２を用いて、取得された回答から、判別に必要な情報を取得する。ここでは、自由回答のテキスト情報を取得する。そして、ＵＴＦ−８による数値化などの処理を行う。続いて特徴抽出部１１３は、テキストに学習済みのモデルを適用して特徴量を抽出する。続いて分類判定部１１４は、自由回答にいずれの分類に当たるかを判定して、分類情報を付与する。分類判定部１１４は、ＣＮＮにおける全結合層から出力層に対応し、本実施形態では分類の数が３個であるため、出力先も３ノードに設定される。そして、抽出された特徴量に基づき、隠れ層における重み付けと出力層でのソフトマックス関数等を用いた演算を行い、分類それぞれに属する確率を算出する。 In step S106, the investigator 15 uses the response content acquisition unit 112 of the information processing device 10 to acquire information necessary for discrimination from the acquired response. Here, the text information of the free answer is acquired. Then, processing such as digitization by UTF-8 is performed. Subsequently, the feature extraction unit 113 applies the trained model to the text to extract the feature amount. Subsequently, the classification determination unit 114 determines which classification corresponds to the free response, and adds classification information. The classification determination unit 114 corresponds to the fully connected layer to the output layer in the CNN, and since the number of classifications is three in this embodiment, the output destination is also set to three nodes. Then, based on the extracted features, weighting in the hidden layer and calculation using the softmax function in the output layer are performed to calculate the probability of belonging to each classification.

ステップＳ１０７にて、情報処理装置１０の結果出力部１１５が判定結果を調査者１５に提示する。図５（ｃ）は、分類判定の結果として出力された、車名を示す自由回答中のテキストとコードの関係と、調査者１５に提示される結果を説明するための図である。各自由回答データには、図示したように不正確な車名が含まれる。しかし、本発明の学習済みモデルを適用することにより適切なコーディングが行われるために、調査者に対して正確な情報を提示できる。なお、判定結果の提供方法として、判定済みのデータをメモリに保存したり、オンラインで提供したりしても良い。また、判定済みのデータと、当該デー
タの解析機能を持つソフトウェアを、クラウド上で動作するＳａａＳのような形で提供してもよい。 In step S107, the result output unit 115 of the information processing device 10 presents the determination result to the investigator 15. FIG. 5C is a diagram for explaining the relationship between the text and the code in the free answer indicating the vehicle name output as a result of the classification determination and the result presented to the investigator 15. Each open-ended response data contains an inaccurate vehicle name as shown. However, accurate coding can be provided by applying the trained model of the present invention to the investigator. As a method of providing the determination result, the determined data may be saved in the memory or provided online. Further, the determined data and software having an analysis function of the data may be provided in the form of SaaS operating on the cloud.

このように、本実施形態のフローでは、情報処理装置１０が機械学習の結果に基づいて自由回答のテキストを分類する。特に、形態素解析などの前処理が必要なく、自由回答を一様に処理できる。そのため、表記揺れや誤記などを含む多様な自由回答を対象とした場合でも、時間および人手に関するコストを低減しつつ、調査者１５が理解しやすい形式で判定結果を提示できる。したがって、調査者１５のコーディングへの習熟度や機械学習に関する知識に関わらずに、アンケート結果の分析に集中できるようになる。さらに、日本語だけでなく複数の言語を同様に処理できる。 As described above, in the flow of the present embodiment, the information processing device 10 classifies the text of the free answer based on the result of machine learning. In particular, free answers can be processed uniformly without the need for preprocessing such as morphological analysis. Therefore, even when various free responses including notation fluctuations and typographical errors are targeted, the determination result can be presented in a format that is easy for the investigator 15 to understand while reducing the cost related to time and manpower. Therefore, regardless of the investigator 15's coding proficiency and machine learning knowledge, he / she can concentrate on the analysis of the questionnaire results. Furthermore, not only Japanese but also multiple languages can be processed in the same way.

［第２の実施形態］
第２の実施形態について、第１の実施形態との違いを中心に説明を行い、共通部分についての説明は簡略化する。
本実施形態にかかる情報処理システム１の構成は、第１の実施形態と基本的に同じである。本実施形態の回答者端末２０は、第１の実施形態での機能に加えて、アンケートに回答する際に、情報処理装置１０に画像を送信する機能を有する。回答者端末２０がスマートフォンの場合、回答者２５がスマートフォンのカメラで撮影した画像を送信できるようにすると良い。画像は、動画像であっても良い。 [Second Embodiment]
The second embodiment will be described focusing on the differences from the first embodiment, and the description of the common parts will be simplified.
The configuration of the information processing system 1 according to the present embodiment is basically the same as that of the first embodiment. The respondent terminal 20 of the present embodiment has a function of transmitting an image to the information processing device 10 when answering a questionnaire, in addition to the function of the first embodiment. When the respondent terminal 20 is a smartphone, it is preferable to enable the respondent 25 to transmit an image taken by the camera of the smartphone. The image may be a moving image.

＜処理フロー＞
（前提）
図６を参照して、本実施形態の処理フローについて、図３のフローと異なる部分を中心に説明する。本フローでのアンケートは、回答者２５から写真を送信してもらう写真調査アンケートである。このような写真調査には様々な利点がある。例えばテキストのみの回答と比べて、回答者２５の生活実態や行動実態を直感的に把握できる。また、家庭内での実際の商品等の使用状況を確認できる。また、回答者２５が言葉では回答できない内容も、写真を見ることで理解可能となる。 <Processing flow>
(Premise)
The processing flow of the present embodiment will be described with reference to FIG. 6, focusing on a portion different from the flow of FIG. The questionnaire in this flow is a photo survey questionnaire in which respondents 25 send photos. Such a photo survey has various advantages. For example, it is possible to intuitively grasp the living conditions and behavioral conditions of the respondents 25 as compared with the text-only answers. In addition, it is possible to check the actual usage status of products in the home. In addition, the contents that the respondent 25 cannot answer in words can be understood by looking at the photograph.

本実施形態では、図７（ａ）に示すように、回答者２５の自宅で使われている家具（ソファー）を撮影して写真の送信を要求するようなアンケートを行う。本実施形態において、送られてきたソファーの写真を、学習済みモデルにより、「布張り」か「革張り」の２つのカテゴリのいずれかに分類する。 In the present embodiment, as shown in FIG. 7A, a questionnaire is conducted in which the furniture (sofa) used at the home of the respondent 25 is photographed and a photograph is requested to be transmitted. In the present embodiment, the sent sofa photograph is classified into either "upholstered" or "leathered" according to the trained model.

（フロー開始）
本実施形態のフローも、学習済みモデルを作成する準備工程と、写真を分類判別するアンケート実施工程に大別される。ステップＳ１０１〜Ｓ１０３は、コーディング済みのサンプルデータを学習データとして学習済みモデルを作成する、準備工程である。 (Start flow)
The flow of this embodiment is also roughly divided into a preparatory process for creating a trained model and a questionnaire implementation process for classifying and discriminating photographs. Steps S101 to S103 are preparatory steps for creating a trained model using the coded sample data as training data.

ステップＳ２０１にて、調査者１５は、情報処理装置１０のデータ作成部１２１を用いてサンプルデータをコーディングして学習データを作成し、データとして保存する。サンプルデータとしては、様々なソファーの写真の画像情報を用いる。調査者は、それぞれの写真に写るソファーが布張りか革張りかを判断してコードとしての分類情報を付与する。カテゴリ数は２つに限定されず、より多くても良い。また、色や形状、家具メーカー名など、様々な切り口のカテゴリを設けても良い。調査者１５が持つノウハウにより、各写真に適切な分類情報が付与される。 In step S201, the investigator 15 codes the sample data using the data creation unit 121 of the information processing apparatus 10 to create learning data, and saves it as data. As sample data, image information of photographs of various sofas is used. The investigator determines whether the sofa in each photo is upholstered or leather upholstered and gives classification information as a code. The number of categories is not limited to two and may be larger. In addition, various cut categories such as color, shape, furniture manufacturer name, etc. may be provided. Appropriate classification information is given to each photograph by the know-how possessed by the investigator 15.

ステップＳ２０２にて、調査者１５は、情報処理装置１０のモデル作成部１２２を用いて、写真の画像情報を畳み込みニューラルネットワーク（ＣＮＮ）用に数値化する。そしてステップＳ２０３にて、モデル作成部１２２は、数値化された画像情報から学習済みモ
デルを作成する。 In step S202, the investigator 15 digitizes the image information of the photograph for the convolutional neural network (CNN) by using the model creation unit 122 of the information processing apparatus 10. Then, in step S203, the model creation unit 122 creates a trained model from the digitized image information.

ここで、画像情報を対象としたＣＮＮの一例に関して説明する。ここでの入力画像は、ピクセルごとに画素値が数値化される方式で符号化された画像データとする。このような画像データを処理する場合、畳み込み層は、入力層への入力画像に対して、互いに異なる特徴を検出する複数のフィルタを用いた畳み込み演算を行う。これにより、画像の局所性を利用して特徴量を算出できる。ＣＮＮにおいては、フィルタのパラメータは画像認識の対象に応じて自動的に学習される。
プーリング層は、畳み込み層の出力を、画像中の所定領域内の最大値を取るマックスプーリングなどの方法でサブサンプリングして画像解像度を低下させ、特徴量を圧縮する。プーリング層での処理によって、画像中の位置への依存性を低減したり、計算コストを下げたりする効果が得られる。最後に出力層を含む全結合層において、特徴量を結合してパターン分類が行われて結果が出力される。このような学習済みモデルの生成においては、ＣＮＮの分類結果が正解に近づくようにフィルタ重みの計算が行われて最適化される。その際は、誤差逆伝播法など既知の任意の手法を用いてよい。 Here, an example of CNN targeting image information will be described. The input image here is image data encoded by a method in which the pixel value is quantified for each pixel. When processing such image data, the convolution layer performs a convolution operation on the input image to the input layer using a plurality of filters that detect features different from each other. As a result, the feature amount can be calculated by using the locality of the image. In CNN, the filter parameters are automatically learned according to the image recognition target.
The pooling layer subsamples the output of the convolution layer by a method such as max pooling that takes the maximum value within a predetermined region in the image to reduce the image resolution and compress the feature amount. The processing in the pooling layer has the effect of reducing the dependence on the position in the image and reducing the calculation cost. Finally, in the fully connected layer including the output layer, the features are combined and pattern classification is performed, and the result is output. In the generation of such a trained model, the filter weight is calculated and optimized so that the classification result of CNN approaches the correct answer. In that case, any known method such as backpropagation method may be used.

ステップＳ２０４以後は、アンケートを実施し、回答に対して学習済みモデルを適用する、アンケート実施工程である。
ステップＳ２０４にて、調査者１５は、情報処理装置１０のアンケート作成部１０１を用いてアンケートを作成する。そして、アンケート送信部１０２を用いてアンケートを回答者端末２０に送信する。 After step S204, it is a questionnaire implementation process in which a questionnaire is conducted and the trained model is applied to the answers.
In step S204, the investigator 15 creates a questionnaire using the questionnaire creation unit 101 of the information processing device 10. Then, the questionnaire is transmitted to the respondent terminal 20 using the questionnaire transmission unit 102.

ステップＳ２０５にて、回答者２５は、回答入力部２０２を用いてアンケートに回答する。図７（ａ）は回答者端末２０であるスマートフォンのタッチパネル２００４に表示される質問である。回答者２５は、スマートフォンに保存されている写真を選択して回答表示欄２０４２に表示させる。そして、図７（ｂ）に示すように、回答として写真を送信する。 In step S205, the respondent 25 answers the questionnaire using the answer input unit 202. FIG. 7A is a question displayed on the touch panel 2004 of the smartphone which is the respondent terminal 20. The respondent 25 selects a photo stored in the smartphone and displays it in the answer display field 2042. Then, as shown in FIG. 7B, a photograph is transmitted as an answer.

ステップＳ２０６にて、調査者１５は、回答内容取得部１１２を用いて、取得された回答から、判別に必要な情報としての画像情報を取得する。そして、上述したように画像データを符号化する。続いて、特徴抽出部１１３による学習済みのモデルを利用した特徴量抽出、分類判定部１１４による分類判定とカテゴリの決定が行われる。そしてステップＳ２０７にて、結果出力部１１５からの判定結果提示が行われる。 In step S206, the surveyor 15 uses the response content acquisition unit 112 to acquire image information as information necessary for discrimination from the acquired response. Then, the image data is encoded as described above. Subsequently, the feature extraction unit 113 extracts the feature amount using the trained model, and the classification determination unit 114 performs the classification determination and the category determination. Then, in step S207, the determination result is presented from the result output unit 115.

このように本実施形態では、写真調査アンケートで得られた画像情報についても、機械学習の結果に基づいて適切な分類が付与される。その結果、時間および人手に関するコストを低減しつつ、調査者１５が理解しやすい形式で判定結果を提示できる。 As described above, in the present embodiment, the image information obtained in the photo survey questionnaire is also given an appropriate classification based on the result of machine learning. As a result, the determination result can be presented in a format that is easy for the investigator 15 to understand while reducing the time and labor costs.

＜変形例＞
実施形態１、２では、それぞれ自由回答のテキスト情報と、写真調査の画像情報を処理して自動的にコーディングを行った。実施形態１ではテキスト情報を文字レベルで符号化することで文字列を数値情報として処理しており、実施形態２では画像情報を画素ごとの画素値に基づく数値情報として処理している。そのため本発明では、テキスト情報と画像情報を同じアルゴリズムで処理可能となっている。よって本発明は、回答者２５が写真にコメントを付けて回答する場合のように、テキスト情報と画像情報を組み合わせたマルチモーダルな回答についても適用できる。この場合、コメント部分のテキスト情報については実施形態１と同様に文字レベルで符号化処理し、写真部分の画像情報については実施形態２と同様にピクセル処理をすれば良い。そして、テキスト情報に基づく符号と画像情報に基づく符号を併せてニューラルネットワークに入力して分類を行う。
このようにマルチモーダル化を行うことにより、画像情報の特徴量だけでは区別が難し
い場合にも、回答者のコメントに基づいて分類精度を向上させることができる。 <Modification example>
In the first and second embodiments, the text information of the free answer and the image information of the photo survey were processed and automatically coded. In the first embodiment, the text information is encoded at the character level to process the character string as numerical information, and in the second embodiment, the image information is processed as numerical information based on the pixel value for each pixel. Therefore, in the present invention, text information and image information can be processed by the same algorithm. Therefore, the present invention can also be applied to a multimodal answer that combines text information and image information, such as when the respondent 25 responds by adding a comment to a photograph. In this case, the text information of the comment portion may be encoded at the character level as in the first embodiment, and the image information of the photographic portion may be pixel-processed as in the second embodiment. Then, the code based on the text information and the code based on the image information are input to the neural network and classified.
By performing multimodalization in this way, it is possible to improve the classification accuracy based on the comments of the respondents even when it is difficult to distinguish only by the feature amount of the image information.

１０：情報処理装置、１０１：アンケート作成部、１０２：アンケート送信部、１１１：回答受信部、１１４：分類判定部、１２１：データ作成部、１２２：モデル作成部
２０：回答者端末、２０１：アンケート受信部、２０２：回答入力部、２１１：回答送信部 10: Information processing device, 101: Questionnaire creation unit, 102: Questionnaire transmission unit, 111: Answer reception unit, 114: Classification judgment unit, 121: Data creation unit, 122: Model creation unit 20: Respondent terminal, 201: Questionnaire Receiver, 202: Answer input, 211: Answer transmitter

Claims

Information that classifies the answers to the open-ended questions included in the questionnaire and the photos sent to the questions included in the questionnaire that require the respondents to send photos into one of multiple categories. It is an information processing method executed by the processing device .
A step of acquiring the reply consisting of text information regarding which have been, the free reply type interrogator created by the respondent,
The step of encoding by dividing the character string included in the answer into characters one by one and digitizing each character, and
A step of acquiring image information of the photograph transmitted from the respondent and digitizing it for each pixel to encode it.
A step of applying a trained model generated by a character-level convolutional neural network to the quantified answer to determine which of the plurality of categories the answer by the respondent falls into.
A step of applying a trained model generated by a convolutional neural network to the digitized image information to determine which of the plurality of categories the photograph transmitted from the respondent falls into. ,
An information processing method characterized by having.

And creating a plurality of the learning data consisting of the open-ended type classification information assigned to the sample answers and the sample answers to questions,
A step of dividing the character strings included in the sample answers of the plurality of learning data into characters and encoding them, respectively.
A step of creating the trained model for classifying the answers of the respondents using the plurality of training data, and
The information processing method according to claim 1, further comprising a preparatory step.

The information processing method according to claim 1 or 2, wherein the character string included in the answer is encoded by a coding method corresponding to the Unicode standard.

The image information is encoded into a numerical value based on the pixel value for each pixel.
The information processing method according to any one of claims 1 to 3, wherein the information processing method is characterized.

It is an information processing method executed by an information processing device that classifies the answers to the questionnaire into one of multiple categories.
The survey includes a photo and a question requesting respondents to send comments about the photo.
A step of acquiring the image information of the photograph and the text information of the comment from the answer sent from the respondent, and
A step of dividing the character string included in the text information into characters and digitizing each character, and a step of encoding the characters.
A step of encoding the image information by digitizing it for each pixel,
A step of applying a trained model generated by a convolutional neural network to the digitized text information and the image information to determine which of the plurality of categories the answer by the respondent falls into. When,
An information processing method characterized by having.

A program for causing an information processing device to execute the information processing method according to any one of claims 1 to 5.

Information that classifies the answers to the open-ended questions included in the questionnaire and the photos sent to the questions included in the questionnaire that require the respondents to send photos into one of multiple categories. It ’s a processing device,
The respondents created by the response receiver that acquires the reply consisting of text information regarding the open-ended type interrogator,
The first answer content acquisition unit, which encodes the character string included in the answer by dividing it into characters one by one and digitizing each character,
A second response content acquisition unit that acquires image information of the photograph transmitted from the respondent and encodes it by digitizing each pixel, and
The digitized said answer, by applying the learned model generated by the character level convolutional neural network, the first classification determines the answer by the respondent is classified into the plurality of categories Judgment unit and
A second method of applying a trained model generated by a convolutional neural network to the digitized image information to determine which of the plurality of categories the photograph transmitted from the respondent falls into. Classification judgment unit and
An information processing device characterized by having.

An information processing device that classifies the answers to questionnaires into multiple categories.
The survey includes a photo and a question requesting respondents to send comments about the photo.
An answer receiving unit that acquires the image information of the photograph and the text information of the comment from the answer sent from the respondent.
The answer content acquisition unit, which encodes the character string included in the text information by dividing it into characters one by one and digitizing each character, and digitizes the image information for each pixel.
The digitized said text information and the image information, by applying the learned model generated by the neural network convolution, the same algorithm from the image information and the text information, the reply by the respondents said plurality A classification judgment unit that determines which of the categories is classified into
An information processing device characterized by having.