JP7443649B2

JP7443649B2 - Model update method, device, electronic device and storage medium

Info

Publication number: JP7443649B2
Application number: JP2023037758A
Authority: JP
Inventors: ワン、ハイフェン; ティアン、ハオ; ウ、フア; ウ、ティアン; リウ、ジン; ディン、ユチェン; キュ、インキ; ツオ、ミンシアン
Original assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Current assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Priority date: 2022-06-30
Filing date: 2023-03-10
Publication date: 2024-03-06
Anticipated expiration: 2043-03-10
Also published as: CN115168577A; JP2024006945A; CN115168577B

Description

本開示は、人工知能技術の分野に関し、特にディープラーニングおよび自然言語処理などの分野におけるモデル更新方法、装置、電子デバイスおよび記憶媒体に関する。 The present disclosure relates to the field of artificial intelligence technology, and particularly to model updating methods, apparatuses, electronic devices, and storage media in fields such as deep learning and natural language processing.

現在、訓練によって得られたディープラーニングモデルは、さまざまなシナリオで活用されている。例えば、検索シナリオでは、大量のコンテンツから候補結果をリコールするためのセマンティック検索モデルと、リコールされた候補結果をソートするためのセマンティックマッチングモデルとを使用することができる。しかし、これらのモデルはいずれも性能を向上させる余地がある。 Currently, deep learning models obtained through training are being used in a variety of scenarios. For example, a search scenario may use a semantic search model to recall candidate results from a large amount of content and a semantic matching model to sort the recalled candidate results. However, there is room for performance improvements in all of these models.

本開示は、モデル更新方法、装置、電子デバイス及び記憶媒体を提供する。 The present disclosure provides model updating methods, apparatus, electronic devices, and storage media.

評価データセットにおける処理すべきデータであって、少なくとも２つの異なるカテゴリに属する処理すべきデータについて、最適化すべきモデルおよびツールセットにおけるツールをそれぞれ利用して対応する処理結果を取得し、前記処理結果に応じて前記最適化すべきモデルおよび前記ツールの処理効果の優劣を比較し、比較結果に基づいて最適化すべきカテゴリと対応するアドバンテージツールとを特定することであって、前記最適化すべきカテゴリはツールの処理効果が前記最適化すべきモデルよりも優れているカテゴリであり、前記アドバンテージツールは処理効果が前記最適化すべきモデルよりも優れているツールであり、対応するアドバンテージツールに基づいて前記最適化すべきカテゴリの訓練データを生成し、前記訓練データを用いて前記最適化すべきモデルを更新することを含むモデル更新方法が提供される。 For the data to be processed in the evaluation data set that belongs to at least two different categories, the corresponding processing results are obtained using the model to be optimized and the tools in the toolset, and the processing results are The method includes comparing the processing effects of the model to be optimized and the tool, and identifying a category to be optimized and a corresponding advantage tool based on the comparison result, wherein the category to be optimized is a tool. is a category in which the processing effect is better than the model to be optimized, and the advantage tool is a category in which the processing effect is better than the model to be optimized, and the advantage tool is a category in which the processing effect is better than the model to be optimized, and the advantage tool is a category in which the processing effect is better than the model to be optimized. A model updating method is provided that includes generating categorical training data and updating the model to be optimized using the training data.

効果比較モジュールと、情報特定モジュールと、サンプル生成モジュールと、モデル更新モジュールとを備え、前記効果比較モジュールは、評価データセットにおける処理すべきデータであって、少なくとも２つの異なるカテゴリに属する処理すべきデータについて、最適化すべきモデルおよびツールセットにおけるツールをそれぞれ利用して対応する処理結果を取得し、前記処理結果に基づいて、前記最適化すべきモデルおよび前記ツールの処理効果の優劣を比較し、前記情報特定モジュールは、比較結果に基づいて最適化すべきカテゴリと対応するアドバンテージツールとを特定し、前記最適化すべきカテゴリはツールの処理効果が前記最適化すべきモデルよりも優れているカテゴリであり、前記アドバンテージツールは処理効果が前記最適化すべきモデルよりも優れているツールであり、前記サンプル生成モジュールは、対応するアドバンテージツールに基づいて前記最適化すべきカテゴリの訓練データを生成し、前記モデル更新モジュールは、前記訓練データを用いて前記最適化すべきモデルを更新するモデル更新装置が提供される。 The effect comparison module includes an effect comparison module, an information identification module, a sample generation module, and a model update module; For the data, the model to be optimized and the tools in the toolset are used to obtain corresponding processing results, and based on the processing results, the superiority or inferiority of the processing effects of the model to be optimized and the tools are compared; The information identification module identifies a category to be optimized and a corresponding advantage tool based on the comparison result, and the category to be optimized is a category in which the processing effect of the tool is superior to the model to be optimized; The advantage tool is a tool whose processing effect is better than the model to be optimized, the sample generation module generates training data for the category to be optimized based on the corresponding advantage tool, and the model update module generates training data for the category to be optimized based on the corresponding advantage tool. , a model updating device is provided that updates the model to be optimized using the training data.

少なくとも１つのプロセッサと、前記少なくとも１つのプロセッサと通信可能に接続されたメモリとを備え、前記メモリに前記少なくとも１つのプロセッサにより実行可能なコマンドが記憶されており、前記コマンドが前記少なくとも１つのプロセッサにより実行されると、前記少なくとも１つのプロセッサに前記のような方法を実行させる電子デバイスが提供される。 at least one processor; and a memory communicatively connected to the at least one processor, wherein the memory stores a command executable by the at least one processor, and the command is executed by the at least one processor. An electronic device is provided which causes said at least one processor to perform a method as described above.

コンピュータに前記のような方法を実行させるためのコンピュータコマンドを記憶した非一時的なコンピュータ可読記憶媒体が提供される。 A non-transitory computer-readable storage medium is provided having computer commands stored thereon for causing a computer to perform a method as described above.

プロセッサにより実行されると、前記のような方法を実現するコンピュータプログラム／コマンドを含むコンピュータプログラム製品が提供される。 A computer program product is provided that includes computer programs/commands that, when executed by a processor, implement a method as described above.

理解すべきなのは、本セクションで説明される内容は、本開示の実施形態の重要なまたは肝心な特徴を標識することでもなく、本開示の範囲を制限することでもない。本開示の他の特徴は、以下の明細書により容易に理解されるであろう。 It should be understood that what is described in this section does not label important or essential features of the embodiments of the disclosure or limit the scope of the disclosure. Other features of the disclosure will be readily understood from the following specification.

図面は、本技術案をより良く理解するためのものであり、本願に制限されない。図面において、
本開示に記載されたモデル更新方法の実施形態のフローチャートである。本開示に記載されたモデル更新方法の全体的な実施プロセスの概略図である。本開示に記載されたモデル更新装置の実施形態３００の構成の概略図である。本開示の実施形態を実施可能な電子デバイス４００の概略的なブロック図を示す。 The drawings are for better understanding of the technical solution and are not limited to the present application. In the drawing,
2 is a flowchart of an embodiment of a model update method described in this disclosure. 1 is a schematic diagram of the overall implementation process of the model update method described in this disclosure; FIG. 3 is a schematic diagram of a configuration of an embodiment 300 of a model update apparatus described in the present disclosure. FIG. 1 shows a schematic block diagram of an electronic device 400 capable of implementing embodiments of the present disclosure.

以下、図面に基づいて、本出願の例示的な実施例を説明する。理解を容易にするために、本出願の実施例の様々な詳細が含まれており、それらは単なる例示と見なされるべきである。従って、当業者は、本出願の範囲及び精神から逸脱することなく、本明細書に記載の実施形態に対して様々な変更及び修正を行うことができることを認識するはずである。同様に、簡明のために、以下の説明では、よく知られた機能と構造の説明は省略される。 Hereinafter, exemplary embodiments of the present application will be described based on the drawings. Various details of the embodiments of the present application are included to facilitate understanding and are to be considered as illustrative only. Accordingly, those skilled in the art will appreciate that various changes and modifications can be made to the embodiments described herein without departing from the scope and spirit of the present application. Similarly, in the interest of clarity, descriptions of well-known functions and structures are omitted in the following description.

さらに、本明細書における用語「および／または」は、単に関連オブジェクトを記述する関連関係であり、３つの関係が存在し得ると意味する。例えば、Ａおよび／またはＢは、Ａが単独で存在し、ＡとＢが同時に存在し、Ｂが単独で存在するという三つの状況を意味することができる。また、本明細書における文字「／」は、一般的に前後の関連オブジェクトが「または」の関係にあることを意味する。 Additionally, the term "and/or" herein is an association relationship that simply describes related objects, meaning that three relationships may exist. For example, A and/or B can mean three situations: A is present alone, A and B are present simultaneously, and B is present alone. Furthermore, the character "/" in this specification generally means that related objects before and after are in an "or" relationship.

図１は、本開示に記載されたモデル更新方法の実施形態のフローチャートである。図１に示すように、以下の具体的な実現方式が含まれる。 FIG. 1 is a flowchart of an embodiment of the model update method described in this disclosure. As shown in FIG. 1, the following specific implementation methods are included.

ステップ１０１では、評価データセットにおける処理すべきデータであって、少なくとも２つの異なるカテゴリに属する処理すべきデータについて、最適化すべきモデルとツールセットにおけるツールをそれぞれ用いて対応する処理結果を取得し、処理結果に基づいて最適化すべきモデルとツールの処理効果の優劣を比較する。 In step 101, for data to be processed in the evaluation data set and belonging to at least two different categories, corresponding processing results are obtained using the model to be optimized and the tools in the toolset, respectively; Compare the processing effects of the model and tool to be optimized based on the processing results.

ステップ１０２では、比較結果に基づいて、ツールの処理効果が最適化すべきモデルよりも優れているカテゴリである最適化すべきカテゴリと、それに対応する、処理効果が最適化すべきモデルよりも優れているツールであるアドバンテージツールとを特定する。 In step 102, based on the comparison results, a category to be optimized, which is a category in which the processing effect of the tool is better than the model to be optimized, and a corresponding tool, which is a category in which the processing effect of the tool is better than the model to be optimized, are determined. Identify the advantage tools that are.

ステップ１０３では、対応するアドバンテージツールに基づいて最適化すべきカテゴリの訓練データを生成する。 In step 103, training data for the category to be optimized is generated based on the corresponding advantage tool.

ステップ１０４では、生成された訓練データを用いて最適化すべきモデルを更新する。 In step 104, the generated training data is used to update the model to be optimized.

上記の方法の実施形態で説明された技術案によれば、ツールセットにおけるツールを介して最適化すべきモデルにおける問題、即ち最適化すべきカテゴリを特定し、そしてそれに応じて最適化すべきカテゴリの訓練データを生成し、さらに、生成された訓練データに基づいて最適化すべきモデルを更新することができる。すなわち、最適化すべきモデルにおける問題を指向的に解決し、さらにモデルの性能を向上可能なデータ駆動の指向性モデル最適化方式を提案した。 According to the technical proposal described in the above method embodiment, the problem in the model to be optimized, i.e. the category to be optimized, is identified through the tools in the toolset, and the training data of the category to be optimized accordingly. The model to be optimized can be updated based on the generated training data. That is, we proposed a data-driven directional model optimization method that can directionally solve problems in the model to be optimized and further improve model performance.

前記方法の実施形態で説明した技術案は、ディープラーニングや自然言語処理などの人工知能の分野に適用することができる。具体的には、例えば、検索シナリオに適用する場合に、前記最適化すべきモデルはセマンティックマッチングモデルであり、前記処理すべきデータはユーザにより入力されたクエリ（ｑｕｅｒｙ）であり、前記処理結果はｑｕｅｒｙによりリコールされた候補結果をソートした結果であって良い。前記異なるカテゴリとは、異なるｑｕｅｒｙカテゴリを指し、例えば、質問系、キーワード検索系、ウェブアドレス検索系及び音声・動画再生系等のカテゴリに分けても良く、ゲーム系、自動車系、生活常識系及び医療系等のカテゴリに分けても良く、どのように異なるカテゴリに分けるかは、実際の必要に応じて決定することができる。評価データセットにおけるｑｕｅｒｙ、例えばテキストフォーマットのｑｕｅｒｙについては、セマンティックマッチングモデルおよびツールセットにおけるツールをそれぞれ利用して対応する処理結果を取得し、そして、処理結果に基づいてセマンティックマッチングモデルおよびツールの処理効果の優劣を比較し、さらに、比較結果に基づいて、最適化すべきカテゴリおよび対応するアドバンテージツールを特定し、さらに、対応するアドバンテージツールに基づいて最適化すべきカテゴリの訓練データを生成し、生成された訓練データを用いてセマンティックマッチングモデルを更新することができる。以上のように最適化されたセマンティックマッチングモデルが得られた後、これを実用化することができる。つまり、ユーザが入力したｑｕｅｒｙに対してリコールの候補結果を得、最適化されたセマンティックマッチングモデルを用いてリコールの候補結果をソートすることができ、それに応じてソート結果の正確性を向上させることができる。 The technical solution described in the method embodiment can be applied to the field of artificial intelligence such as deep learning and natural language processing. Specifically, for example, when applied to a search scenario, the model to be optimized is a semantic matching model, the data to be processed is a query input by a user, and the processing result is a query It may be the result of sorting the candidate results recalled by. The different categories refer to different query categories, and may be divided into categories such as questions, keyword searches, web address searches, audio/video playback, etc., as well as games, automobiles, common knowledge, and They may be divided into categories such as medical, and how they are divided into different categories can be determined according to actual needs. For queries in the evaluation data set, for example queries in text format, the corresponding processing results are obtained using the semantic matching model and the tools in the toolset, respectively, and the processing effects of the semantic matching model and tools are evaluated based on the processing results. Furthermore, based on the comparison results, the category to be optimized and the corresponding advantage tool are identified, and training data for the category to be optimized is generated based on the corresponding advantage tool. The training data can be used to update the semantic matching model. After the optimized semantic matching model is obtained as described above, it can be put into practical use. In other words, it is possible to obtain recall candidate results for a query input by the user, sort the recall candidate results using an optimized semantic matching model, and improve the accuracy of the sort results accordingly. I can do it.

本開示の一実施形態では、ツールセットは、最適化すべきモデルの履歴バージョンモデル、最適化すべきモデルの同種モデル、最適化すべきモデルの同種システム、単語文法ツール、構文ツール、知識ベース、というツールの１つまたは任意の組み合わせを含むことができる。前記同種は同じ機能を有することを示す。 In one embodiment of the present disclosure, the toolset includes the following tools: a historical version model of the model to be optimized, a homogeneous model of the model to be optimized, a homogeneous system of the model to be optimized, a word grammar tool, a syntax tool, a knowledge base. Can include one or any combination. The same species indicates that they have the same function.

実際の応用では、「沢山の長所を広く取得する」という考えに基づいて、できるだけ多くのツールをツールセットに加えることができる。それに応じて、ツールセットに上記のすべての内容を含めることで、後続で構築される訓練データの数やタイプなどを向上させ、モデルの最適化効果を高めることができる。 In practical applications, as many tools as possible can be added to the toolset, based on the idea of ``capturing many advantages broadly''. Accordingly, by including all of the above in the toolset, it is possible to improve the number and type of training data that is subsequently constructed, increasing the effectiveness of model optimization.

ここで、最適化すべきモデルがセマンティックマッチングモデルであることを例にすると、最適化すべきモデルの履歴バージョンモデルは、過去に得られた各バージョンのセマンティックマッチングモデルを指す。履歴バージョンモデルは、最適化すべきモデルに比べて、全体的な機能が劣るが、あるカテゴリまたは特定のカテゴリでは、より優れたパフォーマンスを発揮する場合がある。このため、最適化すべきモデルのすべての履歴バージョンをツールセットに追加して良い。 Here, assuming that the model to be optimized is a semantic matching model, the history version model of the model to be optimized refers to each version of the semantic matching model obtained in the past. The historical version model has poor overall functionality compared to the model to be optimized, but may perform better in certain or specific categories. For this reason, all historical versions of the model to be optimized may be added to the toolset.

さらに、最適化すべきモデルがあるルールに基づくセマンティックマッチングモデルであると仮定すると、同種モデルは他のルールに基づくセマンティックマッチングモデルであってよい。さらに、様々な検索エンジンを最適化すべきモデルの同種システムとして良い。 Further, assuming that the model to be optimized is a certain rule-based semantic matching model, the homogeneous model may be another rule-based semantic matching model. Furthermore, it may be a homogeneous system of models to be optimized for various search engines.

更に、ある処理すべきデータがユーザが入力したｑｕｅｒｙ「＊＊（ある都市名）の平均住宅価格はいくらですか」であるとすると、単語文法ツールを用いてｑｕｅｒｙ中のキーワードの１つが「＊＊」であることを特定することができる。それに応じて、候補結果に「＊＊」が出現しなければ、あまり関連性のない回答である可能性が高く、下位に順位付けすべきであるという原則に基づいて、リコールの候補結果をソートすることができる。構文ツールや知識ベースなども同様の機能を実現することができる。 Furthermore, if the data to be processed is the query input by the user, ``What is the average housing price in ** (a certain city name)?'', then one of the keywords in the query can be calculated using a word grammar tool, such as ``* *" can be specified. Accordingly, the candidate results for recall are sorted based on the principle that if "**" does not appear in the candidate result, it is likely that the answer is not very relevant and should be ranked lower. can do. Syntax tools, knowledge bases, etc. can also provide similar functionality.

本開示の一実施形態では、評価データセットは、各処理すべきデータにそれぞれ対応する正しい処理結果をさらに含むことができる。 In an embodiment of the present disclosure, the evaluation data set may further include correct processing results corresponding to each piece of data to be processed.

例えば、探索シナリオにおいて、処理すべきデータがユーザが入力したｑｕｅｒｙであり、対応する処理結果がｑｕｅｒｙに基づいてリコールされた各候補結果に対する正しいソート順であって良い。 For example, in a search scenario, the data to be processed may be a user-entered query, and the corresponding processing result may be the correct sort order for each candidate result recalled based on the query.

評価データセットの生成方法に制限がなく、評価データセットに含まれる処理すべきデータの具体的な数に制限もなく、実際の必要に応じて決定して良い。さらに、評価データセットにおける処理すべきデータは、少なくとも２つの異なるカテゴリに属する必要がある。前記少なくとも２つの異なるカテゴリがそれぞれどのようなカテゴリであるかは、同様に実際の必要に応じて決定して良い。例えば、処理すべきデータがユーザにより入力されたｑｕｅｒｙである場合に、あるカテゴリは、ｑｕｅｒｙにめったに見ない単語が含まれることであってよい。 There is no limit to the method of generating the evaluation data set, and there is no limit to the specific number of data to be processed included in the evaluation data set, which may be determined according to actual needs. Furthermore, the data to be processed in the evaluation data set must belong to at least two different categories. The categories of the at least two different categories may also be determined depending on actual needs. For example, if the data to be processed is a query input by a user, one category may be that the query contains rarely seen words.

さらに、評価データセットに１００（列挙であるが、実際にはこれよりもはるかに大きい場合がある）個の処理すべきデータが含まれていると仮定する。これらのデータは、それぞれ１０個の異なるカテゴリに属して良く、各カテゴリには、それぞれ１０個の処理すべきデータが含まれて良い。 Further assume that the evaluation data set contains 100 (an enumeration, but in practice it may be much larger) pieces of data to be processed. Each of these data may belong to ten different categories, and each category may contain ten pieces of data to be processed.

評価データセットにおける処理すべきデータに対して、最適化すべきモデル及びツールセットにおけるツールの処理効果の優劣を比較することができる。 It is possible to compare the processing effects of the model to be optimized and the tools in the toolset on the data to be processed in the evaluation data set.

ここで、ツールセットにおける何れか一つのツールは、個別動作可能なツールであっても良く、個別動作不可能なツールであっても良い。それに応じて、最適化すべきモデル及びツールの処理効果の優劣を比較する場合に、以下で説明するように異なる処理方法を採用することができる。 Here, any one tool in the tool set may be a tool that can be operated individually, or a tool that cannot be operated individually. Accordingly, when comparing the processing effects of models and tools to be optimized, different processing methods can be adopted as described below.

１）個別動作可能なツール
本開示の一実施形態では、最適化すべきモデルを用いて評価データセットにおける各処理すべきデータに対応する第１処理結果を取得することができる。そして、ツールセットにおける何れか一つの個別動作可能なツールについて、ツールを用いて評価データセットにおける各処理すべきデータに対応する第２処理結果を取得し、前記正しい処理結果、第１処理結果および第２処理結果に基づいて、異なるカテゴリに対する最適化すべきモデルとツールの処理効果の優劣を判定する処理をそれぞれ行うことができる。 1) Tools that can operate individually In an embodiment of the present disclosure, a model to be optimized can be used to obtain a first processing result corresponding to each data to be processed in the evaluation data set. Then, for any one of the tools in the tool set that can operate individually, the second processing result corresponding to each data to be processed in the evaluation data set is obtained using the tool, and the correct processing result, the first processing result, and Based on the second processing results, processing can be performed to determine the superiority or inferiority of the processing effects of the model to be optimized and the tool for different categories.

最適化すべきモデルがセマンティックマッチングモデルであることを例にすると、ツールセットにおけるほとんどのツールは、履歴バージョンモデル、同種モデル、単語文法ツールなどのような個別動作可能なツールである。個別動作可能なツールとは、直接にツールを使用して評価データセットにおける各処理すべきデータに対応する候補結果をソートすることができるツールのことである。逆に、ある検索エンジンに対して、直接にこのツールを用いて評価データセットにおける各処理すべきデータに対する候補結果をソートすることができない場合、このツールは個別動作不可能なツールである。 For example, if the model to be optimized is a semantic matching model, most of the tools in the toolset are independently operable tools, such as historical version models, homogeneous models, word grammar tools, etc. A separately operable tool is one that can be used directly to sort candidate results corresponding to each piece of data to be processed in the evaluation data set. Conversely, if the tool cannot be used directly for a search engine to sort candidate results for each data item in the evaluation data set, then the tool is an inoperable tool.

これに応じて、最適化すべきモデルを用いて評価データセットにおける各処理すべきデータに対応する第１処理結果を取得することは、最適化すべきモデルを使用して評価データセットにおける各処理すべきデータに対応する候補結果をソートすることを意味し、得られたソート結果が第１処理結果である。同様に、ツールを用いて評価データセットにおける各処理すべきデータに対応する第２処理結果を取得することは、ツールを用いて評価データセットにおける各処理すべきデータに対応する候補結果をソートすることを意味し、得られたソート結果が第２処理結果である。 Accordingly, obtaining a first processing result corresponding to each data to be processed in the evaluation data set using the model to be optimized means This means sorting the candidate results corresponding to the data, and the obtained sorting result is the first processing result. Similarly, obtaining a second processing result corresponding to each data to be processed in the evaluation data set using the tool includes sorting candidate results corresponding to each data to be processed in the evaluation data set using the tool. This means that the obtained sorting result is the second processing result.

このような処理により、正しい処理結果と、最適化すべきモデルとツールのそれぞれに対応する処理結果とを組み合わせて、最適化すべきモデルとツールの処理効果の優劣を効率的かつ正確に判定することができる。 Through such processing, it is possible to efficiently and accurately determine the superiority or inferiority of the processing effects of the model to be optimized and the tool by combining the correct processing result and the processing result corresponding to each of the model and tool to be optimized. can.

具体的には、本開示の一実施形態では、いずれか一つのカテゴリについて、それぞれ評価データセットにおける当該カテゴリに属する処理すべきデータに対応する正しい処理結果、第１処理結果、および第２処理結果に基づいて、最適化すべきモデルとツールの処理効果の優劣を判定することができる。それに応じて、ツールの処理効果が最適化すべきモデルよりも優れていると判定されたことに応じて、そのカテゴリを最適化すべきカテゴリとし、ツールをアドバンテージツールとすることができる。 Specifically, in one embodiment of the present disclosure, for any one category, correct processing results, first processing results, and second processing results corresponding to data to be processed belonging to the category in the evaluation data set, respectively. Based on this, it is possible to judge the processing effectiveness of the model to be optimized and the tool. Accordingly, in response to determining that the processing effect of the tool is superior to the model to be optimized, the category can be set as the category to be optimized and the tool can be set as an advantage tool.

たとえば、カテゴリａについて、１０個の処理すべきデータが含まれていると仮定する。表現しやすいため、それぞれ処理すべきデータ１～処理すべきデータ１０と呼ばれる場合に、正しい処理結果と比較することにより、処理すべきデータ１～処理すべきデータ１０のそれぞれに対応する第１処理結果のうち、正しい第１処理結果の数を決定し、そして、処理すべきデータ１～処理すべきデータ１０のそれぞれに対応する第２処理結果（ツールｂを用いて得られたものとする）のうち、正しい第２処理結果の数を決定し、そして、それぞれの正確率を取得して最適化すべきモデルおよびツールｂの評価指標とすることができる。ツールｂの評価指標が最適化すべきモデルの評価指標よりも優れていると判定されれば、ツールｂの処理効果が最適化すべきモデルよりも優れていると判定することができる。逆の場合に、最適化すべきモデルの処理効果がツールｂよりも優れていると判定することができる。 For example, assume that category a contains 10 pieces of data to be processed. For ease of expression, when they are called data to be processed 1 to data to be processed 10, the first processing corresponding to each of data to be processed 1 to data to be processed 10 is determined by comparing with the correct processing result. Among the results, determine the number of correct first processing results, and then determine the second processing results corresponding to each of data to be processed 1 to data to be processed 10 (assumed to be obtained using tool b). Among them, the number of correct second processing results can be determined, and the accuracy rate of each can be obtained and used as an evaluation index of the model to be optimized and tool b. If it is determined that the evaluation index of tool b is better than the evaluation index of the model to be optimized, it can be determined that the processing effect of tool b is superior to that of the model to be optimized. In the opposite case, it can be determined that the processing effect of the model to be optimized is superior to tool b.

説明すべきなのは、以上の評価指標の取得方法は例示的な説明に過ぎず、本開示の技術案を限定するためのものではなく、具体的にどのように評価指標を設定するかは実際の必要に応じて決定することができる。 What should be explained is that the above evaluation index acquisition method is only an exemplary explanation and is not intended to limit the technical proposal of the present disclosure, and how to specifically set the evaluation index depends on the actual situation. It can be determined as necessary.

以上の処理により、異なるカテゴリに対して、最適化すべきモデルとツールの処理効果の優劣をそれぞれ判定し、それに応じて最適化すべきカテゴリと対応するアドバンテージツールを特定することができ、さらに後続の処理に良好な基礎を築くことができる。 Through the above processing, it is possible to judge the processing effects of models and tools to be optimized for different categories, and to identify categories to be optimized and corresponding advantage tools accordingly. can build a good foundation.

２）個別動作不可能なツール
ツールセットにおける何れか一つの個別動作不可能なツールに対して、ツールを用いて評価データセットにおける各処理すべきデータに対応する第３処理結果を取得し、そして、いずれか一つの処理すべきデータについて、最適化すべきモデルを用いてその第３処理結果が正しいか否かをそれぞれ検証し、検証結果が不正確であると判定されたことに応じて、人工による判定結果を取得し、人工による判定結果に基づいて、当該処理すべきデータに対する最適化すべきモデルとツールとの処理効果の優劣を判定する、という処理をそれぞれ実行することができる。 2) Tools that cannot be operated individually For any one tool that cannot be operated individually in the tool set, use the tool to obtain a third processing result corresponding to each data to be processed in the evaluation data set, and , for any one of the data to be processed, use the model to be optimized to verify whether the third processing result is correct, and if the verification result is determined to be inaccurate, artificially It is possible to perform a process of obtaining the judgment results obtained by the method, and determining the processing effects of the model to be optimized and the tool on the data to be processed, based on the judgment results obtained by the artificial judgment.

最適化すべきモデルがセマンティックマッチングモデルであり、処理すべきデータがユーザが入力したｑｕｅｒｙであることを例にする。検索エンジンなどのツールについて、このようなツールを直接に用いて評価データセットにおける各処理すべきデータに対応する候補結果をソートすることができないため、このようなツールが個別動作不可能なツールであることに応じて逆方向に動作するしかない。 As an example, the model to be optimized is a semantic matching model, and the data to be processed is a query input by a user. For tools such as search engines, it is not possible to directly use such tools to sort candidate results corresponding to each piece of data to be processed in the evaluation data set, so such tools are incapable of operating independently. We have no choice but to work in the opposite direction depending on what happens.

具体的には、評価データセットにおけるいずれか一つのｑｕｅｒｙについて、検索エンジンに入力して検索エンジンからの検索結果であるリコールの各候補結果および各候補結果のソート順位、即ち上記の第３処理結果を得ることができる。その後、最適化すべきモデルを用いて各処理すべきデータに対する第３処理結果が正しいかどうかをそれぞれ検証することができる。例えば、第３処理結果ごとに、最適化すべきモデルを使用して各候補結果をそれぞれ再ソートし、再ソート結果が第３処理結果と一致するかどうかを比較し、一致しなければ、検証結果が不正確であると判定することができる。それに応じて、どちらのソート結果がより正確であるかを関係者が判定するという人工による判定結果を得、さらに、この人工による判定結果に基づいて、各処理すべきデータに対する最適化すべきモデルとツールの処理効果の優劣を判定することができる。 Specifically, for any one query in the evaluation data set, each candidate result of recall which is the search result from the search engine by inputting it into a search engine, and the sorting rank of each candidate result, that is, the third processing result described above. can be obtained. Thereafter, it is possible to verify whether the third processing results for each data to be processed are correct using the model to be optimized. For example, for each third processing result, each candidate result is re-sorted using the model to be optimized, compared whether the re-sorted result matches the third processing result, and if not, the verification result is can be determined to be inaccurate. Accordingly, the parties involved determine which sorting result is more accurate, obtaining an artificial judgment result, and then determining the model to be optimized for each data to be processed based on this artificial judgment result. It is possible to determine the superiority or inferiority of the processing effect of the tool.

本開示の一実施形態では、いずれか一つの処理すべきデータに対して、人工による判定結果に基づいて、当該処理すべきデータに対する最適化すべきモデルとツールの処理効果の優劣を判定することは、人工による判定結果は当該処理すべきデータに対応する第３処理結果が正しいであることに応じて、最適化すべきモデルよりもツールの方が優れていることを判定し、それ以外の場合に最適化すべきモデルの処理効果がツールよりも優れていると判定し、それに応じて何れか一つのカテゴリについて、評価データセットにおける当該カテゴリに属する処理すべきデータのうち、ツールの処理効果が最適化すべきモデルよりも優れている処理すべきデータの数が、最適化すべきモデルの処理効果がツールよりも優れている処理すべきデータの数よりも多いと判定されたことに応して、当該カテゴリを最適化すべきカテゴリとし、ツールをアドバンテージツールとすることを含んでよい。 In an embodiment of the present disclosure, it is possible to determine the superiority or inferiority of the processing effects of a model to be optimized and a tool for any one data to be processed based on an artificial determination result. , the artificial judgment result determines that the tool is better than the model to be optimized depending on whether the third processing result corresponding to the data to be processed is correct, and in other cases It is determined that the processing effect of the model to be optimized is better than that of the tool, and accordingly, for any one category, the processing effect of the tool is optimized among the data to be processed belonging to that category in the evaluation dataset. In response to determining that the number of data to be processed that is better than the model to be optimized is greater than the number of data to be processed for which the processing effect of the model to be optimized is better than the tool, the corresponding category This may include determining the category to be optimized and the tool as an advantage tool.

カテゴリｂに処理すべきデータが１０個含まれており、このうち５つの処理すべきデータはツール（ツールｃと仮定）の処理効果が最適化すべきモデルの処理効果よりも優れており、２つの処理すべきデータは最適化すべきモデルの処理効果がツールの処理効果よりも優れており、残りの３つの処理すべきデータは最適化すべきモデルとツールの処理効果が同じであると仮定すると、カテゴリｂを最適化すべきカテゴリとし、ツールｃをアドバンテージツールとすることができる。 Category b contains 10 pieces of data to be processed, and for 5 of these data, the processing effect of the tool (assumed to be tool c) is superior to that of the model to be optimized, and two For the data to be processed, assuming that the processing effect of the model to be optimized is better than the processing effect of the tool, and for the remaining three data to be processed, the processing effect of the model to be optimized and the processing effect of the tool are the same, then the category b can be a category to be optimized, and tool c can be an advantage tool.

上記の説明から分かるように、本開示に記載された態様は、個別動作可能なツールであっても、個別動作不可能なツールであっても、適用可能であり、すなわち、汎用性を有する。 As can be seen from the above description, the aspects described in this disclosure are applicable to both individually operable and non-individually operable tools, i.e., have versatility.

最適化すべきカテゴリとは、問題が存在するカテゴリである。その後、データ駆動型の方法で問題を解決することができる。つまり、対応するアドバンテージツールに基づいて最適化すべきカテゴリの訓練データを生成し、生成された訓練データを用いて最適化すべきモデルを更新することができる。 The category to be optimized is the category in which the problem exists. The problem can then be solved in a data-driven manner. That is, training data for a category to be optimized can be generated based on the corresponding advantage tool, and the model to be optimized can be updated using the generated training data.

本開示の一実施形態では、何れか一つの最適化すべきカテゴリについて、最適化すべきカテゴリに属する処理すべきデータをデータベースから取得し、最適化すべきカテゴリに対応するアドバンテージツールの数が１であることに応答して、取得された処理すべきデータに基づいて、アドバンテージツールに基づいて最適化すべきカテゴリの訓練データを生成し、最適化すべきカテゴリに対応するアドバンテージツールの数が１よりも大きいであることに応じて、取得された処理すべきデータに基づいて、各アドバンテージツールに基づいて最適化すべきカテゴリの訓練データをそれぞれ生成する、という処理をそれぞれ実行することができる。 In one embodiment of the present disclosure, for any one category to be optimized, data to be processed belonging to the category to be optimized is obtained from the database, and the number of advantage tools corresponding to the category to be optimized is one. In response, based on the obtained data to be processed, generate training data for the category to be optimized based on the advantage tool, and the number of advantage tools corresponding to the category to be optimized is greater than 1. Depending on the situation, it is possible to perform a process of respectively generating training data for categories to be optimized based on each advantage tool based on the acquired data to be processed.

ツールセットにツールａ、ツールｂ、ツールｃという３つのツールがあると仮定する。また、評価データセットにおける処理すべきデータがそれぞれカテゴリ１～カテゴリ１０という１０個の異なるカテゴリに属し、最適化すべきモデルに比べて、ツールａの処理効果がカテゴリ３～５において優れており、ツールｂの処理効果がカテゴリ４～６において優れており、ツールｃの処理効果がカテゴリ５～７において優れていると仮定すると、カテゴリ３、カテゴリ４、カテゴリ５、カテゴリ６、およびカテゴリ７は、最適化すべきカテゴリである。また、カテゴリ３については、対応するアドバンテージツールであるツールａの数は１つであるため、それに応じてツールａに基づいてカテゴリ３の訓練データを生成することができる。カテゴリ５については、ツールａ、ツールｂ、およびツールｃを含む対応するアドバンテージツールの数が１より多いため、それに応じてツールａに基づいてカテゴリ５の訓練データを生成し、ツールｂに基づいてカテゴリ５の訓練データを生成し、ツールｃに基づいてカテゴリ５の訓練データを生成することができる。他の説明は省略する。 Assume that there are three tools in the toolset: tool a, tool b, and tool c. In addition, the data to be processed in the evaluation dataset belong to 10 different categories, category 1 to category 10, and the processing effect of tool a is superior in categories 3 to 5 compared to the model to be optimized. Assuming that the processing effect of tool b is better in categories 4 to 6 and the processing effect of tool c is better in categories 5 to 7, categories 3, 4, 5, 6, and 7 are optimal This is a category that should be developed. Regarding category 3, since the number of tool a, which is the corresponding advantage tool, is one, training data for category 3 can be generated based on tool a accordingly. For category 5, the number of corresponding advantage tools including tool a, tool b, and tool c is more than 1, so we generate training data for category 5 based on tool a and based on tool b accordingly. Category 5 training data can be generated, and category 5 training data can be generated based on tool c. Other explanations will be omitted.

どのようにデータベースから最適化すべきカテゴリの処理すべきデータを取得するかは制限されず、例えば、データベースにおける最適化すべきカテゴリに属するｑｕｅｒｙから一部のｑｕｅｒｙをランダムに抽出することができる。最適化すべきカテゴリはｑｕｅｒｙにめったに見ない単語が含まれることである場合に、対応するカテゴリのｑｕｅｒｙから一部のｑｕｅｒｙをランダムに抽出して良く、抽出されたｑｕｅｒｙが何れもめったに見ない単語を含む。 There are no restrictions on how the data to be processed of the category to be optimized is acquired from the database; for example, some queries may be randomly extracted from the queries belonging to the category to be optimized in the database. If the category to be optimized is that queries include words that are rarely seen, some queries may be randomly extracted from the queries of the corresponding category, and all of the extracted queries may contain words that are rarely seen. include.

以上の処理により、異なるアドバンテージツールに基づいて生成された訓練データを得ることができるため、訓練データの内容を豊かにし、モデルの最適化効果を向上させることができる。 Through the above processing, it is possible to obtain training data generated based on different advantage tools, thereby enriching the content of the training data and improving the optimization effect of the model.

実際の応用では、上記のカテゴリ５のように各ツールのアドバンテージが特定のカテゴリや幾つかのカテゴリに集中している場合がある。それに応じて、他のカテゴリに比べて。カテゴリ５に対応する訓練データが多くなる。カテゴリによっては対応する訓練データの数の割合のばらつきが大きくなるという問題を回避するために、生成された訓練データを用いて最適化モデルを更新する前に、生成された各カテゴリに対応する訓練データに重複除去やサンプリングなどの処理を行うことで、訓練データセット全体における異なるカテゴリの訓練データの割合が比較的に均等になり、さらにモデルの最適化効果などを一層に向上させることができる。 In actual applications, the advantages of each tool may be concentrated in a specific category or several categories, as in category 5 above. Compared to other categories accordingly. The amount of training data corresponding to category 5 increases. In order to avoid the problem that the proportion of the number of corresponding training data varies greatly depending on the category, before updating the optimization model using the generated training data, perform training corresponding to each generated category. By performing processes such as deduplication and sampling on the data, the ratio of training data of different categories in the entire training data set becomes relatively equal, which further improves the optimization effect of the model.

本開示の一実施形態では、いずれか一つのアドバンテージツールに対して、アドバンテージツールに基づいて最適化すべきカテゴリの訓練データを生成する際に、取得された各処理すべきデータについて、アドバンテージツールを使用して対応する第４処理結果を取得し、そして最適化すべきモデルを用いて対応する第５処理結果を取得した後、取得された各処理すべきデータの中から、第４処理結果と第５処理結果とが一致しない条件を満たす処理すべきデータを選択し、さらに、選択された処理すべきデータに基づいて訓練データを生成するようにしてもよい。 In an embodiment of the present disclosure, when generating training data of a category to be optimized based on the advantage tool for any one advantage tool, the advantage tool is used for each acquired data to be processed. After obtaining the corresponding fourth processing result using the model to be optimized and obtaining the corresponding fifth processing result using the model to be optimized, the fourth processing result and the fifth processing result are selected from among the obtained data to be processed. It is also possible to select data to be processed that satisfies a condition that the processing result does not match, and to generate training data based on the selected data to be processed.

たとえば、取得された各ｑｕｅｒｙに対して、それぞれに対応する候補結果を取得して良い。どのように取得するかに制限はない。その後、それぞれアドバンテージツールおよび最適化すべきモデルを用いて各ｑｕｅｒｙに対応する候補結果をソートして各ｑｕｅｒｙに対応する第４処理結果および第５処理結果をそれぞれ得られる。いずれか一つのｑｕｅｒｙに対応する第４処理結果と第５処理結果とが一致しなければ、そのｑｕｅｒｙを選択された処理すべきデータとし、さらに選択された処理すべきデータに基づいて所望の訓練データを生成することができる。 For example, for each acquired query, corresponding candidate results may be acquired. There are no restrictions on how you can obtain it. Thereafter, the candidate results corresponding to each query are sorted using the advantage tool and the model to be optimized, respectively, to obtain a fourth processing result and a fifth processing result corresponding to each query. If the fourth processing result and the fifth processing result corresponding to any one query do not match, that query is treated as the selected data to be processed, and the desired training is performed based on the selected data to be processed. Data can be generated.

以上の処理により、所望の処理すべきデータを正確かつ効果的に選択することができる。さらに、少ない処理すべきデータを用いて訓練データを生成するだけで、より良いモデル最適化効果が得られることが実験的に証明された。 Through the above processing, desired data to be processed can be selected accurately and effectively. Furthermore, it has been experimentally proven that better model optimization effects can be obtained by simply generating training data using less data to be processed.

本発明の一実施形態において、選択された処理すべきデータに基づいて訓練データを生成する場合に、選択された処理すべきデータと、選択された処理すべきデータに対する第４処理結果および／または第５処理結果の人工注釈結果とを組み合わせて訓練データを生成することができる。 In an embodiment of the present invention, when generating training data based on the selected data to be processed, the selected data to be processed and the fourth processing result for the selected data to be processed and/or Training data can be generated by combining the fifth processing result with the artificial annotation result.

例えば、選択された１つの処理すべきデータに対して、対応する第４処理結果と第５処理結果を人工で判定することができる。第４処理結果が正しく、第５処理結果が正しくないと判定された場合に、当該処理すべきデータおよび第４処理結果を用いてラベルが１であり、即ち正のサンプルである訓練データを生成し、そして当該処理すべきデータと第５処理結果とを用いてラベルが０であり、即ち負のサンプルである訓練データを生成し、或いは、第４処理結果および第５処理結果の両方が正しくないと判定された場合に、処理結果の一方を手動で修正して正のサンプルを取得し、修正されていない処理結果に基づいて負のサンプルを生成したりすることができる。 For example, for one selected piece of data to be processed, the corresponding fourth processing result and fifth processing result can be determined manually. When it is determined that the fourth processing result is correct and the fifth processing result is incorrect, training data with a label of 1, that is, a positive sample, is generated using the data to be processed and the fourth processing result. Then, using the data to be processed and the fifth processing result, generate training data whose label is 0, that is, a negative sample, or if both the fourth processing result and the fifth processing result are correct. If it is determined that there is no such difference, one of the processing results can be manually corrected to obtain a positive sample, and a negative sample can be generated based on the uncorrected processing result.

人工注釈により、生成された訓練データの正確性を向上し、ひいてはモデルの最適化効率や最適化効果などを向上させることができる。 Artificial annotations can improve the accuracy of the generated training data and, in turn, improve the optimization efficiency and effectiveness of the model.

本開示の一実施形態では、上記のようにアドバンテージツールを用いて最適化すべきカテゴリの訓練データを生成することに加えて、取得された、最適化すべきモデルのユーザ行動ログに基づいて生成された評価結果である人工評価結果に基づいて訓練データを生成し、生成された訓練データを用いて最適化すべきモデルを更新することもできる。 In one embodiment of the present disclosure, in addition to generating training data for the category to be optimized using the advantage tool as described above, training data for the category to be optimized is generated based on the obtained user behavior log of the model to be optimized. It is also possible to generate training data based on artificial evaluation results that are evaluation results, and to update the model to be optimized using the generated training data.

最適化すべきモデルは使用が開始された後、一般的にユーザ行動ログを記録することになる。一方、モデルの効果を担うエンジニアは、定期的に人工で評価することでモデルの効果の良否を判断する。例えば、あるｑｕｅｒｙについて、ユーザが第４位の候補結果をクリックし且つそれを読むのに時間を費やしたことが判明された場合、当該候補結果が上位３位の候補結果よりも優れていると考えられる。それに応じて、候補結果のソート順序を修正して正のサンプルを生成し、修正前のソート順序に基づいて負のサンプルを生成することができる。 Once the model to be optimized is put into use, it will generally record user behavior logs. On the other hand, the engineers responsible for the effectiveness of the model determine whether the model is effective or not by periodically performing artificial evaluations. For example, if it turns out that for a certain query, the user clicked on the fourth candidate result and spent time reading it, then that candidate result is better than the top three candidate results. Conceivable. Accordingly, the sort order of the candidate results can be modified to generate positive samples, and negative samples can be generated based on the unmodified sort order.

以上の処理により、直接にユーザ行動ログに基づいて最適化すべきモデルに存在する問題を発見すると共に、それに応じて訓練データを生成し、モデルの最適化効果をさらに向上させることができる。 Through the above processing, it is possible to directly discover problems existing in the model to be optimized based on user behavior logs, generate training data accordingly, and further improve the optimization effect of the model.

前記説明のように、図２は、本開示に記載されたモデル更新方法の全体的な実現プロセスの概略図である。図２に示すように、ツールセットにおけるツールを使用して最適化すべきモデルに存在する問題を発見し、ツールセットにおけるツールを用いて発見された問題に対して問題を解決し、即ち対応する訓練データを生成し、生成された訓練データに基づいて最適化すべきモデルを更新し、さらに上述のプロセスを繰り返すことができる。具体的な実現は前述の関連説明を参照することができ、ここでは詳しく説明しない。 As mentioned above, FIG. 2 is a schematic diagram of the overall implementation process of the model update method described in this disclosure. As shown in Figure 2, the tools in the toolset are used to discover problems that exist in the model to be optimized, and the tools in the toolset are used to solve problems for the discovered problems, i.e., the corresponding training The data can be generated, the model to be optimized can be updated based on the generated training data, and the process described above can be repeated. The specific implementation can refer to the related explanation above, and will not be described in detail here.

なお、前述の方法の実施形態について、説明の簡素化のため、実施例を一連の動作の組み合わせとして表現したが、当業者であればわかるように、本出願による幾らかのステップは他の順序に従って又は同時に実行することができるため、本出願は記述された動作の順序に制限されない。次に、当業者は、明細書に記述された実施例はいずれも好ましい実施例に該当し、関連する動作及びモジュールは必ずしも本出願によって必要とされないことを理解すべきである。 It should be noted that while the method embodiments described above are expressed as a series of combinations of operations for ease of explanation, those skilled in the art will recognize that some steps in accordance with the present application may be performed in other orders. This application is not limited to the order of the operations described, as they may be performed in accordance with or concurrently. Next, those skilled in the art should understand that any embodiments described in the specification constitute preferred embodiments, and that the associated acts and modules are not necessarily required by the present application.

要約すると、本開示の方法の実施形態で説明された方法によれば、さまざまなツールを使用して最適化すべきモデルに存在する問題を発見し、更にツールの種類を豊富にすることで問題のカバー範囲を向上させることができる。また、発見された問題はすべてモデルでは解決できない真の問題であり、問題ごとに訓練データを指向的に生成することができるため、モデルの最適化効果を向上させ、モデルの性能を向上させた。 In summary, the method described in the method embodiments of the present disclosure uses various tools to discover problems that exist in the model to be optimized, and further improves the problem by increasing the variety of tools. Coverage can be improved. In addition, all discovered problems are real problems that cannot be solved by the model, and training data can be generated directionally for each problem, improving the optimization effect of the model and improving the performance of the model. .

以上は、方法の実施形態についての説明であり、以下では、装置の実施形態を介して、本開示に記載された態様についてさらに説明する。 The above is a description of method embodiments, and the aspects described in this disclosure are further described below through apparatus embodiments.

図３は、本開示に記載されたモデル更新装置の実施形態３００の構成の概略図である。図３に示すように、効果比較モジュール３０１と、情報特定モジュール３０２と、サンプル生成モジュール３０３と、モデル更新モジュール３０４とを備える。 FIG. 3 is a schematic diagram of a configuration of an embodiment 300 of a model update apparatus described in this disclosure. As shown in FIG. 3, it includes an effect comparison module 301, an information identification module 302, a sample generation module 303, and a model update module 304.

効果比較モジュール３０１は、評価データセットにおける処理すべきデータであって、少なくとも２つの異なるカテゴリに属する処理すべきデータについて、最適化すべきモデルとツールセットにおけるツールをそれぞれ用いて対応する処理結果を取得し、前記処理結果に基づいて最適化すべきモデルとツールとの処理効果の優劣を比較する。 The effect comparison module 301 obtains corresponding processing results for data to be processed in the evaluation data set and belonging to at least two different categories using the model to be optimized and the tools in the toolset, respectively. Then, based on the processing results, the processing effects of the model to be optimized and the tool are compared.

情報特定モジュール３０２は、比較結果に基づいて、ツールの処理効果が最適化すべきモデルよりも優れているカテゴリである最適化すべきカテゴリと、対応する、処理効果が最適化すべきモデルよりも優れているツールであるアドバンテージツールとを特定する。 Based on the comparison results, the information identification module 302 identifies a category to be optimized, which is a category in which the processing effect of the tool is better than the model to be optimized, and a corresponding category in which the processing effect is better than the model to be optimized. Identify the tool, Advantage Tool.

サンプル生成モジュール３０３は、対応するアドバンテージツールに基づいて最適化すべきカテゴリの訓練データを生成する。 The sample generation module 303 generates training data for the category to be optimized based on the corresponding advantage tool.

モデル更新モジュール３０４は、生成された訓練データを用いて最適化すべきモデルを更新する。 The model update module 304 updates the model to be optimized using the generated training data.

上記の装置の実施形態で説明された態様によれば、ツールセットにおけるツールにより、最適化すべきモデルに存在する問題、即ち最適化すべきカテゴリを特定し、それに応じて最適化すべきカテゴリの訓練データを生成し、さらに生成された訓練データに基づいて最適化すべきモデルを更新することができる。つまり、データ駆動の指向性モデル最適化方式を提案し、最適化すべきモデルに存在する問題を指向的に解決することができるため、モデルの性能を向上させた。 According to the aspects described in the above embodiments of the apparatus, the tools in the toolset identify the problem that exists in the model to be optimized, i.e. the category to be optimized, and the training data for the category to be optimized accordingly. The model to be optimized can be updated based on the generated training data. In other words, we proposed a data-driven directional model optimization method that can directionally solve problems existing in the model to be optimized, thereby improving the performance of the model.

本開示の一実施形態では、ツールセットは、最適化すべきモデルの履歴バージョンモデル、最適化すべきモデルの同種モデル、最適化すべきモデルの同種システム、単語文法ツール、構文ツール、知識ベースのうちの１つまたは任意の組み合わせを含むことができる。前記同種は同じ機能を有することを示す。 In one embodiment of the present disclosure, the toolset includes one of a historical version model of the model to be optimized, a homogeneous model of the model to be optimized, a homogeneous system of the model to be optimized, a word grammar tool, a syntax tool, a knowledge base. or any combination thereof. The same species indicates that they have the same function.

たとえば、探索シナリオでは、処理すべきデータはユーザが入力したｑｕｅｒｙであり、対応する処理結果はｑｕｅｒｙに基づいてリコールされた各候補結果に対する正しいソート順となって良い。 For example, in a search scenario, the data to be processed may be a user-entered query, and the corresponding processing result may be the correct sort order for each candidate result recalled based on the query.

評価データセットにおける処理すべきデータに対して、最適化すべきモデルとツールセットにおけるツールの処理効果の優劣を比較することができる。 It is possible to compare the processing effects of the model to be optimized and the tools in the toolset on the data to be processed in the evaluation data set.

本開示の一実施形態では、効果比較モジュール３０１は、最適化すべきモデルを用いて、評価データセットにおける各処理すべきデータに対応する第１処理結果を取得し、ツールセットにおける何れか一つの個別動作可能なツールに対して、評価データセットにおける各処理すべきデータに対応する第２処理結果をツールで取得し、正しい処理結果と第１処理結果と第２処理結果とに基づいて、異なるカテゴリに対する最適化すべきモデルとツールの処理効果の優劣を判定する処理をそれぞれ行うことができる。 In one embodiment of the present disclosure, the effectiveness comparison module 301 uses the model to be optimized to obtain a first processing result corresponding to each data to be processed in the evaluation data set, and obtains a first processing result corresponding to each data to be processed in the evaluation data set, and For an operable tool, the tool obtains a second processing result corresponding to each data to be processed in the evaluation data set, and assigns different categories based on the correct processing result, the first processing result, and the second processing result. It is possible to perform processing to determine the superiority or inferiority of the processing effects of the model to be optimized and the tool.

最適化すべきモデルがセマンティックマッチングモデルであることを例にすると、最適化すべきモデルを用いて評価データセットにおける各処理すべきデータに対応する第１処理結果を取得することは、最適化すべきモデルを使用して、評価データセットにおける各処理すべきデータに対応する候補結果をソートすることを意味する。得られたソート結果は第１処理結果である。同様に、ツールを用いて評価データセットにおける各処理すべきデータに対応する第２処理結果を取得することは、ツールを用いて評価データセットにおける各処理すべきデータに対応する候補結果をソートすることを意味する。得られたソート結果は第２処理結果である。 For example, if the model to be optimized is a semantic matching model, obtaining the first processing result corresponding to each data to be processed in the evaluation data set using the model to be optimized means that the model to be optimized is is used to sort candidate results corresponding to each data to be processed in the evaluation data set. The obtained sorting result is the first processing result. Similarly, obtaining a second processing result corresponding to each data to be processed in the evaluation data set using the tool includes sorting candidate results corresponding to each data to be processed in the evaluation data set using the tool. It means that. The obtained sorting result is the second processing result.

本開示の一実施形態では、いずれか一つのカテゴリについて、効果比較モジュール３０１は、それぞれ評価データセットにおける当該カテゴリに属する処理すべきデータに対応する正しい処理結果、第１処理結果、および第２処理結果に基づいて、最適化すべきモデルとツールの処理効果の優劣を判定することができる。それに応じて、情報特定モジュール３０２は、ツールの処理効果が最適化すべきモデルよりも優れていると判定されたことに応じて、このカテゴリを最適化すべきカテゴリとし、ツールをアドバンテージツールとすることができる。 In an embodiment of the present disclosure, for any one category, the effect comparison module 301 respectively calculates a correct processing result, a first processing result, and a second processing result corresponding to data to be processed belonging to the category in the evaluation data set. Based on the results, it is possible to judge the processing effectiveness of the model to be optimized and the tool. Accordingly, the information identification module 302 determines that the processing effect of the tool is superior to the model to be optimized, and sets this category to be the category to be optimized and the tool to be an advantage tool. can.

本開示の一実施形態では、効果比較モジュール３０１は、ツールセットにおける何れか一つの個別動作不可能なツールに対して、ツールを用いて評価データセットにおける各処理すべきデータに対応する第３処理結果を取得し、いずれか一つの処理すべきデータに対して、最適化すべきモデルを用いてこの第３処理結果が正しいか否かをそれぞれ検証し、検証結果が不正確であると判定されたことに応じて、人工判定結果を取得し、人工判定結果に基づいて当該処理すべきデータに対する最適化すべきモデルとツールとの処理効果の優劣を判定する、という処理をそれぞれ行うことができる。 In one embodiment of the present disclosure, the effectiveness comparison module 301 performs a third process corresponding to each data to be processed in the evaluation data set using the tool for any one of the tools in the tool set that cannot be individually operated. Obtain the results, use the model to be optimized for any one of the data to be processed, and verify whether or not this third processing result is correct, and it is determined that the verification result is inaccurate. Depending on the situation, it is possible to perform a process of acquiring an artificial determination result and determining the processing effect of the model to be optimized and the tool on the data to be processed based on the artificial determination result.

本開示の一実施形態では、効果比較モジュール３０１は、人工判定結果は第３処理結果が正しいであることに応じて、ツールが最適化するモデルよりも優れていることを判定し、それ以外の場合は最適化すべきモデルの処理効果がツールよりも優れていると判定する。それに応じて、情報特定モジュール３０２は、いずれか一つのカテゴリについて、評価データセットにおける当該カテゴリに属する処理すべきデータのうち、ツールの処理効果が最適化すべきモデルよりも優れている処理すべきデータの数が、最適化すべきモデルの処理効果がツールよりも優れている処理すべきデータの数よりも多いと判定されたことに応じて、当該カテゴリを最適化すべきカテゴリとし、ツールをアドバンテージツールとする。 In one embodiment of the present disclosure, the effectiveness comparison module 301 determines that the artificial determination result is better than the model that the tool optimizes depending on the third processing result being correct; In this case, it is determined that the processing effect of the model to be optimized is superior to that of the tool. Accordingly, the information identification module 302 selects, for any one category, data to be processed for which the processing effect of the tool is superior to that of the model to be optimized, among the data to be processed belonging to that category in the evaluation data set. is determined to be greater than the number of data to be processed for which the processing effect of the model to be optimized is better than that of the tool, the category is set as the category to be optimized, and the tool is set as an advantage tool. do.

本開示の一実施形態では、何れか一つの最適化すべきカテゴリについて、サンプル生成モジュール３０３は、最適化すべきカテゴリに属する処理すべきデータをデータベースから取得し、最適化すべきカテゴリに対応するアドバンテージツールの数が１であることに応じて、取得された処理すべきデータに基づいて、アドバンテージツールに基づいて最適化すべきカテゴリの訓練データを生成し、最適化すべきカテゴリに対応するアドバンテージツールの数が１より大きいことに応じて、取得された処理すべきデータに基づいて、各アドバンテージツールに基づいて最適化すべきカテゴリの訓練データをそれぞれ生成する、という処理をそれぞれ行うことができる。 In one embodiment of the present disclosure, for any one category to be optimized, the sample generation module 303 acquires data to be processed belonging to the category to be optimized from the database, and generates an advantage tool corresponding to the category to be optimized. In response to the number being 1, training data for the category to be optimized is generated based on the advantage tool based on the acquired data to be processed, and the number of advantage tools corresponding to the category to be optimized is 1. Depending on the size of the data, training data for categories to be optimized based on each advantage tool can be generated based on the acquired data to be processed.

データベースから最適化すべきカテゴリの処理すべきデータをどのように取得するかは制限がない。例えば、データベースにおける最適化すべきカテゴリに属するｑｕｅｒｙから一部のｑｕｅｒｙをランダムに抽出することができる。 There are no restrictions on how the data to be processed for the category to be optimized is obtained from the database. For example, some of the queries belonging to the category to be optimized in the database can be randomly extracted.

本開示の一実施形態では、サンプル生成モジュール３０３は、いずれか一つのアドバンテージツールに基づいて最適化すべきカテゴリの訓練データを生成する際に、取得された各処理すべきデータについて、アドバンテージツールを使用して対応する第４処理結果を取得し、そして、最適化すべきモデルを用いて対応する第５処理結果を取得した後、取得された各処理すべきデータの中から、第４処理結果と第５処理結果とが一致しない条件を満たす処理すべきデータを選択し、さらに、選択された処理すべきデータに基づいて訓練データを生成することができる。 In one embodiment of the present disclosure, the sample generation module 303 uses an advantage tool for each acquired data to be processed when generating training data for a category to be optimized based on any one advantage tool. After obtaining the corresponding fourth processing result using the model to be optimized, and obtaining the corresponding fifth processing result using the model to be optimized, the fourth processing result and the fourth processing result are selected from among the obtained data to be processed. It is possible to select data to be processed that satisfies conditions that do not match the results of the 5 processing, and to generate training data based on the selected data to be processed.

本開示の一実施形態では、サンプル生成モジュール３０３は、選択された処理すべきデータに基づいて訓練データを生成する際に、選択された処理すべきデータと、選択された処理すべきデータに対する第４処理結果および／または第５処理結果の人工注釈結果とを組み合わせて訓練データを生成することができる。 In one embodiment of the present disclosure, when generating training data based on the selected data to be processed, the sample generation module 303 includes the selected data to be processed and a Training data can be generated by combining the fourth processing result and/or the fifth processing result with the artificial annotation result.

本開示の一実施形態では、上記のようにアドバンテージツールを用いて最適化すべきカテゴリの訓練データを生成することに加えて、サンプル生成モジュール３０３は、取得された、最適化すべきモデルのユーザ行動ログに基づいて生成された評価結果である人工評価結果に基づいて訓練データを生成することもできる。それに応じて、モデル更新モジュール３０４は、生成された訓練データを用いて最適化すべきモデルを更新することができる。 In one embodiment of the present disclosure, in addition to generating training data for the category to be optimized using the advantage tool as described above, the sample generation module 303 also generates a user behavior log of the acquired model to be optimized. Training data can also be generated based on artificial evaluation results that are evaluation results generated based on. Accordingly, model update module 304 may update the model to be optimized using the generated training data.

図３に示される装置の実施形態の具体的なワークフローは、前述の方法の実施形態における関連説明を参照することができ、ここでは詳しく説明しない。 The specific workflow of the apparatus embodiment shown in FIG. 3 can refer to the related description in the method embodiment above and will not be described in detail here.

要約すると、本開示の装置の実施形態で説明された態様によれば、さまざまなツールを使用して最適化すべきモデルに存在する問題を特定し、また、ツールの種類を豊富にすることで問題のカバー範囲を向上させることができる。更に、発見された問題はすべてモデルでは解決できない真の問題であり、問題ごとに訓練データを指向的に生成することができるため、モデルの最適化効果を向上させ、モデルの性能を向上させた。 In summary, according to aspects described in embodiments of the presently disclosed apparatus, a variety of tools are used to identify problems that exist in a model to be optimized, and a wide variety of tools can be used to solve problems. coverage can be improved. Furthermore, all the problems discovered are real problems that cannot be solved by the model, and training data can be generated directionally for each problem, improving the optimization effect of the model and improving the performance of the model. .

本出願に記載された技術案は、人工知能の分野に適用することができ、特にディープラーニングおよび自然言語処理等の分野に関する。人工知能は、コンピュータが人間の思考過程と知能行為（例えば学習、推理、思考、計画など）をシミュレートすることを研究する学科であり、ハードウェアレベルの技術とソフトウェアレベルの技術の両方がある。人工知能のハードウェア技術には、一般的に、例えばセンサ、専用人工知能チップ、クラウドコンピューティング、分散ストレージ、ビッグデータ処理等の技術が含まれる。人工知能のソフトウェア技術は、主にコンピュータビジョン技術、音声認識技術、自然言語処理技術及び機械学習／ディープラーニング、ビッグデータ処理技術、知識マップ技術等のいくつかの方向を含む。 The technical solution described in this application can be applied in the field of artificial intelligence, especially related to fields such as deep learning and natural language processing. Artificial intelligence is the study of computers simulating human thought processes and intelligent acts (e.g. learning, reasoning, thinking, planning, etc.), and includes both hardware-level and software-level techniques. . Artificial intelligence hardware technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing, etc. Artificial intelligence software technology mainly includes computer vision technology, speech recognition technology, natural language processing technology and machine learning/deep learning, big data processing technology, knowledge map technology and other several directions.

本開示の実施形態における処理すべきデータ等は、特定のユーザを対象としたものではなく、特定のユーザの個人情報を反映したものではない。また、本開示の方法の実施主体は、様々な開示、法令に準拠した方法で前記処理すべきデータを取得することができる。 The data to be processed in the embodiment of the present disclosure is not targeted at a specific user and does not reflect the personal information of a specific user. Further, the entity implementing the method of the present disclosure can obtain the data to be processed through various disclosures and methods compliant with laws and regulations.

本開示の技術案において、関わるユーザの個人情報の取得、記憶、応用、加工、伝送、提供及び配信等は、いずれも関連法律法規の規定に適合しており、公序良俗に反するものではない。 In the technical proposal of the present disclosure, the acquisition, storage, application, processing, transmission, provision, distribution, etc. of the personal information of the users involved comply with the provisions of relevant laws and regulations, and do not violate public order and morals.

本開示の実施形態によれば、本開示は更に電子デバイス、可読記憶媒体、およびコンピュータプログラム製品を提供する。 According to embodiments of the disclosure, the disclosure further provides electronic devices, readable storage media, and computer program products.

図４は、本開示の実施形態を実施するために使用され得る電子デバイス４００の模式的なブロック図である。電子デバイスは、ラップトップ、デスクトップコンピュータ、ワークベンチ、サーバ、ブレードサーバ、大型コンピュータ、および他の適切なコンピュータのような、様々な形態のデジタルコンピュータを表す。電子デバイスは更に、ＰＤＡ、携帯電話、スマートフォン、ウェアラブルデバイス、および他の同様のコンピューティングデバイスなどの様々な形態のモバイルデバイスを表すことができる。本明細書に示す構成要素、それらの接続および関係、ならびにそれらの機能は、単なる一例であり、本明細書に記載および／または要求された本開示の実現を制限することではない。 FIG. 4 is a schematic block diagram of an electronic device 400 that may be used to implement embodiments of the present disclosure. Electronic devices represent various forms of digital computers, such as laptops, desktop computers, workbenches, servers, blade servers, large format computers, and other suitable computers. Electronic devices can further represent various forms of mobile devices such as PDAs, cell phones, smart phones, wearable devices, and other similar computing devices. The components, their connections and relationships, and their functionality depicted herein are merely examples and are not limitations on the implementation of the present disclosure as described and/or required herein.

図４に示すように、デバイス４００は、読み取り専用メモリ（ＲＯＭ）４０２に記憶されたコンピュータプログラム、または記憶手段４０８からランダムアクセスメモリ（ＲＡＭ）４０３にロードされたコンピュータプログラムに従って、様々な適切な動作および処理を実行することができる演算手段４０１を含む。ＲＡＭ４０３には、デバイス４００の動作に必要な各種のプログラムやデータが記憶されてもよい。演算手段４０１、ＲＯＭ４０２およびＲＡＭ４０３は、バス４０４を介して接続されている。入出力（Ｉ／Ｏ）インターフェース４０５もバス４０４に接続されている。 As shown in FIG. 4, device 400 performs various suitable operations in accordance with a computer program stored in read-only memory (ROM) 402 or loaded into random access memory (RAM) 403 from storage means 408. and calculation means 401 capable of executing processing. The RAM 403 may store various programs and data necessary for the operation of the device 400. Arithmetic means 401, ROM 402 and RAM 403 are connected via bus 404. An input/output (I/O) interface 405 is also connected to bus 404.

例えばキーボード、マウス等の入力手段４０６と、例えば様々なタイプのディスプレイ、スピーカ等の出力手段４０７と、例えば磁気ディスク、光ディスク等の記憶手段４０８と、例えばネットワークカード、モデム、無線通信トランシーバなどの通信手段４０９を含むデバイス４００の複数の構成要素は、Ｉ／Ｏインターフェース４０５に接続される。通信手段４０９は、デバイス４００が例えばインターネットのコンピュータネットワークおよび／または様々な電気通信ネットワークを介して他のデバイスと情報／データを交換することを可能にする。 Communication between input means 406 such as a keyboard and mouse, output means 407 such as various types of displays and speakers, storage means 408 such as a magnetic disk or optical disk, and communication such as a network card, modem, wireless communication transceiver, etc. Multiple components of device 400, including means 409, are connected to I/O interface 405. The communication means 409 enable the device 400 to exchange information/data with other devices, for example via computer networks of the Internet and/or various telecommunication networks.

演算手段４０１は、処理能力および演算能力を有する様々な汎用および／または専用の処理コンポーネントであってよい。演算手段４０１のいくつかの例は、中央処理ユニット（ＣＰＵ）、グラフィック処理ユニット（ＧＰＵ）、様々な専用の人工知能（ＡＩ）演算チップ、機械学習モデルアルゴリズムを実行する様々な演算ユニット、デジタル信号プロセッサ（ＤＳＰ）、および任意の適切なプロセッサ、コントローラ、マイクロコントローラなどを含むが、これらに限定されない。演算手段４０１は、上述した様々な方法および処理、例えば本開示に記載の方法を実行する。例えば、幾つかの実施形態では、本開示に記載の方法は、例えば記憶手段４０８のような機械可読媒体に物理的に組み込まれたコンピュータソフトウェアプログラムとして実装されてもよい。幾つかの実施形態では、コンピュータプログラムの一部または全部は、ＲＯＭ４０２および／または通信手段４０９を介してデバイス４００にロードおよび／またはインストールすることができる。コンピュータプログラムがＲＡＭ４０３にロードされ、演算手段４０１により実行されると、本開示に記載の方法の１つまたは複数のステップを実行することができる。代替的に、他の実施形態では、演算手段４０１は、本開示に記載の方法を実行するように、他の任意の適切な方法で（例えば、ファームウェアを介する）構成されてもよい。 The computing means 401 may be various general purpose and/or special purpose processing components having processing and computing capabilities. Some examples of computing means 401 are a central processing unit (CPU), a graphics processing unit (GPU), various dedicated artificial intelligence (AI) computing chips, various computing units that execute machine learning model algorithms, digital signals. including, but not limited to, a processor (DSP), and any suitable processor, controller, microcontroller, etc. The computing means 401 performs the various methods and processes described above, such as those described in this disclosure. For example, in some embodiments, the methods described in this disclosure may be implemented as a computer software program physically embodied in a machine-readable medium, such as storage means 408. In some embodiments, part or all of the computer program may be loaded and/or installed on device 400 via ROM 402 and/or communication means 409. When the computer program is loaded into RAM 403 and executed by computing means 401, one or more steps of the method described in this disclosure can be performed. Alternatively, in other embodiments, the computing means 401 may be configured in any other suitable manner (eg, via firmware) to perform the methods described in this disclosure.

本明細書で前述したシステムおよび技術の様々な実施形態は、デジタル電子回路システム、集積回路システム、フィールドプログラマブルゲートアレイ（ＦＰＧＡ）、専用集積回路（ＡＳＩＣ）、専用標準製品（ＡＳＳＰ）、システムオンチップシステム（ＳＯＣ）、ロードプログラマブル論理デバイス（ＣＰＬＤ）、コンピュータハードウェア、ファームウェア、ソフトウェア、および／またはこれらの組み合わせにおいて実装されてもよい。これらの様々な実施形態は、１つまたは複数のコンピュータプログラムで実施されることを含んで良い。当該１つまたは複数のコンピュータプログラムは、少なくとも１つのプログラマブルプロセッサを含むプログラマブルシステム上で実行および／または解釈することができる。当該プログラマブルプロセッサは、専用または汎用のプログラマブルプロセッサであって、記憶システム、少なくとも１つの入力装置、および少なくとも１つの出力装置からデータおよび命令を受信し、当該記憶システム、当該少なくとも１つの入力装置、および当該少なくとも１つの出力装置にデータおよび命令を転送することができる。 Various embodiments of the systems and techniques described herein above include digital electronic circuit systems, integrated circuit systems, field programmable gate arrays (FPGAs), special purpose integrated circuits (ASICs), special purpose standard products (ASSPs), systems on chips. It may be implemented in a system (SOC), load programmable logic device (CPLD), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include being implemented in one or more computer programs. The one or more computer programs can be executed and/or interpreted on a programmable system that includes at least one programmable processor. The programmable processor is a special purpose or general purpose programmable processor that receives data and instructions from a storage system, at least one input device, and at least one output device; Data and instructions can be transferred to the at least one output device.

本開示の方法を実施するためのプログラムコードは、１つまたは複数のプログラミング言語の任意の組み合わせを用いて記述することができる。これらのプログラムコードは、汎用コンピュータ、専用コンピュータ、または他のプログラマブルデータ処理装置のプロセッサまたはコントローラに提供することにより、プログラムコードがプロセッサまたはコントローラにより実行されると、フローチャートおよび／またはブロック図に指定された機能／動作を実行するようにすることができる。プログラムコードは、全てがマシン上で実行されても良く、一部がマシン上で実行されても良く、スタンドアロンパッケージとして一部的にマシン上で実行され且つ一部的にリモートマシン上で実行され、或いは全てがリモートマシンまたはサーバ上で実行されても良い。 Program code for implementing the methods of this disclosure may be written using any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing device so that when executed by the processor or controller, the program codes are specified in flowcharts and/or block diagrams. can be configured to perform functions/operations. The program code may run entirely on a machine, partially on a machine, or as a standalone package, partially on a machine and partially on a remote machine. , or all may be executed on a remote machine or server.

本開示の文脈では、機械可読媒体は、有形の媒体であって、命令実行システム、装置またはデバイスにより使用され、或いは命令実行システム、装置またはデバイスと合わせて使用されるプログラムを含むか記憶することができる。機械可読媒体は、機械可読信号媒体または機械可読記憶媒体であってよい。機械可読媒体は、電子的、磁気的、光学的、電磁気的、赤外線的、または半導体的なシステム、装置またはデバイス、あるいはこれらの任意の適切な組み合わせを含んで良いが、これらに限定されない。機械可読記憶媒体のより具体的な例は、１つまたは複数のラインに基づく電気的接続、ポータブルコンピュータディスク、ハードディスク、ランダムアクセスメモリ（ＲＡＭ）、読み取り専用メモリ（ＲＯＭ）、消去可能プログラマブル読み取り専用メモリ（ＥＰＲＯＭまたはフラッシュメモリ）、光ファイバ、携帯型コンパクトディスク読み取り専用メモリ（ＣＤ－ＲＯＭ）、光学記憶装置、磁気記憶装置、又はこれらの任意の適切な組み合わせを含む。 In the context of this disclosure, a machine-readable medium is a tangible medium that includes or stores a program for use by or in conjunction with an instruction-execution system, apparatus, or device. I can do it. A machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination thereof. More specific examples of machine-readable storage media are electrical connections based on one or more lines, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory. (EPROM or flash memory), fiber optics, portable compact disc read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination thereof.

ユーザとのインタラクションを提供するために、本明細書に記載されたシステムおよび技術は、ユーザに情報を表示するための表示装置（例えば、ＣＲＴ（陰極線管）またはＬＣＤ（液晶ディスプレイ）モニタ）と、ユーザにより入力をコンピュータに提供するキーボードおよびポインティングデバイス（例えば、マウスまたはトラックボール）と備えるコンピュータ上に実施されてよい。他の種類の装置は、ユーザとのインタラクションを提供するためにも使用され得る。例えば、ユーザに提供されるフィードバックは、任意の形態のセンシングフィードバック（例えば、視覚フィードバック、聴覚フィードバック、または触覚フィードバック）であって良く、ユーザからの入力を任意の形式（音入力、音声入力、または触覚入力を含む）で受信して良い。 To provide user interaction, the systems and techniques described herein include a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user; It may be implemented on a computer with a keyboard and pointing device (eg, a mouse or trackball) for providing input to the computer by a user. Other types of devices may also be used to provide user interaction. For example, the feedback provided to the user may be any form of sensing feedback (e.g., visual, auditory, or haptic feedback), and any form of input from the user (e.g., acoustic, audio, or (including tactile input).

本明細書に記載されたシステムおよび技術は、バックエンド構成要素を含むコンピューティングシステム（例えば、データサーバとする）、又はミドルウェア構成要素を含むコンピューティングシステム（例えば、アプリケーションサーバ）、又はフロントエンド構成要素を含むコンピューティングシステム（例えば、グラフィカルユーザインターフェースもしくはウェブブラウザを有するクライアントコンピュータであり、ユーザは、当該グラフィカルユーザインターフェースもしくは当該ウェブブラウザを通じて本明細書で説明されるシステムと技術の実施形態とインタラクションすることができる）、そのようなバックエンド構成要素、ミドルウェア構成要素、もしくはフロントエンド構成要素の任意の組合せを含むコンピューティングシステムに実施されることが可能である。システムの構成要素は、任意の形態又は媒体のデジタルデータ通信（例えば、通信ネットワーク）によって相互に接続されることが可能である。通信ネットワークの例は、ローカルエリアネットワーク（「ＬＡＮ」）、ワイド・エリア・ネットワーク（「ＷＡＮ」）、インターネットワークを含む。 The systems and techniques described herein may be used in computing systems that include back-end components (e.g., data servers), or that include middleware components (e.g., application servers), or front-end configurations. A computing system (e.g., a client computer having a graphical user interface or web browser) that includes an element through which a user interacts with embodiments of the systems and techniques described herein. may be implemented in a computing system that includes any combination of such back-end components, middleware components, or front-end components. The components of the system may be interconnected by any form or medium of digital data communication (eg, a communication network). Examples of communication networks include local area networks (“LANs”), wide area networks (“WANs”), and internetworks.

コンピュータシステムは、クライアントとサーバーを含み得る。クライアントとサーバーは、一般的に互いから遠く離れており、通常は、通信ネットワークを通じてインタラクトする。クライアントとサーバとの関係は、相応するコンピュータ上で実行され、互いにクライアント－サーバの関係を有するコンピュータプログラムによって生じる。サーバはクラウドサーバであっても良く、分散システムのサーバであっても良く、ブロックチェーンを組み合わせたサーバであってもよい。 A computer system may include clients and servers. Clients and servers are typically remote from each other and typically interact through a communications network. The relationship between client and server is created by computer programs running on corresponding computers and having a client-server relationship with each other. The server may be a cloud server, a distributed system server, or a server combined with a blockchain.

以上で示された様々な形式のフローを使用して、ステップを並べ替え、追加、又は削除できることを理解されたい。例えば、本出願に説明される各ステップは、並列の順序又は順次的な順序で実施されてもよいし、又は異なる順序で実行されてもよく、本出願で開示された技術案の望ましい結果が達成できる限り、ここで制限されない。 It should be appreciated that steps can be reordered, added, or deleted using the various types of flows illustrated above. For example, each step described in this application may be performed in a parallel order or a sequential order, or may be performed in a different order, and the desired results of the technical solutions disclosed in this application may be achieved. There are no limitations here as long as it is achievable.

上記の具体的な実施形態は本出願の保護範囲に対する制限を構成しない。設計要件及び他の要因に従って、様々な修正、組み合わせ、部分的組み合わせ及び置換を行うことができることを当業者は理解するべきである。本出願の精神及び原則の範囲内で行われる修正、同等の置換、改善は、何れも本出願の保護範囲内に含まれるべきである。 The above specific embodiments do not constitute a limitation on the protection scope of the present application. Those skilled in the art should understand that various modifications, combinations, subcombinations, and substitutions can be made according to design requirements and other factors. Any modification, equivalent substitution, or improvement made within the spirit and principles of this application shall be included within the protection scope of this application.

Claims

A model update method, the method comprising:
For the data to be processed in the evaluation data set that belongs to at least two different categories, the corresponding processing results are obtained using the model to be optimized and the tools in the toolset, and the processing results are comparing the processing effects of the model to be optimized and the tool based on;
The method is to identify a category to be optimized and a corresponding advantage tool based on a comparison result, the category to be optimized is a category in which the processing effect of the tool is superior to the model to be optimized, and the advantage tool is to identify a category to be optimized based on a comparison result. The tool is a tool whose processing effect is superior to the model to be optimized;
generating training data for the category to be optimized based on a corresponding advantage tool;
updating the model to be optimized using the training data;
Model update methods, including:

The toolset includes one or more of a historical version model of the model to be optimized, a homogeneous model of the model to be optimized, a homogeneous system of the model to be optimized, a word grammar tool, a syntax tool, a knowledge base. including a combination, indicating that the same type has the same function;
The model updating method according to claim 1.

The evaluation data set further includes correct processing results corresponding to each data to be processed,
Obtaining corresponding processing results using the model to be optimized and tools in the toolset, and comparing the processing effects of the model to be optimized and the tools based on the processing results,
Using the model to be optimized, obtaining a first processing result corresponding to each data to be processed in the evaluation data set;
For any one of the tools in the tool set that can operate individually, use the tool to obtain a second processing result corresponding to each data to be processed in the evaluation data set, and combine the correct processing result with the first processing result. performing a process of determining the superiority or inferiority of the processing effects of the model to be optimized and the tool for different categories based on the result and the second processing result;
The model updating method according to claim 1, comprising:

Determining the superiority or inferiority of the processing effects of the model to be optimized and the tool for the different categories includes determining the correct processing corresponding to the data to be processed belonging to the category in the evaluation data set, respectively, for any one category. determining the superiority or inferiority of a processing effect between the model to be optimized and the tool based on the first processing result and the second processing result;
Identifying the category to be optimized and the corresponding advantage tool based on the comparison result means that the processing effect of the tool is determined to be superior to the model to be optimized for any one category. In accordance with the above, the category is the category to be optimized and the tool is the advantage tool.
The model updating method according to claim 3.

Obtaining corresponding processing results using the model to be optimized and tools in the toolset, and comparing the processing effects of the model to be optimized and the tools based on the processing results,
For any one of the tools in the tool set that cannot be operated individually, use the tool to obtain a third processing result corresponding to each data to be processed in the evaluation data set, and Regarding the data, use the model to be optimized to verify whether or not the third processing result is correct, obtain an artificial judgment result in response to the verification result being determined to be inaccurate, and obtain the artificial judgment result using the model to be optimized. performing a process of determining the superiority or inferiority of the processing effects of the model to be optimized and the tool on the data to be processed based on the determination results;
The model updating method according to claim 1, comprising:

Determining the superiority or inferiority of the processing effects of the model to be optimized and the tool on the data to be processed based on the artificial determination result is based on the fact that the artificial determination result is correct in accordance with the third processing result. , determining that the processing effect of the tool is better than the model to be optimized, and if not, determining that the processing effect of the model to be optimized is better than the tool,
Identifying the category to be optimized and the corresponding advantage tool based on the comparison result is to determine the processing effect of the tool among the data to be processed belonging to the category in the evaluation data set for any one category. The number of data to be processed for which the model to be optimized is superior to the model to be optimized is determined to be greater than the number of data to be processed for which the processing effect of the model to be optimized is superior to the tool. Accordingly, the category is the category to be optimized and the tool is the advantage tool.
The model updating method according to claim 5.

Generating training data for the category to be optimized based on the corresponding advantage tool;
For any one category to be optimized,
Obtaining data to be processed belonging to the category to be optimized from the database,
In response to the number of the advantage tools corresponding to the category to be optimized being 1, training data for the category to be optimized is generated based on the advantage tool based on the obtained data to be processed. ,
In response to the number of the advantage tools corresponding to the category to be optimized being greater than 1, training data of the category to be optimized is determined based on each advantage tool based on the obtained data to be processed. generate each,
Perform each process,
The model updating method according to any one of claims 1 to 6, comprising:

Generating training data for the category to be optimized based on the advantage tool comprises:
For each obtained data to be processed, obtain a corresponding fourth processing result using the advantage tool, obtain a corresponding fifth processing result using the model to be optimized,
Selecting data to be processed from among the acquired data to be processed that satisfies a condition that the fourth processing result and the fifth processing result do not match;
generating the training data based on the selected data to be processed;
8. The model updating method according to claim 7.

Generating the training data based on the selected data to be processed includes:
generating the training data by combining the selected data to be processed and an artificial annotation result for a fourth processing result and/or a fifth processing result of the selected data to be processed;
9. The model updating method according to claim 8.

Generating training data based on the obtained artificial evaluation result, which is an evaluation result generated based on a user behavior log of the model to be optimized, and updating the model to be optimized using the training data.
The model updating method according to any one of claims 1 to 6, further comprising:

A model update device comprising an effect comparison module, an information identification module, a sample generation module, and a model update module,
The effect comparison module calculates corresponding processing results for data to be processed in the evaluation data set and belonging to at least two different categories by using the model to be optimized and the tools in the toolset, respectively. and comparing the processing effects of the model to be optimized and the tool based on the processing results,
The information identification module identifies a category to be optimized and a corresponding advantage tool based on the comparison result, and the category to be optimized is a category in which the processing effect of the tool is superior to the model to be optimized; The advantage tool is a tool whose processing effect is superior to the model to be optimized,
The sample generation module generates training data for the category to be optimized based on a corresponding advantage tool;
The model update module updates the model to be optimized using the training data.
Model update device.

The toolset includes one or more of a historical version model of the model to be optimized, a homogeneous model of the model to be optimized, a homogeneous system of the model to be optimized, a word grammar tool, a syntax tool, a knowledge base. including a combination, indicating that the same type has the same function;
The model updating device according to claim 11.

The evaluation data set further includes correct processing results corresponding to each data to be processed,
The effect comparison module uses the model to be optimized to obtain a first processing result corresponding to each data to be processed in the evaluation data set, and obtains a first processing result corresponding to each data to be processed in the evaluation data set, and calculates a first processing result for any one of the tools in the toolset that can operate individually. , using the tool to obtain a second processing result corresponding to each data to be processed in the evaluation data set, and classifying the data into different categories based on the correct processing result, the first processing result, and the second processing result. performing a process of determining the superiority or inferiority of the processing effects of the model to be optimized and the tool for each;
The model updating device according to claim 11.

The effect comparison module, for any one category, based on the correct processing result, the first processing result, and the second processing result that respectively correspond to data to be processed belonging to the category in the evaluation data set, determining the superiority or inferiority of the processing effect between the model to be optimized and the tool;
In response to determining that the processing effect of the tool is superior to the model to be optimized for any one category, the information specifying module sets the category to the category to be optimized, and the tool be the said advantage tool,
The model update device according to claim 13.

The effect comparison module uses the tool to obtain a third processing result corresponding to each data to be processed in the evaluation data set for any one of the tools in the tool set that cannot be operated individually, and For each data to be processed, the model to be optimized is used to verify whether or not the third processing result is correct, and if the verification result is determined to be inaccurate, an artificial judgment result is determined. and perform processing to determine the superiority or inferiority of the processing effects of the model to be optimized and the tool on the data to be processed based on the artificial determination results, respectively.
The model updating device according to claim 11.

The effect comparison module determines that the processing effect of the tool is superior to the model to be optimized in response to the artificial determination result that the third processing result is correct, and if not, determining that the processing effect of the model to be optimized is superior to that of the tool;
The information identification module identifies, for any one category, among the data to be processed belonging to the category in the evaluation data set, the processing effect of the tool is superior to that of the model to be optimized. in response to determining that the number of data to be optimized is greater than the number of data to be processed for which the processing effect of the model to be optimized is superior to the tool, the category to be optimized is set as the category to be optimized, and the tool is The said advantage tool,
The model updating device according to claim 15.

The sample generation module acquires data to be processed belonging to the category to be optimized from a database for any one category to be optimized, and the number of the advantage tools corresponding to the category to be optimized is 1. Optionally, based on the obtained data to be processed, training data for the category to be optimized is generated based on the advantage tool, and the number of the advantage tools corresponding to the category to be optimized is less than 1. Based on the obtained data to be processed, each process generates training data of the category to be optimized based on each advantage tool, depending on the size of the data to be processed.
The model updating device according to any one of claims 11 to 16.

Regarding any one of the advantage tools, the sample generation module acquires a corresponding fourth processing result using the advantage tool for each acquired data to be processed, and handles the obtained data using the model to be optimized. and select data to be processed that satisfies the condition that the fourth processing result and the fifth processing result do not match from among the obtained data to be processed, and generating the training data based on data to be processed;
The model update device according to claim 17.

The sample generation module generates the training data by combining the selected data to be processed and an artificial annotation result for a fourth processing result and/or a fifth processing result of the selected data to be processed. ,
The model update device according to claim 18.

The sample generation module further generates training data based on the obtained artificial evaluation result, which is an evaluation result generated based on a user behavior log of the model to be optimized.
The model updating device according to any one of claims 11 to 16.

at least one processor;
a memory communicatively connected to the at least one processor;
A command executable by the at least one processor is stored in the memory, and when the command is executed by the at least one processor, the at least one processor is An electronic device that performs the described model update method.

A non-transitory computer-readable storage medium storing computer commands for causing a computer to execute the model updating method according to any one of claims 1 to 6.

A computer program that, when executed by a processor, implements the model updating method according to any one of claims 1 to 6.