JP5333175B2

JP5333175B2 - Operator support device, operator support program, and operator support method

Info

Publication number: JP5333175B2
Application number: JP2009268945A
Authority: JP
Inventors: 佐知子小野寺
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2009-11-26
Filing date: 2009-11-26
Publication date: 2013-11-06
Anticipated expiration: 2029-11-26
Also published as: JP2011112854A

Abstract

<P>PROBLEM TO BE SOLVED: To extract the supplemental phrases matching the procedure manual explanation phrases. <P>SOLUTION: The operator support 1 has a correspondence relation memory 2, an operation time obtaining section 3, a section estimating section 4, and a supplemental phrase extracting section 5. The correspondence relation memory 2 stores the procedure manual explanation phrases and their operating positions by correlating them. The operation time obtaining section 3 obtains the operation time showing the time point that the predetermined operating section has been changed from the PC operation log storage storing the operated time points and positions by correlating them for every operated time point. The section estimating section 4 extracts the procedure explanation phrases relating to the operation after switching from the correspondence relating memory 2. From the recorded voice, the section estimating section 4 presumes the section to the start of the speaking time, longer than predetermined and appearing next to the speaking time starting after the operation time before the speaking section showing the procedure explanation phrase, as the voice section of the procedure manual explanation phrases. The supplemental phrase extracting section 5 extracts the supplemental phrases for the procedure manual explanation phrase from the voice section already estimated. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本願の開示する技術は、オペレータ支援装置、オペレータ支援プログラム及びオペレータ支援方法に関する。 The technology disclosed in the present application relates to an operator support apparatus, an operator support program, and an operator support method.

多くの企業には、顧客からの電話問合せに応対するためにコールセンタが設置される。このコールセンタでは、応対業務専門のエージェント（以下、オペレータと表記）が顧客からの問合せに応対する。オペレータは、顧客から問合せを受け付けた場合に、問合せ内容に対応する電子マニュアルを端末装置を用いて検索し、その検索結果を参照しながら回答を顧客に説明する。 Many companies have a call center to respond to customer phone inquiries. In this call center, an agent specialized in customer service (hereinafter referred to as an operator) responds to inquiries from customers. When the operator receives an inquiry from the customer, the operator searches the electronic manual corresponding to the inquiry content using the terminal device, and explains the answer to the customer while referring to the search result.

この電子マニュアルとしては、例えば、ＱＡ（Question and Answer）事例が使用される。かかるＱＡ事例は、問合せ事項（Ｑ）と、その問合せ事項が解決に至るまでの手順を示す回答事項（Ａ）とが対応付けられたものである。この回答事項の手順には、１つの手順を説明する場合にキー（重要）となる語句（以下、手順説明語句と表記）が含まれる。オペレータは手順説明語句を中心に一つ一つの手順を説明することにより応対業務を行う。 As this electronic manual, for example, QA (Question and Answer) cases are used. Such a QA example is a query item (Q) associated with an answer item (A) indicating a procedure until the query item is resolved. This answer procedure includes words (hereinafter referred to as procedure explanation words) that are key (important) when one procedure is described. The operator performs the response work by explaining each procedure centering on the procedure explanation words.

ところで、コールセンタに問い合わせてくる顧客の知識レベルは様々である。このため、オペレータが端末装置に表示されるＱＡ事例の回答事項を読み上げたとしても、顧客が必ずしもその内容を理解できるとは限らない。そして、顧客の理解が円滑に進まない場合には応対時間が長時間となり、顧客満足度が低下してしまうおそれがある。このため、オペレータには、顧客の知識レベルに応じて回答事項の説明の仕方を変えるなど臨機応変な応対が求められる。しかし、初心者のオペレータにはかかる応対を行うことが難しい。 By the way, the level of knowledge of customers who make inquiries to the call center varies. For this reason, even if an operator reads out the answer matter of the QA example displayed on the terminal device, the customer cannot always understand the content. If the customer's understanding does not proceed smoothly, the response time becomes long, and the customer satisfaction may be reduced. For this reason, the operator is required to respond flexibly such as changing the method of explaining the answer items according to the knowledge level of the customer. However, it is difficult for a beginner operator to perform such a response.

このことから、手順説明語句を補足する語句（以下、補足語句と表記）を支援キーワードとして提示するオペレータ支援装置が提案されている。このオペレータ支援装置は、同一の事例であって異なるオペレータによって応対された事例の音声データを対象に補足語句を抽出する。すなわち、オペレータ支援装置は、熟練のオペレータが発話したキーワードと初心者のオペレータが発話したキーワードとを突き合わせる。このとき、オペレータ支援装置は、手順説明語句が発話された時刻の近傍で発話されたキーワードであり、かつ熟練のオペレータ及び初心者のオペレータのうち熟練のオペレータだけが発話したキーワードを補足語句として抽出する。このように、オペレータ支援装置は、ＱＡ事例を表示する場合に、手順説明語句とともに補足語句を表示させることで、オペレータの応対業務を支援する。 For this reason, there has been proposed an operator support apparatus that presents words that supplement the procedure explanation words (hereinafter referred to as supplement words) as support keywords. This operator support device extracts supplemental phrases from voice data of cases that are the same case and are handled by different operators. That is, the operator support apparatus matches a keyword uttered by a skilled operator with a keyword uttered by a novice operator. At this time, the operator support device extracts keywords that are spoken in the vicinity of the time when the procedural explanation words are uttered and that are spoken only by skilled operators among skilled operators and novice operators as supplementary phrases. . As described above, when displaying the QA case, the operator support device displays the supplemental words and phrases along with the procedure explanation words and thereby supports the operator's reception work.

特開２００８−３３８３５号公報JP 2008-33835 A 特開２００１−５１８７６号公報JP 2001-51876 A 特開平１０−６９３８１号公報JP 10-69381 A 特開２００５−２９３１８０号公報JP-A-2005-293180

しかしながら、上述した従来の技術は、必ずしも手順説明語句に対応した補足語句を抽出することができないという問題がある。 However, the above-described conventional technique has a problem that it is not always possible to extract a supplemental phrase corresponding to the procedure explanation phrase.

この点を図２１及び図２２を用いて説明する。図２１及び図２２は、従来技術における補足語句の抽出要領を説明するための説明図である。図２１及び図２２の例では、コールセンタでパーソナルコンピュータ（以下、ＰＣと表記）の操作に関する問合せを受け付ける例を想定する。これら図２１及び図２２は、いずれも顧客からの問合せ事項「ブラウザの設定方法を教えて」に対して、手順Ｘ「インターネットオプションの接続タブを開く」を説明する対話の一例を示す。このうち、図２１の例では、インターネットオプションが中心となる対話に分岐した場合の対話を示す。また、図２２の例では、接続タブが中心となる対話に分岐した場合の対話を示す。ここで、手順Ｘの手順説明語句は「インターネットオプション」「接続タブ」となる。なお、図２１及び図２２における横軸は、時間軸を指し、右方向に進むにしたがって時間が進むことを示す。また、図中の「ｕ」は、「utterance」の略記であり、「ｕ１」から「ｕ１１」までのブロックはオペレータ又は顧客の発話を示し、これらブロックの横方向の大きさは発話区間の長さを示す。さらに、横軸の上方に図示するブロックは顧客の発話を示し、横軸の下方に図示するブロックはオペレータの発話を示すものとする。 This point will be described with reference to FIGS. FIG. 21 and FIG. 22 are explanatory diagrams for explaining a supplement word extraction procedure in the prior art. In the examples of FIGS. 21 and 22, it is assumed that an inquiry about an operation of a personal computer (hereinafter referred to as a PC) is received at a call center. FIG. 21 and FIG. 22 both show an example of the dialog explaining the procedure X “Open the connection tab of the Internet option” in response to the inquiry item “Tell me how to set the browser” from the customer. Among these, the example of FIG. 21 shows a dialogue when the Internet option branches to a dialogue. Further, in the example of FIG. 22, the dialogue is shown when branching to a dialogue centered on the connection tab. Here, the procedure explanation words of the procedure X are “Internet option” and “Connection tab”. Note that the horizontal axis in FIGS. 21 and 22 indicates the time axis, and indicates that time advances as it advances in the right direction. Also, “u” in the figure is an abbreviation for “utterance”, and the blocks from “u1” to “u11” indicate the speech of the operator or the customer, and the horizontal size of these blocks is the length of the speech section. It shows. Furthermore, the block illustrated above the horizontal axis indicates the customer's speech, and the block illustrated below the horizontal axis indicates the operator's speech.

図２１に示す例では、以下の会話が行われたものとする。オペレータは「では、インターネットオプションの接続タブを開いて下さい」（ｕ１）と発話する。顧客は「何それ？」（ｕ２）と発話する。オペレータは「メニューバーの右から二番目の・・・」（ｕ３）と発話する。顧客は「メニューバー？」（ｕ４）と発話する。オペレータは「一番上に、ファイル、編集などと並んでいる・・・」（ｕ５）と発話する。顧客は「あ、はいはい」（ｕ６）と発話する。オペレータは「その右から二番目に、ツールとあるのでクリックして下さい」（ｕ７）と発話する。顧客は「二番目・・・はい」（ｕ８）と発話する。オペレータは「すると、一番下にインターネットオプションというものがあります」（ｕ９）と発話する。顧客は「あ、これね」（ｕ１０）と発話する。オペレータは「そこの接続タブを選択して下さい」（ｕ１１）と発話する。 In the example shown in FIG. 21, it is assumed that the following conversation is performed. The operator speaks, "Please open the Internet Options connection tab" (u1). The customer speaks “What?” (U2). The operator speaks “second from the right of the menu bar” (u3). The customer speaks “Menu bar?” (U4). The operator utters "It is lined up with files, edits, etc. at the top ..." (u5). The customer speaks “Ah yes yes” (u6). The operator speaks, “Click on the second tool from the right, click here” (u7). The customer speaks “second ... yes” (u8). The operator speaks, "There is an internet option at the bottom" (u9). The customer speaks “Oh, this is it” (u10). The operator speaks “Please select the connection tab there” (u11).

図２１に示す例で言えば、オペレータ支援装置は、ｕ１からｕ１１までの対話を含む音声データの中から、「予め登録されたＱＡ事例の手順説明語句」を抽出する音声認識を行う。図２１の例では、オペレータ支援装置は、ｕ１の発話から「インターネットオプション」及び「接続タブ」を手順説明語句として抽出する。また、オペレータ支援装置は、ｕ９の発話から「インターネットオプション」を手順説明語句として抽出し、ｕ１１の発話から「接続タブ」を手順説明語句として抽出する。ここで、仮に熟練のオペレータだけが行った発話としてｕ３の発話における「メニューバー」及びｕ７の発話における「２番目」を補足語句として抽出したとする。従来技術より、「メニューバー」および「２番目」は、手順Ｘの手順説明語句である「インターネットオプション」および「接続タブ」の近傍に出現しているため、手順Ｘの補足語句として抽出される。実際には、ｕ２からｕ９までの発話では、手順説明語句「インターネットオプション」に関する対話が行われており、手順説明語句「接続タブ」の補足語句ではない。しかし、従来技術では、一手順に対して手順説明語句が複数ある場合、どの手順説明語句の補足語句となるかを特定することができない。 In the example shown in FIG. 21, the operator support apparatus performs voice recognition that extracts “procedure explanation words of QA examples registered in advance” from voice data including dialogues from u1 to u11. In the example of FIG. 21, the operator support apparatus extracts “Internet option” and “connection tab” from the utterance of u1 as procedure explanation words. Further, the operator support apparatus extracts “Internet option” from the utterance of u9 as a procedure explanation word and extracts “connection tab” from the utterance of u11 as a procedure explanation word. Here, it is assumed that “menu bar” in the utterance of u3 and “second” in the utterance of u7 are extracted as supplemental phrases as utterances performed only by a skilled operator. According to the conventional technique, “menu bar” and “second” appear in the vicinity of “Internet option” and “connection tab” which are procedure explanation words of procedure X, and therefore are extracted as supplementary words of procedure X. . Actually, in the utterances from u2 to u9, a dialog regarding the procedure explanation phrase “Internet option” is performed, and it is not a supplementary phrase of the procedure explanation phrase “connection tab”. However, in the prior art, when there are a plurality of procedure explanation words for one procedure, it is not possible to specify which procedure explanation words are supplementary words.

図２２に示す例では、以下の会話が行われたものとする。オペレータは「では、インターネットオプションの接続タブを開いて下さい」（ｕ１）と発話する。顧客は「接続タブって何？」（ｕ２）と発話する。オペレータは「インターネットオプションは開きましたか？」（ｕ３）と発話する。顧客は「開いたよ」（ｕ４）と発話する。オペレータは「一番上に、全般、セキュリティなどと並んでいる・・・」（ｕ５）と発話する。顧客は「あ、これか」（ｕ６）と発話する。オペレータは「はい、そこの接続タブを開いてください」（ｕ７）と発話する。顧客は「はい」（ｕ８）と発話する。 In the example shown in FIG. 22, it is assumed that the following conversation has been performed. The operator speaks, "Please open the Internet Options connection tab" (u1). The customer speaks “What is the connection tab?” (U2). The operator speaks “Is the Internet option open?” (U3). The customer speaks "I'm open" (u4). The operator utters, “It is lined up with General, Security, etc. at the top ...” (u5). The customer speaks “Oh, is this” (u6). The operator speaks “Yes, please open the connection tab there” (u7). The customer speaks “Yes” (u8).

図２２に示す例で言えば、オペレータ支援装置は、ｕ１の発話から「インターネットオプション」及び「接続タブ」を手順説明語句として抽出する。また、オペレータ支援装置は、ｕ３の発話から「インターネットオプション」を手順説明語句として抽出する。さらに、オペレータ支援装置は、ｕ７の発話から「接続タブ」を手順説明語句として抽出する。ここで、仮に熟練のオペレータだけが行った発話としてｕ５の発話における「１番上」を補足語句として抽出したとする。従来技術より、「１番上」は、手順Ｘの手順説明語句である「インターネットオプション」および「接続タブ」の近傍に出現しているため、手順Ｘの補足語句として抽出される。実際には、接続タブの画面上の表示位置を説明するための語句であるため、「インターネットオプション」の補足語句ではないのだが、従来技術では「接続タブ」のみの補足語句であると特定することができない。 In the example shown in FIG. 22, the operator support apparatus extracts “Internet option” and “connection tab” from the utterance of u1 as procedure explanation words. Further, the operator support apparatus extracts “Internet option” from the u3 utterance as a procedure explanation word / phrase. Further, the operator support apparatus extracts “connection tab” from the u7 utterance as a procedure explanation word. Here, it is assumed that “first” in the u5 utterance is extracted as a supplemental phrase as an utterance performed only by a skilled operator. According to the prior art, “top” appears in the vicinity of “Internet option” and “connection tab”, which are procedure explanation words of procedure X, and is therefore extracted as a supplementary word of procedure X. Actually, it is a phrase to explain the display position of the connection tab on the screen, so it is not a supplemental phrase for "Internet Options", but in the conventional technology, it is identified as a supplemental phrase for only "Connection tab" I can't.

このように、上記の従来技術では、１つの手順の中に複数の手順説明語句が存在する場合には、それら複数の手順説明語句の補足語句であるとしか対応づけることができず、補足語句をどの手順説明語句と対応付けるのかを特定できない。 As described above, in the above-described conventional technique, when a plurality of procedure explanation words / phrases exist in one procedure, they can be associated only as supplement words / phrases of the plurality of procedure explanation words / phrases. It is not possible to specify which procedure explanation word is associated with.

開示の技術は、上記に鑑みてなされたものであって、手順説明語句に対応した補足語句を抽出することができるオペレータ支援装置、オペレータ支援プログラム及びオペレータ支援方法を提供することを目的とする。 The disclosed technology has been made in view of the above, and an object thereof is to provide an operator support device, an operator support program, and an operator support method that can extract supplemental words corresponding to procedure explanation words.

本願の開示する技術は、一つの態様において、所定の手順を説明するための語句である手順説明語句と、端末に表示される操作部位のうち該手順説明語句に関係して操作される操作部位とを対応付けて記憶する対応関係記憶部を有する。さらに、本願の開示する技術は、前記端末に表示される操作部位のうち所定の操作部位を切り替える操作が行われた時点を示す操作時間を取得する操作時間取得部を有する。前記操作時間取得部は、操作部位が他の操作部位に切替操作された切替操作時刻ごとにその時刻から操作対象となった操作部位を対応付けて記憶するＰＣ操作ログ記憶部から操作時間を取得する。さらに、本願の開示する技術は、区間推定部を有する。前記区間推定部は、前記対応関係記憶部から前記所定の操作部位から切り替えられた操作部位に関係する手順説明語句を抽出する。前記区間推定部は、音声記録から、前記操作時間取得部によって取得された操作時間より以降に発話が開始された時点を抽出する。前記区間推定部は、手順説明語句が出現した発話区間から当該抽出した時点の発話区間の次に現れる、予め定めた期間よりも長い発話区間の開始時刻に至るまでの音声区間を、前記対応関係記憶部から抽出した手順説明語句に関する音声区間として推定する。さらに、本願の開示する技術は、前記音声記録のうち前記区間推定部によって推定された音声区間から、前記手順説明語句を補足するための語句である補足語句を抽出する補足語句抽出部を有する。 In one aspect, the technology disclosed in the present application is a procedure explanation word that is a word for explaining a predetermined procedure, and an operation part that is operated in relation to the procedure explanation word among the operation parts displayed on the terminal. Are stored in association with each other. Furthermore, the technology disclosed in the present application includes an operation time acquisition unit that acquires an operation time indicating a time point when an operation of switching a predetermined operation part among operation parts displayed on the terminal is performed. The operation time acquisition unit acquires an operation time from a PC operation log storage unit that stores an operation part that is an operation target from that time in association with each switching operation time when the operation part is switched to another operation part. To do. Furthermore, the technology disclosed in the present application includes a section estimation unit. The section estimation unit extracts a procedure explanation word related to an operation part switched from the predetermined operation part from the correspondence storage unit. The section estimation unit extracts a time point when the utterance is started after the operation time acquired by the operation time acquisition unit from the voice recording. The section estimation unit displays a speech section that appears next to an utterance section at the time of extraction from an utterance section in which a procedure explanation word appears, up to a start time of an utterance section longer than a predetermined period, and the correspondence relationship It is estimated as a speech section related to the procedure explanation phrase extracted from the storage unit. Furthermore, the technology disclosed in the present application includes a supplemental phrase extracting unit that extracts a supplemental phrase that is a phrase for supplementing the procedure explanation phrase from the speech segment estimated by the segment estimation unit in the speech recording.

本願の開示する技術の一つの態様によれば、手順説明語句に対応した補足語句を抽出することができるという効果を奏する。 According to one aspect of the technology disclosed in the present application, it is possible to extract a supplemental phrase corresponding to the procedure explanation phrase.

図１は、実施例１に係るオペレータ支援装置の構成を示すブロック図である。FIG. 1 is a block diagram illustrating the configuration of the operator support apparatus according to the first embodiment. 図２は、実施例２に係るオペレータ支援装置の構成を示すブロック図である。FIG. 2 is a block diagram illustrating the configuration of the operator support apparatus according to the second embodiment. 図３は、対応関係記憶部に記憶される情報の構成例を示す図である。FIG. 3 is a diagram illustrating a configuration example of information stored in the correspondence relationship storage unit. 図４は、応対履歴リストの一例を示す図である。FIG. 4 is a diagram illustrating an example of a response history list. 図５は、オペレータに係る韻律データリストの一例を示す図である。FIG. 5 is a diagram showing an example of the prosodic data list related to the operator. 図６は、顧客に係る韻律データリストの一例を示す図である。FIG. 6 is a diagram illustrating an example of a prosodic data list related to a customer. 図７は、オペレータに係る発話区間情報の一例を示す図である。FIG. 7 is a diagram illustrating an example of utterance section information related to an operator. 図８は、顧客に係る発話区間情報の一例を示す図である。FIG. 8 is a diagram illustrating an example of utterance section information related to a customer. 図９は、認識キーワードリストの一例を示す図である。FIG. 9 is a diagram illustrating an example of the recognized keyword list. 図１０は、操作時間リストの一例を示す図である。FIG. 10 is a diagram illustrating an example of the operation time list. 図１１は、手順対応区間の推定結果の一例を示す図である。FIG. 11 is a diagram illustrating an example of a procedure-corresponding section estimation result. 図１２は、手順説明語句の対応区間の推定結果の一例を示す図である。FIG. 12 is a diagram illustrating an example of the estimation result of the corresponding section of the procedure explanation word / phrase. 図１３は、補足語句リストの一例を示す図である。FIG. 13 is a diagram illustrating an example of a supplemental phrase list. 図１４は、実施例２に係るオペレータ支援装置における処理の流れを示すフローチャートである。FIG. 14 is a flowchart illustrating a flow of processing in the operator support apparatus according to the second embodiment. 図１５は、実施例２に係る手順対応区間の推定処理の手順を示すフローチャートである。FIG. 15 is a flowchart illustrating a procedure of a procedure corresponding section estimation process according to the second embodiment. 図１６は、実施例２に係る手順説明語句の対応区間の推定処理の手順を示すフローチャートである。FIG. 16 is a flowchart illustrating a procedure of a process for estimating a corresponding section of a procedure explanatory word according to the second embodiment. 図１７は、実施例２に係る補足語句抽出処理の手順を示すフローチャートである。FIG. 17 is a flowchart illustrating a procedure of supplemental phrase extraction processing according to the second embodiment. 図１８は、実施例２に係る補足語句の抽出要領を説明するための説明図である。FIG. 18 is an explanatory diagram for explaining the supplemental word extraction method according to the second embodiment. 図１９は、実施例２に係る補足語句の抽出要領を説明するための説明図である。FIG. 19 is an explanatory diagram for explaining the supplemental phrase extraction procedure according to the second embodiment. 図２０は、オペレータ支援プログラムを実行するコンピュータを示す図である。FIG. 20 is a diagram illustrating a computer that executes an operator support program. 図２１は、従来技術における補足語句の抽出要領を説明するための説明図である。FIG. 21 is an explanatory diagram for explaining a procedure for extracting supplemental phrases in the prior art. 図２２は、従来技術における補足語句の抽出要領を説明するための説明図である。FIG. 22 is an explanatory diagram for explaining a supplementary word extraction procedure in the prior art.

以下に、図面を参照しつつ、本願の開示するオペレータ支援装置、オペレータ支援プログラム及びオペレータ支援方法の一実施形態について詳細に説明する。なお、本願の開示するオペレータ支援装置、オペレータ支援プログラム及びオペレータ支援方法の一実施形態として以下に説明する実施例により、本願の開示する技術が限定されるものではない。 Hereinafter, an embodiment of an operator support device, an operator support program, and an operator support method disclosed in the present application will be described in detail with reference to the drawings. Note that the technology disclosed in the present application is not limited by the examples described below as an embodiment of the operator support device, the operator support program, and the operator support method disclosed in the present application.

図１は、実施例１に係るオペレータ支援装置の構成を示すブロック図である。同図に示すように、実施例１に係るオペレータ支援装置１は、記憶部２と、操作時間取得部３と、区間推定部４と、補足語句抽出部５とを有する。 FIG. 1 is a block diagram illustrating the configuration of the operator support apparatus according to the first embodiment. As illustrated in FIG. 1, the operator support apparatus 1 according to the first embodiment includes a storage unit 2, an operation time acquisition unit 3, a section estimation unit 4, and a supplemental phrase extraction unit 5.

対応関係記憶部２は、所定の手順を説明するための語句である手順説明語句と、端末に表示される操作部位のうち手順説明語句に関係して操作される操作部位とを対応付けて記憶する。操作時間取得部３は、端末に表示される操作部位のうち所定の操作部位を切り替える操作が行われた時点を示す操作時間を取得する。この操作時間取得部３は、操作部位が他の操作部位に切替操作された切替操作時刻ごとにその時刻から操作対象となった操作部位を対応付けて記憶するＰＣ操作ログ記憶部から、操作時間を取得する。 The correspondence storage unit 2 associates and stores a procedure explanation phrase that is a phrase for explaining a predetermined procedure and an operation part that is operated in relation to the procedure explanation phrase among the operation parts displayed on the terminal. To do. The operation time acquisition unit 3 acquires an operation time indicating a point in time when an operation for switching a predetermined operation part among the operation parts displayed on the terminal is performed. The operation time acquisition unit 3 stores the operation time from the PC operation log storage unit that stores the operation part that is the operation target from that time in association with each other when the operation part is switched to another operation part. To get.

区間推定部４は、対応関係記憶部２から、所定の操作部位から切り替えられた操作部位に関係する手順説明語句を抽出するとともに、音声記録から、操作時間取得部３によって取得された操作時間より以降に発話が開始された時点を抽出する。そして、区間推定部４は、手順説明語句が出現した発話区間から抽出した時点の発話区間の次に現れる、予め定めた期間よりも長い発話区間の開始時刻に至るまでの音声区間を、対応関係記憶部２から抽出した手順説明語句に関する音声区間として推定する。補足語句抽出部５は、音声記録のうち区間推定部４によって推定された音声区間から、手順説明語句を補足するための語句である補足語句を抽出する。 The section estimation unit 4 extracts the procedure explanation words related to the operation part switched from the predetermined operation part from the correspondence storage part 2, and from the operation time acquired by the operation time acquisition part 3 from the voice recording. Thereafter, the time point when the utterance is started is extracted. Then, the section estimation unit 4 corresponds to the speech section that appears next to the utterance section at the time of extraction from the utterance section in which the procedure explanation word appears, and reaches the start time of the utterance section longer than the predetermined period. Estimated as a speech section related to the procedure explanation phrase extracted from the storage unit 2. The supplemental phrase extraction unit 5 extracts a supplemental phrase that is a phrase for supplementing the procedure explanation phrase from the speech segment estimated by the segment estimation unit 4 in the speech recording.

このように、操作時間より以降に発話が開始された時点に至るまでの音声区間を手順説明語句に関する音声区間として推定するのは、オペレータが行う端末の操作と、オペレータ及び顧客の対話の間に関連性があるからである。 As described above, the estimation of the voice section from the operation time to the point when the utterance is started as the voice section related to the procedure explanation phrase is performed between the terminal operation performed by the operator and the dialogue between the operator and the customer. This is because they are related.

つまり、オペレータは、回答事項の手順を説明する場合に、１つの手順で到達目標とする操作部位、すなわち手順説明語句に対応する操作部位に至るまで操作部位を操作しながら説明を行う。そして、到達目標とする操作部位へ至った段階で、オペレータは、顧客を到達目標とする操作部位まで誘導できたか否かを確認するために、操作部位（手順説明語句）に関する確認発話を行うのが応対の定石である。かかる確認発話が行われた場合には、その確認発話で登場した操作部位に関する説明をそれ以前にオペレータが行っていたのは明らかであり、それ以前の対話を確認発話で登場した手順説明語句に関する対話とみなすことができる。 That is, when explaining the procedure of the answer items, the operator performs an explanation while operating the operation part up to the operation part to be reached in one procedure, that is, the operation part corresponding to the procedure explanation word / phrase. Then, at the stage where the operation part to be reached is reached, the operator performs a confirmation utterance regarding the operation part (procedure explanation phrase) in order to confirm whether or not the customer has been guided to the operation part to be the arrival target. Is the answer. When such a confirmation utterance is made, it is clear that the operator had previously explained the operation part that appeared in the confirmation utterance, and the procedure explanation words that appeared in the confirmation utterance before that It can be regarded as a dialogue.

このように、本実施例に係るオペレータ支援装置１では、かかる知見に基づく区間推定を行うので、手順説明語句に対応する音声区間を推定できる。このため、本実施例に係るオペレータ支援装置１によれば、手順説明語句に対応した補足語句を抽出することが可能になる。 Thus, in the operator assistance apparatus 1 which concerns on a present Example, since the area estimation based on this knowledge is performed, the audio | voice area corresponding to a procedure explanatory note can be estimated. For this reason, according to the operator assistance apparatus 1 which concerns on a present Example, it becomes possible to extract the supplemental phrase corresponding to a procedure description word / phrase.

［オペレータ支援装置の構成］
図２は、実施例２に係るオペレータ支援装置の構成を示すブロック図である。なお、本実施例では、コールセンタでパーソナルコンピュータの操作に関する問合せを受け付ける場合を想定したオペレータ支援装置を説明する。なお、以下では、パーソナルコンピュータをＰＣと略記することとする。 [Configuration of operator support device]
FIG. 2 is a block diagram illustrating the configuration of the operator support apparatus according to the second embodiment. In the present embodiment, an operator support apparatus that is assumed to receive an inquiry regarding the operation of a personal computer at a call center will be described. Hereinafter, the personal computer is abbreviated as PC.

図２に示すように、オペレータ支援装置１００は、ＰＣ操作ログ記憶部１１０と、事例データ記憶部１２０と、対応関係記憶部１２５と、音声データ記憶部１３０と、対応関係設定部１４０と、韻律情報抽出部１５０と、発話区間抽出部１５５とを有する。さらに、オペレータ支援装置１００は、音声認識部１６０と、操作時間取得部１７０と、区間推定部１８０と、補足語句抽出部１９０とを有する。 As shown in FIG. 2, the operator support apparatus 100 includes a PC operation log storage unit 110, a case data storage unit 120, a correspondence relationship storage unit 125, a voice data storage unit 130, a correspondence relationship setting unit 140, a prosody. An information extraction unit 150 and an utterance section extraction unit 155 are included. Further, the operator support apparatus 100 includes a voice recognition unit 160, an operation time acquisition unit 170, a section estimation unit 180, and a supplemental phrase extraction unit 190.

ＰＣ操作ログ記憶部１１０は、オペレータが使用するＰＣにおけるＰＣ操作ログを記憶する。ＰＣ操作ログ記憶部１１０は、操作部位が他の操作部位に切替操作された切替操作時刻ごとにその時刻から操作対象となった操作部位を対応付けて記憶する。ここで言う「操作部位」とは、ＯＳ（Operating System）やアプリケーションによって生成される画面そのものに加え、画面内でのマウス操作やキーボード操作といった操作に応答して表示されるタブやプルダウンメニューなども含む。なお、かかるＰＣ操作ログは、既存の操作ログ取得ソフト又はＯＳの機能を用いることにより取得できる。 The PC operation log storage unit 110 stores a PC operation log in the PC used by the operator. The PC operation log storage unit 110 stores, in association with each switching operation time when the operation site is switched to another operation site, the operation site that has been an operation target from that time. The “operation part” mentioned here includes not only a screen generated by an OS (Operating System) or an application, but also a tab or a pull-down menu displayed in response to an operation such as a mouse operation or a keyboard operation on the screen. Including. The PC operation log can be acquired by using existing operation log acquisition software or an OS function.

事例データ記憶部１２０は、オペレータが応対時に参照する事例データを記憶する。この事例データ記憶部１２０は、事例ＩＤ（Identification）ごとにＱＡ（Question and Answer）事例の問合せ事項及び回答事項を対応付けて記憶する。一例を挙げれば、事例ＩＤ「ＱＡ１２３」には、問合せ事項「ブラウザの設定方法を教えて」、回答事項に含まれる各手順が対応付けられる。この回答事項は、手順１「インターネットオプションの接続タブを開く」、手順２「ＬＡＮ（Local Area Network）の設定を開く」、手順３「プロキシサーバーの設定を確認する」を含む。なお、回答事項の各手順には、それぞれの手順番号に対応する数字を枝番とした事例手順ＩＤが付与されるものとする。 The case data storage unit 120 stores case data that the operator refers to when responding. The case data storage unit 120 stores an inquiry item and an answer item of a QA (Question and Answer) case in association with each case ID (Identification). For example, the case ID “QA123” is associated with the inquiry item “Tell me how to set the browser” and each procedure included in the answer item. This answer includes procedure 1 “open connection tab of Internet option”, procedure 2 “open LAN (Local Area Network) setting”, and procedure 3 “check proxy server setting”. Note that each procedure of answer items is given a case procedure ID having a branch number as a number corresponding to each procedure number.

対応関係記憶部１２５は、回答事項に含まれる１つの手順を説明する場合にキー（重要）となる手順説明語句と、その手順説明語句に関係して操作される操作部位との対応関係を記憶する。対応関係記憶部１２５は、手順説明語句、事例手順ＩＤ及び操作部位を対応付けて記憶する。なお、手順説明語句及び操作部位の対応関係は、後述の対応関係設定部１４０によって設定される。 The correspondence relationship storage unit 125 stores the correspondence relationship between a procedure explanation word which is a key (important) when explaining one procedure included in the answer items and an operation part operated in relation to the procedure explanation word. To do. The correspondence relationship storage unit 125 stores a procedure explanation word, a case procedure ID, and an operation part in association with each other. Note that the correspondence between the procedure explanation word and the operation part is set by the correspondence setting unit 140 described later.

図３は、対応関係記憶部に記憶される情報の構成例を示す図である。図３に示すように、対応関係記憶部１２５は、手順説明語句「インターネットオプション」に事例手順ＩＤ「ＱＡ１２３−１」及び操作部位「インターネットオプションwindow−全般タブ画面」を対応付けて記憶する。また、対応関係記憶部１２５は、手順説明語句「接続タブ」に事例手順ＩＤ「ＱＡ１２３−１」及び操作部位「インターネットオプションwindow−接続タブ画面」を対応付けて記憶する。さらに、対応関係記憶部１２５は、手順説明語句「ＬＡＮ」に事例手順ＩＤ「ＱＡ１２３−２」及び操作部位「ローカルエリアネットワーク（ＬＡＮ）の設定window」を対応付けて記憶する。この図３に示す例では、「インターネットオプション」及び「接続タブ」が同一の事例であり、かつ同じ手順に含まれる手順説明語句である状況を示す。 FIG. 3 is a diagram illustrating a configuration example of information stored in the correspondence relationship storage unit. As illustrated in FIG. 3, the correspondence relationship storage unit 125 stores the procedure instruction word “Internet option” in association with the case procedure ID “QA 123-1” and the operation part “Internet option window—general tab screen”. In addition, the correspondence relationship storage unit 125 stores the procedure instruction word “connection tab” in association with the case procedure ID “QA 123-1” and the operation part “Internet option window—connection tab screen”. Further, the correspondence relationship storage unit 125 stores the procedure instruction word “LAN” in association with the case procedure ID “QA123-2” and the operation part “local area network (LAN) setting window”. The example shown in FIG. 3 shows a situation where “Internet option” and “connection tab” are the same case and are procedure explanation words included in the same procedure.

音声データ記憶部１３０は、過去の応対業務において録音された音声データを記憶する。例えば、音声データ記憶部１３０には、１人の顧客の応対で顧客とオペレータの電話の呼が接続されてからその接続が切られるまでの１つのコールが音声データとして録音される。音声データ記憶部１３０は、後述する機能部から任意の音声データの呼び出しが可能となるように、コールを一意に識別するためのコールＩＤに対応付けて音声データを記憶する。また、音声データ記憶部１３０は、後述の機能部で時刻換算可能なように、コールＩＤに対応付けて録音開始時点の時刻を記憶する。なお、音声データは、例えば、オペレータの音声データ及び顧客の音声データそれぞれを異なる系統から入力されたステレオデータであるものとするが、単一系統から入力されるモノラルデータであってもかまわない。また、音声データの録音区間は、呼の接続から切断までに限定されるものではなく、開始時刻が呼の接続から前後してもよく、また、終了時刻が呼の接続から前後してもかまわない。 The voice data storage unit 130 stores voice data recorded in the past reception work. For example, the voice data storage unit 130 records one call as voice data from when a customer's telephone call is connected to one customer's reception until the connection is disconnected. The voice data storage unit 130 stores voice data in association with a call ID for uniquely identifying a call so that arbitrary voice data can be called from a function unit described later. Further, the voice data storage unit 130 stores the time at which recording is started in association with the call ID so that the time can be converted by a function unit described later. The audio data is, for example, stereo data in which operator audio data and customer audio data are input from different systems, but may be monaural data input from a single system. The voice data recording period is not limited from call connection to disconnection, the start time may be before or after the call connection, and the end time may be before or after the call connection. Absent.

また、音声データ記憶部１３０は、応対履歴リストを記憶する。この応対履歴リストは、コールＩＤごとにそのコールに係る応対業務でオペレータが参照した事例データを対応付けたリストである。図４は、応対履歴リストの一例を示す図である。図４に示す例で言えば、コールＩＤ「ｃａｌｌ００１」の応対では、事例ＩＤ「ＱＡ１２３」に関する問合せを受け付けたことを示す。コールＩＤ「ｃａｌｌ００２」の応対では、事例ＩＤ「ＱＡ４５６」に関する問合せを受け付けたことを示す。また、コールＩＤ「ｃａｌｌ００３」の応対では、事例ＩＤ「ＱＡ１３５」に関する問合せを受け付けたことを示す。なお、これらコールＩＤ及び事例ＩＤは、応対業務終了後にオペレータが使用する端末で両者が対応付けて入力されるものとする。また、ここでは、１つのコールで問合せを受け付ける事例が１つである場合を例示したが、１つのコールで複数の事例に関する問合せを受け付けることもできる。 The voice data storage unit 130 stores a response history list. The response history list is a list in which case data referred to by the operator in the service related to the call is associated with each call ID. FIG. 4 is a diagram illustrating an example of a response history list. In the example illustrated in FIG. 4, the response with the call ID “call 001” indicates that an inquiry regarding the case ID “QA123” has been received. The reception of the call ID “call 002” indicates that an inquiry regarding the case ID “QA456” has been received. Further, the reception of the call ID “call 003” indicates that an inquiry regarding the case ID “QA135” has been received. Note that these call ID and case ID are input in association with each other at the terminal used by the operator after the service is completed. In addition, here, the case where there is one case in which an inquiry is received by one call is illustrated, but an inquiry regarding a plurality of cases can also be received by one call.

対応関係設定部１４０は、事例データ記憶部１２０に記憶されたＱＡ事例のうち回答事項の各手順に含まれる手順説明語句と、ＰＣの操作全般に関する操作部位との対応関係を設定する。かかる設定の一例として、対応関係設定部１４０は、事例データ記憶部１２０に記憶された事例手順ＩＤの昇順または降順にその手順に含まれる事例データを読み出す。このとき、対応関係設定部１４０は、事例データのうち回答事項に含まれる各手順の文書に対して形態素解析技術などを用いて解析することで、所定の条件、例えば文書中に網かけを施した語などを満たすキーワードを手順説明語句として抽出する。そして、対応関係設定部１４０は、コールセンタの管理者や熟練のオペレータなどから図示しない入力デバイス等を介して、先に抽出した手順説明語句への対応付け先とする操作部位を選択させる。その後、対応関係設定部１４０は、このようにして対応付け先として選択された操作部位及び手順説明語句とともにその手順説明語句が含まれる手順に対応する事例手順ＩＤを対応付けて対応関係記憶部１２５に設定登録する。なお、ここでは、コールセンタの関係者による入力操作によって対応関係を設定する場合を説明したが、ネットワークを介して対応関係を取得してもよい。また、ここでは、回答事項に含まれる各手順の手順説明語句を形態素解析技術を用いて抽出することとしたが、コールセンタ関係者による指定や同一事例で発話頻度が閾値以上のものなど、任意の基準で抽出できる。 Correspondence setting section 140 sets a correspondence relation between procedure explanation words included in each procedure of the answer items in the QA cases stored in case data storage section 120 and operation parts related to the overall operation of the PC. As an example of such setting, the correspondence setting unit 140 reads out case data included in the procedure in ascending or descending order of the case procedure IDs stored in the case data storage unit 120. At this time, the correspondence setting unit 140 analyzes the document of each procedure included in the answer items in the case data by using a morphological analysis technique or the like, thereby applying a predetermined condition, for example, shading in the document. Keywords that satisfy the selected word are extracted as procedure explanation words. Then, the correspondence setting unit 140 causes the call center administrator or a skilled operator to select an operation part to be associated with the previously extracted procedure explanation word / phrase via an input device (not shown). After that, the correspondence setting unit 140 associates the case procedure ID corresponding to the procedure including the procedure explanation word together with the operation part and the procedure explanation word selected as the correspondence destination in this manner, and the correspondence relation storage unit 125. Register settings to. Here, a case has been described in which the correspondence relationship is set by an input operation by a call center person, but the correspondence relationship may be acquired via a network. In addition, here, the procedure explanation words of each procedure included in the answer items are extracted using the morphological analysis technology. However, any specification such as a specification by a call center person or an utterance frequency exceeding the threshold in the same case is used. Can be extracted by criteria.

韻律情報抽出部１５０は、音声データ記憶部１３０に記憶された音声データから韻律データリストを生成する。例えば、韻律情報抽出部１５０は、韻律情報が未抽出のコールＩＤが存在する場合に、音声データ記憶部１３０から音声データを読み出す。そして、韻律情報抽出部１４０は、ステレオデータである音声データのうちオペレータの音声データから所定の時間間隔でパワー値をサンプリングすることでオペレータの韻律データを抽出する。そして、韻律情報抽出部１５０は、抽出したオペレータの韻律データからオペレータの韻律データリストを生成する。また、韻律情報抽出部１５０は、顧客の音声データに対してもオペレータの場合と同様にパワー値をサンプリングする処理を実行し、顧客の韻律データリストを生成する。なお、パワー値は、音声データから取得される信号波形の振幅の大きさから算出される。 The prosody information extraction unit 150 generates a prosody data list from the speech data stored in the speech data storage unit 130. For example, the prosody information extraction unit 150 reads voice data from the voice data storage unit 130 when there is a call ID from which prosody information has not been extracted. Then, the prosodic information extraction unit 140 extracts the operator's prosodic data by sampling the power value at predetermined time intervals from the operator's audio data among the audio data that is stereo data. Then, the prosodic information extraction unit 150 generates an operator prosodic data list from the extracted operator prosodic data. The prosody information extraction unit 150 also performs processing for sampling the power value on the customer's voice data as in the case of the operator to generate a customer's prosody data list. The power value is calculated from the amplitude of the signal waveform acquired from the audio data.

図５は、オペレータに係る韻律データリストの一例を示す図である。図５に示すように、オペレータの韻律データリストは、一定時間ごとにパワー値が対応付けられた情報である。図５では、１２．８［ｍｓｅｃ］ごとにサンプリングした場合を例示する図５に示すオペレータの韻律データリストによれば、時間「０」のパワー値が−６０．００８１５２であることを示す。時間「１２．８」のパワー値が−５１．８０１００６であることを示す。時間「２５．６」のパワー値が−４９．１７１９２１であることを示す。時間「３８．４」のパワー値が−５０．００５３４４であることを示す。時間「５１．２」のパワー値が−５２．４２２９７７であることを示す。時間「６４．０」のパワー値が−５７．１８０４７３であることを示す。時間「７６．８」のパワー値が−６２．３０３６３８であることを示す。時間「８９．６」のパワー値が−５７．４６８８２６であることを示す。時間「１０２．４」のパワー値が−５３．０６８７１４であることを示す。また、時間「１１５．２」のパワー値が−５１．６５５３９６であることを示す。なお、ここで言う時間とは、音声データの録音の開始時刻をゼロとした場合の経過時間を指すが、標準時刻のタイムスタンプなどの他の時刻を用いることもできる。 FIG. 5 is a diagram showing an example of the prosodic data list related to the operator. As shown in FIG. 5, the operator's prosodic data list is information in which power values are associated with each fixed time. In FIG. 5, according to the prosodic data list of the operator illustrated in FIG. 5 exemplifying the case where sampling is performed every 12.8 [msec], it is indicated that the power value at time “0” is −60.008152. This indicates that the power value at time “12.8” is −51.801006. It indicates that the power value at time “25.6” is −49.1711921. It indicates that the power value at time “38.4” is −50.005344. It shows that the power value at time “51.2” is −52.422977. It shows that the power value at time “64.0” is −57.180473. This indicates that the power value at time “76.8” is −62.3303638. It indicates that the power value at time “89.6” is −57.468826. This indicates that the power value at time “102.4” is −53.068714. In addition, the power value at time “115.2” is −51.655396. The time mentioned here refers to the elapsed time when the start time of recording of audio data is zero, but other times such as a time stamp of standard time can also be used.

図６は、顧客に係る韻律データリストの一例を示す図である。図６に示すように、顧客の韻律データリストは、一定時間ごとにパワー値が対応付けられた情報である。図６では、１２．８［ｍｓｅｃ］ごとにサンプリングした場合を例示する図６に示す顧客の韻律データリストによれば、時間「０」のパワー値が−５７．１８０４７３であることを示す。時間「１２．８」のパワー値が−６２．３０３６３８であることを示す。顧客の韻律データリストには、時間「２５．６」のパワー値が−５７．４６８８２６であることを示す。時間「３８．４」のパワー値が−５３．０６８７１４であることを示す。時間「５１．２」のパワー値が−５１．６５５３９６であることを示す。時間「６４．０」のパワー値が−６０．００８１５２であることを示す。時間「７６．８」のパワー値が−５１．８０１００６であることを示す。時間「８９．６」のパワー値が−４９．１７１９２１であることを示す。時間「１０２．４」のパワー値が−５０．００５３４４であることを示す。また、時間「１１５．２」のパワー値が−５２．４２２９７７であることを示す。 FIG. 6 is a diagram illustrating an example of a prosodic data list related to a customer. As shown in FIG. 6, the customer's prosodic data list is information in which power values are associated with each other at regular intervals. FIG. 6 shows that the power value at time “0” is −57.180473 according to the customer prosody data list shown in FIG. 6 exemplifying the case where sampling is performed every 12.8 [msec]. This indicates that the power value at time “12.8” is −62.3303638. The customer's prosodic data list indicates that the power value at time “25.6” is −57.468826. This indicates that the power value at time “38.4” is −53.068714. It shows that the power value at time “51.2” is −51.655396. It indicates that the power value at time “64.0” is −60.080152. This indicates that the power value at time “76.8” is −51.801006. The power value at time “89.6” is −49.1711921. This indicates that the power value at time “102.4” is −50.005344. In addition, the power value at the time “115.2” is −52.422977.

発話区間抽出部１５５は、韻律情報抽出部１５０より抽出された韻律データから発話区間情報を抽出する。例えば、発話区間抽出部１５５は、オペレータの韻律データリストから所定の閾値以上のパワー値が連続する区間を検出し、検出した区間の期間が所定の期間以上持続する区間の開始時刻及び終了時刻をオペレータの発話区間として抽出する。そして、発話区間抽出部１５５は、このようにして抽出したオペレータの発話区間に発話区間を識別するための発話区間ＩＤを採番し、採番した発話区間ＩＤに発話区間を対応付けたオペレータの発話区間情報を生成する。 The utterance section extraction unit 155 extracts utterance section information from the prosodic data extracted by the prosody information extraction unit 150. For example, the utterance section extraction unit 155 detects a section in which a power value equal to or greater than a predetermined threshold value continues from the operator's prosodic data list, and determines the start time and end time of the section in which the period of the detected section continues for a predetermined period or more. Extracted as the operator's utterance section. Then, the utterance section extraction unit 155 assigns an utterance section ID for identifying the utterance section to the utterance section of the operator extracted in this way, and the operator who associates the utterance section with the numbered utterance section ID. The utterance section information is generated.

図７は、オペレータに係る発話区間情報の一例を示す図である。オペレータの発話区間情報は、図７に示すように、発話区間ＩＤごとに開始時刻及び終了時刻が対応付けられた情報である。図７に示すように、発話区間ＩＤ「ｃａｌｌ００１−１」の例では、オペレータが開始時刻「１０ｍｓｅｃ」から終了時刻「１４３２０ｍｓｅｃ」まで発話したことを示す。発話区間ＩＤ「ｃａｌｌ００１−３」の例では、オペレータが開始時刻「・・・ｍｓｅｃ」から終了時刻「・・・ｍｓｅｃ」まで発話したことを示す。なお、これら開始時刻及び終了時刻は、音声データの録音の開始時刻をゼロとした場合の経過時間を指すが、標準時刻のタイムスタンプなどの他の時刻を用いることもできる。 FIG. 7 is a diagram illustrating an example of utterance section information related to an operator. As shown in FIG. 7, the utterance section information of the operator is information in which the start time and the end time are associated with each utterance section ID. As illustrated in FIG. 7, the example of the utterance section ID “call001-1” indicates that the operator uttered from the start time “10 msec” to the end time “14320 msec”. The example of the utterance section ID “call001-3” indicates that the operator uttered from the start time “... Msec” to the end time “. Note that these start time and end time indicate elapsed time when the start time of recording of audio data is zero, but other times such as a time stamp of standard time can also be used.

また、発話区間抽出部１５５は、顧客の韻律データリストについてもオペレータの韻律データリストと同様にして顧客の発話区間情報を抽出する。図８は、顧客に係る発話区間情報の一例を示す図である。図８に示すように、発話区間ＩＤ「ｃａｌｌ００１−２」の例では、顧客が開始時刻「１４３５０ｍｓｅｃ」から終了時刻「１７８００ｍｓｅｃ」まで発話したことを示す。発話区間ＩＤ「ｃａｌｌ００１−４」の例では、顧客が開始時刻「・・・ｍｓｅｃ」から終了時刻「・・・ｍｓｅｃ」まで発話したことを示す。 In addition, the utterance section extraction unit 155 extracts customer utterance section information for the customer's prosodic data list in the same manner as the operator's prosodic data list. FIG. 8 is a diagram illustrating an example of utterance section information related to a customer. As shown in FIG. 8, the example of the utterance section ID “call001-2” indicates that the customer uttered from the start time “14350 msec” to the end time “17800 msec”. The example of the utterance section ID “call001-4” indicates that the customer uttered from the start time “... Msec” to the end time “.

音声認識部１６０は、音声データ記憶部１３０に記憶された音声データに音声認識を行って認識キーワードリストを作成する。例えば、音声認識部１６０は、音声データ記憶部１３０に記憶された音声データのうち処理対象とするコールＩＤに対応する音声データを読み出すとともに応対履歴リストのうちコールＩＤに対応する事例ＩＤを読み出す。さらに、音声認識部１６０は、先に読み出した事例ＩＤに対応する手順説明語句を対応関係記憶部１２５から読み出す。このようにして、音声認識部１６０は、オペレータが実際に応対を行ったＱＡ事例の回答事項に含まれる手順説明語句とその発話を含む音声データとを読み出す。その後、音声認識部１６０は、手順説明語句を認識対象とする認識キーワードとし、認識キーワードに基づく音声認識を音声データに行って手順説明語句の発話が開始された時刻（以下、出現時刻とする）を特定する。このようにして手順説明語句の出現時刻を特定すると、音声認識部１６０は、コールＩＤ、認識キーワード（手順説明語句）、事例手順ＩＤ及び出現時刻を含むレコードを追加することにより認識キーワードリストを生成する。なお、音声認識部１６０は、補足語句が未抽出である音声データが音声データ記憶部１３０に存在し、かつその音声データに対応する応対履歴リストが音声データ記憶部１３０に登録されているコールＩＤを音声認識の処理対象とすることができる。 The voice recognition unit 160 performs voice recognition on the voice data stored in the voice data storage unit 130 to create a recognition keyword list. For example, the voice recognition unit 160 reads the voice data corresponding to the call ID to be processed among the voice data stored in the voice data storage unit 130 and reads the case ID corresponding to the call ID in the answering history list. Further, the voice recognition unit 160 reads out the procedure explanation words / phrases corresponding to the case ID read out from the correspondence relationship storage unit 125. In this manner, the voice recognition unit 160 reads out the procedure explanation words and the voice data including the utterances included in the answer items of the QA cases actually handled by the operator. After that, the speech recognition unit 160 uses the procedure explanation word / phrase as a recognition keyword, and the time when speech of the procedure explanation word / phrase is started by performing speech recognition based on the recognition keyword on the voice data (hereinafter referred to as an appearance time). Is identified. When the appearance time of the procedure explanation word / phrase is specified in this way, the voice recognition unit 160 generates a recognition keyword list by adding a record including the call ID, the recognition keyword (procedure explanation word / phrase), the case procedure ID and the appearance time. To do. Note that the voice recognition unit 160 has a call ID in which voice data for which no supplemental phrase has been extracted exists in the voice data storage unit 130 and a response history list corresponding to the voice data is registered in the voice data storage unit 130. Can be the target of speech recognition processing.

図９は、認識キーワードリストの一例を示す図である。図９に示すように、「ｃａｌｌ００１」の音声データの録音開始時刻から「１０５５０ｍｓｅｃ」が経過した時点でＱＡ事例１２３の手順１に含まれる手順説明語句「インターネットオプション」の発話が開始されたことを示す。「ｃａｌｌ００１」の音声データの録音開始時刻から「１２８７０ｍｓｅｃ」が経過した時点でＱＡ事例１２３の手順１に含まれる「接続タブ」の発話が開始されたことを示す。「ｃａｌｌ００１」の音声データの録音開始時刻から「９９１８０ｍｓｅｃ」が経過した時点でＱＡ事例１２３の手順１に含まれる「インターネット」の発話が開始されたことを示す。また、「ｃａｌｌ００１」の音声データの録音開始時刻から「１１０３００ｍｓｅｃ」が経過した時点でＱＡ事例１２３の手順２に含まれる「ＬＡＮ」の発話が開始されたことを示す。 FIG. 9 is a diagram illustrating an example of the recognized keyword list. As shown in FIG. 9, when “10550 msec” has elapsed from the recording start time of the voice data “call 001”, the utterance of the procedure explanation phrase “Internet option” included in the procedure 1 of the QA case 123 is started. Indicates. This indicates that the utterance of the “connection tab” included in the procedure 1 of the QA example 123 is started when “12870 msec” has elapsed from the recording start time of the voice data “call 001”. This indicates that the utterance of “Internet” included in the procedure 1 of the QA case 123 is started when “99180 msec” has elapsed from the recording start time of the voice data “call 001”. In addition, it indicates that the utterance of “LAN” included in the procedure 2 of the QA case 123 is started when “110300 msec” has elapsed from the recording start time of the voice data “call 001”.

操作時間取得部１７０は、ＰＣ操作ログ記憶部１１０に記憶されるＰＣ操作ログから、オペレータが使用するＰＣで表示される操作部位のうち所定の操作部位を切り替える操作が行われた時点を示す操作時間を取得する。 The operation time acquisition unit 170 is an operation indicating a point in time when an operation for switching a predetermined operation part among operation parts displayed on the PC used by the operator is performed from the PC operation log stored in the PC operation log storage unit 110. Get time.

ここで、ＰＣ操作ログとして記憶された切替操作時刻は、操作部位の切替操作が行われた時刻である。この切替操作時刻は、音声データの録音時間と対比されるが、音声データとは別系統で操作ログ取得ソフトを用いて収集されたログであるため、音声データの経過時間とは同一視することはできない。このままでは、後述の区間推定部１８０で操作部位の切替操作時刻と音声データの経過時間を対比できないので、操作時間取得部１７０は、両者の対比が可能となるように、時刻を経過時間へ換算するか、或いは経過時間を時刻へ換算する。 Here, the switching operation time stored as the PC operation log is the time when the operation part switching operation is performed. This switching operation time is compared with the recording time of the audio data, but since it is a log collected using operation log acquisition software in a separate system from the audio data, it should be regarded as the elapsed time of the audio data. I can't. In this state, the operation time switching operation time and the elapsed time of the voice data cannot be compared by the section estimation unit 180 described later, so the operation time acquisition unit 170 converts the time into the elapsed time so that the two can be compared. Or convert elapsed time to time.

このような換算を成立させるには、ＰＣ操作ログ記憶部１１０又は音声データ記憶部１３０のいずれかに音声データの経過時間及び操作部位の切替操作時刻の対応関係を持たせておくのが好ましい。一例としては、音声データの記録を行う録音ソフトは、音声データの録音時に、音声データの録音開始時点の時刻を音声データのコールＩＤに対応付けて音声データ記憶部１３０に登録する。なお、ここでは、音声データとともに録音開始時点の時刻を音声データ記憶部１３０へ登録させることとしたが、コールＩＤに対応付けてそのコールの録音開始時点からの切替操作時刻までの時間差をＰＣ操作のログ取得ソフトに登録させるようにしてもよい。 In order to establish such conversion, it is preferable that either the PC operation log storage unit 110 or the audio data storage unit 130 has a correspondence relationship between the elapsed time of the audio data and the switching operation time of the operation part. As an example, recording software for recording audio data registers the time at the start of recording audio data in the audio data storage unit 130 in association with the call ID of the audio data when recording audio data. Here, the time at which the recording is started is registered in the voice data storage unit 130 together with the voice data, but the time difference from the recording start time of the call to the switching operation time is associated with the call ID by the PC operation. You may make it register with this log acquisition software.

このようにして予め登録された録音開始時点の時刻を用いて、操作時間取得部１７０は、音声データ記憶部１３０に記憶された音声データの録音区間を切出し範囲とし、ＰＣ操作ログ記憶部１１０からＰＣ操作ログを切り出す。すなわち、操作時間取得部１７０は、音声データの録音開始時刻から録音終了時刻までの区間に切替操作時刻が含まれるＰＣ操作ログを切り出す。そして、操作時間取得部１７０は、各々の切替操作時刻と録音開始時点の時刻との間で差分を求めることにより、切替操作時刻ごとにその切替操作の録音開始時点からの経過時間（操作時間）を算出する。その上で、操作時間取得部１７０は、ＰＣ操作ログの切出しに使用された音声データのコールＩＤ、切替後の操作部位及びその切替操作が行われた時点を示す操作時間を対応付けて操作時間リストを生成する。なお、操作時間取得部１７０は、補足語句が未抽出である音声データのコールＩＤであれば任意のコールＩＤを操作時間の取得対象とすることができる。 Using the time of the recording start time registered in advance as described above, the operation time acquisition unit 170 uses the recording period of the audio data stored in the audio data storage unit 130 as a cutout range, and from the PC operation log storage unit 110. Cut out the PC operation log. That is, the operation time acquisition unit 170 cuts out a PC operation log in which the switching operation time is included in the section from the recording start time to the recording end time of the audio data. And the operation time acquisition part 170 calculates | requires the difference between each switching operation time and the time of a recording start time, The elapsed time (operation time) from the recording start time of the switching operation for every switching operation time Is calculated. In addition, the operation time acquisition unit 170 associates the call ID of the audio data used for cutting out the PC operation log, the operation part after switching, and the operation time indicating the time when the switching operation is performed, to thereby match the operation time. Generate a list. Note that the operation time acquisition unit 170 can use any call ID as a target for acquiring the operation time as long as it is a call ID of voice data from which a supplemental phrase has not been extracted.

図１０は、操作時間リストの一例を示す図である。操作時間リストは、コールＩＤごとに操作部位及び操作時間が対応付けられた情報である。図１０に示す例では、コールＩＤ「ｃａｌｌ００１」の音声データの録音開始時刻からの経過時間が「１７５６０ｍｓｅｃ」になった時点で操作部位が「ブラウザｗｉｎｄｏｗ」に切り替えられたことを示す。図１０に示す例では、コールＩＤ「ｃａｌｌ００１」の音声データの録音開始時刻からの経過時間が「１００２３０ｍｓｅｃ」になった時点で操作部位が「インターネットオプションｗｉｎｄｏｗ−全般タブ画面」に切り替えられたことを示す。 FIG. 10 is a diagram illustrating an example of the operation time list. The operation time list is information in which an operation part and an operation time are associated with each call ID. The example illustrated in FIG. 10 indicates that the operation part has been switched to “browser window” when the elapsed time from the recording start time of the voice data of the call ID “call 001” reaches “17560 msec”. In the example shown in FIG. 10, when the elapsed time from the recording start time of the voice data of the call ID “call 001” reaches “100230 msec”, the operation part is switched to the “Internet option window-general tab screen”. Indicates.

区間推定部１８０は、手順説明語句に関する音声区間を推定する。区間推定部１８０は、補足語句が未抽出である音声データのコールＩＤを対象に、音声データに含まれるオペレータの発話を時系列に追従することにより、その音声データに含まれる手順説明語句に関する音声区間の開始時刻及び終了時刻を推定する。 The section estimation unit 180 estimates a voice section related to the procedure explanation word / phrase. The section estimation unit 180 targets the call ID of the voice data from which the supplemental phrase has not been extracted, and follows the utterance of the operator included in the voice data in time series, thereby obtaining the voice related to the procedure explanation phrase included in the voice data. Estimate the start time and end time of the section.

この区間推定部１８０は、手順説明語句に関する音声区間を推定するに際して、音声認識部１６０による認識キーワードリストを用いて、回答事項に含まれる１つの手順が説明された手順対応区間を推定する。その上で、区間推定部１８０は、操作時間取得部１７０によって取得された操作時間を用いて、手順説明語句の対応区間を推定する。なお、ここでは、（１）手順対応区間の推定を説明してから、（２）手順説明語句の対応区間の推定を説明する。 The section estimation unit 180 estimates a procedure-corresponding section in which one procedure included in an answer item is described using a recognition keyword list by the speech recognition unit 160 when estimating a speech section related to a procedure explanation word / phrase. In addition, the section estimation unit 180 estimates the corresponding section of the procedure explanation phrase using the operation time acquired by the operation time acquisition unit 170. Here, (1) estimation of the procedure-corresponding section is described, and (2) estimation of the corresponding section of the procedure explanation phrase is described.

（１）手順対応区間の推定
ここでは、区間推定部１８０による手順対応区間の推定を説明する。区間推定部１８０は、音声認識部１６０によって出力された認識キーワードリスト及び発話区間抽出部１５５によって出力されたオペレータの発話区間情報を用いて、ＱＡ事例の回答事項に含まれる手順ごとにその手順に関する説明の開始時刻を推定する。つまり、区間推定部１８０は、手順説明語句が出現した時点をその手順説明語句が含まれる手順の説明が開始された時点と推定する。 (1) Estimation of Procedure Corresponding Section Here, the estimation of the procedure corresponding section by the section estimation unit 180 will be described. The section estimation unit 180 uses the recognition keyword list output by the speech recognition unit 160 and the utterance section information of the operator output by the utterance section extraction unit 155 to relate each procedure included in the answer items of the QA case. Estimate the start time of the description. That is, the section estimation unit 180 estimates the time point when the procedure explanation word appears as the time point when the explanation of the procedure including the procedure explanation word is started.

これを順を追って説明すると、区間推定部１８０は、認識キーワードリストから事例手順ＩＤ及び出現時刻（図９参照）を取得するとともに、その出現時刻が含まれるオペレータの発話区間情報（図７参照）を取得する。 To explain this step by step, the section estimation unit 180 obtains the case procedure ID and the appearance time (see FIG. 9) from the recognition keyword list, and the operator's utterance section information including the appearance time (see FIG. 7). To get.

このとき、区間推定部１８０は、認識キーワードリストから事例手順ＩＤ及び出現時刻を最初に取得した場合には、出現時刻を含む発話区間の開始時刻を、その事例手順ＩＤに対応する音声区間の開始時刻として推定する。例えば、図７及び図９に示す例で言えば、区間推定部１８０は、図９に示す認識キーワードリストから事例手順ＩＤ「ＱＡ１２３−１」及び出現時刻「１０５５０ｍｓｅｃ」を最初に取得する。このため、区間推定部１８０は、この出現時刻「１０５５０ｍｓｅｃ」を発話区間に含む発話の開始時刻「１０ｍｓｅｃ」（図７参照）を事例手順ＩＤ「ＱＡ１２３−１」に対応する音声区間の開始時刻とそのまま推定する。 At this time, when the section estimation unit 180 first acquires the case procedure ID and the appearance time from the recognition keyword list, the section estimation unit 180 sets the start time of the utterance section including the appearance time to the start of the voice section corresponding to the case procedure ID. Estimated as time. For example, in the example illustrated in FIGS. 7 and 9, the section estimation unit 180 first acquires the case procedure ID “QA 123-1” and the appearance time “10550 msec” from the recognition keyword list illustrated in FIG. 9. Therefore, the section estimation unit 180 uses the speech start time “10 msec” (see FIG. 7) including the appearance time “10550 msec” in the speech section as the voice section start time corresponding to the case procedure ID “QA123-1.” Estimate as is.

また、区間推定部１８０は、認識キーワードリストから取得する事例手順ＩＤ及び出現時刻が最初でない場合には、今回に取得する事例手順ＩＤが前回に取得した事例手順ＩＤと同一であるか否かを判定する。このとき、区間推定部１８０は、今回に取得した事例手順ＩＤが前回に取得した事例手順ＩＤと同じである場合には、同じ事例手順ＩＤの説明が継続しているものとみなし、認識キーワードリストから次の事例手順ＩＤ及び出現時刻を取得する。例えば、図７及び図９に示す例で言えば、区間推定部１８０は、図９に示す認識キーワードリストにおける上から１番目から３番目の事例手順ＩＤ及び出現時刻を取得し終えるまでは、事例手順ＩＤ「ＱＡ１２３−１」の説明が継続しているとみなす。 Further, if the case procedure ID and the appearance time acquired from the recognition keyword list are not the first, the section estimation unit 180 determines whether or not the case procedure ID acquired this time is the same as the case procedure ID acquired last time. judge. At this time, if the case procedure ID acquired this time is the same as the case procedure ID acquired last time, the section estimation unit 180 considers that the description of the same case procedure ID is continuing, and the recognition keyword list To obtain the next case procedure ID and appearance time. For example, in the example shown in FIG. 7 and FIG. 9, the section estimation unit 180 does not acquire the case examples until the first to third case procedure IDs and appearance times from the top in the recognition keyword list shown in FIG. It is assumed that the description of the procedure ID “QA 123-1” is continuing.

一方、区間推定部１８０は、今回に取得した事例手順ＩＤが前回に取得した事例手順ＩＤと異なる場合には、今回に取得した出現時刻を含む発話区間の開始時刻を、今回に取得した事例手順ＩＤに対応する音声区間の開始時刻として推定する。例えば、区間推定部１８０は、図９に示す認識キーワードリストから事例手順ＩＤ「ＱＡ１２３−２」及び出現時刻「１１０３００ｍｓｅｃ」を取得した場合に、次の手順の発話に移ったものとみなす。つまり、区間推定部１８０は、事例手順ＩＤ「ＱＡ１２３−１」の説明が終了して事例手順ＩＤ「ＱＡ１２３−２」が開始されたものとみなす。なお、この場合には、事例手順ＩＤ「ＱＡ１２３−２」に対応する音声区間の開始時刻とみなすので、この開始時刻が事例手順ＩＤ「ＱＡ１２３−１」に対応する音声区間の終了時刻と仮に推定されることになる。 On the other hand, when the case procedure ID acquired this time is different from the case procedure ID acquired last time, the section estimation unit 180 obtains the start time of the utterance section including the appearance time acquired this time as the case procedure acquired this time. Estimated as the start time of the speech segment corresponding to the ID. For example, when the case procedure ID “QA123-2” and the appearance time “110300 msec” are acquired from the recognition keyword list illustrated in FIG. 9, the section estimation unit 180 considers that the utterance has moved to the next procedure. That is, the section estimation unit 180 considers that the case procedure ID “QA123-2” has been started and the case procedure ID “QA123-2” has been started. In this case, since it is regarded as the start time of the voice section corresponding to the case procedure ID “QA123-2”, this start time is temporarily estimated as the end time of the voice section corresponding to the case procedure ID “QA123-1”. Will be.

このように、区間推定部１８０は、１つのコールＩＤの音声データの音声認識結果として得られた認識キーワードリストのレコード全てに上述の処理を行うことにより、手順対応区間の推定結果を得る。つまり、区間推定部１８０は、１つのコールＩＤの音声データを手順説明語句の出現時刻で区切ることにより、回答事項に含まれる１つの手順が説明されている区間ごとに区切る処理を行う。なお、ここでは、手順説明語句の出現時刻で区切っただけであり、実際の音声区間の終了時刻が次の手順説明の開始時刻とは限らない。 As described above, the section estimation unit 180 obtains the estimation result of the procedure-corresponding section by performing the above-described processing on all the recognition keyword list records obtained as the voice recognition result of the voice data of one call ID. That is, the section estimation unit 180 performs a process of partitioning each procedure in which one procedure included in the answer item is described by separating the voice data of one call ID by the appearance time of the procedure explanation word / phrase. It should be noted that here, the procedure explanation word / phrase is only separated by the appearance time, and the actual end time of the speech section is not necessarily the start time of the next procedure explanation.

図１１は、手順対応区間の推定結果の一例を示す図である。例えば、図１１に示すように、手順対応区間の推定結果は、コールＩＤごとに事例手順ＩＤ、手順対応区間の開始時刻［ｍｓｅｃ］及び終了時刻［ｍｓｅｃ］が対応付けられた情報である。図１１に示す例で言えば、事例手順ＩＤ「ＱＡ１２３−１」に関する対話が開始時刻「１０」から終了時刻「１３１０５０」まで行われたことを示す。また、事例手順ＩＤ「ＱＡ１２３−２」に関する対話が開始時刻「１３１０５０」から終了時刻「・・・」まで行われたことを示す。なお、ここでは、コール「ｃａｌｌ００１」に関する手順対応区間だけを例示したが、補足語句が未抽出である音声データのコールＩＤすべてに同様の処理が行われる。 FIG. 11 is a diagram illustrating an example of a procedure-corresponding section estimation result. For example, as illustrated in FIG. 11, the estimation result of the procedure corresponding section is information in which the case procedure ID, the start time [msec] and the end time [msec] of the procedure corresponding section are associated with each call ID. In the example shown in FIG. 11, it is shown that the conversation related to the case procedure ID “QA 123-1” was performed from the start time “10” to the end time “131050”. In addition, it shows that the conversation related to the case procedure ID “QA123-2” was performed from the start time “131050” to the end time “...”. Here, only the procedure-corresponding section related to the call “call 001” is illustrated, but the same processing is performed for all call IDs of voice data from which supplemental words / phrases have not been extracted.

（２）手順説明語句の対応区間の推定
次に、区間推定部１８０による手順説明語句の対応区間の推定を説明する。区間推定部１８０は、操作時間取得部１７０によって出力された操作時間リストを用いて、手順説明語句の対応区間を推定する。なお、区間推定部１８０は、ＱＡ事例の回答事項に含まれる１つの手順を処理単位とし、先に推定した手順対応区間全てに対して手順説明語句の対応区間を推定し終えるまで繰り返し処理を行う。 (2) Estimation of Corresponding Section of Procedure Explanation Words Next, estimation of a corresponding section of procedure explanation words by the section estimation unit 180 will be described. The section estimation unit 180 uses the operation time list output by the operation time acquisition unit 170 to estimate the corresponding section of the procedure explanation phrase. The section estimation unit 180 uses one procedure included in the answer item of the QA example as a processing unit, and repeats the process until the corresponding sections of the procedure explanation words are estimated for all the procedure corresponding sections estimated previously. .

この点を順を追って説明すると、区間推定部１８０は、先に推定した手順対応区間の開始時刻と終了時刻を取得する。その後、区間推定部１８０は、操作時間取得部１７０によって出力された操作時間リストのうち、操作時間（録音開始時刻からの経過時間）が最も短い操作時間から順に操作時間及び切替後の操作部位を取得する。そして、区間推定部１８０は、切替後の操作部位との間で対応関係を有する手順説明語句を対応関係記憶部１２５から抽出する。 If this point is explained in order, the section estimation unit 180 acquires the start time and end time of the procedure-corresponding section estimated previously. Thereafter, the section estimation unit 180 selects the operation time and the operation part after switching in order from the operation time with the shortest operation time (elapsed time from the recording start time) in the operation time list output by the operation time acquisition unit 170. get. Then, the section estimation unit 180 extracts a procedure explanation word / phrase having a correspondence relationship with the operation part after switching from the correspondence relationship storage unit 125.

さらに、区間推定部１８０は、オペレータの発話区間情報を参照して、切替操作が行われた操作時間に対応するオペレータの発話（Ｎ）を取得するとともに取得したオペレータの発話の次のオペレータの発話（Ｎ＋１）を取得する。ここで、区間推定部１８０は、操作時間を含むオペレータの発話が存在する場合には、そのオペレータの発話（Ｎ）を取得するとともに、取得したオペレータの発話の次のオペレータの発話（Ｎ＋１）を取得する。一方、区間推定部１８０は、操作時間を含むオペレータの発話が存在しない場合には、その直後のオペレータの発話（Ｎ）を取得するとともに、取得したオペレータの発話の次のオペレータの発話（Ｎ＋１）を取得する。 Further, the section estimation unit 180 refers to the utterance section information of the operator, acquires the utterance (N) of the operator corresponding to the operation time when the switching operation is performed, and utters the operator next to the acquired utterance of the operator. (N + 1) is acquired. Here, when there is an operator utterance including the operation time, the section estimation unit 180 acquires the utterance (N) of the operator, and the utterance (N + 1) of the operator next to the acquired operator utterance. get. On the other hand, when there is no operator utterance including the operation time, the section estimation unit 180 acquires the operator utterance (N) immediately after that and the operator utterance (N + 1) next to the acquired operator utterance. To get.

このとき、区間推定部１８０は、次のオペレータの発話区間（Ｎ＋１の発話区間）が所定の期間よりも長い場合には、次のオペレータの発話（Ｎ＋１）の開始時刻を手順説明語句に関する音声区間の終了時刻と推定する。また、次のオペレータの発話区間（Ｎ＋１の発話区間）が所定の期間よりも短い場合には、次々回のオペレータの発話（Ｎ＋２）を取得し、次々回のオペレータの発話（Ｎ＋２）が所定の期間よりも短いか否かをさらに判定する。このようにして、区間推定部１８０は、所定の期間より長いオペレータの発話が見つかるまでオペレータの発話をサーチする。なお、ここで言う所定期間は、オペレータが顧客の発話に対して「はい」や「うん」などの応答発話を行う期間として予想される期間をコールセンタ関係者が任意に設定することができる。 At this time, when the next operator's utterance section (N + 1 utterance section) is longer than the predetermined period, the section estimation unit 180 uses the start time of the next operator's utterance (N + 1) as the voice section related to the procedure explanation phrase. Is estimated to be the end time of If the next operator's utterance section (N + 1 utterance section) is shorter than the predetermined period, the next operator's utterance (N + 2) is acquired, and the next operator's utterance (N + 2) is acquired from the predetermined period. It is further determined whether or not it is too short. In this manner, the section estimation unit 180 searches for an operator's utterance until an operator's utterance longer than a predetermined period is found. Note that the predetermined period mentioned here can be arbitrarily set by the call center personnel as a period during which the operator is expected to perform a response utterance such as “Yes” or “Yes” in response to the customer's utterance.

このように、区間推定部１８０は、操作時間取得部１７０によって出力された操作時間リストレコード全てに上述の処理を行うことにより、手順説明語句の対応区間の推定結果を得る。つまり、区間推定部１８０は、操作部位が切り替えられた操作時間を用いて、先に推定した手順対応区間の中を手順説明語句の対応区間で区切る。 As described above, the section estimation unit 180 performs the above-described processing on all the operation time list records output by the operation time acquisition unit 170, thereby obtaining the estimation result of the corresponding section of the procedure explanation phrase. That is, the section estimation unit 180 uses the operation time when the operation part is switched to divide the previously estimated procedure corresponding section into the corresponding sections of the procedure explanatory words.

図１２は、手順説明語句の対応区間の推定結果の一例を示す図である。例えば、図１２に示すように、手順説明語句の対応区間の推定結果は、コールＩＤごとに事例手順ＩＤ、手順説明語句及び終了時刻が対応付けられた情報である。図１２に示す例では、事例手順ＩＤ「ＱＡ１２３−１」に含まれる手順説明語句「インターネットオプション」の対話が終了時刻「１１０２８０（ｍｓｅｃ）」に終了したことを示す。また、事例手順ＩＤ「ＱＡ１２３−１」に含まれる手順説明語句の対話が終了時刻「・・・（ｍｓｅｃ）」に終了したことを示す。 FIG. 12 is a diagram illustrating an example of the estimation result of the corresponding section of the procedure explanation word / phrase. For example, as shown in FIG. 12, the estimation result of the corresponding section of the procedure explanation phrase is information in which the case procedure ID, the procedure explanation phrase, and the end time are associated with each call ID. The example shown in FIG. 12 indicates that the conversation of the procedure explanation phrase “Internet option” included in the case procedure ID “QA 123-1” has ended at the end time “110280 (msec)”. Further, it indicates that the dialogue of the procedure explanation word / phrase included in the case procedure ID “QA 123-1” is ended at the end time “... (Msec)”.

図２の説明に戻り、補足語句抽出部１９０は、区間推定部１８０により推定された手順説明語句の音声区間の推定結果と、発話区間抽出部１５５により抽出されたオペレータの発話区間情報とを用いて、手順説明語句を補足するための補足語句を抽出する。この補足語句抽出部１９０は、補足語句が未抽出であるコールＩＤの音声データであり、区間推定部１８０によって推定された手順説明語句の音声区間が存在する音声データのコールＩＤを補足語句の抽出対象とすることができる。 Returning to the explanation of FIG. 2, the supplemental phrase extraction unit 190 uses the estimation result of the speech section of the procedure explanation phrase estimated by the section estimation unit 180 and the utterance section information of the operator extracted by the utterance section extraction unit 155. Thus, a supplemental phrase for supplementing the procedure explanation phrase is extracted. The supplemental phrase extraction unit 190 extracts the supplemental phrase from the call ID of the speech data in which the supplementary phrase has not been extracted and the speech data of the procedure explanation phrase estimated by the section estimation unit 180 exists. Can be targeted.

具体的に説明すると、補足語句抽出部１９０は、発話区間抽出部１５５により抽出されたオペレータの発話区間情報のうち、区間推定部１８０により推定された手順説明語句の音声区間に含まれるオペレータの発話の発話区間を取得する。例えば、図１１及び図１２に示す例で言えば、補足語句抽出部１９０は、手順説明語句「インターネットオプション」に関する音声区間「１０（ｍｓｅｃ）」〜「１１０２８０（ｍｓｅｃ）」に含まれるオペレータの各発話の発話区間を取得する。 Specifically, the supplemental phrase extraction unit 190 includes the operator's utterance included in the speech section of the procedure explanation phrase estimated by the section estimation unit 180 out of the utterance section information of the operator extracted by the utterance section extraction unit 155. Get the utterance interval. For example, in the example shown in FIGS. 11 and 12, the supplemental phrase extracting unit 190 performs each of the operators included in the speech sections “10 (msec)” to “110280 (msec)” related to the procedure explanation phrase “Internet option”. Get the utterance section of the utterance.

その上で、補足語句抽出部１９０は、先に取得したオペレータの発話の発話区間の音声データを補足語句の抽出範囲とし、その補足語句の抽出範囲の音声データに音声認識を行って補足語句を抽出する。例えば、音声データから補足語句となり得る語句を抽出する音素認識技術としては、韻律データから未知語を検索して抽出する技術やその他の公知の技術を任意に選択して使用できる。また、補足語句となり得る語句から補足語句とする語句を抽出する技術としては、特開２００８−３３８３５号公報に開示される技術やその他の公知の技術を任意に選択して使用できる。 Then, the supplemental phrase extraction unit 190 sets the speech data of the utterance section of the operator's utterance acquired previously as the supplemental phrase extraction range, performs speech recognition on the speech data in the supplemental phrase extraction range, and extracts the supplemental phrase. Extract. For example, as a phoneme recognition technique for extracting a phrase that can be a supplemental phrase from speech data, a technique for searching and extracting an unknown word from prosodic data and other known techniques can be arbitrarily selected and used. In addition, as a technique for extracting a phrase to be a supplemental phrase from a phrase that can be a supplemental phrase, a technique disclosed in Japanese Patent Application Laid-Open No. 2008-33835 and other known techniques can be arbitrarily selected and used.

その後、補足語句抽出部１９０は、このようにして抽出した補足語句をその補足語句の抽出範囲を定めるのに使用した音声区間の手順説明語句及びその事例手順ＩＤに対応付けることにより、補足語句リストを生成する。 Thereafter, the supplemental phrase extraction unit 190 associates the supplemental phrase extracted in this way with the procedure explanation phrase and the example procedure ID of the speech section used to determine the extraction range of the supplemental phrase, thereby obtaining the supplemental phrase list. Generate.

図１３は、補足語句リストの一例を示す図である。例えば、図１３に示すように、補足語句リストは、事例手順ＩＤごとに手順説明語句及び補足語句を対応付けられた情報である。図１３に示す例で言えば、事例手順ＩＤ「ＱＡ１２３−１」の手順説明語句「インターネットオプション」として補足語句「メニューバー」が対応付けたことを示す。また、手順ＩＤ「ＱＡ１２３−１」の手順説明語句「インターネットオプション」として補足語句「２番目」が対応付けられたことを示す。また、手順ＩＤ「ＱＡ１２３−１」の手順説明語句「接続タブ」として補足語句「１番上」が対応付けられたことを示す。また、手順ＩＤ「ＱＡ１２３−２」の手順説明語句「ＬＡＮ」として補足語句「１番下」が対応付けられたことを示す。 FIG. 13 is a diagram illustrating an example of a supplemental phrase list. For example, as illustrated in FIG. 13, the supplemental phrase list is information in which a procedure explanation phrase and a supplemental phrase are associated with each case procedure ID. In the example illustrated in FIG. 13, the supplemental phrase “menu bar” is associated with the procedure explanation phrase “Internet option” of the case procedure ID “QA 123-1”. Further, it indicates that the supplementary phrase “second” is associated as the procedure explanation phrase “Internet option” of the procedure ID “QA 123-1”. Further, it is shown that the supplemental phrase “first” is associated with the procedure explanation phrase “connection tab” of the procedure ID “QA 123-1”. Further, it indicates that the supplementary phrase “bottom 1” is associated with the procedure explanation phrase “LAN” of the procedure ID “QA123-2”.

なお、図２に示したオペレータ支援装置１００は、上記の韻律情報抽出部１５０、発話区間抽出部１５５、音声認識部１６０、操作時間取得部１７０、区間推定部１８０及び補足語句抽出部１９０の各機能を情報処理装置に搭載することで実現することもできる。この情報処理装置としては、既知のパーソナルコンピュータ、ワークステーション、ＡＳＰ（Application Service Provider）サーバ装置、携帯電話、ＰＨＳ端末、移動体通信端末またはＰＤＡなどが挙げられる。 The operator support apparatus 100 shown in FIG. 2 includes each of the prosodic information extraction unit 150, the speech segment extraction unit 155, the speech recognition unit 160, the operation time acquisition unit 170, the segment estimation unit 180, and the supplemental phrase extraction unit 190. It can also be realized by installing the function in the information processing apparatus. Examples of the information processing apparatus include a known personal computer, workstation, ASP (Application Service Provider) server apparatus, mobile phone, PHS terminal, mobile communication terminal, or PDA.

また、ＰＣ操作ログ記憶部１１０、事例データ記憶部１２０、対応関係記憶部１２５及び音声データ記憶部１３０は、以下に示すような記憶装置である。例えば、記憶装置とは、ＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）、フラッシュメモリ（Flash Memory）などの半導体メモリ素子、ハードディスクや光ディスクなどである。また、対応関係設定部１４０、韻律情報抽出部１５０、発話区間抽出部１５５、音声認識部１６０、操作時間取得部１７０、区間推定部１８０及び補足語句抽出部１９０は、集積回路や電子回路により実現される。集積回路としては、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）などを適用できる。また、電子回路としては、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）などを適用できる。 The PC operation log storage unit 110, the case data storage unit 120, the correspondence relationship storage unit 125, and the voice data storage unit 130 are storage devices as described below. For example, the storage device is a semiconductor memory device such as a random access memory (RAM), a read only memory (ROM), or a flash memory, a hard disk, or an optical disk. The correspondence setting unit 140, prosodic information extraction unit 150, speech segment extraction unit 155, speech recognition unit 160, operation time acquisition unit 170, segment estimation unit 180, and supplemental phrase extraction unit 190 are realized by an integrated circuit or an electronic circuit. Is done. As the integrated circuit, for example, an application specific integrated circuit (ASIC) or a field programmable gate array (FPGA) can be applied. Further, as an electronic circuit, a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like can be applied.

［オペレータ支援装置による処理］
次に、本実施例に係るオペレータ支援装置の処理の流れを説明する。なお、ここでは、（１）オペレータ支援装置１００における処理の流れを説明しながら、区間推定部１８０によって行われる（２）手順対応区間の推定処理を説明し、（３）手順説明語句の対応区間の推定処理を説明する。その後、（１）オペレータ支援装置１００における処理の流れを説明しながら、補足語句抽出部１９０によって行われる（４）補足語句抽出処理を説明する。 [Processing by operator support device]
Next, a processing flow of the operator support apparatus according to the present embodiment will be described. Here, (1) the process corresponding to the procedure explanation word / phrase is explained by (2) the procedure corresponding section estimation process performed by the section estimation unit 180 while explaining the flow of processing in the operator support apparatus 100. The estimation process will be described. Thereafter, (1) the supplemental phrase extraction process performed by the supplemental phrase extraction unit 190 will be described while explaining the flow of processing in the operator support apparatus 100.

（１）オペレータ支援装置における処理の流れ
図１４を用いて、本実施例に係るオペレータ支援装置における処理の流れについて説明する。図１４は、実施例２に係るオペレータ支援装置における処理の流れを示すフローチャートである。なお、オペレータ支援装置１００は、音声データ記憶部１３０に記憶された応対履歴リストに補足語句が未抽出であるコールＩＤ及び事例ＩＤが存在する場合に処理を起動する。 (1) Process Flow in Operator Support Device The process flow in the operator support device according to the present embodiment will be described with reference to FIG. FIG. 14 is a flowchart illustrating a flow of processing in the operator support apparatus according to the second embodiment. Note that the operator support apparatus 100 activates the process when there is a call ID and case ID from which no supplemental phrase has been extracted in the response history list stored in the voice data storage unit 130.

図１４に示すように、オペレータ支援装置１００は、補足語句が未抽出であるコールＩＤの音声データを対象に、ステップＳ１０１Ａ〜Ｓ１０４Ａの音声データ系の処理と、ステップＳ１０１Ｂ〜Ｓ１０２Ｂの操作履歴系の処理とを並列して行うことができる。 As shown in FIG. 14, the operator support apparatus 100 targets the voice data of the call ID from which the supplemental phrase has not been extracted, the processing of the voice data system of steps S101A to S104A, and the operation history system of steps S101B to S102B. Processing can be performed in parallel.

音声データ系の処理について説明すると、韻律情報抽出部１５０は、音声データ記憶部１３０に記憶された音声データからオペレータの韻律データリスト（図５参照）を生成する（ステップＳ１０１Ａ）。そして、発話区間抽出部１５５は、韻律情報抽出部１５０より抽出されたオペレータの韻律データリストから発話区間情報（図７参照）を生成する（ステップＳ１０２Ａ）。その後、音声認識部１６０は、音声データ記憶部１３０に記憶された音声データに音声認識を行って手順説明語句の出現時刻を含む認識キーワードリスト（図９参照）を生成する（ステップＳ１０３Ａ）。なお、ここでは、韻律情報の抽出、発話区間の抽出を行った後に音声認識を行う場合を説明したが、音声認識については、韻律情報の抽出及び発話区間の抽出と並列して行うこともできる。 Explaining the processing of the speech data system, the prosody information extraction unit 150 generates an operator's prosody data list (see FIG. 5) from the speech data stored in the speech data storage unit 130 (step S101A). Then, the utterance section extraction unit 155 generates utterance section information (see FIG. 7) from the operator's prosody data list extracted by the prosody information extraction unit 150 (step S102A). Thereafter, the voice recognition unit 160 performs voice recognition on the voice data stored in the voice data storage unit 130 to generate a recognition keyword list (see FIG. 9) including the appearance times of the procedure explanation words (step S103A). Here, the case where speech recognition is performed after extracting prosodic information and utterance intervals has been described, but speech recognition can also be performed in parallel with the extraction of prosodic information and the extraction of utterance intervals. .

このようにしてステップＳ１０１Ａ〜Ｓ１０３Ａの音声データ系の処理が終了した後に、区間推定部１８０は、発話区間情報及び認識キーワードリストを用いて、音声区間の開始時刻を推定する処理を行う（ステップＳ１０４Ａ）。 After the processing of the speech data system in steps S101A to S103A is thus completed, the section estimation unit 180 performs processing for estimating the start time of the speech section using the speech section information and the recognition keyword list (step S104A). ).

（２）手順対応区間の推定処理
ここで、図１５を用いて、図１４のステップＳ１０４Ａで示した手順対応区間の推定処理について説明する。図１５は、実施例２に係る手順対応区間の推定処理の手順を示すフローチャートである。 (2) Procedure Corresponding Section Estimation Process Here, the procedure corresponding section estimation process shown in step S104A of FIG. 14 will be described with reference to FIG. FIG. 15 is a flowchart illustrating a procedure of a procedure corresponding section estimation process according to the second embodiment.

図１５に示すように、区間推定部１８０は、認識キーワードリストから事例手順ＩＤ及び出現時刻を順次取得する（ステップＳ２０１）。そして、区間推定部１８０は、認識キーワードリストから取得した出現時刻を含むオペレータの発話の発話区間情報を取得する（ステップＳ２０２）。 As illustrated in FIG. 15, the section estimation unit 180 sequentially acquires the case procedure ID and the appearance time from the recognition keyword list (Step S201). Then, the section estimation unit 180 acquires the utterance section information of the operator's utterance including the appearance time acquired from the recognition keyword list (step S202).

このとき、区間推定部１８０は、今回に取得する事例手順ＩＤが前回に取得した事例手順ＩＤと同一であるか否かを判定する（ステップＳ２０３）。そして、今回及び前回の事例手順ＩＤが同じである場合（ステップＳ２０３Ｎｏ）には、区間推定部１８０は、今回に取得した出現時刻を含む発話区間の開始時刻を、今回に取得した事例手順ＩＤに対応する手順対応区間の開始時刻として推定する（ステップＳ２０４）。なお、認識キーワードリストから事例手順ＩＤ及び出現時刻を最初に取得した場合にも、区間推定部１８０は、出現時刻を含む発話区間の開始時刻を、その事例手順ＩＤに対応する手順対応区間の開始時刻として推定する。また、今回及び前回の事例手順ＩＤが同じである場合にだけ、出現時刻を含むオペレータの発話の発話区間情報を取得するステップＳ２０２の処理を行うこととしてもかまわない。 At this time, the section estimation unit 180 determines whether or not the case procedure ID acquired this time is the same as the case procedure ID acquired last time (step S203). If the current and previous case procedure IDs are the same (No in step S203), the section estimation unit 180 sets the start time of the utterance section including the appearance time acquired this time as the case procedure ID acquired this time. Estimated as the start time of the corresponding procedure-corresponding section (step S204). Even when the case procedure ID and the appearance time are first acquired from the recognition keyword list, the section estimation unit 180 uses the start time of the utterance section including the appearance time as the start of the procedure corresponding section corresponding to the case procedure ID. Estimated as time. Further, only when the current and previous case procedure IDs are the same, the process of step S202 for acquiring the utterance section information of the utterance of the operator including the appearance time may be performed.

一方、今回及び前回の事例手順ＩＤが同じである場合（ステップＳ２０３Ｙｅｓ）には、区間推定部１８０は、同じ事例手順ＩＤの説明が継続しているものとみなし、手順対応い区間の開始時刻を推定することなく、ステップＳ２０５へ移行する。 On the other hand, when the current case procedure ID and the previous case procedure ID are the same (Yes in step S203), the section estimation unit 180 considers that the description of the same case procedure ID is continuing, and sets the start time of the section corresponding to the procedure. The process proceeds to step S205 without estimation.

そして、区間推定部１８０は、認識キーワードリストとして登録された全てのレコードに対して上記のステップＳ２０１〜ステップＳ２０４までの処理を行うまで（ステップＳ２０５Ｎｏ）、上記のステップＳ２０１〜ステップＳ２０４までの処理を繰り返し行う。その後、認識キーワードリストとして登録された全てのレコードに対する処理が終了すると（ステップＳ２０５Ｙｅｓ）、区間推定部１８０は、手順対応区間の推定を終了する。 Then, the section estimation unit 180 performs the processes from step S201 to step S204 until the processes from step S201 to step S204 are performed on all records registered as the recognition keyword list (No in step S205). Repeat. Thereafter, when the processing for all the records registered as the recognition keyword list is completed (step S205 Yes), the section estimation unit 180 ends the estimation of the procedure corresponding section.

図１４の説明に戻り、音声データ系の処理を行う一方で、操作時間取得部１７０は、補足語句が未抽出であるコールＩＤの音声データの録音区間をＰＣ操作ログの切出し範囲とし、ＰＣ操作ログ記憶部１１０から操作時間リストを生成する（ステップＳ１０１Ｂ）。 Returning to the description of FIG. 14, while processing the voice data system, the operation time acquisition unit 170 sets the recording period of the voice data of the call ID for which the supplemental word / phrase has not been extracted as the extraction range of the PC operation log, and performs the PC An operation time list is generated from the log storage unit 110 (step S101B).

このようにしてステップＳ１０１Ａ〜Ｓ１０４Ａ及びステップＳ１０１Ｂが終了した後に、区間推定部１８０は、操作時間リストを用いて、手順説明語句の対応区間を推定する処理を行う（ステップＳ１０５）。 After steps S101A to S104A and step S101B are completed in this way, the section estimation unit 180 performs processing for estimating the corresponding section of the procedure explanation phrase using the operation time list (step S105).

（３）手順説明語句の対応区間の推定処理
ここで、図１６を用いて、図１４のステップＳ１０５で示した手順説明語句の対応区間の推定処理について説明する。図１６は、実施例２に係る手順説明語句の対応区間の推定処理の手順を示すフローチャートである。なお、区間推定部１８０は、ＱＡ事例の回答事項に含まれる１つの手順を処理単位として以下の処理を実行するものとし、ステップＳ１０４Ａで推定された手順対応区間を全て処理するまで繰り返し処理を実行する。 (3) Process for Estimating Corresponding Section of Procedure Explanation Words Here, the process of estimating the corresponding section of the procedure explanation words shown in step S105 of FIG. 14 will be described using FIG. FIG. 16 is a flowchart illustrating a procedure of a process for estimating a corresponding section of a procedure explanatory word according to the second embodiment. The section estimation unit 180 executes the following process with one procedure included in the answer item of the QA example as a processing unit, and repeats the process until all the procedure corresponding sections estimated in step S104A are processed. To do.

図１６に示すように、区間推定部１８０は、先の手順対応区間の推定で得た手順対応区間の開始時刻と終了時刻を取得する（ステップＳ３０１）。その後、区間推定部１８０は、操作時間取得部１７０によって出力された操作時間リストのうち、操作時間（録音開始時刻からの経過時間）が最も短い操作時間から順に操作時間及び切替後の操作部位を取得する（ステップＳ３０２）。 As illustrated in FIG. 16, the section estimation unit 180 acquires the start time and end time of the procedure corresponding section obtained by the previous procedure corresponding section estimation (step S301). Thereafter, the section estimation unit 180 selects the operation time and the operation part after switching in order from the operation time with the shortest operation time (elapsed time from the recording start time) in the operation time list output by the operation time acquisition unit 170. Obtain (step S302).

そして、区間推定部１８０は、切替後の操作部位との間で対応関係を有する手順説明語句を対応関係記憶部１２５から抽出する（ステップＳ３０３）。このとき、手順説明語句が抽出された場合（ステップＳ３０４Ｙｅｓ）には、区間推定部１８０は、オペレータの発話区間情報を参照して、切替操作が行われた操作時間に対応するオペレータの発話（Ｎ）を取得する（ステップＳ３０５）。さらに、区間推定部１８０は、取得したオペレータの発話の次のオペレータの発話（Ｎ＋１）を取得する（ステップＳ３０６）。なお、ステップＳ３０５では、操作時間を含むオペレータの発話が存在する場合には、そのオペレータの発話（Ｎ）を取得し、また、操作時間を含むオペレータの発話が存在しない場合には、その直後のオペレータの発話（Ｎ）を取得する。 Then, the section estimation unit 180 extracts a procedure explanation word / phrase having a correspondence relationship with the operation part after switching from the correspondence relationship storage unit 125 (step S303). At this time, when the procedure explanation word is extracted (step S304 Yes), the section estimation unit 180 refers to the utterance section information of the operator and utters the operator's utterance corresponding to the operation time when the switching operation is performed (N ) Is acquired (step S305). Furthermore, the section estimation unit 180 acquires the utterance (N + 1) of the operator next to the acquired utterance of the operator (step S306). In step S305, if the operator's utterance including the operation time exists, the utterance (N) of the operator is acquired. If the operator's utterance including the operation time does not exist, the operator's utterance (N) is acquired. The operator's utterance (N) is acquired.

このとき、次のオペレータの発話区間（Ｎ＋１の発話区間）が所定の期間よりも長い場合（ステップＳ３０７Ｙｅｓ）には、区間推定部１８０は、次のオペレータの発話（Ｎ＋１）の開始時刻を手順説明語句の音声区間の終了時刻と推定する（ステップＳ３０８）。 At this time, if the next operator's utterance section (N + 1 utterance section) is longer than the predetermined period (step S307 Yes), the section estimation unit 180 describes the start time of the next operator's utterance (N + 1) as a procedure. The end time of the speech section of the phrase is estimated (step S308).

一方、次のオペレータの発話区間（Ｎ＋１の発話区間）が所定の期間よりも短い場合（ステップＳ３０７Ｎｏ）には、区間推定部１８０は、次々回のオペレータの発話（Ｎ＋２）を取得する（ステップＳ３０６）。そして、区間推定部１８０は、次々回のオペレータの発話（Ｎ＋２）が所定の期間よりも短いか否かをさらに判定する。このようにして、区間推定部１８０は、所定の期間より長いオペレータの発話が見つかるまで（ステップＳ３０７Ｙｅｓ）オペレータの発話をサーチする。 On the other hand, when the next operator's utterance section (N + 1 utterance section) is shorter than the predetermined period (No in step S307), the section estimation unit 180 acquires the next operator's utterance (N + 2) (step S306). . Then, the section estimation unit 180 further determines whether or not the next utterance (N + 2) of the operator is shorter than a predetermined period. In this manner, the section estimation unit 180 searches for an operator utterance until an operator utterance longer than a predetermined period is found (step S307 Yes).

また、手順説明語句が抽出されなければ（ステップＳ３０４Ｎｏ）、区間推定部１８０は、切替後の操作部位の遷移が手順説明語句に関係のない操作部位であるため、手順説明語句の対応区間を推定することなく、ステップＳ３０９へ移行する。 If the procedure explanation word is not extracted (No in step S304), the section estimation unit 180 estimates the corresponding section of the procedure explanation word because the transition of the operation part after switching is an operation part not related to the procedure explanation word. Without moving to step S309.

そして、操作時間リストとして登録された全てのレコードに対して上記のステップＳ３０２〜ステップＳ３０８までの処理を行うまで（ステップＳ３０９Ｎｏ）、上記のステップＳ３０２〜ステップＳ３０８までの処理を繰り返し行う。 Then, the processing from step S302 to step S308 is repeated until the processing from step S302 to step S308 is performed on all the records registered as the operation time list (No in step S309).

その後、操作時間リストとして登録された全てのレコードに対する処理が終了すると（ステップＳ３０９Ｙｅｓ）、区間推定部１８０は、手順説明語句に関する対応区間の推定結果を補足語句抽出部１９０へ出力する（ステップＳ３１０）。なお、上記のステップＳ３０１〜ステップＳ３１０の処理は、全ての手順対応区間に対して処理が行われるまで繰り返し行われる。 Thereafter, when the processing for all the records registered as the operation time list is completed (step S309 Yes), the section estimation unit 180 outputs the estimation result of the corresponding section related to the procedure explanation word to the supplemental word extraction unit 190 (step S310). . Note that the processes in steps S301 to S310 are repeated until all the procedure corresponding sections are processed.

図１４の説明に戻り、補足語句抽出部１９０は、区間推定部１８０により推定された手順説明語句の音声区間の推定結果と、発話区間抽出部１５５により抽出されたオペレータの発話区間情報とを用いて、補足語句を抽出する（ステップＳ１０６）。 Returning to the description of FIG. 14, the supplemental phrase extraction unit 190 uses the estimation result of the speech section of the procedure explanation phrase estimated by the section estimation unit 180 and the utterance section information of the operator extracted by the utterance section extraction unit 155. Then, supplemental phrases are extracted (step S106).

（４）補足語句抽出処理
ここで、図１７を用いて、図１４のステップＳ１０６で示した手順説明語句の対応区間の推定処理について説明する。図１７は、実施例２に係る補足語句抽出処理の手順を示すフローチャートである。なお、補足語句抽出部１９０は、１つの音声区間の推定結果を処理単位として以下の処理を実行するものとし、ステップＳ１０５で推定された音声区間の推定結果を全て処理するまで繰り返し処理を実行する。 (4) Supplemental Phrase Extraction Processing Here, the processing for estimating the corresponding section of the procedure explanatory words shown in step S106 of FIG. 14 will be described using FIG. FIG. 17 is a flowchart illustrating a procedure of supplemental phrase extraction processing according to the second embodiment. Note that the supplemental phrase extraction unit 190 executes the following processing using the estimation result of one speech segment as a processing unit, and repeatedly performs the processing until all the estimation results of the speech segment estimated in step S105 are processed. .

図１７に示すように、補足語句抽出部１９０は、発話区間抽出部１５５により抽出されたオペレータの発話区間情報のうち、区間推定部１８０により推定された手順説明語句の音声区間に含まれるオペレータの発話の発話区間を取得する（ステップＳ４０１）。 As shown in FIG. 17, the supplemental phrase extraction unit 190 includes the operator's utterance section information extracted by the utterance section extraction unit 155, and includes the operator's speech section of the procedure explanation phrase estimated by the section estimation unit 180. The utterance section of the utterance is acquired (step S401).

そして、補足語句抽出部１９０は、先に取得したオペレータの発話の発話区間の音声データを補足語句の抽出範囲とし、その補足語句の抽出範囲の音声データに音声認識を行って補足語句を抽出する（ステップＳ４０２）。 Then, the supplemental phrase extracting unit 190 sets the speech data of the utterance section of the operator's utterance acquired previously as the supplemental phrase extraction range, and performs speech recognition on the speech data in the supplemental phrase extraction range to extract the supplemental phrase. (Step S402).

その後、補足語句抽出部１９０は、このようにして抽出した補足語句をその補足語句の抽出範囲を定めるのに使用した音声区間の手順説明語句及びその事例手順ＩＤに対応付けることにより、補足語句リストを生成する（ステップＳ４０３）。なお、上記のステップＳ４０１〜ステップＳ４０３の処理は、全ての音声区間の推定結果に対して処理が行われるまで繰り返し行われる。 Thereafter, the supplemental phrase extraction unit 190 associates the supplemental phrase extracted in this way with the procedure explanation phrase and the example procedure ID of the speech section used to determine the extraction range of the supplemental phrase, thereby obtaining the supplemental phrase list. Generate (step S403). Note that the processing in steps S401 to S403 is repeated until processing is performed on the estimation results of all speech sections.

［補足語句の抽出例］
上述してきたオペレータ支援装置１００が音声データから補足語句を抽出する例について、図１８及び図１９を用いて説明する。図１８及び図１９は、実施例２に係る補足語句の抽出例を説明するための説明図である。図１８及び図１９の例では、コールセンタでＰＣの操作に関する問合せを受け付ける場合を想定する。これら図１８及び図１９は、いずれも顧客からの問合せ事項「ブラウザの設定方法を教えて」に対して、手順「インターネットオプションの接続タブを開く」を説明する対話の一例を示す。このうち、図１８の例では、インターネットオプションが中心となる対話に分岐した場合の対話を示す。また、図１９の例では、接続タブが中心となる対話に分岐した場合の対話を示す。 [Example of supplemental word extraction]
An example in which the operator support apparatus 100 described above extracts supplemental phrases from voice data will be described with reference to FIGS. 18 and 19. FIG. 18 and FIG. 19 are explanatory diagrams for explaining an extraction example of supplemental phrases according to the second embodiment. In the example of FIGS. 18 and 19, it is assumed that an inquiry regarding the operation of the PC is received at the call center. FIG. 18 and FIG. 19 both show an example of the dialog explaining the procedure “open the connection tab of the Internet option” for the inquiry item “Tell me how to set the browser” from the customer. Among these, the example in FIG. 18 shows a dialog when the Internet option branches to a dialog centered on. Further, in the example of FIG. 19, the dialog when the connection tab is branched to the center is shown.

また、図１８及び図１９における横軸は、時間軸を指し、右方向に進むにしたがって時間が経過することを示す。また、図中の「ｕ」は、「utterance」の略記であり、「ｕ１」から「ｕ１２」までのブロックはオペレータ又は顧客の発話を示し、これらブロックの横方向の大きさは発話区間の長さを示す。また、時間軸の上下のブロックのうち横軸の上方に図示するブロックは顧客の発話を示し、横軸の下方に図示するブロックはオペレータの発話を示すものとする。また、図１８及び図１９の時間軸の下方の「操作部位」には、ＰＣ操作ログから取得された切替後の操作部位を示し、矢印が示す範囲の横方向の大きさは操作部位が操作の対象となった時間の長さを示す。また、図１８及び図１９の時間軸のさらに下方の「対応区間」には、区間推定部１８０により推定された手順説明語句に関する音声区間を示し、矢印が示す範囲の横方向の大きさは推定された手順説明語句に関する音声区間の長さを示す。 Also, the horizontal axis in FIGS. 18 and 19 indicates the time axis, and indicates that time elapses as it proceeds in the right direction. Also, “u” in the figure is an abbreviation for “utterance”, and blocks from “u1” to “u12” indicate an utterance of an operator or a customer, and the horizontal size of these blocks is the length of the utterance section. It shows. Of the upper and lower blocks on the time axis, the block shown above the horizontal axis indicates the customer's utterance, and the block shown below the horizontal axis indicates the operator's utterance. 18 and FIG. 19, the “operation part” below the time axis indicates the operation part after switching acquired from the PC operation log, and the horizontal size of the range indicated by the arrow indicates that the operation part is operated by the operation part. Indicates the length of time covered by. 18 and FIG. 19, the “corresponding section” further below the time axis indicates a speech section related to the procedure explanation word estimated by the section estimation unit 180, and the horizontal size of the range indicated by the arrow is estimated. The length of the voice section related to the procedure explanation word is shown.

図１８に示す例では、以下の会話が行われたものとする。オペレータは「では、インターネットオプションの接続タブを開いて下さい」（ｕ１）と発話する。顧客は「何それ？」（ｕ２）と発話する。オペレータは「メニューバーの右から二番目の・・・」（ｕ３）と発話する。顧客は「メニューバー？」（ｕ４）と発話する。オペレータは「一番上に、ファイル、編集などと並んでいる・・・」（ｕ５）と発話する。顧客は「あ、はいはい」（ｕ６）と発話する。オペレータは「その右から二番目に、ツールとあるのでクリックして下さい」（ｕ７）と発話する。顧客は「二番目・・・はい」（ｕ８）と発話する。オペレータは「すると、一番下にインターネットオプションというのがあります」（ｕ９）と発話する。顧客は「あ、これね」（ｕ１０）と発話する。オペレータは「はい」（ｕ’）と発話する。オペレータは「そこの接続タブを選択して下さい」（ｕ１１）と発話する。オペレータは「次に、ＬＡＮの設定を開いて下さい」（ｕ１２）と発話する。 In the example shown in FIG. 18, it is assumed that the following conversation has been performed. The operator speaks, "Please open the Internet Options connection tab" (u1). The customer speaks “What?” (U2). The operator speaks “second from the right of the menu bar” (u3). The customer speaks “Menu bar?” (U4). The operator utters "It is lined up with files, edits, etc. at the top ..." (u5). The customer speaks “Ah yes yes” (u6). The operator speaks, “Click on the second tool from the right, click here” (u7). The customer speaks “second ... yes” (u8). The operator speaks, "There is an internet option at the bottom" (u9). The customer speaks “Oh, this is it” (u10). The operator speaks “yes” (u ′). The operator speaks “Please select the connection tab there” (u11). The operator speaks “Next, open the LAN settings” (u12).

また、図１８に示す例では、ｕ１からｕ１２までの対話を含む音声データの中から、以下の手順説明語句が抽出されるものとする。ｕ１の発話から「インターネットオプション」及び「接続タブ」が手順説明語句として抽出される。ｕ９の発話から「インターネットオプション」が手順説明語句として抽出される。ｕ１１の発話から「接続タブ」が手順説明語句として抽出される。ｕ１２の発話から「ＬＡＮ」が手順説明語句として抽出される。 In the example shown in FIG. 18, the following procedure explanation words are extracted from the audio data including the dialogues from u1 to u12. From the utterance of u1, “Internet option” and “connection tab” are extracted as procedure explanation words. “Internet option” is extracted from the utterance of u9 as a procedure explanation phrase. The “connection tab” is extracted from the utterance of u11 as a procedure explanation word. “LAN” is extracted from the utterance of u12 as a procedure explanation word.

また、図１８に示す例では、時間経過に伴って操作部位が「接続タブ」、「ブラウザウィンドウ」、「インターネットオプション」、「接続タブ」の順に切り替えられたことを示す。また、図１８に示す例では、時刻ｔ１はｕ３の発話開始時刻を指し、時刻ｔ２はインターネットオプション画面に遷移した時刻を指し、時刻ｔ３はｕ１１の発話開始時刻を指し、また、時刻ｔ４はｕ１２の発話開始時刻を指す。 Further, the example shown in FIG. 18 indicates that the operation site is switched in the order of “connection tab”, “browser window”, “Internet option”, and “connection tab” with the passage of time. In the example shown in FIG. 18, time t1 indicates the utterance start time of u3, time t2 indicates the time when the screen transitions to the Internet option screen, time t3 indicates the utterance start time of u11, and time t4 is u12. The utterance start time of

図１８に示す例を用いて、手順説明語句「インターネットオプション」に関する音声区間を推定する場合の推定要領を説明する。まず、オペレータ支援装置１００は、操作部位が「ブラウザウィンドウ」から「インターネットオプション」へ変化した時刻「ｔ２」を取得する。そして、オペレータ支援装置１００は、「ｔ２」を含むオペレータの発話、すなわち「ｕ９」を取得する。そして、オペレータ支援装置１００は、さらに次のオペレータの発話、すなわち「ｕ’」を取得する。ここで、オペレータ支援装置１００は、取得した発話の期間が所定の期間よりも短い場合には、さらに次の発話を取得する。これは、次の音声区間の開始時刻が相槌などの意味の無い発話から始まることを防止するためである。ここで、「ｕ’」が所定の期間より短いと仮定すると、オペレータ支援装置１００は、次の発話「ｕ１１」を取得する。ここで、「ｕ１１」は所定の期間より長いものとする。そして、オペレータ支援装置１００は、「ｕ１１」の開始時刻、すなわち「ｔ３」を取得して、手順説明語句「インターネットオプション」の説明に関する音声区間の終了時刻として推定する。このようにして、オペレータ支援装置１００は、「インターネットオプション」の説明に関する音声区間を「ｔ１からｔ３まで」と推定する。 With reference to the example shown in FIG. 18, an estimation procedure for estimating a speech section related to the procedure explanation word “Internet option” will be described. First, the operator support apparatus 100 acquires the time “t2” when the operation part changes from the “browser window” to the “Internet option”. Then, the operator support apparatus 100 acquires the utterance of the operator including “t2”, that is, “u9”. Then, the operator support apparatus 100 further acquires the next operator utterance, that is, “u ′”. Here, the operator support apparatus 100 further acquires the next utterance when the acquired utterance period is shorter than the predetermined period. This is to prevent the start time of the next speech section from starting from a meaningless utterance such as a conflict. Here, assuming that “u ′” is shorter than the predetermined period, the operator support apparatus 100 acquires the next utterance “u11”. Here, “u11” is longer than a predetermined period. Then, the operator support apparatus 100 acquires the start time of “u11”, that is, “t3”, and estimates it as the end time of the voice section related to the explanation of the procedure explanation phrase “Internet option”. In this manner, the operator support apparatus 100 estimates the voice section related to the description of “Internet option” as “from t1 to t3”.

図１９に示す例では、以下の会話が行われたものとする。オペレータは「では、インターネットオプションの接続タブを開いて下さい」（ｕ１）と発話する。顧客は「接続タブって何？」（ｕ２）と発話する。オペレータは「インターネットオプションは開きましたか？」（ｕ３）と発話する。顧客は「開いたよ」（ｕ４）と発話する。オペレータは「一番上に、全般、セキュリティなどと並んでいる・・・」（ｕ５）と発話する。顧客は「あ、これか」（ｕ６）と発話する。オペレータは「はい、そこの接続タブを開いてください」（ｕ７）と発話する。顧客は「はい」（ｕ８）と発話する。オペレータは「次に、ＬＡＮの設定を開いて下さい」（ｕ９）と発話する。 In the example shown in FIG. 19, it is assumed that the following conversation is performed. The operator speaks, "Please open the Internet Options connection tab" (u1). The customer speaks “What is the connection tab?” (U2). The operator speaks “Is the Internet option open?” (U3). The customer speaks "I'm open" (u4). The operator utters, “It is lined up with General, Security, etc. at the top ...” (u5). The customer speaks “Oh, is this” (u6). The operator speaks “Yes, please open the connection tab there” (u7). The customer speaks “Yes” (u8). The operator speaks “Next, open the LAN settings” (u9).

また、図１９に示す例では、ｕ１からｕ９までの対話を含む音声データの中から、以下の手順説明語句が抽出されるものとする。ｕ１の発話から「インターネットオプション」及び「接続タブ」が手順説明語句として抽出される。ｕ３の発話から「インターネットオプション」が手順説明語句として抽出される。ｕ７の発話から「接続タブ」が手順説明語句として抽出される。ｕ９の発話から「ＬＡＮ」が手順説明語句として抽出される。 In the example shown in FIG. 19, it is assumed that the following procedure explanation words / phrases are extracted from audio data including dialogues from u1 to u9. From the utterance of u1, “Internet option” and “connection tab” are extracted as procedure explanation words. “Internet option” is extracted from the utterance of u3 as a procedure explanation phrase. The “connection tab” is extracted from the utterance of u7 as a procedure explanation word. “LAN” is extracted from the utterance of u9 as a procedure explanation word.

また、図１９に示す例では、時間経過に伴って操作部位が「接続タブ」、「インターネットオプション」、「接続タブ」の順に切り替えられたことを示す。また、図１９に示す例には、時刻ｔ１はｕ５の発話開始時刻を指し、時刻ｔ２は接続タブ画面に遷移した時刻を指し、時刻ｔ３はｕ９の発話開始時刻を指す。 Further, the example shown in FIG. 19 indicates that the operation site is switched in the order of “connection tab”, “Internet option”, and “connection tab” with the passage of time. In the example shown in FIG. 19, the time t1 indicates the utterance start time of u5, the time t2 indicates the time of transition to the connection tab screen, and the time t3 indicates the utterance start time of u9.

図１９に示す例を用いて、手順説明語句「接続タブ」に関する音声区間を推定する場合の推定要領を説明する。まず、オペレータ支援装置１００は、操作部位が「インターネットオプション」から「接続タブ」へ変化した時刻「ｔ２」を取得する。そして、オペレータ支援装置１００は、時刻「ｔ２」を含むオペレータの発話の取得を試みるが、対応する発話が存在しない場合には、さらに次の発話「ｕ７」を取得する。そして、オペレータ支援装置１００は、さらに次のオペレータの発話「ｕ９」を取得する。ここで、「ｕ９」は所定の期間より長いと仮定する。そして、オペレータ支援装置１００は、「ｕ９」の開始時刻「ｔ３」を取得して、手順説明語句「接続タブ」の説明に関する音声区間の終了時刻として推定する。このように、「ｕ７」における発話の終了時刻ではなく次の発話「ｕ９」の開始時刻を終端とするのは、次の音声区間の開始時刻が無音区間から開始されてしまうことを防止するためである。このようにして、オペレータ支援装置１００は、「接続タブ」の説明に関する音声区間を「ｔ１からｔ３まで」と推定する。 With reference to the example illustrated in FIG. 19, an estimation procedure for estimating a speech section related to the procedure explanation word “connection tab” will be described. First, the operator support apparatus 100 acquires the time “t2” when the operation part changes from “Internet option” to “connection tab”. Then, the operator support apparatus 100 tries to acquire the utterance of the operator including the time “t2”, but if there is no corresponding utterance, the operator support apparatus 100 further acquires the next utterance “u7”. Then, the operator support apparatus 100 further acquires the next operator utterance “u9”. Here, it is assumed that “u9” is longer than a predetermined period. Then, the operator support apparatus 100 acquires the start time “t3” of “u9” and estimates it as the end time of the voice section related to the description of the procedure explanation phrase “connection tab”. As described above, the start time of the next utterance “u9” rather than the utterance end time of “u7” is terminated to prevent the start time of the next speech section from being started from the silent section. It is. In this way, the operator support apparatus 100 estimates the voice section related to the description of the “connection tab” as “from t1 to t3”.

［実施例２による効果］
上述してきたように、実施例２に係るオペレータ支援装置１００は、画面に表示される操作部位のうち所定の操作部位の操作によって遷移した画面に関係する手順説明語句を抽出する。また、所定の操作部位を操作した時刻より以降に発話が開始された時刻を抽出し、抽出した時刻に至るまでの音声区間を、抽出した手順説明語句の説明に関係する音声区間として推定する。このため、オペレータ支援装置１００は、手順説明語句に対応した補足語句を適切に抽出することができる。 [Effects of Example 2]
As described above, the operator support apparatus 100 according to the second embodiment extracts the procedure explanation words related to the screen that has been changed by the operation of the predetermined operation part among the operation parts displayed on the screen. In addition, the time when the utterance is started after the time when the predetermined operation part is operated is extracted, and the speech section up to the extracted time is estimated as the speech section related to the explanation of the extracted procedure explanation phrase. For this reason, the operator assistance apparatus 100 can appropriately extract the supplemental words / phrases corresponding to the procedure explanation words / phrases.

また、実施例２に係るオペレータ支援装置１００によれば、相槌などの意味の無い発話を手順説明語句の音声区間の末尾に含めることができ、次に推定する手順説明語句の音声区間の開始時刻を実質的な手順説明語句の説明から開始させることが可能になる。 Further, according to the operator support apparatus 100 according to the second embodiment, a meaningless speech such as a conflict can be included at the end of the speech section of the procedure explanation phrase, and the start time of the speech section of the procedure explanation phrase to be estimated next Can be started from the explanation of the substantial procedure explanation words.

なお、本実施例に係るオペレータ支援装置１００では、区間推定部１８０によって推定された音声区間を用いて補足語句を抽出する場合を説明したが、開示の装置の構成がこれに限定されるものではない。例えば、区間推定部１８０によって推定された音声区間の長さは、１つの手順に要した所要時間と等価である。このため、応対業務の所要時間を統計する場合には、本実施例に係るオペレータ支援装置１００では、従来のように１つの応対業務の単位で統計するのではなく応対業務を回答事項の手順単位に細分化し、手順単位で所要時間を統計することが可能となる。さらに、本実施例に係るオペレータ支援装置１００では、区間推定部１８０によって推定された音声区間の長さをオペレータのスキルの推定・評価をはじめ、マニュアルとするＱＡ事例の見直しなどにも使用することもできる。この結果、本実施例に係るオペレータ支援装置１００によれば、オペレータによる応対業務の改善方針を分析することも可能になる。 In addition, in the operator assistance apparatus 100 according to the present embodiment, the case where the supplemental phrase is extracted using the speech section estimated by the section estimation unit 180 has been described. However, the configuration of the disclosed apparatus is not limited thereto. Absent. For example, the length of the speech section estimated by the section estimation unit 180 is equivalent to the time required for one procedure. For this reason, when the time required for the service is statistically calculated, the operator support apparatus 100 according to this embodiment does not perform the statistics for each service as in the conventional case, but the response service is a procedural unit of answer items. It is possible to divide the time into statistics and to calculate the time required for each procedure. Further, in the operator support apparatus 100 according to the present embodiment, the length of the voice section estimated by the section estimation unit 180 is used for the estimation and evaluation of the operator skill and the review of the QA case as a manual. You can also. As a result, according to the operator support apparatus 100 according to the present embodiment, it becomes possible to analyze the improvement policy of the response work by the operator.

さて、これまで開示の装置に関する実施例について説明したが、開示の装置は上述した実施例以外にも、種々の異なる形態にて実施されてよいものである。そこで、以下では、開示の装置に含まれる他の実施例を説明する。 Although the embodiments relating to the disclosed apparatus have been described above, the disclosed apparatus may be implemented in various different forms other than the above-described embodiments. Therefore, other embodiments included in the disclosed apparatus will be described below.

（１）ＰＣの操作説明以外への適用
上記の実施例２では、ＰＣ操作に関する問合せを受け付けるコールセンタでの応対を想定して説明したが、本願の開示する技術はこれに限定されない。例えば、所定の機器やソフトウェアに関する問合せをコールセンタで受け付け、その機器やソフトウェアを操作して操作ログを残しつつ回答できる場合には、本願の開示する技術を同様に適用できる。 (1) Application to other than PC operation description In the above-described second embodiment, the description has been made assuming that the call center accepts an inquiry related to the PC operation. However, the technology disclosed in the present application is not limited to this. For example, when an inquiry about a predetermined device or software is received at a call center and the device or software can be operated and answered while leaving an operation log, the technology disclosed in the present application can be similarly applied.

（２）装置構成等
また、図２に示した各装置の各構成要素は、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。例えば、例えば、韻律情報抽出部１５０、発話区間抽出部１５５の各機能を集約して統合できる。また、対応関係設定部１４０、韻律情報抽出部１５０、発話区間抽出部１５５、音声認識部１６０、操作時間取得部１７０、区間推定部１８０又は補足語句抽出部１９０の一部又は全部をオペレータ支援装置の外部装置としてネットワーク経由で接続してもよい。また、対応関係設定部１４０、韻律情報抽出部１５０、発話区間抽出部１５５、音声認識部１６０、操作時間取得部１７０、区間推定部１８０又は補足語句抽出部１９０の一部又は全部を別の装置が有する構成を採用する。その上で、別の装置がネットワーク接続されて協働することで、上記のオペレータ支援装置１００の機能を実現するようにしてもよい。また、ＰＣ操作ログ記憶部１１０、事例データ記憶部１２０、対応関係記憶部１２５、音声データ記憶部１３０の一部又は全部を別の装置がそれぞれ有する構成を採用する。その上で、別の装置がネットワーク接続されて協働することで、上記のオペレータ支援装置１００の機能を実現するようにしてもかまわない。 (2) Device Configuration, etc. Each component of each device shown in FIG. 2 does not necessarily have to be physically configured as illustrated. In other words, the specific form of distribution / integration of each device is not limited to that shown in the figure, and all or a part thereof may be functionally or physically distributed or arbitrarily distributed in arbitrary units according to various loads or usage conditions. Can be integrated and configured. For example, for example, the functions of the prosodic information extraction unit 150 and the speech segment extraction unit 155 can be integrated and integrated. In addition, the operator support apparatus may include a part or all of the correspondence setting unit 140, the prosody information extraction unit 150, the speech segment extraction unit 155, the speech recognition unit 160, the operation time acquisition unit 170, the segment estimation unit 180, or the supplemental phrase extraction unit 190. You may connect via a network as an external device. In addition, a part or all of the correspondence setting unit 140, the prosody information extraction unit 150, the speech segment extraction unit 155, the speech recognition unit 160, the operation time acquisition unit 170, the segment estimation unit 180, or the supplemental phrase extraction unit 190 may be replaced with another device. The structure which has is adopted. In addition, the functions of the operator support apparatus 100 described above may be realized by another apparatus connected to the network and cooperating. Further, a configuration is adopted in which another device has a part or all of the PC operation log storage unit 110, the case data storage unit 120, the correspondence relationship storage unit 125, and the voice data storage unit 130, respectively. In addition, the functions of the operator support apparatus 100 may be realized by another apparatus connected to the network and cooperating.

（３）オペレータ支援プログラム
また、上記の実施例で説明したオペレータ支援装置１００の各種の処理（例えば、図１４参照）は、あらかじめ用意されたプログラムをパーソナルコンピュータやワークステーションなどのコンピュータシステムで実行することによって実現することもできる。そこで、以下では、図２０を用いて、上記の実施例で説明したオペレータ支援装置と同様の機能を有するオペレータ支援プログラムを実行するコンピュータの一例を説明する。図２０は、オペレータ支援プログラムを実行するコンピュータを示す図である。 (3) Operator support program The various processes (for example, see FIG. 14) of the operator support apparatus 100 described in the above embodiment are executed by a computer system such as a personal computer or a workstation. Can also be realized. Therefore, in the following, an example of a computer that executes an operator support program having the same function as the operator support apparatus described in the above embodiment will be described with reference to FIG. FIG. 20 is a diagram illustrating a computer that executes an operator support program.

図２０に示すように、オペレータ支援装置としてコンピュータ２００は、通信制御部２１０、ＨＤＤ２２０、ＲＡＭ２３０及びＣＰＵ２４０をバス３００で接続して構成される。 As shown in FIG. 20, the computer 200 as an operator support apparatus is configured by connecting a communication control unit 210, an HDD 220, a RAM 230, and a CPU 240 via a bus 300.

ここで、通信制御部２１０は、各種情報のやり取りに関する通信を制御する。ＨＤＤ２２０は、ＣＰＵ２４０による各種処理の実行に必要な情報を記憶する。ＲＡＭ２３０は、各種情報を一時的に記憶する。ＣＰＵ２４０は、各種演算処理を実行する。 Here, the communication control unit 210 controls communication related to the exchange of various types of information. The HDD 220 stores information necessary for the CPU 240 to execute various processes. The RAM 230 temporarily stores various information. The CPU 240 executes various arithmetic processes.

そして、ＨＤＤ２２０には、図２０に示すように、上記の実施例で説明したオペレータ支援装置１００の各処理部と同様の機能を発揮するオペレータ支援プログラム２２１と、オペレータ支援用データ２２２とがあらかじめ記憶されている。なお、このオペレータ支援プログラム２２１を適宜分散させて、ネットワークを介して通信可能に接続された他のコンピュータの記憶部に記憶させておくこともできる。 As shown in FIG. 20, the HDD 220 stores in advance an operator support program 221 that exhibits the same functions as the processing units of the operator support apparatus 100 described in the above embodiment, and operator support data 222. Has been. Note that the operator support program 221 may be appropriately distributed and stored in a storage unit of another computer that is communicably connected via a network.

そして、ＣＰＵ２４０が、このオペレータ支援プログラム２２１をＨＤＤ２２０から読み出してＲＡＭ２３０に展開することにより、図２０に示すように、オペレータ支援プログラム２２１はオペレータ支援プロセス２３１として機能するようになる。 Then, the CPU 240 reads out the operator support program 221 from the HDD 220 and expands it in the RAM 230, so that the operator support program 221 functions as an operator support process 231 as shown in FIG.

すなわち、オペレータ支援プロセス２３１は、オペレータ支援用データ２２２等をＨＤＤ２２０から読み出して、ＲＡＭ２３０において自身に割り当てられた領域に展開し、この展開したデータ等に基づいて各種処理を実行する。 That is, the operator support process 231 reads the operator support data 222 and the like from the HDD 220, expands the data in the area allocated to itself in the RAM 230, and executes various processes based on the expanded data and the like.

なお、オペレータ支援プロセス２３１は、例えば、図２に示したオペレータ支援装置１００の対応関係設定部１４０、韻律情報抽出部１５０、発話区間抽出部情報１５５、音声認識部１６０及び操作時間取得部１７０において実行される処理に対応する。また、オペレータ支援プロセス２３１は、例えば、図２に示したオペレータ支援装置１００の区間推定部１８０及び補足語句抽出部１９０において実行される処理に対応する。 Note that the operator support process 231 is performed by, for example, the correspondence setting unit 140, the prosody information extraction unit 150, the speech segment extraction unit information 155, the voice recognition unit 160, and the operation time acquisition unit 170 of the operator support apparatus 100 illustrated in FIG. Corresponds to the process to be executed. The operator support process 231 corresponds to, for example, processing executed in the section estimation unit 180 and the supplemental phrase extraction unit 190 of the operator support apparatus 100 illustrated in FIG.

なお、上記したオペレータ支援プログラム２２１については、必ずしも最初からＨＤＤ２２０に記憶させておく必要はない。例えば、コンピュータ２００に挿入されるフレキシブルディスク（ＦＤ）、ＣＤ−ＲＯＭ、ＤＶＤディスク、光磁気ディスク、ＩＣカードなどの「可搬用の物理媒体」に各プログラムを記憶させておく。そして、コンピュータ２００がこれらから各プログラムを読み出して実行するようにしてもよい。 Note that the above-described operator support program 221 is not necessarily stored in the HDD 220 from the beginning. For example, each program is stored in a “portable physical medium” such as a flexible disk (FD), a CD-ROM, a DVD disk, a magneto-optical disk, and an IC card inserted into the computer 200. Then, the computer 200 may read and execute each program from these.

さらには、公衆回線、インターネット、ＬＡＮ、ＷＡＮなどを介してコンピュータ２００に接続される「他のコンピュータ（またはサーバ）」などに各プログラムを記憶させておく。そして、コンピュータ２００がこれらから各プログラムを読み出して実行するようにしてもよい。 Further, each program is stored in “another computer (or server)” connected to the computer 200 via a public line, the Internet, a LAN, a WAN, or the like. Then, the computer 200 may read and execute each program from these.

１オペレータ支援装置
２記憶部
３操作時間取得部
４区間推定部
５補足語句抽出部
１００オペレータ支援装置
１１０ＰＣ操作ログ記憶部
１２０事例データ記憶部
１２５対応関係記憶部
１３０音声データ記憶部
１４０対応関係設定部
１５０韻律情報抽出部
１５５発話区間抽出部
１６０音声認識部
１７０操作時間取得部
１８０区間推定部
１９０補足語句抽出部 DESCRIPTION OF SYMBOLS 1 Operator assistance apparatus 2 Memory | storage part 3 Operation time acquisition part 4 Section estimation part 5 Supplement word extraction part 100 Operator assistance apparatus 110 PC operation log memory | storage part 120 Case data memory | storage part 125 Correspondence relation memory | storage part 130 Voice data memory | storage part 140 Correspondence relation setting Unit 150 prosodic information extraction unit 155 speech segment extraction unit 160 speech recognition unit 170 operation time acquisition unit 180 segment estimation unit 190 supplemental phrase extraction unit

Claims

A correspondence storage unit that stores a procedure explanation word that is a word for explaining a predetermined procedure and an operation part that is operated in relation to the procedure explanation word among the operation parts displayed on the terminal in association with each other; ,
Of the operation parts displayed on the terminal, from the PC operation log storage unit that stores the operation part that has been the operation target from that time for each switching operation time when the operation part is switched to another operation part. An operation time acquisition unit for acquiring an operation time indicating a time point when an operation for switching a predetermined operation part is performed;
When the procedure explanation words related to the operation parts switched from the predetermined operation parts are extracted from the correspondence storage unit, and when the procedure explanation words are first extracted from the voice recording , the appearance time of the procedure explanation words is recorded as voice. When extracting a procedure explanation phrase different from the procedure explanation phrase extracted last time from the voice recording, extracted as the start time of the section, extract the utterance start time after the operation time acquired by the operation time acquisition unit, and the procedure When the explanatory words are extracted for the first time, the voice interval from the start time of the voice interval to the utterance start time point, and when the procedure explanatory words different from the previously extracted procedure explanatory words are extracted, the different Opening an utterance section that appears next to the utterance section at the time of extraction from the utterance section in which the procedure explanation phrase appears and is longer than a predetermined period Voice section up to the time, the interval estimation unit that estimates a speech section on the procedures described phrase extracted from the correspondence relation storage unit,
An operator support apparatus, comprising: a supplemental phrase extracting unit that extracts a supplemental phrase that is a phrase for supplementing the procedure explanation phrase from the speech section estimated by the section estimation unit in the voice record.

On the computer,
For each switching operation time when the operation part is switched to another operation part, from the PC operation log storage unit that stores the operation part that has been an operation target from that time in association with the predetermined operation part displayed on the terminal An operation time acquisition procedure for acquiring an operation time indicating the time point when the operation of switching the operation part of
Correspondence storage unit that associates and stores a procedure explanation word that is a word for explaining a predetermined procedure and an operation part that is operated in relation to the procedure explanation word among the operation parts displayed on the terminal When the procedure explanation word / phrase related to the operation part switched from the predetermined operation part is extracted, and the procedure explanation word / phrase is first extracted from the voice recording , the appearance time of the procedure explanation word / phrase is set as the start point of the voice section. When a procedure explanation phrase that is different from the procedure explanation phrase previously extracted from the voice recording is extracted, the utterance start time after the operation time acquired by the operation time acquisition means is extracted, and the procedure explanation phrase is first In the case where the speech segment is extracted from the start point of the speech segment to the start point of the utterance, the procedure explanation different from the previously extracted procedure explanation phrase In case of extracting the clause, the different procedures described phrase appears from the appearance and speech segment to the next speech period of time of the extracted speech segment up to the start time of a long speech period than the predetermined period, and interval estimation procedure for estimating the speech section on the procedures described phrase extracted from the correspondence relation storage unit,
An operator assistance program for executing a supplemental phrase extraction procedure for extracting a supplemental phrase that is a phrase for supplementing the procedure explanation phrase from a speech section estimated by the section estimation procedure in the speech recording .

Computer
For each switching operation time when the operation part is switched to another operation part, from the PC operation log storage unit that stores the operation part that has been an operation target from that time in association with the predetermined operation part displayed on the terminal An operation time acquisition step of acquiring an operation time indicating the time point when the operation of switching the operation part of
Correspondence storage unit that associates and stores a procedure explanation word that is a word for explaining a predetermined procedure and an operation part that is operated in relation to the procedure explanation word among the operation parts displayed on the terminal When the procedure explanation word / phrase related to the operation part switched from the predetermined operation part is extracted, and the procedure explanation word / phrase is first extracted from the voice recording , the appearance time of the procedure explanation word / phrase is set as the start point of the voice section. When a procedure explanation phrase that is different from the procedure explanation phrase previously extracted from the voice recording is extracted, an utterance start time after the operation time acquired by the operation time acquisition step is extracted, and the procedure explanation phrase is first In the case where the speech section is extracted from the start time of the speech section to the utterance start time, a procedure different from the previously extracted procedure explanation word / phrase In case of extracting the light phrase, the different procedures described phrase appears from the appearance and speech segment to the next speech period of time of the extracted speech segment up to the start time of a long speech period than the predetermined time period , A section estimation step for estimating a voice section related to a procedure explanation word extracted from the correspondence storage unit;
An operator assistance method comprising: performing a supplemental phrase extracting step of extracting a supplemental phrase that is a phrase for supplementing the procedure explanation phrase from the speech section estimated by the section estimation step in the speech recording .