JP2020205485A

JP2020205485A - Telephone system and server device

Info

Publication number: JP2020205485A
Application number: JP2019111133A
Authority: JP
Inventors: 陵二鳥居; Ryoji Torii
Original assignee: Toshiba Corp; Toshiba Infrastructure Systems and Solutions Corp
Current assignee: Toshiba Corp; Toshiba Infrastructure Systems and Solutions Corp
Priority date: 2019-06-14
Filing date: 2019-06-14
Publication date: 2020-12-24

Abstract

To suppress special fraud.SOLUTION: A telephone system includes: a voiceprint database where a pre-registered speaker is associated with his/her voiceprint; a voiceprint authentication section; and a notification section. The voiceprint authentication section performs access to the voiceprint database based on voiceprint information acquired by analyzing voice of a caller of an incoming call, so as to authenticate the caller. The notification section notifies a responder of an incoming call of a message when a caller is not registered.SELECTED DRAWING: Figure 2

Description

本発明の実施形態は、電話システムおよびサーバ装置に関する。 Embodiments of the present invention relate to telephone systems and server devices.

「オレオレ詐欺」等と称して知られる特殊詐欺による被害が、近年、社会問題になっている。この種の詐欺が横行する原因の一つは、電話を受けた人（応答者）が通話相手を確実に特定できないことにある。番号非通知での着信を警告する電話機もあるが、そのような機能を持たない電話機もある。電話機に複雑な設定を施すよりも、むしろシステム側での対応を強化することが望まれる。 Damage caused by special fraud known as "Oreore fraud" has become a social problem in recent years. One of the causes of this type of fraud is that the person receiving the call (the responder) cannot reliably identify the person to call. Some phones warn of incoming calls without number notification, while others do not have such a feature. Rather than making complicated settings on the phone, it is desirable to strengthen the support on the system side.

特開２０１１−１３５３２８号公報Japanese Unexamined Patent Publication No. 2011-135328 特開２０１８−１０７７６９号公報JP-A-2018-107769 特開２００６−３２４７１５号公報Japanese Unexamined Patent Publication No. 2006-324715

声紋は、個人を特定するための特徴量として知られている。声紋を分析する技術も日々発展してきている。この技術を応用して特殊詐欺に対処することも検討されているが、例えばＩｏＴ（Internet of Things）との組み合わせなど、未だ多くの議論の余地がある。 Voiceprints are known as features for identifying individuals. The technology for analyzing voiceprints is also developing day by day. It is also being considered to apply this technology to deal with special fraud, but there is still much debate, such as in combination with the IoT (Internet of Things).

目的は、特殊詐欺を抑止することの可能な電話システムおよびサーバ装置を提供することにある。 The purpose is to provide telephone systems and server devices capable of deterring special fraud.

実施形態によれば、電話システムは、予め登録された話者とその声紋とを対応付けた声紋データベース、および、声紋認証部と通知部とを具備する。声紋認証部は、着信呼の通話相手の音声を解析して取得された声紋情報に基づいて声紋データベースにアクセスし、通話相手を認証する。通知部は、通話相手の登録が無い場合に、着信呼の応答者にメッセージを通知する。 According to the embodiment, the telephone system includes a voiceprint database in which a pre-registered speaker is associated with the voiceprint, and a voiceprint authentication unit and a notification unit. The voiceprint authentication unit accesses the voiceprint database based on the voiceprint information obtained by analyzing the voice of the callee of the incoming call, and authenticates the callee. The notification unit notifies the respondent of the incoming call of the message when the other party is not registered.

図１は、第１の実施形態に係わる電話システムの一例を示す図である。FIG. 1 is a diagram showing an example of a telephone system according to the first embodiment. 図２は、図１に示されるサーバ装置２００の一例を示す機能ブロック図である。FIG. 2 is a functional block diagram showing an example of the server device 200 shown in FIG. 図３は、第１の実施形態に係わる処理手順の一例を示すシーケンス図である。FIG. 3 is a sequence diagram showing an example of the processing procedure according to the first embodiment. 図４は、第２の実施形態に係わる電話システムの一例を示す図である。FIG. 4 is a diagram showing an example of a telephone system according to the second embodiment. 図５は、図４に示されるサーバ装置２００の一例を示す機能ブロック図である。FIG. 5 is a functional block diagram showing an example of the server device 200 shown in FIG. 図６は、第２の実施形態に係わる処理手順の一例を示すシーケンス図である。FIG. 6 is a sequence diagram showing an example of the processing procedure according to the second embodiment. 図７は、第２の実施形態の変形例に係わる電話システムの一例を示す図である。FIG. 7 is a diagram showing an example of a telephone system according to a modified example of the second embodiment. 図８は、第３の実施形態に係わるサーバ装置２００の一例を示す機能ブロック図である。FIG. 8 is a functional block diagram showing an example of the server device 200 according to the third embodiment. 図９は、第３の実施形態に係わる処理手順の一例を示すシーケンス図である。FIG. 9 is a sequence diagram showing an example of the processing procedure according to the third embodiment.

声紋は、私たちの声が持つ特徴の１つである。声紋認証は、いわゆる生体認証と称して知られる技術の一つで、例えばＡＩ（Artificial Intelligence）スピーカ等のデバイスを声で操作するときや、デバイスのロック解除などに応用される。声紋認証においては、音響的な特徴と言語的な特徴とを分けて取り扱うことが多い。音響的な特徴は、認証する人の声がどのような周波数特性を持つのかを表し、言語的な特徴は、音素の並び方に関する制約を表わす。
電話を利用した特殊詐欺は、電話に応答した人が、その通話相手の属性（身内かそうでないか、本当に銀行員なのか、など）を確かめられないことが一つの原因となっている。以下では、この種の詐欺への抑止力となり得る技術について開示する。 The voiceprint is one of the characteristics of our voice. Voiceprint authentication is one of the so-called biometric authentication technologies, and is applied to, for example, operating a device such as an AI (Artificial Intelligence) speaker by voice or unlocking the device. In voiceprint authentication, acoustic features and linguistic features are often treated separately. The acoustic characteristics represent the frequency characteristics of the voice of the person who authenticates, and the linguistic characteristics represent the restrictions on the arrangement of phonemes.
One of the causes of special fraud using a telephone is that the person who answers the telephone cannot confirm the attributes of the other party (whether they are relatives or not, whether they are really bank employees, etc.). The following discloses technologies that can be a deterrent to this type of fraud.

［第１の実施形態］
図１は、第１の実施形態に係わる電話システムの一例を示す図である。電話システムは、サーバ装置２００を中核として形成される。サーバ装置２００は、（ＰＳＴＮ：Public Switched Telephone Networks）などの電話網２と、ＷＡＮ（Wide Area Network）５との双方に接続される。ＷＡＮ５は、例えばＩＰ（Internet Protocol）網などであり、データベース１００を備える。データベース１００は、声紋データベース１００ａを記憶する。声紋データベース１００ａは、予め登録された話者と、その声紋とを対応付けたデータベースである。
第１の実施形態では、特殊詐欺の関係者とその声紋とを対応付けて声紋データベース１００ａに登録することを考える。特殊詐欺の関係者とは、例えば「オレオレ詐欺」の犯人として検挙されたことがある者、あるいは、容疑者として当局からマークされている者である。 [First Embodiment]
FIG. 1 is a diagram showing an example of a telephone system according to the first embodiment. The telephone system is formed with the server device 200 as the core. The server device 200 is connected to both a telephone network 2 such as (PSTN: Public Switched Telephone Networks) and a WAN (Wide Area Network) 5. WAN 5 is, for example, an IP (Internet Protocol) network or the like, and includes a database 100. The database 100 stores the voiceprint database 100a. The voiceprint database 100a is a database in which a pre-registered speaker is associated with the voiceprint.
In the first embodiment, it is considered that a person involved in the special fraud is associated with the voiceprint and registered in the voiceprint database 100a. A person involved in special fraud is, for example, a person who has been arrested as a criminal of "Oreore fraud" or a person who has been marked by the authorities as a suspect.

加入者宅内に置かれる電話機４は、例えば宅内に置かれた接続装置３を介して、電話網２とＷＡＮ５とに接続される。電話網２に接続された電話機１から発生した呼は、接続装置３を経由して電話機４に着信する。電話機４の応答者２０が受話器をあげると、電話機１の通話相手１０との間に呼が設定される。ここで、電話機１，４は固定電話機、移動電話機のいずれであってもよい。 The telephone 4 placed in the subscriber's home is connected to the telephone network 2 and WAN 5 via, for example, a connection device 3 placed in the home. The call generated from the telephone 1 connected to the telephone network 2 arrives at the telephone 4 via the connecting device 3. When the respondent 20 of the telephone 4 picks up the handset, a call is set up with the other party 10 of the telephone 1. Here, the telephones 1 and 4 may be either a fixed telephone or a mobile telephone.

図２は、図１に示されるサーバ装置２００の一例を示す機能ブロック図である。サーバ装置２００は、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等のプロセッサ２５０と、ＲＯＭ（Read Only Memory）２２０、ＲＡＭ（Random Access Memory）２３０、記憶部２４０、および通信部２７０を備えるコンピュータである。 FIG. 2 is a functional block diagram showing an example of the server device 200 shown in FIG. The server device 200 includes a processor 250 such as a CPU (Central Processing Unit) and an MPU (Micro Processing Unit), a ROM (Read Only Memory) 220, a RAM (Random Access Memory) 230, a storage unit 240, and a communication unit 270. It is a computer.

ＲＯＭ２２０は、ＢＩＯＳ（Basic Input Output System）やＵＥＦＩ（Unified Extensible Firmware Interface）などの基本プログラム、および各種の設定データ等を記憶する。ＲＡＭ２３０は、記憶部２４０からロードされたプログラムやデータを一時的に記憶する。 The ROM 220 stores basic programs such as BIOS (Basic Input Output System) and UEFI (Unified Extensible Firmware Interface), and various setting data. The RAM 230 temporarily stores programs and data loaded from the storage unit 240.

通信部２７０は、電話網２およびＷＡＮ５に接続され、それぞれの網との通信を制御する。なお、サーバ装置２００で実行される各種プログラムを、例えば通信部２７０を介してクラウドサーバ（図示せず）からダウンロードし、記憶部２４０にインストールすることもできる。通信部２７０を介してクラウドサーバから最新のプログラムをダウンロードし、インストール済みのプログラムをアップデートすることもできる。
記憶部２４０は、例えば、ＨＤＤ（Hard Disk Drive）やＳＳＤ（Solid State Drive）等の記録媒体であり、プロセッサ２５０により実行されるプログラム２４０ａを記憶する。プロセッサ２５０は、ＯＳ（Operating System）および各種のプログラムを実行する。 The communication unit 270 is connected to the telephone network 2 and WAN5 and controls communication with each network. It should be noted that various programs executed by the server device 200 can be downloaded from a cloud server (not shown) via, for example, the communication unit 270 and installed in the storage unit 240. The latest program can be downloaded from the cloud server via the communication unit 270, and the installed program can be updated.
The storage unit 240 is, for example, a recording medium such as an HDD (Hard Disk Drive) or an SSD (Solid State Drive), and stores a program 240a executed by the processor 250. The processor 250 executes an OS (Operating System) and various programs.

ところで、プロセッサ２５０は、実施形態に係る処理機能として声紋認証部２５０ａ、および通知部２５０ｂを備える。声紋認証部２５０ａ、通知部２５０ｂは、記憶部２４０に記憶されたプログラム２４０ａがＲＡＭ２３０にロードされ、当該プログラムの進行に伴ってプロセッサ２５０が演算処理を実行することで生成されるプロセスとして、理解され得る。 By the way, the processor 250 includes a voiceprint authentication unit 250a and a notification unit 250b as processing functions according to the embodiment. The voiceprint authentication unit 250a and the notification unit 250b are understood as a process generated by loading the program 240a stored in the storage unit 240 into the RAM 230 and executing arithmetic processing by the processor 250 as the program progresses. obtain.

声紋認証部２５０ａは、着信呼の通話相手の音声を解析して取得された声紋情報に基づいて、声紋データベース１００ａにアクセスする。声紋データベース１００ａに声紋情報が登録されていれば、声紋認証部２５０ａは、当該声紋情報に対応する話者を通話相手として特定する。これにより通話相手が認証される。 The voiceprint authentication unit 250a accesses the voiceprint database 100a based on the voiceprint information obtained by analyzing the voice of the other party of the incoming call. If the voiceprint information is registered in the voiceprint database 100a, the voiceprint authentication unit 250a identifies the speaker corresponding to the voiceprint information as a call partner. This authenticates the other party.

通知部２５０ｂは、声紋データベース１００ａに通話相手の登録が無い場合に、着信呼の応答者２０にメッセージを通知する。次に、上記における作用を説明する。 The notification unit 250b notifies the responder 20 of the incoming call of a message when the callee is not registered in the voiceprint database 100a. Next, the operation in the above will be described.

図３は、第１の実施形態に係わる処理手順の一例を示すシーケンス図である。図３において、通話相手１０が発呼すると、電話網２を介して電話機４に着呼し、応答者２０が呼び出される（ステップＳ１）。応答者２０が応答すると、通話状態となる（ステップＳ２）。 FIG. 3 is a sequence diagram showing an example of the processing procedure according to the first embodiment. In FIG. 3, when the other party 10 makes a call, the telephone 4 is called via the telephone network 2 and the respondent 20 is called (step S1). When the responder 20 answers, the call is entered (step S2).

通話が確立されると、電話網２からサーバ装置２００（声紋認証部２５０ａ）に通話音声（音声情報）が提供される（ステップＳ３）。ここで、応答者２０の電話機が移動電話機であれば、移動電話機の転送機能やアプリ機能により、サーバ装置２００に音声情報を提供することが可能である。 When the call is established, the telephone network 2 provides the call voice (voice information) to the server device 200 (voiceprint authentication unit 250a) (step S3). Here, if the telephone of the respondent 20 is a mobile telephone, it is possible to provide voice information to the server device 200 by the transfer function or the application function of the mobile telephone.

サーバ装置２００の声紋認証部２５０ａは、音声情報を取得すると声紋認証を行い（ステップＳ４）、声紋データベース１００ａを参照して通話相手１０を特定する。認証の結果は通知部２５０ｂに渡される（ステップＳ５）。 When the voiceprint authentication unit 250a of the server device 200 acquires the voice information, the voiceprint authentication unit 250a performs voiceprint authentication (step S4) and identifies the callee 10 with reference to the voiceprint database 100a. The result of the authentication is passed to the notification unit 250b (step S5).

その後、通話が終了すると呼が切断され、電話網２からの通話終了情報（ビジートーン）がサーバ装置２００に達する（ステップＳ７）。そうすると声紋認証部２５０ａは、声紋認証処理を終了し、通話終了情報を通知部２５０ｂに渡す（ステップＳ８）。 After that, when the call ends, the call is disconnected, and the call end information (busy tone) from the telephone network 2 reaches the server device 200 (step S7). Then, the voiceprint authentication unit 250a ends the voiceprint authentication process and passes the call end information to the notification unit 250b (step S8).

通知部２５０ｂは、声紋認証の認証結果に基づいて、当該呼について詐欺の疑いがあるか否かを判定する（ステップＳ９）。すなわち、声紋認証により、詐欺の前科を有する者が特定されたならば、通知部２５０ｂは詐欺の疑いありを判定する（ＹＥＳ）。そうすると通知部２５０ｂは、直ちに自動発信を行う（ステップＳ１０）。つまり通知部２５０ｂは、着信呼の終了後に、電話機４との間に電話網２を経由する新たな呼を設定して、応答者２０にメッセージを通知する（ステップＳ１１）。ここで、電話機４が移動電話機であれば、ＳＭＳ（ショートメールサービス）やアプリ連携機能により、メッセージを自動通知することも可能である。 The notification unit 250b determines whether or not there is a suspicion of fraud in the call based on the authentication result of the voiceprint authentication (step S9). That is, if a person having a criminal record of fraud is identified by voiceprint authentication, the notification unit 250b determines the suspicion of fraud (YES). Then, the notification unit 250b immediately makes an automatic transmission (step S10). That is, after the incoming call is completed, the notification unit 250b sets a new call with the telephone 4 via the telephone network 2 and notifies the respondent 20 of the message (step S11). Here, if the telephone 4 is a mobile telephone, it is possible to automatically notify a message by SMS (Short Mail Service) or an application linkage function.

メッセージは、例えば「さきほどの電話は詐欺の疑いがあります。ご注意ください」等の音声メッセージや、「詐欺のおそれがありますので、０３−ＸＸＸＸ−ＸＸＸＸにお電話ください」などのガイダンスなどの形態で通知される。 The message may be in the form of a voice message such as "The previous call is suspected of fraud. Please be careful" or guidance such as "There is a risk of fraud, so please call 03-XXXX-XXXX". You will be notified.

以上説明したように第１の実施形態では、特殊詐欺の犯罪に係わりを持つ者の声紋を声紋データベース１００ａに予め登録する。呼が発生すると、通話相手１０の音声を声紋認証部２５０ａに送り、声紋認証を行う。認証により、通話相手１０が声紋データベース１００ａに登録されていることが判明すると、通話終了後に応答者２０にメッセージを通知するようにした。 As described above, in the first embodiment, the voiceprints of persons involved in the crime of special fraud are registered in advance in the voiceprint database 100a. When a call is generated, the voice of the other party 10 is sent to the voiceprint authentication unit 250a to perform voiceprint authentication. When it is found by the authentication that the call partner 10 is registered in the voiceprint database 100a, the respondent 20 is notified of the message after the call is completed.

すなわち第１の実施形態では、声紋認証により通話相手を特定し、応答者２０に注意を促すようにした。これにより、応答者２０に警戒を促し、「オレオレ詐欺」などの特殊詐欺を防止できる。また、犯人に気づかれることなく声紋認証を実施することができる。これらのことから、第１の実施形態によれば、特殊詐欺を抑止することの可能な電話システムおよびサーバ装置を提供することが可能となる。 That is, in the first embodiment, the call partner is specified by voiceprint authentication, and the respondent 20 is alerted. As a result, the respondent 20 can be alerted and special fraud such as "Oreore fraud" can be prevented. In addition, voiceprint authentication can be performed without being noticed by the criminal. From these facts, according to the first embodiment, it is possible to provide a telephone system and a server device capable of deterring special fraud.

［第２の実施形態］
第１の実施形態では、特殊詐欺関係者の声紋を声紋データベースに登録し、声紋認証でマッチすると、そのことを通話後に応答者に通知するようにした。第２の実施形態では、応答者の知人の声紋を声紋データベースに登録する技術について説明する。知人とは、要するに応答者の知っている人あって、いわゆる身内や家族、親族、友人等を含む概念として捉えられる。 [Second Embodiment]
In the first embodiment, the voiceprints of persons involved in special fraud are registered in the voiceprint database, and when a match is made by voiceprint authentication, the respondent is notified after the call. In the second embodiment, the technique of registering the voiceprint of the respondent's acquaintance in the voiceprint database will be described. An acquaintance is, in short, a person known to the respondent, and can be regarded as a concept that includes so-called relatives, family members, relatives, friends, and the like.

図４は、第２の実施形態に係わる電話システムの一例を示す図である。以下では、第１の実施形態の各図と共通する部分には同じ符号を付して示し、第１の実施形態とは異なる部分についてのみ説明する。
図４において、ＡＩスピーカ６が接続装置３に接続される。また、声紋データベース（符号を２４０ｂとする）はサーバ装置２００に記憶される。 FIG. 4 is a diagram showing an example of a telephone system according to the second embodiment. In the following, the parts common to each figure of the first embodiment are designated by the same reference numerals, and only the parts different from those of the first embodiment will be described.
In FIG. 4, the AI speaker 6 is connected to the connecting device 3. Further, the voiceprint database (reference numeral 240b) is stored in the server device 200.

図５は、図４に示されるサーバ装置２００の一例を示す機能ブロック図である。声紋データベース２４０ｂは、記憶部２４０に記憶される。この声紋データベース２４０ｂは、応答者２０の知人とその声紋とを予め対応付けたデータベースである。 FIG. 5 is a functional block diagram showing an example of the server device 200 shown in FIG. The voiceprint database 240b is stored in the storage unit 240. The voiceprint database 240b is a database in which the acquaintances of the respondent 20 and their voiceprints are associated in advance.

プロセッサ２５０は、実施形態に係る処理機能として登録部２５０ｃを備える。登録部２５０ｃは、ＡＩスピーカ６で採取された知人３０（図４）の音声を解析して声紋を抽出し、得られた声紋を知人３０の属性に対応付けて、声紋データベース２４０ｂに登録する。例えば、ＡＩスピーカ６が毎日、同じ人物の声により起動される場合、その人物は応答者２０の家族であるとして声紋データベース２４０ｂに登録することができる。あるいは、ＡＩスピーカ６を使って自分の声を登録しておくことを、予め家族に頼んでおいてもよい。 The processor 250 includes a registration unit 250c as a processing function according to the embodiment. The registration unit 250c analyzes the voice of the acquaintance 30 (FIG. 4) collected by the AI speaker 6, extracts the voiceprint, associates the obtained voiceprint with the attribute of the acquaintance 30, and registers the obtained voiceprint in the voiceprint database 240b. For example, if the AI speaker 6 is activated by the voice of the same person every day, that person can be registered in the voiceprint database 240b as a family member of the respondent 20. Alternatively, you may ask your family in advance to register your voice using the AI speaker 6.

図６は、第２の実施形態に係わる処理手順の一例を示すシーケンス図である。図６において、呼切断ののち通話終了情報がサーバ装置２００に達する（ステップＳ７）と、声紋認証部２５０ａは、声紋認証処理を終了し、通話終了情報を通知部２５０ｂに渡す（ステップＳ８）。 FIG. 6 is a sequence diagram showing an example of the processing procedure according to the second embodiment. In FIG. 6, when the call end information reaches the server device 200 after the call is disconnected (step S7), the voiceprint authentication unit 250a ends the voiceprint authentication process and passes the call end information to the notification unit 250b (step S8).

通知部２５０ｂは、声紋認証の認証結果に基づいて、通話相手１０が認証されたか否かを判定する（ステップＳ１２）。通話相手１０の声紋情報が声紋データベース２４０ｂに存在すれば、通知部２５０ｂは認証ＯＫとする。声紋情報が声紋データベース２４０ｂに無ければ、通知部２５０ｂは認証ＮＧ（No Good）として、登録が無いことを応答者２０に知らせるために自動発信処理を行う（ステップＳ１０）。ステップＳ１０では、通知部２５０ｂは例えば「データベースに登録がありません」などのメッセージを応答者２０に通知する。 The notification unit 250b determines whether or not the other party 10 has been authenticated based on the authentication result of the voiceprint authentication (step S12). If the voiceprint information of the other party 10 exists in the voiceprint database 240b, the notification unit 250b authenticates. If the voiceprint information is not in the voiceprint database 240b, the notification unit 250b performs an automatic transmission process as authentication NG (No Good) in order to notify the respondent 20 that there is no registration (step S10). In step S10, the notification unit 250b notifies the respondent 20 of a message such as "there is no registration in the database".

応答者２０は、通話相手１０が身内を名乗っているにも拘わらず、データベースに登録がないことを知ることができる。従って応答者２０は、この事実から、この電話が特殊詐欺である可能性を感じ取ることができる。 The respondent 20 can know that the other party 10 is not registered in the database even though he / she claims to be a relative. Therefore, the respondent 20 can sense from this fact that the telephone may be a special fraud.

以上のように第２の実施形態では、電話機を使用する人の知人の声紋を声紋データベース１００ａに予め登録する。呼が発生すると、通話相手１０の音声を声紋認証部２５０ａに送り、声紋認証を行う。認証により、通話相手１０が声紋データベース１００ａに登録されていないことが判明すると、通話終了後に応答者２０にメッセージを通知するようにした。 As described above, in the second embodiment, the voiceprints of acquaintances of the person who uses the telephone are registered in advance in the voiceprint database 100a. When a call is generated, the voice of the other party 10 is sent to the voiceprint authentication unit 250a to perform voiceprint authentication. When it is found by the authentication that the call partner 10 is not registered in the voiceprint database 100a, the respondent 20 is notified of the message after the call ends.

すなわち第２の実施形態では、声紋認証により通話相手を特定できなかった場合に、応答者２０に注意を促すようにした。これによっても、応答者２０に警戒を促し、「オレオレ詐欺」などの特殊詐欺を防止できる。また、犯人に気づかれることなく声紋認証を実施することができる。これらのことから、第２の実施形態によっても、特殊詐欺を抑止することが可能となる。 That is, in the second embodiment, when the callee cannot be identified by voiceprint authentication, the respondent 20 is alerted. This also alerts the respondent 20 and prevents special fraud such as "Oreore fraud". In addition, voiceprint authentication can be performed without being noticed by the criminal. From these facts, it is possible to deter special fraud by the second embodiment as well.

［第２の実施形態の変形例］
図７は、第２の実施形態の変形例に係わる電話システムの一例を示す図である。ＡＩスピーカ６に代えて、知人３０の所持するスマートフォン７を利用して声紋情報をサーバ装置２００に登録することもできる。つまり、声紋登録用のアプリケーションをスマートフォン７に予めインストールし、所持者の音声を採取する。得られた音声情報は、移動通信網ＭＮを経由してサーバ装置２００に転送され、登録部２５０ｃに渡される。登録部２５０ｃは、知人３０の音声を解析して声紋を抽出し、得られた声紋を知人３０の属性に対応付けて、声紋データベース２４０ｂに登録する。 [Modified example of the second embodiment]
FIG. 7 is a diagram showing an example of a telephone system according to a modified example of the second embodiment. Instead of the AI speaker 6, the voiceprint information can be registered in the server device 200 by using the smartphone 7 possessed by the acquaintance 30. That is, the application for voiceprint registration is installed in advance on the smartphone 7, and the voice of the owner is collected. The obtained voice information is transferred to the server device 200 via the mobile communication network MN and passed to the registration unit 250c. The registration unit 250c analyzes the voice of the acquaintance 30 to extract the voiceprint, associates the obtained voiceprint with the attribute of the acquaintance 30, and registers the obtained voiceprint in the voiceprint database 240b.

［第３の実施形態］
第２の実施形態では、応答者の知人の声紋を声紋データベースに登録し、声紋認証でマッチしない場合に、そのことを通話後に応答者に通知するようにした。第３の実施形態では、文言判定を取り入れる技術について説明する。 [Third Embodiment]
In the second embodiment, the voiceprints of the respondent's acquaintances are registered in the voiceprint database, and when the voiceprint authentication does not match, the respondent is notified after the call. In the third embodiment, a technique for incorporating wording determination will be described.

図８は、図８は、第３の実施形態に係わるサーバ装置２００の一例を示す機能ブロック図である。プロセッサ２５０は、実施形態に係る処理機能として声紋認証部２５０ａ、通知部２５０ｂに加えて、文言特定部２５０ｄを備える。文言特定部２５０ｄは、通話相手１０と応答者２０との音声通話に含まれる文言を特定する。すなわち文言特定部２５０ｄは、通話状態における通話音声（音声情報）をテキストに変換し、通話内容を特定する。その際、各種通話内容のキーワード情報を保持する専用のデータベースを参照することにより、通話内容を複数のクラスに分類してもよい。クラスとしては「通常会話」、「学校連絡網」、「セールス」、「特殊詐欺」などが考えられる。通話音声情報をテキストに変換する技術としては、クラウドサーバを用いた音声−テキスト変換機能など、サービスとして既に提供されているものもある。 FIG. 8 is a functional block diagram showing an example of the server device 200 according to the third embodiment. The processor 250 includes a wording identification unit 250d in addition to the voiceprint authentication unit 250a and the notification unit 250b as processing functions according to the embodiment. The wording specifying unit 250d specifies the wording included in the voice call between the other party 10 and the respondent 20. That is, the wording specifying unit 250d converts the call voice (voice information) in the call state into text and specifies the call content. At that time, the call contents may be classified into a plurality of classes by referring to a dedicated database that holds keyword information of various call contents. Classes include "normal conversation," "school contact network," "sales," and "special fraud." As a technology for converting call voice information into text, there are some that are already provided as a service such as a voice-text conversion function using a cloud server.

通知部２５０ｂは、文言特定部２５０ｄにより特定された文言に基づいて、音声通話の内容を分類する。分類に当たってはクラスごとのスコアが算出される。そして、通話内容を特殊詐欺に分類するスコアが既定値以上である場合に、通知部２５０ｂは、応答者２０に、特殊詐欺の可能性があることを示唆するメッセージを通知する。 The notification unit 250b classifies the content of the voice call based on the wording specified by the wording specifying unit 250d. For classification, the score for each class is calculated. Then, when the score for classifying the call content into the special fraud is equal to or higher than the default value, the notification unit 250b notifies the responder 20 of a message suggesting that there is a possibility of the special fraud.

図９は、第３の実施形態に係わる処理手順の一例を示すシーケンス図である。図９において、声紋認証部２５０ａは声紋認証を行う（ステップＳ４）。その結果、通話相手１０が注意すべき人物であることが分かれば、声紋認証部２５０ａは、声紋認証の結果と音声情報とを文言特定部２５０ｄに通知する（ステップＳ１３）。 FIG. 9 is a sequence diagram showing an example of the processing procedure according to the third embodiment. In FIG. 9, the voiceprint authentication unit 250a performs voiceprint authentication (step S4). As a result, if it is found that the other party 10 is a person to be noted, the voiceprint authentication unit 250a notifies the wording identification unit 250d of the result of the voiceprint authentication and the voice information (step S13).

文言特定部２５０ｄは、受け取った通話音声情報をテキストに変換し、予め登録されたキーワード情報等に基づいて通話内容を特定する（ステップＳ１４）。特定結果は通知部２５０ｂに渡される（ステップＳ１５）。声紋認証により特定された人物と、特定された通話内容との組み合わせにより、対象通話が「オレオレ詐欺」であるかどうかを判断することができる。 The wording specifying unit 250d converts the received call voice information into text and specifies the call content based on the keyword information or the like registered in advance (step S14). The specific result is passed to the notification unit 250b (step S15). It is possible to determine whether or not the target call is "Oreore fraud" by the combination of the person identified by the voiceprint authentication and the identified call content.

通話が終了すると、電話網２から通話終了情報がサーバ装置２００に送られ、声紋認証部２５０ａ、文言特定部２５０ｄ、通知部２５０ｂに達する（ステップＳ７，Ｓ８，Ｓ１６）。そうすると通知部２５０ｂは、通話内容を分類し（ステップＳ１７）、特殊詐欺への分類スコアが既定の閾値を超えているか否かを判定する（ステップＳ１８）。スコアが閾値以上であれば、通知部２５０ｂは、直ちに自動発信を行って（ステップＳ１０）、応答者２０に詐欺の疑いを示唆するメッセージを通知する（ステップＳ１９）。 When the call ends, the call end information is sent from the telephone network 2 to the server device 200, and reaches the voiceprint authentication unit 250a, the wording identification unit 250d, and the notification unit 250b (steps S7, S8, S16). Then, the notification unit 250b classifies the contents of the call (step S17) and determines whether or not the classification score for special fraud exceeds a predetermined threshold value (step S18). If the score is equal to or higher than the threshold value, the notification unit 250b immediately makes an automatic transmission (step S10) and notifies the respondent 20 of a message suggesting suspicion of fraud (step S19).

以上のように第３の実施形態では、声紋認証により通話相手１０を認証するとともに、文言特定部２５０ｄにより、テキスト化された音声通話の内容を分類する。そして、特殊詐欺への分類スコアが高いと判定された呼については、通話終了後に、応答者２０にメッセージを通知するようにした。 As described above, in the third embodiment, the call partner 10 is authenticated by voiceprint authentication, and the content of the voice call converted into text is classified by the wording specifying unit 250d. Then, for a call determined to have a high classification score for special fraud, a message is notified to the respondent 20 after the call ends.

声紋認証と文言判定とを組み合わせることで、例えば、金銭を要求する内容の電話であっても身内であれば、特殊詐欺と判定しないようにできる。すなわち判定の精度を向上させることができる。従って、第３の実施形態によっても、特殊詐欺を抑止することが可能となる。 By combining voiceprint authentication and wording judgment, for example, even a phone call that requires money can be prevented from being judged as a special fraud if it is a relative. That is, the accuracy of the determination can be improved. Therefore, it is possible to deter special fraud by the third embodiment as well.

なお、この発明は上記実施形態に限定されるものではない。例えば上記各実施形態では、応答者２０にメッセージを発信して応答者２０に注意を促すようにした。これによれば犯人に気づかれずに特殊詐欺を抑止することができる。これとは逆に、メッセージを通話相手１０に通知することで、より積極的に特殊詐欺を抑止することもできる。このような場合、例えば「この通話は音声により認証されています」等のメッセージを通知することが考えられる。
また、証拠として音声通話記録や、通話内容をテキスト化したデータをクラウド上に蓄積してもよい。これらの情報は、犯人検挙の際の証拠として利用できる可能性がある。 The present invention is not limited to the above embodiment. For example, in each of the above embodiments, a message is sent to the respondent 20 to call attention to the respondent 20. According to this, special fraud can be deterred without being noticed by the criminal. On the contrary, by notifying the other party 10 of the message, it is possible to more actively deter special fraud. In such a case, it is conceivable to notify a message such as "This call has been authenticated by voice".
In addition, voice call records and textualized data of call contents may be stored in the cloud as evidence. This information may be used as evidence in the arrest of the criminal.

さらに、声紋認証部の機能、通知部の機能、登録部の機能、および文言特定部の機能をサーバ装置２００に集約することなく、複数のサーバに分散してもよい。例えば、声紋認証部の機能を有する声紋認証システム、通知部の機能を有する自動通知システム、登録部の機能を有する登録システム、および文言特定部の機能を有する文言特定システムをＷＡＮ５に接続し、各システム間で情報やデータを授受することによっても、第１〜第３の実施形態と同様の作用効果を奏することができる。 Further, the functions of the voiceprint authentication unit, the notification unit, the registration unit, and the wording identification unit may be distributed to a plurality of servers without being integrated in the server device 200. For example, a voiceprint authentication system having a voiceprint authentication unit function, an automatic notification system having a notification unit function, a registration system having a registration unit function, and a wording identification system having a wording identification unit function are connected to WAN5, and each of them is connected. By exchanging information and data between the systems, it is possible to obtain the same effects as those of the first to third embodiments.

本発明のいくつかの実施形態を説明したが、これらの実施形態は例として提示するものであり、発明の範囲を限定することは意図していない。これらの新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これらの実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 Although some embodiments of the present invention have been described, these embodiments are presented as examples and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other embodiments, and various omissions, replacements, and changes can be made without departing from the gist of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are also included in the scope of the invention described in the claims and the equivalent scope thereof.

１…電話機、２…電話網、３…接続装置、４…電話機、６…ＡＩスピーカ、７…スマートフォン、１０…通話相手、２０…応答者、３０…知人、１００…データベース、１００ａ…声紋データベース、２００…サーバ装置、２２０…ＲＯＭ、２３０…ＲＡＭ、２４０…記憶部、２４０ａ…プログラム、２４０ｂ…声紋データベース、２５０…プロセッサ、２５０ａ…声紋認証部、２５０ｂ…通知部、２５０ｃ…登録部、２５０ｄ…文言特定部、２７０…通信部。 1 ... Telephone, 2 ... Telephone network, 3 ... Connection device, 4 ... Telephone, 6 ... AI speaker, 7 ... Smartphone, 10 ... Call partner, 20 ... Respondent, 30 ... Acquaintance, 100 ... Database, 100a ... Voice print database, 200 ... server device, 220 ... ROM, 230 ... RAM, 240 ... storage unit, 240a ... program, 240b ... voiceprint database, 250 ... processor, 250a ... voiceprint authentication unit, 250b ... notification unit, 250c ... registration unit, 250d ... wording Specific part, 270 ... Communication part.

Claims

A voiceprint database that associates pre-registered speakers with their voiceprints,
A voiceprint authentication unit that accesses the voiceprint database based on the voiceprint information obtained by analyzing the voice of the other party of the incoming call and authenticates the other party.
A telephone system including a notification unit that notifies a respondent of an incoming call of a message when the other party is not registered.

The voiceprint database is a database in which a person involved in a special fraud is associated with the voiceprint.
The telephone system according to claim 1, wherein the notification unit notifies the responder that the authenticated call partner is a party involved in the special fraud.

The telephone system according to claim 1, wherein the voiceprint database is a database in which an acquaintance of the responder is associated with the voiceprint.

The telephone system according to claim 3, further comprising a registration unit that analyzes the voice of the acquaintance, extracts the voiceprint, associates the voiceprint with the attribute of the acquaintance, and registers the voiceprint in the voiceprint database.

A voiceprint database that associates pre-registered speakers with their voiceprints,
A voiceprint authentication unit that accesses the voiceprint database based on the voiceprint information obtained by analyzing the voice of the other party of the incoming call and authenticates the other party.
A wording specifying unit that specifies the wording included in the voice call between the other party and the respondent of the incoming call,
It is provided with a notification unit that classifies the content of the voice call based on the specified wording and notifies the responder of a message when the score for classifying the content as a special fraud is equal to or higher than a default value. Telephone system.

The voiceprint database is a database in which a person involved in the special fraud is associated with the voiceprint.
The telephone system according to claim 5, wherein the notification unit notifies the responder that the authenticated call partner is a party involved in the special fraud.

The voiceprint database is a database in which a person involved in the special fraud is associated with the voiceprint.
The telephone system according to claim 5, wherein the notification unit notifies the authenticated call partner of the message.

The telephone system according to any one of claims 1 to 7, wherein the notification unit sets a new call after the end of the incoming call and notifies the message.

In a server device that can connect to the telephone network
Based on the voiceprint information obtained by analyzing the voice of the other party of the incoming call generated in the telephone network, the voiceprint database that associates the pre-registered speaker with the voiceprint is accessed to access the voiceprint database of the other party. Voiceprint authentication department that certifies
A server device including a notification unit that notifies a respondent of an incoming call to a message via the telephone network when the other party is not registered.

In a server device that can connect to the telephone network
Based on the voiceprint information obtained by analyzing the voice of the other party of the incoming call generated in the telephone network, the voiceprint database that associates the pre-registered speaker with the voiceprint is accessed to access the voiceprint database of the other party. Voiceprint authentication department that certifies
A wording specifying unit that specifies the wording included in the voice call between the other party and the respondent of the incoming call,
The content of the voice call is classified based on the specified wording, and when the score for classifying the content as a special fraud is equal to or higher than the default value, the respondent is notified of a message via the telephone network. A server device including a notification unit.