JPWO2004039044A1

JPWO2004039044A1 - Communication terminal, voiceprint information search server, personal information display system, personal information display method in communication terminal, personal information display program

Info

Publication number: JPWO2004039044A1
Application number: JP2004546371A
Authority: JP
Inventors: 悟植山
Original assignee: Fujitsu Ltd; Fujitsu Peripherals Ltd
Current assignee: Fujitsu Ltd; Fujitsu Peripherals Ltd
Priority date: 2002-10-23
Filing date: 2002-10-23
Publication date: 2006-02-23
Also published as: WO2004039044A1

Abstract

着信による通話開始時に、発信者の個人情報の表示を行う通信端末において、登録された個人の個人情報と個人の声紋情報とを対応づけて記憶するデータベースとしてのＦＬＡＳＨ−ＲＯＭ１０と、着信による通話開始時に発信者の音声から発信者の声紋情報を抽出する声紋分析部６と、発信者の声紋情報とデータベース内の声紋情報とを比較することによりデータベース内の個人の中から発信者を特定するＭＰＵ７と、特定された発信者の個人情報を表示する表示部としてのＬＣＤ１７とを備えた。FLASH-ROM 10 serving as a database for storing personal information and personal voiceprint information in association with each other in a communication terminal that displays the personal information of the caller at the start of the incoming call, and starting the incoming call Sometimes the voiceprint analysis unit 6 that extracts the voiceprint information of the caller from the voice of the caller, and the MPU 7 that identifies the caller from the individuals in the database by comparing the voiceprint information of the caller with the voiceprint information in the database. And an LCD 17 as a display unit for displaying personal information of the specified caller.

Description

本発明は、着信による通話開始時に発信者の声紋を用いて個人を特定し、その個人情報を画面に表示する通信端末、声紋情報検索サーバ、通信端末における個人情報表示方法、個人情報表示プログラムに関するものである。なお、通信端末には携帯電話機のような携帯端末も含まれる。 The present invention relates to a communication terminal, a voiceprint information search server, a personal information display method in a communication terminal, and a personal information display program for identifying an individual using a caller's voiceprint at the start of an incoming call and displaying the personal information on a screen. Is. The communication terminal includes a mobile terminal such as a mobile phone.

従来、通信端末を用いた通話等において、発信先の通信端末のユーザが発信元の通信端末のユーザ（発信者）を知るためには、通話開始時に発信者の名前を聞いたり、あるいは発信者の声から判断する必要がある。また、着信時に発信元の電話番号または名前を表示する機能を有する通信端末を用いて通話を行う場合は、ユーザは表示される電話番号または名前を視認することにより、発信元を認識することができる。この発信者の名前の表示は、例えば特開２００１−２１８２６７号に知られており、携帯電話より発信される電波を送受信機にて受信し、その電波に含まれる携帯電話の電話番号を個人認証判定部にて個人認証データベースを用いて照合することにより行っている。
しかしながら、このような従来の技術においては、発信元の電話番号または名前を表示する通信端末を用いても、発信元の電話番号が複数の人に使われているような場合、例えば発信元の電話番号が企業や団体の電話番号である場合は、発信者個人を特定することができないという問題がある。また、発信元の通信端末を持ち主以外の個人が使用した場合は、発信元をその通信端末の持ち主と間違えてしまうという問題がある。
本発明は、このような問題を解決するためになされたものであり、着信による通話開始時に発信者の声紋を用いて発信者個人を特定し、発信者の個人情報を画面に表示できる通信端末、声紋情報検索サーバ、通信端末における個人情報表示方法、個人情報表示プログラムを提供することを目的とする。Conventionally, in a call using a communication terminal or the like, in order for the user of the communication terminal of the call destination to know the user (caller) of the communication terminal of the call origin, It is necessary to judge from the voice. In addition, when a call is made using a communication terminal having a function of displaying a caller's telephone number or name when an incoming call is received, the user may recognize the caller by visually recognizing the displayed telephone number or name. it can. This caller name display is known, for example, in Japanese Patent Application Laid-Open No. 2001-218267. A radio wave transmitted from a mobile phone is received by a transmitter / receiver, and the mobile phone number included in the radio wave is personally authenticated. This is done by collating using a personal authentication database in the determination unit.
However, in such a conventional technique, even when a communication terminal that displays a caller's phone number or name is used, if the caller's phone number is used by multiple people, for example, When the telephone number is a telephone number of a company or organization, there is a problem that the individual sender cannot be specified. In addition, when an individual other than the owner uses the communication terminal of the sender, there is a problem that the sender is mistaken for the owner of the communication terminal.
The present invention has been made to solve such a problem, and can identify a caller using a voiceprint of a caller at the start of an incoming call and can display personal information of the caller on a screen. An object of the present invention is to provide a voiceprint information search server, a personal information display method in a communication terminal, and a personal information display program.

本発明の通信端末は、例えば登録された個人の個人情報と前記個人の声紋情報とを対応づけて記憶するデータベースと、着信による通話開始時に発信者の音声から前記発信者の声紋情報を抽出する声紋分析部と、前記発信者の声紋情報と前記データベース内の声紋情報とを比較することにより前記発信者についての前記データベース内の個人情報を特定する演算部と、特定された前記発信者の個人情報を表示する表示部とを備えてなるものである。
このような構成によれば、着信による通話開始時に、例えばユーザは表示部に表示される個人情報を視認等することにより発信者を特定することができる。なお、表示部は音声表示するものであっても良い。ここで、実施の形態１におけるデータベースとはＦＬＡＳＨ−ＲＯＭ１０のことであり、声紋分析部とは声紋分析部６のことであり、演算部とはＭＰＵ７のことであり、表示部とはＬＣＤ１７のことである。
また、本発明に係る通信端末においては、例えばユーザが前記発信者の個人情報を入力するための入力部を備え、前記声紋分析部から得られた前記発信者の声紋情報が前記データベースに登録されていない場合に、前記発信者の声紋情報と前記入力部から得られた前記発信者の個人情報とからなる組を前記データベースへ登録することを特徴とするものである。
このような構成によれば、通信端末内のデータベースに発信者の声紋情報が保存されていない場合、ユーザは入力部を用いて発信者の個人情報を入力し、通信端末内のデータベースに発信者の声紋情報と個人情報をリンクさせて保存することができる。なお、実施の形態１における入力部とはキーパッド１８のことである。
また、本発明は、通信回線を介して通信端末と接続されることができる声紋情報検索サーバであって、例えば登録された個人の個人情報と前記個人の声紋情報とを対応づけて記憶するデータベースと、前記通信端末から声紋情報を含む情報を受信する受信部と、該受信部が受信した声紋情報と前記データベース内の声紋情報とを比較することにより、前記データベース内の個人情報（前記受信された声紋情報を持つ話者）を特定する演算部と、特定された前記データベース内の個人情報を前記通信端末へ送信する送信部とを備えてなるものである。
また、本発明は、通信回線を介して前記声紋情報検索サーバと接続されることができる通信端末であって、着信による通話開始時に発信者の音声から前記発信者の声紋情報を抽出する声紋分析部と、前記発信者の声紋情報を前記声紋情報検索サーバへ送信する送信部と、前記発信者の個人情報を前記声紋情報検索サーバから受信する受信部と、該受信部から得られる前記発信者の個人情報を表示する表示部とを備えてなるものである。
このような構成によれば、着信による通話開始時に、ユーザは表示部に表示される個人情報を視認等することにより発信者を特定することができる。また、声紋情報検索サーバが一括してデータベースを持ち、通信端末が個々にデータベースを持つ必要がないことから、通信端末の回路規模を低減することができる。なお、実施の形態２における声紋情報検索サーバとはサーバ４０のことであり、通信端末とは通信端末１Ａのことであり、データベースとは記憶部４４のことであり、送信部と受信部とは信号処理部４１のことであり、演算部とは制御部４３のことであり、声紋分析部とは声紋分析部６のことであり、送信部と受信部とは信号処理部３のことであり、表示部とはＬＣＤ１７のことである。
また、本発明に係る通信端末においては、発信者の個人情報を入力するための入力部を備え、例えば前記発信者の個人情報が前記声紋情報検索サーバから受信されない場合に、前記発信者の声紋情報と前記入力部から得られた前記発信者の個人情報とからなる組を前記声紋情報検索サーバのデータベースへ登録することを特徴とするものである。
このような構成によれば、声紋情報検索サーバ内のデータベースに発信者の声紋情報が保存されていない場合、ユーザは入力部を用いて個人情報を入力し、声紋情報検索サーバ内のデータベースに発信者の声紋情報と個人情報をリンクさせて保存することができる。なお、実施の形態２における入力部とはキーパッド１８のことである。また、本発明によれば、これら声紋情報検索サーバと通信端末とから個人情報表示システムが構成される。
また、本発明は、通信回線を介して通信端末と接続されることができる声紋情報検索サーバであって、例えば登録された個人の個人情報と前記個人の声紋情報とを対応づけて記憶するデータベースと、通信端末から音声を含む情報を受信する受信部と、該受信部が受信した音声から声紋情報を抽出する声紋分析部と、該声紋分析部が抽出した声紋情報と前記データベース内の声紋情報とを比較することにより前記データベース内の個人情報を特定する演算部と、前記特定された前記データベース内の個人情報を前記通信端末へ送信する送信部とを備えてなるものである。
また、本発明は、通信回線を介して前記声紋情報検索サーバと接続されることができる通信端末であって、着信による通話開始時に発信者の音声を前記声紋情報検索サーバへ転送する送信部と、前記声紋情報検索サーバから前記発信者の個人情報を受信する受信部と、該受信部から得られる前記発信者の個人情報を表示する表示部とを備えてなるものである。
このような構成によれば、着信による通話開始時に、ユーザは表示部に表示される個人情報を視認等することにより発信者を特定することができる。また、通信端末に比べて演算性能の高い声紋情報検索サーバが声紋分析や比較を行うことにより、高速に個人を特定することができるとともに、通信端末が個々に声紋分析部やデータベースを持つ必要がないことから、通信端末の回路規模を低減することができる。なお、実施の形態３における声紋情報検索サーバとはサーバ６０のことであり、通信端末とは通信端末５０のことであり、データベースとは記憶部６５のことであり、声紋分析とは声紋分析部６４のことであり、送信部と受信部とは信号処理部６１のことであり、演算部とは制御部６３のことであり、送信部と受信部とは信号処理部３のことであり、表示部とはＬＣＤ１７のことである。
また、本発明に係る通信端末においては、発信者の個人情報を入力するための入力部を備え、例えば前記発信者の個人情報が前記声紋情報検索サーバから受信されない場合に、前記発信者の声紋情報と前記入力部から得られた前記発信者の個人情報とからなる組を前記声紋情報検索サーバのデータベースへ登録することを特徴とするものである。
このような構成によれば、声紋情報検索サーバ内のデータベースに発信者の声紋情報が保存されていない場合、ユーザは入力部を用いて個人情報を入力し、声紋情報検索サーバ内のデータベースに発信者の声紋情報と個人情報をリンクさせて保存することができる。なお、実施の形態３における入力部とはキーパッド１８のことである。また、本発明によれば、これら声紋情報検索サーバと通信端末により個人情報表示システムが構成される。
また、本発明は、着信による通話開始時に、発信者の個人情報の表示を行う通信端末における個人情報表示方法であって、例えば登録された個人の個人情報と前記個人の声紋情報を対応づけデータベースとして記憶するステップと、前記発信者の音声から前記発信者の声紋情報を抽出するステップと、前記発信者の声紋情報と前記データベース内の声紋情報とを比較することにより前記発信者についての前記データベース内の個人情報を特定するステップと、特定された前記発信者の個人情報を表示するステップとを備えてなる通信端末における個人情報表示方法を提供するものである。
また、本発明は、着信による通話開始時に、発信者の個人情報の表示をコンピュータに実行させるために、コンピュータにより読取可能な媒体に記憶された個人情報表示プログラムであって、例えば登録された個人の個人情報と前記個人の声紋情報を対応づけデータベースとして記憶するステップと、前記発信者の音声から前記発信者の声紋情報を抽出するステップと、前記発信者の声紋情報と前記データベース内の声紋情報とを比較することにより前記発信者についての前記データベース内の個人情報を特定するステップと、特定された前記発信者の個人情報を表示するステップとをコンピュータに実行させる個人情報表示プログラムを提供するものである。なお、このプログラムは、コンピュータにより読み取り可能な記憶媒体に記憶されることができ、記憶媒体としては、ＲＯＭ、ＲＡＭ等の半導体メモリ、ＣＤ−ＲＯＭやフレキシブルディスク、ＤＶＤディスク、光磁気ディスク、ＩＣカード等の可搬型記憶媒体や、コンピュータプログラムを保持するデータベース等がある。The communication terminal of the present invention extracts, for example, the voice print information of the caller from the database that stores the personal information of the registered individual and the personal voice print information in association with each other, and the voice of the caller at the start of the incoming call. A voiceprint analysis unit; a calculation unit that identifies personal information in the database for the sender by comparing the voiceprint information of the sender with voiceprint information in the database; and the identified individual of the sender And a display unit for displaying information.
According to such a configuration, at the start of an incoming call, for example, the user can identify the caller by visually checking the personal information displayed on the display unit. Note that the display unit may display a sound. Here, the database in the first embodiment is the FLASH-ROM 10, the voiceprint analysis unit is the voiceprint analysis unit 6, the calculation unit is the MPU 7, and the display unit is the LCD 17. It is.
In the communication terminal according to the present invention, for example, a user includes an input unit for inputting personal information of the caller, and the voiceprint information of the caller obtained from the voiceprint analysis unit is registered in the database. If not, a set consisting of the voiceprint information of the sender and the personal information of the sender obtained from the input unit is registered in the database.
According to such a configuration, when the voiceprint information of the caller is not stored in the database in the communication terminal, the user inputs the caller's personal information using the input unit, and the caller is stored in the database in the communication terminal. Voiceprint information and personal information can be linked and saved. The input unit in the first embodiment is the keypad 18.
Further, the present invention is a voiceprint information search server that can be connected to a communication terminal via a communication line, and stores, for example, a registered personal information and the personal voiceprint information in association with each other. And a receiving unit for receiving information including voiceprint information from the communication terminal, and comparing the voiceprint information received by the receiving unit with the voiceprint information in the database, thereby storing the personal information in the database (the received information). And a transmitter for transmitting the specified personal information in the database to the communication terminal.
Further, the present invention is a communication terminal that can be connected to the voiceprint information search server via a communication line, and extracts the voiceprint information of the caller from the voice of the caller at the start of the incoming call A transmitter that transmits the voiceprint information of the sender to the voiceprint information search server, a receiver that receives the personal information of the sender from the voiceprint information search server, and the sender obtained from the receiver And a display unit for displaying the personal information.
According to such a configuration, the user can specify the caller by visually recognizing the personal information displayed on the display unit at the start of the incoming call. Further, since the voiceprint information search server has a database in a lump and the communication terminals do not need to have a database individually, the circuit scale of the communication terminals can be reduced. In addition, the voiceprint information search server in Embodiment 2 is the server 40, the communication terminal is the communication terminal 1A, the database is the storage unit 44, and the transmission unit and the reception unit are The signal processing unit 41, the calculation unit is the control unit 43, the voice print analysis unit is the voice print analysis unit 6, and the transmission unit and the reception unit are the signal processing unit 3. The display unit is the LCD 17.
The communication terminal according to the present invention further includes an input unit for inputting the personal information of the caller. For example, when the personal information of the caller is not received from the voiceprint information search server, the voiceprint of the caller is provided. A set of information and personal information of the sender obtained from the input unit is registered in a database of the voiceprint information search server.
According to such a configuration, when the voiceprint information of the caller is not stored in the database in the voiceprint information search server, the user inputs personal information using the input unit, and sends the personal information to the database in the voiceprint information search server. A person's voiceprint information and personal information can be linked and stored. In the second embodiment, the input unit is the keypad 18. Further, according to the present invention, a personal information display system is constituted by the voiceprint information search server and the communication terminal.
Further, the present invention is a voiceprint information search server that can be connected to a communication terminal via a communication line, and stores, for example, a registered personal information and the personal voiceprint information in association with each other. A receiving unit for receiving information including voice from the communication terminal, a voiceprint analyzing unit for extracting voiceprint information from the voice received by the receiving unit, voiceprint information extracted by the voiceprint analyzing unit, and voiceprint information in the database Are provided with a calculation unit that specifies personal information in the database and a transmission unit that transmits the specified personal information in the database to the communication terminal.
Further, the present invention is a communication terminal that can be connected to the voiceprint information search server via a communication line, and transmits a voice of a caller to the voiceprint information search server at the start of a call by incoming call; A receiving unit that receives the sender's personal information from the voiceprint information search server, and a display unit that displays the sender's personal information obtained from the receiving unit.
According to such a configuration, the user can specify the caller by visually recognizing the personal information displayed on the display unit at the start of the incoming call. In addition, a voiceprint information search server with higher computing performance than a communication terminal can identify a person at high speed by performing voiceprint analysis and comparison, and the communication terminal needs to have a voiceprint analysis unit and a database individually. Therefore, the circuit scale of the communication terminal can be reduced. The voiceprint information search server in the third embodiment is the server 60, the communication terminal is the communication terminal 50, the database is the storage unit 65, and the voiceprint analysis is the voiceprint analysis unit. 64, the transmission unit and the reception unit are the signal processing unit 61, the arithmetic unit is the control unit 63, the transmission unit and the reception unit are the signal processing unit 3, The display unit is the LCD 17.
The communication terminal according to the present invention further includes an input unit for inputting the personal information of the caller. For example, when the personal information of the caller is not received from the voiceprint information search server, the voiceprint of the caller is provided. A set of information and personal information of the sender obtained from the input unit is registered in a database of the voiceprint information search server.
According to such a configuration, when the voiceprint information of the caller is not stored in the database in the voiceprint information search server, the user inputs personal information using the input unit, and sends the personal information to the database in the voiceprint information search server. A person's voiceprint information and personal information can be linked and stored. In the third embodiment, the input unit is the keypad 18. Further, according to the present invention, a personal information display system is constituted by these voiceprint information search servers and communication terminals.
The present invention is also a personal information display method in a communication terminal that displays personal information of a caller at the start of a call by incoming call, for example, a database that associates registered personal personal information and personal voiceprint information. Storing the caller's voiceprint information from the caller's voice, comparing the caller's voiceprint information with the voiceprint information in the database to compare the database of the caller The personal information display method in the communication terminal comprising the steps of specifying personal information in the communication terminal and displaying the personal information of the specified sender is provided.
The present invention is also a personal information display program stored on a computer-readable medium for causing a computer to display personal information of a caller at the start of an incoming call, for example, a registered individual Storing the personal information of the person and the voice print information of the person as a database, extracting the voice print information of the caller from the voice of the caller, the voice print information of the caller and the voiceprint information in the database Providing a personal information display program for causing a computer to execute the step of identifying personal information in the database for the sender by comparing the information and the step of displaying the personal information of the identified sender It is. This program can be stored in a computer-readable storage medium, such as a semiconductor memory such as ROM or RAM, a CD-ROM, a flexible disk, a DVD disk, a magneto-optical disk, or an IC card. And a portable storage medium such as a database for holding computer programs.

第１図は、本発明の実施の形態１に係る通信端末の構成の一例を示すブロック図である。
第２図は、本発明の実施の形態１に係る通信端末の発信者特定の動作の一例を示すフローチャートである。
第３図は、通信端末とサーバを備えた通信システムの構成の一例を示すブロック図である。
第４図は、本発明の実施の形態２に係るサーバの構成の一例を示すブロック図である。
第５図は、本発明の実施の形態２に係る通信端末の発信者特定の動作の一例を示すフローチャートである。
第６図は、本発明の実施の形態２に係るサーバの発信者特定の動作の一例を示すフローチャートである。
第７図は、本発明の実施の形態３に係る通信端末の構成の一例を示すブロック図である。
第８図は、本発明の実施の形態３に係るサーバの構成の一例を示すブロック図である。
第９図は、本発明の実施の形態３に係る通信端末の発信者特定の動作の一例を示すフローチャートである。
第１０図は、本発明の実施の形態３に係るサーバの発信者特定の動作の一例を示すフローチャートである。FIG. 1 is a block diagram showing an example of the configuration of a communication terminal according to Embodiment 1 of the present invention.
FIG. 2 is a flowchart showing an example of a caller specifying operation of the communication terminal according to Embodiment 1 of the present invention.
FIG. 3 is a block diagram illustrating an example of a configuration of a communication system including a communication terminal and a server.
FIG. 4 is a block diagram showing an example of the configuration of the server according to Embodiment 2 of the present invention.
FIG. 5 is a flowchart showing an example of a caller specifying operation of the communication terminal according to Embodiment 2 of the present invention.
FIG. 6 is a flowchart showing an example of the caller specifying operation of the server according to Embodiment 2 of the present invention.
FIG. 7 is a block diagram showing an example of the configuration of the communication terminal according to Embodiment 3 of the present invention.
FIG. 8 is a block diagram showing an example of the configuration of the server according to Embodiment 3 of the present invention.
FIG. 9 is a flowchart showing an example of the caller specifying operation of the communication terminal according to Embodiment 3 of the present invention.
FIG. 10 is a flowchart showing an example of the caller specifying operation of the server according to Embodiment 3 of the present invention.

以下、本発明の実施の形態について図面を参照して詳細に説明する。なお、本発明の実施の形態では、通信端末のうち無線通信を行う通信端末を例に挙げて説明する。
実施の形態１．
本実施の形態では、通信端末の着信における発信者特定において、通信端末が声紋分析を行い、声紋情報の検索を行い、個人情報の表示を行う例について説明する。
まず、通信端末の構成と動作について説明する。第１図は、本発明の実施の形態１に係る通信端末の構成の一例を示すブロック図である。第１図に示すように、この通信端末１は、送受信アンテナ２と信号処理部３とデータ処理部４と音声処理部５と声紋分析部６とＭＰＵ（ＭｉｃｒｏｐｒｏｃｅｓｓｉｎｇＵｎｉｔ）７とＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）８とＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）９とＦＬＡＳＨ−ＲＯＭ１０と音源ＬＳＩ（ＬａｒｇｅＳｃａｌｅＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）１１とマイク１２とスピーカ１３と外部入出力部１４とバイブレータ１５とＬＥＤ（ＬｉｇｈｔＥｍｉｔｔｉｎｇＤｉｏｄｅ）１６とＬＣＤ（ＬｉｑｕｉｄＣｒｙｓｔａｌＤｉｓｐｌａｙ）１７とキーパッド１８から構成される。
ＭＰＵ７は、信号処理部３とデータ処理部４と音声処理部５と声紋分析部６とＲＡＭ８とＲＯＭ９とＦＬＡＳＨ−ＲＯＭ１０と音源ＬＳＩ１１と外部入出力部１４とバイブレータ１５とＬＥＤ１６とＬＣＤ１７とキーパッド１８と接続されており、それぞれの制御を行う。
送信動作において、信号処理部３は、データ処理部４からの非音声データと音声処理部５からの音声データを合成し、送受信アンテナ２を介して外部へ送信する。また受信動作において、信号処理部３は、送受信アンテナ２を介して受信した信号が音声以外の非音声データであればデータ処理部４へ出力し、音声データであれば音声処理部５へ出力する。
非音声データは、データ処理部４を介して文字や画像としてＬＣＤ１７へ出力される。音声処理部５は、スピーカ１３を介して音声データを音声として外部へ出力すると共に、声紋分析に必要な音声データを声紋分析部６へ出力する。また、音声処理部５は、マイク１２を介して外部から受信した音声を音声データとして信号処理部３へ出力する。
声紋分析部６は、例えば特許第３２８０８２５号公報に開示されたような処理を用いて、受信した音声データに対して声紋分析を行い、周波数ごとの時間分布、発声時間、ピッチ周波数等からなる声紋情報を算出し、ＲＡＭ８へ出力する。
ＲＯＭ９は、ＭＰＵ７のプログラム等を保存する。ＲＡＭ８は、ＭＰＵ７のプログラムの実行に必要な情報を保存する。また、ＲＡＭ８は、声紋情報を一時的に保存する。ＦＬＡＳＨ−ＲＯＭ１０は、声紋情報と個人情報をリンクさせたデータベースを保存する。ここで個人情報とは、例えば名前、年齢、性別、会社名、電話番号、管理番号等である。
音源ＬＳＩ１１は着信音等を生成する。外部入出力部１４は、外部のＰＣ等とケーブル等を介してデータの入出力を行う。バイブレータ１５は着信等に伴って振動を行う。ＬＥＤ１６は着信等に伴って発光を行う。ＬＣＤ１７は文字や画像の表示を行う。キーパッド１８は、ユーザからの個人情報等の入力を受け付ける。
次に、着信による通話開始時の発信者特定の動作について、第２図のフローチャートを用いて説明する。第２図は、実施の形態１に係る通信端末の発信者特定の動作の一例を示すフローチャートである。ある発信者から通信端末１へ着信があり、通話を開始すると（Ｓ１）、ＭＰＵ７は声紋分析部６において発信者の音声の声紋分析を行い（Ｓ２）、その結果を発信者の声紋情報としてＲＡＭ８へ保存する。
次に、ＭＰＵ７はＲＡＭ８に保存された発信者の声紋情報とＦＬＡＳＨ−ＲＯＭ１０のデータベース内の声紋情報を比較することにより発信者の声紋情報の検索を行い（Ｓ３）、発信者の声紋情報がデータベースに登録済みであるか否かの判断を行う（Ｓ４）。発信者の声紋情報がデータベースに登録済みである場合（Ｓ４，Ｙ）、ＭＰＵ７は発信者の声紋情報にリンクした個人情報をデータベースから読み出してＬＣＤ１７へ表示し（Ｓ５）、通信端末１のフローは終了する。
一方、発信者の声紋情報がデータベースに登録されていない場合（Ｓ４，Ｎ）、ＭＰＵ７は、「この発信者の声紋情報は登録されていません。登録しますか？」のような未登録メッセージをＬＣＤ１７へ表示する（Ｓ６）。この未登録メッセージを見たユーザは、キーパッド１８を用いて発信者の声紋情報を登録するか否かを入力する。
ユーザからの入力が、登録を希望している場合（Ｓ７，Ｙ）、ＭＰＵ７はユーザがキーパッド１８を用いて入力した発信者の個人情報を受け付け（Ｓ８）、発信者の声紋情報と発信者の個人情報をリンクさせ、ＦＬＡＳＨ−ＲＯＭ１０のデータベースへ保存し（Ｓ９）、通信端末１のフローは終了する。また、ユーザからの入力が、登録を希望していない場合（Ｓ７，Ｎ）、通信端末１のフローは終了する。
以上のような処理によれば、着信による通話開始時に、ユーザはＬＣＤ１７に表示される個人情報を視認して発信者を特定することができる。また、通信端末１内のデータベースに発信者の声紋情報が保存されていない場合、ユーザは発信者の個人情報を入力し、通信端末１内のデータベースに発信者の声紋情報と個人情報をリンクさせて保存することができる。
実施の形態２．
本実施の形態では、通信端末の着信における発信者特定において、通信端末が声紋分析を行い、外部のサーバが声紋情報の検索を行い、通信端末が個人情報の表示を行う例について説明する。
第３図は、通信端末とサーバを備えた通信システム（個人情報表示システム）の構成の一例を示すブロック図である。第３図に示すように、この通信システムは、通信端末１Ａと無線基地局２０とサーバ４０から構成される。通信端末１Ａと無線基地局２０は無線で通信を行い、無線基地局２０とサーバ４０は公衆網（通信回線）３０を介して通信を行う。
まず、通信端末１Ａの構成と動作について説明する。通信端末１Ａは第１図に示した通信端末１と同様の構成を持つが、ＦＬＡＳＨ−ＲＯＭ１０はデータベースを持たない。またＭＰＵ７は、声紋分析部６から出力される声紋情報とキーパッド１８から新規に入力された発信者の個人情報を信号処理部３からサーバ４０へ送信し、信号処理部３がサーバ４０から受信した発信者の個人情報をＬＣＤ１７へ表示する。
次に、サーバ４０の構成と動作について説明する。第４図は、本発明の実施の形態２に係るサーバの構成の一例を示すブロック図である。サーバ４０は、信号処理部４１とデータ処理部４２と制御部４３と記憶部４４から構成される。制御部４３は、信号処理部４１とデータ処理部４２と記憶部４４と接続されており、それぞれの制御を行う。送信動作において、信号処理部４１は、データ処理部４からのデータを通信端末１Ａへ送信する。また受信動作において、信号処理部４１は、通信端末１Ａから受信したデータをデータ処理部４２へ出力する。データ処理部４２は、データを記憶部４４へ出力する。記憶部４４は、制御部４３のプログラムやデータ処理部４２からのデータ等を保存する。また、記憶部４４は、声紋情報と個人情報をリンクさせたデータベースを保存する。
次に、着信による通話開始時の発信者特定の動作について、第５図と第６図のフローチャートを用いて説明する。第５図は、実施の形態２に係る通信端末の発信者特定の動作の一例を示すフローチャートである。第６図は、実施の形態２に係るサーバの発信者特定の動作の一例を示すフローチャートである。
通信端末１Ａにおいて、ある発信者から着信があり、通話を開始すると（Ｓ１１）、ＭＰＵ７は声紋分析部６において発信者の音声の声紋分析を行い（Ｓ１２）、その結果を発信者の声紋情報として、信号処理部３からサーバ４０へ送信する（Ｓ１３）。
サーバ４０において、信号処理部４１は通信端末１Ａから発信者の声紋情報を受信し（Ｓ２１）、制御部４３は発信者の声紋情報と記憶部４４のデータベース内の声紋情報を比較することにより発信者の声紋情報の検索を行い（Ｓ２２）、発信者の声紋情報がデータベースに登録済みであるか否かの判断を行う（Ｓ２３）。
サーバ４０において、発信者の声紋情報がデータベースに登録済みである場合（Ｓ２３，Ｙ）、制御部４３は発信者の声紋情報にリンクした個人情報をデータベースから読み出し、信号処理部４１から通信端末１Ａへ送信し（Ｓ２４）、サーバ４０のフローは終了する。
次に通信端末１Ａにおいて、信号処理部３がサーバ４０から個人情報を受信すると（Ｓ１４，Ｙ）、ＭＰＵ７は受信した個人情報をＬＣＤ１７へ表示し（Ｓ１５）、通信端末１Ａのフローは終了する。
サーバ４０において、発信者の声紋情報がデータベースに登録されていない場合（Ｓ２３，Ｎ）、制御部４３は未登録を示す信号を、信号処理部４１から通信端末１Ａへ送信する（Ｓ２５）。
次に通信端末１Ａにおいて、信号処理部３がサーバ４０から未登録を示す信号を受信すると（Ｓ１４，Ｎ）、ＭＰＵ７は「この発信者の声紋情報は登録されていません。登録しますか？」のような未登録メッセージをＬＣＤ１７へ表示する（Ｓ１６）。この未登録メッセージを見たユーザは、キーパッド１８を用いて声紋情報を登録するか否かを入力する。
次に通信端末１Ａにおいて、ユーザからの入力が、登録を希望している場合（Ｓ１７，Ｙ）、ＭＰＵ７はユーザがキーパッド１８を用いて入力した発信者の個人情報を受け付け（Ｓ１８）、発信者の個人情報を信号処理部３からサーバ４０へ送信し（Ｓ１９）、通信端末１Ａのフローは終了する。また、ユーザからの入力が、登録を希望していない場合（Ｓ１７，Ｎ）、通信端末１Ａのフローは終了する。
次にサーバ４０において、信号処理部４１が、ユーザにより入力された発信者の個人情報を通信端末１Ａから受信した場合（Ｓ２６，Ｙ）、制御部４３は発信者の声紋情報と発信者の個人情報をリンクさせ、記憶部４４のデータベースへ保存し（Ｓ２７）、サーバ４０のフローは終了する。また、信号処理部４１が、ユーザにより入力された発信者の個人情報を通信端末１Ａから受信しない場合（Ｓ２６，Ｎ）、サーバ４０のフローは終了する。
以上のような処理によれば、着信による通話開始時に、ユーザはＬＣＤ１７に表示される個人情報を視認して発信者を特定することができる。また、サーバ４０内のデータベースに発信者の声紋情報が保存されていない場合、ユーザは発信者の個人情報を入力して通信端末１Ａから送信し、サーバ４０内のデータベースに発信者の声紋情報と個人情報をリンクさせて保存することができる。本実施の形態では、サーバ４０が一括してデータベースを持ち、通信端末１Ａが個々にデータベースを持つ必要がないことから、通信端末１Ａの回路規模を低減することができる。
なお、本実施の形態では、通信端末１Ａはデータベースを持たないとしたが、通信端末１Ａにもデータベースを備え、通信端末１Ａ内のデータベースに発信者の声紋情報がなければサーバ４０内のデータベースを検索する構成としても良い。
実施の形態３．
本実施の形態では、通信端末の着信における発信者特定において、外部のサーバが声紋分析を行い、外部のサーバが声紋情報の検索を行い、通信端末が個人情報の表示を行う個人情報表示システムの例について説明する。
本実施の形態では、第３図と同様に通信端末とサーバを用いて発信者特定を行うが、通信端末１Ａの代わりに通信端末５０を備え、サーバ４０の代わりにサーバ６０を備える。
まず、通信端末５０の構成と動作について説明する。第７図は、本発明の実施の形態３に係る通信端末の構成の一例を示すブロック図である。第７図において、第１図と同一符号は第１図に示された対象と同一又は相当物を示しており、ここでの説明を省略する。本実施の形態における通信端末５０は、声紋分析部６を削除し、音声処理部５の代わりに音声処理部５５を備える。音声処理部５５は、音声データを音声としてスピーカ１３を介して外部へ出力すると共に、声紋分析に必要な音声データを信号処理部３へ出力する。また、音声処理部５５は、マイク１２を介して外部から受信した音声を音声データとして信号処理部３へ出力する。
またＦＬＡＳＨ−ＲＯＭ１０はデータベースを持たない。またＭＰＵ７は、音声処理部５５から出力される音声データとキーパッド１８から新規に入力された発信者の個人情報を信号処理部３からサーバ６０へ送信し、信号処理部３がサーバ６０から受信した発信者の個人情報をＬＣＤ１７へ表示する。
次に、サーバ６０の構成と動作について説明する。第８図は、本発明の実施の形態３に係るサーバの構成の一例を示すブロック図である。第８図において、第８図と同一符号は第８図に示された対象と同一又は相当物を示しており、ここでの説明を省略する。本実施の形態におけるサーバ６０は、信号処理部４１の代わりに信号処理部６１を備え、制御部４３の代わりに制御部６３を備え、記憶部４４の代わりに記憶部６５を備え、さらに音声処理部６２と声紋分析部６４を備える。
制御部６３は、信号処理部６１とデータ処理部４２と音声処理部６２と声紋分析部６４と記憶部６５と接続されており、それぞれの制御を行う。送信動作において、信号処理部６１は、データ処理部４２からの非音声データを、通信端末５０へ出力する。また受信動作において、信号処理部６１は、受信した信号が音声以外の非音声データであればデータ処理部４２へ出力し、音声データであれば音声処理部６２へ出力する。
音声処理部６２は、声紋分析に必要な音声データを声紋分析部６４へ出力する。声紋分析部６４は、第１図に示した声紋分析部６と同様に声紋情報を算出し、記憶部６５へ出力する。記憶部６５は、制御部６３のプログラムやデータ処理部４２からのデータ等を保存する。また、記憶部６５は、声紋情報と個人情報をリンクさせたデータベースを保存する。さらに、記憶部６５は、声紋分析部６４で分析された声紋情報を一時的に保存する。
次に、着信による通話開始時の発信者特定の動作について、第９図と第１０図のフローチャートを用いて説明する。第９図は、実施の形態３に係る通信端末の発信者特定の動作の一例を示すフローチャートである。第１０図は、実施の形態３に係るサーバの発信者特定の動作の一例を示すフローチャートである。
通信端末５０において、ある発信者から着信があり、通話を開始すると（Ｓ３１）、ＭＰＵ７は声紋分析に必要な発信者の音声を、信号処理部３からサーバ６０へ転送する（Ｓ３２）。
サーバ６０において、信号処理部６１は通信端末５０から転送された音声を受信し（Ｓ４１）、制御部６３は声紋分析部６４において発信者の音声の声紋分析を行い（Ｓ４２）、その結果得られた発信者の声紋情報と記憶部６５のデータベース内の声紋情報を比較することにより発信者の声紋情報の検索を行い（Ｓ４３）、発信者の声紋情報がデータベースに登録済みであるか否かの判断を行う（Ｓ４４）。
サーバ６０において、発信者の声紋情報がデータベースに登録済みである場合（Ｓ４４，Ｙ）、制御部６３は発信者の声紋情報にリンクした個人情報をデータベースから読み出し、信号処理部６１から通信端末５０へ送信し（Ｓ４５）、サーバ６０のフローは終了する。
次に通信端末５０において、信号処理部３がサーバ６０から個人情報を受信すると（Ｓ３３，Ｙ）、受信した個人情報をＬＣＤ１７へ表示し（Ｓ３４）、通信端末５０のフローは終了する。
サーバ６０において、発信者の声紋情報がデータベースに登録されていない場合（Ｓ４４，Ｎ）、制御部６３は未登録を示す信号を、信号処理部６１から通信端末５０へ送信する（Ｓ４６）。
次に通信端末５０において、信号処理部３がサーバ６０から未登録を示す信号を受信すると（Ｓ３３，Ｎ）、ＭＰＵ７は「この発信者の声紋情報は登録されていません。登録しますか？」のような未登録メッセージをＬＣＤ１７へ表示する（Ｓ３５）。この未登録メッセージを見たユーザは、キーパッド１８を用いて声紋情報を登録するか否かを入力する。
次に通信端末５０において、ユーザからの入力が、登録を希望している場合（Ｓ３６，Ｙ）、ＭＰＵ７はユーザがキーパッド１８を用いて入力した発信者の個人情報を受け付け（Ｓ３７）、発信者の個人情報を信号処理部３からサーバ６０へ送信し（Ｓ３８）、通信端末５０のフローは終了する。また、ユーザからの入力が、登録を希望していない場合（Ｓ３６，Ｎ）、通信端末５０のフローは終了する。
次にサーバ６０において、信号処理部６１が、ユーザにより入力された発信者の個人情報を通信端末５０から受信した場合（Ｓ４７，Ｙ）、制御部６３は発信者の声紋情報と発信者の個人情報をリンクさせ、記憶部６５のデータベースへ保存し（Ｓ４８）、サーバ６０のフローは終了する。また、信号処理部６１が、ユーザにより入力された発信者の個人情報を通信端末５０から受信しない場合（Ｓ４７，Ｎ）、サーバ６０のフローは終了する。
以上のような処理によれば、着信による通話開始時に、ユーザはＬＣＤ１７に表示される個人情報を視認して発信者を特定することができる。また、サーバ６０内のデータベースに発信者の声紋情報が保存されていない場合、ユーザは発信者の個人情報を入力して通信端末５０から送信し、サーバ６０内のデータベースに発信者の声紋情報と個人情報をリンクさせて保存することができる。本実施の形態では、通信端末５０に比べて演算性能の高いサーバ６０が声紋分析や比較を行うことにより、高速に個人を特定することができるとともに、通信端末５０が個々に声紋分析部やデータベースを持つ必要がないことから、通信端末５０の回路規模を低減することができる。
なお、実施の形態１から３において、無線通信を行う通信端末を例に挙げて説明したが、有線通信を行う通信端末にも本発明を適用することができる。また、表示部としてＬＣＤを用いた視覚的な表示について説明したが、音声表示等を行うようにすることも可能である。以上、実施の形態１から３を説明したが、上述した実施の形態において説明された通信端末やサーバの構成及び動作は、本発明を実現するための一例であり、その構成は本発明の趣旨を逸脱しない範囲内において特に限定されず、適宜応用可能であることは言うまでもない。Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In the embodiment of the present invention, a communication terminal that performs wireless communication among communication terminals will be described as an example.
Embodiment 1 FIG.
In the present embodiment, an example will be described in which a communication terminal performs voiceprint analysis, searches for voiceprint information, and displays personal information when specifying a caller at an incoming call of the communication terminal.
First, the configuration and operation of the communication terminal will be described. FIG. 1 is a block diagram showing an example of the configuration of a communication terminal according to Embodiment 1 of the present invention. As shown in FIG. 1, the communication terminal 1 includes a transmission / reception antenna 2, a signal processing unit 3, a data processing unit 4, a voice processing unit 5, a voice print analysis unit 6, an MPU (Microprocessing Unit) 7, a RAM (Random Access Memory). ) 8, ROM (Read Only Memory) 9, FLASH-ROM 10, tone generator LSI (Large Scale Integrated Circuit) 11, microphone 12, speaker 13, external input / output unit 14, vibrator 15, LED (Light Emitting Diode) 16, and LCD A liquid crystal display 17 and a keypad 18 are included.
The MPU 7 includes a signal processing unit 3, a data processing unit 4, an audio processing unit 5, a voice print analysis unit 6, a RAM 8, a ROM 9, a FLASH-ROM 10, a sound source LSI 11, an external input / output unit 14, a vibrator 15, an LED 16, an LCD 17 and a keypad 18. And control each.
In the transmission operation, the signal processing unit 3 synthesizes the non-voice data from the data processing unit 4 and the voice data from the voice processing unit 5 and transmits the synthesized data to the outside via the transmission / reception antenna 2. In the reception operation, the signal processing unit 3 outputs to the data processing unit 4 if the signal received via the transmission / reception antenna 2 is non-voice data other than voice, and outputs to the voice processing unit 5 if the signal is voice data. .
The non-voice data is output to the LCD 17 as characters and images via the data processing unit 4. The voice processing unit 5 outputs the voice data to the outside as voice through the speaker 13 and outputs voice data necessary for voiceprint analysis to the voiceprint analysis unit 6. The audio processing unit 5 also outputs audio received from the outside via the microphone 12 to the signal processing unit 3 as audio data.
The voiceprint analysis unit 6 performs voiceprint analysis on the received voice data by using a process disclosed in, for example, Japanese Patent No. 3280825, and a voiceprint composed of a time distribution for each frequency, a voice time, a pitch frequency, and the like. Information is calculated and output to the RAM 8.
The ROM 9 stores a program of the MPU 7 and the like. The RAM 8 stores information necessary for executing the program of the MPU 7. The RAM 8 temporarily stores voiceprint information. The FLASH-ROM 10 stores a database in which voiceprint information and personal information are linked. Here, personal information includes, for example, name, age, gender, company name, telephone number, management number, and the like.
The tone generator LSI 11 generates a ring tone and the like. The external input / output unit 14 inputs / outputs data with an external PC or the like via a cable or the like. The vibrator 15 vibrates with an incoming call or the like. The LED 16 emits light when an incoming call is received. The LCD 17 displays characters and images. The keypad 18 receives input of personal information and the like from the user.
Next, a caller identification operation at the start of a call by incoming call will be described with reference to the flowchart of FIG. FIG. 2 is a flowchart showing an example of a caller specifying operation of the communication terminal according to the first embodiment. When there is an incoming call from a caller to the communication terminal 1 and a call is started (S1), the MPU 7 performs a voiceprint analysis of the caller's voice in the voiceprint analysis unit 6 (S2), and the result is stored in the RAM 8 as the voiceprint information of the caller. Save to
Next, the MPU 7 searches for the voiceprint information of the caller by comparing the voiceprint information of the caller stored in the RAM 8 with the voiceprint information in the database of the FLASH-ROM 10 (S3). It is determined whether or not it has already been registered (S4). When the caller's voiceprint information is already registered in the database (S4, Y), the MPU 7 reads out the personal information linked to the caller's voiceprint information from the database and displays it on the LCD 17 (S5), and the flow of the communication terminal 1 is as follows. finish.
On the other hand, if the caller's voiceprint information is not registered in the database (S4, N), the MPU 7 will send an unregistered message such as "This caller's voiceprint information is not registered. Do you want to register it?" Is displayed on the LCD 17 (S6). The user who sees this unregistered message uses the keypad 18 to input whether to register the caller's voiceprint information.
When the input from the user desires registration (S7, Y), the MPU 7 accepts the caller's personal information input by the user using the keypad 18 (S8), and the voiceprint information of the caller and the caller are received. Are stored in the database of the FLASH-ROM 10 (S9), and the flow of the communication terminal 1 ends. If the input from the user does not desire registration (S7, N), the flow of the communication terminal 1 ends.
According to the above processing, the user can identify the caller by visually recognizing the personal information displayed on the LCD 17 at the start of the incoming call. Further, when the voiceprint information of the caller is not stored in the database in the communication terminal 1, the user inputs the personal information of the caller, and links the voiceprint information of the caller and the personal information to the database in the communication terminal 1. Can be saved.
Embodiment 2. FIG.
In the present embodiment, an example will be described in which a communication terminal performs voiceprint analysis, an external server searches for voiceprint information, and a communication terminal displays personal information in specifying the caller at the incoming call of the communication terminal.
FIG. 3 is a block diagram showing an example of a configuration of a communication system (personal information display system) including a communication terminal and a server. As shown in FIG. 3, this communication system is composed of a communication terminal 1A, a radio base station 20, and a server 40. The communication terminal 1 </ b> A and the wireless base station 20 communicate wirelessly, and the wireless base station 20 and the server 40 communicate via a public network (communication line) 30.
First, the configuration and operation of the communication terminal 1A will be described. The communication terminal 1A has the same configuration as the communication terminal 1 shown in FIG. 1, but the FLASH-ROM 10 does not have a database. The MPU 7 transmits the voiceprint information output from the voiceprint analysis unit 6 and the personal information of the caller newly input from the keypad 18 from the signal processing unit 3 to the server 40, and the signal processing unit 3 receives the information from the server 40. The personal information of the sender is displayed on the LCD 17.
Next, the configuration and operation of the server 40 will be described. FIG. 4 is a block diagram showing an example of the configuration of the server according to Embodiment 2 of the present invention. The server 40 includes a signal processing unit 41, a data processing unit 42, a control unit 43, and a storage unit 44. The control unit 43 is connected to the signal processing unit 41, the data processing unit 42, and the storage unit 44, and controls each of them. In the transmission operation, the signal processing unit 41 transmits the data from the data processing unit 4 to the communication terminal 1A. In the reception operation, the signal processing unit 41 outputs the data received from the communication terminal 1 </ b> A to the data processing unit 42. The data processing unit 42 outputs the data to the storage unit 44. The storage unit 44 stores a program of the control unit 43, data from the data processing unit 42, and the like. The storage unit 44 stores a database in which voiceprint information and personal information are linked.
Next, a caller identification operation at the start of a call by incoming call will be described with reference to the flowcharts of FIGS. FIG. 5 is a flowchart showing an example of the caller specifying operation of the communication terminal according to the second embodiment. FIG. 6 is a flowchart showing an example of the caller specifying operation of the server according to the second embodiment.
When communication terminal 1A receives an incoming call from a certain caller and starts a call (S11), MPU 7 performs voiceprint analysis of the caller's voice in voiceprint analysis unit 6 (S12), and the result is used as the voiceprint information of the caller. Then, the signal processing unit 3 transmits it to the server 40 (S13).
In the server 40, the signal processing unit 41 receives the caller's voice print information from the communication terminal 1A (S21), and the control unit 43 sends the call by comparing the caller's voice print information with the voice print information in the storage unit 44. The voiceprint information of the sender is searched (S22), and it is determined whether or not the voiceprint information of the sender is already registered in the database (S23).
In the server 40, when the voiceprint information of the caller has been registered in the database (S23, Y), the control unit 43 reads the personal information linked to the voiceprint information of the caller from the database, and from the signal processing unit 41 to the communication terminal 1A. (S24), and the flow of the server 40 ends.
Next, in the communication terminal 1A, when the signal processing unit 3 receives personal information from the server 40 (S14, Y), the MPU 7 displays the received personal information on the LCD 17 (S15), and the flow of the communication terminal 1A ends.
In the server 40, when the voiceprint information of the caller is not registered in the database (S23, N), the control unit 43 transmits a signal indicating unregistration from the signal processing unit 41 to the communication terminal 1A (S25).
Next, in the communication terminal 1A, when the signal processing unit 3 receives a signal indicating unregistration from the server 40 (S14, N), the MPU 7 says, “The voiceprint information of this caller is not registered. Do you want to register? An unregistered message such as "is displayed on the LCD 17 (S16). The user who sees this unregistered message uses the keypad 18 to input whether to register voiceprint information.
Next, in the communication terminal 1A, when the input from the user desires to be registered (S17, Y), the MPU 7 accepts the personal information of the caller input by the user using the keypad 18 (S18), and sends the call. The personal information of the person is transmitted from the signal processing unit 3 to the server 40 (S19), and the flow of the communication terminal 1A ends. Further, when the input from the user does not desire registration (S17, N), the flow of the communication terminal 1A ends.
Next, in the server 40, when the signal processing unit 41 receives the personal information of the caller input by the user from the communication terminal 1A (S26, Y), the control unit 43 determines the voiceprint information of the caller and the personality of the caller. The information is linked and stored in the database of the storage unit 44 (S27), and the flow of the server 40 ends. When the signal processing unit 41 does not receive the caller's personal information input by the user from the communication terminal 1A (S26, N), the flow of the server 40 ends.
According to the above processing, the user can identify the caller by visually recognizing the personal information displayed on the LCD 17 at the start of the incoming call. If the sender's voiceprint information is not stored in the database in the server 40, the user inputs the sender's personal information and transmits it from the communication terminal 1A, and the sender's voiceprint information is stored in the database in the server 40. Personal information can be linked and saved. In the present embodiment, since the server 40 has a database collectively and the communication terminal 1A does not need to have a database individually, the circuit scale of the communication terminal 1A can be reduced.
In this embodiment, the communication terminal 1A does not have a database. However, the communication terminal 1A also has a database, and if the database in the communication terminal 1A does not have the voiceprint information of the caller, the database in the server 40 is used. It is good also as a structure to search.
Embodiment 3 FIG.
In the present embodiment, an external server performs voiceprint analysis, an external server searches for voiceprint information, and a communication terminal displays personal information in the identification of a caller at an incoming call of a communication terminal. An example will be described.
In the present embodiment, the caller identification is performed using the communication terminal and the server as in FIG. 3, but the communication terminal 50 is provided instead of the communication terminal 1A, and the server 60 is provided instead of the server 40.
First, the configuration and operation of the communication terminal 50 will be described. FIG. 7 is a block diagram showing an example of the configuration of the communication terminal according to Embodiment 3 of the present invention. In FIG. 7, the same reference numerals as those in FIG. 1 denote the same or corresponding parts as those in FIG. 1, and description thereof will be omitted here. The communication terminal 50 according to the present embodiment deletes the voiceprint analysis unit 6 and includes a voice processing unit 55 instead of the voice processing unit 5. The sound processing unit 55 outputs sound data as sound to the outside through the speaker 13 and outputs sound data necessary for voiceprint analysis to the signal processing unit 3. The audio processing unit 55 outputs audio received from the outside via the microphone 12 to the signal processing unit 3 as audio data.
The FLASH-ROM 10 does not have a database. Further, the MPU 7 transmits the voice data output from the voice processing unit 55 and the personal information of the caller newly input from the keypad 18 from the signal processing unit 3 to the server 60, and the signal processing unit 3 receives from the server 60. The personal information of the sender is displayed on the LCD 17.
Next, the configuration and operation of the server 60 will be described. FIG. 8 is a block diagram showing an example of the configuration of the server according to Embodiment 3 of the present invention. In FIG. 8, the same reference numerals as those in FIG. 8 denote the same or corresponding parts as those in FIG. 8, and description thereof will be omitted here. The server 60 in the present embodiment includes a signal processing unit 61 instead of the signal processing unit 41, a control unit 63 instead of the control unit 43, a storage unit 65 instead of the storage unit 44, and voice processing. A unit 62 and a voice print analysis unit 64 are provided.
The control unit 63 is connected to the signal processing unit 61, the data processing unit 42, the voice processing unit 62, the voice print analysis unit 64, and the storage unit 65, and controls each of them. In the transmission operation, the signal processing unit 61 outputs the non-voice data from the data processing unit 42 to the communication terminal 50. In the reception operation, the signal processing unit 61 outputs the received signal to the data processing unit 42 if the received signal is non-voice data other than voice, and outputs the received signal to the voice processing unit 62 if the received signal is voice data.
The voice processing unit 62 outputs voice data necessary for voiceprint analysis to the voiceprint analysis unit 64. The voiceprint analysis unit 64 calculates voiceprint information and outputs it to the storage unit 65 in the same manner as the voiceprint analysis unit 6 shown in FIG. The storage unit 65 stores a program of the control unit 63, data from the data processing unit 42, and the like. The storage unit 65 stores a database in which voiceprint information and personal information are linked. Further, the storage unit 65 temporarily stores the voiceprint information analyzed by the voiceprint analysis unit 64.
Next, a caller identification operation at the start of a call by incoming call will be described with reference to the flowcharts of FIG. 9 and FIG. FIG. 9 is a flowchart showing an example of a caller specifying operation of the communication terminal according to the third embodiment. FIG. 10 is a flowchart showing an example of the caller specifying operation of the server according to the third embodiment.
When there is an incoming call from a certain caller at the communication terminal 50 and a call is started (S31), the MPU 7 transfers the caller's voice necessary for voiceprint analysis from the signal processing unit 3 to the server 60 (S32).
In the server 60, the signal processing unit 61 receives the voice transferred from the communication terminal 50 (S41), and the control unit 63 performs voiceprint analysis of the caller's voice in the voiceprint analysis unit 64 (S42). The voiceprint information of the sender is searched by comparing the voiceprint information of the sender and the voiceprint information in the database of the storage unit 65 (S43), and whether or not the voiceprint information of the sender is already registered in the database. A determination is made (S44).
In the server 60, when the voiceprint information of the caller has been registered in the database (S44, Y), the control unit 63 reads the personal information linked to the voiceprint information of the caller from the database, and from the signal processing unit 61 to the communication terminal 50. (S45), the flow of the server 60 ends.
Next, in the communication terminal 50, when the signal processing unit 3 receives personal information from the server 60 (S33, Y), the received personal information is displayed on the LCD 17 (S34), and the flow of the communication terminal 50 ends.
In the server 60, when the voiceprint information of the caller is not registered in the database (S44, N), the control unit 63 transmits a signal indicating unregistration from the signal processing unit 61 to the communication terminal 50 (S46).
Next, in the communication terminal 50, when the signal processing unit 3 receives a signal indicating unregistration from the server 60 (S33, N), the MPU 7 says, “This caller's voiceprint information is not registered. Do you want to register? An unregistered message such as “is displayed on the LCD 17 (S35). The user who sees this unregistered message uses the keypad 18 to input whether to register voiceprint information.
Next, in the communication terminal 50, when the input from the user desires to register (S36, Y), the MPU 7 accepts the personal information of the caller input by the user using the keypad 18 (S37), and sends the call. The personal information of the person is transmitted from the signal processing unit 3 to the server 60 (S38), and the flow of the communication terminal 50 ends. If the input from the user does not desire registration (S36, N), the flow of the communication terminal 50 ends.
Next, in the server 60, when the signal processing unit 61 receives the caller's personal information input by the user from the communication terminal 50 (S47, Y), the control unit 63 sends the caller's voiceprint information and the caller's personal information. The information is linked and saved in the database of the storage unit 65 (S48), and the flow of the server 60 ends. When the signal processing unit 61 does not receive the caller's personal information input by the user from the communication terminal 50 (S47, N), the flow of the server 60 ends.
According to the above processing, the user can identify the caller by visually recognizing the personal information displayed on the LCD 17 at the start of the incoming call. If the sender's voiceprint information is not stored in the database in the server 60, the user inputs the sender's personal information and transmits it from the communication terminal 50, and the sender's voiceprint information is stored in the database in the server 60. Personal information can be linked and saved. In the present embodiment, the server 60 having a higher calculation performance than the communication terminal 50 can perform voiceprint analysis and comparison, whereby a person can be identified at a high speed, and the communication terminal 50 can individually identify a voiceprint analysis unit and a database. Therefore, the circuit scale of the communication terminal 50 can be reduced.
In Embodiments 1 to 3, the communication terminal that performs wireless communication has been described as an example. However, the present invention can also be applied to a communication terminal that performs wired communication. Moreover, although the visual display using LCD as a display part was demonstrated, it is also possible to perform an audio | voice display etc. Although the first to third embodiments have been described above, the configurations and operations of the communication terminals and servers described in the above-described embodiments are examples for realizing the present invention, and the configurations are the gist of the present invention. Needless to say, the present invention is not particularly limited within the range not departing from the above, and can be appropriately applied.

Industrial applicability

以上説明したように本発明によれば、着信による通話開始時に、画面に表示された発信者の個人情報を視認することにより、ユーザは発信者個人を正確に特定することができるようになり、例えば発信者を勘違いして相手に迷惑をかけるようなことを避けることができる。 As described above, according to the present invention, the user can accurately identify the individual caller by visually recognizing the caller's personal information displayed on the screen at the start of the incoming call. For example, it is possible to avoid annoying the other party by misunderstanding the caller.

Claims

A database for storing personal information and personal voiceprint information in association with each other;
A voiceprint analysis unit for extracting the voiceprint information of the caller from the voice of the caller at the start of a call by incoming call;
A computing unit that identifies personal information in the database for the sender by comparing the voiceprint information of the sender with voiceprint information in the database;
A display for displaying personal information of the identified caller;
A communication terminal comprising:

In the communication terminal according to claim 1,
An input unit for inputting personal information of the caller;
A communication terminal capable of registering, in the database, a set of the sender's voiceprint information and the sender's personal information obtained from the input unit.

A voiceprint information search server that can be connected to a communication terminal via a communication line,
A database for storing personal information and personal voiceprint information in association with each other;
A receiving unit for receiving information including voiceprint information from the communication terminal;
A computing unit for identifying personal information in the database by comparing the voiceprint information received by the receiving unit with the voiceprint information in the database;
A transmitting unit that transmits the personal information in the identified database to the communication terminal;
A voiceprint information search server.

A communication terminal that can be connected to the voiceprint information search server according to claim 3 via a communication line,
A voiceprint analysis unit for extracting the voiceprint information of the caller from the voice of the caller at the start of a call by incoming call;
A transmission unit that transmits the voiceprint information of the sender to the voiceprint information search server;
A receiver that receives the personal information of the sender from the voiceprint information search server;
A display unit for displaying personal information of the caller obtained from the receiving unit;
A communication terminal comprising:

In the communication terminal according to claim 4,
It has an input unit for entering the caller's personal information,
A communication terminal capable of registering, in the database of the voiceprint information search server, a set consisting of the voiceprint information of the sender and the personal information of the sender obtained from the input unit.

A personal information display system comprising: the voiceprint information search server according to claim 3; and the communication terminal according to claim 4.

A voiceprint information search server that can be connected to a communication terminal via a communication line,
A database for storing personal information and personal voiceprint information in association with each other;
A receiving unit for receiving information including voice from a communication terminal;
A voiceprint analysis unit for extracting voiceprint information from the voice received by the reception unit;
A computing unit for identifying personal information in the database by comparing the voiceprint information extracted by the voiceprint analysis unit with the voiceprint information in the database;
A transmitting unit that transmits the personal information in the identified database to the communication terminal;
A voiceprint information search server.

A communication terminal that can be connected to the voiceprint information search server according to claim 7 via a communication line,
A transmission unit for transferring a caller's voice to the voiceprint information search server at the start of a call by an incoming call;
A receiving unit for receiving personal information of the caller from the voiceprint information search server;
A display unit for displaying personal information of the caller obtained from the receiving unit;
A communication terminal comprising:

In the communication terminal according to claim 8,
It has an input unit for entering the caller's personal information,
A communication terminal capable of registering, in the database of the voiceprint information search server, a set consisting of the voiceprint information of the sender and the personal information of the sender obtained from the input unit.

A personal information display system comprising: the voiceprint information search server according to claim 7; and the communication terminal according to claim 8.

A personal information display method in a communication terminal that displays personal information of a caller at the start of an incoming call,
Storing personal personal information and personal voiceprint information as a database,
Extracting the caller's voiceprint information from the caller's voice;
Identifying personal information in the database for the sender by comparing the voiceprint information of the sender with voiceprint information in the database;
Displaying personal information of the identified caller;
A personal information display method in a communication terminal comprising:

A personal information display program stored in a computer-readable medium for causing a computer to display personal information of a caller at the start of an incoming call.
Storing personal personal information and personal voiceprint information as a database,
Extracting the caller's voiceprint information from the caller's voice;
Identifying personal information in the database for the sender by comparing the voiceprint information of the sender with voiceprint information in the database;
Displaying personal information of the identified caller;
Personal information display program that causes a computer to execute.