JP2010118955A

JP2010118955A - Portable cell phone terminal, voice processing method, and headset

Info

Publication number: JP2010118955A
Application number: JP2008291448A
Authority: JP
Inventors: Daisuke Sakai; 大輔酒井
Original assignee: Sony Ericsson Mobile Communications AB
Current assignee: Sony Mobile Communications AB
Priority date: 2008-11-13
Filing date: 2008-11-13
Publication date: 2010-05-27

Abstract

<P>PROBLEM TO BE SOLVED: To facilitate distinguishing the spoken voice of a user for the person called, even in a state where the user of the device is intoxicated. <P>SOLUTION: A terminal has an alcohol detecting part 110, which detects the concentration of the alcohol contained in breath and a microphone 108, which picks up the voice in the vicinity and generates voice signals. Moreover, the terminal has a communication processing part 101, which demodulates the received radio waves, extracts voice signals therefrom, converts the input voice signals to radio waves, and transmits the radio waves. According to the alcohol concentration detected by the alcohol detecting part 110, the speed and/or tone of the voice signals obtained by the microphone 108 are converted into a slower speed and or a lower tone and are transmitted to the communication processing part 101. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、携帯電話端末、音声処理方法及びヘッドセットに関し、特に使用者の状態に応じて通話音声の速度や音程を変化させる技術に関する。 The present invention relates to a mobile phone terminal, a voice processing method, and a headset, and more particularly, to a technique for changing the speed and pitch of a call voice according to the state of a user.

一般に、例えば高齢化等に伴って受聴能力が低下してくると、その人における話し相手の発話音声を識別する能力も、低下する傾向にあることが知られている。このため、このような発話音声識別能力の低下を補う機能として、話し相手の声の速度を落として通話できる機能を備えた携帯電話端末が登場している。 In general, it is known that, for example, when the listening ability decreases with the aging of the population, the ability of the person to identify the voice of the other party also tends to decrease. For this reason, as a function to compensate for such a decrease in the spoken voice identification capability, a mobile phone terminal having a function capable of making a call while reducing the speed of the voice of the other party has appeared.

例えば特許文献１には、話し相手の話す言葉を理解しやすくするために、話し相手の発した音声をより遅い速度に変換する機能を有する電話機について記載されている。
特開２００１−２６８１７５号公報 For example, Patent Document 1 describes a telephone having a function of converting a voice uttered by a partner to a slower speed in order to make it easier to understand the words spoken by the partner.
JP 2001-268175 A

ところで、話し相手の発話音声を識別する能力が低下するのは、受聴能力が低下したときのみではない。例えば酔っぱらっている場合等にも、人は話し相手の発話音声を識別しにくくなることがある。このような場合には、人は相手の話す内容を正確に理解することが難しくなるため、通話先の相手との会話が成り立たなくなってしまう恐れがある。ところが、このように酔っぱらった状態における話し相手の発話音声の識別能力の低下を補う技術は、これまでに考案されてこなかった。 By the way, it is not only when the listening ability is lowered that the ability to identify the voice of the other party is lowered. For example, when a person is drunk, it may be difficult for a person to identify the voice of the other party. In such a case, it is difficult for a person to accurately understand what the other party speaks, and there is a risk that the conversation with the other party will not be established. However, no technology has been devised so far to compensate for the decline in the speech recognition ability of the other party in such a drunken state.

本発明はかかる点に鑑みてなされたものであり、装置の使用者が酔っぱらっている状態でも、使用者の発話音声を通話相手が識別しやすくすることを目的とする。 The present invention has been made in view of this point, and an object of the present invention is to make it easier for a call partner to identify a voice of a user even when the user of the apparatus is drunk.

本発明の携帯電話端末は、呼気に含まれるアルコールの濃度を検出するアルコール検出部と、周囲の音声を拾って音声信号を生成するマイクロフォンとを備えた。また、受信した電波を復調して音声信号を取り出すとともに、入力された音声信号を電波に変換して送信する通信処理部とを備えた。そして、アルコール検出部で検出されたアルコールの濃度に応じて、マイクロフォンで得られた音声信号の速度及び／又は音程を、より遅い速度又はより低い音程に変換して、通信処理部に伝送するようにしたものである。 The mobile phone terminal of the present invention includes an alcohol detection unit that detects the concentration of alcohol contained in exhaled breath, and a microphone that picks up surrounding sounds and generates a sound signal. In addition, the apparatus includes a communication processing unit that demodulates the received radio wave and extracts an audio signal, and converts the input audio signal into a radio wave and transmits the radio signal. Then, according to the alcohol concentration detected by the alcohol detection unit, the speed and / or pitch of the voice signal obtained by the microphone is converted into a slower speed or lower pitch and transmitted to the communication processing unit. It is a thing.

このようにしたことで、携帯電話端末の使用者の呼気にアルコール成分が含まれていた場合には、その濃度に応じて、使用者の声の速度や音程がより遅い速度又はより低い音程に変換されて通信処理部に伝送され、アンテナを介して通話相手に伝送される。 As a result, when the alcohol component is contained in the breath of the user of the mobile phone terminal, the speed or pitch of the user's voice is slower or lower depending on the concentration. It is converted and transmitted to the communication processing unit, and transmitted to the other party through the antenna.

本発明によると、携帯電話端末の使用者が酒に酔っており、その話す声が通常よりも早まっていたり高くなっており、聴き取りづらいような状況でも、通話相手は使用者の話す言葉をより理解しやすくなる。 According to the present invention, even if the user of the mobile phone terminal is drunk, and the speaking voice is faster or higher than usual, it is difficult to listen to the other party. It becomes easier to understand.

以下、本発明の一実施の形態を、添付図面を参照して説明する。図１は、本実施の形態の携帯電話端末１００の内部構成例を示すブロック図である。携帯電話端末１００は、マイクロプロセッサ等よりなる制御部１２０を備え、制御部１２０は、制御信号が伝送される制御ライン１５０又はデータが伝送されるデータライン１６０を介して、携帯電話端末１００内の各部と接続されている。そして制御部１２０は、これらのラインを通して各部と通信を行い、各部の動作制御を行う。 Hereinafter, an embodiment of the present invention will be described with reference to the accompanying drawings. FIG. 1 is a block diagram illustrating an internal configuration example of the mobile phone terminal 100 according to the present embodiment. The mobile phone terminal 100 includes a control unit 120 made of a microprocessor or the like. The control unit 120 includes a control line 150 in which a control signal is transmitted or a data line 160 in which data is transmitted. Connected to each part. And the control part 120 communicates with each part through these lines, and controls operation | movement of each part.

制御ライン１５０には、通信処理部１０１と、表示部１０３と、操作部１０４と、メモリ１０５と、音声処理部１０６とが接続されている。通信処理部１０１にはアンテナ１０２が接続してあり、通信処理部１０１は、アンテナ１０２で得られた電波を復調して音声信号を取り出したり、データライン１６０を介して伝送された音声信号を電波に変換してアンテナ１０２に出力する処理を行う。 A communication processing unit 101, a display unit 103, an operation unit 104, a memory 105, and an audio processing unit 106 are connected to the control line 150. An antenna 102 is connected to the communication processing unit 101, and the communication processing unit 101 demodulates the radio wave obtained by the antenna 102 to extract an audio signal or receives an audio signal transmitted via the data line 160 as an electric wave. Is converted to, and output to the antenna 102.

表示部１０３は、液晶パネル等で構成される表示パネルと、その表示パネルの駆動部とで構成され、着信した電話の電話番号や、アンテナ１０２を通して送受信される電子メールの文章等が表示される。操作部１０４は、数字などのダイヤルキーやその他の各種機能キーで構成される。そしてそれらのキーがユーザに押下された場合に、操作内容に応じた操作信号を生成して制御部１２０に供給する。 The display unit 103 includes a display panel constituted by a liquid crystal panel or the like and a drive unit of the display panel, and displays a telephone number of an incoming call, a text of an e-mail transmitted and received through the antenna 102, and the like. . The operation unit 104 includes dial keys such as numerals and other various function keys. When those keys are pressed by the user, an operation signal corresponding to the operation content is generated and supplied to the control unit 120.

メモリ１０５は、図示せぬＲＯＭ（Read Only Memory）やＲＡＭ（Random Access Memory）で構成され、メモリ１０５には、携帯電話端末１００の制御に必要なソフトウェア等が格納されている。またメモリ１０５には、制御部１２０で制御が行われる際に一時的に発生するデータ等も格納される。 The memory 105 is composed of a ROM (Read Only Memory) or a RAM (Random Access Memory) (not shown), and the memory 105 stores software necessary for controlling the mobile phone terminal 100. The memory 105 also stores data that is temporarily generated when the control unit 120 performs control.

音声処理部１０６には、入力された音声信号を音声として出力するスピーカ１０７と、周囲の音声を拾って音声信号に変換するマイクロフォン１０８とが接続してある。音声処理部１０６では、スピーカ１０７から入力された音声の出力処理と、マイクロフォン１０８からの入力された音声の音声処理が行われる。また音声処理部１０６は、アンテナ１０２が受信して通信処理部１０１で取り出された音声信号や、マイクロフォン１０８で得られた音声信号の速度を所定の速度に変換する音声変換部１０６ａを備える。 Connected to the audio processing unit 106 are a speaker 107 that outputs an input audio signal as audio, and a microphone 108 that picks up surrounding audio and converts it into an audio signal. The audio processing unit 106 performs output processing of audio input from the speaker 107 and audio processing of audio input from the microphone 108. The audio processing unit 106 also includes an audio conversion unit 106 a that converts the speed of the audio signal received by the antenna 102 and extracted by the communication processing unit 101 or the speed of the audio signal obtained by the microphone 108 to a predetermined speed.

音声変換部１０６ａは、例えば音声信号の基本周期を検出して同周期の信号を生成し、生成した信号を元の音声信号に合成することで、音声のピッチを変化させずに話速を遅い速度に変換する。また音声変換部１０６ａは、例えば音声信号をフーリエ変換して周波数を全体的に変位させた後に、逆フーリエ変換を行うことによって音声の音程を変化させる。音声変換部１０６ａには脈拍センサ１０９とアルコールセンサ１１０が接続されており、これらのセンサによる検出値に応じて、変換する速度又は音程を変えて変換処理を行う。この変換処理の詳細については後述する。 For example, the voice conversion unit 106a detects the basic period of the voice signal, generates a signal of the same period, and synthesizes the generated signal with the original voice signal, thereby reducing the speech speed without changing the voice pitch. Convert to speed. The voice conversion unit 106a changes the pitch of the voice by performing an inverse Fourier transform, for example, after performing a Fourier transform on the voice signal and displacing the entire frequency. A pulse sensor 109 and an alcohol sensor 110 are connected to the voice conversion unit 106a, and conversion processing is performed by changing the conversion speed or pitch according to the detection values of these sensors. Details of this conversion processing will be described later.

脈拍センサ１０９は、図示せぬ発光部と受光部とを備え、発光部が発光した赤外線が人体内に流れる血液に反射することにより発生する反射光を、受光部が受光する構成としてある。脈拍センサ１０９は、受光部で得た光の量によって脈拍の数を検出する。なお、本実施の形態では、発光部と受光部とを備えた脈拍センサ１０９を例に挙げたが、その他の方式で脈拍を検出するセンサを用いるようにしてもよい。 The pulse sensor 109 includes a light emitting unit and a light receiving unit (not shown), and the light receiving unit receives reflected light generated when the infrared light emitted from the light emitting unit is reflected on blood flowing in the human body. The pulse sensor 109 detects the number of pulses based on the amount of light obtained by the light receiving unit. In the present embodiment, the pulse sensor 109 including the light emitting unit and the light receiving unit is taken as an example. However, a sensor that detects the pulse by other methods may be used.

アルコールセンサ１１０は、例えば半導体ガスセンサ等で構成され、呼気に含まれるアルコールの濃度を検出する。なお、本実施の形態ではアルコールセンサ１１０として半導体ガスセンサを用いた場合を例示しているが、電気化学式等の他の方式でアルコール濃度を検出するセンサに適用してもよい。 The alcohol sensor 110 is constituted by a semiconductor gas sensor, for example, and detects the concentration of alcohol contained in exhaled breath. In the present embodiment, a semiconductor gas sensor is used as the alcohol sensor 110. However, the present invention may be applied to a sensor that detects alcohol concentration by other methods such as an electrochemical method.

次に、図２のブロック図を参照して、脈拍センサ１０９とアルコールセンサ１１０での検出結果に基づいて、音声の速度又は音程が変化されるまでの処理を行う各ブロックの説明を行う。アルコールセンサ１１０は、呼気に含まれるアルコールの濃度に応じた電圧を生成してコンパレータ１１１に出力する。コンパレータ１１１は、アルコールセンサ１１０から印加された電圧をデジタルの値に変換して、アルコール濃度検出値として制御部１２０に供給する。 Next, with reference to the block diagram of FIG. 2, each block that performs processing until the speed or pitch of the sound is changed based on the detection results of the pulse sensor 109 and the alcohol sensor 110 will be described. The alcohol sensor 110 generates a voltage corresponding to the concentration of alcohol contained in exhaled breath and outputs the voltage to the comparator 111. The comparator 111 converts the voltage applied from the alcohol sensor 110 into a digital value and supplies it to the control unit 120 as an alcohol concentration detection value.

脈拍センサ１０９は、検出した脈拍に応じたパルスを生成してオペアンプ１１２に出力する。脈拍センサ１０９での脈拍のカウントは、例えば最初の脈拍と次の脈拍との間隔を測定してその逆数をとることによって行われる。脈拍数検出の精度を向上させるために、所定数の拍数を検出してその平均化し、得られた平均値を脈拍数とするようにしてもよい。オペアンプ１１２は、脈拍センサ１０９から入力されるパルスを所定のゲインで増幅してコンパレータ１１３に出力する。コンパレータ１１３は、オペアンプ１１２で増幅されたパルスをデジタルの値に変換して、脈拍数の検出値として制御部１２０に供給する。 The pulse sensor 109 generates a pulse corresponding to the detected pulse and outputs it to the operational amplifier 112. The pulse count by the pulse sensor 109 is performed, for example, by measuring the interval between the first pulse and the next pulse and taking the reciprocal thereof. In order to improve the accuracy of pulse rate detection, a predetermined number of beats may be detected and averaged, and the average value obtained may be used as the pulse rate. The operational amplifier 112 amplifies the pulse input from the pulse sensor 109 with a predetermined gain and outputs the amplified pulse to the comparator 113. The comparator 113 converts the pulse amplified by the operational amplifier 112 into a digital value and supplies it to the control unit 120 as a detected value of the pulse rate.

制御部１２０は、アルコールセンサ１１０及び／又は脈拍センサ１０９から入力された検出値を、音声の速度や音程を変化させるための音声変換制御信号に変換して音声変換部１０６ａに出力する。 The control unit 120 converts the detection value input from the alcohol sensor 110 and / or the pulse sensor 109 into a voice conversion control signal for changing the speed and pitch of the voice, and outputs the voice conversion control signal to the voice conversion unit 106a.

音声変換部１０６ａは、制御部１２０から入力された音声変換制御信号に基づいて、マイクロフォン１０６から入力された音声信号の速度や音程を所定の速度又は音程に変換して、データライン１６０（図１参照）を介して通信処理部１０１に供給する。また、通信処理部１０１（図１参照）から供給された音声信号の速度や音程を所定の速度又は音声に変換して、スピーカ１０７に供給する。 The voice conversion unit 106a converts the speed and pitch of the voice signal input from the microphone 106 into a predetermined speed or pitch based on the voice conversion control signal input from the control unit 120, and the data line 160 (FIG. 1). To the communication processing unit 101. Further, the speed and pitch of the audio signal supplied from the communication processing unit 101 (see FIG. 1) is converted into a predetermined speed or audio and supplied to the speaker 107.

次に、図３を参照して、本実施の形態による携帯電話端末１００の構成例を説明する。図３は、携帯電話端末１００の外観図である。図３に示す携帯電話端末１００は折り畳み型の形状をしており、図３には開いた状態が示されている。 Next, a configuration example of the mobile phone terminal 100 according to the present embodiment will be described with reference to FIG. FIG. 3 is an external view of the mobile phone terminal 100. The mobile phone terminal 100 shown in FIG. 3 has a folding shape, and FIG. 3 shows an open state.

通話時に使用者の耳に接触する部分には、スピーカ１０７が配置されており、スピーカ１０７と隣接する位置に脈拍センサ１０９が配置されている。つまり脈拍センサ１０９は、通話時に接触する使用者の耳に赤外線を出射し、耳からの反射光を検出することにより使用者の脈拍数を検出する。 A speaker 107 is disposed at a portion that contacts the user's ear during a call, and a pulse sensor 109 is disposed at a position adjacent to the speaker 107. That is, the pulse sensor 109 detects the user's pulse rate by emitting infrared light to the user's ear that contacts during a call and detecting the reflected light from the ear.

なお、図４に示したように、携帯電話端末１００の背面等に脈拍センサ１０９を設けるようにしてもよい。配置位置は、通話中の使用者の指先が触れる位置とする。 In addition, as shown in FIG. 4, you may make it provide the pulse sensor 109 in the back surface etc. of the mobile telephone terminal 100. FIG. The arrangement position is a position where the fingertip of the user during a call touches.

図３に戻って説明を続けると、スピーカ１０７が配置された筐体の上部にはアンテナ１０２が設けてあり、スピーカ１０７の下方には表示部１０３が設けられている。 Returning to FIG. 3, the description is continued. The antenna 102 is provided in the upper part of the casing in which the speaker 107 is arranged, and the display unit 103 is provided below the speaker 107.

ヒンジ１１５を介して接続されたもう一方の筐体には、各種キー等よりなる操作部１０４が設けられており、操作部１０４の下方に、マイクロフォン１０８が配置されている。そしてマイクロフォン１０８と隣接した位置に、アルコールセンサ１１０が配置されている。すなわちアルコールセンサ１１０は、通話中の使用者から出される呼気がかかる位置に配置されており、通話時に、使用者の呼気に含まれるアルコール濃度を検出する。 The other housing connected via the hinge 115 is provided with an operation unit 104 including various keys, and a microphone 108 is disposed below the operation unit 104. An alcohol sensor 110 is disposed at a position adjacent to the microphone 108. In other words, the alcohol sensor 110 is disposed at a position where the exhaled breath from the user during the call is applied, and detects the alcohol concentration contained in the exhalation of the user during the call.

なお、本実施の形態では折り畳み型の携帯電話端末を例に挙げるが、これに限定されるものではない。例えば、表示部１０３と操作部１０４とが同一の筐体に配置されたスティックタイプや、表示部１０３と操作部１０４とが筐体の厚み方向に重ねて配置されるスライドタイプ等の、他の形態の携帯電話端末に適用してもよい。 In the present embodiment, a foldable mobile phone terminal is taken as an example, but the present invention is not limited to this. For example, other forms such as a stick type in which the display unit 103 and the operation unit 104 are arranged in the same casing, and a slide type in which the display unit 103 and the operation unit 104 are arranged in the thickness direction of the casing are used. It may be applied to other mobile phone terminals.

次に、図５のフローチャートを参照して、携帯電話端末１００による音声の変換処理の例について説明する。まず、操作部１０４の通話ボタン（図示略）等が押下されることにより通話が開始すると（ステップＳ１１）、アルコールセンサ１１０によって、使用者の呼気に含まれるアルコールの濃度が検出される（ステップＳ１２）。検出の結果、使用者の呼気にアルコールが含まれていたか否かが判断され（ステップＳ１３）、アルコールが検出された場合には、その検出量が０．１５ｍｇ／ｌ（リットル）未満であったかが判断される（ステップＳ１４）。 Next, an example of voice conversion processing by the mobile phone terminal 100 will be described with reference to the flowchart of FIG. First, when a call is started by pressing a call button (not shown) of the operation unit 104 (step S11), the alcohol sensor 110 detects the concentration of alcohol contained in the user's breath (step S12). ). As a result of the detection, it is determined whether or not alcohol is included in the user's breath (step S13). If alcohol is detected, whether or not the detected amount is less than 0.15 mg / l (liter). Determination is made (step S14).

アルコールセンサ１１０によるアルコールの検出量が０．１５ｍｇ／ｌ未満であった場合には、音声変換処理は行われず、ステップＳ１２に戻って処理が続けられる。次に、アルコールの検出量が０．１５ｍｇ／ｌから０．２５ｍｇ／ｌの間であるかが判断される（ステップＳ１５）。アルコールの検出量が０．１５ｍｇ／ｌから０．２５ｍｇ／ｌの範囲内であった場合には、通話相手の声の速度が０．８倍に変換され、かつ使用者の声の音程が１度下の音程に変換される（ステップＳ１６）。 If the amount of alcohol detected by the alcohol sensor 110 is less than 0.15 mg / l, the voice conversion process is not performed, and the process returns to step S12 and continues. Next, it is determined whether the detected amount of alcohol is between 0.15 mg / l and 0.25 mg / l (step S15). When the detected amount of alcohol is in the range of 0.15 mg / l to 0.25 mg / l, the voice speed of the other party is converted to 0.8 times and the pitch of the user's voice is 1 The pitch is converted to a lower pitch (step S16).

すなわち、アンテナ１０２（図１参照）を通して得られた通話相手の音声信号が、音声変換部１０６ａによって０．８倍の速度に変換されて、スピーカ１０７から放音される。そして、マイクロフォン１０８で得られた使用者の声の音程が１度下の音程に下げられて通信処理部１０１に伝送され、アンテナ１０２を介して通話相手に伝送される。 That is, the voice signal of the communication partner obtained through the antenna 102 (see FIG. 1) is converted to a speed 0.8 times by the voice conversion unit 106a and emitted from the speaker 107. Then, the pitch of the user's voice obtained by the microphone 108 is lowered to a pitch one degree lower, transmitted to the communication processing unit 101, and transmitted to the other party via the antenna 102.

アルコール検出量が０．１５ｍｇ／ｌ未満でもなく、０．１５ｍｇ／ｌ〜０．２５ｍｇ／ｌの範囲内でもない場合、つまり０．２５ｍｇ／ｌより多い場合には、通話相手の声の速度が０．７倍に、使用者の声の音程が２度下に変換される（ステップＳ１７）。０．２５ｍｇ／ｌは、酒酔い運転の基準値である。すなわち、アルコールセンサ１１０で検出されたアルコールの濃度が高いほど、通話相手の声の速度が遅くなり、使用者の声の音程も低くなるようになる。 When the detected alcohol amount is not less than 0.15 mg / l, and is not within the range of 0.15 mg / l to 0.25 mg / l, that is, more than 0.25 mg / l, the speed of the other party's voice is The pitch of the user's voice is converted down by a factor of 0.7 (step S17). 0.25 mg / l is a reference value for drunk driving. That is, the higher the alcohol concentration detected by the alcohol sensor 110, the slower the speed of the voice of the other party and the lower the pitch of the user's voice.

次に、通話が終了したか否かが判断され（ステップＳ１８）、通話が続いている間は、ステップＳ１２に戻って処理が続けられる。通話が終了した場合には、処理も終了となる。 Next, it is determined whether or not the call has ended (step S18), and while the call continues, the process returns to step S12 to continue the process. When the call is finished, the process is also finished.

ステップＳ１３において、アルコールセンサ１１０によってアルコールが検出されなかった場合は、脈拍センサ１０９による使用者の脈拍数のカウントが開始される（ステップＳ１９）。そして、脈拍のカウント開始から、予め設定された所定の時間が経過したか否かが判断され（ステップＳ２０）、所定時間が経過していない間は、脈拍のカウント処理が続けられる。脈拍のカウント開始から所定の時間が経過した場合には、脈拍数のカウントが停止される（ステップＳ２１）。 In step S13, when alcohol is not detected by the alcohol sensor 110, counting of the user's pulse rate by the pulse sensor 109 is started (step S19). Then, it is determined whether or not a predetermined time set in advance has elapsed from the start of pulse counting (step S20), and the pulse counting process is continued while the predetermined time has not elapsed. When a predetermined time has elapsed from the start of pulse counting, the counting of the pulse rate is stopped (step S21).

次に、カウントによって得られた脈拍数が０以上８０未満であるか否かが判断され（ステップＳ２２）、脈拍数が０以上８０未満であった場合には、音声変換処理が行われず、脈拍数をカウントするカウンタの値がリセットされる（ステップＳ２６）。 Next, it is determined whether or not the pulse rate obtained by counting is not less than 0 and less than 80 (step S22). If the pulse rate is not less than 0 and less than 80, the voice conversion process is not performed and the pulse rate is not determined. The value of the counter that counts the number is reset (step S26).

次に、脈拍数が８０以上１００未満であるか否かが判断され（ステップＳ２３）、脈拍数が８０以上１００未満であった場合には、使用者の声の速度が０．９倍に変換されて（ステップＳ２４）、通信処理部１０１とアンテナ１０２を経て通話相手に伝送される。脈拍数が０以上８０未満でもなく、８０以上１００未満でもない場合、つまり１００以上であった場合には、使用者の声の速度が０．８倍に変換される（ステップＳ２５）。 Next, it is determined whether or not the pulse rate is 80 or more and less than 100 (step S23). If the pulse rate is 80 or more and less than 100, the speed of the user's voice is converted to 0.9 times. Then (step S24), the data is transmitted to the other party via the communication processing unit 101 and the antenna 102. If the pulse rate is not 0 or more and less than 80, but not 80 or more and less than 100, that is, 100 or more, the speed of the user's voice is converted to 0.8 times (step S25).

音声変換処理がされた後はカウンタの値がリセットされ（ステップＳ２６）、通話が終了したか否かが判断される（ステップＳ２７）。通話が継続している場合には、ステップＳ１２に戻って処理が続けられる。通話が終了した場合には、処理もここで終了となる。 After the voice conversion process is performed, the counter value is reset (step S26), and it is determined whether or not the call is finished (step S27). If the call is continuing, the process returns to step S12 and the process is continued. If the call is terminated, the process is also terminated here.

脈拍数が０以上８０未満の正常値である場合には、通常通りの通話速度又は音程で通話が行われ、興奮状態又は緊張状態であることによって脈拍数が８０を超えた場合には、その数が増える程に使用者の声の速度がより遅い速度に変換されるようになる。 When the pulse rate is a normal value of 0 or more and less than 80, the call is performed at the normal call speed or pitch, and when the pulse rate exceeds 80 due to the excitement state or the tension state, As the number increases, the speed of the user's voice is converted to a slower speed.

なお、図５のフローチャートに示した各数値は一例であり、実施はこれらの数値に限定されるものではない。例えば、正常時における使用者の脈拍数をメモリ１０５等に登録しておき、登録した脈拍数と測定された脈拍数との差の大きさによって、変換する速度や音程を変化させるような構成としてもよい。また、図５の処理では、アルコールの検出を行ってから次に脈拍数の検出を行っているが、アルコールセンサ１１０によるアルコール濃度の検出と脈拍センサ１０９による脈拍数の検出とを同時に行うようにしてもよい。 In addition, each numerical value shown in the flowchart of FIG. 5 is an example, and implementation is not limited to these numerical values. For example, the user's pulse rate under normal conditions is registered in the memory 105 or the like, and the conversion speed and pitch are changed depending on the difference between the registered pulse rate and the measured pulse rate. Also good. In the process of FIG. 5, the detection of alcohol is performed after the detection of alcohol, but the detection of the alcohol concentration by the alcohol sensor 110 and the detection of the pulse rate by the pulse sensor 109 are performed simultaneously. May be.

上述した実施の形態によれば、使用者が酔っぱらっている場合には、その呼気に含まれるアルコールの濃度に応じて、通話相手の声の速度がよりゆっくり再生されるようになる。これにより、酔っぱらっていることにより通話相手の音声を識別能力が低下している場合であっても、通話相手の話す内容を使用者が理解できるようになる。 According to the above-described embodiment, when the user is drunk, the speed of the other party's voice is reproduced more slowly according to the concentration of alcohol contained in the exhalation. Thereby, even if the ability to discriminate the voice of the other party is reduced due to being drunk, the user can understand the contents spoken by the other party.

また、使用者が酔っている場合には、検出されたアルコールの濃度に応じて、使用者の声の音程もより低い音程に変換されて、通話相手に伝送されるようになる。これにより、使用者が酔っぱらっており、通常よりも高く聴き取りづらい音程で話している場合にも、通話相手にはその声の音程がより低い音程に変換されて届くようになる。よって、通話相手は、使用者が話す内容をより理解しやすくなる。 When the user is drunk, the pitch of the user's voice is converted to a lower pitch according to the detected alcohol concentration and transmitted to the other party. As a result, even when the user is drunk and speaking at a pitch that is higher than normal and difficult to hear, the voice of the other party is converted to a lower pitch and reaches the other party. Therefore, the other party can easily understand what the user speaks.

さらに、脈拍センサ１０９により検出された脈拍数の数に応じて、通話者の声の速度がより遅い速度に変換されるため、使用者が興奮状態又は緊張状態にあり、通常より早口で話してしまっている場合であっても、通話相手はその内容を理解しやすくなる。 Furthermore, since the speed of the caller's voice is converted to a slower speed according to the number of pulse rates detected by the pulse sensor 109, the user is in an excited state or a nervous state and speaks faster than usual. Even if it is closed, the other party can easily understand the content.

なお、上述した実施の形態では、使用者の呼気からアルコールが検出されたり、脈拍数が所定数以上であった場合に、使用者の声の速度や音程を変化させる構成を例に挙げたが、これらの程度に応じて、通話相手の声の大きさを大きくするような処理を行ってもよい。 In the above-described embodiment, the configuration in which the speed and pitch of the user's voice is changed when alcohol is detected from the user's breath or the pulse rate is equal to or greater than the predetermined number is given as an example. Depending on the degree, processing for increasing the loudness of the other party may be performed.

なお、上述した実施の形態では、本発明を携帯電話端末１００に適用した例を挙げたが、スピーカ１０７とマイクロフォン１０８を備えたヘッドセット等に適用するようにしてもよい。 In the above-described embodiment, the example in which the present invention is applied to the mobile phone terminal 100 has been described. However, the present invention may be applied to a headset including the speaker 107 and the microphone 108.

図６は、本発明が適用されたヘッドセットの構成例を示す図であり、図１と対応する箇所には同一の符号を付してある。図６に示したヘッドセット２００は耳掛け型であり、使用者の耳に掛けられる耳掛け１１６に、脈拍センサ１０９を取り付けてある。これにより、通話中の使用者の耳を流れる血流の量から、使用者の脈拍数を検出することができる。なお、脈拍数検出の精度をより高めるために、脈拍センサ１０９を使用者の耳に挟む形状としてもよい。 FIG. 6 is a diagram illustrating a configuration example of a headset to which the present invention is applied, and portions corresponding to those in FIG. 1 are denoted by the same reference numerals. The headset 200 shown in FIG. 6 is an ear hook type, and a pulse sensor 109 is attached to an ear hook 116 to be hung on a user's ear. Thereby, a user's pulse rate can be detected from the amount of blood flow that flows through the user's ear during a call. In order to further improve the accuracy of pulse rate detection, the pulse sensor 109 may be sandwiched between the user's ears.

耳掛け１１６の近辺にはスピーカ１０７が配置されており、ケーブル１１７を通して伝送された通話相手の音声が、使用者に伝送させる。耳掛け１１６の根本部分からは、任意の角度に折り曲げ可能なアームが伸びており、その先端部分にマイクロフォン１０６とアルコールセンサ１１０とが配置されている。 A speaker 107 is disposed in the vicinity of the ear hook 116, and the voice of the communication partner transmitted through the cable 117 is transmitted to the user. An arm that can be bent at an arbitrary angle extends from the base portion of the ear hook 116, and the microphone 106 and the alcohol sensor 110 are disposed at the tip portion thereof.

図７は、ヘッドセット２００の内部構成例を示すブロック図であり、図１と対応する箇所には同一の符号を付してある。ヘッドセット２００は、通話機能を有する外部の装置（図示略）と接続されることにより、外部装置を介して伝送される通話相手の音声が入力される音声入力部１１８を備える。そして、音声入力部１１８を通して伝送された音声信号の速度や音程を、所定の速度や音程に変換する音声変換部１０６ａと、音声変換部１０６ａに検出結果を出力する脈拍センサ１０９と、アルコールセンサ１１０とを備える。音声変換部１０６ａには、音声変換後の音声を出力するスピーカ１０７と、音声変換部１０６ａに音声信号を供給するマイクロフォン１０８が接続されている。 FIG. 7 is a block diagram showing an example of the internal configuration of the headset 200, and portions corresponding to those in FIG. The headset 200 includes an audio input unit 118 that is connected to an external device (not shown) having a call function, and receives the voice of the other party transmitted through the external device. The voice conversion unit 106a converts the speed and pitch of the voice signal transmitted through the voice input unit 118 into a predetermined speed and pitch, the pulse sensor 109 outputs the detection result to the voice conversion unit 106a, and the alcohol sensor 110. With. The audio conversion unit 106a is connected to a speaker 107 that outputs audio after audio conversion and a microphone 108 that supplies an audio signal to the audio conversion unit 106a.

なお、上述した実施形態例における一連の処理は、ハードウェアにより実行することができるが、ソフトウェアにより実行させることもできる。一連の処理をソフトウェアにより実行させる場合には、そのソフトウェアを構成するプログラムを、専用のハードウェアに組み込まれているコンピュータ、または、各種のプログラムをインストールすることで各種の機能を実行することが可能な例えば汎用のパーソナルコンピュータなどに所望のソフトウェアを構成するプログラムをインストールして実行させる。 The series of processes in the above-described embodiment can be executed by hardware, but can also be executed by software. When a series of processing is executed by software, it is possible to execute various functions by installing programs that make up the software into a computer built into dedicated hardware, or by installing various programs. For example, a general-purpose personal computer or the like installs and executes a program constituting desired software.

また、本明細書において、ソフトウェアを構成するプログラムを記述するステップは、記載された順序に沿って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。 Further, in this specification, the step of describing the program constituting the software is not limited to the processing performed in time series according to the described order, but is not necessarily performed in time series, either in parallel or individually. The process to be executed is also included.

本発明の一実施の形態による携帯電話端末の内部構成例を示すブロック図である。It is a block diagram which shows the internal structural example of the mobile telephone terminal by one embodiment of this invention. 本発明の一実施の形態による音声変換処理にかかわる各ブロックの構成例を示すブロック図である。It is a block diagram which shows the structural example of each block concerning the audio | voice conversion process by one embodiment of this invention. 本発明の一実施の形態による携帯電話端末の構成例を示す外観図である。It is an external view which shows the structural example of the mobile telephone terminal by one embodiment of this invention. 本発明の一実施の形態による端末を閉じた状態の例を示す斜視図である。It is a perspective view which shows the example of the state which closed the terminal by one embodiment of this invention. 本発明の一実施の形態による音声変換処理の例を示すフローチャートである。It is a flowchart which shows the example of the audio | voice conversion process by one embodiment of this invention. 本発明の他の実施例によるヘッドセットの構成例を示す外観図である。It is an external view which shows the structural example of the headset by the other Example of this invention. 本発明の他の実施例によるヘッドセットの内部構成例を示すブロック図である。It is a block diagram which shows the internal structural example of the headset by the other Example of this invention.

Explanation of symbols

１００…携帯電話端末、１０１…通信処理部、１０２…アンテナ、１０３…表示部、１０４…操作部、１０５…メモリ、１０６…音声処理部、１０６…マイクロフォン、１０６ａ…音声変換部、１０７…スピーカ、１０８…マイクロフォン、１０９…脈拍センサ、１１０…アルコールセンサ、１１１…コンパレータ、１１２…オペアンプ、１１３…コンパレータ、１１５…ヒンジ、１１６…耳掛け、１１７…ケーブル、１１８…音声入力部、１２０…制御部、１５０…制御ライン、１６０…データライン、２００…ヘッドセット DESCRIPTION OF SYMBOLS 100 ... Mobile phone terminal 101 ... Communication processing part 102 ... Antenna 103 ... Display part 104 ... Operation part 105 ... Memory 106 ... Audio | voice processing part 106 ... Microphone 106a ... Voice conversion part 107 ... Speaker DESCRIPTION OF SYMBOLS 108 ... Microphone, 109 ... Pulse sensor, 110 ... Alcohol sensor, 111 ... Comparator, 112 ... Operational amplifier, 113 ... Comparator, 115 ... Hinge, 116 ... Ear hook, 117 ... Cable, 118 ... Audio | voice input part, 120 ... Control part, 150 ... Control line, 160 ... Data line, 200 ... Headset

Claims

An alcohol detector for detecting the concentration of alcohol contained in exhaled breath;
A microphone that picks up the surrounding sound and generates a sound signal;
A communication processing unit that demodulates the received radio wave and extracts an audio signal, converts the input audio signal into a radio wave, and transmits the radio signal;
Depending on the alcohol concentration detected by the alcohol detection unit, the speed and / or pitch of the audio signal obtained by the microphone is converted to a slower speed or lower pitch and transmitted to the communication processing unit. A mobile phone terminal equipped with a voice converter.

It has a pulse detector that detects the user's pulse,
The mobile phone terminal according to claim 1, wherein the voice conversion unit converts the speed of the voice signal extracted by the communication processing unit to a slower speed according to the number of pulses detected by the pulse detection unit.

A speaker that outputs the voice signal processed by the voice converter as voice;
The voice conversion unit has a slower speed of the voice signal extracted by the communication processing unit in accordance with the alcohol concentration detected by the alcohol detection unit and / or the number of pulses detected by the pulse detection unit. The mobile phone terminal according to claim 2, wherein the mobile phone terminal is converted into a speed and output to the speaker.

The voice conversion unit increases the volume of the voice signal extracted by the communication processing unit according to the alcohol concentration detected by the alcohol detection unit and / or the number of pulses detected by the pulse detection unit. The mobile phone terminal according to claim 3, wherein the mobile phone terminal is converted into a volume.

The mobile phone terminal according to claim 3, wherein the alcohol detection unit is disposed in the vicinity of the speaker, and the pulse detection unit is disposed in the vicinity of the microphone.

Detecting the concentration of alcohol contained in exhaled breath;
Picking up surrounding sounds and generating an audio signal;
Converting the speed of the extracted audio signal to a slower speed in response to the detected alcohol concentration;
Converting the speed-converted audio signal into a radio wave and transmitting the radio signal.

An alcohol detector for detecting the concentration of alcohol contained in exhaled breath;
A microphone that picks up the surrounding sound and generates a sound signal;
An audio input unit to which an audio signal is input;
A voice conversion unit that converts the speed and / or pitch of the voice signal input to the voice input unit into a slower speed or a lower pitch according to the alcohol concentration detected by the alcohol detection unit;
A headset comprising: a speaker that outputs an audio signal whose speed is converted by the audio conversion unit as audio.