JP2008103878A

JP2008103878A - Broadcast receiver

Info

Publication number: JP2008103878A
Application number: JP2006283516A
Authority: JP
Inventors: Fumihiko Aoyama; 文彦青山; Shuichi Matsumoto; 修一松本
Original assignee: Alpine Electronics Inc
Current assignee: Alpine Electronics Inc
Priority date: 2006-10-18
Filing date: 2006-10-18
Publication date: 2008-05-01
Anticipated expiration: 2026-10-18
Also published as: JP4739162B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide "a broadcast receiver" capable of more efficiently receiving speech input of reception frequency. <P>SOLUTION: Speech feature data representing features of uttered speeches of frequencies at which broadcast signals can be received in excellent reception states are stored in a priority recognition dictionary 11, and speech feature data representing features of uttered speeches of other frequencies are stored in a general recognition dictionary 12. Then when speech input is generated, a speech recognition engine 4 is caused to perform speech recognition processing for the frequency that the input speech represents by using the priority recognition dictionary 11 and when the speech recognition is successful, the recognized frequency is set as a reception frequency in a tuner 2. When the speech recognition ends in failure, the speech recognition engine 4 is caused to perform speech recognition for the frequency that the input speech represents by using the general recognition dictionary 12 and the recognized frequency is set as a reception frequency in the tuner 2. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、ラジオ放送やＴＶ放送を受信する放送受信機において、受信チャネルの指定をユーザから音声によって受け付ける技術に関するものである。 The present invention relates to a technology for receiving a designation of a reception channel by voice from a user in a broadcast receiver that receives a radio broadcast or a TV broadcast.

ラジオ放送やＴＶ放送を受信する放送受信機において、受信チャネルの指定をユーザから音声によって受け付ける技術としては、ユーザから放送局名の音声入力を受け付け、音声入力された放送局名の放送局が行っている放送の周波数に受信周波数を切り替える技術が知られている。
特開2001-156595号公報 In a broadcast receiver that receives a radio broadcast or a TV broadcast, as a technique for receiving a designation of a reception channel by voice from a user, a voice station name is input from the user, and the broadcast station with the voice input is performed by the broadcast station name. A technique for switching a reception frequency to a broadcast frequency is known.
Japanese Patent Laid-Open No. 2001-156595

ラジオ放送やＴＶ放送を受信する放送受信機において、受信チャネルの指定をユーザから音声によって受け付ける場合、ユーザから周波数の音声入力を受け付け、受け付けた周波数に、受信周波数を切り替えるようにすることも考えられる。
しかし、この場合には、音声認識の候補となる周波数、すなわち、ユーザから音声入力される可能性のある周波数が多数存在するために、放送局名の音声入力を受け付ける場合に比べ、音声認識処理の処理量が大きくなり、また、誤認識の発生確率も大きくなる。なお、放送局名の音声入力を受け付ける場合においても、音声認識の候補となる放送局名が多数存在する場合は同様となる。 In a broadcast receiver that receives a radio broadcast or a TV broadcast, when receiving a designation of a reception channel from a user by voice, it may be possible to accept a voice input of a frequency from the user and switch the reception frequency to the received frequency. .
However, in this case, since there are many frequencies that are candidates for speech recognition, that is, frequencies that may be speech input from the user, speech recognition processing is performed compared to when receiving speech input of a broadcasting station name. And the probability of occurrence of misrecognition also increases. Note that the same applies to the case where voice input of a broadcast station name is accepted, when there are many broadcast station names that are candidates for speech recognition.

そこで、本発明は、より効率的に、精度良く、受信チャネルの指定をユーザから音声によって受け付けることのできる放送受信機を提供することを課題とする。 Therefore, an object of the present invention is to provide a broadcast receiver that can receive a designation of a reception channel from a user by voice more efficiently and accurately.

前記課題達成のために、本発明は、放送受信機を、受信チャネルとして設定された放送チャネルを受信するチューナと、受信状態が良好な放送チャネルを識別する受信状態良好チャネル識別手段と、ユーザの発話音声が表す放送チャネルを音声認識する音声認識手段と、前記音声認識手段が音声認識した放送チャネルを前記受信チャネルとして前記チューナに設定する受信チャネル設定手段と、前記受信状態良好チャネル識別手段が識別した受信状態が良好な放送チャネルの各々を表す各発話音声の音声認識用の音声認識辞書を優先音声認識辞書として作成し、他の放送チャネルの各々を表す各発話音声の音声認識用の音声認識辞書を一般音声認識辞書として作成する音声認識辞書設定手段とより構成し、前記音声認識手段において、前記優先音声認識辞書を用いたユーザの発話音声が表す放送チャネルの音声認識を行うと共に、当該優先音声認識辞書を用いた音声認識に失敗した場合に、前記一般音声認識辞書を用いたユーザの発話音声が表す放送チャネルの音声認識を行うことにより、前記ユーザの発話音声が表す放送チャネルの音声認識を行うようにしたものである。 To achieve the above object, the present invention provides a broadcast receiver, a tuner that receives a broadcast channel set as a reception channel, a reception state good channel identification unit that identifies a broadcast channel with a good reception state, a user's Voice recognition means for recognizing the broadcast channel represented by the uttered voice, reception channel setting means for setting the broadcast channel recognized by the voice recognition means as the reception channel in the tuner, and the good reception state channel identification means A speech recognition dictionary for speech recognition of each utterance voice representing each of the broadcast channels with good reception status is created as a priority speech recognition dictionary, and voice recognition for speech recognition of each utterance speech representing each of the other broadcast channels is created. Voice recognition dictionary setting means for creating a dictionary as a general voice recognition dictionary; When speech recognition of the broadcast channel represented by the user's speech using the speech recognition dictionary is performed and speech recognition using the priority speech recognition dictionary fails, the user's speech using the general speech recognition dictionary is The voice recognition of the broadcast channel represented by the user's uttered voice is performed by performing the voice recognition of the broadcast channel represented.

ここで、一般的に言って、ユーザは、チューナに設定する受信チャネルとして、受信状態が良好な放送チャネルを選定する蓋然性が高い。したがって、以上のように、まず受信状態が良好な放送チャネルだけを対象として音声認識を行い、当該音声認識に失敗した場合にのみ、他の放送チャネルを対象とする音声認識を行うようにすることにより、効率的かつ誤り少なく音声入力された放送チャネルを認識することができるようになる。 Here, generally speaking, a user has a high probability of selecting a broadcast channel having a good reception state as a reception channel set in the tuner. Therefore, as described above, first, speech recognition is performed only for a broadcast channel with a good reception state, and speech recognition for other broadcast channels is performed only when the speech recognition fails. As a result, it is possible to recognize a broadcast channel in which voice is input efficiently and with few errors.

ここで、以上のような放送受信機において、前記受信状態良好チャネル識別手段は、各放送チャネルの受信状態を当該放送チャネルで受信した放送をユーザに対して出力することなく調査するバックグランドシークによって、受信状態が良好な放送チャネルを識別するものであってよい。 Here, in the broadcast receiver as described above, the reception state good channel identification means performs a background seek to investigate the reception state of each broadcast channel without outputting the broadcast received on the broadcast channel to the user. The broadcast channel having a good reception state may be identified.

また、以上のような放送受信機において、前記優先音声認識辞書には、前記受信状態良好チャネル識別手段が識別した受信状態が良好な放送チャネルの各々に対応づけて、当該放送チャネルを表す発話音声の特徴を登録し、前記一般音声認識辞書には、前記受信状態良好チャネル識別手段が識別した受信状態が良好な放送チャネル以外の放送チャネルの各々に対応づけて、当該放送チャネルを表す発話音声の特徴を登録するようにしてもよい。そして、前記音声認識手段において、前記優先音声認識辞書にユーザの発話音声の特徴と第１のしきい値以上マッチする特徴が登録されている場合に、優先音声認識辞書に登録されている、ユーザの発話音声の特徴と最もマッチする特徴に対応する放送チャネルを前記ユーザの発話音声が表す放送チャネルとして認識し、前記優先音声認識辞書にユーザの発話音声の特徴と第１のしきい値以上マッチする特徴が登録されていない場合に、前記一般音声認識辞書にユーザの発話音声の特徴と第２のしきい値以上マッチする特徴が登録されているかどうかを調べ、前記一般音声認識辞書にユーザの発話音声の特徴と第２のしきい値以上マッチする特徴が登録されている場合に、当該一般音声認識辞書に登録されている、ユーザの発話音声の特徴と最もマッチする特徴に対応する放送チャネルを前記ユーザの発話音声が表す放送チャネルとして認識するようにしてよい。なお、この場合には、前記第１のしきい値は第２のしきい値よりも小さく設定するようにする。 In the broadcast receiver as described above, the priority speech recognition dictionary is associated with each of the broadcast channels with good reception status identified by the channel with good reception status identification speech, and the speech speech representing the broadcast channel. In the general speech recognition dictionary, the speech voice representing the broadcast channel is associated with each of the broadcast channels other than the broadcast channel with the good reception state identified by the reception state good channel identification means. Features may be registered. In the speech recognition means, the user registered in the priority speech recognition dictionary is registered in the priority speech recognition dictionary if a feature that matches the feature of the user's uttered speech is more than the first threshold value. The broadcast channel corresponding to the feature that best matches the feature of the uttered speech is recognized as the broadcast channel represented by the user's uttered speech, and the priority speech recognition dictionary matches the feature of the user's uttered speech with a first threshold or more. When the feature to be registered is not registered, it is checked whether or not the feature that matches the feature of the user's uttered speech in the general speech recognition dictionary for a second threshold or more is registered, and the user's speech feature is registered in the general speech recognition dictionary. When a feature that matches the feature of the uttered speech with the second threshold or more is registered, the feature of the user's uttered speech registered in the general speech recognition dictionary A broadcast channel corresponding to the feature that matches may be to recognize a broadcast channel that represents the speech of the user. In this case, the first threshold value is set smaller than the second threshold value.

このように第１、第２のしきい値を設定することにより、ユーザによって音声入力される蓋然性のより大きい受信状態な良好な放送チャネルが、よりユーザが音声入力した放送チャネルとして認識され易くなる。したがって、このようにすることにより、音声入力された放送チャネルの誤認識発生が、より抑制されることが期待できる。 By setting the first and second threshold values in this way, a good broadcast channel having a high probability of being input by the user and having a high probability of being received is more easily recognized as a broadcast channel input by the user. . Therefore, by doing so, it can be expected that the occurrence of erroneous recognition of a broadcast channel input by voice is further suppressed.

さて、ここで、前記放送チャネルが周波数チャネルである場合には、以上のような放送受信機は、前記放送チャネルを表す発話音声は当該放送チャネルの中心周波数を発話した音声とあるものとして構成するようにしてもよい。なお、この場合には、放送受信機を、さらに、表示装置と、前記受信状態良好チャネル識別手段が識別した受信状態が良好な放送チャネルの中心周波数を前記表示装置に表示する受信状態良好周波数表示手段とを備えて構成することが好ましい。または、以上のような放送受信機は、前記放送チャネルを表す発話音声は当該放送チャネルで放送を行っている放送局の放送局名を発話した音声であるものとして構成するようにしてもよい。なお、この場合には、放送受信機を、表示装置と、前記受信状態良好チャネル識別手段が識別した受信状態が良好な放送チャネルで放送を行っている放送局の放送局名を前記表示装置に表示する受信状態良好放送局表示手段とを備えて構成することが好ましい。 Now, when the broadcast channel is a frequency channel, the broadcast receiver as described above is configured such that the utterance voice representing the broadcast channel is the voice uttered the center frequency of the broadcast channel. You may do it. In this case, the broadcast receiver is further displayed on the display device and the center frequency of the broadcast channel with the good reception state identified by the display state good channel identification means is displayed on the display device. And means. Alternatively, the broadcast receiver as described above may be configured such that the uttered voice representing the broadcast channel is the voice uttered by the broadcast station name of the broadcast station broadcasting on the broadcast channel. In this case, the broadcast receiver is connected to the display device and the name of the broadcast station that is broadcasting on the broadcast channel with the good reception status identified by the reception status good channel identification means. It is preferable that the apparatus is provided with a broadcasting station display means for displaying a good reception state.

さて、前記課題達成のために、本発明は、さらに、受信チャネルとして設定された放送チャネルを受信するチューナと、１または複数の放送チャネルを登録したプリセット登録手段と、ユーザのプリセット放送チャネル選択操作に応じて、前記プリセット登録手段に登録されている放送チャネルの内から選定した放送チャネルを前記チューナに受信チャネルとして設定するプリセット選局手段と、ユーザの発話音声が表す放送チャネルを音声認識する音声認識手段と、前記音声認識手段が音声認識した放送チャネルを前記受信チャネルとして前記チューナに設定する受信チャネル設定手段と、前記プリセット登録手段に登録されている放送チャネルの各々を表す各発話音声の音声認識用の音声認識辞書を優先音声認識辞書として作成し、他の放送チャネルの各々を表す各発話音声の音声認識用の音声認識辞書を一般音声認識辞書として作成する音声認識辞書設定手段とを備えた放送受信機を提供する。ただし、前記音声認識手段は、前記優先音声認識辞書を用いたユーザの発話音声が表す放送チャネルの音声認識を行うと共に、当該優先音声認識辞書を用いた音声認識に失敗した場合に、前記一般音声認識辞書を用いたユーザの発話音声が表す放送チャネルの音声認識を行うことにより、前記ユーザの発話音声が表す放送チャネルの音声認識を行うものである。 In order to achieve the above object, the present invention further includes a tuner for receiving a broadcast channel set as a reception channel, preset registration means for registering one or more broadcast channels, and a preset broadcast channel selection operation by a user. And preset channel selection means for setting a broadcast channel selected from among the broadcast channels registered in the preset registration means as a reception channel in the tuner, and voice for voice recognition of the broadcast channel represented by the user's uttered voice Recognizing means, receiving channel setting means for setting the broadcast channel recognized by the voice recognizing means as the receiving channel in the tuner, and audio of each utterance voice representing each of the broadcast channels registered in the preset registering means Create a speech recognition dictionary for recognition as a priority speech recognition dictionary, Provides a broadcast receiver with a speech recognition dictionary setting means for generating a speech recognition dictionary for speech recognition of each speech as a general voice recognition dictionary representing each of the feed channels. However, the speech recognition means performs speech recognition of the broadcast channel represented by the user's speech using the priority speech recognition dictionary, and when the speech recognition using the priority speech recognition dictionary fails, the general speech By performing speech recognition of the broadcast channel represented by the user's uttered speech using the recognition dictionary, speech recognition of the broadcast channel represented by the user's uttered speech is performed.

ここで、一般的に言って、ユーザは、チューナに設定する受信チャネルとして、プリセット選局操作で選局可能な放送チャネルとしてプリセット登録手段に登録されている放送チャネルを選定する蓋然性が高い。したがって、以上のように、まず、プリセット登録手段に登録されているだけを対象として音声認識を行い、当該音声認識に失敗した場合にのみ、他の放送チャネルを対象とする音声認識を行うようにすることにより、効率的かつ誤り少なく音声入力された放送チャネルを認識することができるようになる。 Here, generally speaking, a user is highly likely to select a broadcast channel registered in the preset registration unit as a broadcast channel that can be selected by a preset channel selection operation as a reception channel set in the tuner. Therefore, as described above, first, speech recognition is performed only for those registered in the preset registration means, and speech recognition for other broadcast channels is performed only when the speech recognition fails. By doing so, it becomes possible to recognize a broadcast channel in which voice is input efficiently and with few errors.

以上のように、本発明によれば、より効率的に、精度良く、受信チャネルの指定をユーザから音声によって受け付けることができる。 As described above, according to the present invention, designation of a reception channel can be received from a user by voice more efficiently and accurately.

以下、本発明の実施形態に係る放送受信機の実施形態を、自動車に搭載されるラジオ放送受信機への適用を例にとり説明する。
図１に、本実施形態に係るラジオ放送受信機の構成を示す。
図示するように、ラジオ放送受信機は、アンテナ１を備えたチューナ２、マイクロフォン３から入力する音声の音声認識処理を行う音声認識エンジン４、各種メッセージ音声を生成する音声生成部５、チューナ２の出力する受信音声と音声生成部５が生成したメッセージ音声を必要に応じて合成しスピーカ７から出力する音声出力部６、入力装置８、表示装置９、制御部１０、優先認識辞書１１、一般認識辞書１２、辞書データベース１３、シーク周波数メモリ１４、プリセットメモリ１５とを備えている。 Hereinafter, an embodiment of a broadcast receiver according to an embodiment of the present invention will be described taking application to a radio broadcast receiver mounted on an automobile as an example.
FIG. 1 shows a configuration of a radio broadcast receiver according to the present embodiment.
As shown in the figure, the radio broadcast receiver includes a tuner 2 having an antenna 1, a speech recognition engine 4 that performs speech recognition processing of speech input from a microphone 3, a speech generation unit 5 that generates various message speech, and a tuner 2. The received voice to be output and the message voice generated by the voice generation unit 5 are synthesized as necessary and output from the speaker 7, the voice output unit 6, the input device 8, the display device 9, the control unit 10, the priority recognition dictionary 11, and the general recognition. A dictionary 12, a dictionary database 13, a seek frequency memory 14, and a preset memory 15 are provided.

さて、このような構成において、プリセットメモリ１５には、予めユーザによって１からｎまでのプリセット番号に対して登録された周波数のリストが登録されている。
また、シーク周波数メモリ１４には、制御部１０が、バックグランドシーク処理によって探索した、受信状態が所定レベル以上良好な周波数のリストが登録される。すなわち、制御部１０は、チューナ２の受信周波数を最小周波数と最大周波数の間で漸次的に変化させながら、各周波数における放送信号の受信状態を調べ、所定レベル以上良好な受信状態で放送信号を受信できる周波数をシーク周波数メモリ１４に登録するバックグランドシーク処理を繰り返し行う。ここで、このバックグランドシーク動作において、受信状態を調べるために設定された受信周波数でチューナ２が受信した放送信号を復調した音声はチューナ２から出力されない。また、このバックグランドシーク処理は、ラジオ放送受信機がユーザによってラジオ放送番組を聴くために利用されていない期間にのみ行うようにしても、常時行うようにしてもよい。なお、バックグランドシーク処理を常時行う場合には、ラジオ放送受信機がユーザによってラジオ放送番組を聴くために利用されている期間中は、ユーザのラジオ放送番組の利用を妨げないよう、チューナ２を間欠的に充分に短い時間だけバックグランドシーク処理に用いるようにする。または、チューナ２を二重化して設け、一方のチューナ２をラジオ放送番組の受信用に、他方のチューナ２をバックグランドシーク処理専用に用いることにより、ユーザのラジオ放送番組の利用を妨げることなく、常時、バックグランドシーク処理を行えるようにしてもよい。 In such a configuration, a list of frequencies registered in advance for preset numbers 1 to n by the user is registered in the preset memory 15.
Also, in the seek frequency memory 14, a list of frequencies having a good reception state of a predetermined level or higher, which is searched by the control unit 10 by the background seek process, is registered. That is, the control unit 10 checks the reception state of the broadcast signal at each frequency while gradually changing the reception frequency of the tuner 2 between the minimum frequency and the maximum frequency, and determines the broadcast signal in a good reception state above a predetermined level. The background seek process for registering the receivable frequencies in the seek frequency memory 14 is repeated. Here, in this background seek operation, the sound obtained by demodulating the broadcast signal received by the tuner 2 at the reception frequency set for examining the reception state is not output from the tuner 2. The background seek process may be performed only during a period when the radio broadcast receiver is not used for listening to the radio broadcast program by the user, or may be performed at all times. When the background seek process is always performed, the tuner 2 is set so as not to prevent the user from using the radio broadcast program while the radio broadcast receiver is used by the user to listen to the radio broadcast program. The background seek process is used intermittently for a sufficiently short time. Alternatively, the tuner 2 is provided in a duplex manner, and one tuner 2 is used for receiving a radio broadcast program and the other tuner 2 is used exclusively for background seek processing, thereby preventing the user from using the radio broadcast program, The background seek process may always be performed.

次に、辞書データベース１３には、チューナ２に受信周波数として設定可能な各周波数チャネルの中心周波数の各々に対して登録した、当該中心周波数を発話した音声の特徴を表すデータである音声特徴データが格納されている。すなわち、チューナ２に対して、５３０ｋＨｚから１３００ｋＨｚまで１ｋＨｚきざみに７７１個の受信周波数を設定することができるのであれば、図２に示すように「５３０ｋＨｚ」と発話した音声の特徴を表する音声特徴データ、「５３１ｋＨｚ」と発話した音声の特徴を表する音声特徴データ、...、「１２９９ｋＨｚ」と発話した音声の特徴を表する音声特徴データ、「１３００ｋＨｚ」と発話した音声の特徴を表する音声特徴データというように、７７１個の周波数に対応する音声特徴データを格納する。 Next, in the dictionary database 13, voice feature data that is registered for each center frequency of each frequency channel that can be set as a reception frequency in the tuner 2 and that represents the characteristics of the voice that uttered the center frequency is stored. Stored. That is, if it is possible to set 771 reception frequencies in increments of 1 kHz from 530 kHz to 1300 kHz for the tuner 2, an audio feature representing the characteristics of the speech uttered “530 kHz” as shown in FIG. Data, voice feature data representing the characteristics of the voice uttered as “531 kHz”, voice characteristics data representing the characteristics of the voice spoken as “1299 kHz”, and characteristics of the voice uttered as “1300 kHz” Audio feature data corresponding to 771 frequencies, such as audio feature data, is stored.

次に、優先認識辞書データと一般認識辞書データには、制御部１０が行う辞書更新処理によって、辞書データベース１３に格納されている音声特徴データが振り分けられて格納される。
図３に、この辞書更新処理の手順を示す。
図示するように、制御部１０は、この辞書更新処理において、チューナ２の最小受信周波数から最大受信周波数までの間で受信周波数を漸次的に変化させながら受信状態が良好な周波数を探索するバックグランドシーク処理を完了する度に（ステップ３０２）、今回完了したバックグランドシーク処理によって、シーク周波数メモリ１４に格納されている受信状態が良好な周波数のリストに変更が発生したかどうかを調べる（ステップ３０４）。そして、変更が発生していない場合には、そのまま、ステップ３０２に戻って、次の、バックグランドシーク処理の完了を待つ。 Next, the voice feature data stored in the dictionary database 13 is sorted and stored in the priority recognition dictionary data and the general recognition dictionary data by dictionary update processing performed by the control unit 10.
FIG. 3 shows the procedure of the dictionary update process.
As shown in the figure, in this dictionary update process, the control unit 10 searches for a frequency with a good reception state while gradually changing the reception frequency from the minimum reception frequency to the maximum reception frequency of the tuner 2. Each time the seek process is completed (step 302), it is checked whether or not the background seek process completed this time has changed the list of frequencies having a good reception state stored in the seek frequency memory 14 (step 304). ). If no change has occurred, the process returns to step 302 and waits for the completion of the next background seek process.

一方、シーク周波数メモリ１４に格納されている周波数のリストに変更があれば、辞書データベース１３から必要な音声特徴データを読み出しながら、優先認識辞書１１の内容を、シーク周波数メモリ１４に格納されている周波数リストに含まれる各周波数を発話した音声の特徴を表す音声特徴データのセットに更新する（ステップ３０６）。また、一般認識辞書１２の内容を、辞書データベース１３から必要な音声特徴データを読み出しながら、チューナ２に受信周波数として設定可能な周波数であって、シーク周波数メモリ１４に格納されている周波数リストに含まれていない各周波数を発話した音声の特徴を表す音声特徴データのセットに更新する（ステップ３０８）。 On the other hand, if there is a change in the list of frequencies stored in the seek frequency memory 14, the contents of the priority recognition dictionary 11 are stored in the seek frequency memory 14 while reading out necessary voice feature data from the dictionary database 13. Update to a set of voice feature data representing the characteristics of the voice that uttered each frequency included in the frequency list (step 306). The contents of the general recognition dictionary 12 are included in the frequency list stored in the seek frequency memory 14, which is a frequency that can be set as a reception frequency in the tuner 2 while reading out necessary voice feature data from the dictionary database 13. It is updated to a set of speech feature data representing the features of speech uttered at each frequency that has not been spoken (step 308).

そして、ステップ３０２に戻って、次の、バックグランドシーク処理の完了を待つ。
この結果、たとえば、バックグランドシーク処理によってシーク周波数メモリ１４の周波数リストが５４０ｋＨｚ、６００ｋＨｚ、７５０ｋＨｚ、８００ｋＨｚ、９１０ｋＨｚ５つの周波数のリストに更新された場合には、図２に示すように、優先認識辞書１１には、この５つの周波数各々についての、当該周波数を発話した音声の特徴を表す音声特徴データが格納され、一般認識辞書１２には、辞書データベース１３に登録されている７７１個の音声特徴データのうちの残る７６６個の周波数についての当該周波数を発話した音声の特徴を表す音声特徴データが格納されることになる。 Then, the process returns to step 302 and waits for the completion of the next background seek process.
As a result, for example, when the frequency list of the seek frequency memory 14 is updated to a list of five frequencies of 540 kHz, 600 kHz, 750 kHz, 800 kHz, and 910 kHz by background seek processing, as shown in FIG. Is stored for each of the five frequencies, and the general feature dictionary 12 contains 771 voice feature data registered in the dictionary database 13. Of the remaining 766 frequencies, speech feature data representing the features of speech uttering the frequencies is stored.

さて、このようなラジオ放送受信機において、受信したラジオ放送番組をユーザに対して出力する際の動作は次のようになる。
すなわち、チューナ２は、制御部１０から受信周波数として設定された周波数チャネルの放送信号を受信し、受信した放送信号を復調した音声を音声出力部６に受信音声として出力する。
一方、制御部１０は、入力装置８を介してユーザから指定された周波数や、音声入力によって指定された周波数を、受信周波数としてチューナ２に設定し、当該周波数の周波数チャネルの放送信号の受信及び受信音声の出力を行わせる。
なお、ユーザからの入力装置８を介した受信周波数とする周波数の指定は、ユーザの入力装置８に対する周波数増減操作に応じて受信周波数とする周波数を漸次的に切り替えることにより受け付ける他、ユーザの入力装置８に対するプリセット番号選択操作に応じて、ユーザから選択されたプリセット番号に対する周波数としてプリセットメモリ１５に登録されている周波数を受信周波数とすることにより受け付ける。また、ユーザからの入力装置８を介した受信周波数とする周波数の指定は、ユーザの入力装置８に対するシーク操作に応じて、シーク周波数メモリ１４に登録されている周波数を、順次、受信周波数とすることによっても受け付ける。 Now, in such a radio broadcast receiver, the operation when the received radio broadcast program is output to the user is as follows.
That is, the tuner 2 receives a broadcast signal of a frequency channel set as a reception frequency from the control unit 10 and outputs a sound obtained by demodulating the received broadcast signal to the sound output unit 6 as a received sound.
On the other hand, the control unit 10 sets a frequency designated by the user via the input device 8 or a frequency designated by voice input in the tuner 2 as a reception frequency, and receives a broadcast signal of a frequency channel of the frequency. Output received audio.
In addition, designation | designated of the frequency used as the receiving frequency via the input device 8 from a user accepts by changing the frequency used as a receiving frequency gradually according to the frequency increase / decrease operation with respect to the user's input device 8, and a user's input In response to the preset number selection operation on the device 8, the frequency registered in the preset memory 15 as the frequency for the preset number selected by the user is used as the reception frequency. In addition, the designation of the frequency to be the reception frequency via the input device 8 from the user is made such that the frequencies registered in the seek frequency memory 14 are sequentially set as the reception frequency in accordance with the seek operation on the input device 8 by the user. Also accept by.

一方、制御部１０は、ユーザからの音声入力による受信周波数とする周波数の指定の受け付けを、受信周波数音声入力受付処理によって行う。
図４に、この受信周波数音声入力受付処理の手順を示す。
図示するように、制御部１０は、この処理において、まず、ユーザからの音声入力の発生を待つ（ステップ４０２）。ユーザからの音声入力の有無は、たとえば、入力装置８に設けた、ユーザが音声入力時に押し下げる音声入力ボタンの押し下げ状態等に応じて判定する。
そして、音声入力が発生したならば、入力された音声を入力音声として録音しながら、音声認識エンジン４に、優先認識辞書１１を用いた音声認識を行わせ、入力音声に最もマッチする周波数を算出する（ステップ４０４）すなわち、このステップ４０４では、音声認識エンジン４は、優先認識辞書１１に格納されている音声特徴データであって、当該音声特徴データが表す特徴が、入力された音声の特徴に最も近似する音声特徴データを、当該近似の度合いを表すマッチスコアと共に算定し、算定した音声特徴データに対応する周波数を入力音声に最もマッチする周波数として制御部１０に、算定したマッチスコアと共に通知する。 On the other hand, the control unit 10 receives a designation of a frequency to be a reception frequency by a voice input from a user by a reception frequency voice input reception process.
FIG. 4 shows the procedure of the reception frequency voice input acceptance process.
As shown in the figure, in this process, the control unit 10 first waits for a voice input from the user (step 402). The presence / absence of voice input from the user is determined according to, for example, a state where a voice input button provided in the input device 8 is pressed down when the user inputs voice.
If a voice input occurs, the voice recognition engine 4 performs voice recognition using the priority recognition dictionary 11 while recording the input voice as the input voice, and calculates the frequency that best matches the input voice. That is, in step 404, the speech recognition engine 4 is the speech feature data stored in the priority recognition dictionary 11, and the feature represented by the speech feature data is converted into the feature of the input speech. The most approximate speech feature data is calculated together with a match score representing the degree of approximation, and the frequency corresponding to the calculated speech feature data is notified to the control unit 10 as the frequency that best matches the input speech along with the calculated match score. .

そして、次に、このように優先認識辞書１１を用いた音声認識によって算出した周波数のマッチスコアが、所定のしきい値Ｔｈ１以上であるかどうかを調べ（ステップ４０６）、Ｔｈ１以上であれば、算出した周波数を受信周波数としてチューナ２に設定し（ステップ４１４）、ステップ４０２に戻って、次の音声入力の発生を待つ。 Next, it is checked whether or not the frequency match score calculated by the speech recognition using the priority recognition dictionary 11 is equal to or greater than a predetermined threshold Th1 (step 406). The calculated frequency is set in the tuner 2 as a reception frequency (step 414), and the process returns to step 402 to wait for the next voice input.

一方、優先認識辞書１１を用いた音声認識によって算出した周波数のマッチスコアが、しきい値Ｔｈ１未満であれば（ステップ４０６）、音声認識エンジン４に、一般認識辞書１２を用いた音声認識を行わせ、録音しておいた入力音声に最もマッチする周波数を算出する（ステップ４０８）。すなわち、このステップ４０８では、音声認識エンジン４は、一般認識辞書１２に格納されている音声特徴データであって、当該音声特徴データが表す特徴が、録音しておいた入力音声の特徴に最も近似する音声特徴データを、当該近似の度合いを表すマッチスコアと共に算定し、算定した音声特徴データに対応する周波数を入力音声に最もマッチする周波数として制御部１０に、算定したマッチスコアと共に通知する。 On the other hand, if the frequency match score calculated by speech recognition using the priority recognition dictionary 11 is less than the threshold Th1 (step 406), the speech recognition engine 4 performs speech recognition using the general recognition dictionary 12. The frequency that best matches the recorded input voice is calculated (step 408). That is, in this step 408, the speech recognition engine 4 is the speech feature data stored in the general recognition dictionary 12, and the feature represented by the speech feature data is the closest to the feature of the recorded input speech. The voice feature data to be calculated is calculated together with a match score representing the degree of approximation, and the frequency corresponding to the calculated voice feature data is notified to the control unit 10 as the frequency that best matches the input voice, together with the calculated match score.

そして、次に、このように一般認識辞書１２を用いた音声認識によって算出した周波数のマッチスコアが、所定のしきい値Ｔｈ２以上であるかどうかを調べ（ステップ４１０）、Ｔｈ２以上であれば、算出した周波数を受信周波数としてチューナ２に設定し（ステップ４１６）、ステップ４０２に戻って、次の音声入力の発生を待つ。 Then, next, it is checked whether or not the frequency match score calculated by the speech recognition using the general recognition dictionary 12 is not less than a predetermined threshold Th2 (step 410). The calculated frequency is set in the tuner 2 as a reception frequency (step 416), and the process returns to step 402 to wait for the next voice input.

一方、一般認識辞書１２を用いた音声認識によって算出した周波数のマッチスコアがしきい値Ｔｈ２未満であれば、音声生成部５に、入力された音声に対応する周波数は存在しない旨を表すメッセージ音声を生成させ、音声出力部６を介してスピーカ７から出力させる（ステップ４１２）。そして、ステップ４０２に戻って、次の音声入力の発生を待つ。 On the other hand, if the match score of the frequency calculated by the speech recognition using the general recognition dictionary 12 is less than the threshold value Th2, a message speech indicating that there is no frequency corresponding to the input speech in the speech generation unit 5. Is generated and output from the speaker 7 via the audio output unit 6 (step 412). Then, the process returns to step 402 to wait for the next voice input.

以上、受信周波数音声入力受付処理について説明した。
このような受信周波数音声入力受付処理によれば、たとえば、優先認識辞書１１と一般認識辞書１２の内容が図２に示す状態にあるときに、ユーザが、現在受信状態が良好な周波数である７５０ｋＨｚにチューナ２の受信周波数を設定するために、「７５０ｋＨｚ」と発話して音声入力した場合には、優先認識辞書１１に、この７５０ｋＨｚの音声特徴データが格納されているので、優先認識辞書１１のみを用いた音声認識によって、しきい値Ｔｈ１以上のマッチスコアを持つ周波数として７５０ｋＨｚが認識され、受信周波数としてチューナ２に設定されることになる。一方、ユーザが、現在受信状態が良好な周波数ではない５３４ｋＨｚにチューナ２の受信周波数を設定するために、「５３４ｋＨｚ」と発話して音声入力した場合には、優先認識辞書１１に、この５３４ｋＨｚの音声特徴データが格納されていないので、優先認識辞書１１を用いた音声認識ではしきい値Ｔｈ１以上のマッチスコアを持つ周波数は算出されないが、ひき続き行われる一般認識辞書１２を用いた音声認識によって、しきい値Ｔｈ２以上のマッチスコアを持つ周波数として５３４ｋＨｚが認識され、支障なく受信周波数としてチューナ２に設定されることになる。 The reception frequency voice input reception process has been described above.
According to such reception frequency voice input reception processing, for example, when the contents of the priority recognition dictionary 11 and the general recognition dictionary 12 are in the state shown in FIG. In order to set the reception frequency of the tuner 2, when speech is input by speaking “750 kHz”, since the voice feature data of 750 kHz is stored in the priority recognition dictionary 11, only the priority recognition dictionary 11 is stored. 750 kHz is recognized as a frequency having a match score equal to or higher than the threshold value Th1, and is set in the tuner 2 as a reception frequency. On the other hand, when the user utters “534 kHz” and inputs a voice in order to set the reception frequency of the tuner 2 to 534 kHz, which is not a good frequency in the reception state, the 534 kHz of this 534 kHz Since no voice feature data is stored, a frequency having a match score equal to or higher than the threshold value Th1 is not calculated in the voice recognition using the priority recognition dictionary 11, but the voice recognition using the general recognition dictionary 12 is performed continuously. Thus, 534 kHz is recognized as a frequency having a match score equal to or greater than the threshold Th2, and the tuner 2 is set as the reception frequency without any trouble.

ここで、一般的に言って、ユーザは、チューナ２に設定する受信周波数として、受信状態が良好な周波数を選定する蓋然性が高い。したがって、以上のように、まず受信状態が良好な周波数だけを対象として音声認識を行うことにより、効率的かつ誤り少なく音声入力された周波数を認識することができるようになる。 Here, generally speaking, the user has a high probability of selecting a frequency having a good reception state as the reception frequency set in the tuner 2. Therefore, as described above, by first performing speech recognition only for frequencies with good reception conditions, it is possible to recognize the frequency of speech input efficiently and with few errors.

なお、前述したしきい値Ｔｈ１は、しきい値Ｔｈ２よりも小さな値とし、シーク周波数メモリ１４に登録されている受信状態が良好な周波数が、ユーザが音声入力した周波数として、他の周波数よりも認識され易くするようにしてもよい。
また、このような受信周波数音声入力受付処理において、ステップ４０６で優先認識辞書１１を用いた音声認識によって算出した周波数のマッチスコアがしきい値Ｔｈ１未満であると判定された場合には、音声認識エンジン４で優先認識辞書１１より探索した、入力された音声に対するマッチスコアが、上位の所定数の音声特徴データに対応する周波数の一覧を、たとえば、図５ａに示すように表示装置９に表示するようにしても良い。そして、この場合には、一覧を表示したならば、表示した一覧中からの周波数の選択、もしくは、表示した一覧中に音声入力した周波数が存在しない旨の指定の入力をユーザから受け付けるようにする。そして、一覧中からの周波数の選択を受け付けた場合には、受け付けた周波数を受信周波数としてチューナ２に設定するようにし、表示した一覧中に音声入力した周波数が存在しない旨の指定を受け付けたときには、ステップ４０８に進んで、上述のように一般認識辞書１２を用いた音声認識を行うようにする。 Note that the above-described threshold value Th1 is smaller than the threshold value Th2, and the frequency in which the reception state registered in the seek frequency memory 14 is good is the frequency that the user has input as a voice than the other frequencies. You may make it easy to recognize.
In such reception frequency voice input reception processing, if it is determined in step 406 that the frequency match score calculated by voice recognition using the priority recognition dictionary 11 is less than the threshold value Th1, voice recognition is performed. A list of frequencies corresponding to a predetermined number of higher-order voice feature data whose match score for the input voice searched by the priority recognition dictionary 11 in the engine 4 is displayed on the display device 9 as shown in FIG. 5a, for example. You may do it. In this case, when the list is displayed, selection of a frequency from the displayed list or designation input indicating that there is no voice input frequency in the displayed list is accepted from the user. . When the selection of the frequency from the list is accepted, the received frequency is set in the tuner 2 as the reception frequency, and when the designation that the frequency input by voice does not exist in the displayed list is accepted. Then, the process proceeds to step 408 to perform speech recognition using the general recognition dictionary 12 as described above.

また、同様に、ステップ４１０において、一般認識辞書１２を用いた音声認識によって算出した周波数のマッチスコアがしきい値Ｔｈ２未満であると判定された場合には、音声認識エンジン４で一般認識辞書１２より探索した、入力された音声に対するマッチスコアが上位の所定数の音声特徴データに周波数の一覧を表示装置９に表示するようにしても良い。そして、この場合には、一覧を表示したならば、表示した一覧中からの周波数の選択、もしくは、表示した一覧中に音声入力した周波数が存在しない旨の指定の入力をユーザから受け付けるようにする。そして、一覧中からの周波数の選択を受け付けた場合には、受け付けた周波数を受信周波数としてチューナ２に設定するようにし、表示した一覧中に音声入力した周波数が存在しない旨の指定を受け付けたときには、ステップ４１２に進んで、上述のように入力された音声に対応する周波数は存在しない旨を表すメッセージ音声を出力するようにする。 Similarly, if it is determined in step 410 that the frequency match score calculated by speech recognition using the general recognition dictionary 12 is less than the threshold Th2, the speech recognition engine 4 uses the general recognition dictionary 12 A list of frequencies may be displayed on the display device 9 for a predetermined number of voice feature data having a higher match score with respect to the input voice searched. In this case, when the list is displayed, selection of a frequency from the displayed list or designation input indicating that there is no voice input frequency in the displayed list is accepted from the user. . When the selection of the frequency from the list is accepted, the received frequency is set in the tuner 2 as the reception frequency, and when the designation that the frequency input by voice does not exist in the displayed list is accepted. The process proceeds to step 412 to output a message voice indicating that there is no frequency corresponding to the voice input as described above.

また、以上の実施形態において、制御部１０は、ユーザが受信状態が良好な周波数を音声入力前に認知できるように、図５ｂに示すように、シーク周波数メモリ１４に登録されている周波数リストに登録された周波数の一覧を表示装置９に表示するようにしてもよい。
また、以上の実施形態の辞書更新処理において、プリセットメモリ１５に登録されている周波数については無条件に、当該周波数に対応する音声特徴データを、一般認識辞書１２に格納せずに優先認識辞書１１に格納するようにしてもよい。ここで、プリセットメモリ１５に登録した周波数も、ユーザが、チューナ２に設定する受信周波数として選定する蓋然性が高い。したがって、このようにすることにより、効率的かつ誤り少なく音声入力された周波数を認識することができるようになる。 Further, in the above embodiment, the control unit 10 adds the frequency list registered in the seek frequency memory 14 to the frequency list registered in the seek frequency memory 14 as shown in FIG. A list of registered frequencies may be displayed on the display device 9.
Further, in the dictionary update process of the above embodiment, the speech recognition data corresponding to the frequency registered in the preset memory 15 is unconditionally stored in the priority recognition dictionary 11 without being stored in the general recognition dictionary 12. You may make it store in. Here, the frequency registered in the preset memory 15 is also highly likely to be selected as a reception frequency set by the user in the tuner 2. Therefore, by doing so, it is possible to recognize the frequency of voice input efficiently and with few errors.

ところで、以上の実施形態では、チューナ２に設定する受信周波数を、ユーザからの受信周波数とする周波数の音声入力によって受け付けるようにしたが、これは、チューナ２に設定する受信周波数を、受信周波数とする周波数で放送を行っている放送局の放送局名の音声入力によってユーザから受け付けるようにしてもよい。 By the way, in the above embodiment, the reception frequency set in the tuner 2 is received by voice input of the frequency that is the reception frequency from the user. This is because the reception frequency set in the tuner 2 is defined as the reception frequency. You may make it accept from a user by the audio | voice input of the broadcast station name of the broadcast station which is broadcasting on the frequency which carries out.

すなわち、この場合には、予め制御部１０に、各周波数と各周波数で放送を行っている放送局名の対応を記憶しておく。また、辞書データベース１３には、放送が行われている各周波数に対応づけて、当該周波数で放送を行っている放送局の放送局名を発話した音声の音声特徴データを格納しておく。そして、優先認識辞書１１には、シーク周波数メモリ１４に登録されている各周波数に対応づけて、当該周波数で放送を行っている放送局名の音声特徴データを、辞書データベース１３から転送して登録し、一般認識辞書１２には、残りの各周波数に対応づけて、当該周波数で放送を行っている放送局名の音声特徴データを、辞書データベース１３から転送して登録する。そして、ユーザから入力された放送局名を表す音声の特徴にしきい値Ｔｈ１以上最も近似する特徴を表す音声特徴データを優先認識辞書１１より探索し、探索した音声特徴データが対応づけて優先認識辞書１１に登録されている周波数を受信周波数としてチューナ２に設定するようにする。また、入力された音声の特徴にしきい値Ｔｈ１以上近似する特徴を表す音声特徴データが優先認識辞書１１に登録されていない場合には、ユーザから入力された放送局名を表す音声の特徴にしきい値Ｔｈ２以上最も近似する特徴を表す音声特徴データを一般認識辞書１２より探索し、探索した音声特徴データが対応づけて一般認識辞書１２に登録されている周波数を受信周波数としてチューナ２に設定するようにする。 That is, in this case, the correspondence between each frequency and the name of the broadcasting station that is broadcasting at each frequency is stored in the control unit 10 in advance. Further, the dictionary database 13 stores voice feature data of a voice uttered by a broadcasting station name of a broadcasting station broadcasting at the frequency in association with each frequency at which the broadcasting is performed. Then, in the priority recognition dictionary 11, the voice feature data of the name of the broadcasting station broadcasting at the frequency is transferred from the dictionary database 13 and registered in association with each frequency registered in the seek frequency memory 14. Then, in the general recognition dictionary 12, the voice feature data of the name of the broadcasting station broadcasting at the frequency is transferred from the dictionary database 13 and registered in association with the remaining frequencies. Then, voice feature data representing a feature that most closely approximates the threshold value Th1 to the voice feature representing the broadcast station name input by the user is searched from the priority recognition dictionary 11, and the searched voice feature data is associated with the priority recognition dictionary. 11 is set in the tuner 2 as a reception frequency. In addition, when voice feature data representing a feature approximating the threshold Th1 or more to the input voice feature is not registered in the priority recognition dictionary 11, the threshold is set as the voice feature representing the broadcast station name inputted by the user. Voice feature data representing a feature that is closest to a value Th2 or more is searched from the general recognition dictionary 12, and a frequency registered in the general recognition dictionary 12 in association with the searched voice feature data is set as a reception frequency in the tuner 2. To.

なお、この場合には、図５ｃに示すように、シーク周波数メモリ１４に登録されている周波数リストに登録されている各周波数で放送を行っている放送局名の一覧を表示装置９に表示するようにするのがよい。
また、以上の実施形態は、バックグランドシーク処理によって良好に受信できる周波数を探索し、探索した周波数のリストをシーク周波数メモリ１４に登録する代わりに、カーナビゲーション装置などの、現在位置する地域を算出する手段を設け、予め求めておいた地域と当該地域で放送が行われている周波数との対応に基づいて求まる、算出した現在位置する地域で良好に受信できるはずの周波数をシーク周波数メモリ１４に登録し、当該シーク周波数メモリ１４に登録された周波数に基づいて、上述のように優先認識辞書１１や一般認識辞書１２を構成するようにしてもよい。なお、この場合においても、優先認識辞書１１や一般認識辞書１２に登録する音声特徴データは、前述のように、各周波数を発話した音声の特徴を表すものであってもよいし、各周波数で放送を行っている放送局の放送局名を発話した音声の特徴を表すものであってもよい。 In this case, as shown in FIG. 5 c, a list of broadcast station names that are broadcasting at each frequency registered in the frequency list registered in the seek frequency memory 14 is displayed on the display device 9. It is better to do so.
Further, in the above embodiment, instead of searching for a frequency that can be satisfactorily received by the background seek process and registering a list of searched frequencies in the seek frequency memory 14, a region where the current position is located, such as a car navigation device, is calculated. A frequency that should be satisfactorily received in the area where the current position is calculated, which is obtained based on the correspondence between the area that has been obtained in advance and the frequency at which the broadcast is performed in the area. The priority recognition dictionary 11 and the general recognition dictionary 12 may be configured as described above based on the frequencies registered and registered in the seek frequency memory 14. In this case as well, the voice feature data registered in the priority recognition dictionary 11 and the general recognition dictionary 12 may represent the features of the voice uttered at each frequency, as described above. It may represent the characteristics of the voice uttered by the name of the broadcasting station that is broadcasting.

ところで、以上の実施形態は、ラジオ放送の受信を行うラジオ放送受信機への適用を例にとり説明したが、本実施形態はＴＶ放送を受信するＴＶ放送受信機についても同様に適用することができる。 By the way, although the above embodiment demonstrated taking the case of application to the radio broadcast receiver which receives a radio broadcast as an example, this embodiment can be applied similarly also to the TV broadcast receiver which receives TV broadcast. .

本発明の実施形態に係るラジオ放送受信機の構成を示すブロック図である。It is a block diagram which shows the structure of the radio broadcast receiver which concerns on embodiment of this invention. 本発明の実施形態に係る辞書データベース、優先認識辞書、一般認識辞書の構成例を示す図である。It is a figure which shows the structural example of the dictionary database which concerns on embodiment of this invention, a priority recognition dictionary, and a general recognition dictionary. 本発明の実施形態に係る辞書更新処理を示すフローチャートである。It is a flowchart which shows the dictionary update process which concerns on embodiment of this invention. 本発明の実施形態に係る受信周波数音声入力受付処理を示すフローチャートである。It is a flowchart which shows the reception frequency audio | voice input reception process which concerns on embodiment of this invention. 本発明の実施形態に係るラジオ放送受信機の表示画面例を示す図である。It is a figure which shows the example of a display screen of the radio broadcast receiver which concerns on embodiment of this invention.

Explanation of symbols

１…アンテナ、２…チューナ、３…マイクロフォン、４…音声認識エンジン、５…音声生成部、６…音声出力部、７…スピーカ、８…入力装置、９…表示装置、１０…制御部、１１…優先認識辞書、１２…一般認識辞書、１３…辞書データベース、１４…シーク周波数メモリ、１５…プリセットメモリ。 DESCRIPTION OF SYMBOLS 1 ... Antenna, 2 ... Tuner, 3 ... Microphone, 4 ... Voice recognition engine, 5 ... Voice generation part, 6 ... Voice output part, 7 ... Speaker, 8 ... Input device, 9 ... Display apparatus, 10 ... Control part, 11 Priority recognition dictionary, 12 General recognition dictionary, 13 Dictionary database, 14 Seek frequency memory, 15 Preset memory.

Claims

A broadcast receiver,
A tuner for receiving a broadcast channel set as a reception channel;
A reception state good channel identification means for identifying a broadcast channel having a good reception state;
Speech recognition means for recognizing the broadcast channel represented by the user's speech;
A reception channel setting means for setting the broadcast channel recognized by the voice recognition means in the tuner as the reception channel;
Create a speech recognition dictionary for speech recognition of each utterance voice representing each of the broadcast channels with good reception status identified by the reception status good channel identification means as a priority speech recognition dictionary, and represent each of the other broadcast channels Speech recognition dictionary setting means for creating a speech recognition dictionary for speech recognition of uttered speech as a general speech recognition dictionary;
The speech recognition means performs speech recognition of a broadcast channel represented by a user's speech using the priority speech recognition dictionary, and when the speech recognition using the priority speech recognition dictionary fails, the general speech recognition dictionary A broadcast receiver characterized by performing speech recognition of a broadcast channel represented by the user's uttered speech by performing speech recognition of the broadcast channel represented by the user's uttered speech using the.

The broadcast receiver according to claim 1,
The reception state good channel identification means identifies a broadcast channel with a good reception state by a background seek that investigates the reception state of each broadcast channel without outputting to the user the broadcast received on the broadcast channel. Broadcast receiver characterized by.

The broadcast receiver according to claim 1 or 2,
In the priority speech recognition dictionary, the characteristics of the speech sound representing the broadcast channel are registered in association with each of the broadcast channels with good reception status identified by the channel with good reception status identifying means, and the general speech In the recognition dictionary, the characteristics of the utterance voice representing the broadcast channel are registered in association with each broadcast channel other than the broadcast channel with the good reception state identified by the reception state good channel identification means,
The speech recognition means, when a feature that matches the feature of the user's utterance speech with a first threshold value or more is registered in the priority speech recognition dictionary, the user speech registered in the priority speech recognition dictionary The broadcast channel corresponding to the feature that most closely matches the feature of speech is recognized as the broadcast channel represented by the user's speech, and the feature that matches the feature of the user's speech to the priority speech recognition dictionary by a first threshold or more. Is not registered, it is checked whether or not a feature that matches the feature of the user's uttered speech in the general speech recognition dictionary with a second threshold value or more is registered, and the user's uttered speech is registered in the general speech recognition dictionary. If a feature that matches the feature of the second threshold or more is registered, the feature most closely matches the feature of the user's speech that is registered in the general speech recognition dictionary Recognizing a broadcast channel corresponding to the symptom as a broadcast channel representing the uttered voice of the user,
The broadcast receiver according to claim 1, wherein the first threshold value is set smaller than the second threshold value.

The broadcast receiver according to claim 1, 2, or 3,
The broadcast receiver, wherein the broadcast channel is a frequency channel, and the utterance voice representing the broadcast channel is a voice uttered by the center frequency of the broadcast channel.

The broadcast receiver according to claim 4, wherein
A display device;
A broadcast receiver comprising: a reception state good frequency display means for displaying a center frequency of a broadcast channel with a good reception state identified by the reception state good channel identification means on the display device.

The broadcast receiver according to claim 1, 2, or 3,
A broadcast receiver characterized in that the utterance voice representing the broadcast channel is a voice uttered by a broadcast station name of a broadcast station broadcasting on the broadcast channel.

The broadcast receiver according to claim 6, wherein
A display device;
A reception station with good reception status that displays on the display device a broadcast station name of a broadcasting station that is broadcasting on a broadcast channel with a good reception status identified by the reception status good channel identification means; Broadcast receiver.

A broadcast receiver,
A tuner for receiving a broadcast channel set as a reception channel;
Preset registration means for registering one or more broadcast channels;
Preset channel selection means for setting a broadcast channel selected from broadcast channels registered in the preset registration means as a reception channel in the tuner in response to a user's preset broadcast channel selection operation;
Speech recognition means for recognizing the broadcast channel represented by the user's speech;
A reception channel setting means for setting the broadcast channel recognized by the voice recognition means in the tuner as the reception channel;
Create a speech recognition dictionary for speech recognition of each utterance voice representing each broadcast channel registered in the preset registration means as a priority speech recognition dictionary, and for speech recognition of each utterance speech representing each of the other broadcast channels Voice recognition dictionary setting means for creating a voice recognition dictionary of the above as a general voice recognition dictionary,
The speech recognition means performs speech recognition of a broadcast channel represented by a user's speech using the priority speech recognition dictionary, and when the speech recognition using the priority speech recognition dictionary fails, the general speech recognition dictionary A broadcast receiver characterized by performing speech recognition of a broadcast channel represented by the user's uttered speech by performing speech recognition of the broadcast channel represented by the user's uttered speech using the.

In a broadcast receiver, a method for receiving audio input of a received broadcast channel that accepts designation of a broadcast channel received by a tuner by audio input,
Identifying a broadcast channel with good reception;
Create a speech recognition dictionary for speech recognition of each utterance speech representing each of the identified broadcast channels with good reception status as a priority speech recognition dictionary, and speech recognition speech for each utterance speech representing each of the other broadcast channels Creating a recognition dictionary as a general speech recognition dictionary;
The speech of the broadcast channel represented by the user's speech using the priority speech recognition dictionary is recognized, and when the speech recognition using the priority speech recognition dictionary fails, the user's speech using the general speech recognition dictionary Performing speech recognition of the broadcast channel represented by the user's uttered speech by performing speech recognition of the broadcast channel represented by the speech;
A voice input receiving method for a received broadcast channel, comprising: setting a broadcast channel recognized by voice as a broadcast channel received by the tuner.