JP6485214B2

JP6485214B2 - Electronic library system

Info

Publication number: JP6485214B2
Application number: JP2015105186A
Authority: JP
Inventors: 梨恵新井; 幸俊森下; 鈴木　正和; 正和鈴木
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2015-05-25
Filing date: 2015-05-25
Publication date: 2019-03-20
Anticipated expiration: 2035-05-25
Also published as: JP2016218881A

Description

本発明は，インターネットを利用して電子資料を貸し出す電子図書館を音声操作できるようにした発明である。 The present invention is an invention in which an electronic library that lends electronic materials can be operated by voice using the Internet.

目の不自由な方（以下，「視覚障害者」と記す。）にとって，読書から得られる楽しみは健常者よりも大きいため，視覚障害者用の図書館として，視覚障害者用の図書（点字図書と録音図書）を視覚障害者に貸し出す点字図書館が既に設立されている。しかし，点字図書館が蔵書する視覚障害者用の図書はボランティアの協力を得て製作されるため，視覚障害者用の図書の蔵書数は通常の図書館よりも遥かに少なく，また，点字図書館は，視覚障害者用の図書を郵送で視覚障害者に貸し出すため，リアルタイムで視覚障害者用の図書を借りることができない問題もある。 For those who are blind (hereinafter referred to as “visually impaired”), the enjoyment gained from reading is greater than that of healthy people, so as a library for the visually impaired, books for the visually impaired (Braille books) Braille libraries have already been established to lend out books to the visually impaired. However, because the books for the visually impaired, which are collected by the Braille Library, are produced with the cooperation of volunteers, the number of books for the visually impaired is much smaller than the regular library. There is also a problem that books for the visually impaired cannot be borrowed in real time because books for the visually impaired are lent to the visually impaired by mail.

一方で，健常者用の図書館については，利用者の利便性を高めるために，インターネットを利用して電子資料を貸し出す電子図書館の普及が進んでいる。インターネットを用いた電子図書館に係る発明は既に開示され，例えば，特許文献１では，ユーザがインターネットを介して電子図書館にアクセスし，電子資料の検索、電子資料の貸出及び電子資料の返却などの手続をすることができるシステムが開示されている。 On the other hand, with regard to libraries for healthy people, electronic libraries that lend electronic materials using the Internet are increasing in order to improve convenience for users. Inventions related to electronic libraries using the Internet have already been disclosed. For example, in Patent Document 1, a user accesses an electronic library via the Internet, searches for electronic materials, rents electronic materials, and returns electronic materials. A system that can do this is disclosed.

視覚障害者が電子図書館を利用できれば，視覚障害者は，図書館の建屋に出向かなくとも電子資料を借りることができるし，また，視覚障害者が借りることのできる電子資料の数も増えるが，電子資料を貸し出す電子図書館は，視覚障害者が独力で利用し易いように構成されていない。 If visually impaired people can use electronic libraries, visually impaired people can borrow electronic materials without going to the library building, and the number of electronic materials that visually impaired people can borrow increases. Electronic libraries that lend electronic materials are not designed to be easily accessible by the visually impaired.

視覚障害者が独力で電子図書館を利用し易くするためには，画面を見ずに電子図書館を利用できればよく，音声を利用して電子図書館を操作できれば，視覚障害者が独力で利用し易くなると考えられる。 In order to make it easy for visually impaired people to use the electronic library by themselves, it is only necessary to be able to use the electronic library without looking at the screen, and if the electronic library can be operated using voice, it will be easier for visually impaired people to use the library independently. Conceivable.

音声を利用した図書システムに係る発明としては，特許文献２において，音声を利用することで，視覚障害者の利便性を高めた情報アクセスシステムが開示されている。特許文献２には，ユーザの音声を音声認識すること，音声認識結果に基づいて電子資料（刊行物）を検索することで，電子資料のテキストの音声合成を生成することなどが記載されているが，特許文献２に係る発明では，音声を利用して電子図書館を操作できることには着眼されていない。 As an invention relating to a book system using voice, Patent Document 2 discloses an information access system that uses voice to improve convenience for visually impaired persons. Patent Document 2 describes, for example, generating speech synthesis of text of electronic materials by recognizing user's speech and searching electronic materials (publications) based on speech recognition results. However, the invention according to Patent Document 2 does not focus on the fact that the electronic library can be operated using voice.

電子図書館は，複数のＷｅｂページを有し，Ｗｅｂページを任意に切り替えられるように構成されているが，画面を見ることができない視覚障害者にとって，音声により電子図書館のＷｅｂページを任意に切り替えることができたとしても，電子図書館のＷｅｂページを任意に切り替えて電子図書館を利用することは困難である。 The electronic library has a plurality of Web pages and can be switched arbitrarily. However, for visually impaired people who cannot see the screen, the Web page of the electronic library can be switched arbitrarily by voice. Even if it is possible, it is difficult to use the electronic library by arbitrarily switching the Web page of the electronic library.

特開２００８−２６２５１８号公報JP 2008-262518 A 特開平７−１５２７８７号公報JP-A-7-152787

そこで，本発明は，視覚障害者が音声により電子図書館を容易に利用できる電子図書館システムを提供することを課題とする。 Therefore, an object of the present invention is to provide an electronic library system in which a visually impaired person can easily use an electronic library by voice.

上述した課題を解決する第１の発明は，ネットワークを介して利用できる様々な電子図書館サービスを提供し，電子図書館サービスの呼び出し要求を受けると，呼び出し要求を受けた前記電子図書館サービスを実行し，前記電子図書館サービスの呼び出し要求をした装置に対して，前記電子図書館サービスの実行結果を送信する電子図書館サイトと，前記電子図書館サイトが提供している前記電子図書館サービスを利用するアプリケーションプログラムを実装した携帯端末と，前記携帯端末から音声の特徴量を受信すると，音声の特徴量を用いて音声認識を行い，音声認識により得られたテキスト形式の音声認識結果を前記携帯端末へ送信する音声認識サーバと，前記携帯端末からテキストを受信すると，このテキストの合成音声を生成して前記携帯端末へ送信する音声合成サーバとから少なくとも構成された電子図書館システムである。
視覚障害者であっても，音声により電子図書館を容易に利用できるように，前記アプリケーションプログラムが起動することで，前記携帯端末は，ユーザがマイクに入力した音声の特徴量を抽出して，音声の特徴量を前記音声認識サーバに送信し，ユーザがマイクに入力した音声に対応する音声認識結果を前記音声認識サーバから取得する特徴量抽出部と，前記電子図書館サイトの操作に係る手続きとその実行順序を記述した電子図書館操作フローが登録され，前記電子図書館操作フローに記述されている前記手続きを実行する際，音声入力を開始する動作を検知すると，前記特徴量抽出部を作動させて，ユーザがマイクに入力した音声の音声認識結果を前記特徴量抽出部から取得した後，前記特徴量抽出部から取得した音声認識結果を利用して，この時点の前記手続きに対応する前記前記電子図書館サービスを呼び出し，呼び出した前記前記電子図書館サービスの実行結果を前記電子図書館サイトから受信すると，前記音声合成サーバを利用して，前記電子図書館サイトから受信した実行結果に対応する合成音声を生成して音声出力する手続き実行部を備える。 The first invention that solves the above-mentioned problems provides various electronic library services that can be used via a network. When a call request for the electronic library service is received, the electronic library service that receives the call request is executed, An electronic library site that transmits the execution result of the electronic library service and an application program that uses the electronic library service provided by the electronic library site are installed on the device that has requested the electronic library service to be called. A voice recognition server that performs voice recognition using a voice feature amount and transmits a voice recognition result in a text format obtained by voice recognition to the portable terminal when a voice feature amount is received from the portable terminal and the portable terminal When text is received from the mobile terminal, a synthesized speech of the text is generated. Serial at least configured digital library system from a voice synthesizing server for transmitting to the portable terminal.
Even if it is a visually impaired person, when the application program is started so that the electronic library can be easily used by voice, the portable terminal extracts the feature amount of the voice input by the user to the microphone, and A feature amount extraction unit that transmits a feature recognition amount corresponding to a speech input by a user to a microphone from the speech recognition server, a procedure related to operation of the electronic library site, and When an electronic library operation flow describing the execution order is registered and when the procedure described in the electronic library operation flow is executed, when the operation of starting speech input is detected, the feature amount extraction unit is operated, After the speech recognition result of the voice input by the user to the microphone is acquired from the feature amount extraction unit, the speech recognition result acquired from the feature amount extraction unit is used. Then, when the electronic library service corresponding to the procedure at this time is called and the execution result of the called electronic library service is received from the electronic library site, the electronic library site is utilized using the speech synthesis server. A procedure execution unit for generating a synthesized speech corresponding to the execution result received from and outputting the synthesized speech.

上述した課題を解決する第２の発明は，第１の発明に記載した電子図書館システムにおいて，前記手続き実行部は，音声入力を開始する動作を検知すると，この時点の前記手続きに係る音声ガイダンスを音声出力してから，前記特徴量抽出部を作動させることを特徴とする。前記手続き実行部が，音声入力を開始する動作を検知すると，この時点の前記手続きに係る音声ガイダンスを音声出力するように構成することで，ユーザは，前記手続き実行部が実行しているこの時点の前記手続きを音声ガイダンスにより把握できる。 According to a second invention for solving the above-described problem, in the electronic library system according to the first invention, when the procedure execution unit detects an operation of starting voice input, the voice guidance related to the procedure at this time is displayed. The feature extraction unit is activated after outputting the voice. When the procedure execution unit detects an operation to start voice input, the voice guidance related to the procedure at this time is configured to be output by voice so that the user can Can be grasped by voice guidance.

上述した課題を解決する第３の発明は，第１の発明または第２の発明に記載した電子図書館システムにおいて，前記携帯端末はモーションセンサを備え，前記手続き実行部は，前記携帯端末が振られたことを，音声入力を開始する動作として検知することを特徴とする。音声入力を開始する動作は，音声入力を開始するためのボタンオブジェクトを選択する動作とすることもできるが，ユーザが視覚障害者であることを想定すると，前記携帯端末が振られたことを，音声入力を開始する動作として検知するようにすることが好適である。 According to a third invention for solving the above-described problem, in the electronic library system according to the first invention or the second invention, the mobile terminal includes a motion sensor, and the procedure execution unit is configured to shake the mobile terminal. Is detected as an operation of starting voice input. The action of starting voice input can be an action of selecting a button object for starting voice input, but assuming that the user is a visually impaired person, It is preferable to detect as an operation of starting voice input.

上述した課題を解決する第４の発明は，第１の発明から第３の発明のいずれか一つに記載した電子図書館システムにおいて，資料を検索する前記電子図書館サービスを呼び出し，前記特徴量抽出部から取得した音声認識結果に適合する資料を検索する資料検索手続き，前記資料検索手続きの検索結果に含まれるテキストの合成音声を音声出力した後，資料を選択する前記電子図書館サービスを呼び出し，前記特徴量抽出部から取得した音声認識結果に対応する資料を選択する資料選択手続き，前記電子図書館サイトにログインする前記電子図書館サービスを呼び出し，前記特徴量抽出部から取得した音声認識結果を利用して，前記電子図書館サイトにログインするログイン手続き，資料を貸出しする前記電子図書館サービスを呼び出し，前記資料選択手続きにて選択した資料を前記電子図書館サイトから借りる貸出手続き，資料のコンテンツを提供する前記電子図書館サービスを呼び出し，前記貸出手続きにて前記電子図書館サイトから借りた資料のコンテンツに含まれるテキストを合成音声により再生する再生手続きを順に実行することを，前記電子図書館サイトから資料を借りて読む操作に係る前記手続きとその実行順序として前記電子図書館操作フローに記述したことを特徴とする。第４の発明によれば，前記電子図書館サイトから資料を借りて読む一連の操作を音声により実施できる。 According to a fourth invention for solving the above-mentioned problem, in the electronic library system according to any one of the first to third inventions, the electronic library service for retrieving materials is called up, and the feature quantity extracting unit A document search procedure for searching for a document that matches the speech recognition result obtained from the above, a synthesized speech of text included in the search result of the document search procedure is output as voice, and the electronic library service for selecting the document is called, A material selection procedure for selecting a material corresponding to the speech recognition result acquired from the amount extraction unit, the electronic library service for logging in to the electronic library site is called, and the speech recognition result acquired from the feature amount extraction unit is used. Login procedure for logging in to the electronic library site, calling the electronic library service for renting materials, Lending procedures for borrowing materials selected in the fee selection procedure from the electronic library site, calling the electronic library service providing the content of the materials, and text included in the contents of the materials borrowed from the electronic library site in the lending procedure Is described in the electronic library operation flow as the procedure relating to the operation of borrowing and reading materials from the electronic library site and the execution order thereof. According to the fourth invention, a series of operations for borrowing and reading materials from the electronic library site can be performed by voice.

上述した課題を解決する第５の発明は，第４の発明に記載した電子図書館システムであって，前記資料選択手続きにおいて，ユーザが借りている資料が検索されると，資料を返却する前記電子図書館サービスを呼び出し，前記特徴量抽出部から取得した音声認識結果によって指定された資料を前記電子図書館サイトへ返却する返却手続きを実行することを，資料の返却操作に係る前記手続きとその実行順序として前記電子図書館操作フローに記述したことを特徴とする。第５の発明によれば，前記電子図書館サイトから借りた資料を返却する操作を音声により実施できる。 A fifth invention that solves the above-described problem is the electronic library system according to the fourth invention, wherein the electronic document system returns a material when a material borrowed by a user is searched in the material selection procedure. Calling the library service and executing the return procedure for returning the material specified by the voice recognition result acquired from the feature amount extraction unit to the electronic library site is performed as the procedure related to the material return operation and the execution order thereof. It is described in the electronic library operation flow. According to the fifth aspect, the operation of returning the materials borrowed from the electronic library site can be performed by voice.

上述した課題を解決する第６の発明は，第４の発明または第５の発明に記載した電子図書館システムであって，前記資料選択手続きにて選択した資料が貸出不可の場合，資料を予約する前記電子図書館サービスを呼び出し，前記資料選択手続きにて選択した資料を前記電子図書館サイトに予約する予約手続きを実行することを，資料の予約操作に係る前記手続きとその実行順序として前記電子図書館操作フローに記述したことを特徴とする。第６の発明によれば，前記電子図書館サイトに資料を予約する操作を音声により実施できる。 A sixth invention for solving the above-described problem is the electronic library system described in the fourth invention or the fifth invention, and reserves a material when the material selected in the material selection procedure cannot be lent. Calling the electronic library service and executing the reservation procedure for reserving the material selected in the material selection procedure in the electronic library site is the electronic library operation flow as the procedure relating to the material reservation operation and its execution order. It is characterized by having described it. According to the sixth invention, the operation of reserving materials at the electronic library site can be performed by voice.

上述した課題を解決する第７の発明は，第４の発明から第６の発明のいずれか一つに記載した電子図書館システムにおいて，前記アプリケーションプログラムが起動することによって前記携帯端末は，ユーザがマイクに入力した音声の声紋を認証する声紋認証部を備え，前記手続き実行部は，前記ログイン手続きにおいて，ユーザがマイクに入力した音声の声紋認証に前記声紋認証部が成功したときのみ，前記電子図書館サイトにログインする前記電子図書館サービスを呼び出すことを特徴とする。ユーザがマイクに入力した音声の声紋を認証することで，前記電子図書館サイトへのログインに係るセキュリティを高めることができる。 According to a seventh invention for solving the above-described problem, in the electronic library system according to any one of the fourth to sixth inventions, when the application program is activated, the portable terminal is connected to a microphone by a user. A voice print authentication unit for authenticating a voice print of a voice input to the electronic library, and the procedure executing unit is configured to perform the electronic library only when the voice print authentication unit succeeds in the voice print authentication of a voice input by a user to a microphone in the login procedure. Calling the electronic library service to log in to the site. By authenticating the voice print of the voice input to the microphone by the user, security related to login to the electronic library site can be enhanced.

上述した本発明に係る電子図書館システムによれば，電子図書館操作フローに従い，音声を利用する手続きが自動的に実行されるため，視覚障害者であっても，音声により電子図書館を容易に利用できるようになる。 According to the electronic library system according to the present invention described above, the procedure using voice is automatically executed according to the operation flow of the electronic library, so that even the visually impaired can easily use the electronic library by voice. It becomes like this.

本実施形態に係る電子図書館システムの構成を説明する図。The figure explaining the structure of the electronic library system which concerns on this embodiment. 本実施形態に係る携帯端末のブロック図。The block diagram of the portable terminal which concerns on this embodiment. 本実施形態に係る電子図書館操作フローを説明する図。The figure explaining the electronic library operation flow which concerns on this embodiment. 資料検索手続きにおいて，手続き実行部が実行する処理を説明する図。The figure explaining the process which a procedure execution part performs in a document search procedure. 電子図書館サイトのトップページを説明する図。The figure explaining the top page of an electronic library site. 資料選択手続きにおいて，手続き実行部が実行する処理を説明する図。The figure explaining the process which a procedure execution part performs in a data selection procedure. 資料の検索結果を表示する電子図書館サイトのページを説明する図。The figure explaining the page of the electronic library site which displays the search result of a document. 貸出手続きにおいて，手続き実行部が実行する処理を説明する図。The figure explaining the process which a procedure execution part performs in a loan procedure. ログイン手続きにおいて，手続き実行部が実行する処理を説明する図。The figure explaining the process which a procedure execution part performs in a login procedure. 再生手続きにおいて，手続き実行部が実行する処理を説明する図。The figure explaining the process which a procedure execution part performs in a reproduction | regeneration procedure. 資料返却手続きにおいて，手続き実行部が実行する処理を説明する図。The figure explaining the process which a procedure execution part performs in a material return procedure.

ここから，本発明の好適な実施形態を記載する。なお，以下の記載は本発明の技術的範囲を束縛するものでなく，理解を助けるために記述するものである。 From here, preferred embodiments of the present invention will be described. The following description is not intended to limit the technical scope of the present invention, but is provided to aid understanding.

ここから，本実施形態に係る電子図書館システム１について説明する。図１は，本実施形態に係る電子図書館システム１の構成を説明する図である。本実施形態に係る電子図書館システム１は，視覚障害者が独力で電子図書館を利用できるように発案されたシステムで，図１に図示したように，携帯端末２，電子図書館サイト３，音声認識サーバ４，音声合成サーバ５を含み，それぞれはネットワーク１ａを介してデータ通信できる。 From here, the electronic library system 1 which concerns on this embodiment is demonstrated. FIG. 1 is a diagram for explaining the configuration of an electronic library system 1 according to the present embodiment. An electronic library system 1 according to the present embodiment is a system designed so that a visually impaired person can use an electronic library by himself. As shown in FIG. 1, a portable terminal 2, an electronic library site 3, a voice recognition server 4 and the speech synthesis server 5, each of which can perform data communication via the network 1a.

電子図書館システム１を構成する電子図書館サイト３は，電子図書館サイト３で蓄積しているさまざまな電子資料（以下，単に「資料」と記す。）の検索，資料の貸出，資料の予約，資料の返却など電子図書館業務に係る様々な手続きを，携帯端末２で動作するアプリケーションプログラムが利用できるようにするため電子図書館サービスを提供し，本実施形態では，電子図書館サービスを呼び出すためのＡＰＩ（Application Programming Interface）を外部に公開している。電子図書館サイト３は，ＨＴＴＰ（HyperText Transfer Protocol）を利用して携帯端末２とデータ通信し，携帯端末２から呼び出されたＡＰＩに対応する電子図書館サービスに係る処理を実行し，その実行結果を携帯端末２へ返信する処理を行う。なお，電子図書館サイト３はクラウド型のシステムで構成することが好適である。 The electronic library site 3 constituting the electronic library system 1 searches various electronic materials accumulated in the electronic library site 3 (hereinafter simply referred to as “materials”), lends materials, reserves materials, An electronic library service is provided in order to make it possible for an application program operating on the mobile terminal 2 to use various procedures related to electronic library operations such as return, and in this embodiment, an API (Application Programming) for calling the electronic library service is provided. Interface) is open to the public. The electronic library site 3 performs data communication with the mobile terminal 2 using HTTP (HyperText Transfer Protocol), executes processing related to the electronic library service corresponding to the API called from the mobile terminal 2, and carries the execution result on the mobile phone. A process of returning to the terminal 2 is performed. The electronic library site 3 is preferably composed of a cloud type system.

電子図書館システム１を構成する音声認識サーバ４は，携帯端末２から受信した音声の特徴量を音声認識し，テキスト形式の音声認識結果を携帯端末２へ送信するサーバである。音声認識の手法は様々あるが，本実施形態に係る音声認識サーバ４は，認識単位の単語毎に音声の特徴量を保持し，携帯端末２から受信した音声の特徴量に対応する単語を特定して順に並べることでテキスト形式の音声認識結果を生成する。 The speech recognition server 4 that constitutes the electronic library system 1 is a server that recognizes speech features received from the mobile terminal 2 and transmits a text-format speech recognition result to the mobile terminal 2. Although there are various methods of speech recognition, the speech recognition server 4 according to the present embodiment holds a speech feature amount for each word in the recognition unit, and identifies a word corresponding to the speech feature amount received from the mobile terminal 2 Then, a speech recognition result in a text format is generated by arranging in order.

電子図書館システム１を構成する音声合成サーバ５は，携帯端末２から受信したテキストを解析し，テキストに含まれる文字に対応する音声を合成することで，所定形式（例えば，ＭＰ３形式）の合成音声を生成して携帯端末２へ送信するサーバである。音声合成の手法は様々あるが，対象となるテキストを形態素解析してテキストを発音記号列に変換し，発音記号列から音声合成に必要なパラメータを生成して，このパラメータを用いて音声波形を合成することで合成音声を生成する手法が既に知られている。 The speech synthesis server 5 constituting the electronic library system 1 analyzes the text received from the mobile terminal 2 and synthesizes speech corresponding to characters included in the text, thereby synthesizing speech in a predetermined format (for example, MP3 format). Is generated and transmitted to the mobile terminal 2. There are various speech synthesis methods, but the morphological analysis of the target text is performed, the text is converted into a phonetic symbol string, parameters necessary for speech synthesis are generated from the phonetic symbol string, and the speech waveform is generated using these parameters. A technique for generating synthesized speech by synthesis is already known.

電子図書館システム１を構成する携帯端末２は，ユーザ（ここでは，視覚障害者になる）が電子図書館サイト３を利用する際に用いる端末で，具体的には，スマートフォンやタブレットである。 A portable terminal 2 constituting the electronic library system 1 is a terminal used when a user (here, a visually impaired person) uses the electronic library site 3, and specifically, is a smartphone or a tablet.

図２は，本実施形態に係る携帯端末２のブロック図である。図２に図示したように，携帯端末２は，マイク２４，モーションセンサ２５，タッチパネル２６およびスピーカ２７を備える。携帯端末２が備えるマイク２４は，音を電気信号に変換する機器である。携帯端末２が備えるモーションセンサ２５は，携帯端末２の動きの変化を検出するセンサで，具体的には，３軸ジャイロセンサや３軸加速度センサである。携帯端末２が備えるタッチパネル２６は，液晶ディスプレイなどのディスプレイと，ディスプレイに直接触れることで操作を行う装置が組み合わされた電子機器である。携帯端末２が備えるスピーカ２７は，音声出力に用いるデバイスで，電気信号を音に変換する機器である。なお，音声出力端子をスピーカ２７の代わりに用いてもよい。 FIG. 2 is a block diagram of the mobile terminal 2 according to the present embodiment. As illustrated in FIG. 2, the mobile terminal 2 includes a microphone 24, a motion sensor 25, a touch panel 26 and a speaker 27. The microphone 24 included in the mobile terminal 2 is a device that converts sound into an electrical signal. The motion sensor 25 included in the mobile terminal 2 is a sensor that detects a change in the movement of the mobile terminal 2, and specifically, a 3-axis gyro sensor or a 3-axis acceleration sensor. The touch panel 26 included in the mobile terminal 2 is an electronic device in which a display such as a liquid crystal display and a device that performs an operation by directly touching the display are combined. The speaker 27 provided in the portable terminal 2 is a device used for outputting sound, and is a device that converts an electrical signal into sound. Note that an audio output terminal may be used instead of the speaker 27.

また，携帯端末２は，図２では図示していないプロセッサを動作させるコンピュータプログラムとして，電子図書館サイト３が公開しているＡＰＩを利用して，電子図書館サイト３が提供している電子図書館サービスを呼び出して，音声により電子図書館サイト３を操作できるように構成された電子図書館アプリケーション２０がインストールされ，図２に図示しているように，電子図書館アプリケーション２０は，手続き実行部２１，特徴量抽出部２２および声紋認証部２３を備える。 In addition, the mobile terminal 2 uses the API published by the electronic library site 3 as a computer program for operating a processor not shown in FIG. 2, and provides an electronic library service provided by the electronic library site 3. The electronic library application 20 configured to call and operate the electronic library site 3 by voice is installed. As illustrated in FIG. 2, the electronic library application 20 includes a procedure execution unit 21 and a feature amount extraction unit. 22 and a voiceprint authentication unit 23.

電子図書館アプリケーション２０が有する特徴量抽出部２２は，電子図書館サイト３を利用するユーザの音声を認識するために備えられた機能である。電子図書館アプリケーション２０の特徴量抽出部２２は，音声認識が指定されて手続き実行部２１から呼び出されると，ユーザがマイク２４に入力した音声の特徴量（例えば，周波数スペクトルなど）を抽出し，抽出した特徴量を音声認識サーバ４に送信して、ユーザがマイク２４に入力した音声を音声認識サーバ４に音声認識させた後，音声認識サーバ４から受信した音声認識結果を手続き実行部２１に引き渡す処理を実行する。また，電子図書館アプリケーション２０の特徴量抽出部２２は，音声認識と声紋認証が指定されて手続き実行部２１から呼び出されると，上述した内容に加え，ユーザがマイク２４に入力した音声を後述する声紋認証部２３に引き渡して，声紋認証部２３から声紋認証結果を受取り，音声認識サーバ４から受信した音声認識結果と声紋認証部２３から受信した声紋認証結果を手続き実行部２１に引き渡す処理を実行する。 The feature amount extraction unit 22 included in the electronic library application 20 is a function provided for recognizing a voice of a user who uses the electronic library site 3. When the voice recognition is specified and the procedure execution unit 21 is called, the feature quantity extraction unit 22 of the electronic library application 20 extracts and extracts the voice feature quantity (for example, frequency spectrum) input to the microphone 24 by the user. The received feature amount is transmitted to the speech recognition server 4, and the speech input by the user to the microphone 24 is caused to be recognized by the speech recognition server 4, and then the speech recognition result received from the speech recognition server 4 is delivered to the procedure execution unit 21. Execute the process. When the feature recognition unit 22 of the electronic library application 20 is called from the procedure execution unit 21 with voice recognition and voiceprint authentication specified, in addition to the above-described content, the voiceprint input by the user to the microphone 24 will be described later. Deliver it to the authentication unit 23, receive the voice print authentication result from the voice print authentication unit 23, and execute a process of transferring the voice recognition result received from the voice recognition server 4 and the voice print authentication result received from the voice print authentication unit 23 to the procedure execution unit 21. .

電子図書館アプリケーション２０が有する声紋認証部２３は，電子図書館サイト３を利用するユーザを声紋認識できるように備えられた機能で，特徴量抽出部２２から引き渡された音声の声紋と、予め声紋認証部２３に登録されている声紋を照合し、声紋認証結果を特徴量抽出部２２に引き渡す処理を実行する。 The voiceprint authentication unit 23 included in the electronic library application 20 is a function provided so that a user who uses the electronic library site 3 can recognize a voiceprint. The voiceprint passed from the feature amount extraction unit 22 and a voiceprint authentication unit in advance. The voice print registered in 23 is collated, and the voice print authentication result is transferred to the feature amount extraction unit 22.

電子図書館アプリケーション２０が有する手続き実行部２１は，音声による電子図書館サイト３の操作を制御するために備えられた機能である。複数のＷｅｂページを有する電子図書館サイト３は，Ｗｅｂページを任意に切り替えられるように構成されているが，Ｗｅｂページを見ることができない視覚障害者にとって，Ｗｅｂページを任意に切り替えることは困難である。そこで，本実施形態に係る手続き実行部２１は，電子図書館サイト３の操作に係る手続きとその実行順序を記述した電子図書館操作フローに従い，電子図書館操作フローに含まれる手続きを順に実行するように構成され，手続きを実行する際，音声入力を開始する動作を検知すると，特徴量抽出部２２を作動させて，ユーザがマイク２４に入力した音声の音声認識結果を特徴量抽出部２２から取得した後，特徴量抽出部２２から取得した音声認識結果を利用して，この時点，すなわち，音声入力を開始する動作を検知した時点の手続きに対応する電子図書館サービスを呼び出し，呼び出した電子図書館サービスの実行結果を電子図書館サイト３から受信すると，音声合成サーバ５を利用して，電子図書館サイト３から受信した実行結果に対応する合成音声を生成し，この合成音声を音声出力する処理を実行する。 The procedure execution unit 21 included in the electronic library application 20 is a function provided for controlling the operation of the electronic library site 3 by voice. The electronic library site 3 having a plurality of Web pages is configured so that the Web pages can be arbitrarily switched. However, it is difficult for the visually impaired who cannot see the Web pages to arbitrarily switch the Web pages. . Therefore, the procedure execution unit 21 according to the present embodiment is configured to sequentially execute the procedures included in the electronic library operation flow according to the electronic library operation flow describing the procedure related to the operation of the electronic library site 3 and the execution order thereof. When the operation for starting the voice input is detected when the procedure is executed, the feature amount extraction unit 22 is operated, and the voice recognition result of the voice input by the user to the microphone 24 is acquired from the feature amount extraction unit 22. , Using the speech recognition result acquired from the feature quantity extraction unit 22, call the electronic library service corresponding to the procedure at this time, that is, the time when the operation of starting the voice input is detected, and execute the called electronic library service When the result is received from the electronic library site 3, the speech synthesis server 5 is used to correspond to the execution result received from the electronic library site 3. It generates a synthesized voice that, the synthesized speech executes processing for audio output.

図３は，本実施形態に係る電子図書館操作フローを説明する図である。図３で図示した電子図書館操作フローには，音声を利用して電子図書館サイト３を操作する手続きとして，資料を検索する電子図書館サービスを呼び出し，特徴量抽出部２２から取得した音声認識結果に適合する資料を検索する資料検索手続きＴ１，資料検索手続きＴ１の検索結果に含まれるテキストの合成音声を音声出力した後，特徴量抽出部２２から取得した音声認識結果に対応する資料を選択する資料選択手続きＴ２，特徴量抽出部２２から取得した音声認識結果をログインキーワードとし，電子図書館サイト３にログインする電子図書館サービスを呼び出し，電子図書館サイト３にログインするログイン手続きＴ４，資料を貸出す電子図書館サービスを呼び出し，資料選択手続きＴ２にて選択した資料を電子図書館サイト３から借りる貸出手続きＴ３，資料のコンテンツを提供する電子図書館サービスを呼び出し，貸出手続きＴ３にて電子図書館サイト３から借りた資料を再生する再生手続きＴ５，資料を返却する電子図書館サービスを呼び出し，資料選択手続きＴ２にて選択した資料を電子図書館サイト３へ返却する返却手続きＴ６が含まれる。 FIG. 3 is a diagram for explaining an electronic library operation flow according to the present embodiment. In the electronic library operation flow shown in FIG. 3, the electronic library service for retrieving materials is called as a procedure for operating the electronic library site 3 by using voice, and is adapted to the voice recognition result obtained from the feature amount extraction unit 22 The material selection which selects the material corresponding to the speech recognition result acquired from the feature amount extraction unit 22 after outputting the synthesized speech of the text included in the retrieval result of the material retrieval procedure T1 and the material retrieval procedure T1 The procedure T2, the speech recognition result acquired from the feature quantity extraction unit 22 as a login keyword, the electronic library service for logging in to the electronic library site 3 and the login procedure T4 for logging in to the electronic library site 3 And borrow the material selected in the document selection procedure T2 from the electronic library site 3 Lending procedure T3, calling the electronic library service that provides the contents of the material, calling the electronic library service for returning the material borrowed from the electronic library site 3 in the lending procedure T3, calling the electronic library service for returning the material, and selecting the material T2 A return procedure T6 for returning the material selected in step 1 to the electronic library site 3 is included.

図３で図示した電子図書館操作フローによれば，電子図書館アプリケーション２０が起動した後に，手続き実行部２１が実行する手続きは資料検索手続きＴ１になり，資料検索手続きＴ１にて資料を検索すると，手続き実行部２１が実行する手続きは資料選択手続きＴ２に遷移する。資料選択手続きＴ２にて，資料を選択すると，手続き実行部２１が実行する手続きは貸出手続きＴ３に遷移する。貸出手続きＴ３において，電子図書館サイト３にログインしているか確認され，電子図書館サイト３にログインしていない場合，手続き実行部２１が実行する手続きはログイン手続きＴ４に遷移する。また，貸出手続きＴ３において，電子図書館サイト３にログインしている場合，または，ログイン手続きＴ４において，電子図書館サイト３にログインした後，資料選択手続きＴ２にて選択した資料を電子図書館サイト３から借りると，手続き実行部２１が実行する手続きは再生手続きＴ５に遷移する。再生手続きＴ５が終了すると，手続き実行部２１が実行する手続きは資料検索手続きＴ１に戻る。また，資料検索手続きＴ１において，電子図書館サイト３に返却する資料が検索された場合，手続き実行部２１が実行する手続きは返却手続きＴ６に遷移し，選択された資料を電子図書館サイト３へ返却すると，手続き実行部２１が実行する手続きは資料検索手続きＴ１に戻る。 According to the electronic library operation flow shown in FIG. 3, after the electronic library application 20 is started, the procedure executed by the procedure execution unit 21 is the material search procedure T1, and when the material is searched by the material search procedure T1, The procedure executed by the execution unit 21 transitions to a material selection procedure T2. When a material is selected in the material selection procedure T2, the procedure executed by the procedure execution unit 21 transitions to a lending procedure T3. In the lending procedure T3, it is confirmed whether or not the electronic library site 3 is logged in. When the electronic library site 3 is not logged in, the procedure executed by the procedure execution unit 21 transitions to the login procedure T4. In addition, when logging in to the electronic library site 3 in the lending procedure T3 or after logging in to the electronic library site 3 in the login procedure T4, the materials selected in the document selecting procedure T2 are borrowed from the electronic library site 3. Then, the procedure executed by the procedure execution unit 21 transitions to the reproduction procedure T5. When the reproduction procedure T5 ends, the procedure executed by the procedure execution unit 21 returns to the material search procedure T1. In addition, when a material to be returned to the electronic library site 3 is searched in the material search procedure T1, the procedure executed by the procedure execution unit 21 shifts to the return procedure T6, and the selected material is returned to the electronic library site 3. The procedure executed by the procedure execution unit 21 returns to the material retrieval procedure T1.

なお，電子図書館アプリケーション２０を起動させる操作は，電子図書館アプリケーション２０のアイコンを選択（ダブルタップ）する操作になり，視覚障害者でも電子図書館アプリケーション２０を起動できるように，携帯端末２は，音声を使用したアプリケーション起動，また，タッチパネル２６に表示されている内容を読み上げるスクリーンリーダに対応していることが望ましい。 Note that the operation of starting the electronic library application 20 is an operation of selecting (double-tapping) the icon of the electronic library application 20, and the portable terminal 2 generates a voice so that a visually impaired person can also start the electronic library application 20. It is desirable to support a screen reader that reads the contents displayed on the touch panel 26 and activates the used application.

ここから，図３で図示した電子図書館操作フローに含まれる各手続きにて，電子図書館アプリケーション２０の手続き実行部２１が実行する処理について詳細に説明する。図４は，資料検索手続きＴ１において，手続き実行部２１が実行する処理を説明する図である。資料検索手続きＴ１において，電子図書館アプリケーション２０の手続き実行部２１は，まず，音声入力を開始する動作を受け付ける状態になり，音声入力を開始する動作を検知すると（Ｓ１），この時点，すなわち，音声入力を開始する動作を検知した時点に対応する音声ガイダンス（例えば，「キーワードを話してください。」）をスピーカ２７から音声出力した後（Ｓ２），音声認識を指定して特徴量抽出部２２を作動させる（Ｓ３）。 From here, the processing executed by the procedure execution unit 21 of the electronic library application 20 in each procedure included in the electronic library operation flow illustrated in FIG. 3 will be described in detail. FIG. 4 is a diagram illustrating processing executed by the procedure execution unit 21 in the material search procedure T1. In the document retrieval procedure T1, the procedure execution unit 21 of the electronic library application 20 first enters a state of accepting an operation for starting voice input, and when an operation for starting voice input is detected (S1), at this time, that is, a voice. After voice guidance corresponding to the time point at which an input start operation is detected (for example, “Speak a keyword”) is output from the speaker 27 (S2), voice recognition is designated and the feature amount extraction unit 22 is set. Operate (S3).

音声入力を開始する動作は，携帯端末２のタッチパネル２６の中央部にボタンオブジェクトを表示し，このボタンオブジェクトを選択（タップ）する動作とすることもできるが，音声入力を開始する動作を，携帯端末２を振る動作とすることが好適である。音声入力を開始する動作を，携帯端末２を振る動作とし，音声入力を開始する動作を検知すると音声ガイダンスを音声出力するように構成することで，ユーザは，携帯端末２を振るだけで，電子図書館アプリケーション２０の手続き実行部２１が実行している現時点の手続きを把握できる。なお，携帯端末２を振る動作は，携帯端末２のモーションセンサ２５を利用して検知できる。 The operation for starting voice input can be an operation in which a button object is displayed at the center of the touch panel 26 of the portable terminal 2 and this button object is selected (tapped). It is preferable that the terminal 2 be shaken. The operation for starting the voice input is the operation for shaking the mobile terminal 2, and when the operation for starting the voice input is detected, the voice guidance is output by voice. The current procedure being executed by the procedure execution unit 21 of the library application 20 can be grasped. Note that the motion of shaking the mobile terminal 2 can be detected using the motion sensor 25 of the mobile terminal 2.

電子図書館アプリケーション２０の特徴量抽出部２２は，ユーザがマイク２４に音声を入力すると，ユーザがマイク２４に入力した音声の特徴量を抽出し，抽出した特徴量を音声認識サーバ４に送信した後，音声認識サーバ４から受信した音声認識結果を手続き実行部２１に引き渡し，手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果を取得する（Ｓ４）。 When the user inputs voice to the microphone 24, the feature quantity extraction unit 22 of the electronic library application 20 extracts the voice feature quantity input to the microphone 24 by the user and transmits the extracted feature quantity to the voice recognition server 4. The voice recognition result received from the voice recognition server 4 is transferred to the procedure execution unit 21, and the procedure execution unit 21 acquires the voice recognition result of the voice input by the user to the microphone 24 (S4).

電子図書館アプリケーション２０の手続き実行部２１は，特徴量抽出部２２から音声認識結果が引き渡されると，音声認識結果に従い処理を分岐する（Ｓ５）。特徴量抽出部２２から引き渡された音声認識結果が，予め手続き実行部２１に登録されているカテゴリーの場合，手続き実行部２１は，特徴量抽出部２２から引き渡された音声認識結果をカテゴリーとし，このカテゴリーに適合する資料を検索する電子図書館サービスを呼び出して（Ｓ６），このカテゴリーに適合する資料の検索結果をこの電子図書館サービスの実行結果として電子図書館サイト３から取得する（Ｓ７）。そして，電子図書館アプリケーション２０の手続き実行部２１は，電子図書館サービスの実行結果に対応する合成音声として，音声合成サーバ５を利用して，資料の検索結果に含まれる資料の件数を通知する合成音声を生成し，この合成音声をスピーカ２７から音声出力して（Ｓ１０），手続き実行部２１が実行する手続きを資料選択手続きＴ２に遷移させる。 When the procedure execution unit 21 of the electronic library application 20 receives the speech recognition result from the feature amount extraction unit 22, the procedure execution unit 21 branches the process according to the speech recognition result (S5). When the speech recognition result delivered from the feature quantity extraction unit 22 is a category registered in advance in the procedure execution unit 21, the procedure execution unit 21 sets the speech recognition result delivered from the feature quantity extraction unit 22 as a category, An electronic library service that searches for materials suitable for this category is called (S6), and a search result for materials suitable for this category is acquired from the electronic library site 3 as an execution result of this electronic library service (S7). Then, the procedure execution unit 21 of the electronic library application 20 uses the speech synthesis server 5 as the synthesized speech corresponding to the execution result of the electronic library service to notify the number of materials included in the material search result. And the synthesized speech is output from the speaker 27 (S10), and the procedure executed by the procedure execution unit 21 is shifted to the material selection procedure T2.

また，特徴量抽出部２２から引き渡された音声認識結果が，予め手続き実行部２１に登録されているカテゴリー以外の場合，電子図書館アプリケーション２０の手続き実行部２１は，特徴量抽出部２２から引き渡された音声認識結果を検索キーワードとし，この検索キーワードに適合する資料を検索する電子図書館サービスを呼び出して（Ｓ８），この検索キーワードに適合する資料の検索結果をこの電子図書館サービスの実行結果として電子図書館サイト３から取得する（Ｓ９）。そして，電子図書館アプリケーション２０の手続き実行部２１は，電子図書館サービスの実行結果に対応する合成音声として，音声合成サーバ５を利用して，資料の検索結果に含まれる資料の件数を通知する合成音声を生成し，スピーカ２７から合成音声を音声出力して（Ｓ１０），手続き実行部２１が実行する手続きを資料選択手続きＴ２に遷移させる。 If the speech recognition result delivered from the feature quantity extraction unit 22 is not a category registered in advance in the procedure execution unit 21, the procedure execution unit 21 of the electronic library application 20 is delivered from the feature quantity extraction unit 22. The electronic library service for retrieving the material that matches the search keyword is called as a search keyword (S8), and the search result of the material that matches the search keyword is used as the execution result of the electronic library service. Obtained from the site 3 (S9). Then, the procedure execution unit 21 of the electronic library application 20 uses the speech synthesis server 5 as the synthesized speech corresponding to the execution result of the electronic library service to notify the number of materials included in the material search result. Is generated, the synthesized speech is output from the speaker 27 (S10), and the procedure executed by the procedure execution unit 21 is shifted to the material selection procedure T2.

図５は，電子図書館サイト３のトップページ６を説明する図である。図５で図示したトップページ６は，ページ移動することなく表示内容を変更するタブとして，３つのタブ，「新着」，「ランキング」，「マイページ」が含まれる。「新着」のタブ６ａは，所定期間内に出版された新着を表示するタブで，「ランキング」のタブ６ｂは，所定期間内で貸出件数の多い資料をランキング形式で表示するタブで，「マイページ」のタブ６ｃは，ユーザに貸出している資料を少なくとも表示するタブである。例えば，音声認識結果が一致するカテゴリーが「新着」の場合，手続き実行部２１は，「新着」のタブ６ａが選択されたときの表示内容（所定期間内に出版された新着のリスト）を検索結果として電子図書館サイト３から取得する。また，図５で図示したトップページ６は，検索キーワードを入力する入力フォーム６ｄとキーワード検索を実行するボタン６ｅを有し，例えば，音声認識結果が「歴史」の場合，手続き実行部２１は，入力フォーム６ｄに「歴史」が入力されたときのキーワード検索の検索結果を電子図書館サイト３から取得する。 FIG. 5 is a diagram illustrating the top page 6 of the electronic library site 3. The top page 6 illustrated in FIG. 5 includes three tabs, “New Arrival”, “Ranking”, and “My Page”, as tabs for changing the display contents without moving the page. The “New Arrival” tab 6a is a tab for displaying new arrivals published within a predetermined period, and the “Ranking” tab 6b is a tab for displaying materials with a large number of loans in a predetermined period in a ranking format. The “page” tab 6c is a tab for displaying at least materials lent to the user. For example, when the category whose voice recognition result matches is “new arrival”, the procedure execution unit 21 searches the display contents (list of new arrivals published within a predetermined period) when the “new arrival” tab 6a is selected. As a result, it is obtained from the electronic library site 3. The top page 6 shown in FIG. 5 has an input form 6d for inputting a search keyword and a button 6e for executing a keyword search. For example, when the speech recognition result is “history”, the procedure execution unit 21 The search result of the keyword search when “history” is input to the input form 6 d is acquired from the electronic library site 3.

なお，図５において，「マイページ」のタブ６ｃは，ユーザが電子図書館サイト３にログインしている状態でのみ表示されるタブである。電子図書館サイト３にログインしている状態で，音声認識結果が一致するカテゴリーが「マイページ」の場合，手続き実行部２１は，「マイページ」のタブ６ｃが選択されたときの表示内容を検索結果として電子図書館サイト３から取得する。 In FIG. 5, the “My Page” tab 6 c is a tab that is displayed only when the user is logged in to the electronic library site 3. If the category with the same voice recognition result is “My Page” while logged in to the electronic library site 3, the procedure execution unit 21 searches the display contents when the “My Page” tab 6c is selected. As a result, it is obtained from the electronic library site 3.

図６は，資料選択手続きＴ２において，手続き実行部２１が実行する処理を説明する図である。資料選択手続きＴ２において，電子図書館アプリケーション２０の手続き実行部２１は，資料検索手続きＴ１にて電子図書館サイト３から取得した検索結果に含まれるテキストを音声合成サーバ５に送信して，検索結果に含まれるテキストの合成音声を音声合成サーバ５から取得した後，音声合成サーバ５から取得した合成音声をスピーカ２７から音声出力する（Ｓ２０）。 FIG. 6 is a diagram for explaining processing executed by the procedure execution unit 21 in the material selection procedure T2. In the document selection procedure T2, the procedure execution unit 21 of the electronic library application 20 transmits the text included in the search result acquired from the electronic library site 3 in the document search procedure T1 to the speech synthesis server 5, and is included in the search result. After the synthesized speech of the text to be obtained is acquired from the speech synthesis server 5, the synthesized speech acquired from the speech synthesis server 5 is output from the speaker 27 (S20).

電子図書館アプリケーション２０の手続き実行部２１は，電子図書館サイト３から取得した検索結果を音声出力すると，音声入力を開始する動作を受け付ける状態になり，音声入力を開始する動作を検知すると（Ｓ２１），この時点に対応する音声ガイダンス（例えば，「選択する資料のタイトルまたは資料の番号を話してください。」）をスピーカ２７から音声出力した後（Ｓ２２），音声認識を指定して特徴量抽出部２２を作動させる（Ｓ２３）。 When the procedure execution unit 21 of the electronic library application 20 outputs the search result acquired from the electronic library site 3 by voice, the procedure execution unit 21 enters a state of accepting an operation for starting voice input, and detects the operation for starting voice input (S21). A voice guidance corresponding to this time point (for example, “speak the title of the material to be selected or the number of the material to be selected”) is output from the speaker 27 (S22), then voice recognition is designated and the feature amount extraction unit 22 is specified. Is operated (S23).

電子図書館アプリケーション２０の特徴量抽出部２２は，ユーザがマイク２４に音声を入力すると，ユーザがマイク２４に入力した音声の特徴量を抽出し，抽出した特徴量を音声認識サーバ４に送信した後，音声認識サーバ４から受信した音声認識結果を手続き実行部２１に引き渡し，手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果を取得する（Ｓ２４）。 When the user inputs voice to the microphone 24, the feature quantity extraction unit 22 of the electronic library application 20 extracts the voice feature quantity input to the microphone 24 by the user and transmits the extracted feature quantity to the voice recognition server 4. The voice recognition result received from the voice recognition server 4 is delivered to the procedure execution unit 21, and the procedure execution unit 21 acquires the voice recognition result of the voice input by the user to the microphone 24 (S24).

電子図書館アプリケーション２０の手続き実行部２１は，特徴量抽出部２２から音声認識結果が引き渡されると，音声認識結果に従い処理を分岐する（Ｓ２５）。特徴量抽出部２２から引き渡された音声認識結果が，資料のタイトルまたは資料の番号の場合，手続き実行部２１は，音声認識結果（ここでは，タイトルまたは番号）で特定される資料を選択状態にし（Ｓ２８），手続き実行部２１が実行する手続きを貸出手続きＴ３に遷移させる。 When the procedure execution unit 21 of the electronic library application 20 receives the speech recognition result from the feature amount extraction unit 22, the procedure execution unit 21 branches the process according to the speech recognition result (S25). When the speech recognition result delivered from the feature quantity extraction unit 22 is a document title or document number, the procedure execution unit 21 selects a document identified by the speech recognition result (in this case, the title or number). (S28), the procedure executed by the procedure execution unit 21 is shifted to the lending procedure T3.

また，電子図書館アプリケーション２０の手続き実行部２１は，特徴量抽出部２２から引き渡された音声認識結果が，資料のタイトルまたは資料の番号の以外の場合，特徴量抽出部２２から引き渡された音声認識結果が，手続き実行部２１に予め登録されている並び替えに係る単語であるか確認し，特徴量抽出部２２から引き渡された音声認識結果が，並び替えに係る単語の場合，音声認識結果に対応する並び替えを実行する電子図書館サービスを呼び出し（Ｓ２６），並び替え後の検索結果を電子図書館サイト３から取得して（Ｓ２７），図６の先頭に戻る。また，特徴量抽出部２２から引き渡された音声認識結果が，上述の単語以外の場合，手続き実行部２１が実行する手続きを資料検索手続きＴ１に遷移させる。 Further, the procedure execution unit 21 of the electronic library application 20 recognizes the speech recognition delivered from the feature quantity extraction unit 22 when the speech recognition result delivered from the feature quantity extraction unit 22 is other than the material title or the material number. It is confirmed whether the result is a word related to rearrangement registered in advance in the procedure execution unit 21. If the speech recognition result delivered from the feature amount extraction unit 22 is a word related to rearrangement, The electronic library service that executes the corresponding rearrangement is called (S26), the search result after the rearrangement is acquired from the electronic library site 3 (S27), and the process returns to the top of FIG. If the speech recognition result delivered from the feature quantity extraction unit 22 is other than the above word, the procedure executed by the procedure execution unit 21 is shifted to the material search procedure T1.

図７は，資料の検索結果を表示する電子図書館サイト３のページ７を説明する図である。図７で図示したページ６では，「歴史」が検索キーワードとして入力され，「歴史」に適合する資料のリスト７ａが検索結果として表示され，検索結果には，検索結果に含まれる資料毎に，資料の表紙画像，資料のタイトル，資料の著者名および資料の出版日が含まれる。また，図７で図示したページ７には，検索結果を並び替えるときに選択するボタンオブジェクト７ｂが含まれ，このボタンオブジェクト７ｂを選択することで，検索結果の並び替えとして，新着順，名昇順，名降順のいずれかを選択できるようになっている。 FIG. 7 is a diagram for explaining the page 7 of the electronic library site 3 that displays the search result of the material. In the page 6 illustrated in FIG. 7, “history” is input as a search keyword, and a list 7 a of materials matching “history” is displayed as a search result. The search result includes, for each material included in the search result, Includes the cover image of the material, the title of the material, the name of the author of the material, and the date of publication of the material. Further, the page 7 illustrated in FIG. 7 includes a button object 7b to be selected when the search results are rearranged. By selecting this button object 7b, the search results are rearranged in the new arrival order, ascending order. , You can choose either descending order.

電子図書館アプリケーション２０の手続き実行部２１は，資料の検索結果として，資料のリスト７ａの内容を取得し，資料のリスト７ａに含まれるテキストを読み上げることになる。また，音声認識結果が歴史書２のタイトルの場合，手続き実行部２１は，歴史書２を選択状態にする。また，音声認識結果が新着順の場合，手続き実行部２１は，資料の検索結果を新着順に並び替える電子図書館サービスを呼び出す。 The procedure execution unit 21 of the electronic library application 20 acquires the contents of the material list 7a as a result of the material search, and reads out the text included in the material list 7a. When the speech recognition result is the title of the history book 2, the procedure execution unit 21 puts the history book 2 in a selected state. If the speech recognition results are in the order of arrival, the procedure execution unit 21 calls an electronic library service that rearranges the search results of the materials in the order of arrival.

図８は，貸出手続きＴ３において，手続き実行部２１が実行する処理を説明する図である。貸出手続きＴ３において，電子図書館アプリケーション２０の手続き実行部２１は，まず，電子図書館サイト３にログインしているか否かにより処理を分岐する（Ｓ３０）。電子図書館アプリケーション２０の手続き実行部２１は，電子図書館サイト３へのログイン状況を内部情報として管理し，この内部情報を参照して，電子図書館サイト３にログインしているか否かを判断する。 FIG. 8 is a diagram for explaining processing executed by the procedure execution unit 21 in the lending procedure T3. In the lending procedure T3, the procedure execution unit 21 of the electronic library application 20 first branches the process depending on whether or not the electronic library site 3 is logged in (S30). The procedure execution unit 21 of the electronic library application 20 manages the login status to the electronic library site 3 as internal information, and refers to the internal information to determine whether or not the electronic library site 3 is logged in.

電子図書館アプリケーション２０の手続き実行部２１は，電子図書館サイト３にログインしていなければ，手続き実行部２１が実行する手続きをログイン手続きＴ４に遷移させ，ログイン手続きＴ４に係る処理を実行する。また，手続き実行部２１は，電子図書館サイト３にログインしていれば，資料の貸出状況を提供する電子図書館サービスを呼び出し（Ｓ３１），資料選択手続きＴ２にて選択状態にした資料の貸出状況を電子図書館サイト３から取得し（Ｓ３２），選択状態にした資料をユーザが既に借りているか否かにより処理を分岐させる（Ｓ３３）。 If the procedure execution unit 21 of the electronic library application 20 is not logged in to the electronic library site 3, the procedure executed by the procedure execution unit 21 is shifted to the login procedure T4, and the process related to the login procedure T4 is executed. Also, if the procedure execution unit 21 is logged in to the electronic library site 3, the procedure execution unit 21 calls an electronic library service that provides the lending status of the material (S31), and displays the lending status of the material selected in the material selection procedure T2. The processing branches depending on whether the user has already borrowed the material acquired from the electronic library site 3 (S32) and selected (S33).

選択状態にした資料の貸出状況により，選択状態にした資料が貸出されており，かつ，この資料を借りている者がユーザ自身の場合，電子図書館アプリケーション２０の手続き実行部２１は，手続き実行部２１が実行する手続きを再生手続きＴ５に遷移させる。 If the selected material is lent out according to the lending status of the selected material, and the person who borrows this material is the user himself, the procedure execution unit 21 of the electronic library application 20 is the procedure execution unit. The procedure executed by 21 is shifted to the reproduction procedure T5.

選択状態にした資料をユーザが借りていない場合，電子図書館アプリケーション２０の手続き実行部２１は，選択状態にした資料が貸出可能か否かにより処理を分岐させる（Ｓ３４）。選択状態にした資料が貸出可能な場合，電子図書館アプリケーション２０の手続き実行部２１は，音声入力を開始する動作を受け付ける状態になり，音声入力を開始する動作を検知すると（Ｓ４２），この時点に対応する音声ガイダンス（例えば，「資料を借りますか。」）をスピーカ２７から音声出力した後（Ｓ４３），音声認識を指定して特徴量抽出部２２を作動させる（Ｓ４４）。 If the user has not borrowed the selected material, the procedure execution unit 21 of the electronic library application 20 branches the process depending on whether the selected material can be lent (S34). When the selected material can be lent, the procedure execution unit 21 of the electronic library application 20 enters a state of accepting an operation for starting voice input, and when detecting the operation for starting voice input (S42), at this time point After corresponding voice guidance (for example, “Do you borrow material?”) Is output from the speaker 27 (S43), voice recognition is designated and the feature amount extraction unit 22 is activated (S44).

特徴量抽出部２２は，ユーザがマイク２４に音声を入力すると，ユーザがマイク２４に入力した音声の特徴量を抽出し，抽出した特徴量を音声認識サーバ４に送信した後，音声認識サーバ４から受信した音声認識結果を手続き実行部２１に引き渡し，手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果を取得する（Ｓ４５）。 When the user inputs voice to the microphone 24, the feature quantity extraction unit 22 extracts the feature quantity of the voice input to the microphone 24 by the user, transmits the extracted feature quantity to the voice recognition server 4, and then the voice recognition server 4. The procedure execution unit 21 acquires the speech recognition result of the speech input by the user to the microphone 24 (S45).

電子図書館アプリケーション２０の手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果により処理を分岐させる（Ｓ４６）。電子図書館アプリケーション２０の手続き実行部２１は，音声認識結果が肯定を示す単語（例えば，「はい」）の場合，資料を貸し出す電子図書館サービスを呼び出し（Ｓ４７），資料を貸し出す電子図書館サービスの実行結果を電子図書館サイト３から受信すると，電子図書館サイト３から借りた資料，すなわち，資料選択手続きＴ２で選択状態にした資料のタイトルを音声合成サーバ５に送信して，資料のタイトルの合成音声を音声合成サーバ５から取得した後，音声合成サーバ５から取得した合成音声をスピーカ２７から音声出力し（Ｓ４８），手続き実行部２１が実行する手続きを再生手続きＴ５に遷移させる。なお，音声認識結果が肯定を示す単語でない場合，手続き実行部２１は，手続き実行部２１が実行する手続きを資料検索手続きＴ１に遷移させる。 The procedure execution unit 21 of the electronic library application 20 branches the process according to the voice recognition result of the voice input by the user to the microphone 24 (S46). If the speech recognition result is a positive word (for example, “Yes”), the procedure execution unit 21 of the electronic library application 20 calls the electronic library service that lends the material (S47), and the execution result of the electronic library service that lends the material. Is received from the electronic library site 3, the title of the material borrowed from the electronic library site 3, that is, the title of the material selected in the material selection procedure T 2 is transmitted to the speech synthesis server 5, and the synthesized speech of the material title is voiced. After obtaining from the synthesis server 5, the synthesized voice obtained from the voice synthesis server 5 is output from the speaker 27 (S48), and the procedure executed by the procedure execution unit 21 is shifted to the reproduction procedure T5. If the speech recognition result is not a word indicating affirmation, the procedure execution unit 21 transitions the procedure executed by the procedure execution unit 21 to the material search procedure T1.

選択状態にした資料が貸出不可の場合，電子図書館アプリケーション２０の手続き実行部２１は，音声入力を開始する動作を受け付ける状態になり，音声入力を開始する動作を検知すると（Ｓ３５），この時点に対応する音声ガイダンス（例えば，「資料を予約しますか。」）をスピーカ２７から音声出力した後（Ｓ３６），音声認識を指定して特徴量抽出部２２を作動させる（Ｓ３７）。 When the selected material cannot be lent out, the procedure execution unit 21 of the electronic library application 20 enters a state of accepting an operation for starting voice input, and when an operation for starting voice input is detected (S35), at this time point The corresponding voice guidance (for example, “Do you reserve material?”) Is output from the speaker 27 (S36), and then the feature recognition unit 22 is activated by specifying voice recognition (S37).

電子図書館アプリケーション２０の特徴量抽出部２２は，ユーザがマイク２４に音声を入力すると，ユーザがマイク２４に入力した音声の特徴量を抽出し，抽出した特徴量を音声認識サーバ４に送信した後，音声認識サーバ４から受信した音声認識結果を手続き実行部２１に引き渡し，手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果を取得する（Ｓ３８）。 When the user inputs voice to the microphone 24, the feature quantity extraction unit 22 of the electronic library application 20 extracts the voice feature quantity input to the microphone 24 by the user and transmits the extracted feature quantity to the voice recognition server 4. The voice recognition result received from the voice recognition server 4 is transferred to the procedure execution unit 21, and the procedure execution unit 21 acquires the voice recognition result of the voice input by the user to the microphone 24 (S38).

電子図書館アプリケーション２０の手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果により処理を分岐させる（Ｓ３９）。電子図書館アプリケーション２０の手続き実行部２１は，音声認識結果が肯定を示す単語の場合，資料を予約する電子図書館サービスを呼び出し（Ｓ４０），資料を予約する電子図書館サービスの実行結果を電子図書館サイト３から受信すると，電子図書館サイト３に予約した資料，すなわち，資料選択手続きＴ２で選択状態にした資料のタイトルを音声合成サーバ５に送信して，資料のタイトルの合成音声を音声合成サーバ５から取得した後，音声合成サーバ５から取得した合成音声をスピーカ２７から音声出力し（Ｓ４１），手続き実行部２１が実行する手続きを資料検索手続きＴ１に遷移させる。なお，音声認識結果が肯定を示す単語でない場合，手続き実行部２１は，手続き実行部２１が実行する手続きを資料検索手続きＴ１に遷移させる。 The procedure execution unit 21 of the electronic library application 20 branches the process according to the voice recognition result of the voice input to the microphone 24 by the user (S39). When the speech recognition result is a word indicating affirmation, the procedure execution unit 21 of the electronic library application 20 calls the electronic library service for reserving the material (S40), and the execution result of the electronic library service for reserving the material is stored in the electronic library site 3 Is received from the speech synthesis server 5 by transmitting to the speech synthesis server 5 the title of the material reserved in the electronic library site 3, that is, the title of the material selected in the material selection procedure T2. After that, the synthesized speech acquired from the speech synthesis server 5 is output from the speaker 27 (S41), and the procedure executed by the procedure execution unit 21 is shifted to the material retrieval procedure T1. If the speech recognition result is not a word indicating affirmation, the procedure execution unit 21 transitions the procedure executed by the procedure execution unit 21 to the material search procedure T1.

図９は，ログイン手続きＴ４において，手続き実行部２１が実行する処理を説明する図である。ログイン手続きＴ４において，電子図書館アプリケーション２０の手続き実行部２１は，音声入力を開始する動作を受け付ける状態になり，音声入力を開始する動作を検知すると（Ｓ５０），この時点に対応する音声ガイダンス（例えば，「ログインパスワードを話してください。」）をスピーカ２７から音声出力した後（Ｓ５１），音声認識と声紋認証を指定して特徴量抽出部２２を作動させる（Ｓ５２）。 FIG. 9 is a diagram for explaining processing executed by the procedure execution unit 21 in the login procedure T4. In the login procedure T4, the procedure execution unit 21 of the electronic library application 20 is in a state of accepting an operation for starting voice input. When the operation for starting voice input is detected (S50), voice guidance corresponding to this time (for example, , "Please tell me your login password") from the speaker 27 (S51), and then designates voice recognition and voiceprint authentication to activate the feature quantity extraction unit 22 (S52).

電子図書館アプリケーション２０の特徴量抽出部２２は，ユーザがマイク２４に音声を入力すると，ユーザがマイク２４に入力した音声の特徴量を抽出し，抽出した特徴量を音声認識サーバ４に送信した後，音声認識結果を音声認識サーバ４から受信する。また，特徴量抽出部２２は，ユーザがマイク２４に入力した音声を声紋認識部に引き渡し，声紋認証結果を声紋認証部２３から受信する。そして，特徴量抽出部２２は，音声認証結果と声紋認証結果を手続き実行部２１に引き渡し，手続き実行部２１は，音声認識結果と声紋認証結果を取得する（Ｓ５３）。 When the user inputs voice to the microphone 24, the feature quantity extraction unit 22 of the electronic library application 20 extracts the voice feature quantity input to the microphone 24 by the user and transmits the extracted feature quantity to the voice recognition server 4. The voice recognition result is received from the voice recognition server 4. The feature amount extraction unit 22 passes the voice input by the user to the microphone 24 to the voiceprint recognition unit, and receives the voiceprint authentication result from the voiceprint authentication unit 23. Then, the feature amount extraction unit 22 delivers the voice authentication result and the voice print authentication result to the procedure execution unit 21, and the procedure execution unit 21 acquires the voice recognition result and the voice print authentication result (S53).

電子図書館アプリケーション２０の手続き実行部２１は，特徴量抽出部２２から音声認識結果と声紋認証結果が引き渡しされると，まず，音声認識結果に従い処理を分岐する（Ｓ５４）。手続き実行部２１は，特徴量抽出部２２から引き渡された音声認識結果が，予め手続き実行部２１に登録されているログインパスワードと一致しない場合，ログインパスワードエラーを通知する音声ガイダンスをスピーカ２７から音声出力した後（Ｓ５５），図９の先頭に戻る。 When the speech recognition result and the voiceprint authentication result are delivered from the feature amount extraction unit 22, the procedure execution unit 21 of the electronic library application 20 first branches the process according to the speech recognition result (S54). If the voice recognition result delivered from the feature quantity extraction unit 22 does not match the login password registered in advance in the procedure execution unit 21, the procedure execution unit 21 provides voice guidance for notifying a login password error from the speaker 27. After outputting (S55), the process returns to the top of FIG.

また，特徴量抽出部２２から引き渡された音声認識結果が，予め手続き実行部２１に登録されているログインパスワードと一致する場合，電子図書館アプリケーション２０の手続き実行部２１は，特徴量抽出部２２から引き渡しされた声紋認証結果に従い処理を分岐する（Ｓ５６）。手続き実行部２１は，特徴量抽出部２２から引き渡しされた声紋認証結果によって声紋認証の失敗が示される場合，声紋認証エラーを通知する音声ガイダンスをスピーカ２７から音声出力した後（Ｓ５７），図９の先頭に戻る。 If the speech recognition result delivered from the feature quantity extraction unit 22 matches the login password registered in advance in the procedure execution unit 21, the procedure execution unit 21 of the electronic library application 20 receives the feature quantity extraction unit 22 from the feature quantity extraction unit 22. The process branches in accordance with the voiceprint authentication result delivered (S56). When the voiceprint authentication result delivered from the feature quantity extraction unit 22 indicates a voiceprint authentication failure, the procedure execution unit 21 outputs a voice guidance for notifying a voiceprint authentication error from the speaker 27 (S57), and FIG. Return to the top of.

電子図書館アプリケーション２０の手続き実行部２１は，特徴量抽出部２２から引き渡しされた声紋認証結果によって声紋認証の成功が示される場合，手続き実行部２１に登録されているユーザＩＤとログインパスワードとなる音声認証結果をパラメータとして，電子図書館サイト３にログインする電子図書館サービスを呼び出し（Ｓ５８），電子図書館サイト３にログインする電子図書館サービスの実行結果を電子図書館サイト３から取得すると，ユーザＩＤが電子図書館サイト３にログインしたことを通知するテキストを音声合成サーバ５に送信して，このテキストの合成音声を音声合成サーバ５から取得した後，音声合成サーバ５から取得した合成音声をスピーカ２７から音声出力して（Ｓ５９），手続き実行部２１が実行する手続きを貸出手続きＴ３に遷移させる。 The procedure execution unit 21 of the electronic library application 20, when the voice print authentication success is indicated by the voice print authentication result delivered from the feature amount extraction unit 22, the voice that becomes the user ID and login password registered in the procedure execution unit 21. When the authentication result is used as a parameter, an electronic library service for logging in to the electronic library site 3 is called (S58), and the execution result of the electronic library service for logging in to the electronic library site 3 is acquired from the electronic library site 3. 3 is transmitted to the speech synthesis server 5 and the synthesized speech of this text is acquired from the speech synthesis server 5, and then the synthesized speech acquired from the speech synthesis server 5 is output from the speaker 27. (S59), the procedure executed by the procedure execution unit 21 To transition to the lending procedures T3.

図１０は，再生手続きＴ５において，手続き実行部２１が実行する処理を説明する図である。再生手続きＴ５において，電子図書館アプリケーション２０の手続き実行部２１は，音声入力を開始する動作を受け付ける状態になり，音声入力を開始する動作を検知すると（Ｓ６０），この時点に対応する音声ガイダンス（例えば，「借りている資料を読みますか。」）をスピーカ２７から出力した後（Ｓ６１），音声認識を指定して特徴量抽出部２２を作動させる（Ｓ６２）。 FIG. 10 is a diagram for explaining processing executed by the procedure execution unit 21 in the reproduction procedure T5. In the playback procedure T5, the procedure execution unit 21 of the electronic library application 20 is in a state of accepting an operation for starting voice input. When the operation for starting voice input is detected (S60), voice guidance corresponding to this time (for example, , “Do you want to read the borrowed material?” Is output from the speaker 27 (S61), the voice recognition is designated and the feature amount extraction unit 22 is activated (S62).

電子図書館アプリケーション２０の特徴量抽出部２２は，ユーザがマイク２４に音声を入力すると，ユーザがマイク２４に入力した音声の特徴量を抽出し，抽出した特徴量を音声認識サーバ４に送信した後，音声認識サーバ４から受信した音声認識結果を手続き実行部２１に引き渡し，手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果を取得する（Ｓ６３）。 When the user inputs voice to the microphone 24, the feature quantity extraction unit 22 of the electronic library application 20 extracts the voice feature quantity input to the microphone 24 by the user and transmits the extracted feature quantity to the voice recognition server 4. The voice recognition result received from the voice recognition server 4 is delivered to the procedure execution unit 21, and the procedure execution unit 21 acquires the voice recognition result of the voice input by the user to the microphone 24 (S63).

電子図書館アプリケーション２０の手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果により処理を分岐させる（Ｓ６４）。電子図書館アプリケーション２０の手続き実行部２１は，音声認識結果が肯定を示す単語でない場合，電子図書館操作フローの状態を資料検索手続きＴ１に遷移させる。また，音声認識結果が肯定を示す単語の場合，手続き実行部２１は，資料のコンテンツを提供する電子図書館サービスを呼び出し（Ｓ６５），資料のコンテンツを提供する電子図書館サービスの実行結果として，電子図書館サイト３から借りた資料のコンテンツを電子図書館サイト３から取得する（Ｓ６６）。 The procedure execution unit 21 of the electronic library application 20 branches the process according to the voice recognition result of the voice input to the microphone 24 by the user (S64). If the speech recognition result is not a word indicating affirmation, the procedure execution unit 21 of the electronic library application 20 changes the state of the electronic library operation flow to the material search procedure T1. If the speech recognition result is a word indicating affirmation, the procedure execution unit 21 calls the electronic library service that provides the content of the material (S65), and the electronic library service is executed as the execution result of the electronic library service that provides the content of the material. The contents of the materials borrowed from the site 3 are acquired from the electronic library site 3 (S66).

電子図書館アプリケーション２０の手続き実行部２１は，資料のコンテンツを電子図書館サイト３から取得すると，電子図書館サイト３から取得した資料のコンテンツに含まれるテキストを音声合成サーバ５へ送信して，このテキストに対応する合成音声を取得し，音声合成サーバ５から取得した合成音声をスピーカ２７から音声出力することで，電子図書館サイト３から借りた資料を再生する（Ｓ６７）。なお，手続き実行部２１は，電子図書館サイト３から取得した資料の再生は，所定のページ数単位で行い，手続き実行部２１は，資料の再生終了を実行すると，手続き実行部２１が実行する手続きを資料検索手続きＴ１に遷移させる。 When the procedure execution unit 21 of the electronic library application 20 acquires the content of the material from the electronic library site 3, the procedure execution unit 21 transmits the text included in the content of the material acquired from the electronic library site 3 to the speech synthesis server 5, The corresponding synthesized voice is acquired, and the synthesized voice acquired from the voice synthesis server 5 is output from the speaker 27 as a voice to reproduce the material borrowed from the electronic library site 3 (S67). The procedure execution unit 21 reproduces the material acquired from the electronic library site 3 in units of a predetermined number of pages. The procedure execution unit 21 executes the procedure executed by the procedure execution unit 21 when the reproduction of the material is completed. Is shifted to the document retrieval procedure T1.

電子図書館アプリケーション２０の手続き実行部２１は，音声により資料の再生を操作できる機能に対応していることが好適である。電子図書館アプリケーション２０の手続き実行部２１は，音声により資料の再生を操作できる機能に対応している場合，電子図書館アプリケーション２０の手続き実行部２１は，資料を再生している間，ユーザの音声を受け付ける状態になり，ユーザがマイク２４に入力した単語に対応する再生操作を実施する。なお，資料の再生操作としては，再生終了，一時停止および再生再開が考えられる。 The procedure execution unit 21 of the electronic library application 20 preferably corresponds to a function capable of operating the reproduction of the material by voice. When the procedure execution unit 21 of the electronic library application 20 supports a function capable of operating the reproduction of the material by voice, the procedure execution unit 21 of the electronic library application 20 receives the user's voice while reproducing the material. It will be in the state which receives and performs reproduction | regeneration operation corresponding to the word which the user input into the microphone 24. FIG. It should be noted that the playback operation of the material can be the end of playback, pause and restart of playback.

図１１は，資料返却手続きにおいて，手続き実行部２１が実行する処理を説明する図である。図４において，「マイページ」のタブは，ユーザが電子図書館サイト３にログインしている状態でのみ有効なタブである。電子図書館サイト３にログインしている状態で，上述の資料検索手続きＴ１が実行され，音声認識結果が一致するカテゴリーが「マイページ」の場合，電子図書館アプリケーション２０の手続き実行部２１は，「マイページ」のタブが選択されたときの表示内容を検索結果として電子図書館サイト３から取得し，図１１で図示した返却手続きＴ６に係る処理が実行可能になる。 FIG. 11 is a diagram illustrating processing executed by the procedure execution unit 21 in the material return procedure. In FIG. 4, the tab “My Page” is valid only when the user is logged in to the electronic library site 3. In the state where the electronic library site 3 is logged in, the above-described material search procedure T1 is executed, and when the category whose voice recognition result matches is “My Page”, the procedure execution unit 21 of the electronic library application 20 reads “My page”. The display content when the “page” tab is selected is acquired from the electronic library site 3 as a search result, and the processing related to the return procedure T6 shown in FIG. 11 can be executed.

資料返却手続きにおいて，電子図書館アプリケーション２０の手続き実行部２１は，音声入力を開始する動作を受け付ける状態になり，音声入力を開始する動作を検知すると（Ｓ７０），資料返却手続きに対応する音声ガイダンス（例えば，「返却する資料を話してください」）をスピーカ２７から音声出力した後（Ｓ７１），音声認識を指定して特徴量抽出部２２を作動させる（Ｓ７２）。 In the document return procedure, the procedure execution unit 21 of the electronic library application 20 enters a state of accepting an operation for starting voice input, and when detecting the operation for starting voice input (S70), the voice guidance corresponding to the document return procedure ( For example, after “speak the material to be returned” is output from the speaker 27 (S71), the voice recognition is designated and the feature amount extraction unit 22 is operated (S72).

電子図書館アプリケーション２０の特徴量抽出部２２は，ユーザがマイク２４に音声を入力すると，ユーザがマイク２４に入力した音声の特徴量を抽出し，抽出した特徴量を音声認識サーバ４に送信した後，音声認識サーバ４から受信した音声認識結果を手続き実行部２１に引き渡し，手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果を取得する（Ｓ７３）。 When the user inputs voice to the microphone 24, the feature quantity extraction unit 22 of the electronic library application 20 extracts the voice feature quantity input to the microphone 24 by the user and transmits the extracted feature quantity to the voice recognition server 4. The voice recognition result received from the voice recognition server 4 is transferred to the procedure execution unit 21, and the procedure execution unit 21 acquires the voice recognition result of the voice input by the user to the microphone 24 (S73).

電子図書館アプリケーション２０の手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果により処理を分岐させる（Ｓ７４）。特徴量抽出部２２から引き渡された音声認識結果が，資料のタイトルまたは資料の番号でない場合，手続き実行部２１は，手続き実行部２１が実行する手続きを資料検索手続きＴ１に遷移させる。また，特徴量抽出部２２から引き渡された音声認識結果が，資料のタイトルまたは資料の番号のいずれかである場合，手続き実行部２１は，音声認識結果（ここでは，タイトルまたは番号）で特定される資料を選択状態にしてから（Ｓ７５），音声入力を開始する動作を受け付ける状態になり，音声入力を開始する動作を検知すると（Ｓ７６），資料返却手続きに対応する音声ガイダンス（例えば，「選択された資料を返却しますか」）をスピーカ２７から音声出力した後（Ｓ７７），音声認識を指定して特徴量抽出部２２を作動させる（Ｓ７８）。 The procedure execution unit 21 of the electronic library application 20 branches the process according to the voice recognition result of the voice input to the microphone 24 by the user (S74). If the speech recognition result delivered from the feature quantity extraction unit 22 is not a material title or material number, the procedure execution unit 21 transitions the procedure executed by the procedure execution unit 21 to the material search procedure T1. If the speech recognition result delivered from the feature quantity extraction unit 22 is either a material title or a material number, the procedure execution unit 21 is identified by the speech recognition result (here, the title or number). After selecting the material to be selected (S75), it is in a state of accepting the operation for starting voice input, and when the operation for starting voice input is detected (S76), the voice guidance corresponding to the material return procedure (for example, “Select” Is output from the speaker 27 (S77), voice recognition is designated and the feature amount extraction unit 22 is activated (S78).

電子図書館アプリケーション２０の特徴量抽出部２２は，ユーザがマイク２４に音声を入力すると，ユーザがマイク２４に入力した音声の特徴量を抽出し，抽出した特徴量を音声認識サーバ４に送信した後，音声認識サーバ４から受信した音声認識結果を手続き実行部２１に引き渡し，手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果を取得する（Ｓ７９）。 When the user inputs voice to the microphone 24, the feature quantity extraction unit 22 of the electronic library application 20 extracts the voice feature quantity input to the microphone 24 by the user and transmits the extracted feature quantity to the voice recognition server 4. The voice recognition result received from the voice recognition server 4 is transferred to the procedure execution unit 21, and the procedure execution unit 21 acquires the voice recognition result of the voice input by the user to the microphone 24 (S79).

電子図書館アプリケーション２０の手続き実行部２１は，ユーザがマイク２４に入力した音声の音声認識結果により処理を分岐させる（Ｓ８０）。電子図書館アプリケーション２０の手続き実行部２１は，音声認識結果が肯定を示す単語の場合，特徴量抽出部２２から音声認識結果が引き渡されると，資料を返却する電子図書館サービスを呼び出し（Ｓ８１），音声認識結果で特定される資料を電子図書館サイト３へ返却し，資料を返却する電子図書館サービスの実行結果を電子図書館サイト３から受信すると，電子図書館サイト３に返却した資料のタイトルを音声合成サーバ５に送信して，資料のタイトルの合成音声を音声合成サーバ５から取得した後，音声合成サーバ５から取得した合成音声をスピーカ２７から音声出力して（Ｓ８２），手続き実行部２１が実行する手続きを資料検索手続きＴ１に遷移させる。なお，音声認識結果が肯定を示す単語でない場合，手続き実行部２１は，手続き実行部２１が実行する手続きを資料検索手続きＴ１に遷移させる。 The procedure execution unit 21 of the electronic library application 20 branches the process according to the voice recognition result of the voice input to the microphone 24 by the user (S80). When the speech recognition result is an affirmative word, the procedure execution unit 21 of the electronic library application 20 calls the electronic library service that returns the material when the speech recognition result is delivered from the feature amount extraction unit 22 (S81). When the material specified by the recognition result is returned to the electronic library site 3 and the execution result of the electronic library service for returning the material is received from the electronic library site 3, the title of the material returned to the electronic library site 3 is converted into the speech synthesis server 5 And the synthesized speech of the title of the material is acquired from the speech synthesis server 5, and then the synthesized speech acquired from the speech synthesis server 5 is output from the speaker 27 (S82), and the procedure executed by the procedure execution unit 21 Is shifted to the document retrieval procedure T1. If the speech recognition result is not a word indicating affirmation, the procedure execution unit 21 transitions the procedure executed by the procedure execution unit 21 to the material search procedure T1.

１電子図書館システム
２携帯端末
２０電子図書館アプリケーション
２１手続き実行部
２２特徴量抽出部
２３声紋認証部
２４マイク
２５モーションセンサ
２６タッチパネル
２７スピーカ
３電子図書館サイト
４音声認識サーバ
５音声合成サーバ
DESCRIPTION OF SYMBOLS 1 Electronic library system 2 Portable terminal 20 Electronic library application 21 Procedure execution part 22 Feature-value extraction part 23 Voiceprint authentication part 24 Microphone 25 Motion sensor 26 Touch panel 27 Speaker 3 Electronic library site 4 Speech recognition server 5 Speech synthesis server

Claims

Provide various electronic library services that can be used via the network, and upon receiving a call request for an electronic library service, execute the electronic library service that has received the call request, and provide a device that has requested the electronic library service to be called An electronic library site for transmitting the execution result of the electronic library service;
A mobile terminal in which an application program using the electronic library service provided by the electronic library site is installed;
A voice recognition server that receives voice feature values from the portable terminal, performs voice recognition using the voice feature values, and transmits a voice recognition result in a text format obtained by voice recognition to the portable terminal;
An electronic library system comprising at least a voice synthesis server that receives a text from the portable terminal and generates a synthesized voice of the text and transmits the synthesized voice to the portable terminal,
When the application program starts, the portable terminal
A feature that extracts a feature amount of voice input by a user to a microphone, transmits the feature amount of voice to the voice recognition server, and acquires a voice recognition result corresponding to the voice input by the user to the microphone from the voice recognition server. A quantity extraction unit;
When an electronic library operation flow describing the procedure related to the operation of the electronic library site and its execution order is registered, and when the procedure described in the electronic library operation flow is executed, an operation to start voice input is detected , By operating the feature amount extraction unit to acquire the speech recognition result of the voice input by the user to the microphone from the feature amount extraction unit, and then using the speech recognition result acquired from the feature amount extraction unit, The electronic library service corresponding to the procedure at the time is called, and when the execution result of the called electronic library service is received from the electronic library site, it is received from the electronic library site using the speech synthesis server. It has a procedure execution unit that generates synthesized speech corresponding to the execution result and outputs it.
An electronic library system characterized by this.

When the procedure execution unit provided in the portable terminal by starting the application program detects an operation to start voice input, the procedure guidance unit outputs voice guidance related to the procedure at this time, and then extracts the feature amount. The electronic library system according to claim 1, wherein the electronic library system operates.

The portable terminal includes a motion sensor, and the procedure execution unit included in the portable terminal detects that the portable terminal is shaken as an operation of starting voice input when the application program is activated. The electronic library system according to claim 1, wherein the electronic library system is characterized.

Calls the electronic library service to retrieve materials, retrieves materials that match the speech recognition results obtained from the feature extraction unit, and outputs the synthesized speech of the text included in the retrieval results of the material retrieval procedures Then, the electronic library service for selecting the material is called, the material selection procedure for selecting the material corresponding to the speech recognition result acquired from the feature quantity extraction unit, the electronic library service for logging in to the electronic library site, Using the speech recognition result acquired from the feature extraction unit, the login procedure for logging in to the electronic library site, the electronic library service for renting materials are called, and the material selected by the material selection procedure is selected from the electronic library. Lending procedure to borrow from the site, the electronic book providing the contents of the material Calling a service and executing the reproduction procedure of reproducing the text included in the content of the material borrowed from the electronic library site in the lending procedure in order by synthesizing the sound is an operation of borrowing the material from the electronic library site and reading it. The electronic library system according to any one of claims 1 to 3, wherein the procedure and the execution order thereof are described in the electronic library operation flow.

When the material borrowed by the user is searched in the material selection procedure, the electronic library service that returns the material is called, and the material specified by the voice recognition result acquired from the feature amount extraction unit is stored in the electronic library site. 5. The electronic library system according to claim 4, wherein execution of a return procedure to return to the electronic library is described in the electronic library operation flow as the procedure relating to the return operation of the material and the execution order thereof.

In the lending procedure, if the material selected in the material selection procedure cannot be lent, the electronic library service that reserves the material is called, and the reservation procedure for reserving the material selected in the material selection procedure to the electronic library site 6. The electronic library system according to claim 4, wherein execution of the electronic library is described in the electronic library operation flow as the procedure relating to the reservation operation of the material and the execution order thereof.

When the application program is activated, the portable terminal includes a voice print authentication unit that authenticates a voice print of a voice input by a user to the microphone, and the procedure execution unit is configured to perform a voice test of the voice input by the user to the microphone in the login procedure. Calling the electronic library service for logging into the electronic library site only when the voiceprint authentication unit succeeds in voiceprint authentication,
The electronic library system according to any one of claims 4 to 6.