JP2006222568A

JP2006222568A - Narration support device, and document editing method and program thereof

Info

Publication number: JP2006222568A
Application number: JP2005032170A
Authority: JP
Inventors: Tamotsu Takada; 保高田
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2005-02-08
Filing date: 2005-02-08
Publication date: 2006-08-24
Anticipated expiration: 2025-02-08
Also published as: JP4459077B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a narration support device capable of adjusting in detail and confirming reading timing. <P>SOLUTION: In the narration support device 9, a voice processing program 12 performs correction editing for adjusting reading timing of a voice waveform of a voice file 32 wherein a reading voice of a narration document displayed on a display 5 is recorded according to a command entered from a control input section 4 and a voice/character processing program 13 copies the display position of a mark M indicating the reading timing displayed in a narration document from the voice file 32 corrected and edited by a voice recognizing function and information on reading time length to a document file 31 of the narration document giving guide display of the reading timing. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、テレビ放送のナレーション原稿の読み上げに用いるナレーション支援装置、その原稿編集方法およびプログラムに関する。 The present invention relates to a narration support apparatus used for reading a narration document for television broadcasting, and a document editing method and program thereof.

テレビ放送のニュース番組等では、アナウンサが原稿を読み上げるが、許容時間に合わせて原稿を読み上げなければならない。読み上げ速度を合わせる準備としてアナウンサは原稿下読みを行うが、本番の時間合わせは、アナウンサの熟練度に依存する。そこで、原稿文字数と時間を計算して読み上げ速度を調整し、速度に合わせたタイミングで原稿文字を画面に表示する音楽カラオケに類似した原稿読み上げ装置（以下、ナレーション支援装置と称する。）がある（例えば、特許文献１。）。 In a television broadcast news program or the like, an announcer reads out a manuscript, but the manuscript must be read out in accordance with an allowable time. In preparation for adjusting the reading speed, the announcer scans the original, but the actual time adjustment depends on the skill level of the announcer. Therefore, there is an original reading device (hereinafter referred to as a narration support device) similar to a music karaoke that calculates the number of original characters and time, adjusts the reading speed, and displays the original characters on the screen at a timing in accordance with the speed (hereinafter referred to as a narration support device). For example, Patent Document 1).

しかし、この方法は、原稿全体の読み上げ時間を所定時間内に収めるのには十分であるが、テレビ画面に表示される番組提供者を画面の動きと同期して読み上げたり、風景や、報道番組での映像説明、ナレーションを行う様な細部タイミングを調整して読み上げる場合には適さない問題が有った。また、読み上げタイミング等の状態を確認するリハーサル機能、および修正機能がなかった。
特開平７−６７０３４号公報（第３頁、第２図） However, this method is sufficient to keep the reading time of the entire manuscript within a predetermined time, but the program provider displayed on the TV screen is read out in synchronism with the movement of the screen, landscapes, news programs, etc. There was a problem that was not suitable for adjusting the timing of details such as video explanations and narrations. Also, there was no rehearsal function to check the state of reading-out timing, etc., and a correction function.
JP 7-67034 A (page 3, FIG. 2)

従来の原稿読み上げ装置は、テレビ画面に表示される番組提供者を画面と同期して読み上げたり、風景や、報道番組での映像説明やナレーションを行う様な細部タイミングを調整して読み上げるには適さない問題があった。 Conventional document readers are suitable for reading program providers displayed on a TV screen in synchronization with the screen, or adjusting detailed timing such as explaining the scenes or explaining video narration in news programs. There was no problem.

本発明は上記問題を解決するためになされたもので、読み上げタイミングの細部調整と共に、確認が可能なナレーション支援装置、その原稿編集方法およびプログラムを提供することを目的とする。 The present invention has been made to solve the above problems, and an object of the present invention is to provide a narration support apparatus, a document editing method, and a program thereof that can be confirmed along with detailed adjustment of reading timing.

上記目的を達成するために、本発明のナレーション支援装置は、テレビ放送で読み上げられるナレーション原稿を表示するナレーション支援装置において、ナレーション原稿の情報が書き込まれた原稿ファイルと、前記ナレーション原稿を読み上げた音声が録音された音声ファイルとを記憶するデータベースと、前記原稿ファイルと、前記音声ファイルの編集を行うコマンドを入力する為の制御入力手段と、アナウンサが前記ナレーション原稿を読み上げる音声を前記録音される音声のデジタル音声信号に変換する音声入力部と、前記コマンドにより前記記憶された音声ファイルを音声にして再生出力する音声出力部と、前記ナレーション原稿、またはおよび前記音声ファイルの音声波形を前記ナレーション原稿の読み上げタイミングを参照するためのタイムスケールと共に画面表示するディスプレイと、タイマと、前記入力される前記コマンドにより前記読み上げられるナレーション原稿を読み上げる音節、または文節の単位毎の文にすると共に、前記各文、および前記各文の読み上げタイミングを示すマークを前記タイムスケールに合わせて画面表示する編集を行い前記データベースに記憶する処理を行う文章処理プログラムと、前記コマンドにより前記データベースから前記音声ファイルを読み出し、および前記ディスプレイに画面表示される音声波形を調整することにより前記各文の読み上げタイミング、および前記読み上げられる前記各文の時間長を編集して前記データベースに記憶する処理を行う音声処理プログラムとを備える制御処理部とを具備することを特徴とする。 In order to achieve the above object, a narration support device according to the present invention is a narration support device for displaying a narration document read out by a television broadcast, and a document file in which information of the narration document is written, and a voice read out from the narration document A database for storing a recorded audio file, a control input means for inputting a command for editing the audio file, and an audio for an announcer to read the narration original. An audio input unit that converts the audio file stored in response to the command into an audio output unit that reproduces and outputs the audio file, and the voice waveform of the narration document or the voice file of the narration document Refer to reading timing A display that displays a screen together with a time scale, a timer, a syllable that reads out the narration document read out by the input command, or a sentence for each unit of the sentence, and each sentence and each sentence A text processing program for performing a process of editing the mark indicating the read-out timing on the screen in accordance with the time scale and storing the mark in the database, reading the voice file from the database by the command, and displaying the screen on the display A control processing unit comprising: a speech processing program for performing processing of editing the reading timing of each sentence and adjusting the time length of each sentence to be read and adjusting the waveform to be stored in the database It is characterized by .

また、本発明のナレーション支援装置の原稿編集方法は、テレビ放送で読み上げられるナレーション原稿を表示するナレーション支援装置の原稿編集方法において、前記ナレーション支援装置は、ナレーション原稿の情報が書き込まれた原稿ファイルと、前記ナレーション原稿を読み上げた音声が録音された音声ファイルとを記憶するデータベースと、前記原稿ファイルまたはおよび前記音声ファイルの編集を行うコマンドを入力する為の制御入力手段と、アナウンサが前記ナレーション原稿を読み上げる音声を前記録音される音声のデジタル音声信号に変換する音声入力部と、前記コマンドにより前記記憶された音声ファイルを音声に再生出力する音声出力部と、前記ナレーション原稿、またはおよび前記音声ファイルの音声波形を前記ナレーション原稿の読み上げタイミングを参照するためのタイムスケールとともに画面表示するディスプレイと、制御処理部とを備え、前記制御処理部は、前記入力される前記コマンドにより前記読み上げられるナレーション原稿を読み上げる音節、または文節の単位毎の文にすると共に、ナレーション開始からの各文およびそれぞれの読み上げタイミングをマークとともに画面表示する前記ナレーション原稿に編集して前記データベースに記憶し、前記音声ファイルの編集を行うコマンドが入力された場合、内蔵するタイマを参照することにより前記データベースから前記音声ファイルを読み出し、前記ディスプレイに前記音声ファイルの音声波形を表示し前記音声ファイルに書き込まれた前記各文の読み上げタイミング、および前記読み上げられる前記各文の時間長を編集して前記データベースに記憶し、前記音声波形を調整して編集された各文の前記読み上げタイミングにより前記画面表示されるナレーション原稿の読み上げタイミングの表示を調整する処理を行うことを特徴とする。 The narration support apparatus of the present invention also provides a narration support apparatus for editing a narration support apparatus for displaying a narration document read out by a television broadcast, wherein the narration support apparatus includes a document file in which information of the narration document is written. A database for storing a voice file in which the voice of the narration manuscript is recorded; a control input means for inputting the manuscript file or a command for editing the voice file; and an announcer A voice input unit that converts a voice to be read into a digital voice signal of the recorded voice, a voice output unit that plays back and outputs the stored voice file according to the command, and the narration document or the voice file The sound waveform is A display that displays a screen together with a time scale for referring to the reading timing of the reading of the original document, and a control processing unit, wherein the control processing unit reads a syllable or phrase that reads out the narrated original read out by the input command. A command for editing the voice file is input, edited into the narration manuscript displayed on the screen with each mark from the start of the narration and each reading timing and displayed on the screen. The voice file is read from the database by referring to a built-in timer, the voice waveform of the voice file is displayed on the display, and the reading timing of each sentence written to the voice file is read. Processing to adjust the display of the reading timing of the narration document displayed on the screen according to the reading timing of each sentence edited by editing the time length of each sentence and storing it in the database It is characterized by performing.

さらに、本発明のナレーション支援装置のプログラムは、ナレーション原稿の情報が書き込まれた原稿ファイルと、前記ナレーション原稿を読み上げた音声が録音された音声ファイルとを記憶するデータベースと、前記原稿ファイルと、前記音声ファイルの編集を行う為の制御入力手段からのコマンドに基づき、前記ナレーション原稿を読み上げる音節、または文節の単位毎の文にすると共に、ナレーション開始からの各文およびそれぞれの読み上げタイミングをマークとともに画面表示する前記ナレーション原稿に編集して前記データベースに記憶し、前記音声ファイルの編集を行うコマンドが入力された場合、前記データベースから前記音声ファイルを読み出し、前記ディスプレイに前記音声ファイルの音声波形を表示し、前記表示される音声波形を調整して前記音声ファイルに書き込まれた前記各文の読み上げタイミング、および前記各文が読み上げられる時間長をタイマを参照することにより編集して前記データベースに記憶し、前記音声波形を調整して編集された各文の前記読み上げタイミングにより前記画面表示されるナレーション原稿の読み上げタイミングの表示を調整する処理を行うことを特徴とするナレーション支援装置のプログラム。 Further, the program of the narration support device of the present invention includes a database that stores a document file in which information of a narration document is written, a sound file in which a sound that reads out the narration document is recorded, the document file, Based on the command from the control input means for editing the audio file, the narration manuscript is read as a syllable or a sentence for each unit of the narration, and each sentence from the start of the narration and each reading timing are displayed together with a mark. When the command for editing the voice file is input, the voice file is read from the database, and the voice waveform of the voice file is displayed on the display. The above is displayed The voice waveform is adjusted and the reading timing of each sentence written in the voice file and the time length for reading each sentence are edited by referring to a timer and stored in the database, and the voice waveform is adjusted. A program for a narration support apparatus that performs a process of adjusting the display of the reading timing of the narration document displayed on the screen according to the reading timing of each sentence edited in this way.

本発明によれば、ナレーション原稿が下読みされた音声を録音編集することによりナレーション原稿の読み上げガイドのタイミングの細部調整と共に、確認が可能なナレーション支援装置を提供することができる。 According to the present invention, it is possible to provide a narration support device capable of confirming, together with detailed adjustment of the timing of a reading guide for a narration document, by recording and editing the sound of the narration document being read down.

以下、図面を参照して本発明の実施例を説明する。 Embodiments of the present invention will be described below with reference to the drawings.

図１は、本発明の実施例に係るナレーション支援装置の機能構成を示すブロック図である。
図１においてナレーション支援装置９は、それぞれの間が内部バス等で接続された制御処理部（以下、ＣＰＵと称する。）１、タイマ２、データベース３、キーボード、マウス等からなる制御入力部４、ディスプレイ５、音声入力部６、音声出力部７と、通信ＩＦ（インタフェース）８とを備えている。なお、ナレーション支援装置９は、上記の機能構成を備えるもので有れば、パーソナルコンピュータやワークステーションの様な情報端末であっても良い。 FIG. 1 is a block diagram showing a functional configuration of a narration support apparatus according to an embodiment of the present invention.
In FIG. 1, a narration support device 9 includes a control processing unit (hereinafter referred to as CPU) 1, a timer 2, a database 3, a control input unit 4 including a keyboard, a mouse, and the like connected to each other by an internal bus or the like. A display 5, an audio input unit 6, an audio output unit 7, and a communication IF (interface) 8 are provided. The narration support device 9 may be an information terminal such as a personal computer or a workstation as long as it has the above-described functional configuration.

ＣＰＵ１は、データベース３に記憶された原稿ファイル３１、音声ファイル３２を処理する文章処理プログラム１１、および音声処理プログラム１２、ディスプレイ５に表示される原稿文、タイミングのマークと音声ファイル３２の音声波形との間の同期を制御する音声・文字処理プログラム１３と、ワークメモリ１４を備えている。 The CPU 1 reads the document file 31 stored in the database 3, the sentence processing program 11 for processing the audio file 32, the voice processing program 12, the document sentence displayed on the display 5, the timing mark, and the audio waveform of the audio file 32. Are provided with a voice / character processing program 13 for controlling the synchronization between and a work memory 14.

タイマ２は、後述する原稿の読み上げタイミングや音声波形の立ち上がりについて、原稿ファイル３１と音声ファイル３２とが同期して動作するための基準時計として動作する。 The timer 2 operates as a reference clock for the original file 31 and the audio file 32 to operate in synchronism with respect to the reading timing of the original and the rising of the audio waveform, which will be described later.

データベース３は、アナウンサが読み上げる原稿ファイル３１と、アナウンサが原稿を読み上げた音声を記録した音声ファイル３２を記憶する。 The database 3 stores a manuscript file 31 read by the announcer and a sound file 32 that records the sound of the announcer reading the manuscript.

制御入力部４は、オペレータによりナレーション原稿の文章入力と、文字表示位置、音声ファイルの処理編集等のコマンドが入力される。そして、そのデータ、コマンドは、内部バスを介して内部バスに出力され、更にＣＰＵ１で所定の処理が行われる。ＣＰＵ１は、文章処理プログラム１１で編集作成した文章を原稿ファイル３１にしてデータベース３に書き込み保存する。 The control input unit 4 receives commands such as text input of a narration document, character display position, and processing editing of an audio file by an operator. Then, the data and commands are output to the internal bus via the internal bus, and further, the CPU 1 performs predetermined processing. The CPU 1 writes and saves the text edited and created by the text processing program 11 in the database 3 as a manuscript file 31.

ディスプレイ５は、アナウンサが読み上げるナレーション原稿と、アナウンサが読み上げた原稿を録音した音声ファイル３２の音声波形を表示する。ナレーション原稿と音声ファイルは、それぞれの画面に表示され、それらの画面にはナレーション原稿の読み始めからナレーションの各文を読み上げるタイミングが、言い換えれば、経過時間、または読み上げ開始までの残り時間が分かるガイドのタイムスケールが表示される。また、ナレーション原稿と音声波形は同じ画面上に表示されるものであっても良いが以下では、それぞれの画面で表示される場合を例に、ナレーション支援装置の動作説明をする。 The display 5 displays the narration original read by the announcer and the audio waveform of the audio file 32 that records the original read by the announcer. The narration manuscript and audio file are displayed on each screen, and on these screens, the timing to read each narration sentence from the beginning of reading the narration manuscript, in other words, the elapsed time or the remaining time until the start of reading The time scale is displayed. Although the narration document and the audio waveform may be displayed on the same screen, the operation of the narration support apparatus will be described below by taking the case of displaying on each screen as an example.

音声入力部６は、マイク６１から入力された音声をＡ／Ｄ変換したデジタル音声信号を内部バスに出力する。ＣＰＵ１は、音声処理プログラム１２により内部バスからデジタル音声信号を入力して音声ファイル３２を生成してデータベース３へ書き込み記憶する。 The audio input unit 6 outputs a digital audio signal obtained by A / D converting the audio input from the microphone 61 to the internal bus. The CPU 1 inputs a digital audio signal from the internal bus by the audio processing program 12, generates an audio file 32, writes and stores it in the database 3.

音声出力部７は、制御入力部４からのコマンドによりＣＰＵ１の音声処理プログラム１２が読み上げた音声ファイル３２をＤ／Ａ変換、および音声に復元してスピーカ７１から出力する。 The voice output unit 7 restores the voice file 32 read out by the voice processing program 12 of the CPU 1 to D / A conversion and voice in response to a command from the control input unit 4 and outputs the voice file 32 from the speaker 71.

通信ＩＦ８は、放送される映像画面と同期して原稿の文字表示を変える為のスタート（キュー）信号が入力される。 The communication IF 8 receives a start (queue) signal for changing the character display of the document in synchronization with the broadcast video screen.

図２は、アナウンサが読み上げるナレーション原稿を表示したディスプレイ５の画表示面例である。
図２において、画面は、文章処理プログラム１１と音声処理プログラム１２とによって編集された原稿文ｂ１〜ｂｎと、その読み上げ開始のタイミング（以下、読み上げタイミング（ｔ１〜ｔ３）と省略する。）がマークＭによりタイムスケールＣに並んで表示され、また画面左には、録音や、再生、編集を行うためのコマンドを入力する画面ボタンＢが表示されている。この画面ボタンＢは、制御入力部４のマウスや、キーボードによってポインティングやクリックなどにより操作される。 FIG. 2 is an example of an image display surface of the display 5 on which a narration document read by the announcer is displayed.
In FIG. 2, the screen is marked with manuscript sentences b1 to bn edited by the text processing program 11 and the voice processing program 12, and the reading start timing (hereinafter abbreviated as reading timing (t1 to t3)). A screen button B for inputting commands for recording, reproduction, and editing is displayed on the left side of the screen. The screen button B is operated by pointing or clicking with the mouse or keyboard of the control input unit 4.

原稿文ｂ１、ｂ２、ｂ３は、それぞれ「この番組は、明日を目指すｘｙｚと、」、「ＯＯカンパニーと、」、「ご覧のスポンサーがお送りします」とがそれぞれ、読み上げタイミング（ｔ１、ｔ２、ｔ３）で読み上げられる。そして読み上げ速度（読み上げタイミング）に応じて、時々刻々文字色が、例えば、まだ読み上げていないことを示す「青」から読み上げ済みを示す「オレンジ」に変わるようになっている。 The manuscript sentences b1, b2, and b3 respectively read “This program is xyz aimed at tomorrow,” “OO company,” and “The sponsors you see are sent” respectively. Read out at t3). Depending on the reading speed (reading timing), the character color is changed from “blue” indicating that it has not been read out to “orange” indicating that it has been read out.

タイムスケールＣは、表示されている原稿文の読み上げ開始までの時間をアナウンサに画面で予告する時間目盛であり、図２では、バーｃｂの色がナレーション開始時刻から時間の経過と共に、例えば、右から左へ未読み上げの「青」から読み上げ済みの「オレンジ」に変わる。このバーに合わせて原稿文を読み上げるタイミングの順序が「△１」〜「△３」のマークＭで表示され、例えば、「△１」位置の下に原稿文ｂ１の「この番組は、明日を目指すｘｙｚと、」が配置されている。 The time scale C is a time scale for notifying the announcer of the time until the start of reading of the displayed document text on the screen. In FIG. 2, the color of the bar cb is, for example, rightward as time elapses from the narration start time. From left to right, it changes from “blue” unread to “orange” to read. The order of the timing of reading out the original text in accordance with this bar is indicated by marks M from “Δ1” to “Δ3”. For example, under the “Δ1” position, “This program shows tomorrow. The target xyz is arranged.

これらの文字色およびバーｃｂの変化は、ＣＰＵ２の文章処理プログラムがタイマ２を参照することによって実行される。 These changes in the character color and the bar cb are executed when the sentence processing program of the CPU 2 refers to the timer 2.

なお、図２における表示画面には、読み上げ開始からの経過時間が数字、時計の針等の時計部ＣＬ、および読み上げ開始タイミングになると点灯するキューランプＱがディスプレイ５に表示されるものでも良い。 Note that the display screen in FIG. 2 may be such that the elapsed time from the start of reading is a number, a clock part CL such as a clock hand, and a cue lamp Q that is lit when the reading start timing comes.

アナウンサは、これらの原稿文ｂ１〜ｂｎをディスプレイ５の表示を見ながら読み上げるが、その際に、“録音”のコマンドとして制御入力部４のマウスにより画面ボタンの「ＲＥＣ」が押されるか、または、キーボードから「ＲＥＣ」が入力されると、その読み上げ音声は、音声ファイル３２となってデータベース３に書き込み記憶される。 The announcer reads these manuscript sentences b1 to bn while looking at the display 5, and at that time, the “REC” of the screen button is pressed by the mouse of the control input unit 4 as a “record” command, or When “REC” is input from the keyboard, the reading voice is written and stored in the database 3 as an audio file 32.

音声処理プログラム１２は、録音された音声ファイル３２を読み出して、発声タイミング等を編集して修正することが出来る。また、音声・文字処理プログラム１３は、音声認識機能により、音声ファイル３２の音声波形の発声開始タイミングと、ナレーション原稿に表示される読み上げ開始のタイミングのマークＭを一致させて同期する処理を行う。 The voice processing program 12 can read out the recorded voice file 32 and edit and correct the utterance timing. In addition, the voice / character processing program 13 performs a process of synchronizing the voice waveform utterance start timing of the voice file 32 with the mark M of the reading start timing displayed on the narration document by using the voice recognition function.

図３は、音声ファイル３２が音声処理プログラム１２によって編集された時のディスプレイ５に表示される音声波形の例である。
図３において、波形編集画面は、音声波形ｗ１〜ｗ３が表示され、音声波形ｗ１〜ｗ３は、それぞれ、原稿文ｂ１〜ｂ３に対応している。音声波形は発声時の振動波形であって文単位の音節毎の群となった形状をなしている。この音声波形の上に、タイムスケールＴＳが表示されている。このタイムスケールＴＳは、図２のタイムスケールＣと同期しているが、音声波形のタイミング調整を微細に調整できるようタイムスケールＣに比べて小些細な時間目盛りが記入されている。 FIG. 3 is an example of a sound waveform displayed on the display 5 when the sound file 32 is edited by the sound processing program 12.
In FIG. 3, the waveform editing screen displays voice waveforms w1 to w3, and the voice waveforms w1 to w3 correspond to the original sentences b1 to b3, respectively. The speech waveform is a vibration waveform at the time of utterance and has a shape of a group for each syllable in sentence units. A time scale TS is displayed on the voice waveform. Although this time scale TS is synchronized with the time scale C in FIG. 2, a time scale that is slightly smaller than the time scale C is written so that the timing adjustment of the audio waveform can be finely adjusted.

また、波形編集画面には画面上に編集ボタンＥが備えられ、後述の音声波形の発声タイミングや発生時間の長さの編集に使用される。 Further, the waveform editing screen is provided with an editing button E on the screen, which is used for editing the utterance timing and generation time length of the later-described speech waveform.

音声処理プログラム１２は、例えば、音声波形ｗ１とｗ２の間の空き時間を延ばしたり縮める、言い換えれば、発音開始タイミングを移動するタイミング設定機能と、音声が発声される時間を長くしたり短くするタイムコンパンダ機能とを備えている。 For example, the voice processing program 12 extends or shortens the free time between the voice waveforms w1 and w2, in other words, a timing setting function for moving the sound generation start timing, and a time for lengthening or shortening the time when the voice is uttered. It has a compander function.

従来の録音機では、録音時間長と再生時間長が異なると、その速度に比例して、音声周波数の高低にあたる音声ピッチが変化した。しかし、タイムコンパンダ機能は、再生時間長と録音時間長、言い換えれば再生速度と録音速度とを比較して、発生時間が記事各なる場合には音声情報を間引くか、または長くなる場合には挿入する符号化音声補正技術を用いることにより音声ピッチを変えることなく発声時間を延長、または短縮して、読み上げ時間を変えるのと同じ効果を作り出すことが出来る。 In the conventional recorder, when the recording time length and the playback time length are different, the sound pitch corresponding to the sound frequency changes in proportion to the speed. However, the time compander function compares the playback time length and the recording time length, in other words, the playback speed and the recording speed, and if the occurrence time is each article, the audio information is thinned out or inserted when it becomes longer By using the encoded speech correction technique, the same effect as changing the reading time can be created by extending or shortening the utterance time without changing the speech pitch.

従って、本発明の実施例によるナレーション支援装置９は、アナウンサが一度読み上げた原稿を録音して、その音声ファイルを編集することにより、読み上げタイミングと読み上げ時間長を修正したのと同じ結果の手本となる音声ファイルを作成することができる。そして、その修正されたファイルを、原稿読み上げ音声として利用することもできる。 Therefore, the narration support apparatus 9 according to the embodiment of the present invention records the original read once by the announcer and edits the audio file, thereby correcting the reading timing and the reading time length. Can be created. Then, the corrected file can be used as a document reading voice.

また、音声、文字処理プログラム１３は、音声ファイル３２に設定された音声波形の開始タイミングを音声認識機能により検出し、ナレーション原稿の読み上げタイミングを示すマークＭを修正した原稿ファイル３１に修正することも可能としている。 Further, the voice / character processing program 13 detects the start timing of the voice waveform set in the voice file 32 by the voice recognition function, and corrects the mark M indicating the read-out timing of the narration document to the corrected document file 31. It is possible.

図４は、本発明の実施例におけるナレーション支援装置９の動作手順を説明するフローチャートである。
以下、図１〜図４を参照して、ナレーション支援装置９の各構成の処理、および動作手順を説明する。 FIG. 4 is a flowchart for explaining the operation procedure of the narration support device 9 in the embodiment of the present invention.
Hereinafter, with reference to FIGS. 1-4, the process of each structure of the narration assistance apparatus 9 and an operation | movement procedure are demonstrated.

図１において、オペレータ（原稿作成者）は、制御入力部４から文章を入力して作成したナレーションの原稿下書きを原稿ファイル３１として、例えば、“広告１−Ｌ”と名付けてデータベース３に書き込む（図４のステップｓ１）。 In FIG. 1, an operator (manuscript creator) writes a draft of a narration draft created by inputting a text from the control input unit 4 as a manuscript file 31, for example, named “Advertisement 1-L” in the database 3 ( Step s1) in FIG.

この原稿は、読み上げの文節毎に区切られた原稿文ｂ１〜ｂｎが記述されている。各原稿文ｂ１〜ｂｎの読み上げタイミングが予め原稿作成者により予め設定されてもよいが、ここでは、アナウンサが読み上げた原稿を録音して作成した音声ファイルを編集して、音声、タイミング設定を行うことにより原稿作成を行う場合を例に、原稿作成手順を説明する。 In this manuscript, manuscript sentences b1 to bn are described which are divided for each sentence to be read out. The reading timing of each of the manuscript sentences b1 to bn may be set in advance by the manuscript creator, but here, the sound file created by recording the manuscript read by the announcer is edited to set the sound and timing. The procedure for creating a document will be described by taking the case of creating a document as an example.

アナウンサは、制御入力部４を操作して、“広告１”の名前の原稿ファイル３１をデータベース３から読み出してディスプレイ５に表示する（ステップｓ２）。 The announcer operates the control input unit 4 to read the document file 31 named “advertisement 1” from the database 3 and display it on the display 5 (step s2).

図５は、修正前の“広告１−Ｌ”の原稿ファイル３１がディスプレイ５に表示された時の画面表示例である。
図５において、原稿文ｂ１、ｂ２の上部のタイムスケールＣのバーｃｂには、文章処理プログラム１１によって設定された読み上げの順番と読み上げタイミングとを示す「△１」〜「△３」までの番号が表示される。アナウンサは、映像画面の開始と共に、画面の開始「ＲＥＣ」コマンドを制御入力部４から入力し（ステップｓ３）て、原稿下読みの録音を開始する。そして、録音開始後、原稿を読み上げた音声がマイク６１から音声入力部６へ入力される（ステップｓ４）。 FIG. 5 is a screen display example when the original file 31 of “advertisement 1-L” before correction is displayed on the display 5.
In FIG. 5, the bar cb of the time scale C above the original sentences b1 and b2 is a number from “Δ1” to “Δ3” indicating the reading order and the reading timing set by the sentence processing program 11. Is displayed. At the same time as the start of the video screen, the announcer inputs a screen start “REC” command from the control input unit 4 (step s 3), and starts recording the document draft. After the start of recording, the voice read out from the original is input from the microphone 61 to the voice input unit 6 (step s4).

音声入力部６は、マイク６１から入力される音声をデジタル音声化して内部バスに出力し（ステップｓ５）、ＣＰＵ１の音声処理プログラム１２は、デジタル音声を逐次データベース３の音声ファイル３２へ書き込む（ステップｓ６）。そして文章の最後までよみ終えた時、録音終了コマンド、例えば「ＥＮＤ」を制御入力部４から入力する（ステップｓ７）と録音が終了し、その音声ファイル３２は、例えば、“広告１−Ｖ”の名前が付与されてデータベース３に書き込まれて記憶保存される（ステップｓ８）。 The voice input unit 6 converts the voice input from the microphone 61 into digital voice and outputs it to the internal bus (step s5), and the voice processing program 12 of the CPU 1 sequentially writes the digital voice into the voice file 32 of the database 3 (step s5). s6). When the end of the sentence has been read, when a recording end command, for example, “END” is input from the control input unit 4 (step s7), the recording is ended, and the audio file 32 is, for example, “advertisement 1-V”. Is given and written in the database 3 and stored (step s8).

続いて、アナウンサから、読み上げ結果を試聴するために再生コマンドの、例えば、「ＰＬＹ」を制御入力部４から入力されると（ステップｓ９）、ＣＰＵ１は、“広告１−Ｖ”の音声ファイル３２をデータベース３から読み出して音声出力部７に入力し、音声出力部７が再生した音声がスピーカ７１から出力される（ステップｓ１０）。 Subsequently, when a playback command, for example, “PLY”, for example, is input from the control input unit 4 in order to audition the reading result from the announcer (step s9), the CPU 1 reads the audio file 32 of “advertisement 1-V”. Is read from the database 3 and input to the audio output unit 7, and the audio reproduced by the audio output unit 7 is output from the speaker 71 (step s10).

そこでアナウンサが試聴の結果、原稿文ｂ１と原稿文ｂ２の間の間隔を長くし、また、原稿文ｂ２の「明日を目指すｘｙｚ」の読み上げ時間長を長くする読み上げ修正が必要と判断したとする（ステップｓ１１がＹｅｓ）。 As a result of the trial announcement, the announcer determines that it is necessary to correct the reading to increase the interval between the original sentence b1 and the original sentence b2 and to increase the reading time length of “xyz aiming for tomorrow” of the original sentence b2. (Step s11 is Yes).

アナウンサにより、タイミングを修正する編集の為に音声ファイル３２の修正編集コマンド「ＥＤＴ」を制御入力部４から入力されると（ステップｓ１２）ディスプレイ５には、広告１−Ｖ”の名前の音声ファイル３２の音声波形が表示される（ステップｓ１３）。 When the announcer inputs a correction edit command “EDT” of the audio file 32 from the control input unit 4 for editing to correct the timing (step s12), an audio file with the name of the advertisement 1-V ″ is displayed on the display 5. 32 voice waveforms are displayed (step s13).

図６は、修正前の“広告１−Ｖ”の音声ファイル３２の音声波形をディスプレイ５に表示した時の画面である。
図６において、録音開始タイミングが「△０」のマークで示され、その後、各原稿文ｂ１〜ｂ３を読み上げた音声波形ｗ１〜ｗ３が表示されている。音声処理プログラム１２は、例えば、音声波形の包絡線を観測してその立ち上がりタイミングを検出して検出信号を内部バスに出力する。 FIG. 6 is a screen when the audio waveform of the audio file 32 of the “advertisement 1-V” before correction is displayed on the display 5.
In FIG. 6, the recording start timing is indicated by a mark “Δ0”, and thereafter, the audio waveforms w1 to w3 are displayed in which the original sentences b1 to b3 are read out. For example, the speech processing program 12 observes the envelope of the speech waveform, detects its rise timing, and outputs a detection signal to the internal bus.

音声・文書処理プログラム１３は、この検出信号を受信すると、最初の音声波形の立ち上がり部分に、タイマ２を参照して読み上げタイミング（ｔ１）時刻を読み取る。そして、この包絡線の立ち上がりタイミングは、原稿文の読み上げ開始タイミングと対応しているので、“広告１−Ｌ”の原稿ファイル３１と照合して原稿文ｂ１の読み上げ開始タイミングの「△１」のマークＭをタイムスケールＴＳと、図５のバーｃｂの読み上げタイミング（ｔ１）の位置に割り当てる。 When the voice / document processing program 13 receives this detection signal, it reads the reading timing (t1) time with reference to the timer 2 at the rising portion of the first voice waveform. Since the rising timing of the envelope corresponds to the reading start timing of the original text, the reading start timing of “Δ1” of the original text b1 is compared with the original file 31 of the “advertisement 1-L”. The mark M is assigned to the position of the time scale TS and the reading timing (t1) of the bar cb in FIG.

そして、他の原稿文に付いても同様の処理が行われ、図６の画面には、「△１」から「△３」のマークと各音声波形ｗ１〜ｗ３が対応して表示される。 The same processing is performed for other manuscript sentences, and the marks “Δ1” to “Δ3” and the audio waveforms w1 to w3 are displayed in correspondence with each other on the screen of FIG.

原稿文ｂ１〜ｂ３についての読み上げタイミング（ｔ１〜ｔ３）は、タイムスケールＴＳと照合すると、録音開始後それぞれ、０．７秒、２．５秒、４．７秒であることが表示される。また、例えば、原稿文ｂ２の読み上げ時間長は、約０．８秒であることが音声波形の表示から読み取ることができる。 The reading timings (t1 to t3) for the original sentences b1 to b3 are 0.7 seconds, 2.5 seconds, and 4.7 seconds, respectively, after the start of recording, when compared with the time scale TS. Further, for example, it can be read from the display of the voice waveform that the reading time length of the document sentence b2 is about 0.8 seconds.

図６では、録音開始のタイミングに「△０」のマークＭが示されており、最初の原稿文ｂ１の読み上げタイミング（ｔ１）が「△１」のマークＭになるが、この最初の読み上げタイミング（時刻）を、映像開始のキュー信号と同期するように調整することも可能である。その場合、映像開始のキューは、通信ＩＦ８によりナレーション支援装置９に入力され、「△０」のマークＭを基準として原稿文ｂ１の読み上げ開始タイミング（ｔ１）の「△１」のマークＭの設定を原稿文ｂ２の読み上げタイミング（ｔ２）設定に先立って調整する。 In FIG. 6, the mark M of “Δ0” is shown at the recording start timing, and the reading timing (t1) of the first document sentence b1 becomes the mark M of “Δ1”. It is also possible to adjust the (time) so as to be synchronized with the cue signal for starting the video. In this case, the video start queue is input to the narration support device 9 by the communication IF 8 and the “M” mark “M” is set at the read start timing (t1) of the original text b1 with the “M” mark “M” as a reference. Is adjusted prior to setting the reading timing (t2) of the document sentence b2.

例えば、映像開始の２秒後に読み上げを始める場合、「△１」のマークＭを制御入力部４のマウスで、タイムスケールＴＳの２秒の位置にドラッグすることにより、各マークＭと各音声波形全体が左にシフト、即ち１．３秒遅れる様に調整される。そして、原稿文ｂ２の読み上げタイミング（ｔ２）は、２．５＋１．３＝３．８（秒）、原稿文ｂ３の読み上げタイミング（ｔ３）は、６．０秒となる。 For example, when reading is started 2 seconds after the start of the video, each mark M and each audio waveform is dragged by dragging the mark M of “Δ1” to the position of 2 seconds on the time scale TS with the mouse of the control input unit 4. The whole is shifted to the left, that is, adjusted to be delayed by 1.3 seconds. Then, the reading timing (t2) of the document sentence b2 is 2.5 + 1.3 = 3.8 (seconds), and the reading timing (t3) of the document sentence b3 is 6.0 seconds.

さて、アナウンサは、原稿文ｂ１とｂ２の間の間隔を長くするために制御入力部４を操作して、「△２」のマークＭにポインタを置き、タイムスケールＴＳを参照して画面左側へドラッグすることにより２００ｍ秒発声タイミングを遅らす（ステップｓ１４）。 Now, the announcer operates the control input unit 4 to lengthen the interval between the original sentences b1 and b2, places the pointer on the mark M of “Δ2”, and refers to the time scale TS to the left side of the screen. The utterance timing is delayed by 200 msec by dragging (step s14).

音声・文字処理プログラム１３は、各マークＭと各原稿文ｂ１〜ｂ３の読み上げ開始位置を対応付けて記憶しているので、マークＭの移動に合わせて録音された音声波形の立ち上がり位置、即ち読み上げタイミング（ｔ１）も移動する。 Since the voice / character processing program 13 stores each mark M and the reading start position of each of the original sentences b1 to b3 in association with each other, the rising position of the voice waveform recorded in accordance with the movement of the mark M, that is, the reading-out is read out. Timing (t1) also moves.

この読み上げタイミングを遅らせるコマンドとして、例えば「Ｄ２」を入力する様にしても良い。この場合、「Ｄ２」の“Ｄ”は、遅らせることを意味し、“２”は遅延の単位ステップ数で数字１ステップに付き、ここでは１００ｍ秒遅延することを意味している。この処理により原稿文ｂ２の「ＯＯカンパニー」の読み上げタイミング（ｔ２）が２ステップ分の２００ミリ秒遅れる（もし、空白時間を反対に短くするので有れば、早めるコマンドとして、例えば、「Ａ２」が入力される。）。この結果、原稿文ｂ２の読み上げタイミング（ｔ２）は、４．０秒になるが、原稿文３の読み上げタイミング（ｔ３）は、６．０秒のまま保持される。 For example, “D2” may be input as a command for delaying the reading timing. In this case, “D” in “D2” means delaying, and “2” means one step in the number of unit steps of delay, and here means 100 ms delay. As a result of this processing, the reading timing (t2) of “OO Company” in the original sentence b2 is delayed by 200 milliseconds corresponding to two steps (if the blank time is shortened on the contrary, as a command to advance, for example, “A2”) Is entered.) As a result, the reading timing (t2) of the original sentence b2 is 4.0 seconds, but the reading timing (t3) of the original sentence 3 is held at 6.0 seconds.

また、音声・文字処理プログラム１３は、音声波形ｗ１、ｗ３の読み上げタイミング（ｔ１）、（ｔ３）についても音声波形の立ち上がりが一致するように「△１」、「△３」のマークＭとその立ち上がり位置に移動する処理を行う（ステップ１４−１）。 Further, the voice / character processing program 13 sets the marks M of “Δ1” and “Δ3” and their marks so that the rising timings of the voice waveforms coincide with the reading timings (t1) and (t3) of the voice waveforms w1 and w3. Processing to move to the rising position is performed (step 14-1).

さて、アナウンサは続いて、読み上げ時間長を長くする修正を行う為に、例えば、ディスプレイ５に表示された原稿文２の「ＯＯカンパニー」の読み上げ部分の音声波形ｗ２上をドラッグし（ステップｓ１５）、その終了点を指示して延長区間を設定する。そして、延長コマンドとして、画面の編集ボタンＥの「ＥＸＰ」を１回クリックする（ステップｓ１６）。このクリックは、１回行う毎に、例えば、１０％時間が長くなるように設定されている。（もし、逆に短縮するので有れば、画面の編集ボタンＥの「ＣＯＭ」を１回クリックする。）
この読み上げ時間長を長くする別の方法として、例えば、「Ｅ１０」を制御入力部４から入力する様にしても良い。この「Ｅ１０」の“Ｅ”は、読み上げ時間長の延長を意味し“１０”は、読み上げ時間長を１０％長くする事を意味する。その結果、ＣＰＵ１の音声処理プログラム１２は、「明日を目指すｘｙｚ」の読み上げ部分の音声信号に補正ビットを挿入することにより読み上げ時間長を０．９秒に延長する（ステップｓ１７）。 Now, for example, the announcer drags on the speech waveform w2 of the read-out portion of “OO Company” of the document sentence 2 displayed on the display 5 in order to make correction to increase the read-out time length (step s15). Then, the end point is indicated and an extension section is set. Then, "EXP" of the edit button E on the screen is clicked once as an extension command (step s16). Each time this click is performed, for example, 10% time is set longer. (If it is shortened, click “COM” of the edit button E on the screen once.)
As another method of increasing the reading time length, for example, “E10” may be input from the control input unit 4. “E” of “E10” means extension of the reading time length, and “10” means that the reading time length is increased by 10%. As a result, the speech processing program 12 of the CPU 1 extends the readout time length to 0.9 seconds by inserting a correction bit into the speech signal of the readout portion of “xyz aiming for tomorrow” (step s17).

そして、アナウンサは、編集終了の「ＥＮＤ」コマンドを制御入力部４から入力し、ＣＰＵ１は“広告１−Ｖ”の音声ファイル３２を修正内容で上書き記憶して音声ファイル編集を一旦終了する（ステップｓ１８）。 Then, the announcer inputs an “END” command to end editing from the control input unit 4, and the CPU 1 overwrites and stores the audio file 32 of “advertisement 1-V” with the modified content, and temporarily ends the audio file editing (step) s18).

この「ＥＮＤ」コマンドが入力されると、ディスプレイ５には、ナレーション原稿が再び表示される。ＣＰＵ１の音声・文字処理プログラム１３は、上書きされた音声ファイル３２から修正された「△１」のマークＭのタイミング（ｔ１）をタイムスケールＴＳから読み取る。そして、“広告１−Ｌ”の原稿ファイル３１に修正された読み上げタイミング（ｔ１）の時刻と、バーｃｂにキューのタイミングの「△１」のマークＭを並べて記入する。そして、原稿文ｂ２の読み上げ時間長を音声波形ｗ１から読み取り、原稿文ｂ２の付属情報として“広告１−Ｌ”の原稿ファイル３１に書き込む処理を行う。 When this “END” command is input, the narration document is displayed on the display 5 again. The voice / character processing program 13 of the CPU 1 reads the timing (t1) of the mark M of “Δ1” corrected from the overwritten voice file 32 from the time scale TS. Then, the corrected reading timing (t1) time and the mark M of “Δ1” of the cue timing are written in the bar cb in the original file 31 of “advertisement 1-L”. Then, the reading time length of the document sentence b2 is read from the voice waveform w1 and written into the document file 31 of “advertisement 1-L” as the attached information of the document sentence b2.

続いて、同様に読み上げタイミング（ｔ２）、（ｔ３）についても時刻情報と「△２」と「△３」のマークＭの修正記入、読み上げ時間の書き込みが行われ、音声ファイル３２の修正内容がコピーされる（ステップｓ１９）。そして、データベース３の“広告１−Ｌ”の原稿ファイル３１も上書き記憶される（ステップｓ２０）。この結果、ディスプレイ５における各マークＭの画面表示は、図２に示されるものとなる。 Subsequently, for the reading timings (t2) and (t3), the time information, the correction of the marks M of “Δ2” and “Δ3”, the writing of the reading time are performed, and the correction contents of the audio file 32 are changed. Copied (step s19). Then, the document file 31 of “advertisement 1-L” in the database 3 is also overwritten and stored (step s20). As a result, the screen display of each mark M on the display 5 is as shown in FIG.

ここで、アナウンサは、修正内容を確認する為に再度「ＰＬＹ」のコマンドを入力すると、音声出力部７は、上書きされた“広告１−Ｖ”の音声ファイルの音声をスピーカ７１から出力する。また、文章処理プログラム１１は、編集された原稿ファイル３１とタイマ２とを照合して録音再生の開始と共にバーｃｂの制御と文字食の制御を開始し、図２のディスプレイ５の画面では、音声出力の開始から時間経過と共にバーｃｂと、原稿文の文字も読み上げ済みの部分の色が変わるように表示される。 Here, when the announcer inputs the “PLY” command again to confirm the correction contents, the audio output unit 7 outputs the audio of the overwritten “advertisement 1-V” audio file from the speaker 71. Also, the sentence processing program 11 compares the edited document file 31 with the timer 2 and starts the control of the bar cb and the character eclipse as well as the start of recording and reproduction. On the screen of the display 5 in FIG. As time elapses from the start of output, the bar cb and the text of the original text are displayed so that the color of the read-out portion changes.

なお、例えば、「ＰＬＹ」「ＥＤＴ」と２つのコマンドを同時に入力すると、図５および図６の原稿表示画面と音声波形表示画面とを同一画面で上下に表示するようにしても良い。そして、音声ファイル３２を読み出して録音再生を行う場合には、バーｃｂ、タイムスケール上では、文章処理プログラム１１、音声プログラム１２または、音声文章処理プログラム１３がタイマ２を参照して、再生開始（または、キュー開始）とともに再生開始からの時間を示しても良い。この時間経過表示の方法は、前述の色変化、もしくは、縦線が移動するものである。また、時関経過は、数字や時計針の表示の様なものが用いられても良い。 For example, when two commands “PLY” and “EDT” are simultaneously input, the document display screen and the audio waveform display screen of FIGS. 5 and 6 may be displayed vertically on the same screen. When the audio file 32 is read and recorded and reproduced, the text processing program 11, the audio program 12 or the audio text processing program 13 refers to the timer 2 on the bar cb and time scale, and starts reproduction ( Alternatively, the time from the start of reproduction may be indicated together with the cue start). This time-lapse display method is such that the aforementioned color change or vertical line moves. Also, the time course may be displayed as a number or a clock hand.

もし、アナウンサが、更に、原稿文ｂ２の部分の読み上げタイミング、もしくは読み上げ時間長を変更する場合には、上記ステップｓ１４、またはステップｓ１５以降を繰り返して再修正を行う。この場合でも原稿文ｂ３の「ＯＯカンパニー」の読み上げタイミングは、原稿文ｂ３に対する修正コマンドが入力されない限り、再修正前に記憶設定されていた読み上げタイミングが保持され、再修正は、原稿文ｂ２に関わる部分にのみ限定されるように音声・文字処理プログラム１３は、修正処理を行う。 If the announcer further changes the reading timing or reading time length of the portion of the document sentence b2, the above-described step s14 or step s15 and the subsequent steps are repeated for recorrection. Even in this case, the read-out timing of “OO Company” of the original sentence b3 is retained as long as the read-out timing stored before the re-correction is stored unless the correction command for the original sentence b3 is input. The voice / character processing program 13 performs correction processing so as to be limited only to the part concerned.

なお、読み上げタイミング（ｔ１）設定は、予め“広告１−Ｌ”の原稿ファイルの「△１」の位置を書き込み記憶すれば、映像開始後の読み上げ開始までのリードタイムが最初から記憶されて録音することができる。 When the reading timing (t1) is set in advance by writing and storing the position of “Δ1” in the original file of “Advertisement 1-L”, the lead time from the start of the video to the start of reading is memorized from the beginning. can do.

また、必要で有れば、原稿文ｂ１、ｂ３に付いてもアナウンサが原稿文ｂ２同様の手順を行うことにより、各処理プログラムが、“広告１−Ｌ”の原稿ファイル３１と“広告１−Ｖ”の音声ファイル３２とを編集する。 Further, if necessary, the announcer performs the same procedure as the original sentence b2 even if attached to the original sentences b1 and b3, so that each processing program becomes the original file 31 of the “advertisement 1-L” and the “advertisement 1−1”. The V "audio file 32 is edited.

更に、最終のナレーション原稿を表示して原稿を読み上げる際に、音声ファイル３２の再生音声を小さなガイダンス音声としてスピーカ７１から出力するようにすればその音声は、ディスプレイに表示されるナレーション原稿の読み上げタイミングと同期して出力されるので、ガイダンス効果を高める事も可能である。そしてまた、最終的に編集された“広告１−Ｖ”の音声ファイル３２を、そのまま、放送用音声素材として用いても良い。 Further, when the final narration original is displayed and the original is read out, if the reproduced sound of the audio file 32 is output as a small guidance voice from the speaker 71, the voice is read out from the narration original displayed on the display. It is possible to increase the guidance effect because it is output in synchronization with the output. Further, the finally edited audio file 32 of “advertisement 1-V” may be used as it is as the audio material for broadcasting.

この場合、ナレーションを開始するコマンドは、例えば、「ＰＬＹ」が制御入力部４から入力されるか、通信ＩＦ８からの映像開始のキュー信号によるもので、開始コマンドに合わせて再生音声がスピーカ７１から出力される。また、再生音声は、スピーカ７１から出力される代わりに、デジタル音声信号のまま通信インタフェース８を介して、または、音声信号として図示されない外部の装置へ放送用音声素材として出力されても良い。 In this case, the command for starting the narration is, for example, that “PLY” is input from the control input unit 4 or is based on a video start cue signal from the communication IF 8. Is output. Further, instead of being output from the speaker 71, the reproduced audio may be output as a broadcast audio material via the communication interface 8 as a digital audio signal or to an external device (not shown) as an audio signal.

実施例２は、ナレーション支援装置９の構成が、図１に示す実施例１の構成から音声・文字処理プログラム１３を省略した構成によって構成される。 In the second embodiment, the configuration of the narration support device 9 is configured by omitting the voice / character processing program 13 from the configuration of the first embodiment shown in FIG.

従って、実施例２においては、原稿ファイル３１と音声ファイル３２は、実施例１と同様に作成されるが、音声・文字処理プログラム１３を省略したことにより、音声波形とマークＭの対応が自動的に行われなくなる。その結果、図４におけるステップｓ１４−１、ｓ１９と、ｓ２０における作業、即ち、図６における音声波形とそれに合わせるマークＭとを対応させる一致作業や、その作業画面から音声波形画面のマークＭの表示位置を読み取り、ナレーション原稿のタイムスケールＣに合わせてマークＭを再設定する作業はオペレータ、またはアナウンサが制御入力部４を操作してコマンドを入力して文章処理プログラム１１に必要なパラメータを設定しなければならない。しかし、音声処理プログラム１２による音声ファイルの編集や再生音声は、実施例１と同じものとなるので、修正変更された音声ファイルの音声を手本等に利用できることは実施例１と同様である。 Accordingly, in the second embodiment, the document file 31 and the voice file 32 are created in the same manner as in the first embodiment, but the correspondence between the voice waveform and the mark M is automatically achieved by omitting the voice / character processing program 13. Will not be done. As a result, the operations in steps s14-1, s19 and s20 in FIG. 4, that is, the matching operation for associating the speech waveform in FIG. 6 with the mark M to be matched therewith, and display of the mark M on the speech waveform screen from the work screen. The operation of reading the position and resetting the mark M in accordance with the time scale C of the narration original is performed by the operator or the announcer operating the control input unit 4 to input a command and setting the necessary parameters in the text processing program 11. There must be. However, since the sound file editing and playback sound by the sound processing program 12 is the same as in the first embodiment, the sound of the modified and changed sound file can be used as a model or the like as in the first embodiment.

以上説明したように、本発明によれば、ナレーション原稿を下読みした音声を録音編集することによりナレーション原稿の読み上げガイドのタイミングの細部調整と共に、確認が可能なナレーション支援装置を提供することができる。 As described above, according to the present invention, it is possible to provide a narration support device capable of confirming, along with the detailed adjustment of the timing of a reading guide for a narration document, by recording and editing the sound of the narration document read down. .

本発明の実施例１に係るナレーション支援装置の機能構成を示すブロック図。The block diagram which shows the function structure of the narration assistance apparatus which concerns on Example 1 of this invention. ナレーション原稿を表示したディスプレイの画面表示例。A screen display example of a display showing a narration manuscript. ナレーション原稿を読み上げた音声波形の画面表示例。An example of the screen display of the voice waveform read out from the narration manuscript. 本発明の実施例１におけるナレーション支援装置の動作手順を説明するフローチャート。The flowchart explaining the operation | movement procedure of the narration assistance apparatus in Example 1 of this invention. 読み上げタイミング修正前の原稿の画面表示例。An example of a screen display of a document before correction of reading timing. アナウンサが下読みした原稿の音声波形の画面表示例。An example of the screen display of the audio waveform of the manuscript read by the announcer.

Explanation of symbols

１制御処理部（ＣＰＵ）
１１章処理プログラム
１２音声処理プログラム
１３音声・文字処理プログラム
１４ワークメモリ
２タイマ
３データベース記憶部
３１原稿ファイル
３２音声ファイル
４制御入力部
５ディスプレイ
６音声入力部
７音声出力部
８通信ＩＦ（インタフェース）
９ナレーション支援装置
Ｃ、ＴＳタイムスケール
Ｍマーク 1 Control processing unit (CPU)
Chapter 11 Processing Program 12 Voice Processing Program 13 Voice / Character Processing Program 14 Work Memory 2 Timer 3 Database Storage Unit 31 Original File 32 Audio File 4 Control Input Unit 5 Display 6 Audio Input Unit 7 Audio Output Unit 8 Communication IF (Interface)
9 Narration support device C, TS Time scale M mark

Claims

In a narration support device that displays a narration manuscript read out by a television broadcast,
A database for storing a manuscript file in which information of a narration manuscript is written, and a sound file in which a sound of reading out the narration manuscript is recorded;
Control input means for inputting a command for editing the original file and the audio file;
An audio input unit for converting an audio from which the announcer reads the narration document into a digital audio signal of the recorded audio;
An audio output unit that reproduces and outputs the stored audio file as audio by the command;
A display for displaying the voice waveform of the narration document or the voice file together with a time scale for referring to the reading timing of the narration document;
A syllable to be read out by the timer and the narration document read out by the input command, or a sentence for each phrase unit, and a mark indicating the reading timing of each sentence and each sentence are adjusted to the time scale. A sentence processing program for performing processing for screen display editing and storing in the database, and reading each voice file from the database by the command, and adjusting the voice waveform displayed on the display by the command. A narration support apparatus comprising: a speech processing program that performs a process of editing the time to read out and the time length of each sentence read out and storing the edited time in the database.

The narration support apparatus according to claim 1, wherein each of the programs refers to the timer to display a lapse of time from the start of the narration on the display.

3. The time lapse is displayed by at least one of a color change of a bar displayed on the time scale and a color change of a character read out in each sentence. Narration support device.

In a narration support device that displays a narration manuscript read out by a television broadcast,
A database for storing a manuscript file in which information of a narration manuscript is written, and a sound file in which a sound of reading out the narration manuscript is recorded;
Control input means for inputting a command for editing the original file and the audio file;
An audio input unit for converting an audio from which the announcer reads the narration document into a digital audio signal of the recorded audio;
An audio output unit that reproduces and outputs the stored audio file as audio by the command;
A display for displaying the voice waveform of the narration document or the voice file together with a time scale for referring to the reading timing of the narration document;
A syllable to be read out by the timer and the narration document read out by the input command, or a sentence for each phrase unit, and a mark indicating the reading timing of each sentence and each sentence are adjusted to the time scale. A sentence processing program for performing processing for screen display editing and storing in the database, and reading each voice file from the database by the command, and adjusting the voice waveform displayed on the display by the command. A speech processing program for performing processing for editing the time length of each sentence to be read out and storing it in the database, and the display timing for each sentence edited by adjusting the speech waveform Read-out timing of each sentence Narration support apparatus characterized by comprising a control unit and a voice-character processing program for performing processing to tailor the mark indicating grayed.

The sentence processing program and the speech processing program refer to the timer, display the passage of time from the start of narration in accordance with the time scale, and display the speech waveform on the screen for displaying the speech waveform. Corresponding to each mark indicating the same, the marks are displayed on the screen in the same manner,
The voice / character processing program reads the reading timing of each sentence by voice recognition that detects the rising of the voice waveform from the voice file, and the reading timing is read with reference to the timer. The narration support apparatus according to claim 4, wherein each mark is displayed on the screen in the order at a time corresponding to a reading-out timing.

6. The narration support apparatus according to claim 5, wherein each of the programs refers to the timer to display a lapse of time from the start of the narration on the display.

The time elapsed is displayed by at least one of a color change of a bar displayed on the time scale and a color change of a character read out in each sentence. Narration support device.

When a command for starting narration is input from the control input unit, the audio output unit reads out the edited and stored audio file from the database, and generates reproduced audio as a guide audio for reading out the narration document. The narration support apparatus according to claim 1, wherein the narration support apparatus outputs a broadcast sound material in place of an announcer's read-out sound.

The narration support device further includes communication interface means,
8. The narration support apparatus according to claim 6, wherein a cue signal of a signal for starting a video corresponding to narration is input as a command for starting the narration through the communication interface means.

In a method for editing a narration support device for displaying a narration manuscript read out by a television broadcast,
The narration support device edits the original file or the audio file, a database that stores a manuscript file in which information of the narration manuscript is written, an audio file in which a voice that reads out the narration manuscript is recorded, and Control input means for inputting a command, a voice input unit for converting a voice that the announcer reads the narration document into a digital voice signal of the recorded voice, and playing back the voice file stored by the command as voice An audio output unit for outputting, a display for displaying the audio waveform of the narration document or the audio file together with a time scale for referring to the reading timing of the narration document, and a control processing unit,
The control processing unit
The narration manuscript to be read out by the input command is changed to a syllable to be read out or a sentence for each unit of the syllable, and each sentence from the start of the narration and each reading timing are edited into the narration manuscript displayed on the screen together with marks. Stored in the database
When a command for editing the voice file is input, the voice file is read from the database by referring to a built-in timer, and the voice waveform of the voice file is displayed on the display and written to the voice file. The reading timing of each sentence and the time length of each sentence read out are edited and stored in the database, and the narration displayed on the screen according to the reading timing of each sentence edited by adjusting the speech waveform A document editing method for a narration support apparatus, characterized in that a process for adjusting a display of a document reading timing is performed.

The control processing unit
With reference to the timer, the passage of time from the start of narration is displayed according to the time scale, and the same mark corresponding to each mark indicating the reading timing of each sentence on the screen displaying the speech waveform Are sequentially displayed on the screen, and the reading timing of each sentence is read by voice recognition that detects the rising of the voice waveform from the voice file, and each mark is set to the time at a time corresponding to the read reading timing. The narration support apparatus document editing method according to claim 10, wherein the screen is displayed in accordance with a scale.

12. The time lapse is displayed by at least one of a color change of a bar displayed on the time scale and a color change of a character read out in each sentence. Manuscript editing method of narration support device.

From a database that stores a document file in which information of a narration document is written, a sound file in which a voice that reads out the narration document is recorded, the document file, and a control input means for editing the sound file Based on the command
A syllable to read the narration manuscript, or a sentence per phrase unit, and edit each narration manuscript from the start of the narration and the respective reading timing to the narration manuscript displayed on the screen together with marks, and store in the database
When a command for editing the audio file is input, the audio file is read from the database, the audio waveform of the audio file is displayed on the display, and the displayed audio waveform is adjusted to the audio file. The read-out timing of each written sentence and the length of time during which each sentence is read out are edited by referring to a timer and stored in the database, and the read-out of each sentence edited by adjusting the speech waveform A program for a narration support apparatus that performs processing for adjusting display of a reading timing of a narration document displayed on the screen according to timing.

The program is
With reference to the timer, the passage of time from the start of narration is displayed according to the time scale, and the same mark corresponding to each mark indicating the reading timing of each sentence on the screen displaying the speech waveform Are sequentially displayed on the screen, and the reading timing of each sentence is read by voice recognition that detects the rising of the voice waveform from the voice file, and each mark is set to the time at a time corresponding to the read reading timing. 14. The program for a narration support apparatus according to claim 13, wherein the narration support apparatus displays the screen in accordance with a scale.

The program refers to the timer according to at least one of a color change of a bar displayed on the time scale and a color change of a character to be read out of each sentence. The narration support apparatus program according to claim 14, wherein the narration support apparatus program is displayed.

The narration support apparatus program according to claim 13, wherein the program edits the length of time to be read out without changing the pitch of the voice.