KR940008867B1

KR940008867B1 - Editing method of human speech message using computer

Info

Publication number: KR940008867B1
Application number: KR1019920009210A
Authority: KR
Inventors: 박경원
Original assignee: 삼성전자 주식회사; 정용문
Priority date: 1992-05-28
Filing date: 1992-05-28
Publication date: 1994-09-28
Also published as: KR930023821A

Abstract

accoring to input selection key, reading a voice message data from the specified file in disk and loading the data into memory; displaying the voice message sample on the display device; selecting a part of voice message waveform and deleting the other part of the displayed waveform according to the input selection key; sending a reproducing command to the voice message processing unit according to the reproduction key; and finally selecting an output device and saving the selected waveform to disk.

Description

How to edit voice message using computer

제1도는 본 발명을 수행하기 위한 시스템의 블럭구성도.1 is a block diagram of a system for carrying out the present invention.

제2도는 본 발명에 따른 음성메세지 편집기능의 흐름도.2 is a flowchart of a voice message editing function according to the present invention.

제3도는 본 발명을 수행하기 위한 메모리 맵 구성도.3 is a memory map configuration diagram for carrying out the present invention.

본 발명은 음성메세지(voice message)를 처리하는 방법에 관한 것으로, 특히 음성처리장치와 컴퓨터를 이용하여 녹음된 음성메세지를 편집하는 방법에 관한 것이다.The present invention relates to a method of processing a voice message, and more particularly, to a method of editing a voice message recorded using a voice processing device and a computer.

일반적으로 음성자동응답장치, 음성우편장치등의 음성메세지 처리시스템은 가입자가 안내메세지, 광고메세지등의 음성메세지를 녹음하려면, 전화기를 이용하여 음성메세지 처리시스템에 전화를 걸거나 아나로그 전송선로를 이용하여 녹음하고자 하는 음성메세지를 송신하여 녹음을 하여 왔다.In general, a voice message processing system such as a voice answering device or a voice mail device is used for a subscriber to record a voice message such as an announcement message or an advertisement message by using a telephone to call the voice message processing system or to establish an analog transmission line. It has been recording by sending a voice message to be recorded by using.

한편 음성메세지 처리시스템에서 사용되는 음성메세지의 음성 품질은 해당 시스템의 기능이나 신뢰성 이상으로 중요한 것으로, 일반 사용자들은 해당 시스템의 음성메세지를 듣고 해당 시스템의 품질을 판단하게 된다.On the other hand, the voice quality of the voice message used in the voice message processing system is more important than the function or reliability of the system, and general users hear the voice message of the system and judge the quality of the system.

따라서 상기한 바와 같은 종래의 음성메세지 처리시스템에서는 전화기를 이용하여 음성메세지를 녹음함에 따라 전송로상의 각종 잡음(noise)으로 인하여 음질이 현저하게 저하되며 전화기가 처리할 수 있는 음성 대역의 특성상 모든 음성 대역, 특히 고주파 대역에 대하여 모두 좋은 특성을 갖을 수 없는 문제점이 있었다.Therefore, in the conventional voice message processing system as described above, the voice quality is remarkably degraded due to various noises on the transmission path as the voice message is recorded using the telephone. There was a problem that the band, in particular high frequency band can not have all good characteristics.

또한 반드시 음성메세지 처리시스템이 있어야만 녹음이 가능하게 되는 불편함이 있었다.In addition, it was inconvenient to be able to record only if there is a voice message processing system.

이에따라 상기한 문제점을 해결하기 위한 기술로서 본원 출원인과 동일 출원인에 의해 특허출원된 제92-9191호가 있다. 상기 특허출원 제92-9191호는 별도의 음성메세지 처리시스템을 사용하지 않고서도 컴퓨터를 이용하여 음성메세지를 음질의 저하없이 녹음 및 재생한다.Accordingly, there is a patent application No. 92-9191 filed by the same applicant as the present application as a technique for solving the above problems. The patent application No. 92-9191 records and plays back voice messages without degrading sound quality using a computer without using a separate voice message processing system.

그러나 상기한 종래의 음성메세지 처리시스템 및 상기 특허출원 제92-9191호에 있어서는 녹음된 음성이 이상할 경우 즉, 음성의 크기(size)가 크거나 작을 경우 또는 앞에 무음부분이 길 경우에는 처음부터 다시 녹음을 하여야 하는 문제점이 있었다. 또한 녹음된 음성에 대하여 적합여부도 녹음된 음성을 운용자가 청취하여 판단함으로써 부정확하게 되는 문제점이 있었다.However, in the above-described conventional voice message processing system and the patent application No. 92-9191, when the recorded voice is abnormal, that is, when the size of the voice is large or small or when the silent portion is long, There was a problem that you need to record again. In addition, there was a problem that whether or not conformity to the recorded voice is incorrect by the operator listening to judge the recorded voice.

따라서 본 발명의 목적은 음성처리장치를 구비한 컴퓨터를 이용하여 녹음된 음성메세지를 보고 들으면서 필요한 부분만을 편집하여 음성메세지 처리시스템에 적합한 음성메세지를 만들 수 있는 컴퓨터를 이용한 음성메세지 편집 방법을 제공함에 있다.Accordingly, an object of the present invention is to provide a voice message editing method using a computer capable of making a voice message suitable for a voice message processing system by editing only a necessary part while watching and listening to a voice message recorded using a computer equipped with a voice processing device. have.

이하 본 발명을 첨부된 도면을 참조하여 상세히 설명한다.Hereinafter, the present invention will be described in detail with reference to the accompanying drawings.

제1도는 본 발명을 수행하기 위한 시스템의 블럭구성도로서, 전화기(10)와 오디오장치(12)와 마이크(14)와 스피커(16)와 헤드폰(18)등의 외부의 음성 입출력장치들이 연결되는 음성처리장치(56)가 컴퓨터 본체(58)에 컴퓨터의 시스템버스(103)를 통하여 연결되며, 상기 컴퓨터 본체(58)에는 모니터(52)와 키보드(54)가 연결된다. 또한 상기 컴퓨터 본체(58)에는 음성메세지 처리시스템(60)이 연결될 수도 있다.1 is a block diagram of a system for carrying out the present invention, in which external voice input / output devices such as a telephone 10, an audio device 12, a microphone 14, a speaker 16, and a headphone 18 are connected. The audio processing device 56 is connected to the computer main body 58 through the system bus 103 of the computer, and the monitor 52 and the keyboard 54 are connected to the computer main body 58. In addition, the voice message processing system 60 may be connected to the computer main body 58.

상기 컴퓨터 본체(58)는 시스템버스(103)를 통하여 서로 접속되는 디스크 드라이버(42)와 롬(44)과 램(46)과 제어부(48)와 입출력 인터페이스부(50)와 디스크 드라이버(42)로 구성되는 일반적인 컴퓨터이다.The computer main body 58 includes a disk driver 42, a ROM 44, a RAM 46, a controller 48, an input / output interface 50, and a disk driver 42 connected to each other via a system bus 103. It is a common computer that consists of.

상기 음성처리장치(56)는 아나로그 인터페이스부(20)와 코덱(22)과 데이타변환부(24)와 ADPCM(Adaptive Differential Pulse Code Modulation)부(26)와 DTMF(Dual Tone Multi-Frequency)수신부(28)와 리셋트신호발생부(30)와 제어부(32)와 롬(34)과 램(36)과 공통메모리(38)와 어드레스영역 선택부(40)로 구성된다. 상기 음성 처리장치(56)는 상기 컴퓨터 본체(58)에 내장시킬 수도 있다.The voice processing device 56 includes an analog interface unit 20, a codec 22, a data converter 24, an ADPCM (Adaptive Differential Pulse Code Modulation) unit 26, and a DTMF (Dual Tone Multi-Frequency) receiver. And the reset signal generator 30, the controller 32, the ROM 34, the RAM 36, the common memory 38, and the address area selector 40. The voice processing device 56 may be incorporated in the computer main body 58.

상기 제1도에 대한 구성 및 동작은 전술한 특허출원 제92-9191호에 상세히 게재되어 있으므로 생략한다.The configuration and operation of FIG. 1 are described in detail in the aforementioned patent application No. 92-9191, and thus will be omitted.

제2도는 본 발명에 따른 음성메세지 편집기능의 흐름도로서, 상기 제2도의 흐름도는 제1도의 제어부(48)의 수행 흐름도이다.2 is a flowchart of a voice message editing function according to the present invention, and the flowchart of FIG. 2 is a flowchart of the operation of the controller 48 of FIG.

제3도는 본 발명을 수행하기 위한 메모리 맵 구성도이다.3 is a memory map configuration diagram for carrying out the present invention.

제1도의 디스크 드라이버(42)의 디스크에 녹음된 PCM 또는 ADPCM의 음성데이타의 화일이 저장되어 있는 상태에서 운용자가 키보드(54)의 편집키를 입력시키면, 제어부(48)는 제2도의 (201)단계에서 이를 인식하고 (202)-(203)단계에서 디스크 드라이버(42)를 통하여 디스크에서 운용자가 지정하는 화일의 음성데이타를 독출하여 램(46)에 저장하고 독출한 음성데이타의 파형을 모니터(52)의 화면에 표시한다.When the operator inputs the edit key of the keyboard 54 while the file of the recorded PCM or ADPCM audio data is stored on the disk of the disk driver 42 of FIG. 1, the controller 48 displays (201) in FIG. In step (202) to step (203), the disk driver 42 reads the voice data of the file designated by the operator from the disk, stores it in the RAM 46, and monitors the waveform of the read voice data. It is displayed on the screen of 52.

상기와 같은 상태에서 운용자가 화면에 표시되는 파형을 보고 부분적으로 선택하기 위한 선택키를 입력하면, 제어부(48)는 (204)-(205)단계에서 상기 선택키입력에 따른 파형의 음성데이타만을 남겨두고 나머지 부분은 삭제한다. 다음에 운용자가 상기 선택한 파형의 음성데이타를 재생하여 들어보고자 재생키를 입력하면, 제어부(48)는 (206)-(207)단계에서 공통메모리(38)의 제3도와 같은 송신명령 저장버퍼를 이용하여 음성처리장치(56)에 재생시작명령을 전송하며, (208)단계에서 운용자의 선택에 따라 전화기(10), 스피커(16), 헤드폰(18)중 하나의 출력장치를 선택설정한후, (209)단계에서 상기 선택한 음편파형의 음성데이타를 공통메모리(38)의 제3도와 같이 제1, 제2PCM데이타 저장버퍼와 제1,제2ADPCM데이타 저장버퍼를 통하여 음성처리장치(56)로 전송한다. 그러면 음성처리장치(56)의 제어부(32)는 아날로그 인터페이스부(20)를 제어하여 상기 설정된 하나의 출력장치로 스위칭하도록 한후 코덱(22)과 데이타변환부(24)와 ADPCM부(26)를 제어하여 컴퓨터 본체(58)로 부터 입력되는 음성메세지의 음성신호를 처리하여 상기 설정된 출력장치로 출력하도록 한다. 그리고 상기 제어부(32)는 공통메모리(38)의 상기 음성데이타를 독출하여 상기 음성데이타가 ADPCM데이타일 경우에는 ADPCM부(26)에 1바이트/250㎲의 속도로 써주며 PCM데이타일 경우에는 데이타변환부(24)에 1바이트/125㎲의 속도로 써준다.In the above state, when the operator inputs a selection key for viewing and partially selecting the waveform displayed on the screen, the controller 48 performs only the voice data of the waveform according to the selection key input in steps 204 and 205. Leave the rest and delete the rest. Next, when the operator inputs a playback key for reproducing and listening to the audio data of the selected waveform, the control unit 48 stores the transmission command storage buffer as shown in FIG. 3 of the common memory 38 in steps 206 to 207. Transmits a playback start command to the voice processing device 56 by using the controller. In step 209, the voice data of the selected polarization waveform is transferred to the voice processing device 56 through the first and second PCM data storage buffers and the first and second ADPCM data storage buffers as shown in FIG. 3 of the common memory 38. send. Then, the control unit 32 of the voice processing device 56 controls the analog interface unit 20 to switch to the set output device, and then the codec 22, the data converter 24, and the ADPCM unit 26 are controlled. The control unit processes the audio signal of the voice message input from the computer main body 58 to output to the set output device. The controller 32 reads the voice data of the common memory 38, writes the voice data to the ADPCM unit 26 at a rate of 1 byte / 250 Hz when the voice data is ADPCM data, and in the case of PCM data, the data is read. The converter 24 writes at a rate of 1 byte / 125 Hz.

이에따라 상기 음성데이타가 ADPCM데이타인 경우에는 ADPCM부(26)를 통하여 PCM데이타로 변환되고, 다시 데이타변환부(24)에서 직렬 데이타로 변환된후 코덱(22)에 인가되어 아나로그의 음성신호로 변환된다. 상기 음성데이타가 PCM데이타인 경우에는 바로 데이타변환부(24)에서 직렬 데이타로 변환된후 코덱(22)에 인가되어 아나로그의 음성신호로 변환된다. 상기 아나로그의 음성신호는 아나로그 인터페이스부(20)를 거쳐 상기 설정된 출력장치에 출력된다.Accordingly, when the voice data is ADPCM data, it is converted into PCM data by the ADPCM unit 26, and then converted into serial data by the data converter 24, and then applied to the codec 22 to convert the voice data into an analog signal. Is converted. If the voice data is PCM data, it is immediately converted into serial data by the data converter 24 and then applied to the codec 22 to be converted into an analog voice signal. The audio signal of the analog is output to the set output device via the analog interface unit 20.

상기와 같은 상태에서 제어부(48)는 상기 (203)단계로 진행하여 전술한 과정을 반복한다. 이에따라 음성데이타가 최적으로 조절되게 된다.In the above state, the controller 48 proceeds to step 203 and repeats the above-described process. Accordingly, the voice data is optimally adjusted.

상기 출력되는 음성이 최적으로 조절이 됨에따라 운용자가 기록키를 입력하면, 제어부(48)는 (211)-(212)단계에서 상기 편집된 음성데이타를 디스크 드라이버(42)를 통하여 디스크에 저장한다.When the operator inputs a recording key as the output voice is optimally adjusted, the controller 48 stores the edited voice data on the disc through the disc driver 42 in steps 211 and 212. .

따라서 운용자는 모니터(52)의 화면에 표시되는 파형을 보고 파형을 부분적으로 선택하여 그 부분만을 재생하여 들어본후 선택하는 파형의 부분들을 조절함으로써 최적의 음성을 만들게 되는 것이다.Therefore, the operator looks at the waveform displayed on the screen of the monitor 52, selects the waveform partly, plays back only the part, and adjusts the part of the selected waveform to make the optimum voice.

한편 상기한 바와 같이 화면에 음성데이타의 파형을 표시하는 것을 상세히 설명하면 다음과 같다.On the other hand, the display of the waveform of the voice data on the screen as described above in detail as follows.

첫번째로 디스크로 부터 독출한 PCM 또는 ADPCM데이타를 그대로 이용할 경우에는 그래픽(graphic)화면에서 가로가 640화소(pixel)라고 가정할때, PCM데이타는 1바이트의 음성데이타가 125㎲이므로 640 125㎲=80㎳의 음성을 표시하며 ADPCM데이타는 1바이트의 음성데이타가 250㎲이므로 640 125㎲=160㎳의 음성을 표시한다. 이것을 적당한 비율로 샘플링(sampling)하면 더 많은 분량의 데이타를 표시할 수 있다.First, if PCM or ADPCM data read from the disk is used, assuming that the width is 640 pixels on the graphic screen, PCM data is 125㎲ because Byte 125㎲ = 1 byte audio data. The voice of 80㎳ is displayed and ADPCM data is 250㎲ of 1 byte, so the voice of 640 125㎲ = 160㎳ is displayed. Sampling this at a reasonable rate allows you to display more data.

두번째로 유음 카운터를 이용하는 경우에는 음성데이타를 128바이트단위로 유음 또는 무음을 판단한다. 즉, 128바이트내의 유음갯수를 레벨로 정한다. 그러므로 데이타는 0∼128사이의 값이 되며, 0은 녹음시 입력이 없는 부분이고 128은 모두 데이타가 유효한 경우이다. 이것을 상기한 바와같이 640화소에 표시하면 128 250㎲ 640=20480㎳로 약 20초 분량의 데이타가 된다. 이 데이타로 적당히 샘플링을 하면 수분길이의 음성을 표시하여 편집이 가능하게 된다.Secondly, in the case of using the sound counter, the sound data is judged to be sounded or silent in 128 byte units. That is, the number of noises in 128 bytes is set as the level. Therefore, the data is between 0 and 128, where 0 is no input and 128 are all valid. If this is displayed on the 640 pixel as described above, it is 128 250 ms 640 = 20480 ms, which is about 20 seconds of data. If you sample properly with this data, you will be able to display and edit voices with moisture lengths.

상술한 바와 같이 본 발명은 음성처리장치를 구비한 컴퓨터를 이용하여 녹음된 음성메세지를 보고 들으면서 필요한 부분만을 편집하여 음성메세지 처리시스템에 적합한 음성메세지를 만들 수 있는 잇점이 있다.As described above, the present invention has an advantage of making a voice message suitable for a voice message processing system by editing only a necessary part while watching and listening to a recorded voice message using a computer equipped with a voice processing apparatus.

Claims

Control means for converting a voice signal input from at least one set input device into a digital voice data, converting the voice data into an analog voice signal and outputting at least one set output device to a voice message processing device. And a memory means, a disk drive means, and a key input means for displaying a voice message using a computer, wherein the voice data of a designated file is read out from a disc through the disk drive means in response to an edit key input. Storing in the means; displaying the sound polarization waveform of the read voice data on the display means; partially selecting the displayed waveform according to selection key input and deleting the rest; In response, transmits a playback start command to the voice message processing apparatus to set the playback state. And setting the output device, selecting and setting the output device, and transmitting the voice data of the selected waveform to the voice processing device, and storing the voice data of the selected waveform on the disc through the disk driving means in response to a recording key input. Voice message editing method using a computer, characterized in that consisting of.