KR930010781A

KR930010781A - Document reading system

Info

Publication number: KR930010781A
Application number: KR1019910020922A
Authority: KR
Inventors: 곽동후
Original assignee: 정용문; 삼성전자 주식회사
Priority date: 1991-11-22
Filing date: 1991-11-22
Publication date: 1993-06-23

Abstract

원고 내용을 읽어 문자정보를 인식한후 인식한 문자정보를 음성합성하는 문서낭독 시스템을 제공한다. 이를 위하여 원고의 이미지 정보를 픽셀 단위의 이진화의 화상데이타로 변환한다. 그리고 이런 화상데이타를 수신하여 1문자열 단위로 추출하고, 추출한 1문자열의 화상상태를 분석하여 각 문자들을 인식한다. 이후 문자인식 데이타를 수신하여 실시간으로 수신한 문자코드에 대한 음소데이타 베이스를 결합한후 음의 크기, 피치, 유무성음정보, 성도 계수등의 음의 특성을 구하고, 이를 바탕으로 음을 합성하여 출력한다.The present invention provides a document reading system that reads the text and recognizes the text information, and then synthesizes the recognized text information by voice. To this end, image information of the original is converted into image data of binarization in units of pixels. Then, the image data is received and extracted in units of one string, and each character is recognized by analyzing the image state of the extracted one string. After receiving the character recognition data, combine the phoneme data base for the received character code in real time, and then obtain the sound characteristics such as loudness, pitch, voice information, vocal coefficients, etc. and synthesize the sound based on this. .

Description

Document reading system

본 내용은 요부공개 건이므로 전문내용을 수록하지 않았음Since this is an open matter, no full text was included.

제1도는 문서낭독 시스템의 구성도.1 is a block diagram of a document reading system.

제2도는 문자인식 흐름도.2 is a character recognition flow chart.

제5도는 음성합성기의 블럭구성도.5 is a block diagram of a speech synthesizer.

Claims

In a document reading system, a scanning process of converting image information of an original into image data of binarization in units of pixels, receiving the image data, extracting the image data in units of one string using a horizontal histogram, and extracting the extracted image state of one string Analyze each character by analyzing, and combine the phoneme database of the received character code by receiving the character recognition data, and then obtain the sound characteristics such as loudness, tooth value, presence and absence voice information, vocal tract coefficient, etc. A method of reading a document, characterized in that it consists of a process of synthesizing sound on the basis.

The method of claim 1, wherein the character recognition process comprises: extracting one string from the received image data according to the distribution state of the number of pixels, analyzing the number of pixels in the extracted one string, and separating the character strings into character area units; Classifying a character type of the corresponding character area after separating the character area; classifying a Hangul type according to the Hangul temporary phoneme configuration state in the classification process; separating the phoneme of the character after classifying the Hangul type; Recognizing the vowels and consonants of each phoneme after separating the phonemes, generating a code of the corresponding character, a large classification process for classifying the presence or absence of uppercase and lowercase letters in the classification process, and recognizing the corresponding alphabetic character after performing the large classification process. Document reading method, characterized in that consisting of a small classification process for generating an English code.

The method according to claim 1 or 2, wherein the speech synthesis process comprises: analyzing the syntax of the received character codes to determine the classification and type of the sentence, searching for a space to be read, and processing the text into symbols and symbols; After the analysis, the process of processing the synthesis unit for the rhyme process, the process of accent of the connection section and the pause sentence, the volume of the accent, and the process of performing the process of the connection process by performing a silent and tactile process after the luck; And a process of synthesizing the audio signal after the connection process.

In the document reading system, an image scanner converts an image information of an original into binarized image data, and receives the image data, extracts the image data in units of one string, and analyzes the extracted image state of each character string. After combining the recognized character recognizer and the phoneme data base for the received character code by receiving the character recognition data, the sound characteristics such as loudness, pitch, voice information, and vocal coefficients are obtained. And a document synthesizer configured to synthesize a speech synthesizer.

※ Note: The disclosure is based on the initial application.