KR19990036555A

KR19990036555A - Intelligent video capture and message display device

Info

Publication number: KR19990036555A
Application number: KR1019980027120A
Authority: KR
Inventors: 잭슨 씨.에스. 창; 데이비드 디.에스. 호; 프레드 에이취.와이. 첸; 샘슨 엑스.에스. 왕; 링 이
Original assignee: 예쿠이; 인벤텍 코오포레이션
Priority date: 1997-10-22
Filing date: 1998-07-06
Publication date: 1999-05-25

Abstract

영상을 포착하여, 카메라에 의해 포착된 영상을 가진 물체에 관련된 정보를 처리하여 디스플레이하는 영상 포착 및 메시지 디스플레이 장치는 전면 단부 및 후면 단부를 가진 본체, 렌즈의 전면에서 물체의 광 영상을 포착하는 렌즈 장착 림(rim)을 통해 상기 전면 단부에 결합된 렌즈, 상기 본체 상에서 렌즈에 대해 미리 정해진 위치에 장착된 디스플레이, 상기 본체상에서 상기 디스플레이에 대해 미리 정해진 위치에 장착된 스위치와, 한 단부가 상기 본체에 결합되는 2개의 단부를 가진 전면 단부로 부터 본체 앞으로 연장한 스탠드-오프 암 및, 미리 정해진 각도로 상기 스탠드-오프 암의 다른 단부에 결합된 횡 영상폭 유도장치를 포함한 거리 유도 장치를 구비하고 있다.An image capturing and message display apparatus for capturing an image and processing and displaying information related to an object having an image captured by a camera includes a main body having a front end and a rear end and a lens for capturing an optical image of an object from the front of the lens. A lens coupled to the front end via a mounting rim, a display mounted at a predetermined position relative to the lens on the body, a switch mounted at a predetermined position relative to the display on the body, and one end of the body A distance induction device including a stand-off arm extending forward from the front end having two ends coupled to the body and a lateral image width induction device coupled to the other end of the stand-off arm at a predetermined angle; have.

영상 포착 장치는 사용자가 선택한 물체의 영상을 나타내는 신호를 연속적으로 발생시켜, 실시간내에서 수신된 신호로 표시된 영상을 디스플레이하고, 스위치의 활성화와 동시에 이에 의해 선택된 물체의 정지 영상을 찍어, 특정 물체로서 정지 영상을 식별하여, 상기 물체에 관한 정보를 검색한다.The image capturing apparatus continuously generates a signal representing an image of an object selected by the user, displays an image displayed as a signal received in real time, and simultaneously captures a still image of the selected object by activating the switch, thereby making it a specific object. Identifies a still image and retrieves information about the object.

Description

Intelligent video capture and message display device

본 발명은 일반적으로 영상 처리 시스템에 관한 것으로서, 특히, 물체의 영상을 포착하고, 포착된 영상을 인식된 물체로서 인식하며, 인식된 물체에 관련된 정보를 검색하며, 검색된 정보를 디스플레이상에 디스플레이하는 카메라를 가진 시스템에 관한 것이다.The present invention relates generally to an image processing system, and more particularly, to capture an image of an object, recognize the captured image as a recognized object, retrieve information related to the recognized object, and display the retrieved information on a display. It relates to a system with a camera.

현대 사회는 사람이 흡수할 많은 정보 및 지식을 가진 환경에 있는 데, 그런 정보는 많은 포맷으로 이용할 수 있다. 예를 들면, 이는 텔레비젼 프로그램, 라디오 방송, 인터넷등으로 부터 도래할 수 있다. 그럼에도 불구하고, 가장 광범위하게 이용된 정보 소스는 여전히 인쇄된 매체, 즉 신문, 잡지 또는 책에서 비롯된다. 신문 또는 책은 이들이 정보를 인쇄된 포맷으로 제공하여, 신뢰 가능한 미래의 참조물이 될 수 있게 하는 다른 매체와는 다르다. 인터넷으로 부터 획득된 정보 조차도 미래 참조물을 위한 하드카피(hardcopy)로 인쇄된다.Modern society is in an environment with a lot of information and knowledge for humans to absorb, and that information is available in many formats. For example, it can come from television programs, radio broadcasts, the Internet, and the like. Nevertheless, the most widely used sources of information still come from printed media, ie newspapers, magazines or books. Newspapers or books are unlike any other medium in which they provide information in a printed format, making it a reliable future reference. Even information obtained from the Internet is printed in hardcopy for future reference.

학생들은 교과서 및, 선생에 의해 제공된 다른 인쇄자료를 판독함으로써 지식을 습득한다. 전문가는 시대의 흐름을 알도록 전문 잡지 또는 주간지를 판독한다. 기술자 및 과학자는 그들 자신이 각 전문 분야에서 최근의 개발에 뒤떨어지지 않도록 교과서, 기술 저널 또는 무역 잡지를 판독한다. 세일즈맨 및 상인은 이들의 소유한 제품의 추세를 알도록 무역 잡지를 판독한다. 인쇄된 포맷으로 이용 가능한 대량의 정보는 필수적으로 이런 자료를 판독함에 있어 인간의 일상 생활의 중요부분을 바치게 한다. 서류를 판독할 시에, 사람은 때때로 사전에서 단어 또는 숙어를 찾아 볼 필요가 있을 수 있다. 사전에서 잘모르는 수많은 단어를 찾을 필요가 있을 경우, 사람은 실망하게 될 수 있다. 이는 사람이 판독할 수 없게 하여, 배울 사람의 능력을 방해할 수 있다.Students acquire knowledge by reading textbooks and other printed materials provided by the teacher. Experts read professional magazines or weekly magazines to know the trends of the times. Technicians and scientists read textbooks, technical journals or trade magazines to keep up with recent developments in their respective areas of expertise. Salesmen and merchants read trade magazines to see trends in their owned products. The large amounts of information available in the printed format make it an essential part of human daily life in reading these materials. When reading a document, a person may sometimes need to look up words or idioms in a dictionary. If you need to find hundreds of unknown words in a dictionary, people can be disappointed. This makes it impossible for a person to read, which can interfere with a person's ability to learn.

전자 사전은 사전의 속성(features)을 제공하는 장치이지만, 사람이 구 사전의 페이지를 넘기지 않고 단어의 의미를 찾게 한다. 더욱이, 최근의 기술적 발전으로, 단어의 발음을 나타낼 수 있는 음성 속성을 가진 전자 사전의 제조가 가능해졌다. 그러나, 종래 전자 사전의 하나의 주요 결점은 사용자가 바라는 단어의 키를 누르게 할 필요가 있다는 것이다. 최근에, 수기(handwriting) 인식 기술은 특정 기록 패드상의 수기를 통해 단어를 입력시켰다. 이런 키-인(key-in) 또는 수기 동작은 느려지는 경향이 있고, 에러가 발생되기 쉬운데, 이는 서류에 대한 이해를 더욱 느리게 한다. 인쇄상의 에러로 전자 사전이 부정확한 단어를 찾게 될 경우, 사용자가 문장의 의미를 잘못 해석하게 할 수 있다. 이는, 특히 사람이 인쇄상의 에러를 행했음을 사용자가 이해하지 못할 경우에도 적용된다. 그래서, 전자 사전은 통상적인 타입-인(type-in) 또는 기록 방법과 다른 입력을 액셉트하는 것이 바람직하다.Electronic dictionaries are devices that provide the features of a dictionary, but allow a person to find the meaning of a word without turning the pages of an old dictionary. Moreover, recent technological developments have made it possible to manufacture electronic dictionaries with phonetic attributes that can represent the pronunciation of words. However, one major drawback of conventional electronic dictionaries is that the user needs to press a key of the desired word. Recently, handwriting recognition techniques have entered words through handwriting on specific writing pads. This key-in or handwriting behavior tends to be slow and error prone, which slows the understanding of the document. If a typographical error causes the electronic dictionary to find an incorrect word, it may cause the user to misinterpret the meaning of the sentence. This applies especially if the user does not understand that a person has made a typographical error. Thus, it is desirable for the electronic dictionary to accept input that is different from the usual type-in or recording method.

전자 사전을 전자 서류철과 같은 사이즈로 휴대할 수 있게 되었지만, 여전히 운반하기에는 부피가 너무 크다. 그래서, 전자 사전을 포켓에 넣을 수 있게 하여, 사용자가 운반하는 데에 너무 짐이 되지 않게 쉽게 이용할 수 있도록 하는 것이 바람직하다.Electronic dictionaries can be carried in the same size as electronic filings, but they are still too bulky to carry. Thus, it is desirable to be able to put the electronic dictionary in a pocket so that the user can easily use it so that it is not too burdensome to carry.

종래 스캐너의 대부분은 서류의 전체 페이지 또는 한 라인을 스캐닝한다. 이는 전자 사전내에 이용하기에는 적절치 않다. 전자 사전이 통상적으로 단일 단어의 정의를 제공할 뿐이기 때문에, 서류의 전체 페이지 또는 한 라인 조차 스캐닝하는 것은 너무 짐이 되고, 비경제적일 수 있다. 그래서, 하나 이상의 단어의 영상을 한 번에 포착할 수 있는 가벼운 휴대용 장치를 갖는 것이 바람직하다.Most of the conventional scanners scan an entire page or a line of documents. This is not suitable for use in electronic dictionaries. Since electronic dictionaries typically only provide the definition of a single word, scanning an entire page or even a line of a document can be too burdensome and uneconomical. Thus, it is desirable to have a lightweight portable device that can capture images of more than one word at a time.

그래서, 본 발명의 목적은 카메라로 물체의 영상을 포착할 수 있고, 물체와 관련된 정보를 장치상에 디스플레이할 수 있는 장치를 제공하는 것이다.It is therefore an object of the present invention to provide a device capable of capturing an image of an object with a camera and displaying information related to the object on the device.

본 발명의 다른 목적은 사용자가 단어의 키를 수동으로 누를 필요성을 제거하도록 단어의 영상을 포착할 수 있고, 단어에 관련된 정보를 장치에 부착된 디스플레이상에 디스플레이할 수 있는 전자 사전을 제공하는 것이다.Another object of the present invention is to provide an electronic dictionary capable of capturing an image of a word so as to eliminate the need for a user to manually press a key of the word, and displaying information related to the word on a display attached to the device. .

본 발명의 또다른 목적은 사용자가 편안하게 운반할 수 있는 휴대용 전자 사전을 제공하는 것이다.Another object of the present invention is to provide a portable electronic dictionary which can be carried comfortably by a user.

도 1 은 양호한 실시예의 영상 포착 및 메시지 디스플레이 장치의 사시도.1 is a perspective view of an image capture and message display apparatus of the preferred embodiment;

도 2 는 다른 양호한 실시예의 영상 포착 및 메시지 디스플레이 장치의 사시도.2 is a perspective view of an image capture and message display device of another preferred embodiment;

도 3 은 양호한 실시예의 주요 소자를 도시한 블록도.3 is a block diagram showing the main elements of a preferred embodiment;

도 4 는 양호한 실시예의 CCD 기초 영상 센서의 개략도.4 is a schematic diagram of a CCD elementary image sensor of the preferred embodiment;

도 5 는 양호한 실시예의 영상 포착 동작 및 메시지 디스플레이 동작을 설명한 흐름도.Fig. 5 is a flowchart for explaining an image capturing operation and a message displaying operation in a preferred embodiment.

도 6 은 양호한 실시예의 영상인식 프로세스를 설명한 흐름도.6 is a flow chart illustrating an image recognition process of a preferred embodiment.

도 7 은 양호한 실시예의 매칭(matching) 동작에서 물체의 특성 픽셀 선택에 대해 설명한 일례도.FIG. 7 is an exemplary diagram illustrating characteristic pixel selection of an object in a matching operation of the preferred embodiment. FIG.

도 8 은 양호한 실시예의 루프 인식 서브프로세스의 매칭 동작을 설명한 흐름도.8 is a flowchart illustrating a matching operation of the loop aware subprocess of the preferred embodiment.

도 9 는 정보 탐색(look-up) 동작을 설명한 흐름도.9 is a flowchart illustrating an information look-up operation.

본 발명의 특정 실시예에서, 하나 이상의 선택된 물체의 하나 이상의 영상을 포착 및 디스플레이하고, 선택된 영상을 처리하여, 컴퓨터 시스템에 의해 인식 가능한 코드를 가진 특정 물체로서 상기 선택된 영상을 식별하고, 상기 물체에 관한 정보를 검색 및 디스플레이하는 장치가 기술된다. 일 실시예에서, 장치는 전면 단부 및 후면 단부를 가진 본체, 렌즈의 전면에서 물체의 광 영상을 포착하는 렌즈 장착 림(rim)을 통해 상기 전면 단부에 결합된 렌즈, 상기 본체상에 장착된 디스플레이, 상기 본체상에 위치된 스위치 및, 거리 유도장치(distance guide)를 포함한다. 이런 거리 유도장치는 스탠드-오프(stand-off) 암 및 횡 영상폭 유도장치는 더 포함한다. 스탠드-오프 암은 본체의 전면에서 앞으로 연장하고, 2개의 단부를 갖는데, 한 단부는 본체에 결합되고, 다른 단부는 횡 영상폭 유도 장치에 결합된다. 영상 포착 장치는 사용자가 선택한 물체의 영상을 나타내는 신호를 연속적으로 발생시켜, 실시간내에서 수신된 신호로 표시된 영상을 디스플레이하고, 스위치의 활성화와 동시에 이에 의해 선택된 물체의 정지(still) 영상을 찍어, 특정 물체로서 정지 영상을 식별하여, 상기 물체에 관한 정보를 검색한다.In certain embodiments of the present invention, one or more images of one or more selected objects are captured and processed, and the selected images are processed to identify the selected images as specific objects having a code recognizable by a computer system, An apparatus for retrieving and displaying information related is described. In one embodiment, the device comprises a body having a front end and a rear end, a lens coupled to the front end via a lens-mounted rim that captures an optical image of an object at the front of the lens, a display mounted on the body. And a switch located on the body and a distance guide. Such distance inducing devices further include stand-off arms and lateral image width inducing devices. The stand-off arm extends forward from the front of the body and has two ends, one end coupled to the body and the other end coupled to the transverse width guide device. The image capturing apparatus continuously generates a signal representing an image of an object selected by the user, displays an image represented by a signal received in real time, and simultaneously captures a still image of the selected object by activating the switch, A still image is identified as a specific object, and information about the object is retrieved.

본 발명의 한 잇점은 카메라로 물체의 영상을 포착할 수 있고, 물체와 관련된 정보를 장치상에 디스플레이할 수 있는 정치를 제공한다는 것이다.One advantage of the present invention is that it provides a stationary camera capable of capturing an image of an object and displaying information related to the object on a device.

본 발명의 다른 잇점은 사용자가 키보드 또는 기록 패드로 부터 단어를 입력시킬 필요성을 제거하도록 단어를 포착할 수 있고, 단어에 관련된 정보를 장치에 장착된 디스플레이상에 디스플레이할 수 있는 전자사전을 제공하는 것이다.Another advantage of the present invention is to provide an electronic dictionary capable of capturing words to eliminate the need for a user to enter words from a keyboard or recording pad, and displaying information related to the words on a display mounted to the device. will be.

본 발명의 또다른 잇점은 사용자가 편안하게 운반할 수 있는 휴대용 전자 사전을 제공하는 것이다.Another advantage of the present invention is to provide a portable electronic dictionary that can be carried comfortably by the user.

본 발명의 상기 및 다른 목적, 특성 및 잇점은 도면을 참조로 양호한 실시예의 아래의 상세 설명을 판독한 후에 본 기술 분야의 숙련자에게는 명백해진다.The above and other objects, features and advantages of the present invention will become apparent to those skilled in the art after reading the following detailed description of the preferred embodiments with reference to the drawings.

본 발명이 많은 형태로 실시될 수 있지만, 양호한 상세 사항은 도 1 내지 9 에 개략적으로 도시되며, 이런 기술은 본 발명을 설명하는 실시예로 제한하는 것으로 해석되지 않는다.While the invention may be embodied in many forms, preferred details are shown schematically in Figs. 1-9, and such techniques are not to be construed as limiting the embodiments to the invention.

본 발명은 영상 포착 및 메시지 디스플레이 장치를 특징으로 한다. 이런 장치는 물체의 광영상을 포착한다. 이런 물체는 단어, 숫자, 화상 및, 인쇄 자료의 2차원 포맷의 그래프이거나, 시계, 개, 차량과 같은 3차원 물체일 수 있다. 전자 사전에서, 물체는 많은 문자로 구성된 단어일 수 있다. 상기 장치는 단어를 문자 서브파트(subpart)로 분리하여, 각 문자의 광영상 신호를 디지털 신호로 변환한다. 그 다음, 상기 장치는 그 단어에 의해 인식 프로세스를 수행시켜, 데이터베이스내에서 단어에 관련된 정보를 탐색하고, 발견된 정보를 검색하여, 상기 장치의 비디오 디스플레이상에 정보를 디스플레이한다.The present invention features an image capture and message display apparatus. These devices capture optical images of objects. Such objects can be graphs in two-dimensional formats of words, numbers, images, and printed materials, or three-dimensional objects such as watches, dogs, and vehicles. In an electronic dictionary, an object may be a word composed of many letters. The device divides a word into character subparts and converts the optical image signal of each character into a digital signal. The device then performs a recognition process by that word to search for information related to the word in the database, retrieve the found information and display the information on the video display of the device.

도 1 에서, 영상 포착 및 메시지 디스플레이 장치의 양호한 실시예의 사시도가 설명된다. 영상 포착 및 메시지 디스플레이 장치(20)는 본체(12)를 갖는다. 본 실시예에서, 본체(12)는 사용자가 그의 손안에 편안히 잡을 수 있는 펜을 닮았다. 상기 장치(10)는 전면 단부(14) 및 후면 단부(16)를 갖고 있다. 렌즈(18)는 본체(12)의 전면 단부(14)에 위치된다. 본체(12) 내부의 렌즈(18)뒤에는 물체의 광 영상 포착용(도시되지 않은) CCD 카메라가 장착된다. 렌즈(18)는 그 주변에 렌즈 장착 림(19)을 갖고 있다. 림(19)은 소정의 다른 적당한 방식으로 본체(12)에 스냅(snap)되고, 밀착되며, 스레드(thread)되거나 고정될 수 있다. 스탠드-오프 암(20)은 본체(12)의 전면 단부(14)에서 앞으로 연장된다. 스탠드-오프 암(20)은 2개의 단부를 갖는데, 암(20)의 한 단부(22)는 본체(12)에 접속된다. 횡 영상폭 유도 장치(24)는 적당한 수단을 통해 암(20)에 고정된 암(20)의 말단부(26)에 접속된다. 횡 영상폭 유도장치(24)는 미리 정해진 각도에서 암(20)에 고정될 수 있다. 횡 영상폭 유도 장치(24)가 스탠드-오프 암(20)상에 고정되는 각도는 장치(10)의 동작에 영향을 주지 않는다. 유도장치(24)는 또한 암(20)에 수직으로 고정될 수 있다. 선택적으로, 횡 영상폭 유도장치(24)는 볼 조인트(ball joint)로 스탠드-오프 암(20)에 결합될 수 있음으로써, 유도장치(24)는 본체(12)에 대한 소정의 각도로 될 유연성을 가질 수 있다. 유도장치(24)를 암(20)에 접속하는 볼 조인트를 이용한 한가지 잇점은 사용자가 기울인 위치에서도 편안하게 사용할 수 있다는 것이다. 다른 잇점은 거리 유도 기구를 더욱 튼튼하게 하고, 암(20)을 파괴하지 않게 한다는 것이다. 스탠드-오프 암(20) 및 횡 영상폭 유도 장치(24)는 서로 거리 유도장치(25)를 형성한다. 암(20)의 길이는 렌즈(18)의 초점길이로 결정됨으로써, 렌즈(18)는 항상 렌즈의 앞에 암(20)의 길이와 같은 거리에서 물체상에 접속된다. 본체(12)의 후면 단부(16)에서, 캡(42)은 본체(12)상에 고정된다. 렌즈 장착 림(19)과 마찬가지로, 캡(42)은 본체(12)상에 스냅되거나, 본체(12)상에 스크루(screw)될 수 있다(도시되지 않은) 배터리는 장치(10)의 동작을 위한 전원을 공급하도록 후면 단부(16)에서 본체(12) 내부에 장착된다.1, a perspective view of a preferred embodiment of an image capture and message display apparatus is described. The image capture and message display apparatus 20 has a main body 12. In this embodiment, the body 12 resembles a pen that a user can comfortably hold in his hand. The device 10 has a front end 14 and a back end 16. The lens 18 is located at the front end 14 of the body 12. Behind the lens 18 inside the main body 12, a CCD camera (not shown) for capturing an optical image of an object is mounted. The lens 18 has a lens mounting rim 19 around it. Rim 19 may be snapped to, adhered to, threaded or secured to body 12 in any other suitable manner. The stand-off arm 20 extends forward at the front end 14 of the body 12. The stand-off arm 20 has two ends, one end 22 of the arm 20 being connected to the body 12. The transverse width guide device 24 is connected to the distal end 26 of the arm 20 which is fixed to the arm 20 by any suitable means. The lateral image width induction device 24 may be fixed to the arm 20 at a predetermined angle. The angle at which the lateral width guide device 24 is fixed on the stand-off arm 20 does not affect the operation of the device 10. Guide device 24 may also be fixed perpendicularly to arm 20. Optionally, the transverse width guide device 24 can be coupled to the stand-off arm 20 by a ball joint, so that the guide device 24 can be at an angle to the body 12. It can have flexibility. One advantage of using the ball joint to connect the induction device 24 to the arm 20 is that the user can comfortably use it even in a tilted position. Another advantage is that the distance guidance mechanism is more robust and does not destroy the arm 20. The stand-off arm 20 and the lateral image width induction device 24 form a distance induction device 25 with each other. The length of the arm 20 is determined by the focal length of the lens 18 so that the lens 18 is always connected to the object at the same distance as the length of the arm 20 in front of the lens. At the rear end 16 of the body 12, the cap 42 is fixed on the body 12. Like the lens mounted rim 19, the cap 42 can be snapped onto the body 12, or screwed onto the body 12 (not shown). It is mounted inside the body 12 at the rear end 16 to supply power for it.

플랫 패널(flat-panel) 디스플레이인 디스플레이(28)는 장치(10)의 본체(12)상에 위치된다. 양호한 실시예에서, 디스플레이는 액정 디스플레이이다. 다수의 버튼(30, 32)은 디스플레이(28)의 강도를 조정하기 위해 디스플레이(28)의 근처에 위치된다. 제 1 버튼(30)을 누름으로서 디스플레이의 강도가 증가된다. 제 2 버튼(32)을 누름으로서 디스플레이의 강도가 감소된다. 또한 본체상에는 다수의 스위치(34, 36, 38, 40)가 있다. 제 1 스위치(34)는 장치(10)내의 렌즈 및 다른 소자를 통해 물체의 속사(snap-shot) 영상을 찍기 위한 것이다. 다른 소자 및 그의 동작에 대해서는 아래에서 기술된다. 다른 3개의 스위치(36, 38, 40)는, 렌즈의 초점을 조정하고, 물체를 관찰하기 위한 줌(zoom) 동작을 제어하며, 물체의 샘플 발음을 생성시키며, 음성 인식 동작을 활성화시키고, 통신 동작을 활성화시키거나, 단순히 온/오프 스위치 역할을 하는 바와 같은 다른 응용에 이용될 수 있다. 이런 3개의 스위치에 대한 다른 대안은 이런 스위치가 시프트 키, 알토 키(alt-key) 또는 제어키로서 이용되어, 스위치의 응용을 조합할 수 있다는 것이다. 이런 부가적인 특징은 또한 디스플레이의 강도를 제어하는 버튼(30,32)에 적용될 수 있다.The display 28, which is a flat-panel display, is located on the body 12 of the device 10. In a preferred embodiment, the display is a liquid crystal display. A number of buttons 30, 32 are located near the display 28 to adjust the intensity of the display 28. By pressing the first button 30 the intensity of the display is increased. The intensity of the display is reduced by pressing the second button 32. There are also a number of switches 34, 36, 38, 40 on the body. The first switch 34 is for taking a snap-shot image of the object through the lens and other elements in the device 10. Other elements and their operation are described below. The other three switches 36, 38, 40 adjust the focus of the lens, control the zoom operation for observing the object, generate sample pronunciation of the object, activate the voice recognition operation, and communicate It can be used for other applications such as activating operation or simply acting as an on / off switch. Another alternative to these three switches is that such switches can be used as a shift key, alt-key or control key to combine the application of the switches. This additional feature can also be applied to buttons 30 and 32 that control the intensity of the display.

장치(10)는 횡 영상폭 유도장치(24) 바로위의 렌즈(18) 앞에 물체의 광 영상을 포착하고, 물체의 광 영상 신호를 발생시키며, 제 1 스위치(34)로 부터의 지시 사항(indications)을 수신함과 동시에 물체를 식별하여 인식하며, 물체에 관련된 정보를 탐색하며, 그리고 물체의 영상 및, 그에 관련된 정보를 디스플레이상에 디스플레이할 수 있다. 이런 물체는 단어, 숫자 또는 다른 형태의 물체일 수 있다.The device 10 captures an optical image of the object in front of the lens 18 directly above the lateral image induction device 24, generates an optical image signal of the object, and provides instructions from the first switch 34 ( At the same time as receiving the indications, the object can be identified and recognized, search for information related to the object, and an image of the object and the related information can be displayed on the display. Such objects can be words, numbers or other forms of objects.

장치(10)는 펜처럼 휴대할 수 있다. 일례의 전자 사전에서, 영상 포착 및 메시지 디스플레이 장치(10)는 단어의 광 영상을 포착하고, 단어에 관련된 정의(definition) 및 다른 정보를 찾으며, 그리고 단어 및 그의 정의와 정보를 디스플레이상에 디스플레이한다. 일례의 개인 정보 관리(PIM) 시스템에서, 영상 포착 및 메시지 디스플레이 장치(10)는 사람의 이름 또는 조직 명칭을 포착하고, 관련 사람의 전화 및/또는 팩시밀리 번호, 주소, 회의 스케쥴 및/또는 의사 일정 등을 찾는다. 특정 날짜의 카렌더 사건 또는 지정(events or appointments)은 또한 카렌더 날짜를 포착하여, 그 날짜를 인식하며, 그 당시의 스케쥴을 검색하며, 그리고 그 스케쥴을 디스플레이상에 디스플레이함으로써 검색될 수 있다. 또한, 사무용 카드상의 정보는 영상 포착 장치에 의해 포착되어, 데이터베이스내에 저장될 수 있다. 데이터베이스내에 저장된 정보는 사무용 카드상의 키 요소를 식별하여, 영상 포착 장치를 통해 상기 요소의 정지 영상을 포착함으로써 검색될 수 있다.The device 10 can be carried like a pen. In an example electronic dictionary, image capture and message display apparatus 10 captures an optical image of a word, finds definitions and other information related to the word, and displays the word and its definition and information on a display. . In an example personal information management (PIM) system, the image capture and message display device 10 captures a person's name or organization name and includes the person's telephone and / or facsimile number, address, meeting schedule and / or agenda. Find your back. Calendar events or appointments of a particular date can also be retrieved by capturing the calendar date, recognizing the date, retrieving the schedule at that time, and displaying the schedule on a display. In addition, the information on the office card can be captured by the image capturing apparatus and stored in the database. The information stored in the database can be retrieved by identifying the key element on the business card and capturing a still image of the element via the image capturing device.

렌즈(18)는 원형 렌즈이다. 그러나, 전자 사전에서, 영상이 장치에 의해 포착되는 물체는 보통 직각형이다. 이는, 영어 또는 다른 서양 언어에서, 각 단어가 많은 알파벳에 의해 생성되는 이유이다. 이런 알파벳을 함께 놓을 시에, 상기 알파벳은 보통 좌에서 우로 배열되고, 직각형 단어를 유발시킨다. 그래서, 직각형의 물체의 광 영상을 포착하기 위하여, 렌즈(18)는 원형일 필요가 없다. 이는 본 발명의 영상 포착 프로세스가 거의 직각형인 물체에 관계하기 때문에 원형 렌즈의 부분일 수 있다. 선택적으로, 렌즈는 또한 렌즈의 두께를 줄이도록 프레스널(Fresnel) 렌즈의 부분 또는 프레스널 렌즈일 수 있다. 이는 또한 원통형 렌즈 또는 원통형 렌즈의 부분일 수 있다. 그러나, 중국어 또는 한국어와 같은 외국어에 대하여, 단어는 보통 사각형이다. 그래서, 중국어 또는 한국어 전자 사전에 이용된 렌즈는 직각형 대신에 원형일 수 있다.Lens 18 is a circular lens. However, in an electronic dictionary, the object whose image is captured by the device is usually rectangular. This is why in English or other Western languages, each word is produced by many alphabets. When putting these alphabets together, the alphabets are usually arranged from left to right, resulting in rectangular words. Thus, in order to capture an optical image of a rectangular object, the lens 18 need not be circular. This may be part of a circular lens because the image capture process of the present invention relates to an object that is nearly rectangular. Optionally, the lens may also be part of a Fresnel lens or Fresnel lens to reduce the thickness of the lens. It may also be a cylindrical lens or part of a cylindrical lens. However, for foreign languages such as Chinese or Korean, words are usually square. Thus, lenses used in Chinese or Korean electronic dictionaries may be circular instead of rectangular.

도 2 는 양호한 실시예의 선택적 실시예의 사시도이다. 이런 도면에는 물체의 영상을 포착하여, 상기 물체에 관련된 정보를 디스플레이하기 위해 컴퓨터 마우스와 닮은 제 2 장치(44)가 도시된다. 도 1 에서 언급된 장치(10)와 상기 제 2 장치(44)를 구별하기 위하여, 상기 제 2 장치(44)는 본 발명에서 마우스(44)로서 식별된다.2 is a perspective view of an alternative embodiment of the preferred embodiment. This figure shows a second device 44 that resembles a computer mouse to capture an image of an object and display information related to the object. In order to distinguish the device 10 mentioned in FIG. 1 from the second device 44, the second device 44 is identified as a mouse 44 in the present invention.

마우스(44)는 본체(46)를 가지고 있다. 마우스(44)는 전면 단부(48) 및 후면 단부(50)를 갖는다. 마우스(44)는 또한 상부(52), (도시되지 않은)하부 및 2개의 측면부(54, 56)를 갖는다. 스탠드-오프 운반자(carrier)(58)는 전면 단부(48)에 대해 미리 정해진 위치에 (도시되지 않은) 렌즈를 위치시킨다. 스탠드-오프 운반자(58)는 상향 상승부(62) 섹션 및 플랫폼(64) 섹션을 포함한다. 상향 상승부(62)는 본체(46)의 전면 단부(48)에 회전 가능하게 고정됨으로써, 플랫폼(64)은 본체(46)위의 미리 정해진 높이까지 올려진다. 플랫폼(64)은 상위 표면(66) 및 하위 표면(68)을 포함한다. 렌즈는 광 축이 스탠드-오프 운반자(58)의 상향 상승부(62)와 병렬인 식으로 하위 표면(68)상에 설치된다. 다수의 올려진 측면(70)으로 에워싸인 상위 표면(66) 및 하위 표면(68)은(도시되지 않은) 하우징을 한정한다. 상위표면(66) 및 하위표면(68)간의 거리는 렌즈뒤에 CCD 카메라를 놓기 위한 충분한 공간을 제공한다.The mouse 44 has a body 46. The mouse 44 has a front end 48 and a back end 50. Mouse 44 also has an upper portion 52, a lower portion (not shown), and two side portions 54, 56. The stand-off carrier 58 positions the lens (not shown) at a predetermined position relative to the front end 48. The stand-off carrier 58 includes an upward rise 62 section and a platform 64 section. The upward rise 62 is rotatably fixed to the front end 48 of the body 46, whereby the platform 64 is raised to a predetermined height above the body 46. Platform 64 includes an upper surface 66 and a lower surface 68. The lens is mounted on the lower surface 68 in such a way that the optical axis is in parallel with the upward rise 62 of the stand-off carrier 58. Upper surface 66 and lower surface 68 surrounded by a number of raised sides 70 define a housing (not shown). The distance between the upper surface 66 and the lower surface 68 provides sufficient space for placing the CCD camera behind the lens.

마우스(44)의 전면 단부(48)의 하부 근처에서 전면단부(48)로 안내 부재(72)가 연장된다. 안내부재(72)는 개구(74)를 한정하는 거의 직각형의 부재이다. 개구(74)는 또한 거의 직각형이다. 안내 부재(72)는 렌즈의 광 축이 개구(74)를 통과하는 식으로 위치된다.Guide member 72 extends to front end 48 near the bottom of front end 48 of mouse 44. Guide member 72 is a substantially rectangular member defining opening 74. The opening 74 is also nearly rectangular. Guide member 72 is positioned in such a way that the optical axis of the lens passes through opening 74.

양호한 실시예에서, 플랫폼(64)은 축(69)을 통해 상향 상승부(62)에 회전 가능하게 고정된다. 플랫폼(64)은 회전되어, 상향 상승부(62)와 병렬이 되도록 하향으로 접힌 플랫폼(64)을 가질 수 있다. 상향 상승부(62)는 본체(46)에 대한 (도시되지 않은) 축상에 회전 가능하게 설치된다. 상향 상승부(62) 및 플랫폼(64)을 포함하는 스탠드-오프 운반자(58)는 상향 상승부(62)의 앞과 그와 나란히 하향으로 접힐 수 있다. 스탠드-오프 운반자(58)는 그때 본체(46)내로 슬라이드(slide)될 수 있다. 안내 부재(72)는 (도시되지 않은) 슬라이드 가능 운반자상에 설치되고, 마우스(44)의 본체(46)내로 쑥 들어갈 수 있다. 이런 특징의 상세 메카니즘은 계류중인 출원에서 기술된다.In the preferred embodiment, the platform 64 is rotatably fixed to the upward rise 62 through the axis 69. The platform 64 may be rotated to have the platform 64 folded downward to be in parallel with the upward rise 62. The upward rise 62 is rotatably mounted on an axis (not shown) with respect to the body 46. The stand-off carrier 58 including the upward rise 62 and the platform 64 can be folded downward in front of and in parallel with the upward rise 62. Stand-off carrier 58 may then slide into body 46. Guide member 72 is mounted on a slidable carrier (not shown) and can dent into body 46 of mouse 44. The detailed mechanism of this feature is described in pending applications.

디스플레이(76)는 본체(46)의 상부(52)상에 위치된다. 양호한 실시예에서, 디스플레이는 LCD 디스플레이이다. 다수의 스위치(78, 80)는 본체(46)의 상부(52)상에 위치된다. 선택적으로, 스위치(82)는 또한 본체(46)의 어느하나 또는 둘 측면(54, 56)상에 위치될 수 있다. 제 1 스위치(78)는 마우스(44)내의 CCD 카메라 및 렌즈를 통해 물체의 속사 영상을 찍기 위한 것이다. 다른 스위치(72, 74)는 전술된 다른 응용에 이용될 수 있다. 분실(compartment)은 배터리가 저장될 수 있는 마우스(44)의 하부에서 접근하기 쉬운 본체(46)내에 위치된다. 배터리 분실은 그 위치내에 배터리를 수용하는 스냅-온 형 덮개(snap-on type cover)를 갖는다.Display 76 is located on top 52 of body 46. In a preferred embodiment, the display is an LCD display. Multiple switches 78, 80 are located on top 52 of body 46. Optionally, the switch 82 may also be located on either or both sides 54, 56 of the body 46. The first switch 78 is for taking a fast-shooting image of the object through the CCD camera and the lens in the mouse 44. Other switches 72 and 74 can be used for the other applications described above. The compartment is located in the main body 46 which is accessible at the bottom of the mouse 44 where the battery can be stored. Battery loss has a snap-on type cover that houses the battery in its location.

도 1 의 영상 포착 및 메시지 디스플레이 장치(10)에서 기술된 렌즈와 마찬가지로, 마우스형 장치(44)내에 사용된 렌즈는 또한 많은 형태로 제조될 수 있다. 이는 원형 렌즈, 원형 렌즈의 부분, 프레스널 렌즈, 프레스널 렌즈의 부분, 원통형 렌즈 또는 원통형 렌즈의 부분일 수 있다.Like the lenses described in the image capture and message display device 10 of FIG. 1, the lenses used in the mouse-like device 44 can also be manufactured in many forms. It may be a circular lens, part of a circular lens, Fresnel lens, part of Fresnel lens, cylindrical lens or part of cylindrical lens.

도 3 은 물체(84)의 영상을 포착하여, 물체(84)에 관련된 정보를 디스플레이(28)상에 디스플레이하는 장치(10) 또는 마우스(44)의 양호한 실시예의 블록도이다. 아래의 기술은 장치(10)에 관하여 설명되지만, 장치(10) 및 마우스(44)의 내부 기능 구조 및 동작이 동일하기 때문에, 마우스(44)에도 균등하게 적용할 수 있다. 장치(10)는, 렌즈(18)(또는 마우스(44)내의 (62)) 및, 물체(84)의 광영상을 감지하는 2차원 영상 센서(88)를 가진 카메라(86), 아날로그-디지털(A/D) 변환기(90), 디지털 신호 프로세서(DSP)(92), 판독 전용 기억장치(ROM)(94), 임의 접근 기억장치(RAM)(96), 디스플레이(28), 스위치를 포함한다.3 is a block diagram of a preferred embodiment of a device 10 or mouse 44 that captures an image of an object 84 and displays information related to the object 84 on a display 28. Although the description below is described with respect to the device 10, the internal functional structures and operations of the device 10 and the mouse 44 are the same, and therefore, the same can be applied to the mouse 44 evenly. Device 10 is an analog-digital camera 86 with lens 18 (or 62 in mouse 44) and a two-dimensional image sensor 88 that senses an optical image of an object 84. (A / D) converter 90, digital signal processor (DSP) 92, read only memory (ROM) 94, random access memory (RAM) 96, display 28, switch do.

2차원 영상 센서(88)는 렌즈(18)뒤에 위치된다. 2차원 영상 센서(88)는 렌즈(16)를 통과한 물체(90)의 광 영상 신호를 수신하고, 광 영상 신호를 아날로그 영상 신호로 변환하여, 아날로그 영상 신호를 아날로그-디지털(A/D) 변환기(90)로 전송한다. A/D 변환기(90)는 아날로그 영상 신호로 부터 디지털 영상 신호를 발생시켜, 디지털 영상 신호를 디지털 신호 프로세서(DSP)(92)로 보낸다. 스위치(34)는 DSP(92)에 결합되고, 사용자에 의해 제어되어, 장치(10)의 횡 영상 폭 유도장치(24) 위(도 1 참조), 또는 안내 부재(72)의 개구(74)내에 (도 2 참조) 있는 물체(84)의 속사 정지 영상을 찍는다. DSP(92)는 ROM(94) 및 RAM(96)에 결합된다. ROM(94)은 DSP(80)의 동작을 제어하기 위한 명령어를 포함한다. 또한, ROM(94)내에는 물체(84)에 관련된 모든 정보를 포함하는 데이터 베이스나, 장치(10)에 의해 수행될 문자 인식 동작을 위한 템플레이트 라이브러리(template library)가 포함된다. ROM(94)은 또한 플래시 기억장치형 임으로써, 최근 정보가 장치(10)상의 전원을 턴 오프하기 전에 저장될 수 있다. RAM(96)은 DSP(92)에 의해 제어된 장치 동작을 위한 일반적인 기억장치이다. DSP(92)는 2차원 영상 센서(88)에 전기적으로 결합되어, 2차원 영상 센서(88)의 동작을 제어한다.The two-dimensional image sensor 88 is located behind the lens 18. The two-dimensional image sensor 88 receives an optical image signal of the object 90 passing through the lens 16, converts the optical image signal into an analog image signal, and converts the analog image signal into an analog-digital (A / D) signal. Send to converter 90. The A / D converter 90 generates a digital video signal from the analog video signal, and sends the digital video signal to a digital signal processor (DSP) 92. The switch 34 is coupled to the DSP 92 and controlled by the user, such that the opening 74 of the lateral image width induction device 24 of the device 10 (see FIG. 1) or of the guide member 72. A quick still image of the object 84 within (see FIG. 2) is taken. DSP 92 is coupled to ROM 94 and RAM 96. The ROM 94 includes instructions for controlling the operation of the DSP 80. Also included in ROM 94 is a database containing all information related to object 84, or a template library for character recognition operations to be performed by device 10. The ROM 94 is also flash memory type so that recent information can be stored before turning off the power on the device 10. The RAM 96 is a general storage device for the device operation controlled by the DSP 92. The DSP 92 is electrically coupled to the two-dimensional image sensor 88 to control the operation of the two-dimensional image sensor 88.

확성기, 스피커 및 통신 수단은 또한 상기 장치내에 포함되고, 모두 DSP(92)에 결합되어, 음성 인식, 음성 합성 또는 통신 특성을 제공한다. 부가적인 스위치가 또한 DSP(92)에 결합되어, 확성기, 스피커, 통신 또는 장치의 온/오프를 제어할 수 있다.Loudspeakers, speakers and communication means are also included within the apparatus and are all coupled to the DSP 92 to provide speech recognition, speech synthesis or communication characteristics. Additional switches may also be coupled to the DSP 92 to control on / off of loudspeakers, speakers, communications, or devices.

양호한 실시예에서, 장치(10)는 물체(84)의 영상을 포착한다. 물체(84)의 영상을 포착하기 위하여, 렌즈(18)는 물체(84)에서 거리(100)으로 떨어져 위치되어야 한다. 물체(84)의 선명한 영상을 포착할 카메라(86)에 대하여, 거리(100)는 렌즈(18)의 초점 길이 근처내에 있어야 한다. 펜형 장치(10)의 경우에, 거리는 계산되어, 스탠드-오프 암(20)의 길이만큼 미리 정해진다. 마우스형 장치(44)의 경우에, 거리(100)는 안내 부재(72)의 개구(74)내에 위치된 물체와 렌즈 사이의 거리, 즉 대략 상향 상승부(62)의 길이이다.In the preferred embodiment, the device 10 captures an image of the object 84. In order to capture an image of the object 84, the lens 18 must be positioned at a distance 100 from the object 84. For camera 86 to capture a clear image of object 84, distance 100 should be within the focal length of lens 18. In the case of the pen-shaped device 10, the distance is calculated and predetermined by the length of the stand-off arm 20. In the case of the mouse-like device 44, the distance 100 is the distance between the lens and the object located in the opening 74 of the guide member 72, ie, approximately the length of the upward rise 62.

물체(84)는 서류의 단어와 같이 문서 포맷으로 되어 있다. 영상 포착 장치(10)는 개구(74)내 또는 횡 영상폭 유도장치(24)위의 물체(84)의 영상을 포착한다. 예를 들면, 전자 사전에서, 양호한 실시예의 영상 포착 장치(10)는 카메라로 사진을 찍는 것처럼 단어의 영상을 단일 동작으로 포착한다. 종래 기술에서, 단어의 영상은 통상적으로 서류의 페이지 또는 라인의 일부로서 컴퓨터내로 스캔된다. 종래의 스캐너는 서류의 동일 라인내의 다른 단어로부터 한 단어를 선택적으로 스캔할 수 없다. 그래서, 전자 사전의 사용자는 단어의 정의를 검색하기 위해 스캐너를 입럭장치로서 사용할 수 없다. 또한, PIM 사용자는 통상적인 스캐너를 통해 카렌더 날짜를 스캔함으로써 특정 날짜의 지정을 질의할 수 없다. 본 발명의 영상 포착 장치(10)는 단일 단어, 많은 단어나, 단어의 길이에 의존하는 단어의 일부의 영상을 포착할 수 있다. 그래서, 이런 장치는 작은 물체나, 항목의 일부의 영상을 포착하기 위한 유연성을 더 많이 제공한다.The object 84 is in document format like a word in a document. The image capturing device 10 captures an image of the object 84 in the opening 74 or on the lateral image width inducing device 24. For example, in an electronic dictionary, the image capturing apparatus 10 of the preferred embodiment captures an image of a word in a single operation as if taking a picture with a camera. In the prior art, an image of a word is typically scanned into a computer as part of a page or line of a document. Conventional scanners cannot selectively scan one word from another word in the same line of the document. Thus, the user of the electronic dictionary cannot use the scanner as an input device to retrieve the definition of a word. In addition, PIM users cannot query the assignment of a specific date by scanning the calendar date through a conventional scanner. The image capturing apparatus 10 of the present invention can capture an image of a single word, many words, or a part of a word depending on the length of the word. Thus, such devices provide more flexibility for capturing images of small objects or parts of items.

물체의 광 영상은 렌즈를 통과하여, 2차원 영상센서(88)상에 투사된다. 양호한 실시예에서, 2차원 영상 센서(20)는 예를 들어 물체의 광 영상 신호를 2차원 지향 아날로그 영상 신호로 변환하는 전하 결합 장치(CCD) 광 영상 센서 모듈을 포함한다. 이런 변환 프로세스는 기술분야에 잘 알려져 있다. 아날로그 영상 신호는 그때 아날로그-디지털 변환기(90)에 의해 디지털 영상 신호로 변환된다. 디지털 영상신호는 그때 DSP(92)에 의해 처리되어, 물체의 영상을 디스플레이하는 디스플레이(28)로 전송된다. 이런 모든 프로세스는 실시간에 수행되어, 디스플레이상에 나타난 물체의 영상은 렌즈의 범주내에 있는 물체의 영상을 반사시킨다. 사용자는 실시간에 디스플레이상에서 장치에 의해 포착된 물체의 영상을 볼 수 있다. 일단 사용자가 사람이 더욱 많은 정보를 획득하기를 바라는 디스플레이상에서의 물체를 볼 경우, 사용자는 스위치(34)를 눌려 물체의 속사 정지 영상을 간단히 찍을 수 있다. DSP(90)는 스위치(34)가 눌려졌음을 검출하여, 영상이 디스플레이되는 물체가 선택되고, 상기 물체에 관한 정보가 검색되어 디스플레이됨을 인식한다.An optical image of the object passes through the lens and is projected onto the two-dimensional image sensor 88. In a preferred embodiment, the two-dimensional image sensor 20 comprises a charge coupled device (CCD) optical image sensor module for converting, for example, an optical image signal of an object into a two-dimensional directed analog image signal. Such conversion processes are well known in the art. The analog video signal is then converted into a digital video signal by the analog-to-digital converter 90. The digital video signal is then processed by the DSP 92 and transmitted to the display 28 displaying the image of the object. All these processes are performed in real time, such that the image of the object shown on the display reflects the image of the object within the scope of the lens. The user can see an image of the object captured by the device on the display in real time. Once the user sees an object on the display that a person wishes to acquire more information, the user can simply press the switch 34 to take a quick still image of the object. The DSP 90 detects that the switch 34 is pressed, and recognizes that an object on which an image is displayed is selected, and that information about the object is retrieved and displayed.

다른 실시예에서, 사용자는 스위치(34)를 "온" 위치에 놓아 영상 포착 동작을 활성화시킬 수 있다. 활성화될 시에, 영상 포착 장치는 렌즈의 범주내에 위치된 물체의 영상을 포착한다. 영상 포착 장치의 사용자는 디스플레이상에 포착된 물체의 영상을 볼 수 있다. 물체의 속사 정지 영상은 스위치를 풀어줌과 동시에 찍힌다.In other embodiments, the user may place the switch 34 in the "on" position to activate the image capture operation. When activated, the image capture device captures an image of an object located within the scope of the lens. The user of the image capturing apparatus can see an image of the object captured on the display. Rapid still images of the object are taken at the same time the switch is released.

2차원 영상 센서(88)는 ROM(94)내에 내장된 영상 센서 드라이버를 이용한다. 양호한 실시예에서, 영상 센서 드라이버는 컴퓨터 프로그램 형태로 DSP(92)의 동작을 제어하는 명령어의 시퀀스를 포함한다. 2차원 영상 센서(88)는 영상 센서 드라이버의 명령어에 따라 광 영상 신호를 물체의 아날로그 영상 신호로 변환시킨다.The two-dimensional image sensor 88 uses an image sensor driver embedded in the ROM 94. In a preferred embodiment, the image sensor driver includes a sequence of instructions that control the operation of the DSP 92 in the form of a computer program. The 2D image sensor 88 converts an optical image signal into an analog image signal of an object according to a command of an image sensor driver.

양호한 실시예에서, DSP(92)는 또한 마이크로 컴퓨터, 마이크로 제어기, CISC 프로세서 또는 RISC 프로세서일 수 있다. 프로세서형은 영상 포착 장치(10)의 동작의 결과에 영향을 받지 않는다. 디지털 신호 프로세서는 큰 수를 다룰 수 있는 것으로 알려져 있다. 계산 다항식이 신호 처리 기법에 상당히 이용되기 때문에, 디지털 신호 프로세서는 이런 응용에 이용될 최상의 프로세서형인 것처럼 보인다.In a preferred embodiment, the DSP 92 may also be a microcomputer, microcontroller, CISC processor or RISC processor. The processor type is not affected by the result of the operation of the image capturing apparatus 10. Digital signal processors are known to handle large numbers. Because computational polynomials are heavily used in signal processing techniques, digital signal processors appear to be the best processor type for these applications.

도 4 는 시프트 게이트(106)를 통해 CCD 아날로그 시프트 레지스터(104)에 접속된 영상 감지 회로(102)를 포함하는 2차원 영상 센서(88)의 개략도이다. 영상 감지 회로(102)는 2차원 매트릭스 포맷으로 배치된(포토사이트(photosites)로 알려진) 다수의 광 감지 소자를 포함하는 데, 각 소자는 병렬로 결합된 포토-다이오드(108) 및 캐패시터(110)를 포함한다. 영상 감지 회로(102)의 한 단부는 영상 포착장치(10)의 섀시 접지(112)에 접속되는 반면에, 영상 감지 회로(102)의 다른 단부는 시프트 게이트(106)의 한 측면에 결합된다. 시프트 게이트(106)의 다른 측면은 CCD 아날로그 시프트 레지스터(104)에 결합된다. 시프트 게이트(106)를 폐쇄함으로써, 목표 물체의 광무반사(reflected off)는 포토-다이오드(108)의 상태를 변화시켜, 광 신호의 시퀀스를 발생시킨다. 광 신호의 시퀀스는 CCD 아날로그 시프트 레지스터(104)로 클록된다. CCD 아날로그 시프트 레지스터(104)를 통해, 아날로그 영상 신호가 발생된다.4 is a schematic diagram of a two-dimensional image sensor 88 including an image sensing circuit 102 connected to a CCD analog shift register 104 via a shift gate 106. Image sensing circuitry 102 includes a number of light sensing elements (also known as photosites) arranged in a two dimensional matrix format, each device having photo-diode 108 and capacitor 110 coupled in parallel. ). One end of the image sensing circuit 102 is connected to the chassis ground 112 of the image capture device 10, while the other end of the image sensing circuit 102 is coupled to one side of the shift gate 106. The other side of the shift gate 106 is coupled to the CCD analog shift register 104. By closing the shift gate 106, the reflected off of the target object changes the state of the photo-diode 108, generating a sequence of optical signals. The sequence of optical signals is clocked into the CCD analog shift register 104. Through the CCD analog shift register 104, an analog image signal is generated.

선택적으로, CCD 아날로그 시프트 레지스터는 CMOS-기초 아날로그 시프트 레지스터로 대체될 수 있으며, 포토 다이오드(108)는 포토 레지스터로 대체될 수 있다. 이런 대체로, CCD-기초 2차원 영상 센서(88)는 CMOS-기초 영상 센서가 된다.Optionally, the CCD analog shift register can be replaced with a CMOS-based analog shift register and the photodiode 108 can be replaced with a photo resistor. As such, the CCD-based two-dimensional image sensor 88 becomes a CMOS-based image sensor.

도 5 에서, 양호한 실시예의 장치(10)에 의해 수행된 영상 포착 동작이 설명된다. 사용자는 영상 포착 동작을 활성화시켜, 사용자가 영상 포착 동작을 활성화시킬 카메라 스위치를 누르게 할 필요가 있다(단계(114)). 물체의 광 영상은 렌즈(18)를 통과하고, 카메라(86)에 의해 검출된다. 광 영상 신호는 2차원 아날로그 영상 신호로 변환된다(단계(116)). 그 다음, 아날로그 영상 신호는 아날로그-디지털 변환기에 의해 디지털 영상 신호로 변환된다(단계(118)). 프리 프로세스 동작에서, DSP는 필터링 서브프로세스를 통해 수신된 디지털 영상 신호의 잡음을 제거하여(단계(120)), 필터된 영상 신호를 발생시킨다.In FIG. 5, an image capturing operation performed by the apparatus 10 of the preferred embodiment is described. The user needs to activate the image capture operation, causing the user to press a camera switch to activate the image capture operation (step 114). An optical image of the object passes through the lens 18 and is detected by the camera 86. The optical image signal is converted into a two-dimensional analog image signal (step 116). The analog video signal is then converted into a digital video signal by an analog-to-digital converter (step 118). In the pre-process operation, the DSP removes noise in the digital video signal received through the filtering subprocess (step 120) to generate a filtered video signal.

그 다음, DSP 는 광 영상 인식 서브프로세스를 수행시킨다. 전자 사전에서, 본 발명의 인식 동작은 신규 광 문자 인식 서브프로세스이며(단계(124)), 이의 상세 동작은 도 8 에서 설명된다. 그 다음 DSP는 인식된 단어를 데이터베이스내의 단어와 매칭함으로써 문자 또는 단어 매칭 서브프로세스를 수행시킨다(단계(126)). 양호한 실시예의 이런 매칭 서브프로세스는 아래의 도 9 에서 설명된다. DSP(92)가 데이터베이스내의 매칭 단어를 발견한 후, 데이터베이스로 부터 단어에 관련된 정보를 검색한다(단계(128)). 검색된 정보 및, 데이터베이스내에서 발견된 단어는 그때 장치(10)의 디스플레이(28)상에 디스플레이된다(단계(130)).The DSP then performs an optical image recognition subprocess. In the electronic dictionary, the recognition operation of the present invention is a novel optical character recognition subprocess (step 124), the detailed operation of which is described in FIG. The DSP then performs a character or word matching subprocess by matching the recognized word with a word in the database (step 126). This matching subprocess of the preferred embodiment is described in FIG. 9 below. After the DSP 92 finds a matching word in the database, it retrieves information related to the word from the database (step 128). The retrieved information and the words found in the database are then displayed on display 28 of device 10 (step 130).

본 기술분야의 숙련자는 알 수 있듯이, 양호한 실시예는 장치와 물체 사이에서 각도에 대한 고도의 공차를 갖도록 행해짐으로써, 사람은 펜을 비스듬히 잡는 것처럼 장치를 편안하게 잡을 수 있다. 더욱이, 신규 카메라 구경측정(calibration) 특성은 양호한 실시예에서 구현되어, 영상 좌표는 회전될 수 있으며, 이런 특성은 사용자가 물체에 대해 카메라를 자유롭게 기울일 수 있게 한다.As will be appreciated by those skilled in the art, the preferred embodiment is made to have a high tolerance for angle between the device and the object so that a person can comfortably hold the device as if holding the pen at an angle. Moreover, new camera calibration features are implemented in the preferred embodiment such that the image coordinates can be rotated, which allows the user to tilt the camera freely with respect to the object.

도 6 은 양호한 실시예의 광학 문자 인식 서브프로세스를 설명한 흐름도이다. 이런 서브프로세스는 필터된 디지털 영상 신호의 매트릭스의 수신으로 개시하고, 이런 매트릭스는 광 신호를 나타내는 데이터 포인트를 포함하며, 이는 X 및 Y 좌표에 의하여 매트릭스로 배열된다(단계(132)). 매트릭스의 각 요소는 P_xy(x,y)의 픽셀의 그레이-스케일 값을 나타내는 데, 여기서(x,y)는 매트릭스내의 픽셀 P_xy의 위치이다. 그 다음, 2진 그레이 포맷의 영상 데이터는 그레이-스케일 필터된 디지털 영상 신호로 부터 생성된다(단계(134)). 이런 프로세스의 기본 동작은 그레이-스케일 포맷 또는 칼라 스케일의 영상 신호를 2진 그레이 포맷으로 변환시킬 수 있다. 이런 동작은 그레이의 각종 셰이드의 픽셀을 백색이나 흑색(또는 0 이나 1) 픽셀로 변환시키는 데, 이는 픽셀이 흑색 도트 또는 도트없는 픽셀중 어느 하나인 것을 의미한다. 각종 알고리즘은 그레이-스케일 데이터가 어떻게 이진 데이터로 변환되는 지를 결정하는 데 이용될 수 있다. 2진 데이터를 생성시킨 후, 필터링 프로세스는 영상내에서 도트 잡음을 제거하도록 수행된다(단계(136)). 클리어(clear) 영상 데이터 매트릭스는 필터링 서브프로세스의 결과로서 획득될 수 있다.6 is a flow chart illustrating an optical character recognition subprocess in a preferred embodiment. This subprocess begins with the reception of a matrix of filtered digital video signals, which matrix contains data points representing the optical signals, which are arranged in a matrix by X and Y coordinates (step 132). Each element of the matrix represents the gray-scale value of a pixel of P _xy (x, y), where (x, y) is the position of pixel P _{xy in} the matrix. Image data in binary gray format is then generated from the gray-scale filtered digital image signal (step 134). The basic operation of this process is to convert an image signal in gray-scale format or color scale to binary gray format. This operation converts the pixels of the various shades of gray to white or black (or 0 or 1) pixels, meaning that the pixels are either black dots or pixels without dots. Various algorithms can be used to determine how gray-scale data is converted to binary data. After generating the binary data, the filtering process is performed to remove dot noise in the image (step 136). Clear image data matrix may be obtained as a result of the filtering subprocess.

다음 단계는 물체의 영상 블록을 생성시킬 수 있다. 영상 블록을 생성시키기 전에, 물체의 영상은 먼저 서브파트로 분리되어야 한다. 전자 사전에서, 단어 프레이밍(framing) 서브프로세스는 각 포착된 단어의 영상에서 수행된다. 단어 프레이밍 서브프로세스는 먼저 단어 전후의 공간을 검출하여 수행된다. 단어의 각 문자는 또한 단어 프레이밍 프로세스에 의해 프레임되는 데, 각 문자는 문자의 불연속성을 인식함으로써 프레임된다. 이런 프로세스는 필터된 영상 신호로 부터 단어 프레이밍 또는 문자 프레이밍 프로세스를 통해 단어 영상 블록 또는 문자 영상 블록을 생성시킨다(단계(138)). 이런 단계는 다중 영상 블록의 포인트 밀도에 따라 영상 데이터 매트릭스로 부터 상기 다중 영상 블록을 생성시킨다. 전자 사전에서, 이는 단어내의 각 문자를 분리시킬 필요가 있는 데, 이는 문자간의 공간을 검출함으로써 달성된다. 이런 프로세스의 목적은 뒤따른 프로세스내에서 인식하기가 더욱 쉬운 문자를 획득하는 것이다. 문자 영상 블록으로 부터, 문자는 루프 인식 서브프로세스를 통해 인식될 수 있다(단계(140)). 필수적으로, 앞선 서브프로세스를 통해 재생된 각 문자의 인식하고, 영상 포착 장치에 의해 포착된 단어를 재구성할 정확한 순서로 각 문자를 배치함으로써 상기 루프 인식 서브프로세스를 수행시킨다. 이런 단계의 소정의 견지는 도 7 및 도 8 에서 더욱 더 설명된다. 이전의 서브프로세스(단계(132 내지 140))는 단어의 모든 문자가 문자 영상 블록내에 놓여졌을 때까지(단계(142)) 계속한다.The next step may be to create an image block of the object. Before generating an image block, the image of the object must first be separated into subparts. In an electronic dictionary, word framing subprocesses are performed on an image of each captured word. The word framing subprocess is first performed by detecting the space before and after the word. Each letter of a word is also framed by the word framing process, where each letter is framed by recognizing the discontinuity of the letter. This process generates a word image block or a character image block from the filtered image signal through a word framing or character framing process (step 138). This step generates the multiple image blocks from the image data matrix according to the point densities of the multiple image blocks. In an electronic dictionary, this needs to separate each letter in a word, which is accomplished by detecting the space between letters. The purpose of this process is to obtain characters that are easier to recognize in subsequent processes. From the character image block, the character may be recognized through a loop recognition subprocess (step 140). Essentially, the loop recognition subprocess is performed by recognizing each character reproduced through the preceding subprocess and placing each character in the correct order to reconstruct the word captured by the image capturing apparatus. Certain aspects of this step are further described in FIGS. 7 and 8. The previous subprocess (steps 132 through 140) continues until all the letters of the word have been placed in the character image block (step 142).

루프 인식 서브프로세스를 통해 문자를 매칭하는 상기 동작은 일례로서 문자 "z"를 이용한 도 7 에서 더욱 더 설명된다. 제 1 단계는 다수의 특정 픽셀을 식별하는 것이다. 이런 예에서, 4개의 특정 픽셀은 cp-1 144, cp-2 146, cp-3 148 및 cp-4 150 로 식별된다. 그후, 다른 특성 픽셀에 관련한 각 특성 픽셀의 상대 위치를 결정한다.The above operation of matching characters through a loop aware subprocess is further described in FIG. 7 using the character "z" as an example. The first step is to identify a number of specific pixels. In this example, four specific pixels are identified as cp-1 144, cp-2 146, cp-3 148 and cp-4 150. The relative position of each feature pixel relative to the other feature pixel is then determined.

예를 들면, cp-2 146 는 cp-1 144 의 우측에 있고, cp-3 148은 cp-1 144 아래에 있으며, cp-4 150 은 cp-1 144의 하위 우측에 있다. 다음 단계는 각 특성 픽셀에서 문자내에 형성된 각도를 식별하는 것이다. 예를 들면, cp-3 148 의 각도는 cp-2 146 및 cp-3 148 을 연결한 라인과, cp-3 148 및 cp-4 150을 연결한 라인간의 각도이다. 이런 예에서, 각도는 대략 60도이다. 다음 단계는 2개의 특성 픽셀간의 거리를 결정하는 것이다. 이런 거리는 하나의 결정된 거리에 관한 비율로 나타낼 수 있다. 이런 비율은 문자의 서로 다른 폰트(font) 사이즈를 지지하는 것이 바람직하다. 특성 픽셀의 상대 간계를 발견한 후, 매창 프로세서는 이런 문자의 특성을 호스트 시스템의 템플레이트 라이브러리내에 미리 저장된 문자의 특성과 비교함으로써 수행된다. 템플레이트 라이브러리는 데이터베이스의 다른 형태이다.For example, cp-2 146 is to the right of cp-1 144, cp-3 148 is below cp-1 144, and cp-4 150 is to the lower right of cp-1 144. The next step is to identify the angle formed within the character at each feature pixel. For example, the angle of cp-3 148 is an angle between a line connecting cp-2 146 and cp-3 148 and a line connecting cp-3 148 and cp-4 150. In this example, the angle is approximately 60 degrees. The next step is to determine the distance between the two characteristic pixels. This distance can be expressed as a ratio with respect to one determined distance. This ratio preferably supports different font sizes of characters. After finding the relative tricks of the feature pixels, the engraver processor is performed by comparing the properties of these characters with the properties of the characters previously stored in the template library of the host system. Template libraries are another form of database.

도 8 은 도 6 의 단계(140)에 의해 식별된 문자루프인식 서브프로세스를 설명한 흐름도이다. 전자 사전의 루프 인식 서브프로세스에서, 먼저 포착된 문자 영상의 스트로크(strokes)를 식별한다(단계(152)). 일례로서 도 7 의 문자 "z" 를 이용하여, 3개의 스트로크가 식별될 수 있는데, 제 1 스트로크는 cp-1 144에서 cp-2 146 까지이고, 제 2 스트로크는 cp-2 146 에서 cp-3 148 까지이며, 제 3 스트로크는 cp-3 148 에서 cp-4 150 까지이다. 다음 단계는 각 스트로크의 상대 길이를 결정하는 것이다(단계(154)). 문자 "z" 에서, 이런 단게는 각 스트로크의 상대 길이, 즉 cp-1 144 및 cp-2 146 간의 거리인 제1 스트로크의 상대 길이, cp-2 146 및 cp-3 148 간의 거리인 제 2 스트로크의 상대 길이와, cp-3 148 및 cp-4 150 간의 거리인 제 3 스트로크의 상대 길이를 결정한다. 다음 단계는 영상 포착 장치에 의해 포착된 문자 영상의 스트로크 카운트를 결정하는 것이다(단계(156)). 문자 "z"에서, 스트로크 카운트는 3인데, 이는 제 1, 2 및 3 스트로크로서 상기와 같이 식별된다. 다음 단계는 2개의 연결 스트로크 사이에서의 각도를 결정하는 것이다(단계(158)). 전술된 바와 같이, 특성 픽셀 cp-3 에서의 각도는 제 2 스트로크 및 제 3 스트로크 사이의 각도이다. 이런 예에서, 제 2 스트로크 및 제 3 스트로크는 2개의 연결 스트로크이다. 또한, 상기와 같이 식별된 제 1 스트로크 및 제 2 스트로크는 문자 "z" 내의 2개의 연결 스트로크이다. 2개의 연결 스트로크, 즉, 제 1 스트로크 및 제 2 스트로크는 또한 이런 단계에서 결정되는 특성 픽셀 cp-2 146 에서의 각도를 형성한다. 다음 단계는 각 스트로크의 방향을 결정하는 것이다(단계(160)). 다시 일례로서 문자 "z" 를 이용하여, 제 1 스트로크는 통상적으로 좌측에서 우측으로 그려지고, 제 2 스트로크는 통상적으로 상위 우측에서 하위 우측으로 그려지며, 제 3 스트로크는 통상적으로 좌측에서 우측으로 그려진다. 그 다음, 포착된 문자 영상의 특징 정보는 상기 단계에서 결정되는 정보를 포함하는 스트로크 특성으로 부터 생성된다(단계(162)). 이런 특징 정보는 각 스트로크의 상대 길이, 문자 영상의 스트로크 카운트수, 2개의 연결 스트로크 사이에 형성된 각도 및 각 스트로크의 방향을 포함한다. 포착된 문자 영상의 특징 정보는 그때 템플레이트 라이브러리내의 문자의 특성과 비교된다(단계(164)). 템플레이트 라이브러리는 인식될 각 독특한 그래픽 심볼에 대한 컴퓨터내의 프로토타입(prototype) 영상을 포함한 데이터베이스이다. 템플레이트 라이브러리내의 문자의 특성과 포착된 문자 영상의 특징 정보를 비교한 후, 완전한 매치가 발견되었는지를 결정할 필요가 있다(단계(166)). 완전한 매치가 발견되었을 경우, 매칭 서브프로세스의 결과로서 발견된 문자는 참조문자로서 이용된다(단계(168)). 어떤 완전한 매치도 발견되지 않았을 경우, 특성이 포착된 문자 영상의 특징 정보에 가장 가까운 문자는 참조 문자로서 선택된다(단계(170)).FIG. 8 is a flow chart illustrating the character loop recognition subprocess identified by step 140 of FIG. In the loop recognition subprocess of the electronic dictionary, first the strokes of the captured character image are identified (step 152). Using the letter “z” in FIG. 7 as an example, three strokes can be identified, where the first stroke is from cp-1 144 to cp-2 146 and the second stroke is from cp-2 146 to cp-3 Up to 148 and the third stroke is from cp-3 148 to cp-4 150. The next step is to determine the relative length of each stroke (step 154). In the letter "z", this step is the relative length of each stroke, i.e. the second stroke, the relative length of the first stroke, the distance between cp-1 144 and cp-2 146, and the distance between cp-2 146 and cp-3 148. Determine the relative length of and the relative length of the third stroke, which is the distance between cp-3 148 and cp-4 150. The next step is to determine the stroke count of the character image captured by the image capturing apparatus (step 156). In the letter "z", the stroke count is three, which is identified as above as the first, second and third strokes. The next step is to determine the angle between the two connecting strokes (step 158). As described above, the angle at characteristic pixel cp-3 is the angle between the second stroke and the third stroke. In this example, the second stroke and the third stroke are two connecting strokes. Further, the first stroke and the second stroke identified as above are two connecting strokes in the letter "z". The two connecting strokes, namely the first stroke and the second stroke, also form an angle in the characteristic pixel cp-2 146 determined at this stage. The next step is to determine the direction of each stroke (step 160). Again using the letter "z" as an example, the first stroke is typically drawn from left to right, the second stroke is typically drawn from upper right to lower right, and the third stroke is typically drawn from left to right. . The characteristic information of the captured character image is then generated from the stroke characteristic including the information determined in the above step (step 162). This feature information includes the relative length of each stroke, the number of stroke counts of the character image, the angle formed between the two connecting strokes, and the direction of each stroke. The characteristic information of the captured character image is then compared with the characteristic of the character in the template library (step 164). The template library is a database containing prototype images in the computer for each unique graphic symbol to be recognized. After comparing the characteristics of the characters in the template library with the characteristic information of the captured character image, it is necessary to determine whether a complete match has been found (step 166). If a perfect match is found, the character found as a result of the matching subprocess is used as a reference character (step 168). If no complete match is found, the character closest to the characteristic information of the character image in which the characteristic is captured is selected as the reference character (step 170).

영상 포착 장치에 의해 포착된 단어의 모든 문자 영상이 식별된 후, 다음 서브프로세스는 상기 단어에 관련된 정보를 데이터베이스로 부터 검색할 수 있다. 식별된 단어의 정보를 검색하는 서브프로세스는 도 9 에 도시된 흐름도에서 설명된다. 이런 단어 매칭 서브프로세스는 기억장치로 부터 인식된 워드의 검색으로 개시한다(단계(172)). 인식된 워드를 검색한 후, 다음 절차는 인식된 워드와 매치하는 데이터 베이스내의 단어를 발견하는 것이다(단계(174)). 그 다음, 매치된 단어가 데이터베이스내에서 발견되는지의 여부에 간해 결정이 행해진다(단계(176)). 단어가 데이터 베이스내에서 발견될 수 있는지 없는지 포착된 단어에 소정의 정보를 제공하는 것이 필수적이다. 매치된 단어가 데이터베이스내에서 발견될 경우, 매치된 단어에 관련된 정의 및 정보는 데이터베이스로 부터 검색된다(단계(178)). 또한, 포착된 단어에 가장 근접한 데이터베이스내의 단어는 선택되고, 그의 정의 및 정보는 데이터 베이스로부터 검색된다(단계(180)). 데이터베이스로 부터 정보를 검색한 후, 단어, 그의 정의 및 관련된 정보는 장치의 디스플레이상에 디스플레이된다(단계(182)).After all the character images of the words captured by the image capturing device are identified, the next subprocess can retrieve information related to the words from the database. The subprocess for retrieving the information of the identified word is described in the flowchart shown in FIG. This word matching subprocess begins with the retrieval of the recognized word from storage (step 172). After retrieving the recognized word, the next procedure is to find a word in the database that matches the recognized word (step 174). A decision is then made as to whether or not a matched word is found in the database (step 176). It is essential to provide some information to the captured word whether the word can be found in the database. If a matched word is found in the database, definitions and information related to the matched word are retrieved from the database (step 178). In addition, the words in the database closest to the captured words are selected, and their definitions and information are retrieved from the database (step 180). After retrieving the information from the database, the words, their definitions and related information are displayed on the display of the device (step 182).

본 발명이 특히 소정의 실시예를 참조로 기술되었지만, 본 기술분야의 숙련자에게는 다양한 변경 및 수정이 가능한 것으로 이해된다. 따라서, 아래의 청구의범위는 본 발명의 참 정신 및 범주내에서 그런 모든 변경 및 수정을 커버한다.Although the present invention has been described in particular with reference to certain embodiments, it is understood that various changes and modifications are possible to those skilled in the art. Accordingly, the following claims are intended to cover all such variations and modifications within the true spirit and scope of this invention.

Claims

An apparatus for capturing and displaying one or more images of at least one selected object, the apparatus processing the selected image to identify the selected image as a specific object having a code recognizable by the apparatus, and to retrieve and display information about the selected object. Apparatus for image capture and display of

Processor,

Storage devices and displays containing information about a plurality of identifiable objects,

A camera having a switch for taking a rapid still image of an object and generating a signal representing a still image of the object,

The camera continuously generates a signal representing an image of an object selected by the user, and passes this signal to the processor, which continuously displays the image represented by the signal in real time, and simultaneously with the activation of the switch, Taking a rapid still image of the object selected by the processor, and wherein the processor identifies the still image as a specific object and retrieves information about the object.

The method of claim 1,

The camera

A lens for sensing an optical image of at least one selected object,

An image sensor behind the lens operative to receive an optical image signal of the object through the lens and generate an analog image signal of the object,

An analog-to-digital converter for generating a digital video signal of the object in response to the analog video signal of the object from the image sensor;

And a processor for displaying a digital image of the object on a display in response to the digital image signal of the object generated by the analog-to-digital converter.

The method of claim 2,

And the image sensor is a CCD-based image sensor.

The method of claim 2,

And the image sensor is a CMOS-based image sensor.

The method of claim 1,

And the device is a personal information management system.

The method of claim 1,

And said display is a flat pattern display.

The method of claim 1,

And said display is a liquid crystal display.

A method for capturing an image of an object, the method comprising: capturing an image of an object by displaying the image as a specific object by a device and identifying the information, and searching for and displaying information on the device;

Continuously capturing an optical image signal of an object to generate a signal representing an image of the object,

Processing the signal to generate an image indicated by the signal, and displaying the image;

Generating a still image simultaneously with activation of a switch controllable by a user,

Identifying the still image as a particular object having a code recognizable by a processor in the device,

Recognizing a specific object through an image recognition subprocess,

Matching the recognized object with an object previously stored in a storage device,

Retrieving information about the matched object from the storage device;

And displaying the information on a display.

The method of claim 8,

The recognition process is

Generating a gray-scale digital image data matrix from a digital image signal of a specific object,

Generating a digital video signal of a specific object in binary-gray format from a gray-scale digital video signal,

Framing the particular object from a binary-gray format digital video signal of the particular object;

And generating an image block of the object from the frame of the object.

The method of claim 8,

The subprocess that matches the recognized object with the object in the database, and identifies the matched object,

Separating the object into a plurality of subparts,

Identifying a plurality of feature pixels in each subpart, wherein the properties of the subparts depend on the properties of each feature pixel, wherein the identifying step determines the relative position of each feature pixel relative to another feature pixel of the subpart. Identifying a plurality of feature pixels, including determining an angle formed at each feature pixel of the subpart, and determining a distance between two feature pixels of the subpart,

Comparing the characteristics of the subpart with other subparts in the database,

Selecting a subpart from a database whose characteristic matches the subpart of the object, or selecting a subpart from a database whose characteristic is closest to the characteristic of the subpart of the object.

The method of claim 8,

The recognition subprocess is an optical character recognition process.

An electronic dictionary device for capturing and displaying one or more images of one or more selected objects, the electronic dictionary device processing the selected images to identify the selected images as specific words with codes recognizable by the device, and searching and displaying information about them. An electronic dictionary device for capturing and displaying an image of an object

Processor,

A storage device and display comprising information relating to a plurality of identifiable words,

A camera having a switch for taking a rapid still image of a word and generating a signal representing the still image of the word,

The camera continuously generates a signal representing an image of a word selected by the user and passes this signal to the processor, the processor continuously displaying the image represented by the signal in real time, and simultaneously with the activation of the switch, And taking a quick still image of the word selected by the processor, and wherein the processor identifies the still image as a specific word and retrieves information about the same.

The method of claim 12,

The camera

A lens for detecting an optical image of one or more selected words,

An image sensor behind the lens operative to receive an optical image signal of a word through the lens and to generate an analog image signal of the word;

An analog-to-digital converter for generating a digital video signal of a word in response to the analog video signal of the word from the image sensor;

And a processor for displaying a digital image of the word on a display in response to the digital image signal of the word generated by the analog-to-digital converter.

The method of claim 13,

And said image sensor is a CCD-based image sensor.

The method of claim 13,

And said image sensor is a CMOS-based image sensor.

The method of claim 12,

And the device is a personal information management system.

The method of claim 12,

And the display is a flat pattern display.

The method of claim 12,

And said display is a liquid crystal display.

A method of capturing an image of a word, comprising: displaying and identifying the image as a specific word by an electronic dictionary device, searching for information on the same, and displaying the image on the electronic dictionary device,

Continuously capturing an optical image signal of a word and generating a signal representing the image of the word,

Generating a still image of a word simultaneously with the activation of a switch controllable by a user,

Identifying the still image as a specific word having a code recognizable by a processor in the device,

Recognizing a specific word through an image recognition subprocess,

Matching the recognized word with an object previously stored in a storage device,

Retrieving information about the matched word from the storage device;

And displaying the information on a display of an electronic dictionary.

The method of claim 19,

The recognition process is

Generating a gray-scale digital image data matrix from a digital image signal of a specific word,

Generating a digital video signal of a specific word in binary-gray format from a gray-scale digital video signal,

Framing the specific word from a binary-gray format digital image signal of a specific object;

And generating an image block of the word from the frame of the word.

The method of claim 19,

The subprocess that matches the recognized word with a word in the database, and identifies the matched word,

Separating the word into a plurality of subparts,

Identifying a plurality of characteristic pixels within each subpart, wherein the characteristics of the subparts depend on the characteristics of each characteristic pixel, wherein the identifying step determines the relative position of each characteristic pixel relative to the other characteristic pixels of the subpart. Identifying a plurality of characteristic pixels, including determining an angle formed at each characteristic pixel of the subpart, and determining a distance between two characteristic pixels of the subpart,

Selecting a subpart from a database whose characteristic matches the subpart of the word, or selecting a subpart from a database whose characteristic is closest to the characteristic of the subpart of the word.

The method of claim 19,

A recognition subprocess is an optical character recognition process.